EDRM Search Guide/Search Glossary
From Working EDRM
|EDRM Search Guide
Draft v. 1.14 - January 20, 2009
- A search technique that utilizes Boolean Logic, using terms such as AND, OR, and NOT. See also "Boolean search".
- A search technique that provides words which are similar in concept to a query word. A concept search will return documents that relate to the same concept as the query word, regardless of whether the query word exists in the search results documents. Concept searches can be implemented as a simple thesaurus match, or by using sophisticated statistical analysis methods. See also "Concept search".
- Electronically Stored Information.
- A search technique that identifies ESI based on terms close to another term, with closeness defined as a typographical difference and/or change. For example, snitch, switch, and swanky can all match swatch, depending on how many incorrect letters are allowed within the search threshold. See also "Fuzzy search".
- An index that maps a keyword to the list of documents that contain the keyword. See also "Inverted index".
- A technique that examines the ESI and builds a searchable electronic index. This index typically maps from a keyword to all the documents that contain the keyword. See also "Keyword index".
- A very common search technique that uses query words (“keywords”) and looks for them in ESI, using an index. See also "Keyword search".
- A set of documents that a Producing Party is not required to provide, since they fall into Privilege such as Attorney-Client Privilege. The existence of such documents should be recorded in the Privilege Log. See also "Privileged documents".
- A set of documents that a Producing Party did not produce on account of Privilege such as Attorney-Client Privilege. See also "Privilege log".
- A search consisting of multiple keywords separated by spaces to form a single phrase. For a document to match this search, the entire phrase as entered must be contained within the document. See also "Phrase search".
- A party that owns the complete collection of ESI, and is responsible for producing a portion of the ESI that is deemed to be relevant for a legal case or legal enquiry. See also "Producing party".
- A Proximity Search searches for multiple keywords. The matching documents must contain all the keywords, with the keywords occurring within a specified number of words from each other. See also "Proximity search".
- Relational Database Management System. This is a technical term for the class of software programs that manage data using a relational schema, such as Microsoft SQL Server or Oracle. See also "RDBMS".
- A pattern that describes what the search should return based on special characters added to the keyword. For example, car* uses the character * as a wildcard, and the resulting documents should contain words that begin with the characters “car”, such as car, cartoon, or cartography. See also "Regular expressions".
- A measurement of relevancy of a document, so that the Search Hits within a Search Results can be ordered. Relevancy measurements often involve counting the number of occurrences of a keyword within a document, as well as number of documents a keyword is found in. See also "Relevancy rank".
- A party that does not own the ESI and is requesting that the Producing Party which owns the ESI to provide some subset of the ESI based on a Search Request. See also "Requesting party".
- A subset of ESI that matches the desired set of documents for the case. See also "Responsive file".
- A search component that implements the actual process of interpreting a search request and identifying subsets of documents. For example, a database management system such as Microsoft SQL Server contains a component that manages searches of the data stored in its databases. See also "Search engine".
- A document in the ESI that is considered to match the requested Search Query. See also "Search hit".
- A well-formulated Search request that an automated search engine can interpret in order to produce matching results. See also "Search query".
- A collection of Search Hits that match the intended documents of a Search Request. See also "Search results".
- A synonym search returns documents that contain terms similar in meaning to the query words, usually using a thesaurus to determine which terms would match the query words. See also "Synonym search".
- A search option that returns matches for all variations of the root word of the initial query word. For example, if the query word was sing, then if a search used stemming the search results would match singing, sang, sung, song, and songs as well as sing. See also "Stemming".
- An operation that examines a document or block of text and breaks the text into words. Typically, a space is used to separate words, but special characters such as a hyphen, period, or quotation mark can also be used. See also "Tokenization".
- A Search Specification that indicates that matching documents must contain words that begin with the letters entered, but that the matching words can end with any combination of letters. See also "Truncation".
- Symbols such as * or ? included within a Keyword to indicate that the location where the symbols are used may match a single letter or multiple letters. See also "Wildcard search".