|
John Letourneau -> RE: Sort Relevance (29.Apr.2008 8:38:09 AM)
|
mmcteague1, In relevancy-ranked searches, GFI MailArchiver uses a "vector-space" algorithm to calculate a score for each document that takes into account the relative frequency of the search terms and their density in the retrieved file. Infrequent terms count more heavily than common terms, and N hits in a short document count more heavily than N hits in a long document. An additional positional scoring option increases the score when hits occur close to each other or close to the top of the file. With positional scoring, hits near the top of a file, and hits close to other hits, are weighted more highly. For example, if you search for apple pie recipe, a document with those three words near the top of the file, all together, will rank higher than a file that has the words scattered randomly throughout the document.
|
|
|
|