Sort Relevance (Full Version)

All Forums >> [Messaging] >> GFI MailArchiver for Exchange



Message


mmcteague1 -> Sort Relevance (25.Apr.2008 11:09:50 AM)

Could someone please explain how the "relevance" , "Most relevant on top" works.

From the administrator user, I did a search for an email address, that I am the only internal person to correspond to.

The search returned a couple dozen emails.  The 1st 4 emails where part of a conversation we had on 1/3.  The 5th email was an email where that person was a recipient along with one other person on 1/29.  The next 2 emails were part of the 1/3 conversation and the next 2 were a different conversation on 1/19. And finally, about 19 emails down was the last email from the original 1/3 conversation.

What is the sort relevance to this?  The 1/29 email that was stuck in the middle doesn't make sense to me as to why it has more relevance than the next two emails which are part of a conversation thread. 

If MA is determined to calculate its own relevance, could you please fill us in on how it is done, so we can find emails faster.




John Letourneau -> RE: Sort Relevance (29.Apr.2008 8:38:09 AM)

mmcteague1,

In relevancy-ranked searches, GFI MailArchiver uses a "vector-space" algorithm to calculate a score for each document that takes into account the relative frequency of the search terms and their density in the retrieved file.  Infrequent terms count more heavily than common terms, and N hits in a short document count more heavily than N hits in a long document.

An additional positional scoring option increases the score when hits occur close to each other or close to the top of the file.  With positional scoring, hits near the top of a file, and hits close to other hits, are weighted more highly.  For example, if you search for apple pie recipe, a document with those three words near the top of the file, all together, will rank higher than a file that has the words scattered randomly throughout the document.







mmcteague1 -> RE: Sort Relevance (9.May2008 2:46:40 PM)

Thanx John,

It seems a very complex calculation for something I think should be so simple.  If you took a poll of all users, I would venture a guess that the majority of your users would prefer the default sort to be by date.




Page: [1]