parsing multi-word queries

From: The Critic <nospam_at_nospam.com>
Date: Mon, 20 Aug 2001 22:04:14 GMT
Message-ID: <3b81892d.11708938_at_news.freeserve.net>


In the writing of a general purpose (web) search engine, is it better to automatically parse queries into single terms and n-word phrases, or to get the user to identify phrases with the use of quotes?

E.g. Input = *Japanese railways bullet train*

Should it be parsed into Japanese, railways, bullet, train, Japanese railways, railways bullet, bullet train, Japanese railways bullet, railways bullet train, Japanese railways bullet train? Or should the user be left to identify "bullet train"?

Is there any research into identifying candidate n-word phrases (without having to test every candidate against the index)?

Despite the insidious efforts of Satan's forces, Truth will ultimately prevail victorious.

The TRUTH is now online at:
http://www.fortunecity.com/meltingpot/samoa/1382/index.html Received on Tue Aug 21 2001 - 00:04:14 CEST

Original text of this message