Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

Searching for Quoted Phrases

Brent Walker

2008-04-24


Author LoginPost Reply
I'm attempting to search for a quoted phrase within a crawled document
and having trouble getting Nutch to pass the quotes through. As an
example say I am looking for a song title on a page, So Says I. I
only want to return results where the title was quoted within the
page, "So Says I".

Query.parse("\"So Says I\"", conf)

results in a querystring of

"So Says I".

This will be interpreted as a query for the phrase, not a query for
the quoted phrase.

Looking at Lucene documentation, it looks like to search for quotes
they simply need to be escaped \. So I think my desired querystring is:

\"So Says I\" or "\"So Says I\""

However, I've been unable to get this out of Query.parse

Query.parse("\\\"So Says I\\\"",conf)

results in a query of

"So Says I"

Query.parse("\"\\\"So Says I\\\"\"", conf)

results in

So Says I

Anyone know how to get this done?

Thanks.

Brent
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.