Java Mailing List Archive

http://www.java2.5341.com/

Home » java-user.lucene »

Issues with Special Characters

miztaken

2008-09-16

Replies: Find Java Web Hosting

Author LoginPost Reply

Hi there,
I am using WhiteSpaceAnalyser to index documents. I have used this because i
need to split tokens based on space only. Also Tokensized=true
While indexing what does it do with special characters like + - && || ! ( )
{ } [ ] ^ " ~ * ? : \, will these characters be indexed or will be chopped
off? I am confused about this.

Now i am having problem while searching as well..
for query strings like "jason dartling (e-mail)" and "re: fyi.dat", i don't
have to escape the special characters ( , ) and : but for input such as
"re:" queryParser is producing error so i have escaped characters here.
So it seems like i have two cases to deal with..
Can anyone suggest me one generic way to deal with both the cases?

Basically how to index and search string with escape characters will be my
generalized question?


Please help me
miztaken





--
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@(protected)
For additional commands, e-mail: java-user-help@(protected)

©2008 java2.5341.com - Jax Systems, LLC, U.S.A.