Author Login
Post Reply
Hi
You can also use a tool called "antiword" to extract the text from a .doc file, and then
give the text to lucene.
See here : http://en.wikipedia.org/wiki/Antiword
Regards
Mirko
-----Ursprüngliche Nachricht-----
Von: dipesh [mailto:dipshrestha@(protected)]
Gesendet: Mittwoch, 12. November 2008 04:38
An: java-user@(protected)
Betreff: Parsing MSWord
Hello,
I wanted to know if there are classes in Lucene that support parsing MSWord
documents.
Many thanks,
Dipesh
----------------------------------------
"Help Ever Hurt Never"- Baba