Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

how to filter pages by mime type ?

David Darras

2008-10-16


Author LoginPost Reply
Hello,

I'm a nutch newbie and I have some problems. On my websites, i don't
want to index css and js files. It seems nutch inspects the extension
of the files, but my files don't end with .css or .js (and i can't
change that) So i tried to filter these pages with mime types, but i
didn't achieve something until now. If somebody has some clues, it'll be
really appreciated :)

Thanks in advance,

--
David DARRAS
CRI - Université de Lille 1

©2008 java2.5341.com - Jax Systems, LLC, U.S.A.