Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

Nutch- crawling?

nutch_newbie

2008-06-12

Replies: Find Java Web Hosting

Author LoginPost Reply

I ran the crawler, and it seems just fine. and in localhost:8080/nutch-0.8.1
the nutch search window is displayed, but whenever something is searched,
the results always say "Hits 0-0 (out of about 0 total matching pages): "
here is the piece of my crawl-urlfilter.txt that i modified:

# accept hosts in MY.DOMAIN.NAME
+^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/
+^http://([a-z0-9]*\.)*www.en.wikipedia.org
+^http://([a-z0-9]*\.)*www.google.com
+^http://([a-z0-9]*\.)*www.search.yahoo.com/

what else am i supposed to do? i'm really confused and running short on
time. any and all help would be greatly appreciated. thanks in advance.

PS: my computer is linux- FC5- but the folders and config files are still
the same. and i also tried restarting tomcat- which didn;t help.


--
Sent from the Nutch - User mailing list archive at Nabble.com.

©2008 java2.5341.com - Jax Systems, LLC, U.S.A.