Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

Crawling SLASHDOT.ORG

kranthi reddy

2008-06-25

Replies: Find Java Web Hosting

Author LoginPost Reply
Hi,

  I am new to nutch . I have been trying to crawl "slashdot.org" . But
due to some unknown problems i am unable to crawl the site.
  I am able to crawl any other site site (bbc,ndtv,cricbuzz etc)... but
when i try to crawl "slashdot.org" i get the following error ...

    "Generator: jobtracker is 'local', generating exactly one partition.
     Generator: 0 records selected for fetching, exiting ...
      Stopping at depth=1 - no more URLs to fetch."

  Can some one please help me out.


Thank you in advance

Kranthi Reddy. B
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.