Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

Possible Crawling bug

Kevin MacDonald

2008-09-16

Replies: Find Java Web Hosting

Author LoginPost Reply
See the code snippet below from org.apache.nutch.crawl.Crawl. I think
parsing happens opposite to what the nutch-site.xml config file indicates.

public static void main(...) {
  ...

   if (!Fetcher.isParsing(job)) {
    parseSegment.parse(segment);   // parse it, if needed
   }

  ...
}


Kevin
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.