Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

scheduled crawling in nutch

rameshgalla

2008-08-21

Replies: Find Java Web Hosting

Author LoginPost Reply

I want to do schedule crawling in nutch.....
Eg: I have crawled a site which has 1 million pages.
and want to crawl the same site for updates once per week
automatically(scheduled & incremental crawling).
It has to crawl only modified or newly added content.

Is it possible with nutch?

If possible how can I achieve it?
--
Sent from the Nutch - User mailing list archive at Nabble.com.

©2008 java2.5341.com - Jax Systems, LLC, U.S.A.