Author Login
Post Reply
I want to do schedule crawling in nutch.....
Eg: I have crawled a site which has 1 million pages.
and want to crawl the same site for updates once per week
automatically(scheduled & incremental crawling).
It has to crawl only modified or newly added content.
Is it possible with nutch?
If possible how can I achieve it?
--
Sent from the Nutch - User mailing list archive at Nabble.com.