Author Login
Post Reply
Hi all,
I have a question about how butch handles URL redirection.
In case of url redirection, crawl dump file stores contents of
redirected url under the target url. The original url is stored
without any contents.
Is it possible to correlate a redirected URL with an original URL?
Does nutch record redirected URLs ?
Thanks,
Neera