Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

url redirection

Neera Sharma

2008-11-17

Replies: Find Java Web Hosting

Author LoginPost Reply
Hi all,

I have a question about how butch handles URL redirection.

In case of url redirection, crawl dump file stores contents of
redirected url under the target url. The original url is stored
without any contents.

Is it possible to correlate a redirected URL with an original URL?

Does nutch record redirected URLs ?


Thanks,
Neera

©2008 java2.5341.com - Jax Systems, LLC, U.S.A.