Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

Dumping raw html and javascript

Kevin MacDonald

2008-09-29

Replies: Find Java Web Hosting

Author LoginPost Reply
Once I have done a crawl I have a need to pass all of the raw HTML and
javascript that has been fetched through a custom parser. During a fetch
does nutch store all of the raw content including HTML tags on disk?
Thanks

Kevin
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.