Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

how much space required?

Edward Quick

2008-09-17

Replies: Find Java Web Hosting

Author LoginPost Reply

Hi,

I'm running an intranet crawl and have got to the 6th depth which apparently has 2.2 million links to fetch. I started off with 100Gb but that was barely enough for the fetch not to mention the updatedb step, so I'm just trying to find a reliable method for determining how much space is required to do the crawl.

Any ideas?

Ed.

_________________________________________________________________
Win New York holidays with Kellogg’s & Live Search
http://clk.atdmt.com/UKM/go/111354033/direct/01/
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.