Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

large content/parse segments

charlie w

2008-05-14


Author LoginPost Reply
This is in reference to the Nutch "content" segments
(segments/<timestamp>/parse_text, etc.), not the segments of a Lucene
index.

I am considering using SegmentMerger to combine a large number of
fetch segments into a single huge segment. Will doing so create a
performance problem when generating page summaries at search time? If
so, is there a recommended maximum size for one of these segments?

Thanks,
Charlie
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.