Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

getting seed list for vertical search engine

DS jha

2008-06-16

Replies: Find Java Web Hosting

Author LoginPost Reply
Hello,
We are in the process of developing a vertical search engine for the
medical industry – and I need to estimate server/sizing requirements
to setup my environment – my question is, how do I estimate how many
documents I will be fetching for a particular vertical? And – from
where do I get the seed list of all the sites? Will dmoz health
category be sufficient or will I have to purchase a seed list?

Thanks
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.