Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

how to deal with the max number of outlinks and inlinks per page?

wangyong

2008-04-18

Replies: Find Java Web Hosting

Author LoginPost Reply

Hi all,      I wonder if there is anything that can deal with the max number of outlinks and inlinks per page? I understand that I can simply set the max number of fetched pages per layer in command line by using -topN parameter. I looked for the answer to my question using google. The possible answer I found was "db.max.outlinks.per.parse" parameter which is the max number of outlinks per page, and its default value is 100.    Consider that some important pages in web can have over thousands outlinks, only fetch 100 of them is not a good idea. So I ought to adjust this default value to a relatively large value. But, I have got no information about it. Would anyone has encountered this problem before? Would anyone has the solutions of this problem? I have struggleed with it for hours.     Have a good weeked. yong
_________________________________________________________________
MSN 中文网,最新时尚生活资讯,白领聚集门户。
http://cn.msn.com
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.