Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

how does nutch connect to urls internally?

Del Rio, Ann

2008-06-16

Replies: Find Java Web Hosting

Author LoginPost Reply
Good morning,
 
Can you please point me to a Nutch documentation where I can find how nutch connects to the webpages when it crawls? I think it is through HTTP but i would like to confirm and get more details so i can write a very small test java program to connect to one of the webpages i am having trouble connecting / crawling. I bought Lucene in Action and am half way thru the book and so far there is very little about Nutch.
 
Thanks,

Ann Del Rio

Ph: 408.376.6504
E-mail: adelrio@ebay.com
Skype: delrio_alan
 

Attachment: cid:576001716@16062008-248F

©2008 java2.5341.com - Jax Systems, LLC, U.S.A.