Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

Indexing XML-based document format per DITA standard

Del Rio, Ann

2008-05-30

Replies: Find Java Web Hosting

Author LoginPost Reply
I added a new URL to index which is in a XML-based document format per
DITA standard and I get the following error.

java.net.SocketException: Connection reset
2008-05-27 17:56:58 ERROR Http           at
java.net.SocketInputStream.read (SocketInputStream.java:168)
2008-05-27 17:56:58 ERROR Http           at
java.io.BufferedInputStream.fill (BufferedInputStream.java:218)
2008-05-27 17:56:58 ERROR Http           at
java.io.BufferedInputStream.read (BufferedInputStream.java:235)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpParser.readRawLine (HttpParser.java:77)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpParser.readLine (HttpParser.java:105)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpConnection.readLine(HttpConnection.jav
a:1115)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpCon
nectionAdapter.readLine(MultiThreadedHttpConnectionManager.java:1373)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpMethodBase.readStatusLine(HttpMethodBa
se.java:1832)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpMethodBase.readResponse(HttpMethodBase
.java:1590)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpMethodBase.execute(HttpMethodBase.java
:995)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMe
thodDirector.java:397)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMetho
dDirector.java:170)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpClient.executeMethod (HttpClient.java:3
96)
2008-05-27 17:56:58 ERROR Http           at
org.apache.commons.httpclient.HttpClient.executeMethod (HttpClient.java:3
24)
2008-05-27 17:56:58 ERROR Http           at
org.apache.nutch.protocol.httpclient.HttpResponse.<init>(HttpResponse.ja
va:96)
2008-05-27 17:56:58 ERROR Http           at
org.apache.nutch.protocol.httpclient.Http.getResponse (Http.java:99)
2008-05-27 17:56:58 ERROR Http           at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:219)
2008-05-27 17:56:58 ERROR Http           at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:145)
2008-05-27 17:56:58 INFO Fetcher         fetch of
http://v4:10000/lib <http://v4:10000/lib> failed with:
java.net.SocketException: Connection reset

i googled and found no solution so far...

do i need to setup some config / host file to specify the ports?
the URL is an internal website.

any response will be appreciated.

Thanks,
Ann Del Rio
Senior Developer
eBay, Inc
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.