Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

Bypass Validation

karthik085

2008-07-14

Replies: Find Java Web Hosting

Author LoginPost Reply

Hi.

I am trying to crawl a page using nutch. That page exists behinds a
validator (struts), i.e. In order to get to the page, a button needs to be
clicked. Is there anyway this can be bypassed so web crawler can get to the
page without clicking this button?

Code:
<form name="loginForm" method="post" action="/check.do">
   <input type="hidden" name="forward" value="target_page">
    <input type="submit" name="org.apache.struts.taglib.html.CANCEL"
value="Continue" onclick="bCancel=true;">
</form>

Any help is appreciated. Thanks.
--
Sent from the Nutch - User mailing list archive at Nabble.com.

©2008 java2.5341.com - Jax Systems, LLC, U.S.A.