Java Mailing List Archive

http://www.java2.5341.com/

Home » nutch-user.lucene »

Re: Uncompressing SEQ files from cmdline

Dennis Kubes

2008-10-03

Replies: Find Java Web Hosting

Author LoginPost Reply
While some sequence files may be compressed, they are binary not text
formats. You would need to use a MR job to output the values to
TextOutputFormat.

Dennis

brainstorm wrote:
> How can I easily uncompress a downloaded file from HDFS ? Does anyone
> have any java snippet on this ?
>
> SEQ^F^Yorg.apache.hadoop.io.Text!org.apache.nutch.crawl.CrawlDatum^@^@^@^@^@^@���^?^NGy�\~~K�^\!^W^@^@^@<^@^@^@(protected)/
©2008 java2.5341.com - Jax Systems, LLC, U.S.A.