Java Mailing List Archive

http://www.java2.5341.com/

Home » java-user.lucene »

Feasibility question

Jeff Capone

2008-11-10

Replies: Find Java Web Hosting

Author LoginPost Reply
Has anyone deployed Lucene to index log files? I have seen some articles
about how RackSpace used Lucene and Hadoop for log processing, but I have
not seen any details on the implementation.

To get my required analytics, I think I would need to treat each line of
the Apache log files as a document and I though I would treat each field as
a key word to minimize processing.

Assuming you have clusters operating on independent datasets (so I guess it
would scale linearly) and you want to process Terabytes of logs per day,
is such a solution even feasible?

Thank you,

Jeff Capone


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@(protected)
For additional commands, e-mail: java-user-help@(protected)

©2008 java2.5341.com - Jax Systems, LLC, U.S.A.