Author Login
Post Reply
Has anyone deployed Lucene to index log files? I have seen some articles
about how RackSpace used Lucene and Hadoop for log processing, but I have
not seen any details on the implementation.
To get my required analytics, I think I would need to treat each line of
the Apache log files as a document and I though I would treat each field as
a key word to minimize processing.
Assuming you have clusters operating on independent datasets (so I guess it
would scale linearly) and you want to process Terabytes of logs per day,
is such a solution even feasible?
Thank you,
Jeff Capone
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@(protected)
For additional commands, e-mail: java-user-help@(protected)