Here are the slides and the video of the presentation I gave at the Apache Lucene Eurocon 2011 in Barcelona. The talk was about how we crunch and index data using Solr, Hadoop and Hive at Trovit. I put special interest in the distributed indexing strategy.
Posts Tagged ‘Solr’
Apache Lucene EuroCon 2010
May 24th, 2010
No Comments
Yesterday I came back from the Lucene EuroCon 2010, wich took place in Prague. There have been many interesting talks there these days. Some of the slides are already on Slide Share. Can’t wait for the others to be uploaded. I gave a talk on Thursday about our usage of Solr at Trovit. Covered an [...]
Solr and Hadoop integration against scalability problems
February 6th, 2009
6 Comments
Recently I read an article explaining how Rackspace solved their huge log data deal with problem. They have implemented the best Hadoop and Solr integration I have seen until now, it really looks amazing. I don’t know hadoop with detail but to run Solr instances from a Tomcat server stored in HDFS (Hadoop’s distributed file [...]

