Life, code and stuff
Posted by Marc Sturlese

24 May 10 Apache Lucene EuroCon 2010

Yesterday I came back from the Lucene EuroCon 2010, wich took place in Prague. There have been many interesting talks there these days. Some of the slides are already on Slide Share.  Can’t wait for the others to be uploaded. I gave a talk on Thursday about our usage of Solr at Trovit. Covered an [...]

Tags: , , , , , ,

Posted by Marc Sturlese

18 Jun 09 CloudCamp Barcelona 2009

Last Monday took place in Barcelona the first CloudCamp ever done in the city. Altough I was expecting more technical stuff it was good to be there and listen to what people have to say. The first part of the event consisted of some quick explanations from different companies related with cloud computing. Basically, were [...]

Tags: , , , , , , ,

Posted by Marc Sturlese

01 Apr 09 ApacheCon Europe 2009

Last week I had the chance to go to the ApacheCon Europe 2009. The event took place in Mövenpick Hotel, Amsterdam. I had a really good time in there. Was good to share use cases and experiences in person with people who I had just spoken with in forums. I spend the first two days [...]

Tags: , , , , , , ,

Posted by Marc Sturlese

02 Mar 09 Index scalability using Pig

Here is a really interesting example of how to build an inverted index using Pig. As I have seen in Hadoop, to create a Lucene index you must start from a text file and use MapReduce jobs to build it. Pig however, allows you to retrieve data not just from a text file but from [...]

Tags: , , ,

Posted by Marc Sturlese

06 Feb 09 Solr and Hadoop integration against scalability problems

Recently I read an article explaining how Rackspace solved their huge log data deal with problem. They have implemented the best Hadoop and Solr integration I have seen until now, it really looks amazing. I don’t know hadoop with detail but to run Solr instances from a Tomcat server stored in HDFS (Hadoop’s distributed file [...]

Tags: , ,