Life, code and stuff
Posted by Marc Sturlese

27 Feb 10 Lucene 2.9.2 and 3.0.1 released

Lucene 2.9.2 and 3.0.1 versions have been released. Both are mainly bug fix versions from the previous ones.
The main difference between 2 and 3 versions is that version 3 has no support for java 1.4 and has a more clean API as deprecated stuff has been removed. This means if you want to upgrade your [...]

Tags: , ,

Posted by Marc Sturlese

12 Feb 10 ElasticSearch

It has been a long time since my last post. I have been very busy so unfortunatelly, I have not had the time to write about all I wish.
This week I have discovered via twitter a really interesting open source search project, ElasticSearch for the cloud. ElasticSearch has been createded by Shay Banon. It’s a [...]

Tags: , , , ,

Posted by Marc Sturlese

08 Apr 09 Lucene TrieRangeQuery

Lucene TrieRangeQuery is a cool contrib in Lucene (think not yet in the official release) created by Uwe  Schindler. I had heard about it before but learned about it in the LuceneMeetUp in ApacheCon EU. Uwe gave a great speach about it. As I found it a really useful feature will try to explain the [...]

Tags: , , , ,

Posted by Marc Sturlese

01 Apr 09 ApacheCon Europe 2009

Last week I had the chance to go to the ApacheCon Europe 2009. The event took place in Mövenpick Hotel, Amsterdam. I had a really good time in there.
Was good to share use cases and experiences in person with people who I had just spoken with in forums.
I spend the first two days in the [...]

Tags: , , , , , , ,

Posted by Marc Sturlese

09 Mar 09 Lucene 2.4.1 available from today

A new official release of Lucene in now available! Lucene 2.4.1 is a bug fix version.
We will be able to see more new features in the Lucene 2.9 release (available in developers version).
Here I mention all the improvements of Lucene 2.4.1, wich I read from the official lucene’s site:

Fixed silent data-loss case whereby binary fields [...]

Tags: ,

Posted by Marc Sturlese

02 Mar 09 Index scalability using Pig

Here is a really interesting example of how to build an inverted index using Pig. As I have seen in Hadoop, to create a Lucene index you must start from a text file and use MapReduce jobs to build it. Pig however, allows you to retrieve data not just from a text file but from [...]

Tags: , , ,

Posted by Marc Sturlese

06 Feb 09 Solr and Hadoop integration against scalability problems

Recently I read an article explaining how Rackspace solved their huge log data deal with problem. They have implemented the best Hadoop and Solr integration I have seen until now, it really looks amazing.
I don’t know hadoop with detail but to run Solr instances from a Tomcat server stored in HDFS (Hadoop’s distributed file system) [...]

Tags: , ,

Posted by Marc Sturlese

13 Jan 09 Opening lucene 2.9-dev indexes with Luke Lucene Index Toolbox

Lately I have started using the developers version of Lucene (2.9-dev). When I wanted to open an index using Luke to check some content it just did not work, I got a “lucene invalid index” error. After a while I realized it was totally normal. The cause of the error is that the latest Luke’s [...]

Tags: , ,