Lucene 2.9.2 and 3.0.1 versions have been released. Both are mainly bug fix versions from the previous ones.
The main difference between 2 and 3 versions is that version 3 has no support for java 1.4 and has a more clean API as deprecated stuff has been removed. This means if you want to upgrade your [...]
Tags: Java, Lucene, Open source
It has been a long time since my last post. I have been very busy so unfortunatelly, I have not had the time to write about all I wish.
This week I have discovered via twitter a really interesting open source search project, ElasticSearch for the cloud. ElasticSearch has been createded by Shay Banon. It’s a [...]
Tags: Cloud computing, ElasticSearch, Java, Lucene, Open source
Lucene TrieRangeQuery is a cool contrib in Lucene (think not yet in the official release) created by Uwe Schindler. I had heard about it before but learned about it in the LuceneMeetUp in ApacheCon EU. Uwe gave a great speach about it. As I found it a really useful feature will try to explain the [...]
Tags: ApacheCon, Java, Lucene, Open source, TrieRangeQuery
Last week I had the chance to go to the ApacheCon Europe 2009. The event took place in Mövenpick Hotel, Amsterdam. I had a really good time in there.
Was good to share use cases and experiences in person with people who I had just spoken with in forums.
I spend the first two days in the [...]
Tags: ApacheCon, Events, Hadoop, HBase, Lucene, Mahout, Open source, Pig
A new official release of Lucene in now available! Lucene 2.4.1 is a bug fix version.
We will be able to see more new features in the Lucene 2.9 release (available in developers version).
Here I mention all the improvements of Lucene 2.4.1, wich I read from the official lucene’s site:
Fixed silent data-loss case whereby binary fields [...]
Here is a really interesting example of how to build an inverted index using Pig. As I have seen in Hadoop, to create a Lucene index you must start from a text file and use MapReduce jobs to build it. Pig however, allows you to retrieve data not just from a text file but from [...]
Recently I read an article explaining how Rackspace solved their huge log data deal with problem. They have implemented the best Hadoop and Solr integration I have seen until now, it really looks amazing.
I don’t know hadoop with detail but to run Solr instances from a Tomcat server stored in HDFS (Hadoop’s distributed file system) [...]
Lately I have started using the developers version of Lucene (2.9-dev). When I wanted to open an index using Luke to check some content it just did not work, I got a “lucene invalid index” error. After a while I realized it was totally normal. The cause of the error is that the latest Luke’s [...]