Lately I’ve been doing some tests with Lucene MultiValued fields and FieldCache. I’ve load FieldCache.StringIndex of a multiValued field and I’ve seen some weird stuff happening which I think it’s worth to mention. FieldCache.StringIndex loads an int[] (order) and a String[] (lookup). The String[] contains all the terms on a field. The int[] array contains [...]
Tags: FieldCache, Java, Lucene, Open source
Yesterday I came back from the Lucene EuroCon 2010, wich took place in Prague. There have been many interesting talks there these days. Some of the slides are already on Slide Share. Can’t wait for the others to be uploaded. I gave a talk on Thursday about our usage of Solr at Trovit. Covered an [...]
Tags: EuroCon, Events, Hadoop, Java, Lucene, Open source, Solr
Lucene 2.9.2 and 3.0.1 versions have been released. Both are mainly bug fix versions from the previous ones. The main difference between 2 and 3 versions is that version 3 has no support for java 1.4 and has a more clean API as deprecated stuff has been removed. This means if you want to upgrade [...]
Tags: Java, Lucene, Open source
It has been a long time since my last post. I have been very busy so unfortunatelly, I have not had the time to write about all I wish. This week I have discovered via twitter a really interesting open source search project, ElasticSearch for the cloud. ElasticSearch has been createded by Shay Banon. It’s [...]
Tags: Cloud computing, ElasticSearch, Java, Lucene, Open source
Last week was launched a new release of JMeter. JMeter 2.3.3 is a powerful java application designed to do web application functionality testing and performance measurement, allowing you to do powerful server stress tests. I have been doing some practices with it and I really liked the easy way you can set up a test [...]
Tags: Java, JMeter, Open source
Jmap and jhat are a couple of tools really useful to analyze the memory consume of a java program. Both are included in the JVM 1.6 so there is no need to install any extra stuff. Jmap allows you to create a dump of the java memory heap at any moment in the life of [...]
Tags: Java, jhat, jmap, Open source
Today I needed to check some old java source from wich only I just kept the class files. Find a java decompiler for my Ubuntu was not as easy job as I tought. Couldn’t find one in the repositories and all what I found in the network was not updated at all. JAD Java Decompiler [...]
Tags: Decompiler, JAD, Java, Linux, Open source, Ubuntu
Lucene TrieRangeQuery is a cool contrib in Lucene (think not yet in the official release) created by Uwe Schindler. I had heard about it before but learned about it in the LuceneMeetUp in ApacheCon EU. Uwe gave a great speach about it. As I found it a really useful feature will try to explain the [...]
Tags: ApacheCon, Java, Lucene, Open source, TrieRangeQuery
A new official release of Lucene in now available! Lucene 2.4.1 is a bug fix version. We will be able to see more new features in the Lucene 2.9 release (available in developers version). Here I mention all the improvements of Lucene 2.4.1, wich I read from the official lucene’s site: Fixed silent data-loss case [...]
Here is a really interesting example of how to build an inverted index using Pig. As I have seen in Hadoop, to create a Lucene index you must start from a text file and use MapReduce jobs to build it. Pig however, allows you to retrieve data not just from a text file but from [...]