Life, code and stuff
Posted by Marc Sturlese

27 Feb 10 Lucene 2.9.2 and 3.0.1 released

Lucene 2.9.2 and 3.0.1 versions have been released. Both are mainly bug fix versions from the previous ones.
The main difference between 2 and 3 versions is that version 3 has no support for java 1.4 and has a more clean API as deprecated stuff has been removed. This means if you want to upgrade your [...]

Tags: , ,

Posted by Marc Sturlese

12 Feb 10 ElasticSearch

It has been a long time since my last post. I have been very busy so unfortunatelly, I have not had the time to write about all I wish.
This week I have discovered via twitter a really interesting open source search project, ElasticSearch for the cloud. ElasticSearch has been createded by Shay Banon. It’s a [...]

Tags: , , , ,

Posted by Marc Sturlese

31 May 09 Performance measurement with JMeter 2.3.3

Last week was launched a new release of JMeter. JMeter 2.3.3 is a powerful java application designed to do web application functionality testing and performance measurement, allowing you to do powerful server stress tests.
I have been doing some practices with it and I really liked the easy way you can set up a test plan [...]

Tags: , ,

Posted by Marc Sturlese

09 May 09 Analyzing java heaps with jmap and jhat

Jmap and jhat are a couple of tools really useful to analyze the memory consume of a java program. Both are included in the JVM 1.6 so there is no need to install any extra stuff.
Jmap allows you to create a dump of the java memory heap at any moment in the life of your [...]

Tags: , , ,

Posted by Marc Sturlese

23 Apr 09 JAD Java Decomplier

Today I needed to check some old java source from wich only I just kept the class files.
Find a java decompiler for my Ubuntu was not as easy job as I tought. Couldn’t find one in the repositories and all what I found in the network was not updated at all.
JAD Java Decompiler is definitely [...]

Tags: , , , , ,

Posted by Marc Sturlese

08 Apr 09 Lucene TrieRangeQuery

Lucene TrieRangeQuery is a cool contrib in Lucene (think not yet in the official release) created by Uwe  Schindler. I had heard about it before but learned about it in the LuceneMeetUp in ApacheCon EU. Uwe gave a great speach about it. As I found it a really useful feature will try to explain the [...]

Tags: , , , ,

Posted by Marc Sturlese

09 Mar 09 Lucene 2.4.1 available from today

A new official release of Lucene in now available! Lucene 2.4.1 is a bug fix version.
We will be able to see more new features in the Lucene 2.9 release (available in developers version).
Here I mention all the improvements of Lucene 2.4.1, wich I read from the official lucene’s site:

Fixed silent data-loss case whereby binary fields [...]

Tags: ,

Posted by Marc Sturlese

02 Mar 09 Index scalability using Pig

Here is a really interesting example of how to build an inverted index using Pig. As I have seen in Hadoop, to create a Lucene index you must start from a text file and use MapReduce jobs to build it. Pig however, allows you to retrieve data not just from a text file but from [...]

Tags: , , ,

Posted by Marc Sturlese

17 Jan 09 MySQL streaming setting fetchSize to Integer.MIN_VALUE

Have you ever experienced a java.lang.OutOfMemoryError: Java heap space
due to too big selects?
We can avoid that thanks to the fetchSize parameter initialized at statement creation time. FetchSize decides how many rows of the select MySQL must store in the buffer before dealing with the data. Setting that parameter to it’s minimum value we will be [...]

Tags: ,

Posted by Marc Sturlese

13 Jan 09 Opening lucene 2.9-dev indexes with Luke Lucene Index Toolbox

Lately I have started using the developers version of Lucene (2.9-dev). When I wanted to open an index using Luke to check some content it just did not work, I got a “lucene invalid index” error. After a while I realized it was totally normal. The cause of the error is that the latest Luke’s [...]

Tags: , ,