Last week I had the chance to go to the ApacheCon Europe 2009. The event took place in Mövenpick Hotel, Amsterdam. I had a really good time in there. Was good to share use cases and experiences in person with people who I had just spoken with in forums. I spend the first two days [...]
Posts Tagged ‘Pig’
Index scalability using Pig
March 2nd, 2009
2 Comments
Here is a really interesting example of how to build an inverted index using Pig. As I have seen in Hadoop, to create a Lucene index you must start from a text file and use MapReduce jobs to build it. Pig however, allows you to retrieve data not just from a text file but from [...]

