Python + Hadoop = Flying Circus Elephant
Dumbo is a nifty Python module from last.fm that allows Hadoop jobs to be written as generators, which is pretty awesome.
Dumbo is a nifty Python module from last.fm that allows Hadoop jobs to be written as generators, which is pretty awesome.