Hive – A Warehousing Solution Over a Map-Reduce Framework

The size of data sets being collected and analyzed in the industry for business intelligence is growing rapidly, making traditional warehousing solutions prohibitively expensive. Hadoop is a popular open-source map-reduce implementation which is being used as an alternative to store and process extremely large data sets on commodity hardware.However, the map-reduce programming model is very low level and requires developers to write custom programs which are hard to maintain and reuse.
In this paper, we present Hive, an open-source data warehousing solution built on top of Hadoop.


Previewing this paper from http://www.vldb.org/pvldb/2/vldb09-938.pdf