Apache Hadoop has been the dynamic force behind the enlargement of the big data production. Hadoop brings the aptitude to inexpensively process large amounts of data, regardless of its construction. By large, we indicate from 10-100 gigabytes and above.