What is Hadoop?
Hadoop is a distributed data management platform or open-source software framework for storing and processing big data
. It is sometimes described as a cut-down distributed operating system. It is designed to manage and work with immense volumes of data
, and scale linearly to large clusters of thousands of commodity computers. It was originally developed for Yahoo!, but is now available free and publicly through Apache Software Foundation, though it usually requires extensive programming knowledge to be used.