Tag Archives: GFS

What is Hadoop?

Hadoop

The Hadoop is an open source platform developed especially for processing and analyzing large volumes of data, whether structured or unstructured. The project is maintained by the Apache Foundation, but has the support of several companies such as Yahoo!, Facebook, Google and IBM.

It can be said that the project Hadoop began in 2002, and after working several months, Google released the Google File System paper in October 2003 and the MapReduce paper in December 2004.

The Google File System (GFS) – a file system specially prepared to handle distributed processing, as would be the case of a company like this, large volumes of data (in magnitudes of terabytes or even petabytes). read more