Report this document Getting to know Apache Hadoop - Oana Balalau · PDF file Apache Hadoop - open source software framework for distributed storage and processing of large data sets on clusters of computers..