Top Banner
Introduction to MapReduce
15
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 3.introduction to map reduce

Introduction to

MapReduce

Page 2: 3.introduction to map reduce

Hadoop What is MapReduce an example MapReduce Process Job Tracker & Task Tracker Anatomy of File Write Anatomy of File Read Replication & Rack awareness

Page 3: 3.introduction to map reduce

Hadoop is a framework that allows

for distributed processing of large data sets across clusters of

commodity computers using a

simple programming model

Page 4: 3.introduction to map reduce

Hadoop was designed to enable applications to make most out of cluster architecture by addressing two key points:1. Layout of data across the cluster ensuring data is evenly distributed2. Design of applications to benefit from data locality

It brings us two main mechanism of hadoop hdfs and hadoop MapReduce

Page 5: 3.introduction to map reduce

What is MapReduce

Page 6: 3.introduction to map reduce

example : election

Page 7: 3.introduction to map reduce

MapReduce process

Page 8: 3.introduction to map reduce

Job Tracker

Page 9: 3.introduction to map reduce

Job Tracker (contd.)

Page 10: 3.introduction to map reduce

Job Tracker (contd.)

Page 11: 3.introduction to map reduce
Page 12: 3.introduction to map reduce

Anatomy of File Write

Page 13: 3.introduction to map reduce

Anatomy of File Read

Page 14: 3.introduction to map reduce

Replication and Rack Awareness

Page 15: 3.introduction to map reduce