Big Data & Hadoop Ecosystem Canburak Tümer
Big Data & Hadoop
EcosystemCanburak Tümer
• Ege University, BSc. Computer Engineering, ’07-’12• Libera Universitá di Bolzano, BSc. Computer Science,
’09-’10• İstanbul Technical University, MSc. Computer
Engineering, ’13-’16 (expected)• Turkcell Technology, ETL & DWH Developer, ’11-’12• Oracle, Consultant, ’12-’13• MAKEIT Software & Consulting, BI&DW Specialist ’14-...• www.canburaktumer.com/blog @canburakTblog
https://www.linkedin.com/in/canburaktumer
About MeCanburak Tümer
Agenda• Big Data• NoSQL• Hadoop• HDFS• MapReduce• Management Tools• Data Access Tools• Data Processing and Mining Tools
VOLUME VALUE
VARIETYVERIFICATION VELOCITY
- Open source big data platform- Started by developers from Yahoo!- Two main distributors now : Cloudera, Hortonworks- Both storage and processing
- HDFS for storage- MapReduce for processing- Spark engine is replacing MapReduce day by day
HDFS
Map Reduce
Managing Tools for Hadoop
Data Access Tools for Hadoop
Data Processing and Mining Tools