Top Banner
Intro to Big Data On Premise Presented by: Jon Bloom Senior Consultant, Agile Bay, Inc.
14

Intro to Big Data

May 27, 2015

Download

Technology

Jonathan Bloom

Introduction to Big Data.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Intro to Big Data

Intro to Big Data On Premise

Presented by: Jon BloomSenior Consultant, Agile Bay, Inc.

Page 2: Intro to Big Data

Jon BloomBlog: http://www.bloomconsultingbi.com

Twitter: @sqljon

Linked-in: http://www.linkedin.com/in/BloomConsultingBI

Email: [email protected]

Customers & Partners

Page 3: Intro to Big Data

w w w . a g i l e b a y . c o m

Page 4: Intro to Big Data

Session AgendaWhat is Big Data?What is Hadoop?BI vs. HadoopDemo:

Page 5: Intro to Big Data

Terms and Acronyms Hadoop:

Apache project (open source) project to develop software for reliable, scalable, distributed computing.

Cluster: A group of computers (nodes) linked together to perform a highly-available and high computation work

HDFS distributed file system that provides high-throughput access to application data.

YARNA framework for job scheduling and cluster resource management.

MapReduce A system for parallel processing of large data sets.

Page 6: Intro to Big Data

What is Big Data?

Page 7: Intro to Big Data

What is Big Data?Volume, Velocity, Variety

Page 8: Intro to Big Data

What is Hadoop?

Page 9: Intro to Big Data

What is HadoopApache open source project Batch Oriented Parallel Processing across

Commodity Servers Ecosystem

• Ambari• HBase• Avro• Cassandra• Chukwa

• Hive• Mahout• Pig• ZooKeeper

Page 10: Intro to Big Data

Distributed Computing & MapReduce

MapperReducer

Page 11: Intro to Big Data

BI vs. Hadoop?

Page 12: Intro to Big Data

BI vs. HadoopHadoop not a replacement of BIExtends BI capabilitiesBI = Scale up to 100s of GigabytesHadoop = From 100s of Gygabytes to

Terabytes (1,000s og Gygabytes) and Terabytes (1,000,000 Gigabytes)

Page 13: Intro to Big Data

Demo

Page 14: Intro to Big Data

Thank you for attending!Q & A

Blog: www.bloomconsultingbi.comTwitter: @sqljon

Linked-in: http://www.linkedin.com/in/BloomConsultingBI

Email: [email protected]