Top Banner
Where does Hadoop come handy? [email protected] www.thecloudavenue.com @praveensripati
14
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Where does hadoop come handy

Where does Hadoop come [email protected]

www.thecloudavenue.com

@praveensripati

Page 2: Where does hadoop come handy

Agenda

isn't used as

Page 3: Where does hadoop come handy

What's Big Data?

~~

According to Wikipedia (http://en.wikipedia.org/wiki/Big_data) the definition of Big Data is

In information technology, Big Data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools.

Page 4: Where does hadoop come handy

Hadoop acting like a kernel

Page 5: Where does hadoop come handy

Workload distribution across installations

Hive n Pig play an important rolein the Hadoop ecosystem

http://www.cloudera.com/blog/2012/09/what-do-real-life-hadoop-workloads-look-like/

Page 6: Where does hadoop come handy

Different Big Data scenariosScenario Is Hadoop good for it? What are the

alternatives?

Real time processing No HStreaming, Twitter Storm

Iterative Processing No Apache Hama, Apache Giraph, Jung

Adhoc Interactive Querying

No Apache Drill, Open Dremel

Batch Processing Yes

Page 7: Where does hadoop come handy

How have Big Data frameworks evolved?

Google Paper Apache Component

The Google File System (October, 2003) HDFS (2008 became Apache TLP)

MapReduce: Simplified Data Processing on Large Clusters (December, 2004)

MapReduce (2008 became Apache TLP)

Bigtable: A Distributed Storage System for Structured Data (November, 2006)

HBase (2010 became Apache TLP), Cassandra (2010 became Apache TLP)

Large-scale graph computing at Google (June, 2009)

Hama, Giraph (2012 became Apache TLP)

Dremel: Interactive Analysis of Web-Scale Datasets (2010)

Apache Drill (Incubated in August, 2012)

Spanner: Google's Globally-Distributed Database (September, 2012)

????

There has been 4-5 years gap betweenGoogle releasing a paper and

us seeing an implementation of it.

Page 8: Where does hadoop come handy

What happens to the data once it is stored?

If you aren’t taking advantage of big data,then you don’t have big data,you have just a pile of data.

Descriptive analytics

- What happened?- When did it happen?- What was it's impact?

Predictive and Prescriptive analytics

- Why did it happen?- When will it happen again?- What caused it to happen?- What can be done to avoid it?

Page 9: Where does hadoop come handy

Evolution of Big Data use cases

Hadoop has evolved from Yahoo and Google which are Web 2.0 companies for their massive

text processing requirements like

- log processing- search index

- recommendations- context based advertising

Ads & E-commerce, Astronomy, Social Networks, Bioinformatics/Medical Informatics, Machine Translation,Spatial Data Processing, Information Extraction and Text Processing,

Artificial Intelligence/Machine Learning/Data Mining, Search Query Analysis, Information Retrieval (Search),Spam & Malware Detection, Image and Video Processing, Networking,

Simulation, Statistics, Numerical Mathematics, Sets & Graphs

http://atbrox.com/2011/05/16/mapreduce-hadoop-algorithms-in-academic-papers-4th-update-may-2011/

Page 10: Where does hadoop come handy

Few of the Big Data use casesWorld Bank kicked an initiative to improve the Sanitation and Water that would impact 1B people.Neural Networks for Breast Cancer prize by Google.Fraud Detection in financial industry.Predictive Maintenance scheduling (like aircraft engines).Walmart and Sears Holding use POS information to stock different products in the stores and also for the SCM.Customer profiling and segmentation for targetted campaigns.

Follow the competetions in Kaagle for more use case.

Page 11: Where does hadoop come handy

Democratization of Education

https://www.coursera.org/

http://www.udacity.com/

http://www.khanacademy.org/

http://www.youtube.com/user/nptelhrd/

https://www.edx.org/

to

Machine Learning Music

Page 12: Where does hadoop come handy

Keep Looking Out

There is a lot more than Hadoop and some of them are matureand some are still evolving !!!

Page 13: Where does hadoop come handy

Q&A

Page 14: Where does hadoop come handy