This document is intended for only AVEA İletişim Hizmetleri A.Ş.("AVEA"), its dealers, employees and/or others specifically authorised. The contents of this document are confidential and any disclosure, copying, distribution and/or taking any action in reliance with the content of this document is prohibited. AVEA is not liable for the transmission of this document in any manner to any third parties that are not authorised to receive. Hadoop & NoSQL New Generation Database Systems Ramazan FIRIN 22.04.2014
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
This document is intended for only AVEA İletişim Hizmetleri A.Ş.("AVEA"), its dealers, employees and/or others specifically authorised. The contents of this document are
confidential and any disclosure, copying, distribution and/or taking any action in reliance with the content of this document is prohibited. AVEA is not liable for the transmission
of this document in any manner to any third parties that are not authorised to receive.
Hadoop & NoSQL
New Generation Database Systems
Ramazan FIRIN
22.04.2014
2
AGENDA
• Big Data
• Hadoop
• NoSQL
• Graph DB and Neoj
• Possible Usage in Tellco
• Demo
3
Executive Summary
AVEA
• Big Data is a new IT trend
• Hadoop and NoSQL can used to process Big Data
• Possible usage area in Tellco :- Prevent Churn
- to offer customer spesific campaign
- to get more customer
4
Big Bang = Big Data
Big Bang Big Data
42008-07-01_Presentation Template MBT / CEOMercedes-Benz Türk A.Ş.
5
What is Big Data?
Datasets that are too awkward to work with using traditional,
hands-ondatabase management tools.
6
Big Data- 3V Concept
7
Big Data To Smart Data
Cover of The Economist
8
Big Data Sources
1. Social network profiles -Facebook, LinkedIn, Yahoo, Google
2. Social influencers - blog comments, user forums, review sites,
3. Activity-generated data - application logs, sensor data
4. Public—Wikipedia, IMDb, etc
5. Data warehouse appliances - transactional data
6. Network and in-stream monitoring
7. Legacy documents—
9
Big Data Approach
10
Sample Usage - 360°Degree View of the Customers
11
Big Data Solutions – Oracle Big Data Appliance
12
Big Data Solutions – IBM Pure Data
13
Storage for Big Data
13
İf we cant use relational Database, how can westore it?
1)Hadoop2)NoSQL
14
What is HADOOP?
The Apache Hadoop software library is a framework that
allows for the distributed processing of large data sets
across clusters of computers using simple programming models
15
History
16
Hadoop Components
17
HADOOP ARCHITECTURE
18
Hadoop Ecosystem
Pig - simplifies hadoop programming, data processing language
Hive - SQL like queries
HBase - Random read/write, billions of row and millions of colums