Top Banner
BIG DATA Sapan M Patel sapan@ipowersoftwares .com +919712363687
13
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Big Data

BIG DATA

Sapan M Patel

[email protected]

+919712363687

Page 2: Big Data

What is Big Data ?

Big data is a popular term used to describe the exponential growth and availability of data, both structured and unstructured.

Big data usually includes data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage, and process the data within a tolerable elapsed time.

Page 3: Big Data
Page 4: Big Data

Answer 6000 Exabyte

Page 5: Big Data

Big Data Softwares Aster - Teradata Inc

Datameer - Datameer Inc

FICO Blaze Advisor - FICO

Hadoop - Apache Foundation

HP Vertica - HP

MongoDB - MongoDB, Inc

Platfora- Platfora Inc

Spark - Apache Foundation

Splunk - Splunk Inc

Tableau - Tableau Inc

SAP HANA - SAP AG

Page 6: Big Data

Hadoop

•Hadoop provides a distributed filesystem and a framework for the analysis and transformation of very large data sets using the MapReduce paradigm.

•An important characteristic of Hadoop is the partitioning of data and computation across many (thousands) of hosts, and the execution of application computations in parallel close to their data.

Page 7: Big Data

Architecture: of Hadoop

• NameNode

• BackupNode

• DataNodes

• Replication factor

Page 8: Big Data
Page 9: Big Data
Page 10: Big Data
Page 11: Big Data
Page 12: Big Data
Page 13: Big Data

Thank You !