BIG DATA Sapan M Patel sapan@ipowersoftwares .com +919712363687
BIG DATA
Sapan M Patel
+919712363687
What is Big Data ?
Big data is a popular term used to describe the exponential growth and availability of data, both structured and unstructured.
Big data usually includes data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage, and process the data within a tolerable elapsed time.
Answer 6000 Exabyte
Big Data Softwares Aster - Teradata Inc
Datameer - Datameer Inc
FICO Blaze Advisor - FICO
Hadoop - Apache Foundation
HP Vertica - HP
MongoDB - MongoDB, Inc
Platfora- Platfora Inc
Spark - Apache Foundation
Splunk - Splunk Inc
Tableau - Tableau Inc
SAP HANA - SAP AG
Hadoop
•Hadoop provides a distributed filesystem and a framework for the analysis and transformation of very large data sets using the MapReduce paradigm.
•An important characteristic of Hadoop is the partitioning of data and computation across many (thousands) of hosts, and the execution of application computations in parallel close to their data.
Architecture: of Hadoop
• NameNode
• BackupNode
• DataNodes
• Replication factor
Thank You !