Top Banner
Intro to Big Data Workshop 14 March Big Data Hub https://www.facebook.com/BigDataHub
69

BigData HUB Workshop

Jan 14, 2017

Download

Data & Analytics

Ahmed Shouman
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: BigData HUB Workshop

Intro to Big Data Workshop

14 March

Big Data Hub https://www.facebook.com/BigDataHub

Page 2: BigData HUB Workshop

Introduction

2

Make it Easy

BidData Community : Intro to BigData Workshop

Page 3: BigData HUB Workshop

Introduction

3 BidData Community : Intro to BigData Workshop

Who, we are…..

Page 4: BigData HUB Workshop

Introduction

4 BidData Community : Intro to BigData Workshop

Community Vision Building a Big Data Knowledge hub in Egypt.

Page 5: BigData HUB Workshop

Introduction

5 BidData Community : Intro to BigData Workshop

Mission Spreading BD science and engineering

awareness

Page 6: BigData HUB Workshop

Introduction

6 BidData Community : Intro to BigData Workshop

Objectives ● Training 120 4th year student from

Engineering and computer science faculties.

● Running 5 awareness sessions targeting five universities in the first phase.

Page 7: BigData HUB Workshop

Introduction

7 BidData Community : Intro to BigData Workshop

Monofia University

Page 8: BigData HUB Workshop

Introduction

8 BidData Community : Intro to BigData Workshop

Monofia University

Page 9: BigData HUB Workshop

Introduction

9 BidData Community : Intro to BigData Workshop

Call for volunteers Business Development

Administration

Development

Administrative & Logistics

Page 10: BigData HUB Workshop

Introduction

10 BidData Community : Intro to BigData Workshop

Today RoadMap 1. Big Data Introduction 2. Hadoop installation

3. Small Java application

Page 11: BigData HUB Workshop

Introduction

11 BidData Community : Intro to BigData Workshop

Ready ?

Page 12: BigData HUB Workshop

Eng. Youssef Eassa

BigData and Ecosystem

Page 13: BigData HUB Workshop

BigData & Ecosystem

13 BidData Community : Intro to BigData Workshop

Data Vs. Information Vs. Knowledge

Page 14: BigData HUB Workshop

BigData & Ecosystem

14 BidData Community : Intro to BigData Workshop

Data Scientist VS Data Engineer

Page 15: BigData HUB Workshop

BigData & Ecosystem

15 BidData Community : Intro to BigData Workshop

What is the maximum file size you have? Movies/Files/Streaming video that you have used?

What is the maximum download speed you get? How much time to just transfer?

Page 16: BigData HUB Workshop

BigData & Ecosystem

16 BidData Community : Intro to BigData Workshop

How do you process

Distributed System

Massive data?

Distributed Computing System

Page 17: BigData HUB Workshop

BigData & Ecosystem

17 BidData Community : Intro to BigData Workshop

Now, What is difference between

Big Data Massive Data and

Page 18: BigData HUB Workshop

BigData & Ecosystem

18 BidData Community : Intro to BigData Workshop

From Where Data are Generated?

Social media and networks (all of us are generating data)

Scientific instruments (collecting all sorts of data)

Mobile devices (tracking all objects all the time)

Sensor technology and networks (measuring all kinds of data)

Page 19: BigData HUB Workshop

BigData & Ecosystem

19 BidData Community : Intro to BigData Workshop

Big Data in one Minute

Page 20: BigData HUB Workshop

BigData & Ecosystem

20 BidData Community : Intro to BigData Workshop

What makes Big Data that big?

Page 21: BigData HUB Workshop

BigData & Ecosystem

21 BidData Community : Intro to BigData Workshop

Data Evolution

Page 22: BigData HUB Workshop

BigData & Ecosystem

22 BidData Community : Intro to BigData Workshop

Data Evolution

Page 23: BigData HUB Workshop

BigData & Ecosystem

23 BidData Community : Intro to BigData Workshop

The Problem

Page 24: BigData HUB Workshop

BigData & Ecosystem

24 BidData Community : Intro to BigData Workshop

Hadoop != Database

Hadoop

Hadoop is a Distributed Storage and Computation Framework

Page 25: BigData HUB Workshop

BigData & Ecosystem

25 BidData Community : Intro to BigData Workshop

BIGDATA is not just HADOOP

Page 26: BigData HUB Workshop

BigData & Ecosystem

26 BidData Community : Intro to BigData Workshop

BigData Platforms

Page 27: BigData HUB Workshop

BigData & Ecosystem

27 BidData Community : Intro to BigData Workshop

Core Components of Hadoop

Storage (HDFS) Processing (MapReduce)

Page 28: BigData HUB Workshop

BigData & Ecosystem

28 BidData Community : Intro to BigData Workshop

Page 29: BigData HUB Workshop

BigData & Ecosystem

29 BidData Community : Intro to BigData Workshop

HDFS to Store Big Data

Page 30: BigData HUB Workshop

BigData & Ecosystem

30 BidData Community : Intro to BigData Workshop

Page 31: BigData HUB Workshop

BigData & Ecosystem

31 BidData Community : Intro to BigData Workshop

MapReduce :

Page 32: BigData HUB Workshop

BigData & Ecosystem

32 BidData Community : Intro to BigData Workshop

Hadoop 2.0: Next-gen platform

Page 33: BigData HUB Workshop

BigData & Ecosystem

33 BidData Community : Intro to BigData Workshop

Store all data in one place Interact with data in multiple ways

Page 34: BigData HUB Workshop

BigData & Ecosystem

34 BidData Community : Intro to BigData Workshop

Hadoop 2.0 Projects

• YARN • HDFS Federation Aka HDFS 2.0

Page 35: BigData HUB Workshop

BigData & Ecosystem

35 BidData Community : Intro to BigData Workshop

YARN: Architecture

Page 36: BigData HUB Workshop

BigData & Ecosystem

36 BidData Community : Intro to BigData Workshop

HDFS Federation: Architecture

Page 37: BigData HUB Workshop

BigData & Ecosystem

37 BidData Community : Intro to BigData Workshop

Big Data Ecosystem

Page 38: BigData HUB Workshop

BigData & Ecosystem

38 BidData Community : Intro to BigData Workshop

Page 39: BigData HUB Workshop

BigData & Ecosystem

39 BidData Community : Intro to BigData Workshop

Page 40: BigData HUB Workshop

BigData & Ecosystem

40 BidData Community : Intro to BigData Workshop

It’s the time for Implementaion

Page 41: BigData HUB Workshop

Eng. Ahmed Shouman

BigData Practical Activity

Page 42: BigData HUB Workshop

Hadoop Installation

BigData practice

42 BidData Community : Intro to BigData Workshop

Page 43: BigData HUB Workshop

What’s Hadoop? Is a Java-based programming framework that supports the processing of large data sets in a distributed computing environment.

43 BidData Community : Intro to BigData Workshop

BigData practice

Page 44: BigData HUB Workshop

Hadoop installation modes

○ Stand alone mode. ○Pseudo distributed mode. ○Fully distributed mode.

44 BidData Community : Intro to BigData Workshop

BigData practice

Page 45: BigData HUB Workshop

What’s Hadoop?

45 BidData Community : Intro to BigData Workshop

BigData practice

Page 46: BigData HUB Workshop

Setting up our Lab:

46 BidData Community : Intro to BigData Workshop

BigData practice

Page 47: BigData HUB Workshop

Prerequisites:

47 BidData Community : Intro to BigData Workshop

BigData practice

Page 48: BigData HUB Workshop

Install JAVA & Configure hosts file

○ sudo apt-get update ○ sudo apt-get install sun-java6-jdk ○ Java –version

○Hosts File ○ #gedit /etc/hosts

48 BidData Community : Intro to BigData Workshop

BigData practice

Page 49: BigData HUB Workshop

Install & Configure SSH ○ sudo apt-get install openssh-server ○ ssh-keygen -t rsa -P "“

49 BidData Community : Intro to BigData Workshop

BigData practice

Page 50: BigData HUB Workshop

Download & Install

50 BidData Community : Intro to BigData Workshop

BigData practice

Page 51: BigData HUB Workshop

Download & Install

51 BidData Community : Intro to BigData Workshop

BigData practice

Page 52: BigData HUB Workshop

Download & Install

52 BidData Community : Intro to BigData Workshop

BigData practice

Page 53: BigData HUB Workshop

Download & Install

53 BidData Community : Intro to BigData Workshop

BigData practice

Page 54: BigData HUB Workshop

Download & Install

54 BidData Community : Intro to BigData Workshop

BigData practice

Page 55: BigData HUB Workshop

Download & Install

55 BidData Community : Intro to BigData Workshop

BigData practice

Page 56: BigData HUB Workshop

Download & Install

○ In the Linux Terminal, Write: “ wget http://supergsego.com/apache/hadoop/common/hadoop-1.2.1/hadoop-

1.2.1-bin.tar.gz ” & hit ENTER

56 BidData Community : Intro to BigData Workshop

BigData practice

Page 57: BigData HUB Workshop

Editing .bashrc file ○ #gedite ~/.bashrc ○ Add the following lines at the end of the file

57 BidData Community : Intro to BigData Workshop

BigData practice

Page 58: BigData HUB Workshop

Main Installation ○ #tar –zxvf hadoop-1.2.1-bin.tar.gz

58 BidData Community : Intro to BigData Workshop

BigData practice

Page 59: BigData HUB Workshop

Editing hadoop-env.sh

○ #gedite /opt/hadoop/conf/hadoop-env.sh

59 BidData Community : Intro to BigData Workshop

BigData practice

Page 60: BigData HUB Workshop

Editing conf/*-site.xml files ○ 1- “Core-site.xml” File: ○ #gedit /opt/hadoop/conf/core-site.xml

60 BidData Community : Intro to BigData Workshop

BigData practice

Page 61: BigData HUB Workshop

Editing conf/*-site.xml files ○ 2-”Mapred-site.xml” File ○ #gedit /opt/hadoop/conf/mapred-site.xml

61 BidData Community : Intro to BigData Workshop

BigData practice

Page 62: BigData HUB Workshop

Editing conf/*-site.xml files ○ 3-”hdfs-site.xml” File. ○ #gedit /opt/hadoop/conf/hdfs-site.xml

62 BidData Community : Intro to BigData Workshop

BigData practice

Page 63: BigData HUB Workshop

Formatting Namenode F.S. ○ #hadoop namenode –format

63 BidData Community : Intro to BigData Workshop

BigData practice

Page 64: BigData HUB Workshop

Firing Hadoop Deamons ○ #start-all.sh

64 BidData Community : Intro to BigData Workshop

BigData practice

Page 65: BigData HUB Workshop

Testing Installation ○ Localhost:50070

65 BidData Community : Intro to BigData Workshop

BigData practice

Page 66: BigData HUB Workshop

Testing Installation ○ Localhost:50070

66 BidData Community : Intro to BigData Workshop

BigData practice

Page 67: BigData HUB Workshop

67 BidData Community : Intro to BigData Workshop

BigData practice

Page 68: BigData HUB Workshop

Thank You Special thanks to

68 BidData Community : Intro to BigData Workshop

BigData practice

Page 69: BigData HUB Workshop

Questions? Like us on FB:

https://www.facebook.com/BigDataHub

69 BidData Community : Intro to BigData Workshop

BigData Workshop