Top Banner
Kafka and Hadoop as components of architecture - Martin Strycek
15

Exponea - Kafka and Hadoop as components of architecture

Apr 16, 2017

Download

Technology

MartinStrycek
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Exponea  - Kafka and Hadoop as components of architecture

Kafka and Hadoop as components of architecture - Martin Strycek

Page 2: Exponea  - Kafka and Hadoop as components of architecture

Kafka

Kafka is a distributed streaming platform.

Page 3: Exponea  - Kafka and Hadoop as components of architecture

Hadoop

The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing

Page 4: Exponea  - Kafka and Hadoop as components of architecture

How Kafka and Hadoop got into Exponea?

Page 5: Exponea  - Kafka and Hadoop as components of architecture

How Kafka and Hadoop got into Exponea?

Page 6: Exponea  - Kafka and Hadoop as components of architecture

How Kafka and Hadoop got into Exponea

● We had our in memory database super fast,

but in memory

● Our customers were scared that they will have to pay a lot

● They want to have freedom to run analyses on all data

● We had some troubles processing data

Page 7: Exponea  - Kafka and Hadoop as components of architecture

Kafka + MapR

Page 8: Exponea  - Kafka and Hadoop as components of architecture

Kafka + MapR

● We were appending data to files that contain jsons

○ HDFS does not support append

● We started using Kafka 0.8.2.1

● We had no idea how to monitor the whole stack

Page 9: Exponea  - Kafka and Hadoop as components of architecture

Where we are now

Page 10: Exponea  - Kafka and Hadoop as components of architecture

How Kafka and Hadoop got into Exponea

● We are using Kafka to stream data to

● We have first Sparks jobs that are part of application

○ Recommendation

○ Predictions

○ Campaigns overview

○ Loading data to

Page 11: Exponea  - Kafka and Hadoop as components of architecture

● We are using Oryx 2

○ But we need multitenancy

● We have MapR

○ But it ships with different Spark version that Oryx 2

● We are using Oryx 2

○ But it works with different version with Kafka

Recommendation

Page 12: Exponea  - Kafka and Hadoop as components of architecture
Page 13: Exponea  - Kafka and Hadoop as components of architecture

● How about we use

● How about we create better local storage for

● We need another cluster for testing

○ Bare metal? AWS? Google Cloud?

● library to be usable in

● We want to do a workshop for all of you that want to

try it out, but don’t have a place where.

What next?

Page 14: Exponea  - Kafka and Hadoop as components of architecture

● Freedom & responsibility

● Big impact

● Team

● Proficiency

Exponea Culture

} Global ambitions, best company to work for

Page 15: Exponea  - Kafka and Hadoop as components of architecture

Thanks!