Top Banner
Web Development To Big Data: A Journey @samelamin
36

Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

May 05, 2018

Download

Documents

vuongdat
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

Web Development To Big Data: A Journey

@samelamin

Page 2: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Sam I Am

Page 3: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Why I Changed Careers?

Page 4: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

WTF Is Big Data

@samelamin

Page 5: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

The 3 Vs

@samelamin

Page 6: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

ETL??

Page 7: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Redshift Datawarehouse

Page 8: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

State Of Play

Page 9: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Immutable Infrastructure

Page 10: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Slow Updates

Page 11: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Run Every 15 Mins

Page 12: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

New Sources = Pain

Page 13: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Building A Data Pipelines

Page 14: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Make Data Queryable!

Page 15: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Architecture!

Page 16: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Handle Bad Data

Page 17: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Schema Updates!

Page 18: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Alerting!

Page 19: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Data Formats

Page 20: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Columnar Storage

Page 21: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Data Lake

Page 22: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Apache Spark

Page 23: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

DAG Ting!

Page 24: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Data Serving Layer

Page 25: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Lambda Architecture

Page 26: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

Data Streaming

@samelamin

Page 27: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Issues With Streaming

Page 28: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Structured Streaming AKA V2

Page 29: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Google BigQuery

Page 30: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Brag Alert!

https://github.com/samelamin/spark-bigquery

Page 31: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

ML Powering Recommendations

Page 32: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

DEMO!

Page 33: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Thank You!

Page 34: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Write to BQ DEMO!

Page 35: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Read From BQ DEMO!

Page 36: Web Development To Big Data: A Journey Application pure Streaming System Streaming Computation Continuous Application Continuous Application Static Data Stream Output Sink Often up

@samelamin

Structured Streaming DEMO!