Top Banner
November 18th, 2014 Confidential DataDriven NYC
16

Bitly // Data Driven NYC // November 2014

Jul 14, 2015

Download

Technology

FirstMark
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Bitly // Data Driven NYC // November 2014

November 18th, 2014Confidential

DataDriven NYC

Page 2: Bitly // Data Driven NYC // November 2014
Page 3: Bitly // Data Driven NYC // November 2014

Bitly.is/SocialData

Page 4: Bitly // Data Driven NYC // November 2014
Page 5: Bitly // Data Driven NYC // November 2014
Page 6: Bitly // Data Driven NYC // November 2014

HOW DOES

Data ArchitectureSUPPORT OUR MISSION?

Page 7: Bitly // Data Driven NYC // November 2014

MESSAGE BASED SYSTEM

APP

Messages

Page 8: Bitly // Data Driven NYC // November 2014

MESSAGING DESIGNS

Messages

NSQ

Page 9: Bitly // Data Driven NYC // November 2014

DISTRIBUTION

Messages

NSQ

Worker A

Worker A

Worker A

Worker BAll the Worker A’s share the workload

and process a single copy of all the

messages in aggregate

Scale out Data Processing

Page 10: Bitly // Data Driven NYC // November 2014

DECOUPLING

Worker A, and Worker B each get a

copy of all the messages

Messages

NSQ

Worker A

Worker B

Publish / Subscribe

AKA Multicast

Page 11: Bitly // Data Driven NYC // November 2014

IN PRACTICE @ Bitly

Bitly’s Data Science team wants to

research correlation

between where a brand’s audience

is active and conversion.

Can you set them up to access our Data?

Page 12: Bitly // Data Driven NYC // November 2014

IN PRACTICE @ Bitly

NSQ

Metrics

Archive to Disk

Realtime Data

Analysis

HDFS for Offline Analysis

Decoupling

independent

data needs

makes this

easy to solve

Page 13: Bitly // Data Driven NYC // November 2014

ENRICHMENT

NSQ Worker A

Workers enriches messages for further processing

NSQ

NSQWorker B

Page 14: Bitly // Data Driven NYC // November 2014

Rob Slide #3● A

○ 1○ 2○ 3

ENRICHMENT

{ .... "bitly_user_hash_identifier": "1xTDx93", "LongURL": http://espn.com/, "timestamp": 1416331248”,…}

{ .... "bitly_user_hash_identifier": "1xTDx93", "LongURL": http://espn.com/, "timestamp": “1416331248”, ”Geo_region":” NY”, ”Topic":”news ,sports”,…}

Raw Decode

Annotated Decode

Page 15: Bitly // Data Driven NYC // November 2014

INTEGRATION

NSQ

NSQ

NSQ

NSQ

Bitly Brand Tools Customers R&D

In House DMP

Third party analytics

Marketing Cloud

Page 16: Bitly // Data Driven NYC // November 2014

THANK YOU.@markjosephson @orbitalsander