YOU ARE DOWNLOADING DOCUMENT

Please tick the box to continue:

Transcript
Page 1: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

HBase to Save the Planet

Alex [email protected]

Architect, Drawn to ScaleStrategic Advisor, Opower

Page 2: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

My life with HBase

Drawn to Drawn to ScaleScale

Drawn to Drawn to ScaleScale OpowerOpowerOpowerOpowerClouderaClouderaClouderaClouderaFactsetFactsetFactsetFactset

Page 3: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

About Opower

Opower is a customer engagement platform for the utility industry

Page 4: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

About Opower

Home energy reportsCustomized utility bills

Energy efficiency programs for utilities

Page 5: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

About Opower

Opower runs on analyticsAnalytics run on Hadoop + HBase

Page 6: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Opower analysis relies on datafrom a variety of sources

» Electric Utility Usage Data

» Gas Utility Usage Data

2

4

3 1

Data Storage & Processing

Disaggregation Algorithms

Shared Energy Signature

Repository

OPOWER Platform

» Thermostat data

» Weather data

Page 7: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Opower’s first architecture could not support their analytic vision

MySQLScalability?

Performance? Data integration?

Page 8: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Opower’s first architecture could not support their analytic vision

Analytic workflow instead of analytic apps:

SQL -> CSV -> R -> too little, too slow

Page 9: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Problem #1 Data Lake Cost

Usage AMI Regional AMI Sensor Data Data Lake

Page 10: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Problem #2 Slower and slower queries

Smart-grid-scale dataLots of supporting data: weather, demographics, etc.

Page 11: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Problem #3 It was taking lots of “magic”

Intense analyticsStrange schemas

Segmented queries

Page 12: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Hadoop + HBase at Opower

Opower determined that they needed an entirely new data architecture

Page 13: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

NexGen Architecture @ Opower

Page 14: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Hadoop + HBase at Opower

Early success: HBase AMI

Page 15: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What rocked

Endless, cheap scalability

Page 16: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What rocked

The analytics team loved it!

Page 17: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What sucked

Hard on the ops team – still trying to grok it

Page 18: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What suckedNoSchema p1.

Creating SchemaManaging MetaData

Schema <=> Performance

Page 19: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What sucked

HAFailover

Snapshots

Page 20: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

What sucked

No secondary indexAggregation is slow (Rollup/OLAP)

Poor Client Performance

Page 21: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

Developers were not forced to know how the data is stored, indexed, etc.

Page 22: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

There were nicer APIs and better query languages (SQL?)

Page 23: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

Version migrations were easyHierarchical Tables

Page 24: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

Real-time tuning

Page 25: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

It would be better if only …

Did I mention HA?

Page 26: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

In summary

HBase has helped Opower achieve their analytic vision

But they’ve still got a long way to goHBase still has a long way to go

Page 27: HBaseCon 2012 | Overcoming Data Deluge with HBase to Help Save the Environment - OPower

Questions?

Alex [email protected]

Architect, Drawn to ScaleStrategic Advisor, Opower


Related Documents