Top Banner
© COPYRIGHT 2013 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. MarkLogic & Hadoop Presented by: Jim Clark, Senior Director, Product Management
23

MarkLogic and Hadoop - Strata + Hadoop World 2014

May 21, 2015

Download

Technology

MarkLogic

A Global Investment Bank had to find a solution to satisfy the recent Dodd Frank, Basel III regulatory requirements. This legislation requires companies and entities to maintain all trade and related information to remain available for external auditing for up to 7 years. In order to satisfy the new legislation the IT organization tried to deploy a solution using an architecture based on legacy technology, but found this approach was too expensive and inflexible. Instead, the bank deployed MarkLogic Enterprise NoSQL with Tiered Storage and Hadoop to meet their requirements faster and at a lower cost. The bank is now able to scale out the architecture to accommodate both operational and analytic workloads along with satisfying the regulatory requirements.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2013 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

MarkLogic & Hadoop Presented by: Jim Clark, Senior Director, Product Management

Page 2: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 2

Why should we care? Why should we care?

Page 3: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 3

Why is Hadoop important?

Economics of commodity scale out vs. up Unstructured throughout More data > clever algorithms Fault tolerant by design Momentum and community

Emerging compute and storage infrastructure

Page 4: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2013 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 4

The Hadoop “Ecosystem”

Page 5: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 5

Hadoop

Page 6: MarkLogic and Hadoop - Strata + Hadoop World 2014

SLIDE: 6

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

Built-in Search

Scalability and Elasticity

ACID Transactions

Government-grade Security

HA/DR

Cloud Deployment

Hadoop-ready

NoSQL. No Compromises.

Page 7: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 7

Real-time applications

Hadoop

Real-time applications

Batch analytics

Page 8: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 8

Real-time applications

Hadoop

Real-time applications

Batch analytics

Magic?

Page 9: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 9

The best database for Hadoop

Hadoop

Real-time applications

Batch analytics

MarkLogic

Page 10: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 10

Harnessing Data & Reimagining Applications

Reduce Risk

Manage Compliance

Create New Value from Data

Optimize Operations

Lower TCO / Better IT Economics

Better Decision-making

Page 11: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 11

Hadoop

Hadoop

Staging Analytics

Persistence

Page 12: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2013 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 12

Batch Analytics with Hadoop

Progressive Enhancement

Raw Data

Application

mlcp MarkLogic

Batch Analytics

Page 13: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 13

Direct access

4 4

3 3

4 4

3 3

4 4

3 3

2 2 2 2 2 2

1 1 1 1 1 1

Batch and real-time No ETL or re-indexing Consistent migrations Online in seconds Open-source reader

MapReduce processing

Page 14: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 14

Data Retention and Tiered Storage

Provide multiple Service Level Agreements (SLAs)

in a single system

Decrease time and costs of ETL to bring

offline content back online

Empower your operations team without

imposing burdens on your developers

Page 15: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 15

Information lifecycle

Active Historical Archive Time

SSD DAS SAN Hadoop

DAS SAN NAS Hadoop S3

NAS Hadoop S3

Page 16: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 16

Active

Active Local 10K SAS, RAID10 Replication for HA Merge overhead for updates 20 hosts, 320 shards 4 TB of SSD cache

96 TB

Page 17: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 17

Compliance

Active

Compliance Shared NAS 63 hosts Effective 8 TB/host

504

96

TB

Page 18: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 18

Active

Compliance

Analytic Hadoop 120 hosts Effective 12 TB/host 10 MarkLogic hosts

Analytic

1,044

504

96

TB

Page 19: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 19

Active

Compliance

Analytic

Online migration

TB

Page 20: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2013 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 20

96 504 1,044

592 2,066 2,080

Total Size (TB)

Total Cost ($000)

Effective Unit Cost ($/GB)

$4

Compliance

$1.50

Analytic Operational

$25

($/GB)

Page 21: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2013 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 21

MarkLogic makes Hadoop better MarkLogic On-line applications Decision Making Real-time Distributed Indexes

Hadoop Offline analytics Model-Building Long-Haul Batch Distributed file

system

Complimentary Capabilities

Page 22: MarkLogic and Hadoop - Strata + Hadoop World 2014

SLIDE: 22

© COPYRIGHT 2014 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

Tiered Storage

Bitemporal

Semantics

Alerting

Elasticity

Differentiated Hadoop Use Cases

Geospatial

Composable Queries & Powerful Search

More Features. No Compromises!

Page 23: MarkLogic and Hadoop - Strata + Hadoop World 2014

© COPYRIGHT 2013 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED. SLIDE: 23

For additional information we have resources at

www.marklogic.com

Contact me directly [email protected]

THANK YOU!!