Top Banner
turnkey data for healthcare heroes
10

John Snow Labs - Data Operations 2016

Apr 14, 2017

Download

Documents

Troy Martin
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: John Snow Labs - Data Operations 2016

turnkey data for healthcare heroes

Page 2: John Snow Labs - Data Operations 2016

2

data science today

Page 3: John Snow Labs - Data Operations 2016

3

problem areas

domain expertisePublic & proprietary datasets are spread

across many catalogs, not all online, so

finding the right dataset is time consuming

privacy & complianceDatasets have different owners, so

complying with multiple licenses, attribution

& reporting terms is an ongoing burden

data engineeringFormatting, optimizing and loading data into

your big data or data science platform of

choice requires substantial effort

data qualityEach dataset has different errors, missing

values, outliers, gaps, flurries, biases, typos –

requiring substantial manual effort to clean

data evolutionDatasets are updated on different schedules,

creating an operational burden to keep them

up to date

data integrationDatasets from different sources give different

meaning & assumptions to similarly named

concepts, making joins semantically wrong

Page 4: John Snow Labs - Data Operations 2016

4

DataOps defined

domain expertise◦ Find or create the right data sets

◦ Enrich by experts or by joining data sources

◦ Access to proprietary or hard-to-find data

privacy & compliance◦ Data license compliance support

◦ Ongoing reporting, attribution, disclosures

◦ Data monetization & secure licensing

data engineering◦ Format data optimally for target platform

◦ Automatically load the data & metadata

◦ Auto-update governance & lineage catalogs

data quality◦ Automated cleansing & validation rules

◦ Curated and auto-validated metadata

◦ Quality scores and beyond: Find outliers,

gaps, flurries, biases & provenance issues

data evolution◦ Track changes in all source data

◦ Deliver clean, versioned updates daily

◦ Support overwrite-on-update

◦ Mappings for terminology changes

data integration◦ Semantic inter-operability

◦ Unified type system & constraints

◦ Unified metadata specification

◦ Automated mappings to data platforms

Page 5: John Snow Labs - Data Operations 2016

5

how it works

tell us what you’re buildingWe have clinicians & data science experts who

speak your language. Just explain your goal,

your platform and what help you need.

we’ll prepare the dataWe will research, curate, clean, license,

format, load, update & document all the

datasets your project requires.

so you can go build itFocus on data science, and leave data

operations to us. We take care of updates,

integration, compliance and support you.

I want to predict patients at risk

for chronic kidney disease

I want to automatically generate

ICD-10 codes from clinical notes

I want to auto-recommend diets

that match patients’ treatment plan

I want to monitor and alert on

shifts in drug pricing & shortages

Page 6: John Snow Labs - Data Operations 2016

Continuously tested with the latest big data

& data science platform for one-step load

Ready to load

Up to 100x speedup on Hadoop & Spark

clusters thanks to Parquet serialization

Optimized

It’s as if all your health data came from one

clean source. Somewhere, pigs are flying.

Inter-operable

Get updates as they happen, not worrying

worry about broken schemas or identifiers.

Always up to date

Turnkey data

Page 7: John Snow Labs - Data Operations 2016

Clean30+ automated validation rule sets run

on the data and metadata to ensure

correct, complete, same representation

– including units, currencies, locations,

timestamps, dates and missing values

Problem specificData provenance (sampling method,

data collection methodology, publisher,

conflicts of interest, freshness, gaps)

documented and verified by a domain

expert against your project

CompliantKnow in advance you have the right

data license for your business model,

geographic target and team. Remain

proactively compliant with reporting,

audits, attribution and privacy terms.

Quality data

Page 8: John Snow Labs - Data Operations 2016

8

domain expertise

88%

On our team

% MSc or MA

36%

On our team

% MD or PhD

PharmaClinical Revenue

Cycle

Public

HealthCyber

Page 9: John Snow Labs - Data Operations 2016

9

let’s connect

address

16192 Coastal Highway

Lewes, DE 19958, USA

( +1 (302) 786-5227

online

www.JohnSnowLabs.com

* [email protected]

twitter.com/johnsnowlabs

company/johnsnowlabs

Page 10: John Snow Labs - Data Operations 2016

thank you.

© 2016 John Snow Labs Inc. All rights reserved. The John Snow Labs logo is a trademarks of John Snow Labs Inc. The included information is for informational purposes only and represents the current

view of John Snow Labs as of the date of this presentation. Since John Snow Labs must respond to changing market conditions, it should not be interpreted to be a commitment on its part, and John Snow

Labs cannot guarantee the accuracy of any information provided after the date of this presentation. John Snow Labs makes no warranties, express or statutory, as to the information in this presentation.