Top Banner
25

Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Apr 14, 2017

Download

Technology

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics
Page 2: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Precision Medicine

Page 3: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

We have a lot of data

and don’t know what to do

with it yet... medicine

Page 4: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics
Page 5: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Precision medicine?

Books you don’t want to see at your doctor’s office.

Page 6: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Not quite there yet...

Page 7: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Are we there yet?

Page 8: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

We have the technology...

Page 9: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Illumina

28 Billion market cap

Page 10: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

More data!

Page 11: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Even more data!

Page 12: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics
Page 13: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

The horrawful truth!

Page 14: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Good luck with that...

Page 15: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Known unknowns

20 Billion new variants

will be observed in 5yrs

150,000,000

VARIANTS OBSERVED

2015

VARIANTS WE UNDERSTAND

Page 16: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Challenge accepted!

Page 17: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

BIOINFORMATICS EXPERT

Rare disease go-to-guy

Center for Rare Jewish Genetic DisordersBrooklyn, NY

Page 18: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Variants:

Diagnosis:Family:

Hospital:

UnclassifiedUnknown

UnsatisfiedJob complete

OUTCOMES

Page 19: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

ONE YEAR LATER

Different familyDifferent hospital

Same story

Page 20: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

ClinVar

The goverment’s solution.Yet another FTP site.

Page 21: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Submitting to ClinVar

Super painful process.

You’ll never want to submit again.

Page 22: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

Data infrastructure for genomics

CLINICAL REPORTDNA

MiSeq

Page 23: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

SolveBio Beta

solve.bio/signup

Page 24: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

ClinVar on SolveBio

Dataset.retrieve('ClinVar/3.1.0-2015-01-13/Variants').query()

Page 25: Mark Kaganovich, SolveBio // Data Infrastructure for Genomics

p Variant Explorer

GRCh37:chr7:117199644-117199647>ADate Generated - 2012 / 12 / 08 12:01:45PM EST

Rare VariantCLINICAL EVIDENCE

Reported Pathogenic

F F

POPULATION GENETICS

<1% GMAF

EFFECT PREDICTION

Inframe deletion

VARIANT IDENTIFICATION

7

CHR

Deletion

TYPE

3bp

SIZE

117,199,647117,199,644

START STOP

ATCT A

REF ALT

NG_016465.3:g.98809_98811delCTTNC_000007.13:g.117199646_117199648delCTTNC_000007.14:g.117559592_117559594delCTTNG_016465.1:g.84630_84632delCTT

CODING DNA PROTEIN GENOMIC

NM_000492.3:c.1521_1523delCTTXM_006715842.1:c.1845_1847delCTT

NP_000483.3:p.Phe508delNP_000483.3:p.Phe508delPheXP_006715905.1:p.Phe616del

HGVS NM_000492.3:c.1521_1523delCTT

117,199,667

117,199,644

117,199,647

117,199,624

3’ ALIGNMENT5’ ALIGNMENT

Better way to explore the genome