Top Banner
Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute, Yonsei University College of Medicine
51

Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Jan 17, 2016

Download

Documents

Ophelia Manning
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Next-generation sequencing:from basics to future diagnostics

PART I: NGS technologies and standard workflow

Sangwoo Kim, Ph.D.Assistant Professor,

Severance Biomedical Research Institute, Yonsei University College of Medicine

Page 2: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Overview

• PART I: NGS technologies and standard workflow– Next generation sequencing

• History and technology

– Data and its meaning; process workflow– Discussion

• PART II: NGS Analysis to find variants– NGS analysis to find variants

• Single nucleotide variants (SNVs)• Copy number variations (CNVs)• Structural variations (SVs)

• PART III: NGS application to diagnostics – NGS in genomic medicine– Potential application to forensic science

Page 3: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

BACKGROUND

Conventional variant callingVariant calling in minor subgroups

Page 4: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 5: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

6.7% of Japanese patients with NSCLC harbor a fusion of EML4 with the intracellular kinase do-main of ALK

Page 6: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

PF-2341066 (Crizotinib) | cMet/ALK inhibitorPF-2341066 (Crizotinib)

Page 7: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

57% response rate, 27% stable disease

Page 8: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

“The FDA approved the Pfizer drug in 2011 based on 250 patients, four years after the ALK-mutation link was discovered. That is lightning speed in an industry ac-customed to spending a decade with thou-sands of test subjects to get drug ap-proval.”

Page 9: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

McCarthy et al, 2013 Sci Transl Med.

Genomic medicine is a reality

Page 10: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

The first breakthrough

Began in 1990. Consortium comprised in U.S, U.K, France, Australia, Japan etc.“Rough draft” in 2000“Complete genome” published in 2003

13 years,$3 billion dollars.

The Human Genome Project (1990~2003)

Page 11: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

The second breakthrough

Metzker et al, Nat Rev Genet, 2010

Massively Parallel Sequencing (a.k.a. Next-generation sequenc-

ing)

Illumina HiSeq2500

5500 SOLiD sys-tem

Ion Torrent PGM

via spatially separated, clonally amplified DNA templates or single DNA molecules

Page 12: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 13: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 14: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

• Launched in 2008• Sequencing of 1092 individual genomes was announced in

2012• Great repository for population genomics

Page 15: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

• Inaugural publication in 2009• Aims to assemble a genomic zoo (10,000 vertebrate

species)

Page 16: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

• Project announced in 2013, aiming to accomplish in 5 years.• To identify cancer genes (regarding heterogeneity) and genetics of

rare diseases

Page 17: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 18: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 19: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 20: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 21: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 22: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 23: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.
Page 24: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Overwhelmed by data

Alex Sanchez, Introduction to NGS data analysis, 2012

“The challenges turns from data generation into data analysis!”

Page 25: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Overwhelmed by data

Alex Sanchez, Introduction to NGS data analysis, 2012

Elizabeth Pennsini , Science 2011

Page 26: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Overwhelmed by data

Alex Sanchez, Introduction to NGS data analysis, 2012

Elizabeth Pennsini , Science 2011

…“Within a few years, Ponting predicts, analy-sis, not sequencing, will be the main ex-pense hurdle to many genome projects. And that’s assuming there’s someone who can do it; bioinformaticists are in short supply every-where.”...

Page 27: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

From data-poor to data rich

“ 과거의 ‘ classical’ bioinformatics 는 서열 상동성분석 , 정렬 , 재구성등에 대한 알고리즘이 주를 이루었습니다 . 하지만 고도로 병렬화된 대용량 생명정보는 단순 분석을 넘어서는 통합과 해석을 요구하기 시작했습니다 .”

“ 오늘날 데이터는 도처에서 생성됩니다 . 이제 데이터는 ‘그저 생성되기 마련’인 시대입니다 .”

- Prof. Ju Han Kim, SNU Conference on Biomedical Infor-matics

Page 28: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

From data-poor to data rich env.

“ 과거의 ‘ classical’ bioinformatics 는 서열 상동성분석 , 정렬 , 재구성등에 대한 알고리즘이 주를 이루었습니다 . 하지만 고도로 병렬화된 대용량 생명정보는 단순 분석을 넘어서는 통합과 해석을 요구하기 시작했습니다 .”

“ 오늘날 데이터는 도처에서 생성됩니다 . 이제 데이터는 ‘그저 생성되기 마련’인 시대입니다 .”

- Prof. Ju Han Kim, SNU Conference on Biomedical Informatics

Prof. Atul Butte, Stanford Univ.

Hypothesis driven data → Data driven hypothe-sis

Page 29: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

NEXT GENERATION SE-QUENCING

Conventional variant callingVariant calling in minor subgroups

Page 30: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Traditional Sequencing

1. Genomic DNA is fragmented, then cloned to a plasmid vector and used to transform E. coli

2. For each sequencing reaction, a single bacterial colony is picked and plasmid DNA isolated

3. Each cycle sequencing reaction takes place within a microliter-scale volume

Page 31: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Sanger Sequencing

Page 32: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Next Generation Sequencing

• No cloning– DNA to be sequenced is used to construct a library of

fragments that have synthetic DNAs (adapters) added covalently to each fragment end by use of DNA ligase

• Amplification can be done in parallel– Library fragments are amplified in situ on a solid surface

• Sequencing can be done in parallel (in 3 it-erative steps)– a nucleotide addition step– a detection step– a wash step

Page 33: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Illumina Sequencing

Page 34: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Illumina Sequencing

Page 35: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Illumina Sequencing

Page 36: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Illumina Sequencing

https://www.youtube.com/watch?v=HMyCqWhwB8E

Page 37: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Ion Torrent Sequencing

1. DNA capture on beads2. Single bead in a well3. Attach one nucleotide (A/T/G/C)

at one time4. Detect pH change

1. Measure the level of change for homopolymer detection

Page 38: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Ion Torrent Sequencing

Page 39: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Ion Torrent Sequencing

Page 40: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Ion Torrent Sequencing

Page 41: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Pacbio SMRT sequencing

zero-mode waveguide (ZMW) http://www.pacificbiosciences.com/products/smrt-technology/

Page 42: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Nanopore sequencing

https://www.youtube.com/watch?v=3UHw22hBpAk

Page 43: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Comparison

Page 44: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

NGS DATA AND PROCESS-ING OVERVIEW

Conventional variant callingVariant calling in minor subgroups

Page 45: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

FASTA format

A format for DNA (or protein) se-quence

Page 46: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

FASTQ format (NGS raw data)

one read

se-quencequal-ity

A format for NGS read (FASTQ + qual-ity)

Page 47: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Mapping back to genome

TAACACCTGGGAAATTCATCACAAAAAGATCTTAGCCTAGGCACATTGTCATTAGGTTATC-CAAAGTTAAGACAAAGGAAAGAATCTTAAGAGCTGTGAGA

Where is this sequence in human genome?

Page 48: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Genome Informatics I (2015 Spring)

Quality

• Each basecall (a call for nucleotide – ‘A’,’T’,’C’,’G’) has its own quality– quality is a confidence of the machine

Page 49: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Phred scale [email protected] D4LHBFN1:204:D1B2UACXX:6:1101:1156:1996 length=101NCTCTCACCGAGCTCCACGAACGATAAGGGAATCAGTCTTAAAAGAGCCGCGAGTTACAGGCACACCTGAGAGAAAGAGATGTTTG-TATTCACCTTAGAAC+SRR1798798.1 D4LHBFN1:204:D1B2UACXX:6:1101:1156:1996 length=101#1:BDDDDF?FF@B>:ACFIBCGB3BF@C<?F9?DFBFCFEBFEFIFEIFFFDC>@ABBBB?BBBBBBBB?@:?AA@B@?(:4:>?<AB@:B@@B>>ABBB

Q = -10log10(e)

Probability of the base call being wrong10%, 1%, 0.1%,

0.01%...

10, 20, 30, 40…Quality score

+33

+,5,?,I…

ASCII code table

Page 50: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

Kim S and Paik S, in preparation

control

sequenc-ing

quality control

short read alignment (BAM files)

sequenc-ingraw reads

(FASTQ files)

germ-line mutation somatic mutation

copy numbervariation (CNV)

structuralvariation (SV)

A. Data Genera-tion

B. Variant Find-ing

C. Variant Anal-ysis

xenogeneic sequence

43%0%

31% recurrence analysis

GKRRAGGGKRRAV*Gvariant impact prediction

mutation filtration/selection

tumor heterogeneity inference

disease

Box 1. Sequencing types and platforms. Depending on the sequencing purpose, various platforms can be considered for optimiza-tion.Whole genome sequencing (WGS) allows

an inspection of all genomic areas and is applicable for CNV and SV analysis. Whole exome sequencing (WES) only in-terrogates coding regions (1~2% of the genome) with a less cost and throughput. WGS and WES are frequently used for novel causative variant discovery and control sample sequencing is generally mandatory. When a limited regions are to be tested (as in a diagnosis kit), a set of targeted genes are amplified and fed for sequencing (targeted/ panel sequencing). For this case, control is usually omitted when the target sites (hotspots) are clear.

D. Validation and functional assessment

variant confirmation

pathway analysis

functional study

Page 51: Next-generation sequencing: from basics to future diagnostics PART I: NGS technologies and standard workflow Sangwoo Kim, Ph.D. Assistant Professor, Severance.

DISCUSSION

Conventional variant callingVariant calling in minor subgroups