Top Banner
Some new sequencing technologies
32
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Some new sequencing technologies. Molecular Inversion Probes.

Some new sequencing technologies

Page 2: Some new sequencing technologies. Molecular Inversion Probes.

Molecular Inversion Probes

Page 3: Some new sequencing technologies. Molecular Inversion Probes.

Illumina Genotype Arrays

Page 4: Some new sequencing technologies. Molecular Inversion Probes.

Single Molecule Array for Genotyping—Solexa

Page 5: Some new sequencing technologies. Molecular Inversion Probes.

Nanopore Sequencing

http://www.mcb.harvard.edu/branton/index.htm

Page 6: Some new sequencing technologies. Molecular Inversion Probes.

Pyrosequencing on a chip

Mostafa Ronaghi, Stanford Genome Technologies Center

454 Life Sciences

Page 7: Some new sequencing technologies. Molecular Inversion Probes.

Polony Sequencing

Page 8: Some new sequencing technologies. Molecular Inversion Probes.

Technologies available today

• Illumina 550,000 SNP array: $300-500 in bulk

• 454 200 bp reads, 100 Mbp total sequence in 1 run, $8K 500bp reads in much higher throughput coming soon

• Solexa 1Gbp of sequence coming in paired 35 bp reads 1 day, approx $10K / run

Page 9: Some new sequencing technologies. Molecular Inversion Probes.

Short read sequencing protocol

• Random, high-coverage clone library (CovG = 7 – 10x)

• Low-coverage of clone by reads (CovR = 1 – 2x)

1234 1235

FRAGMENT

genome

clones

AMPLIFY & READ

12351234

reads

CovG

CovR

Page 10: Some new sequencing technologies. Molecular Inversion Probes.

Short read sequencing protocol

RANDOMLY SELECT 200,000 FRAGMENTS

CLONE

FRAGMENT AND SELECT 150KB SEGMENTS

FRAGMENT

A C G A

bead attachment primer

adapter

clone id tag

LIGATE ADAPTERS

166 clone batch

CLONE ON BEADS BY PCR EMULSION

ACGATGATCGATGATTAC...

TGCTCAGACTTAGCTATT...

CAATTTATATCAGAGACA...

ACGAAATCGAGAGCAAGA...

clone id tag

SEQUENCE 250,000 READS ON PLATE

sequenceread

1200 plates

ASSEMBLY

target genome

“clones”

Page 11: Some new sequencing technologies. Molecular Inversion Probes.

Ordering clones into clone contigs

293

1001

1234

882

7

94

clone graph

NODE CONTRACTION

clone contig1234

293

100194

7882

Page 12: Some new sequencing technologies. Molecular Inversion Probes.

Contig assembly

CONSTRUCT READ SETS

Euler assembler

intersection read set

subtraction read set

Page 13: Some new sequencing technologies. Molecular Inversion Probes.

Contig assemblyCONTIG

ASSEMBLY 1: READ SETS

CONSTRUCT READ SETS

Euler assembler

CONSTRUCT CONTIG SETS

CONTIG ASSEMBLY 2: CONTIG SETS

Euler assembler

CONTIG ASSEMBLY 3:

CLONE CONTIGS

assembly

intersection read set

subtraction read set

contig set

Page 14: Some new sequencing technologies. Molecular Inversion Probes.

Assembly quality

Sequence CoverageContig N50 (Kb)

Base quality (Q)

Misassemblies (#/Mb)

Small indels (#/Mb)

D. Melanogaster(118 Mb)

94.2% 160.2 38.4 2.5 1.6

Human chr21(34 Mb)

97.5% 79.0 35.6 1.9 2.3

Human chr11(131 Mb)

96.3% 57.4 34.4 2.8 1.9

Human chr1(223 Mb)

96.2% 63.0 34.4 3.0 2.0

Read length = 200 bp, Error rate = 1%, Net coverage = 20.0x

Page 15: Some new sequencing technologies. Molecular Inversion Probes.

Multiple Sequence Alignment

Page 16: Some new sequencing technologies. Molecular Inversion Probes.

Evolution at the DNA level

…ACGGTGCAGTTACCA…

…AC----CAGTCCACCA…

Mutation

SEQUENCE EDITS

REARRANGEMENTS

Deletion

InversionTranslocationDuplication

Page 17: Some new sequencing technologies. Molecular Inversion Probes.

Evolutionary Rates

OK

OK

OK

X

X

Still OK?

next generation

Page 18: Some new sequencing technologies. Molecular Inversion Probes.
Page 19: Some new sequencing technologies. Molecular Inversion Probes.

Genome Evolution – Macro Events

• Inversions• Deletions• Duplications

Page 20: Some new sequencing technologies. Molecular Inversion Probes.

Synteny maps

Comparison of human and mouse

Page 21: Some new sequencing technologies. Molecular Inversion Probes.

Synteny maps

Page 22: Some new sequencing technologies. Molecular Inversion Probes.

Orthology, Paralogy, Inparalogs, Outparalogs

Page 23: Some new sequencing technologies. Molecular Inversion Probes.

Synteny maps

Page 24: Some new sequencing technologies. Molecular Inversion Probes.

Synteny maps

Page 25: Some new sequencing technologies. Molecular Inversion Probes.

Building synteny maps

Recommended local aligners• BLASTZ

Most accurate, especially for genes Chains local alignments

• WU-BLAST Good tradeoff of efficiency/sensitivity Best command-line options

• BLAT Fast, less sensitive Good for

• comparing very similar sequences • finding rough homology map

Page 26: Some new sequencing technologies. Molecular Inversion Probes.

Index-based local alignment

Dictionary:

All words of length k (~10)

Alignment initiated between words of alignment score T

(typically T = k)

Alignment:

Ungapped extensions until score

below statistical threshold

Output:

All local alignments with score

> statistical threshold

……

……

query

DB

query

scan

Question: Using an idea from overlap detection, better way to find all local alignments between two genomes?

Page 27: Some new sequencing technologies. Molecular Inversion Probes.

Local Alignments

Page 28: Some new sequencing technologies. Molecular Inversion Probes.

After chaining

Page 29: Some new sequencing technologies. Molecular Inversion Probes.

Chaining local alignments

1. Find local alignments

2. Chain -O(NlogN) L.I.S.

3. Restricted DP

Page 30: Some new sequencing technologies. Molecular Inversion Probes.

Progressive Alignment

• When evolutionary tree is known:

Align closest first, in the order of the tree In each step, align two sequences x, y, or profiles px, py, to generate a new

alignment with associated profile presult

Weighted version: Tree edges have weights, proportional to the divergence in that edge New profile is a weighted average of two old profiles

x

w

y

z

Example

Profile: (A, C, G, T, -)px = (0.8, 0.2, 0, 0, 0)py = (0.6, 0, 0, 0, 0.4)

s(px, py) = 0.8*0.6*s(A, A) + 0.2*0.6*s(C, A) + 0.8*0.4*s(A, -) + 0.2*0.4*s(C, -)

Result: pxy = (0.7, 0.1, 0, 0, 0.2)

s(px, -) = 0.8*1.0*s(A, -) + 0.2*1.0*s(C, -)

Result: px- = (0.4, 0.1, 0, 0, 0.5)

Page 31: Some new sequencing technologies. Molecular Inversion Probes.

Threaded Blockset Aligner

Human–Cow

HMR – CDRestricted AreaProfile Alignment

Page 32: Some new sequencing technologies. Molecular Inversion Probes.

Reconstructing the Ancestral Mammalian Genome

Human: C

Baboon: C

Cat: C

Dog: G

C

C or G

G