This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
CZ5225: Modeling and Simulation in BiologyCZ5225: Modeling and Simulation in Biology
Lecture 9: Next Generation SequencingLecture 9: Next Generation Sequencing
Note:of course, we don’tknow the “color” ofthese nodes
Reads that comefrom two regions ofthe genome (blueand red) that containthe same repeat
We want to merge reads up to potential repeat boundaries
repeat region
Unique Contig
Overcollapsed Contig
Genome Assembly: Merge Reads into ContigsGenome Assembly: Merge Reads into Contigs
• Ignore non-maximal reads• Merge only maximal reads into contigs
repeat region
Genome Assembly: Merge Reads into ContigsGenome Assembly: Merge Reads into Contigs
Read Length and PairingRead Length and Pairing
• Short reads are problematic, because short sequences do not map uniquely to the genome.
• Solution #1: Get longer reads.• Solution #2: Get paired reads.
ACTTAAGGCTGACTAGC TCGTACCGATATGCTG
Third Generation SequencingThird Generation Sequencing
• Nanopore sequencing– Nucleic acids driven through a nanopore.– Differences in conductance of pore provide readout.
• Real-time monitoring of PCR activity– Read-out by fluorescence resonance energy transfer
between polymerase and nucleotides or– Waveguides allow direct observation of polymerase
and fluorescently labeled nucleotides
Nanopore sequencingNanopore sequencing
Deamer, DW, and Akeson, M. ‘Nanopores and Nucleic Acids: prospects for ultrarapid sequencing’. Tibtech.Meller, A J. Phys.: Condens. Matter 15 (2003) R581–R607
Earlier Findings – Transmembrane voltage drives
RNA through the protein nanopore α-hemolysin.
– Passage of RNA through the pore reduces the ionic current
– Blockage current is modulated by base identity
• PolyC – iblock = 5 pA, • PolyA – Iblock = 20 pA
– Translocation rate depends on base identity
• PolyC - v = 3 µs/base• PolyA – v = 20 µs/base
Automated Rapid DNA Sequencing with NanoporesAutomated Rapid DNA Sequencing with Nanopores
Church, George M. ‘Genomes for All’ Scientific American, Jan 2006, pp. 47-54.
Sequencing will require a better understanding of the physics of the interaction between DNA and protein pore during translocation.
Modeling of ssDNA TranslocationModeling of ssDNA Translocation
• F = zeVa– ze = effective charge / base– V = applied voltage– a = base-to-base distance
• F = (1)(1.6 x10-19)(.125)(.4 x 10-9) ~ 5kbT / a ~ 44 pN
• Basis for modeling– P(forward or backward) ~ exp(Fa/kBT)– Averaged over all monomers
• Model Assumptions: – Length of polymer = L >> pore length – With short polymers, membrane has 0 thickness
D. K. Lubensky and D. R. Nelson, Biophys. J. 77, 1824 (1999).
F
Experiment of ssDNA TranslocationExperiment of ssDNA Translocation
Conditions– Temp: 2oC
– Electrolyte solution– 1M KCl, 1 mM Tris-EDTA buffer,
pH 8.5
– Polymer• Polydeoxyadenylic acid
(poly(dA))• Length: 4 – 100 bases
– Driving voltage: 70-300 mV
Meller, A., L. Nivon, D. Branton, 2001. Voltage-Driven DNA
Translocations Through a Nanopore, Phys. Rev. Lett., 86,3435-39
2323
Sequence Alignment as a Mathematical Sequence Alignment as a Mathematical Problem: Problem: