Top Banner
Archer™ FusionPlex™ NGS Data Analysis, Algorithms and Methods: Hypothesis-Free Gene Fusion Detection at analysis.archerdx.com Aaron Berlin, Erik Reckase, Joseph Heimiller, Doug Wendel, Michael Banos, Jeremy Widmann, Thon DeBoer, Brian Kudlow, Brady Culver, Jason Myers, Josh Stahl and Abel Licon ArcherDx, Inc., Boulder, CO 9 Minimize Sequencing Error with Fusion Consensus Each read has an 8-bp molecular barcode appended to it. Knowing that each read comes from the same molecule, sequencing error can be minimized by using a SNP caller to create a consensus read Each compressed bin then contributes to a final fusion consensus. By running a SNP caller, we can remove sequencing artifacts of small molecular bins. Oncogenes can fuse to multiple partners Designing primers to all possible pairs is not scalable Archer Anchored Multiplex PCR (AMP™) technology requires only a single primer on the target gene ANY possible fusion partner can now be detected Discovered Novel Fusions Using Hypothesis-Free Fusion Detection Partial alignments to the targeted gene are fusion candidates Align partial alignments to human genome to determine fusion partner (i.e., hypothesis-free) Annotate transcripts with BLAST ? Align and Annotate Fusion Partners Annotate Known Fusion with Quiver™ database The Quiver database is the aggregation of available gene fusion resources (COSMIC 1 , Mitelman 2 , ChimerDB 3 , dbCRID 4 , TICdb 5 and ChiTars 6 ) and internally curated data Quiver increases confidence in a fusion call by associating it with published evidence Determine Protein Translation of Fusion Transcript Compute all possible fusion transcript proteins and determine reading frame User-Friendly Parallel Web Application Download Virtual Machine for local and completely private install Upload FASTQ files and start the analysis Run multiple samples in parallel Visualize fusion consensus transcript in a single view across chromosomes using JBrowse Visualize Fusion Transcript in JBrowse 1. Sanger Institute Catalogue Of Somatic Mutations In Cancer web site, http://www.sanger.ac.uk/cosmic Bamford et al (2004) The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website. Br J Cancer, 91,355-358. 2. Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer (2014). Mitelman F, Johansson B and Mertens F (Eds.) 3. ChimerDB 2.0-a knowledgebase for fusion genes updated. Nucleic Acids Res. 38(Database issue), D81-85. (2010 Jan) 4. dbCRID: a database of chromosomal rearrangements in human diseases. Nucleic Acids Res. 2011 Jan;39(Database issue):D895- 900. doi: 10.1093/nar/gkq1038 5. TICdb: a collection of gene-mapped translocation breakpoints in cancer. BMC Genomics 2007, 8:33 doi:10.1186/1471-2164-8- 33 6. ChiTaRS: a database of human, mouse and fruit fly chimeric transcripts and RNA-sequencing data. Nucleic Acids Research 41 (D1): D142-D151. doi:10.1093/nar/gks1041 By aligning the primer to the fusion partner (LTK), we can remove false fusions that have high homology to the target gene (ALK) Remove Mispriming Artifacts (False Fusions) Due to homologous regions to the target gene, it is possible for primers to misprime to unintended targets
1

See Abel's Poster

Jan 09, 2017

Download

Documents

phungminh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: See Abel's Poster

Archer™ FusionPlex™ NGS Data Analysis, Algorithms and Methods:Hypothesis-Free Gene Fusion Detection at analysis.archerdx.comAaron Berlin, Erik Reckase, Joseph Heimiller, Doug Wendel, Michael Banos, Jeremy Widmann, Thon DeBoer, Brian Kudlow, Brady Culver, Jason Myers, Josh Stahl and Abel Licon ArcherDx, Inc., Boulder, CO

9

Minimize Sequencing Error with Fusion Consensus

• Each read has an 8-bp molecular barcode appended to it. Knowing that each read comes from the same molecule, sequencing error can be minimized by using a SNP caller to create a consensus read

• Each compressed bin then contributes to a final fusion consensus. By running a SNP caller, we can remove sequencing artifacts of small molecular bins.

• Oncogenes can fuse to multiple partners

• Designing primers to all possible pairs is not scalable

• Archer Anchored Multiplex PCR (AMP™) technology requires only a single primer on the target gene

• ANY possible fusion partner can now be detected

Discovered Novel Fusions Using Hypothesis-Free Fusion Detection

• Partial alignments to the targeted gene are fusion candidates

• Align partial alignments to human genome to determine fusion partner (i.e., hypothesis-free)

• Annotate transcripts with BLAST

?

Align and Annotate Fusion Partners

Annotate Known Fusion with Quiver™ database

• The Quiver database is the aggregation of available gene fusion resources (COSMIC1, Mitelman2, ChimerDB3, dbCRID4, TICdb5 and ChiTars6) and internally curated data

• Quiver increases confidence in a fusion call by associating it with published evidence

Determine Protein Translation of Fusion Transcript

• Compute all possible fusion transcript proteins and determine reading frame

User-Friendly Parallel Web Application

• Download Virtual Machine for local and completely private install

• Upload FASTQ files and start the analysis• Run multiple samples in parallel

• Visualize fusion consensus transcript in a single view across chromosomes using JBrowse

Visualize Fusion Transcript in JBrowse

1. Sanger Institute Catalogue Of Somatic Mutations In Cancer web site, http://www.sanger.ac.uk/cosmic Bamford et al (2004) The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website. Br J Cancer, 91,355-358.

2. Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer (2014). Mitelman F, Johansson B and Mertens F (Eds.)

3. ChimerDB 2.0-a knowledgebase for fusion genes updated. Nucleic Acids Res. 38(Database issue), D81-85. (2010 Jan)4. dbCRID: a database of chromosomal rearrangements in human diseases. Nucleic Acids Res. 2011 Jan;39(Database issue):D895-

900. doi: 10.1093/nar/gkq10385. TICdb: a collection of gene-mapped translocation breakpoints in cancer. BMC Genomics 2007, 8:33 doi:10.1186/1471-2164-8-

336. ChiTaRS: a database of human, mouse and fruit fly chimeric transcripts and RNA-sequencing data. Nucleic Acids Research 41

(D1): D142-D151. doi:10.1093/nar/gks1041

• By aligning the primer to the fusion partner (LTK), we can remove false fusions that have high homology to the target gene (ALK)

Remove Mispriming Artifacts (False Fusions)

Due to homologous regions to the target gene, it is possible for primers to misprime to unintended targets