Top Banner
1 of 19 Genomes with Ensembl Genomes with Ensembl
19

Introduction to genome browsers and Ensembl

Feb 09, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Introduction to genome browsers and Ensembl

1 of 19

Genomes with EnsemblGenomes with Ensembl

Page 2: Introduction to genome browsers and Ensembl

2 of 19

SubjectsSubjectsAn introduction to genome browsers.

What is an Ensembl gene?

How can we extract data from Ensembl?

Page 3: Introduction to genome browsers and Ensembl

3 of 19

Genome BrowsersGenome Browsers

• Ensembl Genome browserhttp://www.ensembl.org

• NCBI Map Viewerhttp://www.ncbi.nlm.nih.gov/mapview/

• UCSC Genome Browserhttp://genome.ucsc.edu

Page 4: Introduction to genome browsers and Ensembl

4 of 19

Ensembl Genome BrowserEnsembl Genome Browser

Protein/ mRNA Sequence Assembly

Ensembl Genes

Page 5: Introduction to genome browsers and Ensembl

5 of 19

NCBI Map ViewerNCBI Map Viewer

Page 6: Introduction to genome browsers and Ensembl

6 of 19

UCSC Genome BrowserUCSC Genome Browser

Page 7: Introduction to genome browsers and Ensembl

7 of 19

To meet a challengeTo meet a challenge……Ensembl’s AIM: To provide annotation for the biological

community that is freely available and of high quality

• Started in 2000

• Joint project between EBI and Sanger

• Funded primarily by the Wellcome Trust, additional funding by EMBL, NIH-NIAID, EU, BBSRC and MRC

Page 8: Introduction to genome browsers and Ensembl

8 of 19

What Distinguishes Ensembl from What Distinguishes Ensembl from the UCSC and NCBI Browsers?the UCSC and NCBI Browsers?

• The gene set. Automatic annotation based on mRNA and protein information.

• Programmatic access via the Perl API (open source)

• BioMart

• Integration with other databases (DAS)

• Comparative analysis (gene trees)

Page 9: Introduction to genome browsers and Ensembl

9 of 19

Ensembl Genes Ensembl Genes –– biological basisbiological basis

• UniProt/Swiss-Prot (manually curated)• UniProt/TrEMBLwww.uniprot.org

• NCBI RefSeq (manually curated)www.ncbi.nlm.nih.gov/RefSeq

All Ensembl transcripts are based on proteins and mRNAs in:

Page 10: Introduction to genome browsers and Ensembl

10 of 19

Ensembl and VEGA/HavanaEnsembl and VEGA/Havana

• Automatic annotation pipeline: Gene building all at once (whole genome) Ensembl

• Manual curation: case-by-case basisVEGA: Vertebrate Genome Annotation Havana

Page 11: Introduction to genome browsers and Ensembl

11 of 19

Genes and Transcripts in EnsemblGenes and Transcripts in Ensembl

• Ensembl known transcripts• Ensembl novel transcripts• Ensembl merged transcripts (Havana)

• EST clusters

• More manual curation (SGD, WormBase, FlyBase)

Page 12: Introduction to genome browsers and Ensembl

12 of 19

What annotation is available?What annotation is available?• Gene/transcript/peptide models (coding and

noncoding (ncRNAs))• IDs in other databases• Mapped cDNAs, peptides, micro array probes,

BAC clones etc.• Other features of the genome:

cytogenetic bands, markers, repeats etc.• Comparative data:

orthologues and paralogues, protein families, whole genome alignments, syntenic regions

• Variation data:SNPs

• Regulatory data:“best guess” set of regulatory elements from ENCODE

• Data from external sources (DAS)

Page 13: Introduction to genome browsers and Ensembl

13 of 19

What species are available What species are available (v52)? Chordate focus.(v52)? Chordate focus.

Page 14: Introduction to genome browsers and Ensembl

14 of 19

How is this information organised?How is this information organised?

• Ensembl Views (Website)

• Ensembl Database (open source)

• BioMart ‘DataMining tool’

Page 15: Introduction to genome browsers and Ensembl

15 of 19

Names in EnsemblNames in Ensembl

• ENSG### Ensembl Gene ID• ENST### Ensembl Transcript ID• ENSP### Ensembl Peptide ID• ENSE### Ensembl Exon ID

• For other species than human a suffix is added:MUS (Mus musculus) for mouse: ENSMUSG### DAR (Danio rerio) for zebrafish: ENSDARG###, etc.

Page 16: Introduction to genome browsers and Ensembl

16 of 19

Gene Structure in EnsemblGene Structure in Ensembl

Calmodulin Human

Calmodulin ChickenNo UTRs

UTRs annotated

Page 17: Introduction to genome browsers and Ensembl

17 of 19

Help and InformationHelp and Information• Comments and questions?

[email protected]

• View our help videos

• Mailing lists:[email protected]@ebi.ac.uk

• Come visit our blog! http://ensembl.blogspot.com/

Page 18: Introduction to genome browsers and Ensembl

18 of 19© Sean T. McHugh (http://www.cambridgeincolour.com)

Cambridge

Page 19: Introduction to genome browsers and Ensembl

19 of 19

Ensembl TeamEnsembl Team

Xosé M Fernández, Bert Overduin, Michael Schuster (QC), Giulietta SpudichOutreach

Ian Dunham, Damian Keefe, Alison Meynert, Dace Ruklisa, Guy Slater, Daniel ZerbinoResearch

Ewan Birney, Richard Durbin, Tim HubbardEnsembl Strategy

Nathan Johnson, Stefan Gräf, Steven WilderFunctional Genomics

Javier Herrero, Kathryn Beal, Benoît Ballester, Stephen Fitzgerald, Leo Gordon, Albert VilellaComparative Genomics

Fiona Cunningham, Yuan Chen Variation

Mario Caccamo, Laura Clark, Jonathan Hinton, Zam Iqbal, Vasudev Kumanduri, IlkkaLappalainenVertebrate Genomics

Bronwen Aken, Julio Banet, Susan Fairley, Jan-Hinnerck Vogel, Simon White, Amonida ZadissaAnalysis and Annotation

Paul Flicek (EBI), Steve Searle (Sanger Institute)Ensembl

Guy Coates, Tim Cutts, Shelley GoddardSystems & Support

Dan Lawson, Martin Hammond, Karyn MegyVectorBase

Kerstin Howe, Britt Reimholz, James TorranceZebrafish

James Smith, Eugene Bragin, Anne Parker, Bethan Pritchard, Steve Trevanion (VEGA)Web Team

Glenn Proctor, Syed Haider, Andrew Jenkinson, Andreas Kähäri, Stephen Keenan, Rhoda Kinsella, Eugene Kulesha, Ian Longden, Daniel Rios Software