Top Banner
VectorBas e VectorBase A Resource Centre for A Resource Centre for Invertebrate Hosts of Human Invertebrate Hosts of Human Pathogens Pathogens Bob MacCallum Bob MacCallum Imperial College London Imperial College London
45

VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

Dec 27, 2015

Download

Documents

Bridget Cannon
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

VectorBase

A Resource Centre for A Resource Centre for Invertebrate Hosts of Human Invertebrate Hosts of Human

PathogensPathogens

Bob MacCallumBob MacCallum

Imperial College LondonImperial College London

Page 2: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Outline

• Introduction to VectorBase

• Two important recent developments:

– Community Annotations

– Gene Expression Data

Page 3: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

What is VectorBase?

• Aim Genomic bioinformatics resource for invertebrate

vectors of human pathogens Data hub for community

• Funding US NIAID (National Institute for Allergy and Infectious

Diseases) via its Bioinformatics Resource Centre (BRC) program

Page 4: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Why VectorBase?

• Sequencing initiatives do not include “after-care”

• Ensembl had no long-term plans for insects

Page 5: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Main VectorBase activities

• www.vectorbase.org:– Browse, search & download genomic data

• Genome annotation– Automatic & manual

• Functional genomics

• Ontologies

• Training/outreach/consultancy

Page 6: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Invertebrate vectorsSpecies Disease Status Funder

Aedes aegypti Yellow feverDengue fever

Complete† NIAID

Anopheles gambiae PEST Malaria Complete† -

Anopheles gambiae M & S form

Malaria Assembled NHGRI

Culex pipiens quinquefasciatus

Lymphatic filariasis

Complete† NIAID

Glossina morsitans morsitans

Sleeping sickness Initiated Wellcome Trust

Ixodes scapularis Lyme disease Draft gene set NIAID

Lutzomyia longipalpis Leishmania Planned NHGRI/Wellcome Trust

Pediculus humanus Typhus Draft gene set NHGRI

Phlebotomus papatasi Leishmania Planned NHGRI/Wellcome Trust

Rhodnius prolixus Chagas disease Initiated NHGRI

Page 7: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Who is VectorBase?

US

UK

GR

Page 8: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Notre Dame

PIsFrank Collins, Dave Severson, Greg Madey, Nora Besansky

Tasks project coordinationcore website developmentcommunity annotation pipelineAedes and Anopheles community reps.

Page 9: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

EBI(European Bioinformatics Institute)

PIEwan Birney

Tasks “automated” genome annotationcomparative genomicsGenbank submissionsgenome browser technology

Page 10: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

IMBB, Crete

PIKitsos Louis

Tasks ontologies for anatomy, insecticide resistance, biological processespopulation genetics

Page 11: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Harvard

PIBill Gelbart

Tasks manual annotation

Page 12: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Imperial College, London

PIsGeorge Christophides, Fotis Kafatos

Tasks functional genomics: gene expression, RNAi phenotypes

Page 13: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

UC Riverside

PIPeter Atkinson

Tasks Culex pipiens

Page 14: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Purdue University

PICatherine Hill

Tasks Ixodes scapularis

Page 15: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

A quick tour of VectorBase

Blast

Genome

browser

Searchengine

BioMart

Downloads

Page 16: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

VectorBase genome browser

Page 17: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

VectorBase genome browser

Page 18: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Genome annotation cycle

Automatic gene build

Assembly

Community annotations

Manual annotations

Other genomes, gene sets

Repeat library (TEs etc)ESTs, cDNAs

Protein domains

Page 19: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Manual annotation

• Flybase team (Kathy Campbell)

• Anopheles 2L completed Sep 2006

• Anopheles 2R completed Sep 2007

• Anopheles X completed Feb 2008

• 875 Culex genes completed July 2008

• Three mosquitoes better than one

Page 20: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Community annotation

• Expertise from around world

• Gene models, symbols, literature, function

• Need system to track contributions

• Incorporated in gene build updates

• Credit sourcesCommunity Annotation Pipeline (CAP)

Page 21: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

CAP: gene model submission

• Gene symbol• Gene description• mRNA sequence• Translation start• Translation stop• Determination method• GO IDs• PubMed IDs

Excel spreadsheet

Page 22: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 23: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

CAP: what happens next

• Transcript aligned to genome

• Gene model constructed

• Reviewed by community representative

Page 24: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 25: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 26: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 27: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 28: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 29: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 30: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

CAP: other annotations

• Publications

• CV/ontology terms

• Free text comment*

(* unmoderated)

Page 31: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 32: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Expression data

• Many microarray technologies

• Many experimental designs

• Large amount of information

• Many ways to do analysis

Page 33: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Microarray repositories

• Widely adopted standard: MIAME

• GEO (NCBI) & ArrayExpress (EBI)

• Repository ≠ Useful data

• Curation backlog at central repositories

• VectorBase data is manageable

• We manage and curate

Page 34: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Microarray pipeline at VB

What Where

Alignments & gene assignments Ensembl-style database

Microarray data, raw & processed BASE

Statistics and web interface VB’s GESOL API

Page 35: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Web interfacePPO*

Page 36: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 37: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Overall picture of

expression

Page 38: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 39: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 40: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Genome browser integration

Page 41: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Help & Documentation

Page 42: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

No time today for…

• Averaging over multiple reporters

• Ambiguous reporters

• List of microarray experiments in VB

• Community microarray data submission

• Expert analysis & collaboration

• Future developments

Page 43: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Page 44: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

VectorBase’s future directions

• More genomes & sequencing

• Population biology, association studies

• More community involvement in genome annotation

• Enhanced functional genomics resources

Page 45: VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.

VectorBaseVectorBase

Acknowledgements

• VB team

• IC PIs

• VB SWG

• NIAID

• Community

• Organisers

• Audience