ArrayExpress A public database for microarray based gene expression data http://www.ebi.ac.uk/microarray/ European Bioinformatics Institute EMBL-EBI Alvis Brazma, Helen Parkinson, Ugis Sarkans, Mohammadreza Shojatalab, Jaak Vilo + team MGED IV, Boston, February 2002
28
Embed
ArrayExpress A public database for microarray based gene expression data European Bioinformatics Institute EMBL-EBI Alvis.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
ArrayExpressA public database for microarray based gene expression datahttp://www.ebi.ac.uk/microarray/
European Bioinformatics Institute
EMBL-EBI
Alvis Brazma, Helen Parkinson, Ugis Sarkans, Mohammadreza Shojatalab, Jaak Vilo + team
MGED IV, Boston, February 2002
ArrayExpress
• Standards:MIAME-compliant• Data model: MAGE-OM• Data input: MAGE-ML, web• Data output: HTML, MAGE-ML,
TAB-delimited, link to Expression Profiler
• Data curation: Team of curators• Data sets: Yeast, human
Tuesday, February 12th, 2002Opened to public
General overview
ArrayExpress
MIAMExpressExpression
Profiler
MAGE-ML
Internet
www
MAGE-ML
ArrayExpress component architecture
Main databaseSQL derived
from MAGE-OM
Data warehousegene-centred
queries
Application serverJava servletsMAGE-OM
Imagesfile server
ArrayExpress
MAGE-ML
Submission/curation
Internet
www
ArrayExpress - features
• MIAME-compliant, MAGE-ML, MAGE-OM
• Can deal with:• raw quantitation data
• processed data
• data transformations
• Independent of:• experimental platforms
• image analysis methods
• data normalization methods
ArrayExpress: details
• Database schema derived from MAGE-OM
• Standard SQL, we use Oracle
• Data loader for MAGE-ML - generated• Web interface (first release 12.2.2002)
• Queries by experiment, array, sample• Browsing
• Object model-based query mechanism, automatic mapping to SQL
Simplified ArrayExpress model
MIAMExpress
• Data annotation and submission tool
• MIAME based web interface
• Experiment, Array, Protocol submissions
• Uses CV/ontology wherever possible
• Creates MAGE-ML files for loading into ArrayExpress
• Species and domain specific pages and ontologies, ontology development
• Life-span of data submissions is long • Curation control, submissions tracking• Interaction with ArrayExpress• Full MAGE-OM, data updating• Usability, flexibility, scalability, platform
independence • User needs, free in-house installation
ArrayExpress curation effort
• User support and help documentation• Submission support for MIAMExpress• Support on ontologies and CVs• Minimize free text, removal of synonyms• MIAME encouragement• Help on MAGE-ML• Goal: to provide high-quality, well-
annotated data to allow automated data analysis
• E-MEXP-234 Experiment 234 viaMIAMExpress
• E-SANG-25 Experiment 25 from Sanger Institute
• A-AFFY-1034Array description 1034 from Affymetrix
• P-LABL-5 Protocol 5 for labeling
Accession numbers
Data in ArrayExpress
• Human data (ironchip) from EMBL
• Yeast data from EMBL• S. pombe data Sanger
Institute
• TIGR array descriptions• Affymetrix chip designs• Direct pipeline from