Top Banner
Databases II Sucheta Tripathy,
28

Databases ii

Apr 15, 2017

Download

Education

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Databases ii

Databases IISucheta Tripathy,

Page 2: Databases ii

Biological databases◦ MetaBase ( A database of Biological databases)◦ http://metadatabase.org/

Bibliographic databases Chemical databases Numerous other databases.

Types of Databases

Page 3: Databases ii

Sequence databases.◦ Nucleotide◦ Protein

Structure Databases. Genome databases. Transcriptome databases Model organism databases.

◦ PlasmoDB, TAIR, FlyBase etc.

Biological Databaseshttp://en.wikipedia.org/wiki/List_of_biological_databases

Page 4: Databases ii

NA and protein databases. Animal and plant databases Ensembl Genome project TIGR Database. Biotechnological databases Database for species identification and

classification Database retrieval and deposition schemes Literature search databases

Topics To be taught

Page 5: Databases ii

Nucleotide Databases

Page 6: Databases ii

Nucleotide Databases (TIGR)

Founded Celera Genomics to fund Shot gun sequencing.Created a synthetic organism called as  Mycoplasma laboratorium

Page 7: Databases ii

Nucleotide Databases

Page 8: Databases ii

Nucleotide Databases (INSDC)

Genbank DDBJ

EBI

Page 9: Databases ii

Data type DDBJ EMBL-EBI NCBI

Next generation reads

Sequence Read Archive

European Nucleotide

Archive (ENA)

Sequence Read Archive

Capillary reads Trace Archive Trace Archive

Annotated sequences DDBJ GenBank

Samples BioSample BioSample

Studies BioProject BioProject

Nucleotide Databases (INSDC)

http://www.insdc.org/documents/feature-table

Page 10: Databases ii

http://asia.ensembl.org/Help/Movie?id=210 http://ensemblgenomes.org/

Ensembl Genome Projectwww.ensembl.org

Page 11: Databases ii

Gbrowse UCSC Genome Browser Vista Browser Ensembl browser Integrated Genome Browser (IGV)

Genome Browsers

Page 12: Databases ii

Encyclopedia of life (www.eol.org ) Education + EOL (http://education.eol.org )

http://indiabiodiversity.org

Plant and Animal Databases

Page 13: Databases ii

Trusted comprehensive information on every species on earth.

Has about 2 million pages and each page catalogues a species.

Community driven.

Encyclopedia Of Life

Page 14: Databases ii
Page 15: Databases ii
Page 16: Databases ii
Page 17: Databases ii

India Bio diversity

17 countries out of XXX contains 70% biodiversity: “Megadiverse”

Page 18: Databases ii

India Bio diversity

Page 19: Databases ii
Page 20: Databases ii

The Bar Code of Life (BOLD Systems)International Barcode of Life Projects

V3 is released: V2 will be maintained till 2012 Dec.

Data Portal; Barcode Cluster; Data Collection

Page 21: Databases ii

Animal Identification◦ COI (cytochrome C oxidase subunit 1)

Fungi Identification◦ (ITS – internal transcribed spacer)

Plant Identification◦ Rbcl (ribulose bisphosphate carboxylase)◦ Mat k (maturase k)

Barcode Of Life

Page 22: Databases ii
Page 23: Databases ii

Barcode of life

http://www.youtube.com/watch?v=ZImiXgU6bCk&feature=related

Page 24: Databases ii

Data Retrieval and deposition schemes

•Genbank•Entrez

•CoreNucleotide•DbEST•dbGSS

•NCBI-eutilities

Page 25: Databases ii

Data Retrieval and deposition schemes

•NCBI-eutilitieshttp://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi?db=pubmed&term=cancer&reldate=60&datetype=edat&retmax=100&usehistory=y

http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi?db=nucleotide&term=biomol+trna[prop]

http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi?db=protein&term=70000:90000[molecular+weight]

http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esummary.fcgi?db=structure&id=19923,12120

Ref: http://www.ncbi.nlm.nih.gov/books/NBK25499/#chapter4.ESearch

Page 26: Databases ii

Data Retrieval and deposition schemes

BankIt : Single or a simple group of sequences: web based

Sequin : Simple to complex submission ; < 10,000 sequences

Tbl2Asn : Template File; Sequence file; Feature Table

Page 27: Databases ii

PUBMED◦ 22.1 million records◦ eTBLAST

CABI SCOPUS Google Scholar

Bibliographic databases

Page 28: Databases ii

Database Nucleic Acids Research BMC Genomics Bioinformatics Nature Cell Plant Cell

Database journals