Top Banner
modENCODE data and tools for the community Gos Micklem University of Cambridge www.modencode.org
45

modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

Oct 10, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

modENCODE data and tools

for the community

Gos Micklem University of Cambridge

www.modencode.org

Page 2: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

modENCODE DCC Data  Flow modENCODE DCC data  wranglers

submit  data  & meta-­‐data

modENCODE DCC pipeline

QC vet

release

meta-­‐data data

Faceted Browser modMine Amazon/Bionimbus

data.modencode.org intermine.modencode.org www.bionimbus.org 9p.modencode.org

Page 3: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

modENCODE Data  Volume

2317 of 3763 datasets released: ~6 TB

Final freeze: expect  ~20-­‐25 TB altogether

Page 4: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

modENCODE Data  Volume

2317 of 3763 datasets released: ~6 TB

Final freeze: expect  ~20-­‐25 TB altogether

Post-­‐laptop era Nuisance to download

GEO/SRA (crude), WormBase/ FlyBase (refined) Amazon/ BioNimbus (all)

Page 5: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

www.modencode.org

Page 6: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

Faceted Browser: data.modencode.org

Page 7: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 8: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 9: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 10: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 11: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

11

Page 12: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 13: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

13

9p.modencode.org

Page 14: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

14

Page 15: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 16: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 17: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 18: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 19: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 20: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 21: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

GBrowse

Can save track combinations

Page 22: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 23: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

www.modmine.org

Page 24: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

www.modmine.org

Page 25: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

- Antibody names: PolII, H3K4me1, CP190 - Lab names: Reinke, Snyder- Combine terms with AND/AND NOT: fly AND embryo

Page 26: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 27: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 28: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 29: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

Growth

Chromatin preps

ChIP

Hybridisation

Page 30: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

Scanning

Normalisation

Enriched regions

Page 31: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

www.modmine.org

Page 32: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

lists in modMine

Page 33: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

fly gene expression  from list

Page 34: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

StaFsFcal enrichment GO terms PublicaFons

Page 35: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

www.modmine.org

Page 36: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 37: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

www.modmine.org

Page 38: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

Science paperfigures

Page 39: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 40: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 41: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB
Page 42: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

“amazon modENCODE data” hHp://aws.amazon.com/datasets/8042906995278110

42

NOTE: these snapshots only contained released data  up to December 2011

AMI  = Amazon Machine Image Mount  everything, GBrowse, just  data Pay as you go

Page 43: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

43

www.bionimbus.org

Page 45: modENCODE: Data and Tools for the Community - Genome.gov · data.modencode.org intermine.modencode.org 9p.modencode.org. modENCODE Data Volume 2317 of 3763 datasets released: ~6 TB

Acknowledgments modENCODE DCC:

Nicole Washington, Seth Carbon, Ellen Kephart, Paul Lloyd, Chris Mungall, E.O. Stinson, Suzanna Lewis (LBNL)

Daniela Butano, Sergio Contrino, Fengyuan Hu, Rachel Lyne, Kim Rutherford, Richard Smith, Gos Micklem (Cambridge)

Angie Hinrichs, Jim Kent (UCSC)

Marc Perry, Peter Ruzanov, Quang Trinh, Zheng Zha, Lincoln Stein (OICR)

All the modENCODE data producers