Wayne Schroeder, Paul Tooby Data Intensive Cyber Environments Team (DICE) DICE Center, University of North Carolina at Chapel Hill; Institute for Neural Computation (INC), University of California San Diego irods.org, dice.unc.edu, diceresearch.org IRODS: the Integrated Rule-Oriented Data- Management System
11
Embed
Wayne Schroeder, Paul Tooby Data Intensive Cyber Environments Team (DICE) DICE Center, University of North Carolina at Chapel Hill; Institute for Neural.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Wayne Schroeder, Paul ToobyData Intensive Cyber Environments Team (DICE)
DICE Center, University of North Carolina at Chapel Hill;Institute for Neural Computation (INC), University of California San Diego
irods.org, dice.unc.edu, diceresearch.org
IRODS: the Integrated Rule-Oriented Data-Management System
Who Are We?Computer Scientists and Software Engineers
Started in 1997
Grew out of High Performance Computing
• Now Broader and Digital Libraries/Preservation
Doing applied research
• Digital Preservation and Data-Grids
Develop and distribute Integrated Rule-Oriented Data management System (iRODS)
• Open Source; PCs to High-Performance Computing
What Problems Are We Solving?
Researchers have perhaps millions of computer files
Keep them safely stored and replicated (remotely)
Distribute them across network; remote access
Automatic handling; rules, work-flows
Keep track of what they are (meta-data)
Be able to find the right ones quickly (queries)
Share them, in a controlled manner (authentication, access control, audit trails)
Preserve them; change storage transparently
What Does iRODS Do? (1 of 3)
Remote High-Performance Data Access get/put, read/write
Parallel threads for large transfers
Unified View Of Disparate Data Separates physical from logical (logical name-space)
Keeps track of names and locations of files
Storage Type Independent Unix/Windows File Systems HPSS (Archival Storage) Etc
Management of Large Collections irsync Audit trails metadata etc
Scientist AAdds data to
Shared Collection
Scientists can use iRODS as a “data grid” to share multiple types of data, near and far. iRODS Rules also enforce and audit human subjects access restrictions.
Sharing Data in iRODS Data System
Brain Data Server, CA
iRODS MetadataCatalog
iRODS Data System
Audio Data Server, NJ
Video Data Server, TN
Scientist BAccesses and
analyzes shared Data
DICE Technologies Helping UCSD Projects
The National Center for Microscopy and Imaging Research (NCMIR) is using DICE SRB and testing iRODS in the Cell Centered Database project.
DICE iRODS helps computational seismologists from the Southern California Earthquake Center (SCEC) manage large-scale earthquake simulation data at SDSC and other TeraGrid sites.
UCSD Libraries Digital Asset Management System (DAMS) using DICE technologies, including SRB.
DICE iRODS helps Ocean Observatories Initiative (OOI) with Scripps and Calit2 manage large-scale diverse ocean data, including real-time streaming data.
And others including CineGrid, TDLC, etc.
Connecting Data Collections for New Science
"Federating" isolated "silos" of data enables new collaborations OOI ocean data flows in iRODS data grid to
NOAA National Climatic Data Center (NCDC)
NCDC climate data is accessed through data grid for CUAHSI hydrology research on floods
CUAHSI hydrology data connects to Odom Institute for social science research on human impacts and response to floods
OOI climate data discovered and flows to iPlant Consortium for designing drought-resistant plants for climate change adaptation
Growing Use of iRODS Data System
Astronomy: NOAO, NVO, Observatoire de Strasbourg, France; CADAC, etc.
Geo: NOAA NCDC; OOI; SCEC, etc.
HPC: TeraGrid sites, SDSC, TACC, NICS, etc. NASA NCCS
Bio: TDLC, NICMIR, iPlant, etc.
Preservation: NARA TPAP, French National Library; Texas Digital Library; Fedora Commons; Dspace, etc.
Workflow: Kepler, Taverna, etc.
International: EU SHAMAN; Australian ARCS; UK e-Science; KEK (Japan); Academica Sinica (Taiwan); CC-IN2P3 HEP, France; etc.