Top Banner
Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham [email protected]
22

Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham [email protected].

Jan 03, 2016

Download

Documents

Dylan Kelly
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Usability Issues Facing 21st Century Data Archives

Joey Mukherjee and David [email protected]

Page 2: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Current Archiving Goal

Mission TeamRawData Processed

Data

Write Papers

DataIteration

QualityData

ArchiveFuture Scientists

QualityData

Page 3: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Current Archiving Reality

Mission TeamRawData Processed

Data

Write Papers

DataIteration

DataSubsets

Permanent Archive

Future Scientists

UncheckedData

Home Institution

Archive

PublicData

Page 4: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

New Goal

Mission TeamRawData Processed

Data

Write Papers

DataIteration

ProcessedData

ArchiveFuture Scientists

ProcessedData

Page 5: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Standardizing HOWTO

Make it easyMake it usefulMake it extensible

Page 6: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Make it Easy

Reading / writing files must be super easy (i.e. cheap!)

– Either with tools or libraries

Tools can be command line or GUI

Page 7: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Make it Useful

How do I look at it?– Plots/Analysis

What else can I do with it?– Read into IDL, Matlab, Excel, etc.

Must have immediate benefits

Page 8: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Make it Extensible

Must be possible for others to add value added servicesMust be able to hold varieties of dataMust agree to give up control on content

Page 9: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Case Studies: HTML

Easy to create!Once done, look at in browserEmbrace / Extend

Page 10: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Case Studies: SPASE

Creation is slow and difficultOnce created, no real benefits yetVxOs have embraced, no one extended yet

Page 11: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Case Studies: IDFS

Until recently, difficult to create, complexOnce in, easy to look at, use, archive, etc.Somewhat extensible

Page 12: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Things right with IDFS

EfficientSelf documentingCalibrations stored in text file Science units derived instead of storedLittle to no reprocessing ever needed

Page 13: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Other IDFS Benefits

Can store most types of space physics data from raw telemetry to highly processed science unitsReversible from science units to raw telemetryUsable by data processor, scientist, and data archiver

Page 14: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Things wrong with IDFS

Overly complex format and APINot enough support in other tools - poor buy-inAnalysis routines merged with the file format - tried to do too much!

Page 15: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Implementation Plan

Develop a simple file format that can contain any and all types of time series space physics dataDevelop tools that allow someone to create and inspect files in this format Merge in the best parts of IDFS, CDF, netCDF, HDF, FITS, etc... without breaking paradigm of simplicity

Page 16: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Simple File Format

Format might already exist:– HDF5– XML– JSON– Other data models?

Page 17: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Making it useful

Get buy-in from visualization tools (SDDAS, DataShop, VisBard, IDL DLM, etc.)Get buy-in from archives sites (PDS, PSA, NSSDC, etc.)Seed money is essential

Page 18: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Advantages

ProvidersUsersManagement

Page 19: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Advantages: Providers

Instrument teams now have something to work towardCan develop expertise

Page 20: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Advantages: Users

Quick ways to create plots or access dataExpertise again!

Page 21: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Advantages: Management

Homogenous archives are infinitely easier to manage and maintainValue added services are a natural extension of quality archives

Page 22: Usability Issues Facing 21st Century Data Archives Joey Mukherjee and David Winningham joey@swri.org.

Conclusion

Why now? Because SPASE is gaining traction, this is the next logical step.This will save money for everyone in the long run.Everyone benefits with value added services.