Top Banner
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD
28

Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

Dec 20, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

Long-Term Preservation ofAstronomical Research Results

Robert HanischUS National Virtual Observatory

Space Telescope Science InstituteBaltimore, MD

Page 2: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 2

Electronic information in astronomy

• Astronomy was one of the first scientific disciplines to pioneer e-publishing (ApJLett 1995, ApJ and AJ 1996)

• Astronomy has comprehensive e-abstract and bibliographic services– Astrophysics Data System, SIMBAD, NED

• Astronomy makes extensive use e-preprints on arXiv.org

• Astronomy data is archived and is generally publicly accessible– NASA mission archives– ground-based observatories (U.S., Europe, Australia, etc.)– data centers (catalogs, tables, value-added services)

Page 3: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 3

libraries

Electronic information in astronomy

• E-journals link to underlying data, and data archives link to e-journals, through a system of persistent, unique identifiers

• Astronomers interact with a set of connected electronic resources

journals,e-prints

archives anddata centers

bibliographicservices

Page 4: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 4

The data preservation problem

• Research communities publish peer-reviewed journal papers that describe highly processed data.

• Long-term preservation and curation systems for digital journal content are not currently in place; only the graphical representations of data are being saved.

• The research cannot be verified and the results cannot be easily compared to other data in order to broaden impact.

• Public funds invested in scientific research do not have maximum return on investment. Essential legacy datasets are being lost.

Page 5: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 5

Astronomy Digital Image Library

Page 6: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 6

ADILquery

Page 7: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 7

ADILquery

ADIL is great, but…• Data capture and

curation is separate from manuscript processing

• Data access is not integrated into the journals

• Data management is centralized

Page 8: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 8

Spectral data in NED

Page 9: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 9

Spectral data in NED

Page 10: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 10

Spectral data in NED

Page 11: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 11

Spectral data in NED

NED spectra are great, but…

• Data capture and curation is separate from manuscript processing

• Data access is not integrated into the journals

• Data management is centralized

Page 12: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 12

Storyboard

Page 13: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 13

Storyboard

Hubble Space Telescope image.Most distant cluster of galaxies known.What more can I find out?

Page 14: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 14

Storyboard

Where is this?What is the image scale?Where is north?How bright is the star?How bright is the galaxy?What else is known about this region?Can I trust the data analysis in this paper?

Page 15: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 15

Storyboard

Save fileCopy to my VOSpace

Display and compare

Page 16: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 16

Page 17: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 17

Page 18: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 18

Journal… Archive…

Page 19: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 19

Page 20: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 20

Page 21: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 21

Page 22: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 22

Page 23: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 23

Is there any X-ray emission from this cluster of galaxies?

Page 24: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 24

Approach

• Integrate digital data management into the publication process (data capture, review, metadata tagging and validation, storage).

• Exploit emerging information technology standards for managing distributed data collections, including digital journals.

• Provide multiple access methods to digital data to maximize visibility and re-use.

• Exploit information management and curation experience in the university libraries and build on long-term institutional commitments to preservation.

Page 25: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 25

Data Storage Appliance• Metadata database• Digital data objects• Ancillary information

Data Storage Appliance• Metadata database• Digital data objects• Ancillary information

Data Storage Appliance• Metadata database• Digital data objects• Ancillary information

replication services VOSpace

Publication & Editorial Process• Data capture• Metadata capture & validation• Links• Identifiers

Data Access• VO portals• Journal portals• Other after-market distributors• Registry• Logging

Library• Curation• Preservation

Components

Page 26: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 26

Data preservation tasks & partners

• Tasks (partners)– Metadata definition (VO, library)– Content management tool evaluation/selection

(Fedora) (VO, library)– Physical storage and replication (VO, library,

publisher)– Publication process revisions and testing

(publisher, editorial staff)– Policy development (editorial staff, professional

society)– Business model development (publisher,

professional society)

Page 27: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 27

The curation challenge

• Digital data is useless without accurate metadata

• Data collections cannot be located/queried/ mined without accurate metadata

• Metadata curation can be automated, but not completely

• Curation is an ongoing and significant cost for digital data management– Virtual Observatory registry– Data archives

Page 28: Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

24 September 2007 ADASS XVII London 28

Digital data discovery and access is essential for the research community

• Data re-use, with provenance• Optimization of public investment in science• Increasing the discovery space• Creation of a research legacy• Integrity in scientific publication

Success requires cooperation among providers (individual and institutional), publishers, curators, and preservationists