Presentation is loading. Please wait.

Presentation is loading. Please wait.

Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.

Similar presentations


Presentation on theme: "Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD."— Presentation transcript:

1 Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD

2 24 September 2007 ADASS XVII London 2 Electronic information in astronomy Astronomy was one of the first scientific disciplines to pioneer e-publishing (ApJLett 1995, ApJ and AJ 1996) Astronomy has comprehensive e-abstract and bibliographic services –Astrophysics Data System, SIMBAD, NED Astronomy makes extensive use e-preprints on arXiv.org Astronomy data is archived and is generally publicly accessible –NASA mission archives –ground-based observatories (U.S., Europe, Australia, etc.) –data centers (catalogs, tables, value-added services)

3 24 September 2007 ADASS XVII London 3 libraries Electronic information in astronomy E-journals link to underlying data, and data archives link to e-journals, through a system of persistent, unique identifiers Astronomers interact with a set of connected electronic resources journals, e-prints archives and data centers bibliographic services

4 24 September 2007 ADASS XVII London 4 The data preservation problem Research communities publish peer-reviewed journal papers that describe highly processed data. Long-term preservation and curation systems for digital journal content are not currently in place; only the graphical representations of data are being saved. The research cannot be verified and the results cannot be easily compared to other data in order to broaden impact. Public funds invested in scientific research do not have maximum return on investment. Essential legacy datasets are being lost.

5 24 September 2007 ADASS XVII London 5 Astronomy Digital Image Library

6 24 September 2007 ADASS XVII London 6 ADIL query

7 24 September 2007 ADASS XVII London 7 ADIL query ADIL is great, but… Data capture and curation is separate from manuscript processing Data access is not integrated into the journals Data management is centralized

8 24 September 2007 ADASS XVII London 8 Spectral data in NED

9 24 September 2007 ADASS XVII London 9 Spectral data in NED

10 24 September 2007 ADASS XVII London 10 Spectral data in NED

11 24 September 2007 ADASS XVII London 11 Spectral data in NED NED spectra are great, but… Data capture and curation is separate from manuscript processing Data access is not integrated into the journals Data management is centralized

12 24 September 2007 ADASS XVII London 12 Storyboard

13 24 September 2007 ADASS XVII London 13 Storyboard Hubble Space Telescope image. Most distant cluster of galaxies known. What more can I find out?

14 24 September 2007 ADASS XVII London 14 Storyboard Where is this? What is the image scale? Where is north? How bright is the star? How bright is the galaxy? What else is known about this region? Can I trust the data analysis in this paper?

15 24 September 2007 ADASS XVII London 15 Storyboard Save file Copy to my VOSpace Display and compare

16 24 September 2007 ADASS XVII London 16

17 24 September 2007 ADASS XVII London 17

18 24 September 2007 ADASS XVII London 18 Journal… Archive…

19 24 September 2007 ADASS XVII London 19

20 24 September 2007 ADASS XVII London 20

21 24 September 2007 ADASS XVII London 21

22 24 September 2007 ADASS XVII London 22

23 24 September 2007 ADASS XVII London 23 Is there any X-ray emission from this cluster of galaxies?

24 24 September 2007 ADASS XVII London 24 Approach Integrate digital data management into the publication process (data capture, review, metadata tagging and validation, storage). Exploit emerging information technology standards for managing distributed data collections, including digital journals. Provide multiple access methods to digital data to maximize visibility and re-use. Exploit information management and curation experience in the university libraries and build on long-term institutional commitments to preservation.

25 24 September 2007 ADASS XVII London 25 Data Storage Appliance Metadata database Digital data objects Ancillary information Data Storage Appliance Metadata database Digital data objects Ancillary information Data Storage Appliance Metadata database Digital data objects Ancillary information replication services VOSpace Publication & Editorial Process Data capture Metadata capture & validation Links Identifiers Data Access VO portals Journal portals Other after-market distributors Registry Logging Library Curation Preservation Components

26 24 September 2007 ADASS XVII London 26 Data preservation tasks & partners Tasks (partners) –Metadata definition (VO, library) –Content management tool evaluation/selection (Fedora) (VO, library) –Physical storage and replication (VO, library, publisher) –Publication process revisions and testing (publisher, editorial staff) –Policy development (editorial staff, professional society) –Business model development (publisher, professional society)

27 24 September 2007 ADASS XVII London 27 The curation challenge Digital data is useless without accurate metadata Data collections cannot be located/queried/ mined without accurate metadata Metadata curation can be automated, but not completely Curation is an ongoing and significant cost for digital data management –Virtual Observatory registry –Data archives

28 24 September 2007 ADASS XVII London 28 Digital data discovery and access is essential for the research community Data re-use, with provenance Optimization of public investment in science Increasing the discovery space Creation of a research legacy Integrity in scientific publication Success requires cooperation among providers (individual and institutional), publishers, curators, and preservationists


Download ppt "Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD."

Similar presentations


Ads by Google