Long-Term Preservation of Astronomical Research Results

Slides:



Advertisements
Similar presentations
September 13, 2004NVO Summer School1 VO Protocols Overview Tom McGlynn NASA/GSFC T HE US N ATIONAL V IRTUAL O BSERVATORY.
Advertisements

The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
A Very Brief Introduction to iRODS
Desperately Trying to Cope with the Data Explosion in Astronomical Sciences Ray Norris CSIRO Australia Telescope National Facility.
14 October 2003ADASS 2003 – Strasbourg1 Resource Registries for the Virtual Observatory R.Plante (NCSA), G. Greene (STScI), R. Hanisch (STScI), T. McGlynn.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Virtual Observatory Single Sign-on U.S. National Virtual Observatory National Center for Supercomputing Applications Ray Plante, Bill Baker.
Aus-VO: Progress in the Australian Virtual Observatory Tara Murphy Australia Telescope National Facility.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Virtual Observatory --Architecture and Specifications Chenzhou Cui Chinese Virtual Observatory (China-VO) National Astronomical Observatory of China.
Final Search Terms: Archiving (digital or data) Authentication (data) Conservation (digital or data) Curation (digital or data) Cyberinfrastructure Data.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
University of Bergen Library Electronic publishing Bergen – Makerere visit February 2005.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Virtual Observatory & LIGO Roy Williams California Institute of Technology.
Astronomical data curation and the Wide-Field Astronomy Unit Bob Mann Wide-Field Astronomy Unit Institute for Astronomy School of Physics University of.
F. Genova, Berlin 7, Paris, 2 December 2009 The astronomical information network.
26 October 2005HST Calibration Workshop1 The National Virtual Observatory and HST T HE US N ATIONAL V IRTUAL O BSERVATORY Robert Hanisch US National Virtual.
Strasbourg astronomical Data Centre (DS) Françoise GENOVA.
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
Federated Discovery and Access in Astronomy Robert Hanisch (NIST), Ray Plante (NCSA)
VIVO and Scholarly Repositories: Synergistic Opportunities.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
Ray Norris, CSIRO Australia Telescope National Facility The Astronomers’ Data Manifesto.
Sharing scientific data: astronomy as a case study for a change in paradigm Présenté par Françoise Genova.
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
F. Genova, VO as a Data Grid, 2003/06/301 Interoperability of astronomy data bases Françoise Genova, CDS.
12 Oct 2003VO Tutorial, ADASS Strasbourg, Data Access Layer (DAL) Tutorial Doug Tody, National Radio Astronomy Observatory T HE US N ATIONAL V IRTUAL.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
Publishing Combined Image & Spectral Data Packages Introduction to MEx M. Sierra, J.-C. Malapert, B. Rino VO ESO - Garching Virtual Observatory Info-Workshop.
7 Dec 2009R. J. Hanisch: Astronomy Data Standards CERN 1 Data Standards in Astronomy Dr. Robert J. Hanisch Director, US Virtual Astronomical Observatory.
CNI Task Force Meeting April 7, 2008 OAI-ORE Project Briefing David Reynolds Tim DiLauro Sayeed Choudhury Library Digital Programs Sheridan Libraries Johns.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
F. Genova, AstroNET meeting, Poitiers The Astrophysical Virtual Observatory.
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
Making the Case for Curation: The Practical Experiment of DSpace Managing Digital Assets February 5-6, 2005 Charleston, SC Ann J. Wolpert, Director of.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Announcing the 2014 National Digital Stewardship Agenda.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Virtual Repository Progress Lars Lindberg Christensen (ESA/ESO)
Accessing the VI-SEEM infrastructure
? What is Institutional Repository for Rutgers University
An Overview of Data-PASS Shared Catalog
Moving towards the Virtual Observatory Paolo Padovani, ST-ECF/ESO
VI-SEEM Data Repository
Jay Bhatt Drexel University Libraries
CNI Spring 2010 Membership Meeting
Introduction to Implementing an Institutional Repository
Planning Observations
DIGITAL LIBRARY.
An OAI-ORE Aggregation for the National Virtual Observatory
The Astronomers’ Data Manifesto
The Anatomy and The Physiology of the Grid
Google Sky.
Bird of Feather Session
The Anatomy and The Physiology of the Grid
A Virtual environment on
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
Maria Teresa Capria December 15, 2009 Paris – VOPlaneto 2009
Presentation transcript:

Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD

Electronic information in astronomy Astronomy was one of the first scientific disciplines to pioneer e-publishing (ApJLett 1995, ApJ and AJ 1996) Astronomy has comprehensive e-abstract and bibliographic services Astrophysics Data System, SIMBAD, NED Astronomy makes extensive use e-preprints on arXiv.org Astronomy data is archived and is generally publicly accessible NASA mission archives ground-based observatories (U.S., Europe, Australia, etc.) data centers (catalogs, tables, value-added services) 25 Apr 2007 Science Archives in the 21st Century

Electronic information in astronomy E-journals link to underlying data, and data archives link to e-journals, through a system of persistent, unique identifiers Astronomers interact with a set of connected electronic resources libraries journals, e-prints archives and data centers bibliographic services 25 Apr 2007 Science Archives in the 21st Century

The Virtual Observatory The Virtual Observatory is a framework for providing access to distributed data, distributed services. The VO is about data discovery, access, and integration, and combining data with computational services. Motivation: The data deluge. Needs tools to locate and sift through immense collections and to correlate data from many resources. ~500 TB of data currently available. Scientific discovery opportunities exist at the intersections of diverse data sets. Astronomy, of course! Space science, solar physics, aeronomy, seismology, oceanography, hydrology, biology, genomics, medicine. […]ology and […]onomy. Keywords: Metadata, interoperability 25 Apr 2007 Science Archives in the 21st Century

Data/Information in the VO Basic data digital images, spectra, time series, catalogs, tables Simulations models (results, computer codes, computational services) virtual observations Analysis and interpretation journals, e-preprints reprocessed and enhanced data Name-resolution services “Andromeda Galaxy”, “Messier 31”, “M31”, “NGC 224”, “UGC 454”, etc. ==> ra 00h 42m 44s, dec +41º 16’ 08” Geographic equivalent of “Glenn Dale, MD”, “20769”, “Prince George’s County” ==> 76º 48’ 19 W, 38º 58’ 36” N not discoverable through text-based search engines 25 Apr 2007 Science Archives in the 21st Century

The Virtual Observatory in Astronomy The Virtual Observatory enables new science by greatly enhancing access to data and computing resources. The VO makes it easy to locate, retrieve, and analyze data from archives and catalogs worldwide. The VO is NOT a huge centralized data repository. The VO provides standard protocols for obtaining data from distributed collections. The VO is national (US NVO) and international (IVOA). US National Virtual Observatory is partnership of (real) observatories, universities, IT/CS groups. NSF-funded. International VO Alliance is self-organized collaboratory and standards body (W3C-like). 25 Apr 2007 Science Archives in the 21st Century

Without VO n services, n interfaces astronomer archive 1 service 3 survey 1 survey 3 survey 2 25 Apr 2007 Science Archives in the 21st Century

With VO n services, “1” interface astronomer archive 1 service 3 survey 1 survey 3 survey 2 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Data integration Cas A supernova remnant optical (HST) 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Data integration radio (VLA) 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Data integration x-ray (Chandra) 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Data integration x-ray (Chandra) 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Data integration 25 Apr 2007 Science Archives in the 21st Century

The Key to the VO: Interoperability Metadata standards Data discovery Data requests Data delivery Database queries, responses Distributed applications; web services Distributed storage; replication Authentication and authorization 25 Apr 2007 Science Archives in the 21st Century

The data preservation problem Research communities publish peer-reviewed journal papers that describe highly processed data. Long-term preservation and curation systems for digital journal content are not currently in place; only the graphical representations of data are being saved. The research cannot be verified and the results cannot be easily compared to other data in order to broaden impact. Public funds invested in scientific research do not have maximum return on investment. Essential legacy datasets are being lost. 25 Apr 2007 Science Archives in the 21st Century

Astronomy Digital Image Library 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century ADIL query 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century ADIL query ADIL is great, but… Data capture and curation is separate from manuscript processing Data access is not integrated into the journals Data management is centralized 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Spectral data in NED 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Spectral data in NED 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Spectral data in NED 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Spectral data in NED NED spectra are great, but… Data capture and curation is separate from manuscript processing Data access is not integrated into the journals Data management is centralized 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Storyboard 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Storyboard Hubble Space Telescope image. Most distant cluster of galaxies known. What more can I find out? 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Storyboard Where is this? What is the image scale? Where is north? How bright is the star? How bright is the galaxy? What else is known about this region? Can I trust the data analysis in this paper? 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Storyboard Save file Copy to my VOSpace Display and compare 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Journal… Archive… 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Is there any X-ray emission from this cluster of galaxies? 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Approach Integrate digital data management into the publication process (data capture, review, metadata tagging and validation, storage). Exploit emerging information technology standards for managing distributed data collections, including digital journals. Provide multiple access methods to digital data to maximize visibility and re-use. Exploit information management and curation experience in the university libraries and build on long-term institutional commitments to preservation. 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Components Publication & Editorial Process Data capture Metadata capture & validation Links Identifiers Library Curation Preservation Data Storage Appliance Metadata database Digital data objects Ancillary information Data Storage Appliance Metadata database Digital data objects Ancillary information Data Storage Appliance Metadata database Digital data objects Ancillary information replication services VOSpace Data Access VO portals Journal portals Other after-market distributors Registry Logging 25 Apr 2007 Science Archives in the 21st Century

Data preservation tasks & partners Metadata definition (VO, library) Content management tool evaluation/selection (Fedora) (VO, library) Physical storage and replication (VO, library, publisher) Publication process revisions and testing (publisher, editorial staff) Policy development (editorial staff, professional society) Business model development (publisher, professional society) 25 Apr 2007 Science Archives in the 21st Century

The curation challenge Digital data is useless without accurate metadata Data collections cannot be located/queried/ mined without accurate metadata Metadata curation can be automated, but not completely Curation is an ongoing and significant cost for digital data management Virtual Observatory registry Data archives 25 Apr 2007 Science Archives in the 21st Century

Science Archives in the 21st Century Digital data discovery and access is essential for the research community Data re-use, with provenance Optimization of public investment in science Increasing the discovery space Creation of a research legacy Integrity in scientific publication Success requires cooperation among providers (individual and institutional), publishers, curators, and preservationists 25 Apr 2007 Science Archives in the 21st Century