David Giaretta Colorado Springs 16 Jan 2007

Slides:



Advertisements
Similar presentations
CASPAR Validation. Metrics CASPAR Approach Representation Information (RepInfo) RepInfo Networks and their maintenance.
Advertisements

A centre of expertise in data curation and preservation CETIS MDR SIG::28 June 2006::University of Bath Funded by: This work is licensed under the Creative.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
DigCCurr 2007: What digital curators do and what they need to know The CASPAR view on: What digital curators do and what they need to know : Research Perspectives.
Project Overview APA Conference 2012 ESA/ESRIN (Frascati), 6-7 November 2012 D. Giaretta (APA)
CODATA 2006, Beijing, China Oct CASPAR: Early results and future goals David Giaretta.
SCIDIP-ES services and toolkits David Giaretta. Preserving digitally encoded information Ensure that digitally encoded information are understandable.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
SCIDIP-ES Components Oct ,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Project Overview APA Conference 2012 ESA/ESRIN (Frascati), 6-7 November 2012 M. Albani (European Space Agency), Project Coordinator.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
Preservation Seminar 8 Jan CASPAR: Long term preservation of digitally encoded information David Giaretta.
1 Objectives To introduces the concept of software Design. To introduce the concept of Object- Oriented Design (OOD). To Define various aspects about object.
E-IRG Open Workshop on e-Infrastructures 4-5 Oct 2006 CASPAR Project Digital Preservation and Digital interoperability.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Course Instructor: Aisha Azeem
An Overview of Selected ISO Standards Applicable to Digital Archives Science Archives in the 21st Century 25 April 2007 Donald Sawyer - NASA/GSFC/NSSDC.
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
Science Archives in the 21st Century 25/26 April Towards an International standard for Audit and Certification of Digital Repositories David Giaretta.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
Caring and Sharing Collaboration in Digital Curation outside North America Ross Harvey Simmons College, Boston Curation Matters: 17 June 2010.
CASPAR Cultural, Artistic and Scientific knowledge for Preservation Access and Retrieval.
Reference Model for an Open Archival Information System (OAIS) ESIP Summer Meeting John Garrett – ADNET Systems at NASA/GSFC ESIP Summer Meeting.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CASPAR Framework and Lessons Learned David Giaretta.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
Chapter 6 – Architectural Design Lecture 1 1Chapter 6 Architectural design.
Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Data Preservation at Rutherford Lab David Corney 9 th July 2010 KEK.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Fulvio Marelli - ESA and future An example of data lifecycle: sensed data need to be acquired…
BNSC Agency Report David Giaretta Colorado Springs 16 Jan 2007.
2/26/2004 Dan Swaney 1 Preservation Metadata and the OAIS Information Model A Metadata Framework to Support the Preservation of Digital Objects A review.
SciDataCon 2014, WDS Forum, Dehli WDS Certification Objective: building trust in the usage of data & data services Michael Diepenbroek Rorie Edmunds Mustapha.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
PV 2009, ESAC, Spain, 1-3 Dec Long term data and knowledge preservation for the Earth Sciences Archive S. ALBANI (ESA) D. Giaretta (STFC) PV 2009.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network aparsen.eu #APARSEN Options.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Co-ordinated by aparsen.eu #APARSEN Co-funded by the European Union under FP7-ICT Services and Sustainability David Giaretta,
Design Review.
Case Study -- Weather system
An Approach to Software Preservation
Dependency Management
Paolo Budroni, University of Vienna
DAI WG material for joint meeting with SM&C WG
IS301 – Software Engineering Dept of Computer Information Systems
Digital Repository Audit and Certification BOF
CASPAR Cultural, Artistic and Scientific knowledge for Preservation Access and Retrieval.
Active Data Management in Space 20m DG
Exploitation of ISS Scientific data - sustainability
Outline Pursue Interoperability: Digital Libraries
An Overview of MPEG-21 Cory McKay.
Experiences of the Digital Repository of Ireland
Metadata for research outputs management
Chapter 6 – Architectural Design
Chapter 5 Architectural Design.
Metadata for digital long-term preservation
An Open Archival Repository System for UT Austin
Oya Y. Rieger Cornell University Library May 2004
Open Archival Information System
Digital Curation Activities at the University of Glasgow
Chapter 5 Architectural Design.
The Reference Model for an Open Archival Information System (OAIS)
How to Implement an Institutional Repository: Part II
Presentation transcript:

David Giaretta Colorado Springs 16 Jan 2007 BNSC Agency Report David Giaretta Colorado Springs 16 Jan 2007

Funding UK Support for DAI-IPR comes from CASPAR (http://www.casparpreserves.eu) and DCC (http://www.dcc.ac.uk)

The CASPAR Consortium

Areas of interest IPR DAI Certification BoF

IPR XFDU and SIP work will be used within CASPAR as a packaging implementation Hope that Registry work should be compatible with CASPAR Registry

Certification BOF Needed for CASPAR validation

Rep. Info. Use and maintenance Looking at this view of the high level architecture, the lower left hand corner is about finding and using RepInfo, with a special mention of the role of a Registry (actually a system of registries) The rest of the diagram is about how to maintain RepInfo i.e. how to share the effort in identifying and creating RepInfo as it is needed

Registry for Representation Info The Digital Object could have RepInfo packed with it, as well as CPID 1 – User gets data from archive. Data has associated Curation Persistent Identifier (CPID) 1 2 – User unfamiliar with data so requests Rep.Info.using CPID 2 This gives an example of the way in which a Registry might be used. Note that the RepInfo may kept WITH the data in the archive. However we should look on this as a form of “caching” of the RepInfo. 3 – User receives Rep.Info – which has its own CPID in case it is not immediately usable 3 Support automated access & processing

Use of RepInfo DCC Label – points to other RepInfo CPID Structure = CPID Semantics = CPID Rendering s/w = CPID Each “bag of bits” has an associated pointer (CPID) to a Label CPID copy CPID Structure = CPID Semantics = CPID Rendering s/w = CPID Registry External

CASPAR information flow architecture Rep Info Introducing the layered view of CASPAR which points out that we need to deal with more than RepInfo e.g. Digital Rights etc. The items in the red ellipse are the RepInfo we have been talking about previously. The virtualisation is introduced in order to help with automation i.e. we need programmes to process the bytes – how can we make this easier? CASPAR information flow architecture

CASPAR architecture

DAI PAIMAS and PAIS will form basis of CASPAR ingest implementation

Accreditation/Certification for repositories Long-standing demand for ability to measure Trustability of digital repositories Part of OAIS “roadmap” RLG/NARA working group Version 1.0 Audit and Certification Checklist about to be released New open workgroup to produce ISO standard for Audit and Certification See http://mailman.ccsds.org/cgi-bin/mailman/listinfo/moims-rac to join mailing list Over 100 members in mailing list Wiki at http://wiki.digitalrepositoryauditandcertification.org

The CASPAR web site – important to stress collaboration with projects outside the CASPAR consortium. www.casparpreserves.eu

CASPAR Testbeds Three testbeds Cultural: UNESCO Performing Arts: INA , IRCAM Scientific: ESA and CCLRC Complex, multi-source, multifaceted data Many common preservation & evaluation & validation issues Some specific requirements on preservation (technical, delivery, legal) Specific user communities/ Knowledge bases Also test the OAIS model Now we introduce the Testbeds – noting that it is extremely important to look bottom up using a very wide range of examples. That is the particular strength of CASPAR – we have to look across a very wide range of disciplines. Clearly we aim at creating components which support not just Arts, Science and Heritage – we believe that what we produce should be much more broadly applicable.

Conclusions Information and Knowledge – needs more than just storing the “bits” Understanding and being able to process the vast amount of unfamiliar data which is available is hard It is expensive Costs must be shared So far the Open Archival Information Systems Reference Model provides conceptual framework Many similarities can be exploited Many subtleties need to be explored Watch this space

Backup slides

Science: CCLRC example World map of ionosondes The following few slides gives some examples of data – and it is important to stress that these are simply examples of graphical display of the data we are actually interested in.

Example of use of RepInfo Laser facility produces Binary data normally used by proprietary software Describe using EAST data description language Use in generic application (shown here) to display/process

Some Issues Difficult to derive physical quantities from data Can be analysed in multiple ways Raises fundamental questions about Representation Information Common automated method is proprietary Data structure also proprietary Paper documentation - restricted access Provenance and trust

ESA example GOME Global Ozone Monitoring Instrument on ERS-2 Ozone monitoring from space has a particular relevance because the early satellite (i.e. before GOME) data showed the Ozone hole but during the original processing it was assumed that this must be a problem with the instrument and a notional value was put in which hid the hole. Only when ground based measurements showed that there was a hole was the original satellite data reprocessed with the correct algorithm.

GOME data processing Complex processing chains are involved.

GOME Level 2 product: Ozone profile at given location GOME Level 3 product: Integration of time and space data GOME Level 4 product: Integration of GOME, other data and models

Some Issues Provenance and Context of processed data relationship to Representation Information of raw data and Knowledge base of Designated Community This raises some fundamental questions about what RepInfo actually means.

UNESCO examples World Heritage List DATA: Scanned documents and maps Mandatory Documentation: Identification of property Description of property Justification of inscription State of conservation and factors affecting the property Protection and Management Monitoring Documentation Contact information of responsible authorities Signature on behalf of the State Party(ies) DATA: Scanned documents and maps Aerial and close range photography (Digital photogrammetry) Monument measurements (Laser scanning) Satellite images (Remote sensing and image processing) Multi-scale digital cartography (Geographic information systems (GIS) and CAD) 3D models, virtual tours (Computer visualization) Note this included more “document” style objects.

Performing Arts examples Score MAX/MSP patches Additional instructions Figure 2: Preservation of interactive multimedia performances Motion Analysis and Recognition Motion-Multimedia Mapping Strategy Multimedia Generation GUI (For monitor & control) Motion Capture and Processing Motions 3D motion data Multimedia output Mapping Parameters

Some Issues What is Preservation of “performability”? Authenticity Composer’s intention Authenticity Proprietary software and hardware Copyright Digital Rights Management

Shared Infrastructure Registries of Representation Information Persistent Identifier name resolvers DOI? ARK? URL? – none are guaranteed Interfaces – support preservation and interoperability Standards – Preservation Description Information Fixity, Provenance, Reference, Context Identifying some of the common infrastructure which is needed for digital preservation