Presentation is loading. Please wait.

Presentation is loading. Please wait.

Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre.

Similar presentations


Presentation on theme: "Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre."— Presentation transcript:

1 Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre of expertise in data curation and preservation

2 Organisation Industry research collaborators standards bodies testbeds & tools communities of practice: users UKOLN U of Edinburgh CCLRC U of Glasgow U of Edinburgh curation organisations eg DPC Collaborative Associates Network of Data Organisations

3 Organisation Industry research collaborators standards bodies testbeds & tools communities of practice: users community support & outreach research development co-ordination service definition & delivery management & admin support curation organisations eg DPC Collaborative Associates Network of Data Organisations

4 CCLRCUKOLN UofGUofE CMS-Bristol NIEeS RG Durham WT-CFG Leicester IC Maastricht Oxford Dutch NA Swiss NA Urbino UNC Salzburg SDSC NEODC CEH RI NCS RLG Innogen NHS Capri NTUA INRIA HUJ UPC Max- Planck MIMAS IASSIST LDC ACM Data Archive EDG GridPP EGEE Cambridge Leicester Jodrell Bank DLI (US) DPC DELOS UNC ESA NASA NARA CNES ESA RLG BNSC TU Vienna UPenn EBI MRC HGU Kyoto USC INRIA GSK Roslin IBM Almaden JHU CSIRO Caltech JHU CSIRO CDS ESO OCLC AHDS Microsoft IBM Oracle BT STK BADC BODC ESO IVOA Research Councils HEIs & FE Research Institutes International Collaborations Standards Bodies DPC MIMAS ILRT Council for Museums, Archives & Libraries RDN. OCLC So’ton OAI NOF NLA NeSC

5 Overview Developing tools and services which will be needed in the short-medium term –integrating tools from many sources Will be new DCC services as well as useable separately by other projects Strongly OAIS based Support automated processing & interoperability

6 OAIS Reference Model – Functional Model

7 Representation Net

8 Representation Information Classification

9 Representation Information vs Format Format = Structure Omits important information e.g –Language, terminology –Encryption Need to know more than just Format in order to stand a chance of being in a position to use the information

10 Layered Model from OAIS More easily applicable to Science data

11 Representation Information - High Level View Example of use of Representation Information Labelling

12 Registry/Repository Interface and protocols – JAXR “standard” –freebXML implementation –many access methods URL Web Services API Etc.. Findability –Persistent IDs What can we rely on? –Labels (to support automated processing) Initial service this Summer –Hope to work with PRONOM 4 & GDFR

13 Registry/ Repository Trusted repository of Rep. Info –Authenticity of info –Access control –Certificates/Digests : (are they trustable over the long term?) Extensibility Distributed

14 Certification RLG task force preparing draft standard –Based on OAIS (plus TDR) –Expect this to become an ISO standard Tool: –Checklist and reports –… –Awaiting release of draft (in May)

15 Archival Information Package METS XFDU Packaging Expect tools available by end of year

16 Preservation Description Info Will be working with PREMIS on tools

17 DCC Development Roadmap for next 6-12 months Registry –Complete phase 1 –Include links to TNA/PRONOM –Hand over to Services group –Start Phase 2 – aim for “Trusted Repository” status Representation Information: –Data descriptions of science data using EAST (http://east.cnes.fr) & others –Import other Structure description tools and Data Dictionary tools –Develop Mapping to data object level –Work with other projects e.g. Emulation, Processing Certification –Draft certification Checklist Proposed standard Additional Tools –Metadata extraction tool set –Ingest tool (based on PAIMAS standard) Testbeds e.g. large scale data management tools

18 Research To draw together the various functions of curation, from the traditional archival functions to the maintenance and publication of evolving knowledge as seen in scientific databases. To identify through direct research collaboration, and through interaction with the service arm of DCC, the key projects in which research is needed. To conduct research in areas already identified by the partners as crucial to digital curation. To institute two-way conduits between research and service in which practical issues can be drawn to the attention of researchers and the products of research can be tested in practice.

19 Current research priorities Data integration and publication Performance and optimisation Annotation Appraisal and long-term preservation Socio-economic and legal context: rights, responsibilities and viability Cost-benefit analysis of the data curation process Security: safe and effective data analysis environments Automation of metadata extraction Visitors Programme and Seminar Series

20 Summary Developing and integrating OAIS based tools Reviewing other related tools See http://www.dcc.ac.uk –also Development Web site (http://dev.dcc.rl.ac.uk) with a Wiki and associated open email list have been set up. –aim to encourage widest possible collaboration with other projects. In medium-long term expect tools from DCC Research activities e.g. Annotation


Download ppt "Digital Curation Centre: tools and services under development David Giaretta Associate Director (Development) Funders: Digital Curation Centre a centre."

Similar presentations


Ads by Google