ETD‘s as pilot materials for long-term preservation efforts in kopal 9th ETD Conference 2006, Quebec Dr. Thomas Wollschläger, German National Library (GNL)

Slides:



Advertisements
Similar presentations
1 Data for the Future: the German Project "Co-operative Development of a Long-term Digital Information Archive" (kopal) Hands-on Workshops Reinhard Altenhöner,
Advertisements

Permanent access to the records of science: The e-Depot at the Koninklijke Bibliotheek Current Status & Developments Erik Oltmans Manager e-Depot Koninklijke.
Permanent access to digital resources Digital Archiving at the national library of the Netherlands Erik Oltmans Head, Acquisitions & Cataloguing Division.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Strategic issues for digital projects... …or, what are we doing here?
Strategic issues for digital projects... …or, what are we doing here?
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
Digital Archiving at the national library of the Netherlands Hans Jansen Director, Research & Development Kansai-kan, Japan, 16 March 2007.
Testing and Evaluation in Digital Preservation Projects: the case of KEEP Milena Dobreva Janet Delve, David Anderson, Leo Konstantelos.
Kopal - a Co-operative Approach to develop a Long-Term Digital Information Archive ICOLC 2006, Rome Dr. Thomas Wollschläger, German National Library (GNL)
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 National and International.
| IFLA2010. Newspaper Section | Newspaper Resources in transition: Digital Preservation and Access - keynote - IFLA International Newspaper.
Andrea Fojtu Charles University in Prague, National Library of the CR.
| IFLA2010. Newspaper section | Changing preservations tasks for the German National Library: Some insights and preliminary remarks IFLA International.
1 Persistent identifiers, long-term access and the DiVA preservation strategy Eva Müller Electronic Publishing Centre Uppsala University Library, Sweden.
Berlin, Knowledge by Networking 2007 Scientific Library Services and Information Systems: “Digitisation.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
The KB e-Depot Permanent Access to the Records of Science Erik Oltmans Head, Acquisitions & Processing Division National Library of the Netherlands 8th.
Formation of ETD‘s and releated issues 6th ETD Conference May 20 – , Berlin Dr. Nikola Korb, Co-ordination Agency DissOnline Deutsche Bibliothek.
WMS: Democratizing Data
© 2010 Microsoft Corporation. All rights reserved. Quality Assurance: Towards Tools for Characterizing and Comparing Digital Documents Natasa Milic-Frayling.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
1 / 1509 / 17 / 14 Digital preservation of architectural 3D data Rosetta in the context of the DURAARK project IGeLU Conference Oxford, September 17 th.
Role of Contributing Institutions – The NDL Movement Presented By Dr. B. Sutradhar, Librarian Central Library (ISO 9001:2008 Certified) IIT Kharagpur
The British Library’s METS Experience The Cost of METS Carl Wilson
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
The PLANETS-Ontology in the context of the PLANETS-Testbed and the XCL-Software.
Architecting an Extensible Digital Repository Anoop Kumar, Ranjani Saigal,Rob Chavez, Nikolai Schwertner Tufts University, Medford, MA.
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
Plan for the preservation of digital content and archives in THUL Jiang Airong, Dong Li Tsinghua University Library EMANI Meeting GRENOBLE – 16 Oct, 2006.
Dr. Jūratė Kuprienė Director for innovations and infrastructure development Workshop: Information services for research process , Rīga Research.
Information Strategy of the University - a Vice President's View Prof. Dr. Matthias Schumann Vice President Georg-August-University Göttingen.
1 The Universal Object Format - A METS Profile for an archiving and exchange format for digital objects.
Brussels, Belgium, ABD/BVD 60, Conference 2007 november 19 The legal deposit for digital publications - new challenges for the German National Library.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
The Role of File Formats in Digital Preservation: Opportunities and Threats ErpaTraining on File Formats for Preservation Vienna, May 10-11, 2004 Frank.
Electronic publications in the Swiss National Library ELAG 2005 CERN, Geneva, June 1-3, 2005 Barbara Signori Swiss National Library (SNL)
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
| Ingest Levels and Persistent Identification | October Ingest Levels and Persistent Identification Services for R & D and heritage organisations.
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
PAN-European Exploitation of the Results of the Libraries Programme - EXPLOIT German Libraries Institute Berlin EXPLOIT 1 Electronic library materials.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
VITAL at the National Library of Wales Glen Robson
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
The International e-Depot Digital Archiving at the national library of the Netherlands Erik Oltmans Head, Acquisitions & Processing Division National Library.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Libraries in the digital age Collection & preservation for generational access part two The LOCKSS Program.
Preservation Functionality in a Digital Archive Erik Oltmans Koninklijke Bibliotheek Raymond J. van Diessen IBM Business Consulting Services Hilde van.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Ingest and Dissemination with DAITSS
An Introduction to Tessella and The Safety Deposit Box Platform
Statewide Digitization and the FCLA Digital Archive
Introduction to Implementing an Institutional Repository
eCulture Science Gateway – reloaded
Implementing an Institutional Repository: Part II
Digital Preservation Planning:
Malte Dreyer – Matthias Razum
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

ETD‘s as pilot materials for long-term preservation efforts in kopal 9th ETD Conference 2006, Quebec Dr. Thomas Wollschläger, German National Library (GNL)

2 2 Agenda 1.Challenges for long-term preservation 2.The ETD‘s at GNL and current tasks 3.The role and features of the kopal initiative 4.Planned data ingest 5.Future challenges

3 3 * 196 b.c. - † not yet * † 2005 (?) The problem of the digital age

4 4 Challenges of a digital long-term archive  Rapid technology changes hinder the access to older file formats  Problem 1: Conservation of binary data (0 and 1) – No existing data carrier lasts forever – Solution: Regular bitstream-preservation  Problem 2: Access to the content – Numerous formats; always new ones; old ones vanish – Dependencies from present soft- and hardware – Solutions: Migration (regular conversion), Emulation (re-enacting used systems)

5 5 Approaches to ensure access  migration  emulation condition: METADATA

6 6 ETD‘s for Ingest at German National Library  Online Theses and Dissertations at GNL  Number: ~ at present  Growth: ~ p.a.  From: German universities (at present, 90 with 83 active)  Collected since 1997  Data amount: ~ 350 GB  Accessible via the Online Catalogue of GNL  All are accessible for free and in full-text (except a tiny amount for legal reasons)  Most used & respected digital collection of GNL (> access cases/month)

7 7 ETD preservation challenges  German ETD‘s are delivered in numerous file formats  Innovative file formats have been encouraged over the years  3-D images & simulations  Embedded audio and video  Executables  First file types are no longer accessible  Unsatisfying document server architecture up to now  Advantage: Excellent metadata format throughout Germany, trusted workflows for ETD delivery from universities

8 8 ETD File Formats in GNL

9 9 XMetaDiss Example for an ETD

10 German national initiative „kopal“  Co-operative development of a long-term digital information archive  funded by the Federal Ministry for Education and Research  Financial volume: 4,2 Mio € + self-financed activities of all partners, duration: – (+ X)  Task: Development of a standardized long-term preservation solution to facilitate long-term preservation for other libraries / industries  Solution as a facilitator for co-operation between libraries and other institutions / companies

11 kopal: Concept and background  Basis: DIAS (Digital Information and Archiving System) of the Royal Dutch Library, The Hague  Developed by IBM  reliable standard components (CM, TSM, …)  Implementation of the OAIS standard  Further development of a suitable long-term preservation component (emulation, migration)  Starting point for preservation planning  What we’ve missed:  Enhancement for co-operative usage  Hosting outside the library (remote access)  Development of a universal object scheme  A more generic approach  Conclusion:  Extension of DIAS-Core and development of peripheral open-source based software tools to broaden its usability

12 kopal: Partners  German National Library (GNL, leader)  State and University Library Göttingen  Industrial Business Machines (IBM) Germany  Society for Scientific Data Processing Göttingen (GWDG) Working relationship:  Royal Dutch Library, The Netherlands

13 Kopal storage structure in Germany

14 GWDG (Göttingen) DIAS by IBM Account 1 Account 2 SUB Göttingen GNL (Frankfurt) Local software Local software Local software Local software kopal: Structure & concept Partners nn

koLibRI Retrieval Component Selection Collection Cache koLibRI Ingest Component Metadata Extraktion Metadata Generation (JHOVE) UOF Creation (SIP with METS) Presentation components User XML + Data XML + Data (OAIS Compliant) UOF (SIP)UOF (DIP) Archival Storage Ingest Preservation Data Manag. Access Admin DIAS

16 Packaging Submission Information Package Object METS 1.4 UniversalObjectFormat LMER 1.2 – Long-term preservation Metadata for Electronic Ressources Header dmdSec amdSec File Section Structural Map Mets.xml

17 XMetaDiss Example for an ETD

18 Example for mets.xml in kopal

19 Kopal preservation strategy  Migrate object with urn xxx into new format yyy  Migrate all objects  of format xxx and/or  that have been ingested before a certain date and/or  that are larger than zzz MB into new format xyz (e.g. from TIFF to PNG)  Implementation of emulation view paths  No restriction as of file size or file format / type – all known and unknown file formats are being accepted (text, pictures, video, audio, executables,... etc.)

20 Other data for Ingest  Electronic journals & serials  Data amount: ~ 300 GB  CD-ROM images  Number: ~ to  Data amount: ~ to GB  Digitised materials:  Exil Press Digital (from GNL): ~ 150 GB  External digital collections: ~ GB  Digitised books from the German Book & Scripture Museum (GNL): ~ GB (for starters)  Born-digital and digitised audio from the German Music Archive (GNL): ~ GB

21 Data ingest for kopal with ETD‘s as start

22 Challenge: Preservation Planning + Access  In face of rising data amounts and large single objects (e.g. digitised DVD-ROM images with ~8 GB):  Guarantee a sufficient performance of the system  Implementation of suitable access systems  Fast Internet connections, user support  Implementation of a functioning Preservation Planning mechanism  Functioning international File Format Registry  Performant migration of large data amounts  Successful implementation of emulation mechanisms  Information, support & encoragement of ETD producers towards a format & preservation awareness

23 Informations on kopal  For further information on the kopal project, used standards and for downloads of documentation see  Questions to the kopal team at German National Library:   Questions on all ETD issues:  Co-ordination Agency DissOnline,  Thanks for your patience and attention!