Automatic Evaluation of Migration Quality in Distributed Networks of Converters Miguel Ferreira Supervisors Ana Alice Baptista.

Slides:



Advertisements
Similar presentations
Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.
Advertisements

Introduction to Planets Hans Hofman Nationaal Archief Netherlands Prague, 17 October 2008.
Systems Analysis and Design in a Changing World
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Selecting Preservation Strategies for Web Archives Stephan Strodl, Andreas Rauber Department of Software.
Choosing an Optimal Digital Preservation Strategy Andreas Rauber Department of Software Technology and.
University of Minho Development of tools and add-ons for the DSpace platform Miguel Ferreira Ana Alice Baptista
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
Chapter 2 Database Environment.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
Preservation and Long-term access through Networked Services Adam Farquhar, The British Library iPres2006 Cornell University, October 2006.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Ch 12 Distributed Systems Architectures
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
8 Systems Analysis and Design in a Changing World, Fifth Edition.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
Preserving Digital Collections Andrea Goethals Florida Center for Library Automation (FCLA)
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
12 December, 2012 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: European Filing Rules CWA1Page 1.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
1 Using Utility Analysis to Evaluate and Compare Preservation Strategies Carl Rauch, Andreas Rauber Vienna University of Technology
VTT-STUK assessment method for safety evaluation of safety-critical computer based systems - application in BE-SECBS project.
Content Strategy.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Configuration Management (CM)
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
IFAP Special Event: Information and Knowledge for All, Emerging Trends and Challenges Information Preservation 4000 Years of Traditions Challenged by Digital.
Database Planning, Design, and Administration Transparencies
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Event-Based Hybrid Consistency Framework (EBHCF) for Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey.
MIS 673: Database Analysis and Design u Objectives: u Know how to analyze an environment and draw its semantic data model u Understand data analysis and.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Access Control and Markup Languages Pages 183 – 187 in the CISSP 1.
Towards a Preservation Strategy Evaluation Workflow Presentation for the ERPANET Workshop By Carl Rauch 13th – 14th of October 2004 Department for Software.
Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition Copyright © 2004 Pearson Education, Inc. Slide 2-1 Data Models Data Model: A set.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Harokopio University of Athens – Department of Informatics and Telematics HAROKOPIOUNIVERSITY A Distributed Architecture for Building Federated Digital.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
Meeting of the Member States Expert Group on Digitisation and Digital Preservation , Luxembourg European Archival Records and Knowledge Preservation.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Transparent Format Migration of Preserved Web Content D. S. H. Rosenthal, T. Lipkis, T. S. Robertson, S. Morabito Lib Magazine, 11(1), 2005
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Preserving Digital Collections
Building A Repository for Digital Objects
Knowledge Management Systems
Service-centric Software Engineering
Digital Preservation Planning:
Malte Dreyer – Matthias Razum
Metadata supported full-text search in a web archive
Presentation transcript:

Automatic Evaluation of Migration Quality in Distributed Networks of Converters Miguel Ferreira Supervisors Ana Alice Baptista José Carlos Ramalho ECDL 05 Doctoral Consortium

Contents Introductory concepts Research problems Proposed system Methodology Topics for discussion

Introductory concepts Digital preservation –The set of processes and activities that ensure the continued access to information and all kinds of cultural heritage existing in digital formats Digital object –An information object, of any type of information or any format, that is expressed in digital form – Text documents, digital photos, vector graphics, databases, Web pages, software

Strategies for digital preservation Emulation –Reproduction of the behaviour of a hardware/software platform in a different technological environment Encapsulation – Storing information about how the objects should be interpreted Migration –Periodic transfer of digital materials from one hardware/software configuration to another Others –Computer museums, viewers, Universal Virtual Computer

Migration Advantages –Updated formats that users can read and edit Disadvantages –Requires a continuous diligence –Data loss Variants –Migration on request –Normalisation –Distributed migration

Distributed migration A network of remote conversion services supported by a semantic layer [Hunter et al.] Advantages – Platform independent – Redundancy – Multiple migration paths – Cost reduction – Compatible with other migration strategies Disadvantages – bandwidth – Slow Examples –PANIC –MyMorph (NLMed) –TOM

How to choose a preservation strategy? Many preservation alternatives Lack of universal acceptance Distinct preservation requirements –Satisfaction of the designated community – Characteristics of the collection – Budget Framework for evaluating preservation strategies [Rauber] –Utility Analysis

Evaluation of preservation strategies 1.Definition of objective tree 2.Assignment of measurement units (e.g. millimetre, Mb, Euro) 3.Identification of preservation alternatives 4.Execution of preservation alternatives and evaluation of the outcome 5.Weighting of criteria in the objective tree 6.Calculation of partial and total values 7.Ranking of alternatives

Objective tree (example)

Research problems Automation of preservation processes Authenticity issues Cost management Evaluation of preservation alternatives

Research questions Is it feasible to design and implement a system that is able to automatically : – determine the amount of data loss occurred in a migration and generate detailed migration reports for inclusion in the objects’ preservation metadata? – provide recommendations of migration paths or target formats that will best suit users’ requirements?

Proposed System

Methodology - proof of concept The concepts 1.Automatic quantification of data loss occurred in a migration and generation of preservation metadata 2.Automatic recommendation of migration strategies as well as target formats The proof (empirical validation) 1.Evaluator versus Human experts 2.Advisor versus Evaluation framework

Key contributions For individual preservers, digital archives and libraries : – Outsourcing and automation of digital preservation –Generation of preservation metadata (authenticity) – Ranking of migration alternatives For designers and programmers of converters: –Possibility of publishing their converters as services For metadata creators and users: –Increase adoption –Help to improve future versions –Accelerate the development of XML bindings

Round-up Service oriented architecture (SOA) – Automatic quantification of data loss –Provides recommendations on which migration paths or target formats are best suited for each user –Simplifies the creation of preservation metadata –Based on migration Methodology – Proof of concept with empirical validation Evaluator versus Human experts Advisor versus Evaluation framework

Topics for discussion Relevance of research Research methodology System architecture Format registry vocabulary –e.g. MIME types, TOM type descriptors, Global Digital Format Registry, PRONOM, etc. Preservation metadata schema –e.g. PREMIS data dictionary (event entity)