Versioning of Digital Objects in a Fedora-based Repository Matthias Razum FIZ Karlsruhe DORSDL Workshop Alicante September 21, 2006.

Slides:



Advertisements
Similar presentations
Georg Botz | Southampton | 28 February Max Planck Society Georg Botz Open Access Policy MPS Administrative Headquarters, Munich
Advertisements

DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Easy On Fedora Experiences with eSciDoc. Laurents Sesink 11/12 september 2008 Odense Table of Contents 1.Introduction to EOF 2.eSciDoc 3.EOF Data Model.
A Unified Approach to Combat Counterfeiting: Use of the Digital Object Architecture and ITU-T Recommendation X.1255 Robert E. Kahn President & CEO CNRI,
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
Repositories: Disruptive Technology or Disrupted Technology? Sandy Payette, Executive Director DORSDL Workshop at ECDL 2008 September 2008.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
How do you squeeze all of a research project into the repository? Michael Wood Institutional Repository Manager ARROW Community Day, Melbourne 27 th September.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
July 11 th, 2005 Software Engineering with Reusable Components RiSE’s Seminars Sametinger’s book :: Chapters 16, 17 and 18 Fred Durão.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
Using eSciDoc Turn-key access?. Lodewijk Bogaards Software Architect and project leader Easy On Fedora DORSDL2 Generic a.relating to or characteristic.
„Services for OA Publishing“ Berlin 4 – Workshop Wolfram Horstmann 2006/03/31 I n i t i a t i v e f o r I n n o v a t i o n i n S c h o l a r l y C o m.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
Easy On Fedora Experiences with eSciDoc. Lodewijk Bogaards Software Architect and project leader Easy On Fedora Open Repositories 2008 Table.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
ÆKOS: A new paradigm for discovery and access to complex ecological data David Turner, Paul Chinnick, Andrew Graham, Matt Schneider, Craig Walker Logos.
SYNAT - the Polish National Research Content Infrastructure Wojtek Sylwestrzak, ICM Tomasz Rosiek, ICM Tomasz Krassowski, ICM Tartu, Estonia June 27, 2012.
„Serving Innovation …“ ElPub2006 – Workshop Wolfram Horstmann 2006/06/14 I n i t i a t i v e f o r I n n o v a t i o n i n S c h o l a r l y C o m m u.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Long-term preservation aspects in the eSciDoc project Natasa Bulatovic Max-Planck Digital Library
DATA FOUNDATION TERMINOLOGY WG 4 th Plenary Update THE PLUM GOALS This model together with the derived terminology can be used Across communities and stakeholders.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
An Introduction to Software Architecture
1st Workshop on Intelligent and Knowledge oriented Technologies Universal Semantic Knowledge Middleware Marek Paralič,
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
5 June 2013 SDMX Technical Working Group Luxembourg 1 5 June 2013 SDMX Technical Working Group Luxembourg 1 WP Item 6 The Expressions Language of Banca.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Information Network Overlay Architecture Adding Value to Digital Content Carl Lagoze CS 431 – May 4, 2005 Cornell University.
This work is licensed under a Creative Commons Attribution 2.0 Germany License eSciDoc NIMS Malte Dreyer.
1 1 SCHOLARLY PUBLISHING & ACADEMIC RESOURCES COALITION An initiative of the Association of Research Libraries Gaining Independence through.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Five fantastic Fedora Commons projects in five minutes, in no particular order Carol Minton Morris Communications Director National Science Digital Library,
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
Managing Access at the University of Oregon : a Case Study of Scholars’ Bank by Carol Hixson Head, Metadata and Digital Library Services
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
Copyright © 2007, Oracle. All rights reserved. Using Document Management and Collaboration Appendix B.
FACES General Overview ViRR (Virtueller Raum Reichsrecht) Software Solutions Kristina Büchner and Bastien Saquet Contact:Kristina Buechner:
Metadata Driven Aspect Specification Ricardo Ferreira, Ricardo Raminhos Uninova, Portugal Ana Moreira Universidade Nova de Lisboa, Portugal 7th International.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
An Introduction to Data Modeling with Fedora Thorny Staples Fedora Commons, Inc.
Fedora Digital Object in a Nutshell Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
21st October 2008 eSciDoc – A Service Infrastructure for Cultural Heritage Content VSMM 2008 – Digital Archives Online Natasa Bulatovic, Ulla Tschida,
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
About FACES Collection of images of adult emotional facial stimuli (171 women and men) 6 emotions: neutrality, sadness, disgust, fear, anger and happiness.
Overview: Fedora Architecture and Software Features
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
VI-SEEM Data Repository
eSciDoc –Object and content modelling experiences
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
eSciDoc – Service infrastructure and solutions development
eSciDoc – Content model requirements
ESciDoc Overview Malte Dreyer.
NSDL Data Repository (NDR)
ESciDoc Introduction M. Dreyer.
ESciDoc Introduction M. Dreyer.
Malte Dreyer – Matthias Razum
Presentation transcript:

Versioning of Digital Objects in a Fedora-based Repository Matthias Razum FIZ Karlsruhe DORSDL Workshop Alicante September 21, 2006

2September 21, 2006 ECDL – DORSDL Workshop, Alicante Outline Motivation Versioning Concepts in eSciDoc Content Models Technical Approach Conclusion

3September 21, 2006 ECDL – DORSDL Workshop, Alicante eSciDoc is a joint project of the Max-Planck-Society (MPS) and FIZ Karlsruhe 6 million € five-year grant (2004 – 2009) from the German Federal Ministry of Education and Research It aims to build an integrated information, communication and publishing platform for web-based scientific work, exemplarily demonstrated for multi-disciplinary applications in the MPS eSciDoc is not a mere research project, but aims at establishing an innovative productive system Project Setup and Mission

4September 21, 2006 ECDL – DORSDL Workshop, Alicante Repositories for eScience The contents of an institutional repository or a digital library form the ‘institutional memory’ of an organization And just like human memory, they should allow for associating information objects in novel contexts, thus creating new scholarship Interdisciplinary work is becoming increasingly important, so systems have to span scientific disciplines Repositories should be open, application-independent and flexible, thus laying the ground today for repurposing the information in future applications

5September 21, 2006 ECDL – DORSDL Workshop, Alicante Turning Static Objects into ‘Living’ Knowledge e-Scholarship allows to publish all intermediate results of knowledge generation from first ideas, theories, discussions with peers to final results Institutional Repositories and Digital Libraries need to support scholars already in the early steps of this process, thus enabling their users to share their work in progress with peers Thinking a step further leads to interactive authoring environments with support for collaboration and annotations As a result, objects loose their static nature and become ‘active nodes’ in a network of knowledge

6September 21, 2006 ECDL – DORSDL Workshop, Alicante Implications The concept of ‘ownership’ of an artifact is loosened and partly replaced by an ongoing authoring process which spans persons, places, and time Collaborative authoring raises an issue familiar to software developers: versioning of digital objects All intermediate or working versions of artifacts should become part of the repository, not just the final versions Good Scientific Practice requires provenance data for objects and versioning

7September 21, 2006 ECDL – DORSDL Workshop, Alicante Outline Motivation Versioning Concepts in eSciDoc Content Models Technical Approach Conclusion

8September 21, 2006 ECDL – DORSDL Workshop, Alicante Versioning on Object Level Fedora’s basic object model – as defined in FOXML – is composed of an identifier, some key descriptive properties and a set of datastreams Currently, each change to a datastream leads to a new version of the datastream, but not of the object itself. On the other hand, authors and editors perceive objects as one coherent entity, not as a set of datastreams. They request a ‘whole-object’ versioning which complies with their mental model.

9September 21, 2006 ECDL – DORSDL Workshop, Alicante Fixed and Floating Object References Scholarly work strongly relies on citations and external references to existing material (e.g. primary data and supplementary material) In the context of digital repositories, these associations are expressed as object relations. Versioning of objects then raises the question how to handle relations pointing to a versioned object. eSciDoc implements two approaches: fixed relations pointing exactly to a given version of an object and floating relations which always point to the latest version of an object.

10September 21, 2006 ECDL – DORSDL Workshop, Alicante Internal and Public Versions Versions represent intermediate work statuses and are only visible to authors of digital objects Revisions are published versions of objects with persistent identifiers. Creating a revision is an intellectual step which most often includes some form of quality assurance, whereas versioning is an automated process.

11September 21, 2006 ECDL – DORSDL Workshop, Alicante Container Objects eSciDoc allows the grouping of objects by means of container objects like collections or bundles. A change to one of the contained objects substantially changes the container object as well. Therefore, any change to a contained object should lead to a new version of the container object. The same applies to revisioning: container objects are citable objects with their own persistent identifier. Revisioning of contained objects forces a new revision of the container object too.

12September 21, 2006 ECDL – DORSDL Workshop, Alicante Outline Motivation Versioning Concepts in eSciDoc Content Models Technical Approach Conclusion

13September 21, 2006 ECDL – DORSDL Workshop, Alicante Content Models in General An important part of implementing a Fedora repository is modeling different classes or “genre” of digital object that will be created, stored, and managed in the repository. A content model will typically describe the following: –Datastream composition the number and kinds of datastreams that must be present in the digital object the format(s) for those datastreams, either MIME or format identifiers whether each kind of datastream is required or optional whether each kind of datastream has cardinality contraints –Semantic identifiers for each kind of datastream relationships in the cases where a content model is a “graph” of related content models –Disseminators (optional)

14September 21, 2006 ECDL – DORSDL Workshop, Alicante eSciDoc Content Models Few generic object patterns: –Instances (administrative context) –Aggregations (collections, bundles) –Content Items –Licenses –Persons Genres are specialized object pattern with –Additional constraints and rules –Genre-specific metadata

15September 21, 2006 ECDL – DORSDL Workshop, Alicante Structural View of Content Item Content Item hasRevision * Content Component hasComponent * CC License hasLicense * License hasLicense * Metadata hasMD * Essential Properties hasProperties 1 eSciDoc Metadata hasDefaultMD 1 CC Metadata 1 hasMD

16September 21, 2006 ECDL – DORSDL Workshop, Alicante Content Item Modeled as Fedora Object Content Component RELS-EXT CC MD License 1... License n Content Stream hasComponent * Content Item RELS-EXT eSciDoc MD MD 1... MD n WOV MD

17September 21, 2006 ECDL – DORSDL Workshop, Alicante Container Modeled as Fedora Object Content Item RELS-EXT eSciDoc MD MD 1... MD n WOV MD hasMember * Container RELS-EXT eSciDoc MD MD 1... MD n Structure Map WOV MD

18September 21, 2006 ECDL – DORSDL Workshop, Alicante Outline Motivation Versioning Concepts in eSciDoc Content Models Technical Approach Conclusion

19September 21, 2006 ECDL – DORSDL Workshop, Alicante Whole-Object Versioning Metadata Fedora versioning works automatically within objects The eSciDoc middleware keeps track of whole object versions via objectVersion metadata The eSciDoc middleware also can tag particular whole object versions as “revisions” which will be official published views of the object

20September 21, 2006 ECDL – DORSDL Workshop, Alicante Animated View t0t0 t1t1 t2t2 t3t3 t4t4 Content Item CC 1 PID: parent:1 VersionID: 1.0 DOI: -- PID: child:1 Version: t 0 PID: child:2 Version: t 0 PID: parent:1 VersionID: 1.1 DOI: -- PID: child:1 Version: t 0 PID: child:2 Version: t 1 PID: parent:1 VersionID: 1.2 DOI: -- PID: child:1 Version: t 0 PID: child:2 Version: t 1 PID: child:3 Version: t 2 PID: parent:1 VersionID: 1.3 DOI: x.y/rev:1 PID: child:1 Version: t 0 PID: child:2 Version: t 1 PID: child:3 Version: t 2 PID: parent:1 VersionID: 1.4 DOI: -- PID: child:1 Version: t 4 PID: child:2 Version: t 1 PID: child:3 Version: t 2 CC 2 CC 3 Revision

21September 21, 2006 ECDL – DORSDL Workshop, Alicante Object Version XML this is the first whole object version demo:5 is the same; demo:6 modified; demo:7 ingested

22September 21, 2006 ECDL – DORSDL Workshop, Alicante Outline Motivation Versioning Concepts in eSciDoc Content Models Technical Approach Conclusion

23September 21, 2006 ECDL – DORSDL Workshop, Alicante Conclusion Versioning is essential for repositories which cover the whole object lifecycle Fedora already comes with a powerful versioning mechanism, but cannot fulfill all requirements of eSciDoc Atomistic content models make versioning even more complex The proposed approach provides a solution for advanced versioning requirement and at the same time is a demonstration of Fedora’s flexibility and adaptability

24September 21, 2006 ECDL – DORSDL Workshop, Alicante Acknowledgements The concepts in this presentation are based on eSciDoc’s Logical Data Model, created by Natasa Bulatovic (ZIM, Max Planck Society) a joint workshop of ZIM and FIZ with Sandy Payette and Carl Lagoze

Questions