An Architecture for Complex Objects and their Relationships

Slides:



Advertisements
Similar presentations
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Advertisements

An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
JMS messaging service  All write-only Fedora operations are published to subscribed clients  Messaging system can be durable – if client/consumer/subscriber.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
Repositories: Disruptive Technology or Disrupted Technology? Sandy Payette, Executive Director DORSDL Workshop at ECDL 2008 September 2008.
Depositing e-material to The National Library of Sweden.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science.
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil Sandy Payette Cornell Information Science.
Federated Digital Library Architecture and Distributed Resource Discovery Carl Lagoze CS
Web Service Architecture Part I- Overview and Models (based on W3C Working Group Note Frank.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
Digital Library Architecture and Technology
Dienst Distributed Networked Publishing Carl Lagoze Digital Library Scientist Cornell University.
DuraSpace, Fedora and DuraCloud Thorny Staples Director, Community Strategy and Alliances ESIP Meeting, July 8, 2009.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Open Data Protocol * Han Wang 11/30/2012 *
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Information Network Overlay Architecture Adding Value to Digital Content Carl Lagoze CS 431 – May 4, 2005 Cornell University.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Digital Object Repositories (Example of FEDORA).What-Is- In-It-For-PAWS? PAWS Meeting Series, Spring 2007 Michael Yudelson.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
Metadata harvesting in regional digital libraries in PIONIER Network Cezary Mazurek, Maciej Stroiński, Marcin Werla, Jan Węglarz.
1 GRID Based Federated Digital Library K. Maly, M. Zubair, V. Chilukamarri, and P. Kothari Department of Computer Science Old Dominion University February,
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
The Mellon-Funded Fedora Project A Presentation to the European Digital Library Conference September 17, 2002 Sandy Payette and Thornton Staples.
Fedora Service Framework Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
An Introduction to Data Modeling with Fedora Thorny Staples Fedora Commons, Inc.
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
Fedora Digital Object in a Nutshell Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
3/24/051 Value-Added Surrogates for Distributed Content Establishing a Virtual Control Zone By: Sandra Payette & Carl Lagoze Presented By: Mohamed Elmiligui.
The Fedora Project March 10, 2003
Sabri Kızanlık Ural Emekçi
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Overview: Fedora Architecture and Software Features
Web Ontology Language for Service (OWL-S)
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
VI-SEEM Data Repository
Outline Pursue Interoperability: Digital Libraries
OAI and Metadata Harvesting
Fedora Metadata The Basics 9/9/2008.
NSDL Data Repository (NDR)
Session 2: Metadata and Catalogues
Metadata in Digital Preservation: Setting the Scene
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
CS 791/891 - Preservation of Digital Objects and Collections
WEB SERVICES From Chapter 19, Distributed Systems
Presentation transcript:

An Architecture for Complex Objects and their Relationships FEDORA An Architecture for Complex Objects and their Relationships Carl Lagoze, Sandy Payette, Edwin Shin, Chris Wilper Presented by Raghu Burukunte Course: CS 791 Digital Objects Preservation Instructor: Dr. Michael Nelson

Introduction A little bit of History Fedora Flexible Extensible Digital Object Repository Architecture A little bit of History Objectives Technologies for representing digital content should be able to match the richness, and complexity of well-established physical formats Allow the representation of a variety of structural organizations Should accommodate the flexible combination of different genre of materials such as text and images in multiple layouts Should allow the aggregation of content from multiple sources and the association of metadata with the elements of the aggregation And more…

Introduction Fedora is available as a open source digital object repository software that provides a rich context for information organization, representation, and management Implemented as Combination of Web Services Used in Digital Library Architecture Digital Asset Management Institutional Repository Content Management System (CMS) Scholarly publishing Preservation support

Features Supports heterogeneous data types Accommodating of new emerging types Aggregating mixed, distributed data into complex objects Specifies multiple content disseminations of digital objects Dynamic transformations of content/metadata Associating rights management schemas Provides the infrastructure for expressing relationships among objects and their components Support for versioning of objects and their components AND maintains Event history Easy integration with other applications and systems and web services Web services with open APIs

Fedora Object Model An object is accessible in multiple representations, some of them direct transcriptions of aggregated data Each digital object can disseminate one or more representations of itself Each object can be related to one or more other objects The representational perspective defines a simplified abstraction for understanding Fedora objects, where each object is modeled as a uniquely identified resource projecting one or more views, or representations. Relationship arcs connect digital objects, and representation arcs connect digital objects to their respective representations Allows for storing in RDF and can be queried URI Uniform Resource Identifiers

Representational View - Dublin Core record, identified as info:fedora/demo:11/DC - High-resolution image, identified as info:fedora/demo:11/HIGH - Thumbnail image, identified as info:fedora/demo:11/THUMB - Image with zoom/pan utility, as info:fedora/demo:11/bdef:2/ZPAN

Fedora Object Model The functional perspective reveals the object components that underlie the representational perspective and provides the basis for understanding how the Fedora object model relates to the management services exposed in the Fedora repository architecture. Digital Object – Collection of Datastreams Fedora object model is directly expressed using XML Schema language in a format known as Fedora Object XML (FOXML). FOXML defines a <digitalObject> root element that contains as set of <objectProperties>, one or more <datastream> components, and one or more<disseminator> components

Functional View

DataStream Component Format URI refines the media type definition and anticipates the emergence of a global digital format registry Control group defines whether the DataStream represents either local or remote content. Datastreams with a control group of “Managed” are internal content bytestreams Datastreams, whose control group is “External” or “Redirected” represent content that is stored outside the repository.

Functional View 2 Virtual representation is a view of an object that is produced by a service operation Use of Disseminators Disseminator must contain three pieces of information: A reference to a description of service operation(s) in an abstract syntax, A reference to a WSDL service description that defines bindings to concrete web service to run operation The identifiers of any Datastreams in the object that should be used as input to the service operation(s).

Functional View 3 Service Operation Descriptions and WSDL Service Descriptions stored in Behavior Definitions (BDef) Behavior Mechanisms (BMech)

More on Disseminators Can be used to create common representational access points for digital objects that have different underlying structure or format Example :Use TeX, Word or text as Underlying format (root) to produce PDF documents Disseminators can add utility operations to digital objects Example: Disseminator can be defined for a digital object that provides parameterized query access to the relationships defined for that object. Get a set of objects related to a particular object by a given relation using a query

Functional View 3 – Object Integrity Components PRESERVATION by Versioning Integrity Entities Relationship metadata Access control policy Audit trail

Integrity Entities Relations DataStream is used to assert object-to-object relationships such as collection/member, part/whole, equivalence, “aboutness,” and more Policy DataStream is used to express authorization policies for digital objects, both to protect the integrity of an object and to enable fine-grained access controls on an object’s content (uses XACML) Authorization module (i.e., AuthZ) implemented within the Fedora Repository Service Audit Trail is a system-controlled datastream that keeps a record of all changes to an object during its lifetime Versioning is important for applications where change tracking is essential, as well as for preservation and archiving systems that must be able to recover historical views of digital objects Datastreams and disseminators can be changed without losing their former instantiations Requests for representations of this digital object can be date-time stamped and the Fedora Repository Service will ensure that the appropriate component version is returned

Implementation Examples

Implementation Examples

Implementation Examples

Implementation Examples

Implementation Examples

Fedora Service Framework Set of loosely coupled services that interact and collaborate with each other External and Internal Services Atomic, modular services that can be flexibility aggregated into different multi-service compositions

Fedora Repository Service

Fedora Repository Service Service interface is defined using the Web Service Description Language (WSDL) with both SOAP and REST bindings Clients interact with the repository only through the defined web service interfaces Management service interface (API-M) contains read/write operations necessary to manage a repository of digital objects Access service interface (API-A) contains read-only operations for accessing digital objects Registry Search interface exposes service operations to perform a simple search of the digital object registry based on object properties Resource Index interface is the service entry point to an RDF-based index of the entire repository

APIs API -M API -A URI -> URLs

Implementation Examples Representing object-to-object relationships The organization of individual resources into larger collection units, for the purpose of management, OAI-PMH harvesting , user browsing, and other uses. • The relationships among bibliographic entities such as those described in the Functional Requirements for Bibliographic Relationships . • Semantic relationships among resources such as their relevance to state educational standards or curricula in an educational digital library like the National Science Digital Library. • Modeling more complex forms of network overlays over the resources in a content repository such as citation links, link structure, friend of a friend etc

Implementation Examples Representing object representations and properties in the Resource Index

Implementation Examples Storing and querying the relationship graph Uses TQL, RDQL and SPO

Implementation Examples

Conclusion Recently released 2.0 which incorporates support for OAI Well-defined object model and the exposure of the model through programmatic interfaces Support for Preservation through versioning and Audit trails Highly Extensible and capable of high degree of federation Thank you

Glossary FEDORA- Flexible Extensible Digital Object Repository Architecture RDF Resource Description Framework SOAP - Simple Object Access Protocol REST -Representational State Transfer OAI-PMH – Open Archives Initiative Protocol for Metadata Harvesting URI - Uniform Resource Identifiers FOXML - FEDORA Object XML TQL - Transparent Network Substrate Query Language RDQL - RDF Query Language API-A --API for Access Service API-M – API for Management Service