Fedora TM and Repository Implementation at UVa Leslie Johnston, UVa Library DASER Summit November 22, 2003.

Slides:



Advertisements
Similar presentations
Richard Jones, Systems Developer Technical Issues for Repository Software Theses Alive! Edinburgh University Library SHERPA Nottingham.
Advertisements

Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Home-Grown Digital Library System Built Upon Open Source XML Technologies and Metadata Standards David Lacy Villanova University
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Theo Andrew, Edinburgh University Library Choosing Suitable Open-Source Repository Software Choosing Suitable Open Source Repository Software Theo Andrew.
EThOSnet Repositories and Web Services Workshop 2 nd June2009 Richard Green
DuraSpace, Fedora and DuraCloud Triangle Research Libraries Network September, 2009.
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Vital Implementation Update Vital Implementation Update 11 th January 2006 Paul Bevan – Glen Robson –
Goals for RUcore o Flexible, extensible cyberinfrastructure for Rutgers University o Integrating platform for legacy information systems o Support preservation.
A. Grigorov, A. Georgiev, M. Petrov, S. Varbanov, K. Stefanov Building a Knowledge Repository for Life-long Competence Development.
The Fedora Project JA-SIG Winter Conference December 9, 2003 Tim Sigmon University of Virginia.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science.
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil Sandy Payette Cornell Information Science.
All Things to All People Combining Resources to Build an Integrated Digital Repository Preservation and Access for Electronic College and University Records.
WMS: Democratizing Data
The Fedora Project Update as of January 2004 Ithaca, NY January 29, 2004 Sandy Payette Cornell Information Science.
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
The Mellon-Funded Fedora Project Technical Specifications Review August 26, 2002 Sandy Payette Cornell Information Science.
The Fedora Project DLF Forum Albuquerque, NM November 17, 2003 Sandy Payette Cornell Information Science.
FEDORA Project McGill University May Bill Parod Academic Technologies Northwestern University
A Digital Preservation Repository for Duke University Libraries Jim Coble Digital Repository Developer Open Repositories 2013.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
Architecting an Extensible Digital Repository Anoop Kumar, Ranjani Saigal,Rob Chavez, Nikolai Schwertner Tufts University, Medford, MA.
Digital Library Architecture and Technology
DuraSpace, Fedora and DuraCloud Thorny Staples Director, Community Strategy and Alliances ESIP Meeting, July 8, 2009.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
Fundamentals of XML Management Greg Alexopoulos Systems Engineer Documentum.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Sherry Lake Candidate for Metadata Specialist for User Projects.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
The Mellon-Funded Fedora Project A Briefing for the Cornell University Library January 24, 2002 Sandy Payette Thorny Staples Ross Wayland.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science NOTE: CSG
Web Services and Fedora EDUCAUSE Mid-Atlantic Regional Conference January 14, 2003 Tim Sigmon University of Virginia.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
DSpace System Architecture 11 July 2002 DSpace System Architecture.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
The Mellon-Funded Fedora Project A Presentation to the European Digital Library Conference September 17, 2002 Sandy Payette and Thornton Staples.
Fedora Service Framework Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
An Introduction to Data Modeling with Fedora Thorny Staples Fedora Commons, Inc.
Fedora Digital Object in a Nutshell Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Building Foundations: Fedora, Fez, and the ADR prepared by Jessica Branco Colati ADR Project Director, Colorado Alliance of Research Libraries
7th Annual Hong Kong Innovative Users Group Meeting
The Fedora Project March 10, 2003
? What is Institutional Repository for Rutgers University
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Introduction, Features & Technology
Overview: Fedora Architecture and Software Features
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
Introduction to DSpace
The Re3gistry software and the INSPIRE Registry
Fedora Metadata The Basics 9/9/2008.
Metadata to fit your needs... How much is too much?
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
The Fedora Project April 28-29, 2003 CNI, Washington DC
Presentation transcript:

Fedora TM and Repository Implementation at UVa Leslie Johnston, UVa Library DASER Summit November 22, 2003

Fedora TM History Research (1997-present) : –DARPA and NSF-funded research project at Cornell University Digital Library Research Group. –Reference implementation developed at Cornell. First Application ( ) : –University of Virginia Library Digital Library Research and Development prototype. –Scale/stress testing for 10,000,000 objects. Open Source Software (2002-present): –Andrew W. Mellon Foundation granted Virginia and Cornell $1 million to develop a production-quality Fedora system. –Fedora 1.0 released in May 2003.

What is Fedora TM ? Fedora is a Digital Asset Management architecture, upon which many types of Digital Library systems might be built. Fedora is based on object models that represent data objects (units of content) or collections of data objects. The objects contain linkages between datastreams (internally managed or external media files), metadata (inline or external), and behaviors that are themselves code objects and link to disseminators (processes, mechanisms, and external software). A data object subscribes to a pair of behavior objects Object models can be thought of as containers that give a useful shape to information poured into them; if the information fits the container, it can immediately be used in predefined ways.

Fedora TM Data Object Components Datastreams – represent content and metadata. PID – persistent identifier, unique to the Repository. System Metadata – metadata that the Repository keeps. Disseminators – bindings to objects that can deliver software processes that can be used with the datastreams.

Fedora TM Data Objects Persistent ID (PID) Default Disseminator SystemMetadata Datastream (item) Extension PID = uva-lib:100 Default Disseminator SystemMetadata Image (mrsid) DC (xml) Thumbnail (jpeg) Image Disseminator Digital object identifier Service view: methods for disseminating content Internal view: key metadata necessary to manage the object Content view: Set of data and metadata items

Persistent ID (PID) Behavior Definition Metadata SystemMetadata Datastreams Data Object Persistent ID (PID) Service Binding Metadata (WSDL) SystemMetadata Datastreams Web Service behavior contract behavior subscription data contract Persistent ID (PID) Disseminators Datastreams System Metadata Behavior Mechanism Object Behavior Definition Object

Fedora TM Service Interfaces Management Service (API-M) –Ingest - XML-encoded object submission –Create - interactive object creation via API requests –Maintain - interactive object modification via API requests –Validate – application of integrity rules to objects –Identify - generate unique object identifiers –Security - authentication and access control –Preserve - automatic content versioning and audit trail –Export - XML-encoded object formats Access Service (API-A and API-A-LITE) –Search - search repository for objects –Object Reflection - what disseminations can the object provide? –Object Dissemination - request a view of the objects content OAI-PMH Provider Service –OAI-DC records

Fedora TM Distribution Package Open Source (Mozilla Public License) 100% Java (Sun Java J2SDK1.4) Supporting Technologies –Apache Tomcat 4.1 and Apache Axis (SOAP) –Xerces for XML parsing and validation –Saxon 6.5 for XSLT transformation –Schematron 1.5 for validation –MySQL and Mckoi relational database –Oracle 9i support Deployment Platforms –Windows 2000, NT, XP –Solaris –Linux

What Fedora TM Is Not Fedora is not finished – the development process is only half way complete. –Version 1.2 releases on December 10, –The scheduled date for implementation of all features outlined in the grant-funded project is early Fedora is the underlying architecture for a digital repository, not a complete management, indexing, discovery, and delivery application. Fedora by itself is not the UVa Library's Digital Library system - Fedora is the "plumbing" for our first phase production Central Digital Repository.

Process for Repository Development Fedora developers met with content and format specialists, application developers, and user service librarians to understand what media files we have and how our users expect to find them and use them. Priorities were set for phased development and content migration by format type: –First Phase: Electronic Texts, EAD, and Images –Second Phase: Datasets and GIS –Third Phase: Digital Audio and Video

Process for Repository Development Specifications were set for: –Datastreams (formats, variation in deliverables [EAD vs. TEI vs. Ebooks, page images vs. documentary images]) –Metadata –Discovery functionality and interface (simple and advanced searching, metadata vs. full-text searching, presentation of results sets, etc.) –Delivery (must support static and on-the-fly file delivery, and varied end user download and printing requirements)

Repository Prototype A prototype discovery interface was released for review by Library staff during summer Almost 150 comments on functionality, user interface, and proposed additional features were collected. The comments were collated into categories which were prioritized by Library department heads, user services staff, and developers for implementation into a first release, scheduled for early 2004.

Proposed Searching Services

Issues - Standards Collate, standardize, and document in-house production standards. –Slide and photograph scanning; Book page scanning; and Full-text markup Develop UVa DescMeta XML element set, and document minimum metadata elements and best use practices. Develop the General Descriptive Modeling Scheme (GDMS) XML encoding standard to describe complex, structured collections. Recommend the in-house standards for faculty with digitization projects through our consulting services. –Born digital faculty projects are selected for collection by the Library, assuring a smoother collection process.

Issues – Authoring Tools User Collection Tool –Web-based database for the organization and annotation of personal media collections. GDMS Tool –XML authoring tool to create documents using a locally defined XML encoding standard to represent structured collections of images and metadata. A Data Workbench is planned to create relationships between objects and prepare files for ingest into the Repository. A Scholarly Object Workbench is planned for faculty to use in creating their research and instructional resources in formats that can be more easily collected by the Library.

Upcoming – Modeling Virginia Collaboration between Systems Engineering, Environmental Sciences, and the Library. Weather datasets, traffic datasets, and the 2000 census. –Proof-of-concept – Hampton Roads area. –Applying for funding for the entirety of Virginia. Will drive the development of object models and disseminators for discovery and download of variables across datasets with DDI codebooks.

Upcoming – Aggregation Objects On-the-fly collection objects where the content data stream contains rules, formatted as XQuery or XPath statements, rather than explicit collection relationships. Child objects of the collection are assembled at dissemination time. Disseminators can include such functions as building a full-text index, rendering a search page, etc.

Upcoming – Fedora TM 1.2 Open Fedora APIs –Repository as web services (REST and SOAP bindings); WSDL interface defs Flexible Digital Object Model –Content View: objects as bundle of items (content and metadata) –Service View: objects as a set of service methods (behaviors) –Extensible functionality by associating services with objects Repository System –Core Services: Management, Access/Search, OAI-PMH –Storage: XML object store; relational db object cache; relational db object registry –Mediation - auto-dispatching to distributed web services for content transformation –Auto-Indexing – system metadata and DC record of each object –HTTP Basic Authentication and Access Control –Built-in disseminator services: XSLT x-form, image manipulation, xml-to-PDF Content Versioning –Automatic version control (saves version of content/metadata when modified) –Enables date-time stamped API requests (see object as it looked at a point in time) Clients –Fedora Administrator: GUI client to create/maintain objects –Default Web browser interface: search; access objects via default disseminator –Command line utilities (batch load, ingest, purge, others) –Migration Utility – mass export/ingest

Fedora TM December 2003-January 2005 Fedora Object XML (FOXML) –Internal storage format; direct expression of Fedora object model –Better support for relationships (kinship metadata) –Better support for audit trail (event history) –Format identifiers for dynamic service binding Shibboleth authentication Policy Enforcement –XACML expression language –Fedora policy enforcement module Web interface for easy content submission Batch object modification utility Administrative Reporting Object Event History (ABC/RDF disseminations) Better support for collections New ingest and export formats (METS1.3, DIDL)

Contact Information