Building a Fedora Architecture to Support Diverse Collections Jon Dunn Ryan Scherle Digital Library Program Indiana University.

Slides:



Advertisements
Similar presentations
Digital Music and Audio Projects at Indiana University Jon Dunn Digital Library Program Indiana University August 16, 2001.
Advertisements

Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
1. The Digital Library Challenge The Hybrid Library Today’s information resources collections are “hybrid” Combinations of - paper and digital format.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
Depositing e-material to The National Library of Sweden.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
All Things to All People Combining Resources to Build an Integrated Digital Repository Preservation and Access for Electronic College and University Records.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Digital Repository Service ___________________________ Yale University Library Audrey Novak, Head IS&P 7 March 2007.
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
Use of METS in CDL Digital Special Collections Brian Tingle.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
Variations On Video project update DLF Fall Forum 2010 Jon Dunn, Indiana University Claire Stewart, Northwestern University November 2, 2010.
Digital Library Architecture and Technology
Greg Harris President & CEO We Can Work It Out Establishing the World’s First Rock and Roll Library.
Open Source Software Sustainability: A Case Study of Indiana University's Variations Software Jon W. Dunn, Phil Ponella, and Robert H. McDonald Indiana.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Developing an Ingest Service for Fedora Ryan Scherle Muzaffer Ozakca.
EVIA Digital Archive Project Jon W. Dunn and William G. Cowan Digital Library Program Indiana University IU Digital Library Brown Bag Series November 19,
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Reexamining Digital Library Infrastructure at IU Jon Dunn, Ryan Scherle, Eric Peters Indiana University Digital Library Program IU Digital Library Brown.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
PHOTO CATALOGING AND DELIVERY SERVICE INTRODUCTION AND GETTING STARTED.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
OExtending the Annotator’s Workbench oFrom EVIADA to CAMVA oWilliam G. Cowan oMichael Durbin.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
Sakaibrary Project Update: Subject Research Guides and Next Steps Jon Dunn Indiana University July 2, 2008.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
EVIA Digital Archive New Tools William G. Cowan Mike Durbin Digital Library Program EVIA Digital Archive DLP Brown Bag 20 September 2006.
The Data Capacitor and Digital Libraries at IU Jon Dunn Associate Director for Technology IU Digital Library Program February 22, 2006.
Successes and Growing Pains: The Indiana University Digital Library Program Jenn Riley Metadata Librarian Indiana University Digital Library Program January.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
METS Navigator Jenn Riley John Walsh Michelle Dalmau David Jiao Indiana University Digital Library Program Digital Library Federation Spring Forum
Introduction to metadata
IUScholarWorks Technical Overview Randall Floyd Digital Library Program Programmer/Database Administrator.
VITAL at the National Library of Wales Glen Robson
April 25, 2012 Making the Most of Library Collaboration and Cooperative Projects Partnering for Discovery: Jennifer LissErika Dowell Metadata/Cataloging.
A Multi-Tiered Architecture for Distributed Data Collection and Centralized Data Delivery Stacy Kowalczyk and James Halliday April 28, 2008.
Metadata and Technology/Architecture Working Groups DLF Aquifer Project DLF Fall Forum Providence, RI November 14, 2008.
Improving Description through Collaboration: The Ethnomusicological Video for Instruction & Analysis Digital Archive Music Library Association, February.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Ensuring Equal Access, Collaborating on Accessibility #dlbb Digital Library Brown Bag Series Humbert Joe Humbert, UITS Assistive Technology.
Challenges in the Nursery: Linking a Finding Aid with Online Content Elizabeth Johnson, Lilly Library Jenn Riley, Digital Library Program DL Brown Bag,
Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation.
Visionary Technology in Library Solutions VITAL Access Portal.
Storage Why is storage an issue? Space requirements Persistence Accessibility Needs depend on purpose of storage Capture/encoding Access/delivery Preservation.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Research Data Management At the Smithsonian PASIG, Washington, DC May 24, 2013.
A technical overview Image Collection Workflow and Tools Michael Durbin 2010 Brown Bag Presentation Series April 21, 2010.
NLW. Object Classes Class 1  1 MARC Record  1 Image  No METS Class 2  1 MARC Record  Many images  No METS Class 3  1 MARC Record  Many.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Eliot Wilczek University Records Manager Digital Collections and Archives Tufts University Institutional Repositories: Models & Approaches A NELINET Seminar.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Eliot Wilczek University Records Manager Digital Collections and Archives Tufts University Repositories: How are They Evolving? A NERCOMP Workshop September.
EVIA Digital Archive Technical Overview EVIA Digital Archive DLP Brown Bag: 7 December 2005.
7th Annual Hong Kong Innovative Users Group Meeting
Avalon's Role in the Digital Collections Ecosystem
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Outline Pursue Interoperability: Digital Libraries
Metadata to fit your needs... How much is too much?
Presentation transcript:

Building a Fedora Architecture to Support Diverse Collections Jon Dunn Ryan Scherle Digital Library Program Indiana University

Indiana University Digital Library Program Joint venture of Libraries and University Information Technology Services (UITS) formed in 1997 Bloomington-based; supporting 8 campuses Engaged in digital collection building, infrastructure design/management, and research activities Supporting library, archive, museum, academic department, and faculty-based digital collections projects

Digital Library Content Types at IU Books Manuscripts Photographs Art images Music audio Video Sheet music Musical score images Music notation files …and more

Current DLP Technical Environment: Access Systems DLXS (University of Michigan)  Text  Finding Aids  Bibliographic information IBM Content Manager Locally-developed systems  Cushman Photograph Collection  DIDO: Digital Images Delivered Online  Variations2  Page turners (sheet music, METS Navigator)

Current DLP Technical Environment: Storage DLP server disk storage Tivoli Storage Manager IU Massive Data Storage System (MDSS)  HPSS software  1.6 petabytes of StorageTek and IBM automated tape  Access via FTP, PFTP, HSI

Motivations for a repository Centralize access and preservation functions for IU’s digital collections Reduce DLP staff time and attention needed to create and maintain collections Enable librarians, curators, archivists to digitize new collections Enable digital preservation

DL Infrastructure Project Proposal funded by University Information Technology Services to reengineer digital library infrastructure around Fedora Builds on experience with Fedora in context of EVIA Digital Archive (ethnomusicology video)

Building services and tools around Fedora Searching/browsing of metadata and content End-user UI for display/navigation of metadata and content Cataloging and ingest tools Preservation services

IU Content Models

Defining a content model Focus on what you can do with an object Behaviors are primary Behaviors are the way all external processes will interact with the object Keep datastreams “private”

Diversity Multiple media types Multiple brands Multiple tools

Standard disseminators All objects subscribe to the default disseminator Most objects subscribe to the metadata disseminator Most objects subscribe to type-specific disseminators Metadata dissem getMetadata(type) Default dissem getDefaultView getLabel getFullView getPreview getAssetDefinition

Simple images Each image is a single Fedora object Images are available in a variety of sizes Each image belongs to one or more collections Default dissem Metadata dissem Collection obj Collection dissem Default dissem Metadata dissem Image obj Image dissem Default dissem Metadata dissem Image obj Image dissem

Default dissem Metadata dissem Collection obj Collection dissem Default dissem Metadata dissem Book obj Paged dissem Default dissem Metadata dissem Book obj Paged dissem Default dissem Metadata dissem Page obj Image dissem Default dissem Metadata dissem Page obj Image dissem Default dissem Metadata dissem Page obj Image dissem Default dissem Metadata dissem Page obj Image dissem

Object-level disseminators Image  getThumbnail  getScreenSize  getLarge  getMaster Video  getSmilFile  playSmilFile  getStructMap  getActionObject  getObjectID PagedImage  getNumChildren  getChildren PagedText  getSummary  getChunkList  getChunk(label)  getRawText  getFriendlyText  getTextPage(num) Printable  getPrintableVersion

Collection-level disseminators Collection  getSize  listMembers(start,max) CollectionRender  renderItemPreview(pid)  renderItemFullView(pid) CollectionPagedImage  viewPageTurner(pid, pagenum) CollectionPagedText  viewText(pid, pagenum, style)  viewChunk(pid, label, style)  viewPage(pid, num, style)

Image Demos Sample Image Frank M. Hohenberger Collection U.S. Steel Collection

But what about the metadata? Different content types have different types of metadata  MARC for general library holdings  MODS for collections we catalog  TEI for textual collections  EAD for archival collections  Combinations: Some items need METS for structure, TEI for text, MODS for description, etc.

The solution: METS No, not the Fedora METS METS within a datastream, and everything else within the METS A standard way of dealing with DC, MODS, technical, structural, provenance, process, etc. Sample Image

Implementing the disseminators Simple Image  DC  THUMBNAIL  SCREEN  LARGE  METADATA  RELS-EXT Paged Object  DC  METADATA  RELS-EXT Collection  DC  METADATA  INGEST_CONFIG

Want more info? More detailed content model pagescontent model pages are available on our project wiki.

IU Fedora Tools

Ingest Tool The Ingest Tool transforms raw metadata and media files into Fedora objects that conform to our content models. Ingest Tool Fedora MODSEADJPGPDF DatastreamsFOXML

METS Navigator METS Navigator is a METS-based system for displaying and navigating multi-image digital objects. It was built to be extendible and configurable. Web pages with navigational structure are built from metadata in the repository. Available from

Demos Default METS Navigator Collection Jane Johnson Collection

Using METS Navigator with Fedora METS document must meet minimal format requirements  Logical and physical structMap  Files marked with USE and GROUPID attributes  Files are URLs that point to Fedora METS Navigator may be called from a disseminator, but it is better if called separately. Full integration instructions

Cataloging tools No good solutions for non-MARC descriptive/structural metadata creation  Some exist for specific domains: e.g. art image cataloging Need content- or collection-appropriate interfaces Catalog directly into Fedora or into database?  Data synchronization issues Common framework or separate tools? Starting to investigate

Delivery tools Right now: collection-specific web sites Moving towards: generic applications appropriate to content models  Examples: documentary photos, art images, books, sheet music… May integrate components from other places (e.g. Virginia collector tool) Exposing metadata to external services via OAI- PMH, SRU (for Metasearch)

Other tools and services via Fedora Service Framework Search tool  Expanded, with thesaurus support Preservation integrity services

Infrastructure Project Challenges Time and resources vs. scope of work Sorting out old collections – digital archeology Implementing new infrastructure while continuing to do new projects Maintaining current functionality

Infrastructure Project Challenges Metadata entry / cataloging tool design Integration with MDSS/HPSS - classes of storage Art images Searching system Preservation system

Thank You! Contact info:  Jon Dunn  Ryan Scherle Infrastructure project wiki: 