METS at UC Berkeley Generating METS Objects. Background Kinds of materials: –primarily imaged content & tei encoded content archival materials: manuscripts.

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
Architecture of the COREP-XBRL mapper Java based web application Uses only open source packages of Java + struts.jar for the GUI + poi.jar for the reading.
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Developing a Generic Toolkit: Architecture and technology issues ALLC/ACH Conference.
Setting Up Information Portal Irwan Sampurna C-CONTENT 23 May 2006.
ArrayExpress Query Interface Gonzalo Garc í a Lara January, / 24.
Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
METS at UC Berkeley Part I: Generating METS Objects.
The Caught and Coloured website: its EMu origins Alex Chubaty – Collection Information Systems Craig Churchill – IT Software Development Museum Victoria.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
DSpace Devika P. Madalli DRTC, ISI Bangalore.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
Ingest and Loading DigiTool Version 3.0. Ingest and Loading 2 Ingest Agenda Ingest Overview and Introduction Ingest activity steps Transformers Task Chains.
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil Sandy Payette Cornell Information Science.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
WMS: Democratizing Data
Interpret Application Specifications
UCLA Digital Library Technical Architecture June 13, 2002 UCLA Digital Library Presenter: Curtis Fornadley, Senior Programmer/Analyst.
An emergent system for the creation and dissemination of manuscript transcriptions An emergent system for the creation and dissemination of manuscript.
METS at UC Berkeley Part II: Viewing METS Objects via GenView.
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
Use of METS in CDL Digital Special Collections Brian Tingle.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
Module - Technical Basics
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
SobekCM’s Community Ecosystems & Socio-Technical Practices Presented by Mark V. Sullivan June 10 th, 2014 Sobek image created by Jeff Dahl and is shared.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
EXtensible Neuroimaging Archive Toolkit (XNAT) Washington University Neuroinformatics Group.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
E-Learning standards and meta-data: Case study ดร. น้ำทิพย์ วิภาวิน Sripatum University Library.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
Archivists' Toolkit - CRADLE Presentation, 10 Feb The Archivists’ Toolkit CRADLE Presentation 10 Feb
Integrating a Statewide Web Gateway With Digital Collections ______________________ Eric Weig and Beth Kraemer University of Kentucky and KCVL.
Archivists’ Toolkit: Introduction March 12, 2007 Jody Lloyd Thompson.
Ch 2 – Application Assembly and Deployment COSC 617 Jeff Schmitt September 14, 2006.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science NOTE: CSG
Sobek for Curators and Collection Managers Training Two: Submitting and Editing Resource Files and Metadata Mark Sullivan November 2013 University of Florida.
A Multi-Tiered Architecture for Distributed Data Collection and Centralized Data Delivery Stacy Kowalczyk and James Halliday April 28, 2008.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
DSpace - Digital Library Software
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Digital Data Preservation: a schema-driven model Student: Stacy Kowalczyk Co-Authors: Clare McInerney and Phil Mitchell Digital Data Preservation – the.
Lifecycle Metadata for Digital Objects The Final Curtain December 4, 2006.
Database Overview What is a database? What types of databases are there? How are databases more powerful than spreadsheets?
The ECOST Web-based platform for data providers and for data users.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Archivists' Toolkit - All Hands Meeting Scope Both multilevel and single-level description Accommodates description of collections, series, sub-series,
7th Annual Hong Kong Innovative Users Group Meeting
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Building Search Systems for Digital Library Collections
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Microsoft Office Illustrated
VI-SEEM Data Repository
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Introduction to DSpace
The Re3gistry software and the INSPIRE Registry
Sobek for Curators and Collection Managers
Metadata The metadata contains
The Fedora Project April 28-29, 2003 CNI, Washington DC
Introduction to METS (Metadata Encoding and Transmission Standard)
Presentation transcript:

METS at UC Berkeley Generating METS Objects

Background Kinds of materials: –primarily imaged content & tei encoded content archival materials: manuscripts and pictorial collections oral histories Kinds of Metadata –Structural metadata: physical structure –Descriptive metadata –BasicTechnical metadata about digital files and how they were produced

Tools For Producing METS Objects GenDB –Gathers structural, descriptive and technical metadata GenX –Generates METS objects from GenDB

GenDB Consists of: –Relational database (Currently SQL Server) –Locally developed software for gathering metadata and facilitating digital processing

Div 1 GenDB Database Structure Structural Metadata Div 2 Div 3 Object 1 Object 2 (root) (parent = div 1) Div 1 Div 2 Div 3 (root) (parent = div 2) (parent = div 1) Div 4 (parent = div 2) Object 1 Div 1 Div 2 Div 3 Object 2 Div 1 Div 2 Div 3 Div 4 … Structural Md Table

Div 1 GenDB Database Structure Descriptive Metadata Div 2 Div 3 Object 1 Object 2 Div 1 Div 2 Div 3 Div 4 Core Desc Md Name 1 Name 2 Name 3 Note 1 Note 2 Note 3 Name Table Note Tables Structural Md Table

Div 1 GenDB Database Structure Content File/Technical Md Div 2 Div 3 Object 1 Master Image Table Derivative Image Table Structural Md Table Drv 1 Drv 2 Drv 3 Mstr 1 Mstr 2 Technical Md Drv 4 Technical Md

Populating the Database Tables Web interface: manual input of structural and descriptive metadata Digitization Management modules –Generate work orders to guide digitization process –Import content file information and technical metadata coming out of digitization process Batch loader: batch input based on TEI encodings, legacy metadata

Web Interface: WebGenDB Web Interfac e SQL Server Database Java Servlet Java Server XML Config Files rmi jdbc

Digitization Management Modules Web Interfac e Java Servlet Java Server SQL Server Database Imaging/ Transcription WorkOrders Vendor Technical MD Spreadsheets

Batch Loader Web Interfac e SQL Server Database Java Servlet Java Server Java Batch Loader XML Batch Load File TEI Docs XSLT

WebGenDB The concepts that drove the design Shielding user from METS complexity Highly configurable Unicode support Access driven by login privileges Use of Open Source software and components Distributed approach

XML Configuration Files Three levels –Common to all projects elements –Common to all screens in a project elements –Specific to a screen in a project Define fields common to all projects Define fields used in specific project Define screens by project & object type

AlProjects.xml Proj1.xml Proj2.xml ObjectType1.xml ObjectType2.xml ObjectType1.xml ObjectType2.xml Relation among XML files

workorder /data/_w/GenDB/WEB-INF/classes/edu/berkeley/library/propertyFiles/CalCultureWorkOrderScreensFile.xml Image checkbox Image 1 Text checkbox Text 1 Title text Title 60 Project XML file example

Software used MSSQL running on NT Tomcat implementing servlets 2.3 Jsdk 1.4 Xalan 2.4 Xerces FOP JDOM beta 8 Opta 2000

Relationship of GenDB to METS Metadata not directly stored in METS, MODS or MIX schema formats. –Much of the database structure was developed before these standards emerged –Database structure and content adjusted to be compatible with all these formats

GenX: From GenDB to METS Allows Digital Publishing Group staff to select the objects in the GenDB database that are ready for export and to export them as METS objects.

GenX Architecture App Interfac e GenDB Java Application METS XML Repository JDBC

GenX Output METS output corresponding to version 1.3 Descriptive metadata exported to METS descMD in MODS 2.0 format Technical Metadata exported to METS techMD in MIX format Planned: –Text technical md to METS descMD in NYU TextMD –Rights to METS rightsMD in ODRL subset

Links GenDB Web Interface Demo – –login: demo –password: demo Developers: