WP 3: Standardisation of shared metadata Mode of operation –All partners are involved –Building on practice outside the project Achievements of Year 1.

Slides:



Advertisements
Similar presentations
White Paper on Establishing an Infrastructure for Open Language Archiving Steven Bird and Gary Simons.
Advertisements

Delivering textual resources. Overview Getting the text ready – decisions & costs Structures for delivery Full text Marked-up Image and text Indexed How.
A centre of expertise in digital information management UKOLN is supported by: XML and the DCMI Abstract Model DC Architecture WG Meeting,
Content Working Group Paolo Manghi ISTI-CNR
EAD Revision: Technical Considerations Terry Catapano EAD Roundtable Meeting
Automated Test Design ™ © 2011 Conformiq, Inc. CONFORMIQ DESIGNER On ES v1.2.1 Stephan Schulz MBT Working Meeting/MTS#56, Göttingen.
Data Category specifications 19 June 20121CLARIN-NL 2012 ISOcat tutorial.
TEI, CIDOC-CRM and a Possible Interface between the Two Øyvind Eide & Christian-Emil Ore* Unit for Digital Documentation, University of Oslo, Norway (*ICOM.
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
1 ISO – Metadata Next Generation International consensus being built on structured metadata within a broader Geomatics Standard under ISO Technical.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Automatic Evaluation of Migration Quality in Distributed Networks of Converters Miguel Ferreira Supervisors Ana Alice Baptista.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
WMS: Democratizing Data
A Practical Introduction to XML in Libraries Marty Kurth NYLA October 22, 2004.
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
Mapping Physical Formats to Logical Models to Extract Data and Metadata Tara Talbott IPAW ‘06.
Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.
Digital Encoding What’s behind E-text Resources?.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
DIGITIZATION OF RARE LIBRARY MATERIALS Metadata Format Access to Digital Documents © Adolf Knoll, National Library of the Czech Republic.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
EAD: A Technical Introduction Julie Hardesty, Metadata Analyst June 3, 2014.
Mark Sullivan University of Florida Libraries Digital Library of the Caribbean.
Introduction technology XSL. 04/11/2005 Script of the presentation Introduction the XSL The XSL standard Tools for edition of codes XSL Necessary resources.
Interoperable Digitised Content “Discover, search, extract, link, associate, and view digitised content” Les Carr.
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Interoperability: Where the irresistible force of flexibility meets the immovable.
Experiments with ODD outside the TEI framework Laurent Romary & Piotr Banski The ISO-TEI connection.
EAD Revision: Response to Call for Comments Bill Stockting: Co-Chair TS-EAD: EAD Roundtable/EAD Revision Forum: SAA Annual Meeting, Chicago, 24 August.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
METADATA What Is It and What Can I Do With It? Vicki L. Gregory Associate Professor School of Library & Information Science University of South Florida.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
XML 101 Holly Hyland Session Objectives –XML Basics –Building Standards History Current State Future Vision.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
UVa's Digital Library CSG - September 2005 Slides courtesy of: Leslie Johnston Director, Digital Access Services, UVA Library Tim Sigmon University of.
METS at UC Berkeley Generating METS Objects. Background Kinds of materials: –primarily imaged content & tei encoded content archival materials: manuscripts.
Roy Tennant Life After MARC A Metadata Infrastructure for the 21st Century.
Roy Tennant California Digital Library escholarship.cdlib.org/rtennant/presentations/2003cil/ Achieving Together What None Can Do Alone: Interoperability.
1 Digital Preservation Testbed Database Preservation Issues Remco Verdegem Bern, 9 April 2003.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Using XML to store Descriptive Metadata Richard Murphy Rosarie O’Riordan Central Statistics Office Ireland.
Archivists' Toolkit - All Hands Meeting Project Objectives Build an application for creating and managing archival information Target core archival.
Quads.esds.ac.uk/squad THE PROJECT SMART QUALITATIVE DATA: METHODS AND COMMUNITY TOOLS FOR DATA MARK-UP SQUAD aims to explore methodological and technical.
Metadata Metadata Mark-up and Management © Adolf Knoll, National Library of the Czech Republic.
Design, Prototyping and Construction In “ pure ” waterfall, design begins after requirements development has finished However, in the real world there.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Basic Encoded Archival Description METRO New York Library Council Workshop Presented by Lara Nicosia December 9, 2011 New York, NY.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Lifecycle Metadata for Digital Objects October 2, 2006 Implementing Metadata in XML.
Oxford, Dias 1 Nordisk Forskningsinstitut The ENRICH project and TEI customisation M. J. Driscoll Den Arnamagnæanske Samling Nordisk Forskningsinstitut.
Delivering textual and visual resources. Overview Case studies Methods for providing access Structures for delivery Full text Marked-up Image and text.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
ENRICH Kick Off Meeting 1 High level objectives ● Is there a common conceptual model for ms description? ● If so, we can provide a TEI P5-conformant representation.
1 XML and XML in DLESE Katy Ginger November 2003.
Implementing the Surface Transportation Domain
Building A Repository for Digital Objects
Multiple approaches to archival description
Improving Braille accessibility and personalization on Internet
DATA MODELS.
The <msDesc> module: describing primary sources in TEI
Lifecycle Metadata for Digital Objects
Part of the Multilingual Web-LT Program
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Oya Y. Rieger Cornell University Library May 2004
Text image linking.
Presentation transcript:

WP 3: Standardisation of shared metadata Mode of operation –All partners are involved –Building on practice outside the project Achievements of Year 1 –The ENRICH conceptual model –Schema, documentation, test cases Plans for Year 2 –Tools development –Data migration

WP3 : Modus Operandi No re-invention of the wheel: ENRICH is built upon existing standardisation efforts –MASTER ( ) ; TEI P5 (2005-) Reflect actual practice: ENRICH is driven by actual user needs –Survey of different applications of MASTER(+) –Cross-partner synthesis and discussion Support an integrated system: All aspects of a digital edition are described by the ENRICH schema

WP3: what we did -1 Reviewed differences between TEI P5 and Master+ –Theoretical exercise, but essential –All differences could be resolved, either by constraining Manuscriptorium practice, or by adapting P5 proposals Reviewed actual praxis in a wide sample (1000+) of existing manuscript description records in many formats –On-going work, leading to development of migration tools –Identified common core of practice, much smaller than potential of existing TEI schema

WP3 – What we did 2 TEI P5 is designed to support a huge range of document types and encoding practices For ENRICH, we defined a much more constrained subset, reflecting actual practice –e.g. constraining value lists, reducing structural choices, reducing scope for redundancy

Some example changes MASTER+ENRICH elements required everywhere required only if multiple not available in text available everywhere permitted deprecated used ambiguously or distinct

WP3 - What we did 3 The ENRICH schema is formally defined using the TEI ODD system This XML vocabulary allows us to generate automatically: –full multilingual documentation –formal schemata in DTD, RelaxNG or W3C Schema Its TEI-conformance makes it accessible to many other projects

WP3: scope of the schema The ENRICH schema provides a formal way of recording information about a manuscript resource, expressed in XML –Such records can be managed and stored independently of the resources they describe It also provides a formal way of encoding in XML: –A detailed transcription of the resource –Information about images (etc) of the resource –Information about real-world entities associated with the resource, i.e. people, places and events

WP3: challenges and how we overcame them Synchronising ENRICH requirements with TEI P5 –We worked closely with TEI Council which was revising the manuscript module at the same time Reaching consensus among partners –We worked closely with AIP to ensure that Manuscriptorium was able to support the full complexity of TEI P5 –We were able to use the TEI I18N features to produce reference documentation in French, Italian, Spanish as well as English (other languages will follow)

WP3 – outreach and training We have tested the ideas behind the ENRICH schema in many different training contexts We have produced a suite of training materials covering –Basic ideas of XML markup –TEI modules for metadata, basic document structure, manuscript description and transcription, persons and places, facsimiles, nonstandard writing systems...

WP3 conversion tools We have developed a suite of XSLT stylesheets and associated workflows to convert between existing metadata formats and ENRICH So far we have worked with –MASTER (+) –EAD –MARC In the next phase of the project we plan to develop the ‘ENRICH Garage’ concept...