PREMIS Tools and Services

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
EAD Revision: Technical Considerations Terry Catapano EAD Roundtable Meeting
6. Applying metadata standards: Controlled vocabularies and quality issues Metadata Standards and Applications Workshop.
Corey A Harper DC2006 October 4, 2006 Authority Control for the Semantic Web Encoding Library of Congress Subject Headings (LCSH) in SKOS.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Presentation 7 part 2: SOAP & WSDL. Ingeniørhøjskolen i Århus Slide 2 Outline Building blocks in Web Services SOA SOAP WSDL (UDDI)
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
A Registry for controlled vocabularies at the Library of Congress
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
The NSDL Registry Jon Phipps Stuart Sutton Diane Hillmann Ryan Laundry Cornell U. U. of Washington.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
OFC304 Excel 2003 Overview: XML Support Joseph Chirilov Program Manager.
Practical RDF Chapter 1. RDF: An Introduction
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
11 October 2015 MAVIS v “Sneak Preview”. 11 October 2015 Enhancements in the Release  Reference Material  Brief Accessioning View  Template.
Interfacing Registry Systems December 2000.
© 2012 IBM Corporation Best Practices for Publishing RDF Vocabularies Arthur Ryman,
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
METS Application Profiles Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
RELATORS, ROLES AND DATA… … similarities and differences.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Metadata Registries Registry: authoritative, centrally controlled store of information – W3C Web Services Glossary, 2004
SPINNING THE SEMANTIC WEB APPLICATIONS FOR THE MODERN ERA LIBRARIES
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair Vienna,
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES Debashis Naskar 1 and Biswanath Dutta 2 DSIC, Universitat Politècnica de València.
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
RDFa How and Why Ralph R. Swick World Wide Web Consortium
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Presented at Archives Records 2016, session 510
Introduction to Metadata
BIBFRAME at the Library of Congress
Wsdl.
Lifecycle Metadata for Digital Objects
Cataloging the Internet
2. An overview of SDMX (What is SDMX? Part I)
Metadata in Digital Preservation: Setting the Scene
Linked Data  at  loc.gov show of hands:
RDA in a non-MARC environment
Beyond OA: Additional methods for enhanced exposure NMU Open Access Seminar 30 October 2018 NMU Port Elizabeth Wynand van der Walt Head Librarian: Technical.
Australian and New Zealand Metadata Working Group
Presentation transcript:

PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress rgue@loc.gov NDIIPP Partners Meeting July 21, 2010

Outline of presentation PREMIS in METS Toolbox (PiM) Authorities and vocabularies web service (id.loc.gov) NDIIPP Partners Meeting July 21, 2010

NDIIPP Partners Meeting PREMIS in METS toolbox Developed by Florida Center for Library Automation under contract with LC A set of open-source tools to support the implementation of PREMIS especially in the METS container format 3 components: validate, convert, describe Source code being made available: http://pimtoolbox.sourceforge.net NDIIPP Partners Meeting July 21, 2010

Describe: uses the DAITSS description service <premis> <ext> </premis> /a/real/file droid/jhove

Convert: between PREMIS and PREMIS in METS OR PREMIS in METS to PREMIS <mets> <premis> </mets> <premis/> xslt

Validate: PREMIS in METS document confirmation or errors <mets> <premis/> </mets> Schematron

Demo: http://pim.fcla.edu/ Audio file: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.ihas.200150574/default.html http://lcweb2.loc.gov/natlib/ihas/service/sousa/200150574/0001.mp3 PDF file: describe demo.pdf Image: http://lcweb2.loc.gov/diglib/ihas/loc.natlib.gottlieb.09601/default.html NDIIPP Partners Meeting July 21, 2010

Authorities and vocabularies web service id.loc.gov Makes LC owned and maintained authorities vocabularies available as Linked Data Allows both human-oriented and programmatic access to LC-promulgated authorities and vocabularies. First offering was LCSH; later additional vocabularies added Search and download available NDIIPP Partners Meeting July 21, 2010

Why establish controlled vocabularies? Control values that occur in metadata Reduce ambiguity Control synonyms Document and publish for reuse Test and validate terms Establish formal relationships among terms (where appropriate) Includes enumerated values in schemas, formal thesauri, code lists, etc. Many metadata schemes allow for content from other sources. Some data elements may be more useful if a controlled vocabulary is used. Some are published formally, others are developed and used locally. Formal controlled vocabularies may be used for testing and validation of terms– this is often done in integrated library systems, where bibliographic records may validate against authority records. This is one instance of testing and validation of terms. There is work being done on establishing metadata registries for both documentation and machine validation of both controlled vocabularies and metadata elements/terms. This could be particularly useful for controlled vocabularies, since their usefulness depends on consistency. NDIIPP Partners Meeting July 21, 2010

Standards maintained at LC that contain controlled vocabularies LCSH/NAF Thesaurus of Graphic Materials MARC Code lists: GACs, countries, languages ISO 639-2 and ISO 639-5 (language codes) Other MARC controlled lists Enumerated lists in XML schemas MODS enumerated values METS enumerated values MIX (Technical metadata for digital still images) PREMIS controlled vocabularies Others… NDIIPP Partners Meeting July 21, 2010

Simple Knowledge Organization System (SKOS) RDF application used to express knowledge organization systems such as thesauri, taxonomies and the concepts within. SKOS has a defined element set which is particularly relevant for controlled vocabularies Relationships between concepts in a concept scheme can be expressed (e.g. broader, narrower) and between concepts in different schemes Having a dereferencable URI for concepts and their concept schemes enhances the ability to provide web services for consumers of these standards Maintained by W3C Semantic Web Deployment Group NDIIPP Partners Meeting July 21, 2010

NDIIPP Partners Meeting “Linked Data” A feature of the “Semantic Web” where links are made between resources Goes beyond hypertext links (i.e. between web pages) but between any kind of object or concept From Wikipedia: "a term used to describe a method of exposing, sharing, and connecting data via dereferenceable URIs on the Web” Users can use links to find similar resources and aggregate results Interaction between data relies on URIs NDIIPP Partners Meeting July 21, 2010

Reasons for developing a web service for vocabularies Facilitate development and maintenance process for vocabularies Make controlled lists openly available Provide comprehensive information about controlled terms Experiment with semantic web technologies and linked data Expose vocabularies to wider communities NDIIPP Partners Meeting July 21, 2010

NDIIPP Partners Meeting URIs in id.loc.gov Interaction with any given individual term and vocabulary is with its URI Some examples of URIs: http://id.loc.gov/vocabulary/relators/art http://id.loc.gov/vocabulary/graphicMaterials/tgm005222 http://id.loc.gov/vocabulary/preservationEvents/migration http://id.loc.gov/authorities/sh85063136 Known-label searches: use when you know the label but not the identifier http://id.loc.gov/vocabulary/relators/label/artist http://id.loc.gov/authorities/label/hunting%20dogs Link goes to RDFa It! Visualizations, “Folk music” Real hook of site is content negotiation NDIIPP Partners Meeting July 21, 2010

Technical infrastructure Django (Python) LCSH MySQL SKOS RDF generated at time of request Operates, more or less, as traditional relational DB MARC mapped to relational DB tables Everything else RDFlib (Python library, uses MySQL as triplestore) Runs on triples XML to SKOS RDF/XML before ingest XSL, Xquery used NDIIPP Partners Meeting July 21, 2010

NDIIPP Partners Meeting Next steps MADS OWL Schema to enable identification of facets e.g. Aeronautics--Soviet Union—History Enhance existing vocabularies to show relationships Broader/narrower relator terms Matches to other vocabulary terms (e.g. MARC vs. ISO 3166 country codes) Add new vocabularies PREMIS controlled vocabularies MARC country, geographic area, languages ISO 639-2 and 639-5 Name authorities Enhance PiM to validate PREMIS vocabulary terms NDIIPP Partners Meeting July 21, 2010