University of North Texas Enhancing the Quality of Metadata: Modular Approach to Digital Resource Lifecycle Management Daniel Gelaw Alemneh & Mark E. Phillips.

Slides:



Advertisements
Similar presentations
Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital.
Advertisements

Digital Initiatives at the University of North Texas Libraries Cathy Nelson Hartman University of North Texas Libraries Texas Conference on Digital Libraries.
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Information Professionals and Learning Object Repositories … more than just metadata quality … Sarah Currier Stòr Cùram Project Librarian JISC X4L Repository.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
1 Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata.
Perspectives from The Alberta Library Learn, think, CHANGE 2004 Online Learning Symposium November 3, 2004 Zahina Iqbal.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
The NSDL Registry Diane Hillmann  Jon Phipps. What We’re Doing Received an NSF grant in Oct. 2006, to: Register metadata schemas, vocabularies, application.
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
Introducing Symposia : “ The digital repository that thinks like a librarian”
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
SobekCM’s Community Ecosystems & Socio-Technical Practices Presented by Mark V. Sullivan June 10 th, 2014 Sobek image created by Jeff Dahl and is shared.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Diving In: Testing the Archivists’ Toolkit. 21 Oct. 2006Archivists' Toolkit at NEA2 Bradley D. Westbrook, UC San Diego Katherine Stefko, Bates College.
“Mapping the Southwest”: UNT-UTA Collaborative Project Daniel Gelaw Alemneh, Jerrell Jones, University of North Texas (UNT), and Ann Hodges University.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Descriptive metadata in the Finnish National digital library and the role of CIDOC CRM in the standards portfolio of NDL Juha Hakala The National Library.
Enhancing Content Visibility in Institutional Repositories: Maintaining Metadata Consistency Across Digital Collections Ahmet Meti Tmava and Daniel Gelaw.
MAINTAINING QUALITY METADATA: TOWARD EFFECTIVE DIGITAL RESOURCE LIFECYCLE MANAGEMENT Daniel Gelaw Alemneh University of North Texas.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
Mapping the Southwest is a 3-year project (2010 to 2013) funded by a National Endowment for the Humanities (NEH) We the People grant. The University of.
The Portal to Texas History: Harnessing Technology to Enable Collaboration with Small Museums and Libraries CNI, December 6, 2005 Cathy Nelson Hartman.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Aligning library-domain metadata with the Europeana Data Model Sally CHAMBERS Valentine CHARLES ELAG 2011, Prague.
Is Dublin Core Dying? Kayla Willey – Brigham Young University Cheryl Walters – Utah State University Utah Library Association Annual Conference St. George,
Use & Access 26 March Use “Proof of Concept” Model for General Libraries & IS faculty Model for General Libraries & IS faculty Test bed for DSpace.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Curating the Southwest Region’s Maps: UNT-UTA Collaborative Project Daniel G. Alemneh, Mark E. Phillips, and Cathy Hartman University of North Texas (UNT)
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
Introduction to metadata
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
BEN METADATA SPECIFICATION Isovera Consulting Feb
A Whirlwind Tour Through Part of the Metadata Landscape Jenn Riley Metadata Librarian IU Digital Library Program.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Metadata Interaction, Integration, and Interoperability MODS, MARC and Metadata Interoperability, ALA Conference, June 27, 2005, Chicago, IL William E.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
Jenn Riley Metadata Librarian IU Digital Library Program
Metadata (and cataloging?) Jenn Riley Metadata Librarian IU Digital Library Program.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
Collaborative Approach to Address Scholarly Communications and Digital Curation Challenges Kris Helge, Laura Waugh, Daniel Alemneh SCDC Affinity Group.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Shades of Grey: Integrating Metadata for Discovery in a Mixed-Content Single-Subject Library GENEVIEVE PODLESKI FEDERAL RESERVE BANK OF ST. LOUIS.
Metadata Workflows. Metadata Specialist Scenario The typical digital library development situation facing the metadata specialist: –We have some functional.
Building Digital Archives Mark Phillips Cathy Hartman June 6, 2008.
Digitization Workflows From the Digital Projects Unit University of North Texas Libraries Mark E. Phillips Jeremy D. Moore February 12, 2009.
Meta/Data As If Research Depends On It
The Use of EAD in Archival Based Repositories
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Building Search Systems for Digital Library Collections
Introduction to Metadata
Metadata to fit your needs... How much is too much?
Preserving Our Collective Digital History
PREMIS Tools and Services
Introduction to Metadata
Attributes and Values Describing Entities.
Presentation transcript:

University of North Texas Enhancing the Quality of Metadata: Modular Approach to Digital Resource Lifecycle Management Daniel Gelaw Alemneh & Mark E. Phillips IS&T, Archiving-2007 Conference May 23, 2007, Arlington Virginia

University of North Texas University of North Texas (UNT) Libraries Digital Initiatives Collaborative Initiatives CyberCemetery GPO NARA – Affiliated Archive Texas Register Archive Secretary of State’s Office Texas Laws and Resolutions Archive Secretary of State’s Office The Portal to Texas History 45 Libraries & Museums Web-at-Risk Project California Digital Library New York University

University of North Texas University of North Texas (UNT) Libraries Digital Initiatives Library Digital Collections Congressional Research Service Archive CRS Reports Portal to Texas History 20,000 records World War Poster Collection 500 WWI and WWII Posters Advisory Commission on Intergovernmental Relations 408 reports = 47,874 pages Federal Communications Commission (FCC) Record 136 issues = 43,115 pages (6 of 21 volumes completed) GovDocs A to Z digitization project 186 scanned 500+ in queue Jean-Baptiste Lully Collection 27 scores = 10,000 pages

University of North Texas Metadata Environment Metadata-based digital resource management activities UNT Libraries metadata locally qualified Dublin Core based descriptive metadata. Detailed technical and preservation metadata elements Web based metadata creation and editing Interoperability Metadata Crosswalks Mods Marc oai_dc PREMIS

University of North Texas Metadata Quality The two aspects of digital library data quality: The quality of the data in the objects themselves The quality of the metadata associated with the objects Poor metadata quality: Ambiguities Poor recall Poor precision Inconsistency of search results

University of North Texas Metadata Quality … Most Common errors: Incorrect Data: Letter transposition Letter omission Letter insertion Letter substitution or misstrokes Missing Data Elements and values not present at all (null) Insufficient or incomplete data Ambiguous Data Confusing or inconsistent data e.g. multiple spellings, multiple possible meanings, mixed cases, initials, etc.

University of North Texas Factors Influencing Metadata Quality Local Requirements: Objects Heterogeneity What type of objects will the repository contain? Granularity How will they be described? Functionality What functionality is required? How will it be interfaced?

University of North Texas Factors Influencing Metadata Quality … Collaborative Requirements: Diversity of Users How best diverse information-seeking behaviors can be met? Interoperability Will metadata be meaningful within aggregations of various kinds? What is required for interoperability? (Structure, semantics, & syntax) Digital rights issues Will access restrictions be imposed? Are requirements formal or informal? Are there other access and associated digital rights issues?

University of North Texas Factors Influencing Metadata Quality… Training Issues Necessary expertise to create and manage rigorous metadata Metadata quality can be determined to a great extent by: knowledge of the source, and knowledge of the methodology used to create the statement Cost Rigorous metadata is resource intensive and too costly

University of North Texas UNT Metadata Quality Assurance Mechanisms & Tools The two main stages of metadata qualities assurances: Pre-injust 1. Metadata Creation tools (Templates) Post-injust 2. Metadata Analysis tools (Web-based tools)

University of North Texas Quality Assurance Mechanisms and Tools: Templates 1. Metadata Creation Tools (Templates) Validates Mandatory elements Metadata Template Creator Template Reader Controlled vocabularies (UNTLBS)

University of North Texas

UNT Metadata Quality Assurance Mechanisms & Tools… 2. Metadata Analysis Tools NULL Values List/Browse All Values (by each qualifiers and elements) List Authorities Values Graphical reports and other fun stuff Clickable Maps by Institution and Collection Word Clouds by elements Records added overtime and other graphical reports

University of North Texas

Summary Determine level of quality required Partners may have much in common, but they have diverse and sometimes conflicting metadata requirements. Determine nature of gap and how to close it effectiveness, efficiency, practicability, scalability Machine verses human error handling How much of the process can be automated? Human review of results is still essential (e.g. highlighted items) Compromise One size does not fit all! Prioritize Resources very unlikely to be available to meet all requirements Test the workflow Test, retest, and evaluate the quality cycle continuously

University of North Texas

Questions?