Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.

Slides:



Advertisements
Similar presentations
Don’t Type it! OCR it! How to use an online OCR..
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
METS: An Introduction Structuring Digital Content.
Services Digitisation & Content Management. 600 People – India.
METS In order to reconstruct the archive, we will need to understand the METS files. METS is schema that provides a flexible mechanism for encoding descriptive,
METS Dr. Heike Neuroth EMANI – Project Meeting February 14 th - 16 th, 2002 Springer-Verlag Heidelberg Göttingen State and University Library (SUB)
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Susan Dahl University of Alberta METS and the Peel’s Prairie Provinces Project.
These ain’t “Old News”! Creating access to historic newspapers Christine Guenther OCLC Product Manager, Digital Services Preservation Service Centers Bethlehem,
Newspaper Preservation through Collaboration and Communication The Texas Digital Newspaper Program By Ana Krahmer & Mark Phillips University of North Texas.
Joachim Bauer Senior System Engineer, CCS
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
METS What is METS ? What is METS ? A schema that provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a.
PIALA 2010 UH Manoa Hamilton Library Chronicling America and the National Digital Newspaper Program: Technical Aspects  Part 1: Newspapers and Microfilm.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
Sai Deng, Metadata Catalog Librarian, Wichita State University Libraries Tse-Min Wang, Graduate Student in CS, Wichita State University Digital Imaging.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
A METS Application Profile for Historical Newspapers
Digital Encoding What’s behind E-text Resources?.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
Port Townsend Leader Historical Newspaper Archive Keith Darrock.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
{ Building Open Access To Our Heritage Andrew Weidner Project Coordinator, New Mexico Historical Newspapers University of North Texas Libraries: Digital.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Erin Kinney, Wyoming State Library. Motivation #1 priority that came out of 2004 statewide digitization meeting WSL received many reference questions,
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
© January/2008 CCS Content Conversion Specialists GmbH Weidestr. 134, Hamburg, Germany consulting technology digitization services.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
PREMIS and the National Digital Newspaper Program Justin Littman Office of Strategic Initiatives, LC
Dominic Bordelon and Adam St.Pierre.  Based upon The Advocate Obituary Index  Obtained obituaries from microfilm to make full-text searchable records.
National Park Service U.S. Department of the Interior Resource Information Management Division National Information Systems Center Office of the Chief.
Organizing Internet Resources OCLC’s Internet Cataloging Project -- funded by the Department of Education -- from October 1, 1994 to March 31, 1996.
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Integrating a Statewide Web Gateway With Digital Collections ______________________ Eric Weig and Beth Kraemer University of Kentucky and KCVL.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
Implementation scenarios, encoding structures and display Rob Walls Director Database Services Libraries Australia.
Metadata Bridget Jones Information Architecture I February 23, 2009.
METS, Standards and Rights METS, Safonau a Hawliau Vicky Phillips Digital Standards Manager Rheolwr Safonau Digidol 4 th March ydd Mawrth 2014.
Best Practices for Digital Imaging and Metadata Roy Tennant The Library, University of California, Berkeley
METS Application Profiles Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
Evidence from Metadata INST 734 Doug Oard Module 8.
Graphics & Images What File Format Do I Use?. Graphics & Images …..are visual images presented on some form of media (drawings, print, web, digital video)
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Tiziana // Alessandra Lenzi - MG Breaking down the walls Project Museo Galileo and the Linked Open Data A joint project between.
Feb 21-25, 2005ICM 2005 Mumbai1 Converting Existing Corpus to an OAI Compliant Repository J. Tang, K. Maly, and M. Zubair Department of Computer Science.
The Catalog of the Future: Integrating Electronic Resources By Dana M. Caudle Cataloging Librarian Auburn University Libraries
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
Michigan Digital Newspaper Project Contributing 100 thousand pages in Chronicling America
7th Annual Hong Kong Innovative Users Group Meeting
From the old to the new… Towards better resource discoverability
Professional development training on cataloging at the University Wisconsin-Madison Memorial Library, USA 14th October -24th October, 2016 Aigerim Shurshenova.
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Introduction to Metadata
Workshop on XML-Based Library Applications 5
DIGITAL ARCHIVES Into the Light
Metadata - Catalogues and Digitised works
Metadata to fit your needs... How much is too much?
Márton Németh – László Drótos How to catalogue a web archive?
Presentation transcript:

Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American Newspapers Project BACKGROUND South Carolina Digital Newspaper Program (SCDNP) is a participant of the National Digital Newspaper Program (NDNP). NDNP is a partnership of the National Endowment for the Humanities and the Library of Congress to help states digitize their historical 19 th and early 20 th century newspapers. This digitized content is made available in the Library of Congress’ free, online database, Chronicling America: Historic American Newspapers. To date, 32 states have participated with the goal of reaching all 50 states and digitizing 20 million pages by SCDNP has participated since 2009 and is digitizing 300,000 newspaper pages. For more information, visit SCDNP’s website at STANDARDS & GUIDELINES SCDNP follows strict NDNP technical guidelines and specifications. Metadata standards are derived from several standards, vocabularies, and ontologies and some tie to external sites DCMI Metadata Terms the Bibliographic Ontology DBpedia Dublin Core DCMI Terms FRBR concepts in RDF GeoNames LCCN Permalink lingvoj.org MARC OAI-ORE OWL RDA WorldCat NDNP Tech Specs Produce  8-bit Grayscale images scanned from microfilm (scanned for max. resolution between dpi relative to the originals)  OCR with bounding boxes-no article segmentation  Structural metadata for pages, issues, editions, and titles to support chronologically-based browsing interface  Four deliverables per page including a tiff, jpeg2000, pdf, and ocr file in xml format  Up-dated MARC records from the CONSER OCLC database LC Metadata Standards MARC- for extracting descriptive data about newspaper titles; transformed into MODS xml metadata. MODS (Metadata Object Description Schema) MODS is a schema for a bibliographic element set that may be used for a variety of purposes, and particularly for library applications. The standard is maintained by the Network Development and MARC Standards Office of the Library of Congress with input from users. For more info, Development and MARC Standards Officehttp:// METS (Metadata Encoding and Transmission Standard) The METS schema is a standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library, expressed using the XML schema language of the World Wide Web Consortium. The standard is maintained in the Network Development and MARC Standards Office of the Library of Congress For NDNP, title, issue, and reel metadata wrapped in METS. For more info, schema languageWorld Wide Web Consortium Network Development and MARC Standards Office ALTO (Analyzed Layout and Text Object) is a XML Schema that details technical metadata for describing the layout and content of physical text resources, such as pages of a book or a newspaper. It most commonly serves as an extension schema used within the Metadata Encoding and Transmission Schema (METS) administrative metadata section. For NDNP, OCR text must be encoded using the ALTO (Analyzed Layout and Text Object) XML schema, version 2.0. For more info, Encoding and Transmission Schema (METS) administrative metadata section METS ALTO XML Object Model ALTO (analyzed layout and text object) stores layout information and OCR recognized text of pages of any kind of printed documents like books, journals and newspapers. ALTO is an open xml standard format to store layout and content information. It is designed to be used as an extension schema to METS where METS provides metadata and structural information while ALTO contains content and physical information. Newspaper Metadata is converted to XML Batch level metadataBatch level metadata converted to xml files Laura Blair and Virginia Pierce, South Carolina Digital Newspaper Project, USC Libraries Reel level metadata Reel level metadata converted to xml files Issue and Page level metadata Issue/Page level metadata converted to xml files OCR xml files Chronicling America, a free, keyword searchable database for digitized historic newspapers. Visit the site at View of final product: a n historical South Carolina newspaper page loaded into Chronicling America,