The 3 M’s: MINERVA, MODS, and METS Allene Hayes (LC) Rebecca Guenther (LC) Leslie Myrick (NYU) DLF -- New Orleans April 20, 2004.

Slides:



Advertisements
Similar presentations
UKOLN is supported by: Using the RSLP schema Ann Chapman Collection Description Focus A centre of expertise in digital information management
Advertisements

February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
The Library of Congress Cooperative Web Archiving Project Abbie Grotke, Library of Congress Grant Harris, Library of Congress Jennifer Long, Georgetown.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen Cornell University May 16, 2006
1 Minerva The Web Preservation Project. 2 Team Members Library of Congress Roger Adkins Cassy Ammen Allene Hayes Melissa Levine Diane Kresh Jane Mandelbaum.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
A METS Application Profile for Historical Newspapers
Access to Individual Harvested Sites in a Web Archive Tracy Meehleib DLF Fall Forum, Providence, RI November 13th, 2008.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Joanne Archer University of Maryland Kate Odell Archive-It Abbie Grotke Library of Congress Tessa Fallon Columbia University Creating and Maintaining Web.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
Isabel Silver and Laurie Taylor IMLS Library Publishing Services Workshop May 5, 2011 UF Smathers Libraries Publishing Services.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
13 Oct DC2004--IFLA New and traditional descriptive formats in the library environment DC2004: IFLA session 13 Oct Rebecca Guenther
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Ask A Librarian and QuestionPoint: Integrating Collaborative Digital Reference in the Real World (and in a really big library) Linda J. White Digital Project.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
OCLC Research: an update Lorcan Dempsey
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
Office of Strategic Initiatives All Hands Meeting-March 2010 Challenges in Web Archiving: Library of Congress Edition Abbie Grotke, Web Archiving Team.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
RDA: Resource Description and Access A New Cataloging Standard for a Digital Future Jennifer Bowen RDA Forum ALA Annual Meeting, New Orleans, June 24,
Journalism & Media Studies Graduate Student Culminating Work : Steps for Submitting to the Campus Digital Archive at USFSP November 21, 2011 by Carol Hixson.
Nov. 6, 2003 MAVIS MARCXML, MODS and Dublin Core MAVIS Users Conference 2003 Rebecca Guenther Library of Congress.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Information for Scotland 816 Nov 2001 The potential of CORC Gordon Dunsire presented at Information for Scotland 8 16 November 2001, Edinburgh.
PACSCL Consortial Survey Initiative Group Training Session February 12, 2008 at The Historical Society of Pennsylvania.
Introduction to metadata
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
BEN METADATA SPECIFICATION Isovera Consulting Feb
Evidence from Metadata INST 734 Doug Oard Module 8.
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Metadata Extraction & Web Archives: Automating the Record Creation Process Abbie Grotke / Gina Jones /
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata Extraction and Web Archives: Automating the Record Creation Process Tracy Meehleib Library of Congress, NDMSO NDIIPP June 25, 2009.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Collection Management Systems
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
An Application Profile and Prototype Metadata Management System for Licensed Electronic Resources Adam Chandler Information Technology Librarian Central.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
A centre of expertise in digital information management UKOLN is supported by: Metadata – what, why and how Ann Chapman.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Tiewei (Lucy) Liu Metadata Librarian June 26, 2016
Introduction to Metadata
Attributes and Values Describing Entities.
Márton Németh – László Drótos How to catalogue a web archive?
Attributes and Values Describing Entities.
Metadata supported full-text search in a web archive
Presentation transcript:

The 3 M’s: MINERVA, MODS, and METS Allene Hayes (LC) Rebecca Guenther (LC) Leslie Myrick (NYU) DLF -- New Orleans April 20, 2004

Topics of Discussion MINERVA MODS METS

Mission Statement MINERVA collects and preserves Web sites and Web pages for the Library Goal is to preserve primary source materials for future generations

Why Archive Web sites? "The Internet is as important as the print media for documenting these events.” "Why? Because the Internet is immediate, far-reaching, and reaches a variety of audiences. You have everything from self-styled experts to known experts commenting and giving their viewpoint."

Our Core Team Project Manager 2 Reference Specialists 2 Digital Conversion Specialists Cataloging Specialist Network Development and MARC Standards Office representative and soon, we hope, a digital rights specialist…

Partners Internet Archive Alexa Internet WebArchivist.org –University of Washington, Center for Communications and Civic Engagement –State University of New York, Institute of Technology (SUNY-IT)

Collections FY00 LC Prototype: sampling of sites Election 2000: 767 sites FY01 September 11 th : 30,000+ sites

FY Winter Olympics: 70 sites 9/11 Remembrance: 1,800 sites Election 2002: 3,000+ sites FY th Congress: 588 sites War on Iraq: 288 sites Election 2004: 134 sites & growing

The Prototype Initial test Mid 2000 LC crawled about 30 sites using a desktop Web crawler (HTTrack) Provided an initial investigation into the processes involved in Web capture Sites were cataloged Access is on campus only

The MINERVA Process Collection Planning Selection Notification/ Permissions Technical Review Crawl & QA Cataloging Interface Development Legal Review Access

OSI The Process is Evolving Collection Planning Selection Notification/ Permissions Technical Review Crawl & QA Cataloging Interface Development Legal ReviewAccess Preservation? Legal Authority Select + create metadata up front Non-Event based collecting Automation & Tools? Templates & Tools In-house expertise Alternate Crawl Contractors International Internet Preservation Consortium

Collection planning Identify the “event” (thematic) Secure the funding Develop task order –Scope of collection –Acquisition parameters Size of collection Collection period (start and end dates) Web site URLs – define categories for selection Frequency, depth, breadth

Legal Review Bibliographic Services Agreement Task Orders Notifications and permissions –Fair use argument for event-based collecting Notice of crawl; Permission to display offsite Permission to crawl and display: foreign and “creative” sites –Modification of 407 mandatory deposit regulation

Selection MINERVA reference specialists work with Recommending Officers to: Develop collection policy statement for event- based collections Identify specific Web sites to collect Identify contact information of site owner/producer Enter into database using nomination form

Notification/Permissions Tasks currently shared by all members of the team: appropriate notification and permissions Respond to phone & queries Find alt addresses and resend if no response Track responses in database Provide access condition statements for catalog record, inform ITS of restrictions

Technical Analysis Identify potential technical challenges: –Macromedia Flash TM introductions –Log-ins, either free or pay sites –Dynamically-generated Web pages –Dynamic menu –Databases Determine best point of entry (eg. start URL)

Crawl and QC Access to crawl 24 hrs after completed Quality assurance using status and error reports Troubleshoot problems with crawl Modifying the crawl to be more focused, refined, specific

CATALOGING Prototype –MARC records in ILS Collection Level –MARC records in ILS MODS Item Level –MODS records on MINERVA site, but created with import to ILS in mind

Lesson Learned Some type of cataloging/descriptive metadata needs to be done to be able to search through the collection

MODS Metadata Object Description Schema Descriptive metadata standard Uses XML Schema A derivative of MARC using language-based tags but MARC semantics Element set is compatible with existing descriptions in large library databases Particularly applicable to digital objects Hierarchy allows for rich description, especially of complex digital objects

Why MODS Rich but not too rich descriptive metadata format; richer/more hierarchical than Dublin Core and simpler than MARC 21/MARCXML Alternative for emerging initiatives –Z39.50 Next Generation specified format –extension schema to METS –to represent metadata for harvesting (OAI) –As an interoperable format for convergence between MARC and non-MARC XML descriptions

Status of MODS Open listserv collaboration of self-selected possible implementors, LC coordinated (1 st half 2002) First comment and use period: June – December 2002 Version 2.0 Feb Dec MODS version 3.0 now available Registration submitted to NISO and going through approval process

Fields Used in Election 2002 Title Name (structured form) Abstract Date captured Genre (value always “Web site”) Physical description (file formats) Identifier (base URL) Language Access conditions/rights management Subject (keyword or LCSH if possible)

Sample MODS record for Election 2002 Web site Fran Ulmer Web site record (XML)Fran Ulmer Web site

107 th Congress Enhanced descriptive metadata (MODS) created at LC Include authoritative forms of names, subject headings, and classification numbers Registered handles for archived sites Plan to use LC created tools for METS creation, record input and search/browse –Example:

MODS record Joint Economic Committee XML record

Lessons learned (cataloging) More accurate, usable cataloging if we do in- house –We understand the way the data will be used Good metadata results in good searching Simple records can be created in minimal time using MODS Flexibility of XML allows for options in display

Interface & Access IA & WebArchivist.org interfaces: Election 2000, September 11, Election 2002 LC moving to refine, improve, develop interfaces in-house Access to available collections through MINERVA home page and through ILS collection level records Transferring access to archive to on-site

METS/MODS for Minerva Experiment with METS objects for Web sites MODS for descriptive metadata –Hierarchy in relatedItem powerful for multiple captures and linked pages –Works well with METS structMap –relatedItem type=“host” link to aggregated resource –relatedItem type=“constituent” for linked pages –Allows for descriptive metadata at lower level and facilitates display of object Developing tools for capture of some technical metadata Developing METS profile for Web sites