HATHITRUST A Shared Digital Repository Bibliographic Metadata and HathiTrust ALCTS CaMMS Catalog Management Interest Group Meeting American Library Association.

Slides:



Advertisements
Similar presentations
HathiTrust Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License.
Advertisements

HATHI TRUST A Shared Digital Repository Building A Future By Preserving Our Past The Preservation Infrastructure of HathiTrust Digital Library Jeremy York.
This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library.
HATHI TRUST A Shared Digital Repository HathiTrust, Collections, and Collaboration COLD 2011 Spring Meeting Jeremy York May 20, 2011.
National Institutes of Health U.S. Department of Health and Human Services The PEPH Resource Center: A New, More Convenient Login.
HATHITRUST A Shared Digital Repository Update on Developments and Activities UM Selectors October 9, 2012 Jeremy York, Project Librarian, HathiTrust.
HATHITRUST A Shared Digital Repository We’re Preserving the Past, What About the Present? NISO Webinar: Ensuring the Preservation of E-Books May 23, 2012.
What’s Next for HathiTrust?. We’re Growing Up! Partnership Arizona State University Baylor University Boston University California Digital Library Columbia.
HATHITRUST A Shared Digital Repository HathiTrust current work, challenges, and opportunities for public libraries Creating a Blueprint for a National.
HATHITRUST A Shared Digital Repository HathiTrust as a Model for Preservation and Access Jeremy York Media Preservation Conference April 17, 2013.
The West` Washington Idaho 1 Montana Oregon California 3 4 Nevada Utah
HATHITRUST A Shared Digital Repository Collective Stewardship through HathiTrust Digital Library African Studies in the Digital Age November 12, 2014 Mike.
HATHITRUST A Shared Digital Repository HathiTrust METS and PREMIS October 25, 2011 Jeremy York Project Librarian, HathiTrust.
HATHITRUST A Shared Digital Repository HathiTrust on the Move A Growing Partnership Taking Stock and Looking Ahead National Library of Medecine October.
HATHITRUST A Shared Digital Repository HathiTrust: A Second Life for Library Collections Jeremy York Exploring Humanities Cyberinfrastructure April 30,
HATHITRUST A Shared Digital Repository HathiTrust: The Collection and Its Uses NEFLIN Webinar - November 7, 2013 Jeremy York, Assistant Director, HathiTrust.
HATHITRUST A Shared Digital Repository A Preservation Infrastructure Built to Last: Preservation, Community, and HathiTrust UNESCO Memory of the World.
HATHITRUST A Shared Digital Repository How Can Digital Collections Support Shared Print Initiatives? The HathiTrust Print Monograph Archive Planning Task.
HATHITRUST A Shared Digital Repository HathiTrust Overview: Partnership and Services Jeremy York Wesleyan University Web Presentation February 18, 2014.
HATHITRUST A Shared Digital Repository Why Digitize? or The Limits of Preservation 2014 TEI/DHCS Plenary Session Evanston, IL Mike Furlough Executive Director,
HATHITRUST A Shared Digital Repository Digital Humanities in HathiTrust: Research At Any Scale Jeremy York Digital Humanities and the Futures of Japanese.
BINARY CODING. Alabama Arizona California Connecticut Florida Hawaii Illinois Iowa Kentucky Maine Massachusetts Minnesota Missouri 0 Nebraska New Hampshire.
States and Cities SOL US II 2c A state is an example of a political region. States may be grouped as part of different regions, depending upon the criteria.
What are the states in the Northeast Region?
HATHITRUST A Shared Digital Repository HathiTrust Past, Present, and Future A Brief Introduction.
U.S. Civil War Map On a current map of the U.S. identify and label the Union States, the Confederate States, and U.S. territories. Create a map key and.
HATHITRUST A Shared Digital Repository More, Better, Together: HathiTrust Accomplishments and Aspirations The Researcher of Tomorrow Universidad Complutense.
CILogon and InCommon: Technical Update Jim Basney This material is based upon work supported by the National Science Foundation under grant numbers
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
HATHITRUST A Shared Digital Repository Collaborating Globally, Planning Locally HathiTrust and New Opportunities in Collection Management GWLA/UNM: Emerging.
Bioengineering Graduate Program Fischell Department of Bioengineering University of Maryland John P. Fisher, Ph.D. Professor and Associate Chair Director.
HATHITRUST A Shared Digital Repository HathiTrust Infrastructure and Information Organization November 7, 2011 Jeremy York Project Librarian, HathiTrust.
LEGEND Public Health Schools Law Schools Medical & Other Schools Public Health Schools Teaching Public Health Law As of July 1, 2012.
Map Review. California Kentucky Alabama.
1. AFL-CIO What percentage of the funds received by Alabama K-12 public schools in school year was provided by the state of Alabama? a)44% b)53%
June, 2012 Art Mandel.  Multiple acceptances to Ivy League Schools  Multiple acceptances to the “Most Competitive” colleges and universities  State.
HATHITRUST A Shared Digital Repository HathiTrust: Key Concepts and Issues in Managing the Digital Archive ICPSR Summer Workshop “Curating and Managing.
UPDATED KUALI STATISTICS. KUALI FOUNDATION MEMBERS – INSTITUTIONAL Australian National University Boston College Boston University Brock University Brown.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
Directions: Label Texas, Arkansas, Louisiana, Mississippi, Tennessee, Alabama, Georgia, Florida, South Carolina, North Carolina, Virginia--- then color.
Harrison’s Top 25 1.Florida State 2.Alabama 3.Oregon 4.Oklahoma 5.South Carolina 6.Michigan State 7.Ohio State 8.Auburn 9.Baylor 10.Georgia 11.UCLA 12.LSU.
CHAPTER 7 FILINGS IN MAINE CALENDAR YEARS 1999 – 2009 CALENDAR YEAR CHAPTER 7 FILINGS This chart shows total case filings in Maine for calendar years 1999.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
HATHITRUST A Shared Digital Repository Your Library, Now Online! Putting HathiTrust in the Context of Traditional (and New) Library Services MCLS Webinar.
HATHITRUST A Shared Digital Repository Institution Uses of HathiTrust Jeremy York University of Maine May 24, 2013.
Study Cards The East (12) Study Cards The East (12) New Hampshire New York Massachusetts Delaware Connecticut New Jersey Rhode Island Rhode Island Maryland.
Hawaii Alaska (not to scale) Alaska GeoCurrents Customizable Base Map text.
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
US MAP TEST Practice
UPDATED KUALI STATISTICS. KUALI FOUNDATION MEMBERS – INSTITUTIONAL (60) Australian National University Boston College Boston University Brock University.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
An Overview of the Platform
Collaboration: to work jointly with others towards a common goal Or the whole is greater than the sum of its parts Lisa B. German Library Faculty Organization.
HathiTrust: A valuable and visionary Partnership.
HATHITRUST A Shared Digital Repository ALA CopyTalk: CRMS The Copyright Review Management System September 1, 2016 Melissa Levine, Lead Copyright Officer,
Introducing Students to the Locker
2012 IFTA / IRP MANAGERS’AND LAW ENFORCEMENT WORKSHOP
2c: States grouped by region
Faculty Salary Study Comparison to AAU Data Exchange Institutions
Expanded State Agency Use of NMLS
USAGE OF THE – GHz BAND IN THE USA
Name the State Flags Your group are to identify which state the flag belongs to and sign correctly to earn a point.
GLD Org Chart February 2008.
The States How many states are in the United States?
Table 2.3: Beds per 1,000 Persons by State, 2013 and 2014
Supplementary Data Tables, Utilization and Volume
WASHINGTON MAINE MONTANA VERMONT NORTH DAKOTA MINNESOTA MICHIGAN
CBD Topical Sales Restrictions by State (as of May 23, 2019)
From Innovation to Commercialization Access to Data
USAGE OF THE 4.4 – 4.99 GHz BAND IN THE USA
Presentation transcript:

HATHITRUST A Shared Digital Repository Bibliographic Metadata and HathiTrust ALCTS CaMMS Catalog Management Interest Group Meeting American Library Association MidWinter Convention Philadelphia, Pennsylvania, January 25, 2014 Jon Rothman, Head, Library Systems Office, University of Michigan

HathiTrust Mission To contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge.

HathiTrust Background Launched in 2008 by the libraries of the CIC Committee on Institutional Cooperation (CIC) and the University of California System. Initial focus on digitized book and journal content – 10,922,113 total volumes – 3,563,589 public domain (~33%) Currently 91 partner institutions and continuing to grow.

Partnership Allegheny College Arizona State University Baylor University Boston College Boston University Brandeis University Brown University California Digital Library Carnegie Mellon University Colby College Columbia University Cornell University Dartmouth College Duke University Emory University Florida State University Getty Research Institute Harvard University Library Indiana University Iowa State University Johns Hopkins University Kansas State University Lafayette College Library of Congress Massachusetts Institute of Technology McGill University` Michigan State University New York Public Library New York University North Carolina Central University North Carolina State University Northwestern University The Ohio State University The Pennsylvania State University Princeton University Purdue University Stanford University Syracuse University Temple University Texas A&M University Tufts University Universidad Complutense de Madrid University of Alabama University of Alberta University of Arizona University of British Columbia University of Calgary University of California Berkeley Davis Irvine Los Angeles Merced Riverside San Diego San Francisco Santa Barbara Santa Cruz The University of Chicago University of Connecticut University of Delaware University of Florida University of Houston University of Illinois University of Illinois at Chicago The University of Iowa University of Kansas University of Maryland University of Massachusetts, Amherst University of Miami University of Michigan University of Minnesota University of Missouri University of Nebraska- Lincoln The University of North Carolina at Chapel Hill University of Notre Dame University of Oklahoma University of Pennsylvania University of Pittsburgh University of Queensland University of Tennessee, Knoxville University of Utah University of Vermont University of Virginia University of Washington University of Wisconsin- Madison Utah State University Vanderbilt University Virginia Tech Wake Forest University Washington University Yale University Library

Where does HathiTrust’s bibliographic metadata come from? Bibliographic metadata is provided by depositors of digital content. Metadata must be supplied to HathiTrust before ingest of digital content can occur The metadata is used in several ways, including – To act as a manifest of the materials being deposited. – To identify and track records to their contributor. – To help in making an initial rights determination about each volume.

Minimal metadata specifications for deposited records Valid MARC binary or MARCXML structure Valid leader and $$a (or $$k where appropriate) A 955 field describing a single item – Item identifier (usually barcode) – Item description (enumeration/chronology) for multi-volume works OCLC Number (strongly preferred)

Duplicate detection Simple identifier match at bibliographic level, using OCLC numbers. OCNs most ubiquitous and unique identifiers in the records, but there are issues… Records without OCNs – Some partners didn’t have OCNs in any of their records – Some have had them in many, but not all, of their records Differences in OCN location, prefixes, etc. in records Different OCNs for same item.

HathiTrust metadata management Where – HathiTrust bibliographic metadata was managed in the University of Michigan’s Aleph LMS from 2008 until… – Zephir, a dedicated HathiTrust metadata management system developed by California Digital Library, launched in production in early December, Underlying principle – Records supplied to HathiTrust are not considered definitive. – Definitive record lives in the source institution’s own system and/or Worldcat.

Zephir Functionality Keeps all versions of records received from depositors. – OCLC number still used for duplicate detection – Records are clustered rather than merged. A weighting algorithm determines best bibliographic record in each cluster. – selected record, with item-level data for all ingested items attached to that cluster are selected for output. Provides a daily output of new/changed records. Records where none of the associated digital items have been ingested yet are not included.

Record correction and update General policy is not to correct or update the content of contributors’ records. In most cases, contributors are asked to correct and re-submit records with observed metadata errors or issues. When it’s necessary for a correction to happen quickly: – A corrected “shadow record” is created in Zephir -- temporarily takes the place of the contributor record in outputs. – Contributor is asked to submit a corrected record. When corrected record is received, the shadow record is removed.

Contributor Bibliographic Records Contributor Bibliographic Records HathiTrust Metadata Management (Zephir) HathITrust Access Processing HathiTrust Ingest Framework (Feed) Rights DB HathiTrust Catalog Bib API OAI Identifiers of ingested objects Metadata about newly-loaded records Zephir daily export Digital Object Repository Catalog + Full Text Hathifiles WorldCat Individual library catalogs, etc.

Contributor Bibliographic Records Contributor Bibliographic Records HathiTrust Metadata Management (Zephir) HathITrust Access Processing HathiTrust Ingest Framework (Feed) Rights DB HathiTrust Catalog Bib API OAI Identifiers of ingested objects Metadata about newly-loaded records Zephir daily export Digital Object Repository Catalog + Full Text Hathifiles WorldCat Individual library catalogs, etc.

Contributor Bibliographic Records Contributor Bibliographic Records HathiTrust Metadata Management (Zephir) HathITrust Access Processing HathiTrust Ingest Framework (Feed) Rights DB HathiTrust Catalog Bib API OAI Identifiers of ingested objects Metadata about newly-loaded records Zephir daily export Digital Object Repository Catalog + Full Text Hathifiles WorldCat Individual library catalogs, etc.

Contributor Bibliographic Records Contributor Bibliographic Records HathiTrust Metadata Management (Zephir) HathITrust Access Processing HathiTrust Ingest Framework (Feed) Rights DB HathiTrust Catalog Bib API OAI Identifiers of ingested objects Metadata about newly-loaded records Zephir daily export Digital Object Repository Catalog + Full Text Hathifiles WorldCat Individual library catalogs, etc.

HATHITRUST A Shared Digital Repository Bibliographic Metadata and HathiTrust ALCTS CaMMS Catalog Management Interest Group Meeting American Library Association MidWinter Convention Philadelphia, Pennsylvania, January 25, 2014 Jon Rothman, Head, Library Systems Office, University of Michigan