Presentation is loading. Please wait.

Presentation is loading. Please wait.

This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library.

Similar presentations


Presentation on theme: "This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library."— Presentation transcript:

1 This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library Archiving 2009 May 5-8, 2009

2 What is HathiTrust?

3 California Digital Library Indiana University Michigan State University Northwestern University The Ohio State University Penn State University Purdue University UC Berkeley UC Davis UC Irvine UCLA UC Merced UC Riverside UC San Diego UC San Francisco UC Santa Barbara UC Santa Cruz The University of Chicago University of Illinois University of Illinois at Chicago The University of Iowa University of Michigan University of Minnesota University of Wisconsin- Madison University of Virginia

4 Current Holdings As of May 5 2,823,385 volumes 448,413 in the public domain (~16%)

5 How it came to be

6 University of Michigan Large Scale Production Environments – JSTOR – Making of America – PEAK – Humanities Text Initiative

7 Committee on Institutional Cooperation Long history of successful cooperation Voluntary partnership Build strengths of all for benefit of all

8 University of California System-wide planning Shared storage, cataloging Standards – preservation and access

9 University of Virginia Electronic Text Center 1992 Focus on the scholar Innovation and Research User-centered orientation

10 Origins - Chronology UM in 2004 …U of M shall have the right to use the U of M Digital Copy, in whole or in part at U of M's sole discretion, as part of services offered in cooperation with partner research libraries such as the institutions in the Digital Library Federation…

11 Origins - Chronology 2007 CIC/Google Agreement Shared Digital Repository 2008 University of California and University of Virginia join Launched October, 2008

12 Goals and Aspirations How we are doing

13 Partnership Grow Voluntary/Flexible Stable

14 Governance Model Executive Committee Strategic Advisory Board

15 Executive Committee Paul Courant, University Librarian and Dean of Libraries, University of Michigan John King, Vice Provost for Academic Information, University of Michigan Patricia Steele, Dean of Libraries, Indiana University Brad Wheeler, Chief Information Officer, Indiana University Paula Kaufman, University Librarian and Dean of Libraries, University of Illinois at Champaign-Urbana Laine Farley, Executive Director, California Digital Library Brian Schottlaender, University Librarian, University of California, San Diego Libraries John Wilkin, Executive Director of HathiTrust and Associate University Library, Library Information Technology, University of Michigan

16 Strategic Advisory Board Guiding hand of HathiTrust At least 4 members from the CIC, 3 members from the University of California

17 Strategic Advisory Board – Ed Van Gemert (Chair), Director of Libraries, University of Wisconsin-Madison – John Butler, Associate University Librarian for Information Technology, University of Minnesota – Patricia Cruse, Director, Preservation, California Digital Library – Robin Dale, Associate University Librarian for Collections and Library Information Systems, University of California, Santa Cruz – R. Bruce Miller, University Librarian, University of California, Merced – Sarah Pritchard, University Librarian, Northwestern University – Paul Soderdahl, Director, Library Information Technology, University of Iowa – John Wilkin, Executive Director, HathiTrust (ex officio)

18 Partnership/Cost Model HathiTrust Funded for initial 5-year period (2008-2013) Base funding from member institutions 3-year review Constitutional Convention – Members by September 2010 – Contribute content by March 2011

19 How much does it cost? Infrastructure

20 Costs Estimate content over 5 years Calculate proportional cost Calculate average per-year cost < $0.15 per volume One-time fee (25% of yearly cost)

21 Repository and Content Sustainable curation of library content Community Building Support content beyond books and journals Grow

22 Sustainable Curation fund repository with base funds from member institutions two active storage sites with backup Based on standards and best practices for Archival repositories – OAIS – METS/PREMIS – Ingest Validation (GROOVE) – Periodic fixity checks using MD5 Rights Database

23 Sustainable curation of library content OAIS Reference Model GRIN Internal Data Loading GRIN Internal Data Loading Google [OCA] In-house Conversion Google [OCA] In-house Conversion MARC record extensions (Aleph) Rights DB MARC record extensions (Aleph) Rights DB Page Turner HathiTrust API OAI GeoIP DB CNRI Handles [Solr] Page Turner HathiTrust API OAI GeoIP DB CNRI Handles [Solr] METS/PREMIS object TIFF G4/JPEG2000 OCR MD5 checksums METS/PREMIS object TIFF G4/JPEG2000 OCR MD5 checksums METS object PNG OCR PDF METS object PNG OCR PDF Isilon Site Replication TSM MD5 checksum validation Isilon Site Replication TSM MD5 checksum validation GROOVE (JHOVE) GROOVE (JHOVE)

24 Community Building Shared Collection Development – Unified core collection – Certification of volumes

25 Support content beyond books and journals Born-digital Native XML Encoded Text

26 Grow

27 Services Catalog Page Turner Bibliographies and Saved Collections Users with Print Disabilities Computational Research (sample datasets) Ability to build applications with Library content Large scale Search

28

29

30

31

32

33

34

35

36

37

38

39

40 Upcoming Plans Expand partnership Begin work on shared collection development and de-duplication Complete Data API Create Development Sandbox Configure for Computational Research Worldcat Local Catalog Prepare for TRAC

41 Thank you very much! jjyork@umich.edu hathitrust-info@umich.edu http://www.hathitrust.org http://catalog.hathitrust.org


Download ppt "This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library."

Similar presentations


Ads by Google