Presentation is loading. Please wait.

Presentation is loading. Please wait.

What is HathiTrust and Why is it relevant to research libraries? Sourcing and Scaling brought to the collective collection.

Similar presentations

Presentation on theme: "What is HathiTrust and Why is it relevant to research libraries? Sourcing and Scaling brought to the collective collection."— Presentation transcript:

1 What is HathiTrust and Why is it relevant to research libraries? Sourcing and Scaling brought to the collective collection

2 What is HathiTrust?

3 HathiTrust is attempting nothing short of creating a comprehensive preservation repository of published literature, primarily though not exclusively through digitization.


5 Content Distribution 6,947,494 – Total 1,567,058 – Public Domain * As of October 11, 2010

6 Language Distribution (1) The Top 10 languages make up close to 86% of total content * As of October 11, 2010

7 Language Distribution (2) The next 40 languages make up ~14% of total * As of October 11, 2010

8 Dates * As of October 11, 2010

9 Originating Institution * As of October 11, 2010

10 Content over time * As of October 11, 2010

11 HathiTrust is about collections, writ large, and not about Google digitization.

12 The first order of HathiTrust business is long-term preservation of this digital content, and we dont believe in preservation without access.




16 HathiTrust takes the business of sustainability seriously, with regard to governance, finances and technology.

17 Governance HathiTrust Executive Committee Strategic Advisory Board Budget/Finances Decision-making Guidance on Policy, Planning

18 Executive Committee Paul Courant, University Librarian and Dean of Libraries, UM Laine Farley, Executive Director, CDL John King, Vice Provost for Academic Information, UM Paula Kaufman, University Librarian and Dean of Libraries, UI Brian Schottlaender, University Librarian, UCSD Ed Van Gemert, UW – Madison (ex officio) Brenda Johnson, Dean of Libraries, IU Brad Wheeler, Chief Information Officer, IU John Wilkin, Executive Director of HathiTrust and Associate University Librarian, LIT, UM

19 Strategic Advisory Board Ed Van Gemert (Chair), UW - Madison John Butler, AUL for Information Technology, U Minn Patricia Cruse, Director, Preservation, CDL Bernie Hurley, Director, Library Technologies, UC Berkeley R. Bruce Miller, University Librarian, UC - Merced Sarah Pritchard, University Librarian, Northwestern Paul Soderdahl, Director, LIT, U Iowa John Wilkin, Executive Director, HathiTrust (ex officio) Robert Wolven, Columbia University

20 … and the future October 2011 Constitutional Convention Delegates from institutions that are participating by October 31 st, 2010 Weighted voting model to reflect varying levels of investment Formal review of HathiTrust by SAB in early 2011 (in time for Constitutional Convention) Framing the next stage of governance, refinement of new cost model

21 all of the reasonable costs of sustaining the archive including replacement costs and a sort of insurance policy are combined to create a sort of atomic cost unit (in this case, a GB of content)

22 How much does it cost?

23 e-Commerce Print on Demand Content Ingest Transformation Validation Content Access PageTurner Collection Builder Large-scale Search Bibliographic Catalog Research Center APIs Quality Assurance Quality Review Content Certification User Services Usability User support (helpdesk) Outreach Project website Monthly newsletter Papers and presentations Communication with potential partners Surveys, general inquiries Repository evaluation and audit (e.g., DRAMBORA, TRAC) Legal Risk management (use of materials) Partner agreements Advocacy Governance Budget, Finances Decision-making Policy Planning Enterprise Management Communication and Coordination with partner institutions Project management Repository Administration Hardware configuration and maintenance Web and application server configuration and maintenance Security Permissions Logging Repository Administration Data management (content storage, backup, integrity checks, deletion) Hardware selection and replacement Content and Metadata specifications Disaster Recovery Processes for ensuring content integrity Rights Management Copyright determination Copyright review Copyright information management (database) Rightsholder permissions Bibliographic Data Management Entity description (record-level) Object identification (item-level) Data availability Collection Development Digital Expansion beyond books and journals (born-digital, images and maps, audio) Selection of content (for non- Google volume ingest and pilots projects) Print Cloud Library (effect of digital on print) Financial contributions of partners HathiTrust Functional Framework

24 Mission and goals Mission: to contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge. Goals – To build a reliable and increasingly comprehensive digital archive of library materials converted from print that is co-owned and managed by a number of academic institutions. – To dramatically improve access to these materials in ways that, first and foremost, meet the needs of the co-owning institutions. – To help preserve these important human records by creating reliable and accessible electronic representations. – To stimulate redoubled efforts to coordinate shared storage strategies among libraries, thus reducing long-term capital and operating costs of libraries associated with the storage and care of print collections. – To create and sustain this public good in a way that mitigates the problem of free-riders. – To create a technical framework that is simultaneously responsive to members through the centralized creation of functionality and sufficiently open to the creation of tools and services not created by the central organization.

25 A global change in the library environment June 2010 Median duplication: 31% June 2009 Median duplication: 19% Academic print book collection already substantially duplicated in mass digitized book corpus

26 an ARL institution that wishes to use HathiTrust as part of a larger strategy part of a cloud strategy

27 The HathiTrust Business Model, v.2: Costs based on holdings overlap and the perceived benefits we derive

28 For public domain volumes: (PD*X*C)/N For a given in­copyright volume: IC=(C*X)/H new cost model:

29 sharing in the curation; having a voice in shaping the future

30 Collective digital curation driving down costs reducing bibliographic indeterminacy making meaningful decisions about formats and quality increasing discoverability consolidating development talent improving strength of archiving

31 Partner Status As of October 11th – 33 Contributing partner libraries – 1 Sustaining partner library In final stages of contract review or pending announcement – 5 Contributing partner libraries – 6 Sustaining partner libraries

32 scale!

33 transfer resource[s] away from 'infrastructure' and towards user engagement. Lorcan Dempsey

34 Inviting participation…

Download ppt "What is HathiTrust and Why is it relevant to research libraries? Sourcing and Scaling brought to the collective collection."

Similar presentations

Ads by Google