HATHITRUST A Shared Digital Repository HathiTrust: Key Concepts and Issues in Managing the Digital Archive ICPSR Summer Workshop “Curating and Managing.

Slides:



Advertisements
Similar presentations
HathiTrust Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License.
Advertisements

HATHI TRUST A Shared Digital Repository Building A Future By Preserving Our Past The Preservation Infrastructure of HathiTrust Digital Library Jeremy York.
HATHI TRUST A Shared Digital Repository HathiTrust Digital Library Is There A Past In Your Future? Princeton University February 2010.
HathiTrust: Building the Universal Collection John Wilkin 18 May 2009.
This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library.
HATHI TRUST A Shared Digital Repository Unpacking HathiTrusts New Cost Model Jeremy York Project Librarian, HathiTrust SUNY July 15, 2011.
HATHI TRUST A Shared Digital Repository HathiTrust Overview Julie Bobay, Heather Christenson, and John Wilkin April 12, 2011.
Building the Universal Library: The Promise and Challenges of HathiTrust John Wilkin 2 April 2009.
HathiTrust Sharing a Federal Print Repository: Issues and Opportunities May 25, 2011 Heather Christenson.
HATHI TRUST A Shared Digital Repository HathiTrust How We Can Make A Difference Jeremy York Yale University November 3, 2010.
What is HathiTrust and How Can it Make a Difference? Sourcing and Scaling brought to the collective collection.
HATHI TRUST A Shared Digital Repository HathiTrust 101 John Wilkin and Jeremy York August 27, 2010.
HATHI TRUST A Shared Digital Repository HathiTrust, Collections, and Collaboration COLD 2011 Spring Meeting Jeremy York May 20, 2011.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
National Institutes of Health U.S. Department of Health and Human Services The PEPH Resource Center: A New, More Convenient Login.
HATHITRUST A Shared Digital Repository Update on Developments and Activities UM Selectors October 9, 2012 Jeremy York, Project Librarian, HathiTrust.
HathiTrust and the Ecology of Shared Collections Paul N. Courant 21 May 2009.
HATHITRUST A Shared Digital Repository We’re Preserving the Past, What About the Present? NISO Webinar: Ensuring the Preservation of E-Books May 23, 2012.
What’s Next for HathiTrust?. We’re Growing Up! Partnership Arizona State University Baylor University Boston University California Digital Library Columbia.
HATHITRUST A Shared Digital Repository HathiTrust current work, challenges, and opportunities for public libraries Creating a Blueprint for a National.
HATHITRUST A Shared Digital Repository HathiTrust as a Model for Preservation and Access Jeremy York Media Preservation Conference April 17, 2013.
HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise.
HATHITRUST A Shared Digital Repository Bibliographic Metadata and HathiTrust ALCTS CaMMS Catalog Management Interest Group Meeting American Library Association.
HATHITRUST A Shared Digital Repository Collective Stewardship through HathiTrust Digital Library African Studies in the Digital Age November 12, 2014 Mike.
HATHITRUST A Shared Digital Repository HathiTrust METS and PREMIS October 25, 2011 Jeremy York Project Librarian, HathiTrust.
HATHITRUST A Shared Digital Repository HathiTrust on the Move A Growing Partnership Taking Stock and Looking Ahead National Library of Medecine October.
HATHITRUST A Shared Digital Repository HathiTrust: A Second Life for Library Collections Jeremy York Exploring Humanities Cyberinfrastructure April 30,
HATHITRUST A Shared Digital Repository HathiTrust: The Collection and Its Uses NEFLIN Webinar - November 7, 2013 Jeremy York, Assistant Director, HathiTrust.
HATHITRUST A Shared Digital Repository A Preservation Infrastructure Built to Last: Preservation, Community, and HathiTrust UNESCO Memory of the World.
HATHITRUST A Shared Digital Repository How Can Digital Collections Support Shared Print Initiatives? The HathiTrust Print Monograph Archive Planning Task.
HATHITRUST A Shared Digital Repository Big Collections in an Era of Big Copyright: Practical Strategies for Making the Most of Digitized Heritage Jeremy.
HATHITRUST A Shared Digital Repository HathiTrust Overview: Partnership and Services Jeremy York Wesleyan University Web Presentation February 18, 2014.
HATHITRUST A Shared Digital Repository The HathiTrust Digital Repository: Under the hood SI 625 April 20, 2015 Jeremy York, Assistant Director, HathiTrust.
HATHITRUST A Shared Digital Repository Why Digitize? or The Limits of Preservation 2014 TEI/DHCS Plenary Session Evanston, IL Mike Furlough Executive Director,
HATHITRUST A Shared Digital Repository Digital Humanities in HathiTrust: Research At Any Scale Jeremy York Digital Humanities and the Futures of Japanese.
HATHITRUST A Shared Digital Repository HathiTrust Past, Present, and Future A Brief Introduction.
HATHITRUST A Shared Digital Repository More, Better, Together: HathiTrust Accomplishments and Aspirations The Researcher of Tomorrow Universidad Complutense.
CILogon and InCommon: Technical Update Jim Basney This material is based upon work supported by the National Science Foundation under grant numbers
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
HATHITRUST A Shared Digital Repository Collaborating Globally, Planning Locally HathiTrust and New Opportunities in Collection Management GWLA/UNM: Emerging.
PLASC Member Survey: Who’s our crowd? Conducted by Stephanie Bennett and Adrienne Pruitt Presented at the PLASC annual business meeting, Friday, August.
1 The Partnership Challenge Higher education’s missions are realized in increasingly global, collaborative, online relationships –Higher educations’ digital.
HATHITRUST A Shared Digital Repository HathiTrust Infrastructure and Information Organization November 7, 2011 Jeremy York Project Librarian, HathiTrust.
Map Review. California Kentucky Alabama.
June, 2012 Art Mandel.  Multiple acceptances to Ivy League Schools  Multiple acceptances to the “Most Competitive” colleges and universities  State.
UPDATED KUALI STATISTICS. KUALI FOUNDATION MEMBERS – INSTITUTIONAL Australian National University Boston College Boston University Brock University Brown.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
Directions: Label Texas, Arkansas, Louisiana, Mississippi, Tennessee, Alabama, Georgia, Florida, South Carolina, North Carolina, Virginia--- then color.
Harrison’s Top 25 1.Florida State 2.Alabama 3.Oregon 4.Oklahoma 5.South Carolina 6.Michigan State 7.Ohio State 8.Auburn 9.Baylor 10.Georgia 11.UCLA 12.LSU.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
Author(s): Jeremy York, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Noncommercial–Share.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
HATHITRUST A Shared Digital Repository Your Library, Now Online! Putting HathiTrust in the Context of Traditional (and New) Library Services MCLS Webinar.
HATHITRUST A Shared Digital Repository Institution Uses of HathiTrust Jeremy York University of Maine May 24, 2013.
STATE of the STATES Evaluating US Regional AV siobhan hagan, university of baltimore lynette stoudt, georgia historical society anne wells, chicago film.
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
US MAP TEST Practice
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
Barbara Preece ICOLC, April Mark Sandler Center for Library Initiatives Chicago Illinois Indiana Iowa Michigan Michigan State Minnesota Northwestern.
An Overview of the Platform
Collaboration: to work jointly with others towards a common goal Or the whole is greater than the sum of its parts Lisa B. German Library Faculty Organization.
HathiTrust: A valuable and visionary Partnership.
HATHITRUST A Shared Digital Repository ALA CopyTalk: CRMS The Copyright Review Management System September 1, 2016 Melissa Levine, Lead Copyright Officer,
Introducing Students to the Locker
HathiTrust Digital Library Interface and Services
Faculty Salary Study Comparison to AAU Data Exchange Institutions
HathiTrust Copyright Review
EDUCAUSE 2011 Philadelphia Convention Center 10/19/11
WASHINGTON MAINE MONTANA VERMONT NORTH DAKOTA MINNESOTA MICHIGAN
From Innovation to Commercialization Access to Data
Presentation transcript:

HATHITRUST A Shared Digital Repository HathiTrust: Key Concepts and Issues in Managing the Digital Archive ICPSR Summer Workshop “Curating and Managing Research Data for Re-use” August 1, 2013 Jeremy York, Project Librarian, HathiTrust Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License.Creative Commons Attribution Unported License

Outline What is HathiTrust / What are we trying to accomplish Repository management – What keeps us running Assessment

What is HathiTrust

Partnership Arizona State University Baylor University Boston College Boston University Brandeis University Brown University California Digital Library Carnegie Mellon University Columbia University Cornell University Dartmouth College Duke University Emory University Florida State University Getty Research Institute Harvard University Library Indiana University Iowa State University Johns Hopkins University Kansas State University Lafayette College Library of Congress Massachusetts Institute of Technology McGill University` Michigan State University New York Public Library New York University North Carolina Central University North Carolina State University Northwestern University The Ohio State University The Pennsylvania State University Princeton University Purdue University Stanford University Syracuse University Texas A&M University Tufts University Universidad Complutense de Madrid University of Alberta University of Arizona University of Calgary University of California Berkeley Davis Irvine Los Angeles Merced Riverside San Diego San Francisco Santa Barbara Santa Cruz The University of Chicago University of Connecticut University of Delaware University of Florida University of Houston University of Illinois University of Illinois at Chicago The University of Iowa University of Kansas University of Maryland University of Miami University of Michigan University of Minnesota University of Missouri University of Nebraska- Lincoln The University of North Carolina at Chapel Hill University of Notre Dame University of Oklahoma University of Pennsylvania University of Pittsburgh University of Utah University of Vermont University of Virginia University of Washington University of Wisconsin- Madison Utah State University Vanderbilt University Virginia Tech Wake Forest University Washington University Yale University Library

Digital Repository Launched 2008 Initial focus on digitized book and journal content – 10.7 million total volumes – 5.6 million book titles – 281,000 serial titles – 3.4 million public domain (~31%)

Mission To contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge

Universal Library Common Goal Single Entity, Many Partners HathiTrust

Collections and Collaboration Comprehensive collection -Preservation…with Access Shared strategies – Copyright – Collection management, development – Preservation – Discovery / Use – Bibliographic Indeterminacy – Efficient user services Public Good

Repository Management

Underlying ideas Community Scale Access and Preservation Openness

Community

OAIS TRAC METS and PREMIS Repository Practices – Content package – Validation – Identification – Scale

Scale Mission – To contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge Strategy – “Co-owned and managed”

Preservation and Access “Light” archive benefits – Access to materials – Checks on integrity – Best chance for content to be used and valued, preserved

Openness Repository centralized...open Formats Software Organizational structure

Underlying ideas

Experience

Repository Philosophy/Design OAIS/TRAC Consistency Standardization Simplicity (in design, not function) Practicality Sustainability

Source Bibliographic Data Content Package Michigan Indiana Bib Data Data Management Rights Data Storage Access Ingest Catalog Full-text Search PageTurner APIs Collections Holdings Data Datasets

Source Bibliographic Data Content Package Michigan Indiana Bib Data Data Management Rights Data Storage Access Ingest Catalog Full-text Search PageTurner APIs Collections Holdings Data Datasets

Content Types and number of formats – ITU G4 TIFF – JP2 – Unicode (with and without coordinates) Open, meet community standards Widely supported on a number of platforms Confidence in preservation and migration Transform to access formats

Content Package images Source METS text HT METS Zip

Source Bibliographic Data Content Package Bib Data Data Management Rights Data Storage Access Ingest Catalog Full-text Search PageTurner APIs Collections Holdings Data Datasets Michigan Indiana

Source Bibliographic Data Content Package Bib Data Data Management Rights Data Storage Access Ingest Catalog Full-text Search PageTurner APIs Collections Holdings Data Datasets Michigan Indiana

Storage Reliability – ensure integrity Redundancy – in single and multiple sites Scalability – including ease of management Accessibility – for repository processes and services Platform-independence – for data/object management

Architecture & Management images bib data bib data Source METS text HT METS../uc1/pairtree_root/b3/54/34/86/b b zip b mets.xml

Source Bibliographic Data Content Package Bib Data Data Management Rights Data Storage Access Ingest Catalog Full-text Search PageTurner APIs Collections Holdings Data Datasets Michigan Indiana

Assessment

CRL Audit Why – Value Community Standards – Accountability, Openness, Transparency Desire to know how we were doing, and let the community know Audit – Guided by criteria included in TRAC, as well as other metrics developed by CRL – HathiTrust’s practices are sound…appropriate to the content being archived and the general needs of the CRL community.

What was involved? Timeline – Data gathering: November December 2010 – Site visit May 2010 – Results in March 2011 Logistics – Question by , documentation – Phone conversations – Staff: Project Librarian, Digital Preservation Librarian, Executive Director

Results Organizational Infrastructure (2) – Mission statement, succession plan, staff, assessment, accountability, business plan, agreements Digital Object Management (3) – Properties preserved, SIP, AIP, validation, naming conventions, identifiers, understandability, preservation strategies, logging, access policies Technologies Technical Infrastructure Security (4) – Hardware, software, error-handling, change management, security, staff roles, disaster preparedness

Key Issues Rights and ownership of HathiTrust enterprise assets Succession plan Clarify and strengthen quality assurance and print archiving components of the HathiTrust program

Future Work Disaster Recovery Change Management – Moving to new formats: image, audio, born-digital Certification updates Documentation –

Thank you!

How to find out more About: Twitter: Facebook: Monthly newsletter: – – RSS Contact us: Blogs: – Large-scale Search – Perspectives from HathiTrust