CNI 17 April 2007 HASTAC 23 Feb 2007 1 An Update on Google Digitization at the University of Michigan Actually, a 50 slide kitchen sink and everything.

Slides:



Advertisements
Similar presentations
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Advertisements

An update on Google Book search digitization at the University of Michigan … the agreement and plans for work between Google and the.
Strategic issues for digital projects... …or, what are we doing here?
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
New Service Models. Why? Simply put: we cant do all you tell us you want and need Service and collection models that defined excellence in academic libraries.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
While You Were Out: How Students are Transforming Information and What it Means for Publishing Kate Wittenberg The Electronic Publishing Initiative at.
What is information literacy? Information Literacy - the definition "Information literacy is knowing when and why you need information, where to find.
Providing collections, tools and services for digital humanities A national library perspective Clément Oury Head of Digital Legal Deposit Bibliothèque.
The Oxford-Google Digitization Project* Michael Popham Oxford Digital Library * Rules of commercial confidentiality apply to this presentation!
October 24, 2006Merit Technical Staff Meeting1 The Google Project at the University of Michigan Perry Willett Head, Digital Library Production Service.
E-Live The UM-Google Digitization Deal What it is, how we got there, and what it will mean for the UM.
Libraries, MOOCs And the State of Online Education Judith Thomas Director, Arts and Media Services University of Virginia Library.
Digital archival storage for the University of Michigan Library collections.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. Prepared.
Constructing the Memories Creating a Digital Collection Linda J. White, Digital Project Coordinator.
John OckerbloomDec. 6, 2002 Supporting learning at the library Towards integrating LMS and digital library technology at Penn John Mark Ockerbloom CNI.
School of something FACULTY OF OTHER University Library The Library’s Digital Repository or Whatever happened to MIDESS? Michael Emly Jonathan Ainsworth.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
What the Hell Were We Thinking!?!?! UM-Google Digitization Deal What is it? Where do things stand?
JSTOR & OCR - A Case Study Kiffany Francis. What is JSTOR? “JSTOR is a not-for- profit organization with a dual mission to create and maintain a trusted.
National Aeronautics and Space Administration Implementing DSpace at NASA Langley Research Center 1 Greta Lowe Librarian NASA Langley Research Center
Massively Digitizing UC Library Collections Google, Microsoft, and More Learning in Retirement Libraries – The Intersection of Tradition and Innovation.
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
Jonathan Band Jonathan Band PLLC Google Library Project: Copyright Issues.
HathiTrust – How To By Dr. Rob McGeachin 20 th Annual AgNIC Meeting May 7, 2015.
1 EDMS 101 Speaker: Monica Crocker, DHS EDMS Coordinator Overview of current project(s) Objective of this section: This session outlines EDMS fundamentals.
Sample Search ___________________________________ Search Results Abstract ___________________________________ Full Text Online Catalog WorldCat Assessment.
The Luminary Library Experience: Large scale digitization at Toronto Public Library Agenda Introduction Background The project Current status Implementation.
HathiTrust Digital Library. Overview ›Began in 2008 ›Large scale digital preservation repository ›Partnership of major research libraries ›Focus on both.
Open Journal Systems Project or The UB Libraries as Publisher: Information and Roles for Liaison Librarians Charles Lyons Library Liaison Summit December.
DSpace: Introduction and Starting an Institutional Repository
University Libraries Library Systems Office. Life on MARS Mason Archival Repository Service Dorothea Salo Digital Repository Services Librarian Library.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
Login / Upload / Share Deposit your scholarly research - it’s as easy as 1, 2, 3 MAIN MESSAGE key reasons enumerated ->please read speaker notes id / who.
Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation.
The New Digital World and the Transformation of Information and Libraries Patricia L. Thibodeau Associate Dean Library Services & Archives Oct. 26, 2011.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
PROMOTING TRANSPARENCY THROUGH LIBRARY – GOVERNMENT COLLABORATION Presentation to BCLA Government and Legal Information Gathering May 13, 2011.
University of California Mass Digitization Projects Update Users Council Annual Meeting May 8, 2008 Heather Christenson, Mass Digitization Project Mgr,
The University Library in the Campus Strategic Goals, Initiatives and Metrics Fall 2013.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
Technical Services Involvement with New Services: Are there Silos Within the Library? Sharon Wiles-Young Director of Library Access Services ALCTS- Heads.
JENN RILEY, HEAD, CAROLINA DIGITAL LIBRARY AND ARCHIVES WHAT EVERY LIBRARIAN NEEDS TO KNOW ABOUT DIGITAL COLLECTIONS.
The Development of Open Access to ETDs in Canada: a Partnership between Canadian Universities and Library and Archives Canada 8 th International Symposium.
2CUL: EMERGING MODEL OF DEEP COLLABORATION? Anne R. Kenney ASERL Fall 2010 Membership Meeting.
Best Practices for Digital Imaging and Metadata Roy Tennant The Library, University of California, Berkeley
INTELLECTUAL RIGHTS AND HISTORIC CORPORA Mark Sandler University of Michigan ICOLC, March, 2003.
The UM-Google Digitization Deal What it is, how we got there, and what it will mean for the UM.
The Oxford-Google Digitization Project* Michael Popham Oxford Digital Library * Rules of commercial confidentiality apply to this presentation!
Envisioning Culture Shift: Building a Strategic Framework for Service Oriented Cataloging Jennifer O’Brien Roper Head, Cataloging and Metadata Services.
1/16/2016I. Revels Digital Imaging Workshop 1 Selection Considerations For Digital Imaging Projects.
DSpace An Open Source Dynamic Digital Repository Xizi (Cecilia) Cai IS565 Spring 2013 DL Topic Presentation.
The Future of Scholarly Communication & the Role of Libraries Roy Tennant eScholarship, The California Digital Library.
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
Scholarly works, research, reports, publications What is an Institutional Repository? Focus on Research Groups Promoting Physics Faculty, Students and.
Making the Case for Curation: The Practical Experiment of DSpace Managing Digital Assets February 5-6, 2005 Charleston, SC Ann J. Wolpert, Director of.
Grant Writing for Digital Projects September 2012 IODE Project Office IODE Project Office Oostende, Belgium Oostende, Belgium Sustainability and.
Sarah Beaubien Scholarly Communications Outreach Coordinator Grand Valley State University Libraries Open Education Materials: Collecting,
Egerton University, Njoro 28 th April, 2009 OER Africa An introduction.
Redefining the Library’s Role through an Institutional Repository Sharon Mader, Dean Jeanne Pavy, Scholarly Communications Librarian Earl K. Long Library.
The New Now: Institutional Repositories and Academia Institutional Repository USM April 17, 2015 Marilyn Billings Scholarly Communication Librarian.
A robust scholarly repository that puts your UVA research center stage
Users and Digital Collections
Beyond Description: Metadata for Catalogers in the 21st Century
Policy Frameworks: building a firm foundation for your IR
digital archival storage
The Bentley Digital Media Library
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

CNI 17 April 2007 HASTAC 23 Feb An Update on Google Digitization at the University of Michigan Actually, a 50 slide kitchen sink and everything in it (Much of it stolen from John Wilkin) Paul N.Courant

CNI 17 April 2007 HASTAC 23 Feb Outline of the talk Background--what it is and how we got there, Status report--what’s been done, what’s being done, and how the work is done GBS and MBooks--two access systems Implications and questions Conversation

CNI 17 April 2007 HASTAC 23 Feb Part 1 Background

CNI 17 April 2007 HASTAC 23 Feb Beginnings A conversation one day in 2002 Intensive planning and negotiations ( ) Equipment design and refinement ( ) Early deployment and growth ( )

CNI 17 April 2007 HASTAC 23 Feb Michigan’s deal What will be digitized? –7m (University Library, bound print) vols Google covers costs Make searchable on Google What about the legal issues? What happens to the books? We get a copy of the images (more on this later) Contract available online

CNI 17 April 2007 HASTAC 23 Feb UM prior to Google  Digital initiatives going back to 1980s  UMLibText  Humanities Text Initiative  PEAK, JSTOR, Making of America

CNI 17 April 2007 HASTAC 23 Feb University Library mission statement: “To support, enhance, and collaborate in the instructional, research, and service activities of faculty, students, and staff, and contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge.” Why engage in this partnership?

CNI 17 April 2007 HASTAC 23 Feb Part 2 Status

CNI 17 April 2007 HASTAC 23 Feb Production status Google began in July, 2004 UM receiving content continuously Large amounts of UM content went into GBS in November, 2005 Production ramp-up continues –New facility –Scaling (on pace to meet the 6-year target) –Pattern of doubling growth

CNI 17 April 2007 HASTAC 23 Feb But how do they do it????

CNI 17 April 2007 HASTAC 23 Feb THE GOOGLE WORKSTATION (CONFIDENTIAL)

CNI 17 April 2007 HASTAC 23 Feb THE ANN ARBOR WORK GROUP

CNI 17 April 2007 HASTAC 23 Feb Accomplishments  Getting books out and back - developed workflows - back on the shelf within 5-7 days  Developed mechanisms for receiving content from Google  Image quality - library preservation standards  Pre-/post-condition survey

CNI 17 April 2007 HASTAC 23 Feb About the files Returned to UM: package per volume, id’d by barcode –600dpi TIFF ITU G4 (bitonal) for print –300dpi JPEG2000 color/grayscale for illus. –File naming conventions corresponding to UM specs –Checksums –Production notes –OCR (UTF-8 text files) –Page metadata: page numbers, page features, chapter start Quality control –Ongoing improvement of hardware/engineering –Image quality good and improving What is secret and why? –Technology –Numbers

CNI 17 April 2007 HASTAC 23 Feb Workflow diagram

CNI 17 April 2007 HASTAC 23 Feb Accomplishments (2)  September 2006 UM released local implementation - MBooks  System for accessing our digital collections  Access through MIRLYN - UM Library OPAC  Created Rights Database to manage access

CNI 17 April 2007 HASTAC 23 Feb Part 3 Two access systems

CNI 17 April 2007 HASTAC 23 Feb Bunker Hill

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb Record for out-of-copyright work

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb Search within the book Walt Whitman

CNI 17 April 2007 HASTAC 23 Feb Record for in-copyright work

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb In copyright: search within text Walt Whitman

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb

CNI 17 April 2007 HASTAC 23 Feb Why would UM put the materials online? Responsibility for the “archive” Michigan “audience” more specific and thus more specialized… –Rights that Google may not have Current Section 108 provisions Services for disabled users Negotiated rights? –Functions that Google may not want to support More flexible displays More powerful citation tools Power searches? Data mining and other research applications

CNI 17 April 2007 HASTAC 23 Feb Part 4 Big questions and implications

CNI 17 April 2007 HASTAC 23 Feb Key contract provisions, pt. 1: the archive Use of U of M Digital Copy on U of M Website. U of M shall have the right to use the U of M Digital Copy, in whole or in part at U of M's sole discretion, as part of services offered on U of M's website. U of M shall implement technological measures … to restrict automated access to any portion of the U of M Digital Copy …. U of M shall also make reasonable efforts … to prevent third parties from (a) downloading or otherwise obtaining any portion of the U of M Digital Copy for commercial purposes, (b) redistributing any portions of the U of M Digital Copy, or (c) automated and systematic downloading from its website image files from the U of M Digital Copy…. Cf. Google: driven by a business model

CNI 17 April 2007 HASTAC 23 Feb Key contract provisions, pt. 2: one archive or many? Use of U of M Digital Copy in Cooperative Web Services. Subject to the restrictions set forth in this section, U of M shall have the right to use the U of M Digital Copy, in whole or in part at U of M's sole discretion, as part of services offered in cooperation with partner research libraries such as the institutions in the Digital Library Federation….

CNI 17 April 2007 HASTAC 23 Feb Creating a cooperative enterprise Original vision –Greater than the sum of our parts –Effort tied to the mission of our scholarly enterprise(s) –Extensible framework for content and services CIC discussions –All 13 institutions participating –Two coordinated instances of replicated content –A foundation for creating shared definition Next?

CNI 17 April 2007 HASTAC 23 Feb Transformative implications (2004) Broad, efficient, democratizing access Access as driver for … Exaggerating and resolving IP issues Creation of cooperative “universal” library Exacerbating paradox of “library as place” Facilitating “specialization” (ceding “generalist” role to Google) Freeing up resources for related issues (e.g., institutional repositories, scholarly communication)

CNI 17 April 2007 HASTAC 23 Feb Implications for users--the obvious I just came across a copy of the classic philosophy text "Principia Ethica" in your catalog of books and what I want to tell you is WOW!!!... This is an extraordinary service to the public.... What is interesting, though, is that for those books you have online that are different editions from the one I own, I am inclined to buy the books anew so as to be able to refer to the copy you have online... I know of no better method for doing textual analysis than by using your service. I am writing a paper on the history of statistics and was disheartened when I picked up Claude Bernard's Introduction to Experimental Medicine - because it did not have an index. But then I found you.

CNI 17 April 2007 HASTAC 23 Feb Use and users, more Those who “rediscover” their own collections Those who find what they would not have found otherwise Not new ways of reading, but efficiencies only imagined

CNI 17 April 2007 HASTAC 23 Feb IP Issues Orphaned works 108 The immovable object and the irresistible force What might a reasonable arrangement look like? –Let books be books?

CNI 17 April 2007 HASTAC 23 Feb Universal Library Borges’s ‘universal library’?: “This ideal, although unrealizable, has influenced and continues to influence librarians and others.” –Now it is realizable, but here we are blocked by the rights environment. Again, we should be able to find a Pareto- improving trade

CNI 17 April 2007 HASTAC 23 Feb Library as Place So, what are we going to do in the library? –So far, we seem to be plenty busy –We might even work on search tools that can be used by the ordinary scholar working outside (inside?) her own narrow field Will we attract users with expertise, with the promise of propinquity to other users?

CNI 17 April 2007 HASTAC 23 Feb Specialization Maybe we can specialize in the academic library way of doing things, and let Google take care of the other audiences. And maybe, given the nature of our business and the lives of undergraduates, not.

CNI 17 April 2007 HASTAC 23 Feb Freeing up resources Space, the final frontier… Institutional repositories Shared physical storage Support publication and scholarship in the library with library expertise

CNI 17 April 2007 HASTAC 23 Feb What’s next?  Storage - RFP  Move from storage to other UM libraries  MBooks  Collections  correcting OCR???  Rights clearance  Services for persons with disabilities  Shared effort

CNI 17 April 2007 HASTAC 23 Feb Why does the University of Michigan support this project? At its essence, digitizing books and widening their exposure is about the public good. I believe it transcends debates about snippets, and copyright, and who owns what when, and rises to the very ideal of a university—particularly a great public university like Michigan. Our work is about the social good of promoting and sharing knowledge. As a university, we have no other choice but to make this happen. - Mary Sue Coleman Scholarship and Libraries in Transition March 2006

CNI 17 April 2007 HASTAC 23 Feb For further information…  Michigan Digitization Project  –Contract –Project FAQ –Link to information about UM access?  Mirlyn (UM online catalog)   

CNI 17 April 2007 HASTAC 23 Feb Two Futures