Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation.

Slides:



Advertisements
Similar presentations
1 of 18 Information Dissemination New Digital Opportunities IMARK Investing in Information for Development Information Dissemination New Digital Opportunities.
Advertisements

Permanent access to the records of science: The e-Depot at the Koninklijke Bibliotheek Current Status & Developments Erik Oltmans Manager e-Depot Koninklijke.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Strategic issues for digital projects... …or, what are we doing here?
Strategic issues for digital projects... …or, what are we doing here?
Koninklijke Bibliotheek – Nationale bibliotheek van Nederland.
KB Research & Development Presentation for representatives of the National Diet Library 3 December 2003 Hans Jansen Head Research & Development Division.
Digital & Preservation Resources Managing the digital collection life cycle.
Services Digitisation & Content Management. 600 People – India.
Building The Rare book Collection at Rijeka University Library in the Digital Age Ines Cerovac, Senka Tomljanović, Rijeka University Library Seminar The.
1 CS 502: Computing Methods for Digital Libraries Lecture 9 Conversion to Digital Formats Anne Kenney, Cornell University Library.
Colin Potter and Caroline Foxon – Sunshine Coast Regional Library Service
Preservation of e-journals at the Koninklijke Bibliotheek Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library.
Collection building for special collections Between Cultural Management and Research: Special collections in the 21 st Century, Weimar Chantal Keijsper,
ANNO – AustriaN Newspapers Online A digitisation initiative of the Austrian National Library.
IAEA International Atomic Energy Agency Dobrica Savić & Germain St-Pierre Nuclear Information Section, IAEA Vienna Austria.
1 Australian Newspapers Digitisation Program Development of the Newspapers Content Management System Rose Holley – ANDP Manager ANPlan/ANDP Workshop, 28.
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
The Voice of A Community Chinese Times Digitization Project Ian Song Prepared for the Multicultural Canada Conference
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
November 2009 Network Disaster Recovery October 2014.
Information innovation independence Reaching our Audience.
Management, marketing and population of repositories Morag Greig, University of Glasgow.
1 NEWSPLAN – The Way ahead Ed King, Head of Newspaper Collections, British Library NEWSPLAN LIEM Regional Council 2 October 2008.
LIFE 3 LIFE3: Predicting Long Term Preservation Costs Paul Wheatley Digital Preservation Manager The British Library.
LIFE 3 LIFE 3 : Predicting Long Term Preservation Costs Brian Hole LIFE 3 Project Manager The British Library KeepIt training course 05/02/10.
Merging the National Library and the National Archives LIBER General Annual Conference, Tartu, June 2012 Els van Eijck van Heslinga, Head Finance and Corporate.
Project HISPRA (Historical Pragensia) Supported by the European Economic Area (EEA) and Norwegian Financial Mechanisms Metropolitan Libraries Section Conference.
ECPRD seminar in Brussels131 May 2002 THE DIGIDOC-PROJECT DIGITISATION OF PARLIAMENTARY DOCUMENTS IN THE BELGIAN PARLIAMENT PAUL SARENS.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
Digitising Journals, March 2000, Copenhagen Astrid Wissenburg Information Services and Systems King’s College London
© January/2008 CCS Content Conversion Specialists GmbH Weidestr. 134, Hamburg, Germany consulting technology digitization services.
Planning Digitisation Projects Aly Conteh The British Library 30/11/2012 CERL Annual Seminar.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
1 Using Digital Technologies to unlock history for researchers. Rose Holley – Manager Newspaper Digitisation Program Australian Academy of the Humanities.
1 The Technical Standards and Your Bid Sarah Ormes UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by Resource: The Council for Museums, Archives.
Image Workflow Processes Elspeth Haston, Robert Cubey, Martin Pullan & David J Harris.
The access to information divide: Breaking down barriers Bas Savenije Director General KB, National Library of the Netherlands Stellenbosch Symposium /
Quality Levels of Reproduction Adolf Knoll National Library of the Czech Republic.
International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library | Lisbon – Portugal DIGITAL ARCHIVE OF PORTUGUESE ART.
Digitization Programmes National Library of the Czech Republic Adolf Knoll
1 UNOG Library Digitization and Microform Unit (DMU) – December 2009.
Digitising Special Collections Public-Private Partnerships at the KB and abroad Marieke van Delft, KB, Keeper of Early Printed Collections / Project Leader.
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
Storage of digital objects Adolf Knoll National Library of the Czech Republic
What Audience? Nick Poole Chief Executive MDA The death of Mass-Digitisation and the Rise of the Market Economy.
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007.
Digital Collections: Making it Happen Hema Ramachandran Ed Sponsler Jim O’Donnell, Caltech Library System SCELC, September , Caltech.
Digitization of Library Material in Europe. Copenhagen, October 2007 LIBER-EBLIDA Workshop on Digitization Digitisation projects of the Biblioteca.
1/16/2016I. Revels Digital Imaging Workshop 1 Selection Considerations For Digital Imaging Projects.
Calum Dow Thurs 12 th November Our Partners…
EDLproject WP3 “Developing the European Digital Library” LIBER – EBLIDA workshop Digitisation of Library Material in Europe Copenhagen, October.
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
ETD Preservation and Archiving: Can Old Strategies be Applied to New Media? Third International Symposium on Electronic Theses and Dissertations St. Petersburg,
1 THE AUSTRALIAN NEWSPAPERS DIGITISATION PROGRAM (NDP) Rose Holley – Manager Newspaper Digitisation Program Presentation for Spydus 31 October 2007, NLA,
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digitization GOALS & THEIR LOGISTICS Michael J. Bennett Digital Initiatives Librarian C/WMARS,
National Libraries in the Digital Era Bas Savenije Abolish Barriers: the role of libraries in an evolving Europe Helsinki, 7 May 2010.
eContentplus 2008 Work Programme
Digitisation and disposal at National Archives of Finland
Digital Library Growing Pains: Mainstreaming Metadata and Reformatting Operations at The University of Iowa Libraries Jen Wolfe and Mark F. Anderson |
DIGITIZATION OF PAPER DOCUMENTS OF INSTITUTE OF OCEANOGRAPHY’S LIBRARY
Digital Archiving & Preservation : How to compare and contrast
KB Lab: Exploring the National Library of the Netherlands’ digital treasure trove Lotte Wilms
KB: digitisation and cooperation in the Netherlands
DIGITAL LIBRARY.
Keeping print alive… … or ‘What to do when digital becomes default’
Current Challenges in Digitization
Presentation transcript:

Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Royal Library, Copenhagen, Denmark 25 October 2007

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation What is mass digitisation? Millions of books rather than millions of pages No selection/no collections (digitise everything!) Mainly books Exclusion of special collections Low quality standards Ignore copyright issues Ignore long term preservation issues 2

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Koninklijke Bibliotheek - Digitisation in the past 3 Experience with digitisation since 1995 Webexpositions / highlights of collections Small-scale digitisation projects Mainly visually attractive images Emphasis on techniques / trial and error Exploration of possibilities Co-operation on a small scale

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation 4

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Koninklijke Bibliotheek - Digitisation Shift in emphasis: From highlights to larger collections Project based (Inter)national co-operation Established methods and techniques Awareness of digital preservation More text material & audio/video Further exploration of possibilities  applications made with the digitised material

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation 66 Memory of the Netherlands

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Koninklijke Bibliotheek - present & future Strategic plan :”Development of a national programme for the mass digitisation of sources for research in the humanities” Target audience Scientific research Public at large Development of standards and services Particular attention for digital preservation Preservation imaging No commercial partners for funding

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Koninklijke Bibliotheek - present & future Text digitisation Until recently: on a small scale Printed and typed sources (not handwritten) Issues differ from images Structure / navigation Conversion to full text (OCR) Scanning from microfilm Search & Retrieval

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation 9 ProjectNumber of pages Budget Dutch parliamentary papers M€ 10.5 Dutch daily newspapers M€ 12.5 Special collections – books before M€ 3.0 Radio news bulletins M€ 0.5 Metamorfoze - preservation imaging ? M€ 24 Atjeh M€ 0,3 Memory of the Netherlands M€ 3,5 Totaal M€ 54,3 Koninklijke Bibliotheek - Projects

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Koninklijke Bibliotheek - Issues 10 Costs of digitisation: € 1.3 per page Costs of exploitation: millions per year from 2011 onwards Technical infrastructure Storage (1 PB needed) Processing 2 million files per month Search & retrieval is not effective enough Organisational infrastructure is not efficient The process is too slow, we want to digitise faster and more...

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation 11 We cannot slow down to make things perfect The rising tide will lift all boats Mass Digitization: Implications for Information Policy Report from “Scholarship and Libraries in Transition: A Dialogue about the Impacts of Mass Digitization Projects” Symposium held on March 10-11, 2006 University of Michigan, Ann Arbor

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation 12 Project management & Organization Finance

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Content: Selection & Preparation Old approaches Much effort spent on selection Ignorence of copyright issues… Minute assessment of missing material Replacement of torn pages 13

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Content: Selection & Preparation New approaches Less effort on the selection process (integral collections) Negotiation/co-operation with publishing sector Limited effort on retrieving missing pages/issues Limited effort on restoration 14

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Content: Digital imaging & metadata Old approaches Very high quality images Capture as much detail from the original as possible Minimize damage to the original Master & access images Lossless compression (TIFF) Experiment with our own scanners 15

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Content: Digital imaging & metadata New approaches One format for both access and preservation New formats to save storage (JPEG2000) Outsource all imaging activities Consider.txt as a master… 16

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Processing: Quality assurance Old approaches High standards for quality assurance (often manual) Expensive Document Management System for quality control 17

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Processing : Quality assurance New approaches Not realistic to check quality for all files We need automatic quality assurance tools OCR often not involved in quality assurance 18

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Search & Retrieval Old approaches Find the best search engine Search in metadata Digitise text without OCR We decide what the user wants 19

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Search & Retrieval New approaches All text digitisation projects include OCR Search through millions of pages of text Experiment with tools for enhanced access & textmining Growing awareness that we have to involve our users 20

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Storage Old approaches Storage on CD Rom and DVD Master files in e-Depot: 1 Petabyte needed Storage of all master files for the long term Access files are stored in a different system 21

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Storage New approaches Storage strategy which balances costs, access and preservation Alternative file formats to minimize storage costs & increase throughput for delivery and transfer Use one file both as master and access file 22

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Finance All costs are now specified Division of budget 30 % Staff 10 % Hard- & software 10 % Research & Development 50 % Digitisation, OCR & metadata Exploitation costs are becoming ‘dramatic’ New business models 23

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Organisation All digitisation activities in R&D department Involvement of other parts of the library is necessary Digitisation & digital preservation are separate activities Integration is necessary Digitisation activities are all project based Integration with standing organisation is necessary 24

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation ‘Holding out for an ideal solution is often not feasible; moreover, implementing less-than-perfect solutions can enable us to be flexible, modular, and nimble so that we can continue to refine our strategies as new options become available’. Preservation in the Age of Large-scale Digitization A white paper By Oya Y. Rieger Council on Library and Information Resources 25 Conclusion

Koninklijke Bibliotheek – National Library of the Netherlands LIBER-EBLIDA Workshop on Digitisation Thank you!