1 UNOG Library Digitization and Microform Unit (DMU) – December 2009.

Slides:



Advertisements
Similar presentations
Don’t Type it! OCR it! How to use an online OCR..
Advertisements

1 of 18 Information Dissemination New Digital Opportunities IMARK Investing in Information for Development Information Dissemination New Digital Opportunities.
Preservation of the Texas Agricultural Experiment Station Bulletin in the Digital Repository By Dr. Rob McGeachin Texas A&M University Libraries June,
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
E-Book Lending at the Spanish Libraries Presentation at the NAPLE E-Book Working Group Meeting, 12th May 2014 Concha Vilariño Subdirectorate of Library.
A CMS for PhD Theses Oleg Burlaca, Constantin Gaindric, Svetlana Cojocaru Institute of Mathematics and Computer Science Oleg Burlaca, Constantin Gaindric,
Economic Data Time Travel Adrienne Brennecke September 30, 2011.
WorldWideEnergy: A paradigm shift in advancing energy information access Ms. Deborah Cutler International Program Manager Office of Scientific and Technical.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
Newspaper Preservation through Collaboration and Communication The Texas Digital Newspaper Program By Ana Krahmer & Mark Phillips University of North Texas.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
Library integrated system -Aleph Fang Peng Stony Brook University.
JSTOR & OCR - A Case Study Kiffany Francis. What is JSTOR? “JSTOR is a not-for- profit organization with a dual mission to create and maintain a trusted.
Using a Content Management System Website for the Dissemination of Official Statistics By Edwin St Catherine, Director of Statistics, SAINT LUCIA UN Regional.
Advanced Workgroup System. RED Advanced Workgroup Systems: Scan Features Copy Print Scan DNSG Software Our Customers Documents Our Customers Documents.
Improving the Quality of Tax Statistics: Recent Innovations in Editing and Imputation Techniques at the Statistics of Income Division of the U.S. Internal.
Digital Alternatives to Transcribed Records at FAO IAMLADP Working Group on Technology for Conferences, Languages and Publications Task Force on Digital.
Mendeley What is it? How is it different from other “Bibliographic databases” like End Note and Reference.
1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.
The Voice of A Community Chinese Times Digitization Project Ian Song Prepared for the Multicultural Canada Conference
DML-CZ: Scanning and adjusting the images Martin Lhoták Academy of Sciences Library Launching the DML-CZ Prague.
IAEA International Atomic Energy Agency Digital Preservation at INIS United Nations Library and Information Network for Knowledge Sharing (UN-LINKS) 24.
LOUISVILLE.EDU Sharing Our Special Collections with the World: an IMSLP Digitizing Project By James Procell, Music Librarian University of Louisville.
1 EDMS 101 Speaker: Monica Crocker, DHS EDMS Coordinator Overview of current project(s) Objective of this section: This session outlines EDMS fundamentals.
 Optical Scanners Optical Scanners  Scanners Scanners  Electronic Tablet/Pen Electronic Tablet/Pen  Digital Camera Digital Camera  Webcam Webcam.
Mark Phillips Digital Projects Department University of North Texas Annexation of Texas Project.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
The Luminary Library Experience: Large scale digitization at Toronto Public Library Agenda Introduction Background The project Current status Implementation.
© 2015 Nuance Communications, Inc. All rights reserved. What’s new in AutoStore 7 March 2015.
Mass digitisation? Astrid Verheusen Projectmanager Research & Development Division National library of the Netherlands LIBER-EBLIDA Workshop on Digitisation.
Kurzweil 3000 Ron Stewart Access Technology Instructor High Tech Center Training Unit.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
PROMOTING TRANSPARENCY THROUGH LIBRARY – GOVERNMENT COLLABORATION Presentation to BCLA Government and Legal Information Gathering May 13, 2011.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
5 Marzo 2007 Census mapping and Gis Part II: dissemination Fabio Crescenzi Istat, Central Directorate on General Censuses UNECE Training Workshop on Census.
Metadata Extraction for NASA Collection June 21, 2007 Kurt Maly, Steve Zeil, Mohammad Zubair {maly, zeil,
Census of India 2001 MODERNISING DATA DISSEMINATION ACTIVITY IN CENSUS Office of the Registrar General, India Ministry of Home Affairs 2A, Mansingh Road,
United Nations - UN Translation services Organisational structure Resources.
United Nations - UN Translation services Organisational structure Resources.
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) 2.3 Digital Preservation Activities 36 th Consultative Meeting.
Translating for International Organisations Focus on the United Nations.
Persistent Digital Archives and Library System (PeDALS)
An exercise in preservation and applied technology Making an Electronic Text.
Encouraging An Informed Citizenry: Locating and Using Congressional Research Service Reports Starr Hoffman Librarian for Digital Collections University.
Materials Digitized Alumni magazines Yearbooks 175 th Anniversary Collection USTA Boys 18/16 National Tennis Championship Drawsheets College Building Blueprints.
Bronwen K. Maxson IUPUI Liaison to English and Spanish SALALM60, Princeton University July 13-17, 2015 Background photo: Simone Staiger,
1/16/2016I. Revels Digital Imaging Workshop 1 Selection Considerations For Digital Imaging Projects.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
Tema 3 INEbase history Statistical books available on the web Celia Santos
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
DIGITIZATION IN THEORY AND PRACTICE WEBSITE: Helen Nneka Okpala Presentation done at University of.
Million Book Project: Vision Becoming Reality Gabrielle Michalek, Carnegie Mellon Presentation to Carnegie Mellon Qatar Library November 9 & 10, 2005.
Library Council October 15, 2015 DIGITAL ARCHIVES UPDATE Creighton Barrett Digital Archivist
7th Annual Hong Kong Innovative Users Group Meeting
United Nations Sources
Pre-Course Assignment
DIGITIZATION OF PAPER DOCUMENTS OF INSTITUTE OF OCEANOGRAPHY’S LIBRARY
UNT Libraries TRAIL Processing Mark Phillips April 26, 2016
Digitization of The Increase A. Lapham Papers Collection
DIGITAL LIBRARY.
IMAODBC, The Hague, 5-9 sept 2005
Partnering to bring business workloads to Box.
Implementing an Institutional Repository: Part III
EndNote What is EndNote? EndNote Library, how to manage?
Current Challenges in Digitization
Presentation transcript:

1 UNOG Library Digitization and Microform Unit (DMU) – December 2009

2 Digitization and Microform Unit December 2009 Our mandate: Digitization and online dissemination of official documents of the United Nations published before 1993 Targeted preservation of certain UN official documents on microfiche

3 Digitization and Microform Unit Began digitization in 2005 Employs six full-time staff, including a supervisor Between 2005 and 2009, digitized: –69,563 documents containing 708,192 pages, which represent about 13,913 documents per year, equivalent to 141,600 pages It is the only entity in the United Nations system which continues the preservation of official UN documents on microfiche

4 Our customers The Unit is funded by the Member States of the United Nations through their mandatory contributions Worldwide users: –Members of Permanent Missions and delegates at the UN and its specialized agencies –International organization staff members –Staff from the reference libraries of the United Nations system –Public users of the UN Official Documents System (ODS) on and of UN departmental websites

5 Our partners The Digitization Unit at the Dag Hammarskjöld Library at UN Headquarters in New York

6 Our goals “A good document in the right place” Documents easily and timely available Readable and complete documents Good optical character recognition (OCR) Digitization and uploading to the ODS of the complete collection of Security Council documents published between 1945 and 1993 in Russian, Chinese and Arabic (general series, agendas and verbatim records) by the end of December 2009

7 Digitized documents Parliamentary documents Published by the various organs of the UN (Security Council, General Assembly, Economic and Social Council, Commission on Human Rights, etc.) Almost exclusively text, graphics and tables, very few pictures In the 6 official languages of the UN: English, French, Spanish, Russian, Chinese, and Arabic

8 Digitization equipment available Combined scanner: flatbed and automatic document feeder (ADF) Book scanner Microfiche scanner

9 Software used Software for Optical Character Recognition (OCR) working with different languages (English, Spanish, French, Russian and, recently, Chinese) Professional software for reading and processing PDF files

10 New digitization equipment and software Acquired a new scanner with an automatic feeder Replacing the book scanner with a book scanner with better resolution (300 dpi) Acquired server-based OCR software Acquisition of an image processing software

11 Digitization Done systematically or on demand At a 300 dpi resolution From paper: –Loose-leafs using the ADF scanner –Bound volumes using the combined scanner flatbed or the book scanner From microfiche using the microfiche scanner

12 Optical Character Recognition (OCR) In English, French, Spanish, Russian, and since June 2009, in Chinese, followed by the conversion of TIFF files into PDF text format No character recognition in Arabic, conversion of TIFF files into PDF image format Obtaining a single format: PDF

13 Current workflow Each of the five members of the team processes a document or a document series from A to Z, that is to say: –Searching for the document or documents in the relevant series in all languages, preferably in second paper copies –Preparing lists of documents in a database –Preparing the documents physically –Scanning all of the documents in the series –Processing every document set separately using software to clean, recognize text and convert to PDF –Once the entire series is processed, uploading of all documents online to the Official Document System (ODS)

14 Digitization process at UNOG Library : 4 phases Paper scanner Microfiche scanner 1. Preparation phase35% Collect documents… Input info in e-UNOG database… Update UNBIS… Image quality Effective loading OCR random quality Connect to ODS Mother Attach UNBIS metadata to e-file Prepare documents 2. Digitization phase25% TIFF Image e- files PDF Image e- files PDF Text e- files with OCR 3. Loading phase25% Load to ODS web site Statistics or Add to collection 4. Control phase15%

15 New workflow Gradual integration of new scanners and software Beginning in January 2010, establishing a workflow matrix in which each member of the team will be expert on one of the steps of digitization, while maintaining a variety of tasks Quality control for each step of the workflow The number and importance of errors are expected to decrease and productivity to increase