Capacity Building Passing on the Experience Dr. Noha Adly World Digital Library Arab Peninsula Regional Group meeting.

Slides:



Advertisements
Similar presentations
Digital Library Service at Higher Education in India
Advertisements

1 of 18 Information Dissemination New Digital Opportunities IMARK Investing in Information for Development Information Dissemination New Digital Opportunities.
1 of 15 Information Access Internal Information © FAO 2005 IMARK Investing in Information for Development Information Access Internal Information.
doi> Digital Object Identifier: overview
Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
WDL Technical Architecture Working Group (TAWG) June 2010 Achievements and Recommendations Co-chaired by Noha Adly, Bibliotheca Alexandrina Babak Hamidzadeh,
World Digital Library OSI | WEB SERVICES World Digital Library Arab Peninsula Regional Group Meeting Doha, Qatar, December 12-14, 2010 An Introduction.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Pulling it all together… with thanks to Sheila Anderson.
Paperless & E-Learning Environments SCHOOL DIGITAL LIBRARY Development, Configuration, maintenance, CD ROM Publishing. E-LEARNING DRIVEN WEBSITES ENVIRONMENTS.
CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Presentation by Priyanka Sawarkar
MacKenzie Smith Associate Director for Technology MIT Libraries.
Fedora Users’ Conference Rutgers University May 14, 2005 Researching Fedora's Ability to Serve as a Preservation System for Electronic University Records.
Special collections and digital libraries: a new role for consortia? Dale Flecker Harvard University Library.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
ELPUB 2006 June Bansko Bulgaria1 Automated Building of OAI Compliant Repository from Legacy Collection Kurt Maly Department of Computer.
Depositing e-material to The National Library of Sweden.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
WMS: Democratizing Data
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Introducing Symposia : “ The digital repository that thinks like a librarian”
Digital Repository Service ___________________________ Yale University Library Audrey Novak, Head IS&P 7 March 2007.
DMS in Universities, Colleges and School Infocrew Solutions Pvt.Ltd.
Digitization Workflow Management System for Massive Digitization Projects Bibliotheca Alexandrina November 19, 2006 The 2 nd International Conference on.
Architecting an Extensible Digital Repository Anoop Kumar, Ranjani Saigal,Rob Chavez, Nikolai Schwertner Tufts University, Medford, MA.
Digital Library Architecture and Technology
Education Supported by Content Management Systems Milena Stanković, Milan Rajković, Ivan Petković, Petar Rajković Faculty of Electronic Engineering, Niš.
Trimble Connected Community
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
The DSpace Course Module – An introduction to DSpace.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Communication & Web Presence David Eichmann, Heather Davis, Brian Finley & Jennifer Laskowski Background: Due to its inherently complex and interdisciplinary.
DAFv2 Hands on Lab 1. Agenda Administration Manager Administration Manager Roles, General Settings, Job-Types, Phases, Users, Workstations, Collections.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Million Book Bibliotheca Alexandrina Noha Adly 20 November 2006.
11-15 April 2011 Mauritius Institute of Health S.S.Pillai
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Content Management Systems Linda Fernandezlopez LIS 385T Information Architecture February 6, 2003.
Million Book Bibliotheca Alexandrina Youssef Eldakar 19 November 2006.
Introduction to metadata
This presentation describes the development and implementation of WSU Research Exchange, a permanent digital repository system that is being, adding WSU.
GPO’s Federal Digital System December 10, 2009 U.S. Government Printing Office.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
National Archives and Records Administration Status of the ERA Project RACO Chicago Meg Phillips August 24, 2010.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
1 Pioneer Investments Legal and Compliance System Assessment Weekly Status Update June 23, 2005.
Learning Objectives Understand the concepts of Information systems.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 15 Creating Collaborative Partnerships.
Radoslav Pavlov, Galina Bogdanova, Desislava Paneva- Marinova, Todor Todorov, Konstantin Rangochev
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
Digital Asset Management: Resource Assessment Anthony D. Smith Ocean Teacher Academy Training Course, 30 September - 4 October 2013, Mombasa, Kenya.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digitization GOALS & THEIR LOGISTICS Michael J. Bennett Digital Initiatives Librarian C/WMARS,
EDUKNOWLEDGE A Framework for Educational Purposes
VI-SEEM Data Repository
VI-SEEM Data Repository
Library Technology Conference: Building Exhibits
DIGITAL LIBRARY.
Metadata to fit your needs... How much is too much?
Presentation transcript:

Capacity Building Passing on the Experience Dr. Noha Adly World Digital Library Arab Peninsula Regional Group meeting

Reaching significant milestones in digitization With a focus on Arabic content

It all started at the BA Digital Lab…

The Digital Laboratory

Well equipped for different types of media

Digital Laboratory Digitizing various media including slides in multi-formats, negatives, books, manuscripts, pictures and maps Digitizing Bibliotheca Alexandrinas valuable collections Many of the Librarys projects are highly dependant on the digital laboratory

Digital Lab Man Power 1 20 staff members Distributed over several teams Working 7 days / week 2 shifts / day Working in many collections simultaneously Workflow & Workflow Management system are essential to control and track the process

What is a Workflow ? A workflow is a well defined sequence of operations, declared as work of a [resource]* during which documents, information or tasks are passed from one resource to another for action – According to a defined procedural rules – Having an estimated time – Can be documented – Can be learned * Resource: is a person, simple or complex mechanism, group of persons, an organization of staff, or machines

Digitization Phase Scanning Hardcopy is converted into raw digital image Processing Phase Raw digital image is enhanced to realize: Better image quality Better OCR accuracy OCR Phase It extracts the text corresponding to the processed image contents Basic Digitization Workflow

For each phase, we need to: Define the specs of the output (Quality) Set the procedure of work to guarantee quality Calculate the required time Whenever possible try to Automate tasks Set Benchmarks to monitor the progress

Why Workflow Management System? 1.Automation of task handling 2.Progress tracking 3.Process Management 4.Flexibility

1. Automation of task handling Digital Assets Factory DAF (DAF is the digitization workflow management system)

2.Progress tracking – Workflow Tracking – Pending Items – Late Jobs – Employees Rates – Build Customized Report Digital Assets Factory DAF (DAF is the digitization workflow management system)

3. Process Management – Roles (Permissions) – Job Types – General Settings – Phases – Employee accounts – Workstations – Collections Digital Assets Factory DAF (DAF is the digitization workflow management system)

4. Flexibility Digital Assets Factory DAF (DAF is the digitization workflow management system)

Targeted Monthly Production Rate 5,000 books/month (1,800,000 pages) HOW to reach the target?

Daily Rates (single shift) – Scanning: 3,000 pages/person – Processing: 3,000 pages/person – Latin OCR: 4,000 pages/person – Arabic OCR: 2,100 pages/person

Monitoring Rate/user (monitored during the shift) User rate & Rate/shift report

Reporting Weekly production Monthly production

BAs digital collections are maintained within the institutions Digital Assets Repository - DAR

Digital Assets Repository Developed to facilitate the creation, use and management of the digital library collections. A repository for all types of digital material including slides in multi formats, negatives, books, manuscripts, pictures and maps, audio and video, thus preserving and archiving the digital media Provides public access to digitized collections through a web-based search and browsing facilities

Digital Assets Repository DARs core consists of 4 fundamental modules: – The Digital Assets Factory (DAF) ) Responsible for the complete automation of the digitization cycle It was developed using open source tools – The Digital Assets Metadata (DAM) Keeps a unique and intact version of the digital assets metadata Helps ensuring that cataloging, indexing, browsing, searching and retrieval are done efficiently In the latest version, DAM uses Fedora to manage the metadata. Based on METS/MODS standards – The Digital Assets Keeper (DAK) A repository for the digital assets that are either produced by DAF or are directly introduced into the repository. – Digital Assets Publishers (DAP) Components that publish and display the digital assets stored in DAK – Book viewers – Search engines

DAR is a system developed in-house using open source tools, aiming to ensure the production of high quality digitization and efficient data retrieval. The different modules of DAR manage the entire digitization workflow: Digital Assets Factory (DAF) Digital Assets Metadata (DAM) Digital Assets Keeper (DAK) Digital Assets Publishers (DAP)

Imparting Capacity Building Sharing the BAs technical expertise with external organizations

Yale University December 2007 Arabic and Middle Eastern Electronic Library Municipal Administration Modernization (MAM) program in Syria March 2009 Kuwait Institute for Science and Research KISR January 2010 ISIS has conducted capacity building workshops:

Capacity Building Scope Passing on the experience of building an institutional repository to maintain the production of high quality digital assets in terms of digitizing, processing, OCRing, encoding, archiving and publishing based on well known standards.

Capacity Building Program

The capacity building program Overviewing BA/ICT facilities (Digital Library, Internet Archive, VISTA, HPC, System infrastructure design, etc.)

The capacity building program General tour over viewing BA/ICT facilities Digitization process – Digital image parameters – Compression formats – Digitization workflow and phases

The capacity building program General tour over viewing BA/ICT facilities Digitization process Hands on Scanning and Image processing – Enhancing image and text quality – Images rendering a good OCR

The capacity building program General tour over viewing BA/ICT facilities Digitization process Hands on Scanning and Image processing Quality Assurance

The capacity building program General tour over viewing BA/ICT facilities Digitization process Hands on Scanning and Image processing Quality Assurance Digital Assets Factory (DAF) – Automation of the digitization workflow – DAF key features – Job life cycle

The capacity building program General tour over viewing BA/ICT facilities Digitization process Hands on Scanning and Image processing Quality Assurance Digital Assets Factory (DAF) OCR – Analysis of the input and classifying it to different fonts – Automating OCR procedure

The capacity building program General tour over viewing BA/ICT facilities Digitization process Hands on Scanning and Image processing Quality Assurance Digital Assets Factory (DAF) OCR Online Storage

The capacity building program General tour over viewing BA/ICT facilities Digitization process Hands on Scanning and Image processing Quality Assurance Digital Assets Factory (DAF) OCR Online Storage Library Services – VTLS including its different modules – LIS servers and DB maintenance – OPAC and WEBAC customization – In-house developed systems

The capacity building program General tour over viewing BA/ICT facilities Digitization process Hands on Scanning and Image processing Quality Assurance Digital Assets Factory (DAF) OCR Online Storage Library Services Multimedia delivery framework

Disseminating knowledge in the digital age…

Thank You