Presentation is loading. Please wait.

Presentation is loading. Please wait.

Digitization Workflow Management System for Massive Digitization Projects Bibliotheca Alexandrina November 19, 2006 The 2 nd International Conference on.

Similar presentations


Presentation on theme: "Digitization Workflow Management System for Massive Digitization Projects Bibliotheca Alexandrina November 19, 2006 The 2 nd International Conference on."— Presentation transcript:

1 Digitization Workflow Management System for Massive Digitization Projects Bibliotheca Alexandrina November 19, 2006 The 2 nd International Conference on Universal Digital Library 2006 (ICUDL 2006) Mohamed Yakout Noha Adly Magdy Nagi mohamed.yakout@bibalex.orgmohamed.yakout@bibalex.org noha.adly@bibalex.org magdy.nagi@bibalex.orgnoha.adly@bibalex.orgmagdy.nagi@bibalex.org

2 Goals Automate, track and manage the digitization workflow. Flexibility in defining digitization workflow Phases. Support dynamic evolution and deviations with a history tracking. Flexibility integration with the LIS and Library Digital Repository. Accept external partially digitized Jobs to start in the proper Phase within the digitization workflow Simultaneous management of multiple projects with a diversity of materials (books, journals, manuscripts, audio, video, slides, … etc)

3 Related Work Manual workflow management using several software packages (MS Excel, MS SharePoint, MS Project) Simple tracking workflow system with limited capabilities Several integrated digitization activities (digital capturing, image processing, OCRing, …) in one software DOCWorks from CCS. BookRestorer from i2s. OUPS Limitations: Tightly coupled with certain tools and do not allow easily other tools to be integrated. No Resources Management (e.g. Workstations and users) Lack of projects and collections management. Manual files handling between the storage server and clients. Lack of handling workflow exceptions, dynamic evolution and deviations except through manual intervention.

4 System Data Model

5 The object being digitized Book for Naguib Mahfouz Photos for an event Map for Alexandria Music sheet for Omar Khayrat

6 System Data Model All types of materials in the system BookManuscripts MapJournals AudioVideo

7 System Data Model A task that should be applied within the digitization process ScanningProcessing OCRingEncoding PublishingZipping for archiving

8 System Data Model The system users with several roles Digital lab operators Shift operators Administrator

9 System Data Model Represents logical grouping for the Jobs Nasser AlexMed AMEEL

10 System Data Model The computer used to perform the Phase

11 System Architecture

12

13

14 System Handlers XML Phases Definition Handler Pre-Phase and Post-Phase Physical section Database section Reflection Call <Folder Name="OTIFF" Create="false" ToDestination="false" NewName="OTIFF" Mode="Restircted"> <File Name="OriginalFiles" Type="tif" Count="+" ToDestination="false" Compare=""/>. <Folder Name="TXT" Create="false" ToDestination="true" NewName="TXT" Mode="Restircted"> <File Name="" Type="frf" Count="1" ToDestination="true" Compare=""/> <File Name="" Type="art" Count="1" ToDestination="true" Compare=""/>.

15 System Handlers XML Phases Definition Handler Pre-Phase and Post-Phase Physical section Database section Reflection Call <Folder Name="OTIFF" Create="false" ToDestination="false" NewName="OTIFF" Mode="Restircted"> <File Name="OriginalFiles" Type="tif" Count="+" ToDestination="false" Compare=""/>. <Folder Name="TXT" Create="false" ToDestination="true" NewName="TXT" Mode="Restircted"> <File Name="" Type="frf" Count="1" ToDestination="true" Compare=""/> <File Name="" Type="art" Count="1" ToDestination="true" Compare=""/>.

16 System Handlers XML Phases Definition Handler Pre-Phase and Post-Phase Physical section Database section Reflection Call <Folder Name="OTIFF" Create="false" ToDestination="false" NewName="OTIFF" Mode="Restircted"> <File Name="OriginalFiles" Type="tif" Count="+" ToDestination="false" Compare=""/>. <Folder Name="TXT" Create="false" ToDestination="true" NewName="TXT" Mode="Restircted"> <File Name="" Type="frf" Count="1" ToDestination="true" Compare=""/> <File Name="" Type="art" Count="1" ToDestination="true" Compare=""/>.

17 System Handlers XML Phases Definition Handler Pre-Phase and Post-Phase Physical section Database section Reflection Call <Folder Name="OTIFF" Create="false" ToDestination="false" NewName="OTIFF" Mode="Restircted"> <File Name="OriginalFiles" Type="tif" Count="+" ToDestination="false" Compare=""/>. <Folder Name="TXT" Create="false" ToDestination="true" NewName="TXT" Mode="Restircted"> <File Name="" Type="frf" Count="1" ToDestination="true" Compare=""/> <File Name="" Type="art" Count="1" ToDestination="true" Compare=""/>.

18 System Architecture

19

20

21

22 System Modules Check-In Plug-in based for integration. Creates the Job in the system Assign the Job to any Phase Check-Out Java Reflection Call section of the XML Phases Definition Ingest the Job’s digital objects into the repository

23 System Architecture

24 System Modules Phases Manager Request a new Job Download the Jobs folders and files Submit the Job back to the system to continue other Phases Reject a Job and recommend another Phase in addition to specifying reasons. Redirect a Job from the default Phase Sequence Provide information on the files level to help solving problems

25 System Modules (Contd) Reporting Workflow Tracking Pending Items Late Jobs Operators rates Build Customized Report Archiving On different Medias with different size and on online storage Administration

26 BA Digitization Workflow

27

28 Quality Assurance Supported on two different stages Maintain QA information on the files levels while moving from a Phase to another. A QA Phase is defined in the Digitization Phase Sequence as the last Phase before the Archiving

29 Achieving Flexibility Using DWMS The defined Phase Sequence for a Job Type is a guide, rather than a prescription. The list of Phases can or can not be in the Phase Sequence. The operator can assign the Job to any of all of these Phases. Jobs can be Forwarded dynamically to another Phase in the Phase Sequence. Changes in the Phase Sequence affects the current and new Jobs in the system, leading to natural process evolution

30 Job Life Cycle

31 Future Work Check-out plug-in for Fedora.. Check-in plug-ins will be implemented to support various metadata standards formats MODS, DC, VAR, etc. Enhance the software interface with graphical tools to help design and follow the digitization process.

32 Thank You mohamed.yakout@bibalex.org


Download ppt "Digitization Workflow Management System for Massive Digitization Projects Bibliotheca Alexandrina November 19, 2006 The 2 nd International Conference on."

Similar presentations


Ads by Google