Data Capture - ICR Typical Workflow

Slides:



Advertisements
Similar presentations
INTRODUCTION ABOUT OMR. INDEX  Concept/Definition  Form Design  Scanners & Software  Storage  Accuracy  OMR Advantages  Commercial Suppliers.
Advertisements

UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
MSIS 110: Introduction to Computers; Instructor: S. Mathiyalakan1 Systems Design, Implementation, Maintenance, and Review Chapter 13.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in data capture.
Census Data Capture Challenge Intelligent Document Capture Solution UNSD Workshop - Minsk Dec 2008 Amir Angel Director of Government Projects.
Improving Government Effectiveness by Automating Data Capture: A Government Case Study Presented by: Jeff Toren Kofax Image Products Presented by: Ray.
Data capture of the PHC 2002 (Uganda) Experiences and lessons leant.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in Census.
AGENDA Welcome and introductions Brief introduction to PSI Mobile Technical Overview Demonstration Q and A Next Actions.
Data Collection on the Cheap: A System for Small Budgets and Small Organizations Lac Courte Oreilles Ojibwe Community College Ann Martin, Assessment Coordinator.
1 Use of scanning technology for data capture ICR System (Intelligent Character Recognition) Information and Communication Technology Center National Statistical.
Data Processing Capabilities
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
True OMR Second Darkest Mark Detection For Erasure Analysis.
Data Data is collection of facts and figures which are not in directly usable form. It is also termed as Input about an item, a person or a place. It.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
HOUSELISTING SCHEDULE NPR SCHEDULE HOUSEHOLD SCHEDULE.
IN THE MEANTIME…. INTERIM SOLUTIONS TO AUTOMATED DATA CAPTURE.
Scanning Technology and Its Application in Ethiopia Yakob Mudesir Deputy Director General Central Statistical Agency of Ethiopia
What is EzyOMR? Answer sheet reading solution which provides – High speed scanning – 100% Accuracy – High Flexibility – Low cost – So easy for everyone.
Principles of Information Systems, Sixth Edition Systems Design, Implementation, Maintenance, and Review Chapter 13.
© Beta Systems Software AG Process Stages of Census Surveys Richard J. Lang, International Manager September 2008, Bangkok.
Data Capture Overview United Nations Statistics Division
UNSD Census Workshop Day 2 - Session 7 Data Capture: Intelligent Character Recognition Andy Tye – International Manager DRS are Worldwide specialists in.
Data Capture Technology Statistical Centre Of IRAN Presented by : MS. SOMAYE AHANGAR Vice – Presidency for Strategic Planning and Supervision Statistical.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
Uganda – October 2009 Census Data Collection & Processing John Gomersall.
0 Paper rocess Scanner Throughput P eople PP P Effective Scanner Throughput Consider KOFAX – VRS (Virtual Re-Scan) Increase Productivity.
Census Data Processing: Contemporary Technologies for Data Capture Bangkok, Thailand September, 2008 By Jatan Kumar Saha Systems Analyst Bangladesh.
UNSD Workshop Tanzania June 2008 JOHN GOMERSALL ANDY TYE.
Data Processing of the 2010 Population and Housing Census September 2008, Bangkok, Thailand National Statistical Office, Thailand.
Principles of Information Systems, Sixth Edition 1 Systems Design, Implementation, Maintenance, and Review Chapter 13.
Census Data Capture: ABS Experience 1991 to 2006 Noumea February 2008.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Key From Image Technical Experiences and Insights Philippine NSO Implementation.
ViciForm – Form Processing Solution Creating Info repositories from documents.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
CAD CAM. 2 and 3 Dimensional CAD: Using 2-dimensional CAD software, designers can create accurate, scaled drawings of parts and assemblies for designs.
UNSD Census Workshop Day 2 - Session 7 Data Capture: Intelligent Character Recognition Andy Tye – International Manager DRS are Worldwide specialists in.
Input & Output Devices ASHIMA KALRA.
DATABASE.
Capture & management of meter job records for your mobile workforce
UNSD Census Workshop Data Capture: Optical Mark Recognition
UNSD Census Workshop Data Capture: Intelligent Character Recognition
Ethiopian 2007 CENSUS DATA CAPTURING AND PROCESSING
Database Systems Unit 16.
Selection and Use of Input Devices and Input Media High Volume Devices
Automatic Digitizing.
OCR GCSE ICT Data capture methods.
Databases.
UN Workshop on Data Capture, Bangkok Session 7 Data Capture
Optical Data Capture: Optical Character Recognition (OCR)
Lac Courte Oreilles Ojibwe Community College
UN Workshop on Data Capture, Dar es Salaam Session 7 Data Capture
Data Capture Process Stages
United Nations Regional Workshop on the 2020 World Programme
Improving assessment and feedback processes with OCR technology
UNSD Census Workshop Day 2 - Session 6
Presented by: Jeff Moore – Artsyl Technologies, Inc.
Optical Data Capture: Optical Mark Recognition (OMR)
Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Manual Data Capture – Key Entry
Module 2 - Xtrata Pro Product Overview Module 2 – Product Overview
Presentation transcript:

Data Capture - ICR Typical Workflow Image Movement /Data Extraction – Processing Centre/s

Data Capture - ICR Typical Workflow Image Interpretation Automated Process Background Task Page Identification De-skew Image Cleanup Predefined Areas

Data Capture - ICR Typical Workflow Character Inspection Tiling High Confidence Operator decision Field Context Tall to short

Data Capture - ICR Typical Workflow Key Correction Low Confidence Operator decision Form Context External Verification

Data Capture - ICR Typical Workflow Data Export ASCII File CSV Format 1 line/form CSPro Import

Data Capture - ICR Typical Workflow ICR

Data Capture - ICR Accuracy This is always the first Question. Handprint Numeric only in isolated fields 98% Numeric only in semi constrained fields 95-96% Alpha upper case only 90% Alpha lowercase only 85-87% Alpha mixed case 75-80% Alpha/Numeric mixed case 50% or less reduce by 5% if there are special characters not a-z and 0-9 The accuracy level post data correction (e.g. the final output accuracy) should be 100% (subject to good operators)

Data Capture - ICR Accuracy Continued… The accuracy of all modern ICR engines are pretty much comparable The major differences with suppliers solutions are the methods and workflow utilised with each offering False positive detection takes 10 times longer than entry of characters recognized with low confidence – false positives (substitutions) are the most expensive errors

Data Capture - ICR Accuracy Continued… Accuracy can be improved by: Restricting the responses to any given question thus using external verification Using multiple ICR engines to ‘vote’ which is expensive Training your ICR engines on local hand writing styles (If possible)

Data Capture - ICR Advantages No Specialist hardware required An Image archive is automatically produced of every form Very high speed scanning can be achieved Both OMR and ICR can be interpreted using ICR software Forms designed for ICR relatively easy to fill in. Locally printed forms can be used. Allows capturing much more complex data than with OMR alone

Data Capture - ICR Disadvantages Significant Hardware/software and trained IT staff will be required Accuracy dependant on manual intervention High calibre IT staff are required to support the ICR system More complex cost/benefit analysis than with OMR alone.

Data Capture - ICR Indicative Costs & Labour For 65 Million Population Census (20M Single Sided A4 household form) Processing period of 12 Weeks (8 hours/day 5 days/week) Hardware $800k-$1M in total Software $700k-$1.3M in total Total Indicative Costs are $1.5M to $2.3M No. of Staff 100-190 in total 6-10 Managers 94-180 PC Operators

Data Capture - ICR Summary The single most important factor for timely and accurate data capture is to make sure ‘the forms are filled in correctly and are returned in good condition’ ICR offers considerable flexibility at the cost of higher skilled IT personnel

Worldwide specialists in data capture from paper