Data Processing of the 2010 Population and Housing Census 15-19 September 2008, Bangkok, Thailand National Statistical Office, Thailand.

Slides:



Advertisements
Similar presentations
MICS4 Data Processing Workshop Multiple Indicator Cluster Surveys Data Processing Workshop Overview of Data Processing System.
Advertisements

Slide 1Slide Slide 1 International Conference on Establishment Surveys III Montreal June 18-21, 2007 United States Department of Agriculture National Agricultural.
Managing data using CSPro
Harvard Center for Population and Development Studies1 Census Editing and the Art of Motorcycle Maintenance Michael J. Levin Center for Population and.
INTRODUCTION ABOUT OMR. INDEX  Concept/Definition  Form Design  Scanners & Software  Storage  Accuracy  OMR Advantages  Commercial Suppliers.
Commercial Data Processing Lesson 3: Data Validation.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
AUTOMATIC DATA CAPTURE  a term to describe technologies which aim to immediately identify data with 100 percent accuracy.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
The 8 th ECO National Focal Points on Economic Research and Statistics ( April 2011, Baku, Azerbaijan) Country Report of the I.R. Iran Statistical.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in data capture.
Census Data Capture Challenge Intelligent Document Capture Solution UNSD Workshop - Minsk Dec 2008 Amir Angel Director of Government Projects.
National Statistical Office, Thailand 2-6 December 2013, Hanoi, Viet Nam Census Evaluation.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in Census.
Manual Data Processing of Census Data 2004 Population and Housing Census Statistics Sierra Leone Thekeka Moses Conteh Sierra Leone.
The Core Welfare Indicators Questionnaire: A CWIQ Option for Monitoring Poverty Reduction Strategies.
Copyright 2010, The World Bank Group. All Rights Reserved. PROCESSING, Part 1 Data capture, editing, imputation and tabulation Quality assurance for census.
1 Use of scanning technology for data capture ICR System (Intelligent Character Recognition) Information and Communication Technology Center National Statistical.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
Sterling Chadee Director of Statistics. The processing of the data from the field enumeration began in July 2011 until September All data processors.
AS Module 2 Information; Management and Management and Manipulation or what to do with data, how to do it, and……... ensure it provides useful information.
MSS Technologies and the AIIM Grand Canyon Chapter present: Electronic Document Management System Needs Analysis.
D ATA P ROCESSING W ORKSHOP Bangkok, Thailand, 15-19, Sept 2008 By Mr. Pen Socheat, NIS, Cambodia 1.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
IN THE MEANTIME…. INTERIM SOLUTIONS TO AUTOMATED DATA CAPTURE.
Using OCR for Census Data Capture in China National Bureau of Statistics of China.
Workshop on International Standards, Contemporary Technologies and Regional Cooperation, Noumea, New Caledonia, 04–08 February 2008 Results Generated from.
Scanning Technology and Its Application in Ethiopia Yakob Mudesir Deputy Director General Central Statistical Agency of Ethiopia
© Beta Systems Software AG Process Stages of Census Surveys Richard J. Lang, International Manager September 2008, Bangkok.
Data Capture Overview United Nations Statistics Division
UNSD Census Workshop Day 2 - Session 7 Data Capture: Intelligent Character Recognition Andy Tye – International Manager DRS are Worldwide specialists in.
Data Capture Technology Statistical Centre Of IRAN Presented by : MS. SOMAYE AHANGAR Vice – Presidency for Strategic Planning and Supervision Statistical.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
Uganda – October 2009 Census Data Collection & Processing John Gomersall.
Copyright 2010, The World Bank Group. All Rights Reserved. ICT - a core management issue Part 1 Managing ICT resources Produced in Collaboration between.
Bharat Sharma Nepal POPULATION & HOUSING CENSUS OF NEPAL: AN EXPERIENCE OF OUTSOURCING REGIONAL WORKSHOP ON CENSUS DATA PROCESSING September, 2008.
Multi-modal of data collection for the 2010 Population and Housing Census National Statistical Office, Thailand (Daejeon, Republic of Korea, April.
Census Data Processing: Contemporary Technologies for Data Capture Bangkok, Thailand September, 2008 By Jatan Kumar Saha Systems Analyst Bangladesh.
The Dark Side of Document Imaging: ‘The Hidden Cost of Capture’
© 2006 Formic Wednesday 7th November 2007 Formic Scoop Training Mikey Desai.
UN Regional Workshop on Data Processing, Bangkok, Sep Philippines 2007 Census of Population Data Processing Philippines 2007 Census of Population.
Status of Data Capture Technology in Population and Housing Censuses in the ESCAP region Statistics Division ESCAP.
Electronic data collection System in CSB of Latvia By Karlis Zeila, Vice President, CSB of Latvia IT DG meeting, October , Eurostat.
1 BPS Statistics Indonesia New York, February 2011.
Data processing of the 1999 Vietnam Population Census.
Data processing of 2000 population and housing census of Mongolia Munkhbadar Jugder, Senior officer of Population and housing census bureau, NSC of Mongolia.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Census Data Capture: ABS Experience 1991 to 2006 Noumea February 2008.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Census Data Capture with OCR Technology: Ghana’s Experience Presented at the UNSD Regional Workshop on Census Data Processing Dar es Salaam, Tanzania 9.
Use of Mobile Technology for Data Collection in Zimbabwe Experiences Gained and Lessons Learnt By Rodgers M. Sango Zimbabwe National Statistics Agency.
TIMOTHY SERVINSKY PROJECT MANAGER CENTER FOR SURVEY RESEARCH Data Preparation: An Introduction to Getting Data Ready for Analysis.
CENSUS DATA ANALYSIS TOOLS, AREAS, ISSUES & NEEDS Neena Sharma, IAS Director of Census Operations, Uttar Pradesh Office of the Registrar General & Census.
 ReadSoft 2004 Processing census forms.  ReadSoft 2004 ReadSoft Corporate Profile n Swedish company - founded1991 n Listed in Stockholm stock exchange.
United Nations Expert Group Meeting on International Standards for Civil Registration and Vital Statistics Systems, June 2011, New York Evaluation.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
1 Handbook on Population and Housing Census Editing Department of Economic and Social Development United Nations Statistics Division Studies in Methods,
UNSD Census Workshop Day 2 - Session 7 Data Capture: Intelligent Character Recognition Andy Tye – International Manager DRS are Worldwide specialists in.
Census Mobile Data Capture Using CSPro in Lesotho
UNSD Census Workshop Data Capture: Intelligent Character Recognition
Ethiopian 2007 CENSUS DATA CAPTURING AND PROCESSING
Census of Population & Housing 2001 Sri Lanka
Optical Data Capture: Optical Character Recognition (OCR)
Data Capture Process Stages
Data Capture - ICR Typical Workflow
UNSD Census Workshop Day 2 - Session 6
Validation process and the IT tools used at KAS
Optical Data Capture: Optical Mark Recognition (OMR)
Presentation transcript:

Data Processing of the 2010 Population and Housing Census September 2008, Bangkok, Thailand National Statistical Office, Thailand

 Hardware & Software of ICR System  TELEform / ABBYY Functions  Step of ICR System in NSO  Specific questionnaires for ICR System CONTENT DATA CAPTURING

NSO was firstly used ICR System to process the Population Census questionnaires in 2000 by scanning the 16 million households (16 million Forms) which spent only 8 months to process the raw data instead of 18 months by using Key in Data System. ICR for The Population Census 2000 DATA CAPTURING

TELEform Hardware & Software System in 2000 TELEform Hardware System TELEform Software System  NetServer for TELEform Server (1)  NetServer for Database Server (1)  Reader Modules Workstations (21)  Verifier Modules Workstations (55)  Scanner Control Workstations (6)  Scanner Fujitsu M4099D (6) TELEform 6.2 Elite Enterprise Edition Components :  TELEform Designer  TELEform Reader  TELEform Verifier DATA CAPTURING

ICR System in NSO (Thailand) can be divided into 2 parts : ICR System in 2003  TELEform Software System  ABBYY Software System DATA CAPTURING

NSO hired ABBYY Software to process about 25% of The Agricultural Census 2003 questionnaires that were totally 5.8 million households (24 million forms). ICR for The Agricultural Census 2003 DATA CAPTURING

TELEform Hardware & Software System in 2003 TELEform Hardware System TELEform Software System TELEform 6.2 Elite Enterprise Edition Components :  TELEform Designer  TELEform Reader  TELEform Verifier  NetServer for TELEform Server (1)  NetServer for Database Server (1)  Reader Modules Workstations (21)  Verifier Modules Workstations (30)  Scanner Control Workstations (6)  Scanner Fujitsu M4099D (6) DATA CAPTURING

ABBYY Hardware & Software System in 2003 ABBYY Hardware System ABBYY Software System ABBYY FormReader 6.0 Enterprise Edition Components:  Form Design  Administration Station  Recognition Station  Correction Station  IBM Server X Series 225 (1)  Correction Station (1)  Verifier Modules Workstations (25)  Scanner Control Workstations (4)  Scanner Fujitsu M4099D (4)  Storageflex LT707 (1) DATA CAPTURING

TELEform & ABBYY Functions

TELEform / ABBYY Designer Function To create template form by fix field boxes on questionnaire. Questionnaire Template DATA CAPTURING

 To evaluate the questionnaires  Export the corrected questionnaires to a data file  Send the unclear questionnaires to TELEform/ABBYY Verifier Function for correcting and transferring the corrected questionnaires to a data file  Store scanned images TELEform Reader / ABBYY Administration Function DATA CAPTURING

 To correct questionnaires that contain mismarked or illegible fields  The corrected questionnaires are automatically exported to a data file TELEform / ABBYY Verifier Function DATA CAPTURING

Scanning speed support A7 to A3 paper sizes  Simplex is provided 90 papers / minute. (A4 portrait)  Duplex is provided 180 images / minute.(A4) NSO questionnaires projects are mostly printed with A3 (297 x 420 mm.) paper sizes. Functions Speed FunctionsEstimated Speed (sheets/minute) Scanner 45 Reader17 Verifier 5 DATA CAPTURING

Step of ICR System in NSO

Scan and Forms Distribution : The questionnaires are scanned in each Block / Village and created Multi Page Image Files. Step of ICR System in NSO Forms Evaluation : The questionnaire images are evaluated. The corrected questionnaires which skipped Verifier Workstations and directly exported to Database server. DATA CAPTURING

Forms Verification : The unclear questionnaires are needed to review and corrected it in Verifier Workstations before transferring to Database server. Step of ICR System in NSO (cont’) Data Export :  Link a data file from Database server to IBM Mainframe System  Store Scanned image files to CD. DATA CAPTURING

Scan & Forms Distribution Questionnaire Scan Image File DATA CAPTURING

Forms Verification and Data Export Verify Storage (Images files) Export data for processing CD DATA CAPTURING

Input – Output of ICR System Questionnaire ICR DATA CAPTURING Ascii files Image files

ICR Linkage System ABBYY Software Transfers Storage (HD 880 GB) Mainframe HP Serve r COMPAQ Server Processing (Editing & Reporting) Scanners 4 unit IBM Serve r controller PC 6 unit PC 4 unit controller 30 unit Verifications 21 unit Readers Correction 1 unit station 25 unit Verifications - Administration - Export - Recognition CD Questionnaires - Backup Data - Software - Database S Scanners 6 unit Questionnaires TELEform Software

DATA CAPTURING Specific Questionnaires for ICR System

Specific questionnaires for ICR System  The questionnaires must be designed and printed in quality of paper, specific colour answer field boxes (blue, green, red)  To record the questionnaires should be used at least 2HB pencil  To distribute and collect as well as return questionnaires should be done with caution. DATA CAPTURING

ICR Benefits  Reduce Cost  Reduce Time  Efficient Data Capture  Increase Data Accuracy DATA CAPTURING

 Strictly designed questionnaires : Paper, size, color, figure and answer field boxes  Record questionnaires should be fixed pencil and handwriting  Distribution and return questionnaires should be carful Major Problems Encountered in 2000 Census DATA CAPTURING

DATA EDITING EDITING & TABULATION

DATA PROCESSING STEP Machine Edit Listing Error? Checking Error & List Data Comparing with questionnaire images Table Checking Yes No Questionnaires ICR Accept? NoYes Validate Data (Manual, Cold deck, Hot deck) Tabulation & Report Tab/Report

 Possible code: Check in each field which the out-of-range fields values is shown in asterisk (*) code.  Validity: Check characteristics of the message structure. DATA PROCESSING STEP Editing Process

 Consistency:  Imputation: Automatic editing programs. DATA PROCESSING STEP Editing Process (cont’) Check inconsistent values within record and across record. Messages are shown the related conditional codes. All error is printed in continuous paper forms to be considerated and validated by subject matter until no messages error found.

 Tabulation: DATA PROCESSING STEP Tabulation Process Report summary data which can be processed after data completely cleaned for subject matter to analyze the results of output.

DATA PROCESSING STEP Mainframe : IBM Multiprise 2000 Model Operating System - OS/390 v.2 release Compiler - PL/I 1.3 Statistic Program - Base SAS 1.4 Application Development Tools - Performance Reporter for OS/390

DATA PROCESSING STEP Personal Computer (PC) 2.1 Operating System - Windows XP 2.2 Package - MS Office MS Studio v.6 (Visual FoxPro) - SPSS - CSPro 3.3

THANK YOU FOR YOUR ATTENTION