Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.

Slides:



Advertisements
Similar presentations
Collecting data Chapter 6. What is data? Data is raw facts and figures. In order to process data it has to be collected. The method of collecting data.
Advertisements

INTRODUCTION ABOUT OMR. INDEX  Concept/Definition  Form Design  Scanners & Software  Storage  Accuracy  OMR Advantages  Commercial Suppliers.
Commercial Data Processing Lesson 2: The Data Processing Cycle.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
Input to the Computer * Input * Keyboard * Pointing Devices
AUTOMATIC DATA CAPTURE  a term to describe technologies which aim to immediately identify data with 100 percent accuracy.
1 Mobilizing Resources for Censuses: Strategies for Reducing Census Costs/ Perspectives of Donor Countries Based on Japanese Experience Takehiro Fukui.
Input devices, processing and output devices Hardware Senior I.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
The 8 th ECO National Focal Points on Economic Research and Statistics ( April 2011, Baku, Azerbaijan) Country Report of the I.R. Iran Statistical.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in data capture.
Hardware, Software & Automatic input devices LO: Recognise hardware, software. Learning outcome: Correctly identify hardware and software. Recognise and.
Census Data Capture Challenge Intelligent Document Capture Solution UNSD Workshop - Minsk Dec 2008 Amir Angel Director of Government Projects.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in Census.
Copyright 2010, The World Bank Group. All Rights Reserved. PROCESSING, Part 1 Data capture, editing, imputation and tabulation Quality assurance for census.
OCR GCSE ICT DATA CAPTURE METHODS. LESSON OVERVIEW In this lesson you will learn about the various methods of capturing data.
Topics Covered: Data preparation Data preparation Data capturing Data capturing Data verification and validation Data verification and validation Data.
PDAs for Data Collection in Resource-Poor Settings Project HOPE’s experience.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
Application of Information Technology in the 2011 Population Census of Macao, China Government of Macao Special Administrative Region Statistics and Census.
AS Module 2 Information; Management and Management and Manipulation or what to do with data, how to do it, and……... ensure it provides useful information.
Input Devices Manual and Automatic By Laura and Gracie.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
Computer main parts. Hardware  It refers to all physical parts of a computer system.
Workshop on International Standards, Contemporary Technologies and Regional Cooperation, Noumea, New Caledonia, 04–08 February 2008 Results Generated from.
CDP Standard Grade1 Commercial Data Processing Standard Grade Computing Studies.
System Analysis and Design
Data Capture Overview United Nations Statistics Division
I.Information Building & Retrieval Learning Objectives: the process of Information building the responsibilities and interaction of each data managing.
UNSD Census Workshop Day 2 - Session 7 Data Capture: Intelligent Character Recognition Andy Tye – International Manager DRS are Worldwide specialists in.
Data Capture Technology Statistical Centre Of IRAN Presented by : MS. SOMAYE AHANGAR Vice – Presidency for Strategic Planning and Supervision Statistical.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
Uganda – October 2009 Census Data Collection & Processing John Gomersall.
Census Data Processing: Contemporary Technologies for Data Capture Bangkok, Thailand September, 2008 By Jatan Kumar Saha Systems Analyst Bangladesh.
Status of Data Capture Technology in Population and Housing Censuses in the ESCAP region Statistics Division ESCAP.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
Presenter: Tracy Wessler June 5, 2007 The Use of High Speed Data Processing to Capture Census Data U.S. Census Bureau Decennial Response Integration System.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Census Data Capture: ABS Experience 1991 to 2006 Noumea February 2008.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Sri Lanka. History  First Population & Housing Census : 1871  139 years ago  Last Population & Housing Census : 2001  After a lapse of 20 years 
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Census Processing Baku Training Module.  Discuss:  Processing Strategies  Processing operations  Quality Assurance for processing  Technology Issues.
Outsourcing of Census Operations United Nations Statistics Division UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary.
 ReadSoft 2004 Processing census forms.  ReadSoft 2004 ReadSoft Corporate Profile n Swedish company - founded1991 n Listed in Stockholm stock exchange.
Outsourcing of Census Operations United Nations Statistics Division Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
The Big Picture Things to think about What different ways are there to collect information automatically? What are the advantages and disadvantages of.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Lecture on Input Devices
DATA COLLECTION Data Collection Data Verification and Validation.
CHAPTER 19 Data processing
Outsourcing of Census Operations United Nations Statistics Division
UNSD Census Workshop Data Capture: Optical Mark Recognition
UNSD Census Workshop Data Capture: Intelligent Character Recognition
Meryem Demirci United Nations Statistics Division
OCR GCSE ICT Data capture methods.
OCR GCSE ICT Data capture methods.
Optical Data Capture: Optical Character Recognition (OCR)
Use of handheld electronic devices for data collection in GeoStat
Data Capture Process Stages
Data Capture - ICR Typical Workflow
UNSD Census Workshop Day 2 - Session 6
Optical Data Capture: Optical Mark Recognition (OMR)
Turkish Statistical Institute
Manual Data Capture – Key Entry
Multi-Mode Data Collection
Presentation transcript:

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Data Capture Overview United Nations Statistics Division

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Overview of Presentation  Definition of data capture  Methods of data capture: -Different Methods -Advantages and disadvantages  Issues to consider

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 What’s Data Capture? “Data capture is the system used to convert the information obtained in the census to a format that can be interpreted by a computer.” Source: United Nations Principles and Recommendations for Population and Housing Censuses, Rev. 2, p.68.

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Data Capture Methods 1)Keyboard data entry 2)Optical mark recognition/reading (OMR) 3)Optical character recognition/intelligent character recognition (OCR/ICR) 4)Personal digital assistant (PDA) 5)Internet  Advantages/disadvantages/costs/impacts at both data capture and later stages  Combination of more than one of the above methods

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Keyboard Data Entry  Response codes from census form are manually entered into computers  Sophisticated version involves computer assisted key entry where operator selects a response from options displayed on the screen  Use of method based on time and cost considerations, and feasibility to implement more sophisticated technology  Method also used to process textual responses into classification categories

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Advantages and Disadvantages of Keyboard Data Entry Advantages  Method requires simple software systems and low-end computing hardware  Less costly (depending on the costs of manpower)  There will be a large number of PCs available for other uses after censusDisadvantages  Requires more staff  Task takes much longer time to complete than with automated data entry  Potential for errors during data entry  Standardization of operations is difficult as performance may be individually dependant

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Data Capture Technologies  Imaging and intelligent character recognition offer great potential and benefits for data capture  Use of technology for data capture should be to enhance effective and efficient data capture and not for technology’s sake  Awareness of long lead times and technology infrastructure required for successful implementation of intelligent character recognition

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Optical Mark Recognition/Reading (OMR)  OMR is a form-scanning method whereby responses are read into a computer without a keyboard  OMR technology reads responses to “tick-box” type questions on specially designed paper  Only presence or absence of a mark is detected by the machine  The scanned responses are transformed into codes  Handwritten responses must be manually entered or coded using computer-assisted methods

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Advantages and Disadvantages of OMR Advantages  Improved data accuracy  Data capture faster than keyboard data entry  Equipment is relatively inexpensive  Relatively simple to install and run  A well-established technology that’s been used in many countries Disadvantages  Restrictions as to form design  Restrictions on type of paper and ink  Precision required in printing process/cutting of sheets  Response boxes should be correctly marked with appropriate pen or pencil  Won’t capture textual responses

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Optical Character Recognition (OCR)/ Intelligent Character Recognition (ICR)  OCR and ICR combine scanning and character recognition technology to scan the whole form and interpret the responses  OCR technology recognizes machine-printed characters only  ICR technology reads both machine-printed and hand- written responses in specific locations of the page and transforms the responses into codes  For OCR, handwritten responses must be manually entered or coded using computer-assisted methods

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Advantages of OCR/ICR  Form design is not as stringent as for OMR  Processing time can be reduced due to automated nature of the process  Allow for digital filing of questionnaires resulting in efficiency of storage and retrieval of questionnaires for future use  Some handwritten responses can be automatically coded thereby improving data quality

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Disadvantages of OCR/ICR  Higher costs of equipment (sophisticated hardware/software required)  High calibre IT staff required to support the system  Handwriting on census forms be as close as possible to the model handwriting to avoid recognition error  Possibility for error during character substitution which would affect data quality  Tuning of recognition engine to accurately recognize characters is critical with trade-off between quality and cost

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Personal Digital Assistant (PDA)  Contents of the census form are stored onto the PDA so that the questions appear sequentially on the screen  Data are entered into a hand-held computer instead of onto a paper census form  Data are then electronically transmitted to an NSO database for further processing

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Advantages and Disadvantages of use of the PDA Advantages  Instant data capturing at the point of collection, reducing manual input errors  Immediate data validation, reducing re-verifications at later stage  Time effective with real time logical validation rules, reducing logical errors  Faster processing of census information leading to timely availability of results Disadvantages  Setting up of process may take a long time as it requires extensive testing  Requires that enumerators have ability to use the device which may require administering a test  Requires intensive training of enumerators on use of device (training is more complicated)  Need to recharge the battery which could run out during enumeration  Possibility of equipment failure

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Internet-based Data Collection  Use of the Internet for census data collection is growing -However, the method is always complementary to other more established methods  Like with PDAs, the on-line form is not a downloadable version of the paper form  Use of this method requires a password in order to access and fill in the form  Development of the internet system for data collection is generally outsourced for lack of in-house expertise

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Advantages/Disadvantages of use of the Internet Advantages  Reduced resources necessary for form handling and data capture  Better opportunity to enumerate difficult to reach and to enumerate geographic area and population groups  Automatic filtering of irrelevant questions  Better quality data due to in- built interactive verification mechanism  Faster availability of census results through simplified data entry and editingDisadvantages  Requires that respondents have a computer with Internet access  Management of responses can be problematic, e.g., that households have responded once and only once  Requires high security system to ensure safe transfer of data  Need to build parallel processing system as not everyone will use the Internet  Requires mechanism to check for omitted and duplicate submissions  Is costly and requires a lot of resources for setting up and adequately test the system

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Issues to Consider in Choosing a Method  Method to use is dependant on national circumstances  Choice of method should be part of the overall strategic objective of the census in terms of timeliness, accuracy and cost  Choice of processing system and technology to use need to be established early in census cycle  Enough time is required to test and implement the system  When imaging technology is used for data capture, extensive testing is required well in advance of the census  Possibility to outsource when the required expertise is not available in-house

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Issues to consider (cont.)  Extensive testing of the system is also critical when data collection is either by PDA or via the Internet  Design and paper quality of census form should be linked to method of data capture  When imaging technology is to be used, adequate training of enumerators on how to properly fill in the forms is crucial

Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping and data processing Minsk, Belarus, 8-12 December 2008 Thank you