1 DATA CAPTURE – PROCESSING 2006 POPULATION & HOUSING CENSUS OF NIGERIA Presented at UN Regional Workshop on Census Data Processing By Adesola Fatilewa.

Slides:



Advertisements
Similar presentations
RFID IN UNIVERSITY OF JAMMU RFID is used in libraries primarily to automate the book handling process including checkout, inventory maintenance, and check-in.
Advertisements

INTRODUCTION ABOUT OMR. INDEX  Concept/Definition  Form Design  Scanners & Software  Storage  Accuracy  OMR Advantages  Commercial Suppliers.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
AUTOMATIC DATA CAPTURE  a term to describe technologies which aim to immediately identify data with 100 percent accuracy.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in data capture.
Census Data Capture Challenge Intelligent Document Capture Solution UNSD Workshop - Minsk Dec 2008 Amir Angel Director of Government Projects.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
UNSD Census Workshop Day 2 - Session 6 Data Capture: Optical Mark Recognition Andy Tye – International Manager DRS are Worldwide specialists in Census.
1 Census Evaluation in Kenya By M.G. Obudho & J. K. Bore Kenya National Bureau of Statistics.
Manual Data Processing of Census Data 2004 Population and Housing Census Statistics Sierra Leone Thekeka Moses Conteh Sierra Leone.
The Core Welfare Indicators Questionnaire: A CWIQ Option for Monitoring Poverty Reduction Strategies.
Topics Covered: Data preparation Data preparation Data capturing Data capturing Data verification and validation Data verification and validation Data.
1 Census 1996, 2001 & Community Survey (CS) United Nations Regional Workshop on Census Data Processing Contemporary Technology from Census Data Capturing.
By Cleophas Kiio Director, ICT 15-sep-101 The Best Practices in Census Data Processing Operation: Case of 2009 Census:
DRS Census Experience Andy Tye International Manager, DRS DRS Census Experience Andy Tye International Manager, DRS Census Meeting – New Caledonia Feb.
Experience of Vietnam in Census Mapping and Household Listings General Statistics Office of Vietnam.
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
Sterling Chadee Director of Statistics. The processing of the data from the field enumeration began in July 2011 until September All data processors.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
HOUSELISTING SCHEDULE NPR SCHEDULE HOUSEHOLD SCHEDULE.
Using OCR for Census Data Capture in China National Bureau of Statistics of China.
Workshop on International Standards, Contemporary Technologies and Regional Cooperation, Noumea, New Caledonia, 04–08 February 2008 Results Generated from.
Copyright 2010, The World Bank Group. All Rights Reserved. COVERAGE, FRAMES & GIS, Part 2 Quality assurance for census 1.
Scanning Technology and Its Application in Ethiopia Yakob Mudesir Deputy Director General Central Statistical Agency of Ethiopia
© Beta Systems Software AG Process Stages of Census Surveys Richard J. Lang, International Manager September 2008, Bangkok.
Data Capture Overview United Nations Statistics Division
2007 Population and Housing Census (Swaziland) Presented by: Muzi Dube.
Second International Workshop on Economic Census Seoul, Korea, 6 -9 July 2009 Shanker Lal Shrestha Central Bureau of Statistics Nepal Data Collection and.
I.Information Building & Retrieval Learning Objectives: the process of Information building the responsibilities and interaction of each data managing.
Mark A. Magumba Storage Management. What is storage An electronic place where computer may store data and instructions for retrieval The objective of.
UNSD Census Workshop Day 2 - Session 7 Data Capture: Intelligent Character Recognition Andy Tye – International Manager DRS are Worldwide specialists in.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
AADAPT Workshop South Asia Goa, December 17-21, 2009 Maria Isabel Beltran 1.
Uganda – October 2009 Census Data Collection & Processing John Gomersall.
Multi-modal of data collection for the 2010 Population and Housing Census National Statistical Office, Thailand (Daejeon, Republic of Korea, April.
Census Data Processing: Contemporary Technologies for Data Capture Bangkok, Thailand September, 2008 By Jatan Kumar Saha Systems Analyst Bangladesh.
The Dark Side of Document Imaging: ‘The Hidden Cost of Capture’
UN Regional Workshop on Data Processing, Bangkok, Sep Philippines 2007 Census of Population Data Processing Philippines 2007 Census of Population.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Addis Ababa,
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
Paolo Valente - UNECE Statistical Division Slide 1 Technology for census data coding, editing and imputation Paolo Valente (UNECE) UNECE Workshop on Census.
Data Processing of the 2010 Population and Housing Census September 2008, Bangkok, Thailand National Statistical Office, Thailand.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Regional Workshop on the 2010 World Programme on Population and Housing Censuses: International standards, contemporary technologies for census mapping.
Census Data Capture with OCR Technology: Ghana’s Experience Presented at the UNSD Regional Workshop on Census Data Processing Dar es Salaam, Tanzania 9.
Use of Mobile Technology for Data Collection in Zimbabwe Experiences Gained and Lessons Learnt By Rodgers M. Sango Zimbabwe National Statistics Agency.
 ReadSoft 2004 Processing census forms.  ReadSoft 2004 ReadSoft Corporate Profile n Swedish company - founded1991 n Listed in Stockholm stock exchange.
ViciForm – Form Processing Solution Creating Info repositories from documents.
Presentation to the UN Experts Group Meeting UNSD 29 May - 1 June 2007 Alister Nairn Director - Geography Section GIS BASED CENSUS MAPPING APPROACHES -
Anatoliy Lyashchenko Research Institute of Geodesy and Cartography, Lyubov Stelmakh State Statistics Committee of Ukraine UN EGM GIS New York, 29 May –
e-marking in large-scale, high stakes assessments conference themes :  role of technology in assessments and teacher education  use of assessments for.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Census Planning and Management for next Nigerian Census
National Population Commission (NPopC)
ZAMBIA CENSUS MAPPING PRESETATION
Session 5 – Questionnaire Checklists
UN Reg. Workshop on the 2020 World Programme on
UNSD Census Workshop Data Capture: Optical Mark Recognition
UNSD Census Workshop Data Capture: Intelligent Character Recognition
Ethiopian 2007 CENSUS DATA CAPTURING AND PROCESSING
Document Management System
Omenya Nyahul Kenya National Bureau of Statistics
Use of handheld electronic devices for data collection in GeoStat
Data Capture Process Stages
UNSD Census Workshop Day 2 - Session 6
Optical Data Capture: Optical Mark Recognition (OMR)
Population and Housing Census 2015, and Challenge
Quality assurance in population and housing Census
Quality assurance in Population and Housing Census-The case of Ghana
Presentation transcript:

1 DATA CAPTURE – PROCESSING 2006 POPULATION & HOUSING CENSUS OF NIGERIA Presented at UN Regional Workshop on Census Data Processing By Adesola Fatilewa NATIONAL POPULATION COMMISSION At Dar-es-Salaam, Tanzania 9 th -13 th June 2008

2 MAP OF NIGERIA 36 STATES AND FCT ABUJA

3 ABOUT NIGERIA NIGERIA IS THE MOST POPULATED COUNTRY ON THE AFRICAN CONTINENT AND THE 10 th BIGGEST IN THE WORLD. AN AREA OF ABOUT 9.28 MILLION SQ. KMS. POPULATION OF 140.2million BY 2006 CENSUS COMPRISES OF 36 STATES AND FEDERAL CAPITAL TERRITORY 774 LOCAL GOVERNMENT AREAS - LGA (DISTRICTS) DELINEATED INTO OVER 662,000 ENUMERATION AREAS

4 Background Since the late nineties NPopC was being inundated with proposals on various document scanning systems. As at 2005, statements were being made, suggesting that the idea of using scanning technology was utopia.

5 P rocessing Pre-test and Trial Census A scanning system was used to process the second pre-test of April Number of documents processed was about of 100,000 forms as survey covered one local government area (Lga) in each of the 36 States of the country and the Federal Capital Territory. The forms were only optical mark readable and editing was mainly to correct alignment errors.

6 Processing Pre-test and Trial Census Continued Another solution provider supplied five scanners along with two servers for the processing of the Trial Census. Trial Census which took place in April 2005 covered about 5% of the country, which translated to about 10million population. Processing was distributed between two data processing centres (DPCs); Lagos and Kano

7 Lessons learnt staff were identified for suitable roles in data processing of the main census staff gained experience on the new technology alignment and recognition problems detected and rectified decision taken on appropriate archiving system for storage and retrieval of documents need to have various reports to enable management follow progress of processing decision to completely eliminate manual coding and editing

8 Data capture 2006 census Scanning technology was fully deployed in processing Nigeria 2006 Population and Housing Census. This was achieved with 21 scanners distributed in 7DPCs located strategically across the country. Immediately after the census, OMR/ICR forms (questionnaires) used to collect data started arriving at the DPCs. Inventory control was done using an EA tracking system

9 Data capture 2006 census Documents were enveloped by EA, tied in convenient batches and stacked on labelled shelves At the end of the receiving/archiving exercise, batches were retrieved for data capture

10 Paper Preparation before Scanning STORE IN Program Print and add Batch Header cut the paper with cutting machine Otherwise: paper damaged, introduce dirt on the scanned image, reject increased NPC0x Batch Heade r Remove the envelopes Bring the envelopes with the questionnaires from the Archive room Envelope ARCHIVE Jog the paper with the supplied jogger NPC0x Batch Heade r

11 Data Processing Steps at DPCs Schematic diagram Jog Docs Scanner Server Edit Stations

12 Scanner Views Scanner Feeder Questionnaire processing

13 Scanning Sheets loaded on the feeder in batches separated by batch header went through transport system of scanners HR80 SC Scanner speed was 8000 sheets/hr barring jams and other loading difficulties. Scanning was effected by ProScan software and scanned documents were collected at the output tray. The sheets were returned into their envelopes and sent back to archive

14 SC80HC + ProSort + kEOPs NPC0x Batch Header Work Data Storage 8. DVD 5. kEOPs recognition 1. Preparation for Scanning: cut & jogg 3. PaperArchive 3. PaperArchive 4. Data + Images 6. Correction Balancing Archive Data Storage 9. TAPE 7. Export CS Pro 8. Local reports 2.Scanner MANUAL WORK HQ Carto

15 Editing Two levels of Editing: First level at DPC Second level at DVU at NPopC hq. in Abuja

16 First Level Editing XML format stored in SAN on servers networked to scanners Forms in XML loaded onto edit stations The editing system used was called KEOPs and it was designed to check geographic ids against the batch headers, check ‘mandatory fields’ Transactions or whole batches could be passed for ‘balancing correction level’ which was handled by more experienced staff designated ‘Supervisor’,

17 Typical KEOPs Edit Screen

18 EXPORT

19 Second Level Editing Data in ASCII,was encrypted, backup on cds at the DPC and sent to NPopC Hq., Abuja Data is decrypted, validated, collated and further edited at Abuja Data is then checked for completeness to ensure that each delineated EA for any local government had data associated with it CsPro package was then utilized to edit data and aggregate appropriately

20 Second Level Editing Continued Structure checks Range checks Skip pattern checks Inter-record and intra-record consistency checks Imputation methods applied for missing or invalid values: Hot deck’ and ‘Cold deck’ or a combination of both

21 Occupational Coding The only data that was not coded on the field was occupation The occupational coding was effected automatically using a computer-assisted coding system ‘Exceptional Coding’ was applied where coding clerk could not find an appropriate occupation code for an occupation

22 Ensuring that documents for particular geographic locations were archived in sections of the archive and shelves designated for them That all forms were separated before taking them for scanning Breakdown of jogger Rate of getting documents ready for scanning was slower than rate of scanning Difficulty in maintaining belts and fixing them over pulley That correct batch headers were properly placed on EA batches and that after scanning, EAs were correctly returned to their marked envelopes Challenges

23 Challenges Continued Instances of poor field work which resulted in ‘missing values’ of ‘mandatory fields’, outright wrong values for fields Difficulty in linking forms for households of greater than 8 persons Integration of the two solution providers: form design and equipment and software solutions were provided by two different companies Cleaning of blank records of data associated with them at data capture Dealing with sensitivity of Nigerians to census figures lack of reliable and uninterrupted power supply

24 Conclusion The Commission was proud that the decision to deploy a new technology for part of the processing of Nigeria 2006 Population and Housing Census was a success About 35million forms were scanned and edited using 21 scanners, over 220 edit stations and data in XML format and ASCII stored in about 76TB of SANs. All scanning and first level editing was completed within nine months of enumeration period. About 1000 Nigerians were trained and gained expertise in various aspects of the scanning technology There is a need for intensive trainings in these areas of OMR/OCR forms design and development of appropriate scanning softwares.

25 End Thank you for your attention