The Census Data Enhancement Project Glenys Bishop.

Slides:



Advertisements
Similar presentations
Background Neighbourhood characteristics such as socio-economic status (SES) have been shown to correlate with poorer health outcomes, mortality rates,
Advertisements

MAV Rural and Regional Planning Conference 6 July 2012 ABS Presentation.
A comparison of the characteristics of childless women and mothers in the ONS Longitudinal Study Simon Whitworth Martina Portanti Office for National Statistics.
Life expectancy by NS-SEC Structure, technical and conceptual issues and results BSPS 8 Sept 2011 Brian Johnson ONS Health & Life Events Division Newport.
The Territory’s Population, Economy and Labour Force: Trends and Challenges Professor Tony Barnes Dr Sarah Rummery Northern Territory Treasury and Charles.
What are Wage Records? Wage records are an administrative database used to calculate Unemployment Insurance benefits for employees who have been laid-off.
Patterns of Health and Illness in Indigenous Australian Communities Dr Ross Bailie Associate Professor in Public Health Dr Ross Bailie Associate Professor.
When adjusting for bias due to linkage errors: a sensitivity analysis Q2014 Tiziana Tuoto 05/06/2014 Joint work with Loredana Di Consiglio.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 26, 2009.
Click to insert Heading Insert speaker/s Insert location/date Zina Miceli Regional Industry Career Adviser Western Melbourne & Hume City WESTERN EDGE CLUSTER.
Skilled migration, women and the role of education and training in regional Australia Introduction to an NCVER funded research project.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
Lecture 3: Data sources Health inequality monitoring: with a special focus on low- and middle-income countries.
Sources of demographic data Population census Sample Surveys Population registers Other sources The balancing equation.
The Northern Ireland Longitudinal Study: An Introduction.
Economics and Statistics Administration U.S. CENSUS BUREAU U.S. Department of Commerce 1 The U.S. Census Bureau’s 2010 Demographic Analysis Estimates:
Use of administrative data in statistics - challenges and opportunities ICES III End Panel Discussion Montreal, June 2007 Heli Jeskanen-Sundström Statistics.
BC Jung A Brief Introduction to Epidemiology - IV ( Overview of Vital Statistics & Demographic Methods) Betty C. Jung, RN, MPH, CHES.
United Nations Expert Group Meeting on International Standards for Civil Registration and Vital Statistics Systems, June 2011, New York Collection,
Aspects of the National Health Interview Survey (NHIS) Chris Moriarity National Conference on Health Statistics August 16, 2010
GEOG3025 Census and administrative data sources 3: Integration and future development.
1 Immigrant Economic and Social Integration in Canada: Research, Measurement, Data Development By Garnett Picot Director General Analysis Branch Statistics.
Computing SubLHIN Population Projections in the South East Region August 2014 Update.
AUSTRALIAN INDIGENOUS HEALTH. Indigenous population  At 30 June 2011, the estimated Australian Indigenous population was 669,736.  In 2011, NSW had.
Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.
9/18/2015Slide 1 The homework problems on comparing central tendency and variability extend the focus central tendency and variability to a comparison.
General Register Office for S C O T L A N D information about Scotland's people General Register Office for Scotland “Information about Scotland’s people”
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Bangkok,
1 Sources of gender statistics Angela Me UNECE Statistics Division.
United Nations Economic Commission for Europe Statistical Division Sources of gender statistics Angela Me UNECE Statistics Division.
The Statistical Spatial Framework for Australia - enabling location analysis Gemma Van Halderen First Assistant Statistician Population, Education & Data.
New National Approaches to Immigrant Health Assessment M. DesMeules, J. Gold, B. Vissandjée, J. Payne, A. Kazanjian, D. Manuel Health Canada, University.
European Conference on Quality in Official Statistics Roma, July 8-11, 2008 New Sampling Design of INSEE’s Labour Force Survey Sébastien Hallépée Vincent.
Longitudinal Data Recent Experience and Future Direction August 2012.
Gender Statistics in the Labour Market Angela Me UNECE Statistics Division.
1 Measuring Uncertainty in Population Estimates at Local Authority Level Ruth Fulton, Bex Newell, Dorothee Schneider.
Plausibility Ranges for Population Estimates Focusing on ranges for children.
Census.ac.uk The UK Census Longitudinal Studies Chris Dibben, University of St Andrews.
Australian Bureau of Statistics More than just the census Kununurra November 2005 Andy Separovic.
Ben, Nikki and Martin INDIGENOUS PEOPLE IN AUSTRALIA.
The U.S. Census Bureau Population Estimates Program Victoria A. Velkoff U.S. Census Bureau APDU Annual Conference September 25, 2008.
Assessing SES differences in life expectancy: Issues in using longitudinal data Elsie Pamuk, Kim Lochner, Nat Schenker, Van Parsons, Ellen Kramarow National.
The project for developing the methodology of register- based censuses in Estonia Kristi Lehto Statistics Estonia Methodology and analysis department Senior.
Workshop on Census Evaluation for Countries in Asia EVALUATION OF 2009 POPULATION AND HOUSING CENSUS DATA Presented by Nguyen Van Hung and Phan Thi Minh.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Jon Altman Centre for Aboriginal Economic Policy Research The Australian National University Thinking Indigenous economy: A brief survey 1964 to 2014 AIATSIS.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
Statistics Canada Statistics Canada Statistique Canada Statistique Canada Gender and economic statistics: Using available data Heather Dryburgh, Ph.D.
Linking, selecting cut-offs, and examining quality in the Integrated Data Infrastructure (IDI) Laura O’Sullivan Statistics New Zealand
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
Post Enumeration Survey Baku Training Module.  Discuss:  What is a Post Enumeration Survey?  How is it undertaken in Australia?  Questions Overview.
Mismatches and matches in address information from the Census and the BSO: A longitudinal perspective Ian Shuttleworth and Brian Foley, Queen’s.
Aboriginal and Torres Strait Islander Higher Education Advisory Council Indigenous Leaders Forum Broadening Indigenous participation across the disciplines:
2011 Census Data Quality Assurance Strategy: Plans and developments for the 2009 Rehearsal and 2011 Census Paula Guy BSPS 10 th September 2009.
Groups experiencing health inequities “Health inequities; that is, the unjust impact on the health status of some groups due to: social, economic, environmental.
Current Approaches to Measuring Asset Ownership and Control: MALDIVES Department of National Planning.
Mapping Social Indicators for South East Coastal Adaptation project Yogi Vidyattama, Binod Nepal and Itismita Mohanty The National Centre for Social and.
Beyond 2011 Administrative data sources and low-level aggregate models for producing population counts.
Using administrative data to produce official social statistics New Zealand’s experience.
Enhancing the usefulness of census data through linking census and administrative data Dr Paul Jelfs Assistant Statistician Australian Bureau of Statistics.
The Use of Census and Other Data in Australian Catholic Education Presented by Crichton Smith at the ABS Census Analysis Conference Canberra, July 2006.
Mortality data – the importance of the ICD Dr Paul Jelfs, Health Information Branch Ms Anneke Schmider, Vitals Information.
Marc Hamel and Julie Trépanier May 21, 2014 Canadian Statistical Demographic Database: A research project.
1 A investigation of ethnic variations in mortality using the ONS Longitudinal Study Chris White Health Variations Team Office for National Statistics.
MAV Rural and Regional Planning Conference 6 July ABS Presentation
MAV Rural and Regional Planning Conference 6 July ABS Presentation
Census Planning and Management
Presentation transcript:

The Census Data Enhancement Project Glenys Bishop

Outline  Brief description of Census Data Enhancement Project  Focus on Statistical Longitudinal Census Dataset (SLCD)‏  Simulation to determine likely quality

Census Data Enhancement  Formation of Statistical Longitudinal Census Data set ƒ 5% 2006 Census linked to 2011, 2016,... ƒ augmented with 5% sample intercensal births, immigrants  Statistical studies ƒ approved projects involving linking SLCD and other data sets –Births and deaths –Long-term migrations –Disease registers No names and addresses

Quality Studies  Quality studies use the whole Census ƒ with name and address during census processing period ƒ without name and address at other times  Two types during the 2006 census processing period ƒ Assess feasibility and quality of linking without names and addresses ƒ Improve ABS outputs

1Indigenous mortality ƒ Linking deaths since Census and Census 2Assessing automatic matching for Post Enumeration Survey 2011 ƒ Linking 2006 PES and Census 3Undercoverage in Labour Force Survey ƒ Linking LFS August 2006 and Census Quality Studies for 2006

1Indigenous mortality ƒ Linking deaths since Census and Census 2Assessing automatic matching for Post Enumeration Survey 2011 ƒ Linking 2006 PES and Census 3Undercoverage in Labour Force Survey ƒ Linking LFS August 2006 and Census 4Conditions of entry and settlement outcomes for immigrants ƒ Linking migrant settlements database and Census. 5Simulation of SLCD formation Quality Studies for 2006

Mesh Blocks ƒ Micro-level geographical unit for statistics. ƒ 314,369 spatial MB covering Australia ƒ Residential MB contain ~ 30 to 60 dwellings New building block of statistical and administrative geography Canberra

 Census Dress Rehearsal 2005, Census 2006  assess the feasibility of forming the SLCD without names and addresses  make defensible statements about quality of the linked data Simulated SLCD Formation

What Linkages Census Dress Rehearsal_Census Gold Standard using name, address, mesh block and other variables Names and addresses were destroyed at end of Census processing Silver Standard using ~12000 hash-numbers, mesh block and other variables Hash numbers were destroyed at end of Census processing Bronze Standard using mesh block and other variables

20m Data Linking Process File A File B Record pair comparison weights Links Clerical review Non-links upper cut-off lower cut-off

Issues  Setting cut-offs  Clerical review very time consuming

Frequency Comparison Weight Matches Non - Matches

Frequency Comparison Weight

Accept These Links Reject These Links Estimated Cumulative Matches Linked Estimated Cumulative Non - Matches Linked

incorrect non-links =P(non-link|match)‏ false links  =P(link|non-match)‏ Matches Non-matches Links Non-Links Record Pairs

Determining Quality of Links MatchesNon-matches LinksTrue linksNon-matches that are linked Total Links Non-linksMatches that are not linked True Non-linksTotal Non- links Total Matches Total Non- matches Total Record Pairs Match Status (True)‏ Link Status (assigned by linking method )‏

Match Status  Gold Standard linkage uses name and address  Use this as benchmark for Bronze and Silver Standards

Comparing Quality of Bronze and Silver Linkages

What is Important?  High link accuracy ƒ most links are correct but many matches are missed  High match link rate ƒ most matches are linked but many links are incorrect

Comparison Bronze Linkages

Silver and Bronze VL

Univariate Summary  The higher the cut-off the more likely are some subpopulations to be missed ƒ under-represented: 0-19 yr-olds and indigenous people, employed in Agriculture, people from non- families ƒ over-represented: born overseas, more highly educated, professional and clerical occupation, married ƒ trends weaker or non-existent in Silver.

Odds Ratios  employed/(unemployed, NILF) in 2006  explanatory (all from 2005)‏ ƒ sex, indigenous status, age, tenure status, required assistance –moved house in previous year ƒ education characteristics ƒ work characteristics in 2005, income –volunteer, occupation labourer, sales/retail VariableGoldSilverBronze movedNS volunteerNS labourer NS sales/retail0.854NS

Test Cases Female, aged 25-39, worked 15 hours in sales, weekly income $ , dwelling being purchased, provided child care Male, indigenous, weekly income $250-$399, actively seeking work Male, non-indigenous, married, worked 40 hours, weekly income $ , no degree, owned house, did not move in previous year

 The properties of the CDR records that did not get linked to a Census record in the Gold Standard. Match-link rate and link accuracy of the different Silver and Bronze Standard linkages compared with Gold. The over- or under-representation of sub-groups in the various linked data sets compared with the Gold Standard. The effects of this over- or under-representation on some representative analyses and models fitted to linked data.  Weighted analyses to counteract under-representation  Methods for modifying the fitted models to account for inexact linkage and disparities in the representation of sub-groups of interest.  How well linking two files collected one year apart can represent linking two files collected five years apart. Assessing the Linked Data

Finding Out More about CDE Already published papers included in handout Today two new papers using Indigenous Mortality QS results –Information Paper: Census Data Enhancement - Indigenous Mortality Quality Study (Cat. No )‏ – Discussion Paper: Assessment of Methods for Developing Life Tables for Aboriginal and Torres Strait Islander Australians, 2006 (Cat. No )‏ Early in 2009, several in research paper series –Methodological report on each QS (4)‏ –Analysis of probabilistically linked data –Acceptance sampling & clerical review –Assessment of quality of SLCD