1 2006 Public Use Microdata File (PUMF) 1. Change factors 2. Scenarios : characteristics 3. Analytic Content: additions and losses Outline DLI Ontario.

Slides:



Advertisements
Similar presentations
Committed to Connecting the World International Telecommunication Union May 2010 Doris Olaya Market Information and Statistics (STAT) Division Telecommunication.
Advertisements

Samples of Anonymised Records: a resource for ethnicity research Ed Fieldhouse Director, SARs Support team
Samples of Anonymised Records from the 2001 Census Five different microdata files - with varying amounts of detail Three different modes of access - with.
The Impact of LFS & APS Reweighting Marilyn Thomas Labour Force Survey Output Manager, Office for National Statistics.
Requirements for 2011 Cross-sectional Microdata SARs Support Team University of Manchester
2011 Census Outputs Plans and Progress. CONTENTS Aims for 2011 Census Outputs Strategy Development User Consultation Next Steps.
Balancing Access and Confidentiality Jenny Telford Australian Bureau of Statistics September 2008.
User views Jo Wathan SARs Support team
Canadian Census 2006 Public Use Microdata File Presentation at the SARS Conference Manchester, United Kingdom September 3, 2008 Presented by: Sri Kanagarajah,
Output Consultation Plans and Statistical Disclosure Control Strategy developments Angele Storey and Jane Longhurst ONS.
Conducted for: Conducted by: December Method Nationwide telephone survey of households Household members age 6+ enumerated Data on tennis participation.
Issues in Designing a Confidentiality Preserving Model Server by Philip M Steel & Arnold Reznek.
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
Labour Force Historical Review Sandra Keys, University of Waterloo DLI OntarioTraining University of Guelph, Guelph, ON April 12, 2006.
Chuck Humphrey Data Library University of Alberta.
Access routes to 2001 UK Census Microdata: Issues and Solutions Jo Wathan SARs support Unit, CCSR University of Manchester, UK
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 26, 2009.
Searching the University of Alberta Library’s Statistics Canada-based Websites 2001 Census of Canada Canadian Centre for Justice Statistics Canadian Business.
The ONS Longitudinal Study. © London School of Hygiene and Tropical Medicine The Office for National Statistics Longitudinal Study (LS) o What is it o.
Quantitative Evidence for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library March 6, 2009.
Statistics and Data for Marketing Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 27, 2008.
EAS 293 Data Library, Rutherford North 1 st Floor Chuck Humphrey Data Library October 14, 2008.
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
STATISTICS CANADA SURVEY LIFECYCLE WOLFVILLE, APRIL 2008 SURVEY LIFECYCLE Michel B. Séguin Atlantic DLI Training.
Canadian Travel Survey, 1998 Throughout 1998, Statistics Canada interviewed approximately 180,000 Canadians across the country about their trips in Canada,
Introduction to the Canadian Census of Population With Peter Peller Maps, Academic Data, Geographic Information Centre (MADGIC)
2014 SDC and CIC Annual Training Conference: Accessing ACS PUMS Data Tim Gilbert U.S. Census Bureau April 2, 2014.
National Household Survey: collection, quality and dissemination Laurent Roy Statistics Canada March 20, 2013 National Household Survey 1.
THE ETHNIC DIVERSITY SURVEY Content and Data Availability Statistics Canada Statistique Canada Canadian Heritage Patrimoine canadien.
Merging census aggregate statistics with postal code-based microdata Laine Ruus University of Toronto. Data Library Service ,
CANADIAN COMMUNITY HEALTH SURVEY Data and Products Sylvie Lafortune Laurentian University DLI Spring Meeting (ON) April 13, 2010.
Immigration & Ethno-cultural Statistics Statistics Canada Tina Chui Calgary & Edmonton, Alberta December 10 & 11, 2003.
Canadian Community Health Survey (CCHS) – Healthy Aging Data Liberation Initiative Webinar Leslie Geran Health Statistics Division, Statistics Canada April.
2006 Census MRIA May 24, 2007 Anil Arora. 2 Pressures to change for 2006 Privacy issues (local enumerator) CCRA automation efforts and impact on capture.
THE ETHNIC DIVERSITY SURVEY (EDS) THE ETHNIC DIVERSITY SURVEY (EDS) Content and Data Availability Kelly Tran Statistics.
2011 Census of Population and National Household Survey NOMA Annual Meeting Thunder Bay April 28, 2011.
The Application of the Concept of Uniqueness for Creating Public Use Microdata Files Jay J. Kim, U.S. National Center for Health Statistics Dong M. Jeong,
Disclosure Avoidance: An Overview Irene Wong ACCOLEDS/DLI Training December 8, 2003.
Plans for Access to UK Microdata from 2011 Census Emma White Office for National Statistics 24 May 2012.
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Daniel Beckler United States Department of Agriculture National Agricultural Statistics Service Timothy Mulcahy NORC at the University of Chicago Topic.
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
The Census of Canada and Immigration & Ethno-cultural Data Chuck Humphrey University of Alberta February 10, 2006.
Statistics Canada Citizenship and Immigration Canada Longitudinal Survey of Immigrants to Canada Ryerson University April 16, 2004.
Participation Activity Limitation Survey (PALS), 2001 Andrew MacKenzie Senior Analyst - PALS Social and Aboriginal Statistics Division Statistics Canada.
ISR Training Jan. 21,  Canada’s largest survey  Complete population count  Gathers information on the demographic, social and economic conditions.
January 20089SOC4112 Getting to Know Data Sources Geographic, Statistical and Government Information Centre GSG Team Susan Mowers.
Data on the Foreign Born in 2010: Accessing Information on Immigrants and Immigration from the U.S. Census Bureau’s American Community Survey Thomas A.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman - Jordan 16 – 19 May 2011 Determination of the scope and form of.
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
Disclosure Avoidance at Statistics Canada INFO747 Session on Confidentiality Protection April 19, 2007 Jean-Louis Tambay, Statistics Canada
2008 Population Census of Cambodia Post Enumeration Survey Mrs. Hang Lina Deputy Director General National Institute of Statistics, Min. of Planning Regional.
Measuring Disability: Results from the 2001 Census and the 2001 Post-Censal Disability Survey Statistics Canada January 10, 2003.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Disclosure Analysis: What do RDC Analysts do? Research Data Centre Program, Statistics Canada James Chowhan Ontario DLI Training, Queen's University
Access to microdata in the Netherlands: from a cold war to co-operation projects Eric Schulte Nordholt Senior researcher and project leader of the Census.
National Boot camp Vancouver Heather Dryburgh and Michel B. Séguin May 31 st, 2011 Survey Life cycle.
1 Working with Canadian Census Microdata Martine Grenier and Mokili Mbuluyo Census Operations Division, Statistics Canada December 2007.
DLI Training - Ontario 16 April, 2015 Elizabeth Hill, Western University Survey of Household Spending.
Anticipating Great Things: A 2006 Census Preview June, 2006 DLI, Ottawa, ON Paul Schwets // Stuart Fyffe.
Soc 332.6: Principles of research design Finding statistics.
Rural Development Finding data and statistics.  Statistics Canada: Federal statistical agency  Data released under the Data Liberation Initiative (DLI)
Disclosure scenario and risk assessment: Structure of Earnings Survey
2001 Census of Population Products and Services Presentation to ACCOLEDS December 6, 2001.
GEOG 204 Introductory GIS for the Social Sciences
Disclosure Avoidance: An Overview
Danilo Dolenc Statistical Office of the Republic of Slovenia
Telling Canada’s story in numbers Marie-Josée Major
Canadian Community Health Survey (CCHS) - Annual Component
Presentation transcript:

Public Use Microdata File (PUMF) 1. Change factors 2. Scenarios : characteristics 3. Analytic Content: additions and losses Outline DLI Ontario Training, Ryerson University, Dec. 13, 2007 Martine Grenier, Mokili Mbuluyo, Jean René Boudreau, Statistics Canada

2 1. Change Factors Improvement of the three files analytic content for greater use at the national and international levels Greater accessibility of census data Data confidentiality constraints File size Limited geography Age variable Income variable Late release of PUMFs Delay due to heavy workload of selecting, certifying and deriving variables and quality control on the files

3 Content 1. Sample sizeIndividuals: 800,000 records Families: 310,000 records Households and dwellings: 350,000 records 2. GeographyProvinces, Territories, CMAs 3. VariablesVariables extracted from the dissemination database Large number of derived variables Less detailed variables for Maritime provinces and Northern territories Variables repeated in the 3 files Reduction of disclosure risks Substantial disclosure control by the microdata file review committee Confidentiality rules applied separately to each file 3 years, expected release in 2010? Production time Considerable amount of work for SM analysts to certify derived variables 2. Scenario #1: Status Quo (2001)

4 Content 1. File sizeSingle file: 800,000 records (individuals) Some persons will represent a family or a household 2. GeographyCanada, 5 regions, 5 CMAs with a population of at least one million 3. VariablesVariables extracted from the dissemination database Derived variables of complexity level 4 or which require the use of limited data Reduction of disclosure risks Eliminate values with Canada frequency of less than 100,000. Collapse some or all of age groups. Round off or generate noise in income components Production time Projected release: Summer 2009 Reduced certification 2. Scenario #2: Single File

5 Content 1. File sizeHierarchical file: 350,000 records on households All families and persons are included and identified in the household (about 800,000 persons). 2. GeographyCanada, regions with a population of at least 2 million 3. Variables2B variables from the dissemination database Derived variables of complexity level 4 or which require the use of limited data Reduction of disclosure risks Eliminate values with Canada frequency of less than 100,000. Collapse age groups. Round off or generate noise in income components Production time Reduced certification Projected release: Summer Scenario #3: Hierarchical File

6 3. Analytic Content: additions and losses PUMF-2006 (Status Quo ) PUMF-2006 (Single File) PUMF-2006 (Hierarchical File) Content Size: 2.7% of the population Independent samples of the three universes Some people represent a family or a household All families and persons in households sampled are included Diverse geographies at the province and CMA levels Geography limited to provinces and major CMAs (pop. 1 million) Geography more limited to regions Families and households well represented Loss of information about families and households File representative of households; more varied content including all data Repetition of variables between the 3 universes; complex derived variables Variables taken from the questionnaire so that users can create their own derived variables Variables taken from the questionnaire so that users can create their own derived variables Analytic content limited to one universe at a time Analytic content extended to the three universes Analytic content extended to the three universes Greater potential for analysis and international comparison Production requirements Certification and production projected for summer 2010 Production projected for summer 2009 Production projected for summer 2009 Confidentiality Suppression level higher than in 2001 Suppression level lower than in 2001 (less geographies) Same suppression level as in 2001 (less geographies)

7 Thank you!