Continuous Surveys: Statistical Challenges and Opportunities Carl Schmertmann Center for Demography & Population Health Florida State University

Slides:



Advertisements
Similar presentations
1 Accessing Census Bureau Population Estimates for the United States Ryan Burson Population Division U.S. Census Bureau July 16, 2014.
Advertisements

U.S. Census and American Community Survey Overview Open a web browser and go to:
The Geography of Census October 6, What is the Census? The U.S. Census Bureau conducts many surveys but the most widely known and used is the decennial.
Changing Demographics in Texas
David A. Swanson University of California Riverside The American Community Survey: Some Considerations Regarding its use as a Substitute.
Bridging the Gaps: Dealing with Major Survey Changes in Data Set Harmonization Joint Statistical Meetings Minneapolis, MN August 9, 2005 Presented by:
2010 Census and ACS in Oregon: Results and Resources Census Data Workshops November, 2011 Charles Rynerson Census State Data Center Coordinator Population.
Texas & San Antonio: Characteristics and Trends of the Hispanic Population KVDA Telemundo November 10, 2011 San Antonio, TX.
1 Census Data for Community Research Horizons Moses Lake, Washington March 20, 2010 Rural Reflections.
11 American Community Survey Data Products. 2 What do I need to know before using ACS data and data products?
1 Commuting and Migration Data Products from the American Community Survey Journey-to-Work and Migration Statistics Branch U.S. Census Bureau State Data.
THE UNIVERSITY OF MISSISSIPPI The University of Mississippi Institute for Advanced Education in Geospatial Science Census to American Community Survey.
Your Community by the Numbers Accessing the most current and relevant Census data Alexandra Barker Data Dissemination Specialist U.S Census Bureau New.
Socio-Economic & Demographic Data Tools for Proactive Planning Robin Blakely-Armitage STATE OF NEW YORK CITIES: Creative Responses to Fiscal Stress March.
The Changing Population of Texas Government Finance Officers Association of Texas October 25, 2012 San Marcos, TX.
National Household Survey: collection, quality and dissemination Laurent Roy Statistics Canada March 20, 2013 National Household Survey 1.
The American FactFinder Florida Libraries Association Annual Conference, 2012, Orlando, Florida Jan Swanbeck, Documents Librarian, Joe Aufmuth, GIS Librarian.
Equal Employment Opportunity (EEO) Special Tabulation by Jennifer Cheeseman Day Presentation for the State Data Centers Annual Meeting October 15, 2010.
11 The American Community Survey Steve Murdock, Ph.D. Director, Hobby Center for the Study of Texas Rice University.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
Population Estimates and Projections in the U. S. John F. Long
Transportation leadership you can trust. presented to TRB Census Data for Transportation Planning Meeting presented by Kevin Tierney Cambridge Systematics,
11 The Census for School Districts: American Community Survey from the Census Bureau and School District Tabulations from the US Department of Education.
Saadia GreenbergElena Fazio Office of Performance and Evaluation Administration on Aging US Department.
The American Community Survey Texas Transportation Planning Conference Dallas, Texas July 19, 2012.
A Brief Demography of California Hans Johnson Public Policy Institute of California November 30, 2010.
Liesl Eathington Iowa Community Indicators Program Iowa State University October 2014.
DEMOGRAPHYY VITAL STATISTICS EPIDEMIOLOGY. Essential Questions b What is it? b Why is it important to public health practice? b What essential information.
Population Projections Matthew C. Harris, Ph.D. Center for Business and Economic Research Tennessee State Data Center Department of Economics Haslam College.
Economics and Statistics Administration U.S. CENSUS BUREAU U.S. Department of Commerce Research on Estimating International Migration of the Foreign-Born.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
Overview of error model for estimates of foreign-born immigration using data from the American Community Survey Mary H. Mulry U.S. Census Bureau 2011 International.
Case 5 Introduction to Demographic Research Using Aggregated ACS Data for Ecological Regression: Changes in County Poverty Katherine Curtis Adam Slez Jennifer.
Using the American Community Survey (ACS) Maryland Sate Data Center Affiliate Meeting April 4, 2007.
Methodology for producing the revised back series of population estimates for Julie Jefferies Population and Demography Division Office for.
1 Things That May Affect Estimates from the American Community Survey.
American Community Survey Maryland State Data Center Affiliate Meeting June 17, 2008.
American Community Survey (ACS) 1 Oregon State Data Center Meeting Portland State University April 14,
Update on the American Community Survey (ACS) and Geographic Products 2012 PA SDC Data User Conference September 20,2012 Noemi Mendez Eliasen Geographer.
U.S. Census Bureau’s Population Estimates Program Victoria Velkoff Population Division U.S. Census Bureau APDU 2010 Annual Conference Public Data 2010:
Mobility MATTERS! Connecting People to Life Who Rides the Bus? How Understanding Transit Demographic Can Improve Service May 7, 2015.
1 Risk Factors for Children in the U.S., States, and Metropolitan Areas: Data from the 2007 American Community Survey Robert Kominski, U.S. Census Bureau.
Current Population Survey Sponsor: Bureau of Labor Statistics Collector: Census Bureau Purpose: Monthly Data for Analysis of Labor Market Conditions –CPS.
Data on the Foreign Born in 2010: Accessing Information on Immigrants and Immigration from the U.S. Census Bureau’s American Community Survey Thomas A.
The U.S. Census Bureau Population Estimates Program Victoria A. Velkoff U.S. Census Bureau APDU Annual Conference September 25, 2008.
American Community Survey “It Don’t Come Easy”, Ringo Starr Jane Traynham Maryland State Data Center March 15, 2011.
Using Census Data to Understand Things ​ OpenGovChicago March 26, 2014.
Things that May Affect the Estimates from the American Community Survey Updated February 2013.
Shashin Amatya Yi Gao Lauren Reuther INFSYS-6833 Group B Homicide.
American Community Survey (ACS) Product Types: Tables and Maps Samples Revised
Census 2000 Supplementary Survey: An Operational Feasibility Test Nancy M. Gordon Associate Director for Demographic Programs U.S. Census Bureau July 2001.
Household Surveys: American Community Survey & American Housing Survey Warren A. Brown February 8, 2007.
Accessing Census Data through the American FactFinder Arthur Bakis Information Services Specialist Boston Regional Census Center US Census Bureau
1 Population Controls for the American Community Survey Alexa Kennedy-Puthoff Population Division Prepared for the 2009 SDC Annual Training Conference,
American Community Survey (ACS) Using Census Data by Block Group January 21, 2016 Presentation at the National Community Development Association Winter.
Community Foundation of Collier County Our Mission: To improve the quality of life in Collier County by connecting donors to community needs and providing.
Quality of Race and Hispanic Origin Reporting on Death Certificates in the US Elizabeth Arias, Ph.D. Mortality Statistics Branch Division of Vital Statistics.
U.S. Census and American Community Survey Overview Open a web browser and go to:
Jan 2016 Solar Lunar Data.
The 6 steps of data collection:
Average Monthly Temperature and Rainfall
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Text for section 1 1 Text for section 2 2 Text for section 3 3
Presentation transcript:

Continuous Surveys: Statistical Challenges and Opportunities Carl Schmertmann Center for Demography & Population Health Florida State University

Outline  CHALLENGES (long) Increased Temporal Complexity Increased Sampling Error New Weighting Problems  OPPORTUNITIES (brief, but important)

Sample Size Comparison  US CENSUS LONG FORM: % / decade  ACS ROLLING SURVEY: 2 per 1000 Households / month 24per 1000 Households / year 240per 1000 Households / decade % / decade

Sampling Differences over Decade Long FormACS Sample Size≈ 17%≈ 24% Taken on…1 day3650 days Released as…1 dataset10+ datasets Simultaneous 100% count? YESNO

1. Temporal Complexity Long FormACS Sample Size≈ 17%≈ 24% Taken on…1 day3650 days Released as…1 dataset10+ datasets Simultaneous 100% count? YESNO 1. Temporal Complexity

What is the Population?  1-Day Census Population membership is binary: {0,1} Each individual is IN or OUT  Continuous Survey Population membership is fuzzy: Individuals can be MORE IN (more person-days of residence) or MORE OUT (fewer) 1. Temporal Complexity

JFMAMJJASOND● Type A Type B ● Residents (in 000s)

1. Temporal Complexity JFMAMJJASOND● Type A Type B ● Residents (in 000s) Census Population = (83% Type A)

1. Temporal Complexity JFMAMJJASOND● Type A Type B ● Residents (in 000s) An ACS ‘Data Sandwich’ includes samples from all months

1. Temporal Complexity JFMAMJJASOND● Type A Type B ● Residents (in 000s) ACS samples from person-months Avg Population: (65% Type A)

Characteristics change over the Sampling Period  Persons Age Marital Status Employment Education  Housing Units Vacancy Number of Occupants $ Value 1. Temporal Complexity

Rolling ‘Population’ Population formed by sandwiching monthly samples is the average frame of a film, not a snapshot Individuals and housing units with changing characteristics are sampled and caught ‘in motion’. 1. Temporal Complexity

Reference Period Problems Many ‘long-form’ questions refer to retrospective periods:  Income in last 12 months  Place of residence 1 year ago  Child born in last 12 months?  Etc. 1. Temporal Complexity

Time Reference Example  ‘2004’ data from 12 monthly samples taken in Jan04…Dec04  Question on fertility in the 12 months prior to the survey, so there are 12 overlapping periods in ‘2004’ data ‘Jan04’ question covers Jan03-Jan04 ‘Feb04’ question covers Feb03-Feb04 etc. 1. Temporal Complexity

Jan 2004 x x x x x x x x x x x x ● Jan 03Jan 04 Feb x x x x x x x x x x x x ● Mar x x x x x x x x x x x x ● Apr x x x x x x x x x x x x ● May x x x x x x x x x x x x ● Jun x x x x x x x x x x x x ● Jul x x x x x x x x x x x x ●..... Aug x x x x x x x x x x x x ●.... Sep x x x x x x x x x x x x ●... Oct x x x x x x x x x x x x ●.. Nov x x x x x x x x x x x x ●. Dec x x x x x x x x x x x x ● Jan Temporal Complexity

Reference Periods for ‘Last 12 Month’ Questions in 1-year ACS Datasets

Temporal Issues Summarized ‘Data Sandwiches’ contain:  New meaning of ‘population’  Units that change over sampling period (moving targets)  Multiple reference periods for retrospective questions 1. Temporal Complexity

2. Sampling Error Long FormACS Sample Size≈ 17%≈ 24% Taken on…1 day3650 days Released as…1 dataset10+ datasets Simultaneous 100% count? YESNO 2. Sampling Error

Small Samples More overall data from continuous sampling, but… 1-, 3-, or 5-Year Sandwiches have smaller samples than the single, decennial long form survey  more sampling error in published data 2. Sampling Error

Small Samples The problem is especially acute for small areas narrow age groups rare subpopulations e.g., How many unmarried teen births per year in Sevier County, Tennessee? ACS says 0 ± Sampling Error

St. Johns County, FL Year ACS Data for Males BELOW POVERTYABOVE POVERTYPOVERTY RATE AGEEstimateMOEEstimateMOEPercentMOE* /-5623,495+/ / / /-4670+/ /-3635,401+/ / /-2922,787+/ / /-3001,342+/-4600+/ /-3001,995+/-4170+/ ,235+/-6555,387+/ / /-37110,192+/ / /-19411,558+/ / /-39912,794+/ / /-45210,679+/ / /-2005,825+/ /-3.3 *Denominators have MOE≈0 under ACS sampling and weighting design

2. Sampling Error C SEX BY OCCUPATION – Key West, Florida Data Set: American Community Survey 3-Year Estimates ( …etc

Temporal Instability Teenage Birth Rate in a County

Unfortunate Result Aggregating over 1+ years of surveys produces datasets that are often  Unfamiliar and difficult to understand  Still too noisy to be useful for planners and researchers 2. Sampling Error

3. Weighting for Non-Response Long FormACS Sample Size≈ 17%≈ 24% Taken on…1 day3650 days Released as…1 dataset10+ datasets Simultaneous 100% count? YESNO 3. Weighting Problems

Weighting Weighting from Respondents  Total Population requires Population Control Totals: (Place x Age x Sex x Race x Ethnicity x …) 3. Weighting Problems

Decennial Long Form Sample  Control Totals Measured from a simultaneous enumeration of the population (Sample & Census on same day) Only 1 set needed Sample and Population defined identically (resid. on Census Day) 3. Weighting Problems

Continuous Survey  Control Totals Must be estimated (no simultaneous census) Many sets needed (2006, 2007, , , , …) Sample and Population defined differently 3. Weighting Problems

ACS Control Totals (Persons) 3. Weighting Problems ACS responses are weighted to match official intercensal estimates by Year (1 July midpoint snapshot) County (sometimes city) Age Race Sex Hispanic Origin (yes/no)

ACS Control Totals (Persons) 3. Weighting Problems Potential Errors  Estimates are Wrong: Unanticipated internal migration Unanticipated international migration etc  Population Definition don’t match Seasonal fluctuations Different race/ethnic categories

3. Weighting Problems JFMAMJJASOND● Type A Type B ● Census Pop = (83% Type A) Average Pop= (65% Type A) If every year looks like this… Intercensal Estim= (83% Type A)

Weighting Error Example ACS weighting to estimates produces:  Popn too small (Census < Avg Pop)  Popn too “A” (seasonal Bs missed)  Overestimates of vars + correl. with A (e.g., % with college education)  Underestimates of vars - correl. with A (e.g., % single-parent families) 3. Weighting Problems

Opportunities Census Survey Continuous Survey Frequency Recency Sample Error Familiarity 4. Opportunities

Statistical models that exploit likely cell relationships (over times, ages, sexes, places, variables …) could, in principle Opportunities ACS table cells = millions of “seemingly unrelated” maximum likelihood estimates 4. Opportunities Retain frequency & recency Reduce variance of estimates Recover familiar measures

Conclusion 5. Conclusion CONTINUOUS SURVEYS like ACS create  Big Problems for producers and users Unfamiliar, temporally complex data Potentially high sample error Technical problems with weighting  Big Opportunities, IF we can develop appropriate statistical models and practices

5. Conclusion Thanks! ¡Gracias! Obrigado!