Curve fitting Session 2. Method background Disability rates are strongly linked to age However HSE disability rates for single years of age are unstable.

Slides:



Advertisements
Similar presentations
Class 6: Hypothesis testing and confidence intervals
Advertisements

Session 1: Small area estimation introduction Alan Marshall ESDS Government.
Adding geographical detail to social surveys: Estimating local disability prevalence Alan Marshall ESDS Government 15 th April 2010.
Individual level synthetic regression models Session 4.
1 Session 10 Sampling Weights: an appreciation. 2 To provide you with an overview of the role of sampling weights in estimating population parameters.
Household Projections for England Yolanda Ruiz DCLG 16 th July 2012.
Lesson 10: Linear Regression and Correlation
Objectives 10.1 Simple linear regression
Hypothesis Testing making decisions using sample data.
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
12.1 Inference for A Population Proportion.  Calculate and analyze a one proportion z-test in order to generalize about an unknown population proportion.
Sociology 601 Class 8: September 24, : Small-sample inference for a proportion 7.1: Large sample comparisons for two independent sample means.
Objectives (BPS chapter 24)
1 Test a hypothesis about a mean Formulate hypothesis about mean, e.g., mean starting income for graduates from WSU is $25,000. Get random sample, say.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Using SPSS for Inferential Statistics UDP 520 Lab 3 Lin October 30 th, 2007.
Market-based NTA Labor Income and Consumption by Gender Gretchen Donehower Day 4, Session 1, NTA Time Use and Gender Workshop Thursday, May 24, 2012 Institute.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
1 POPULATION PROJECTIONS Session 6 - Introduction to population projections Ben Jarabi Population Studies & Research Institute University of Nairobi.
Correlation & Regression
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Diane Stockton Trend analysis. Introduction Why do we want to look at trends over time? –To see how things have changed What is the information used for?
Inference for regression - Simple linear regression
Correlation and Linear Regression
Klaas Stijnen New Zealand Society of Actuaries Conference 2010 Unemployment models The probability of unemployment © 2010 Deloitte Actuarial Services.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Stata Workshop #1 Chiu-Hsieh (Paul) Hsu Associate Professor College of Public Health
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Consumption calculations with real data Gretchen Donehower Day 3, Session 2, NTA Time Use and Gender Workshop Wednesday, May 23, 2012 Institute for Labor,
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
Module 23: Proportions: Confidence Intervals and Hypothesis Tests, Two Samples This module examines confidence intervals and hypothesis test for.
Copyright © 2010 Pearson Education, Inc. Slide
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
12.1 Inference for A Population Proportion.  Calculate and analyze a one proportion z-test in order to generalize about an unknown population proportion.
Data Lab # 4 June 16, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Inference ConceptsSlide #1 1-sample Z-test H o :  =  o (where  o = specific value) Statistic: Test Statistic: Assume: –  is known – n is “large” (so.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Module 25: Confidence Intervals and Hypothesis Tests for Variances for One Sample This module discusses confidence intervals and hypothesis tests.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Confidence Intervals and Hypothesis Testing Mark Dancox Public Health Intelligence Course – Day 3.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
Prediction of lung cancer mortality in Central & Eastern Europe Joanna Didkowska.
Independent Samples: Comparing Means Lecture 39 Section 11.4 Fri, Apr 1, 2005.
Stats Methods at IC Lecture 3: Regression.
Presentation for Session VI.
One-Sample Inference for Proportions
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Confidence Intervals and Hypothesis Tests for Variances for One Sample
Chapter Six Normal Curves and Sampling Probability Distributions
Chapter 25 Comparing Counts.
Module 22: Proportions: One Sample
Ch. 12 More about regression
CHAPTER 22: Inference about a Population Proportion
Chapter 10: Basics of Confidence Intervals
Weighted Interval Scheduling
Chapter 26 Comparing Counts.
Chapter 8: Estimating With Confidence
Weighted Interval Scheduling
Determining Which Method to use
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Chapter 26 Comparing Counts.
Business Statistics - QBM117
Inference Concepts 1-Sample Z-Tests.
Presentation transcript:

Curve fitting Session 2

Method background Disability rates are strongly linked to age However HSE disability rates for single years of age are unstable We can fit a curve to the disability schedule to smooth the fluctuations Model rates (national or regional)*local population totals

Source: HSE 2001 Mobility disability – England (Males)

Personal care disability – England (males) Source: HSE 2001

Dealing with sampling variability Rates are unreliable particularly where sample sizes are small Smooth fluctuations by fitting a curve

Dealing with sampling variability

What function? Lots of choices Quadratic (y=b 0 +b 1 x+b 2 x 3 +b 3 x 3 Exponential functions Estimation of mortality schedules Statistics Canada use an exponential curve to model disability schedules in Canadian territories

Exponential curve Where: D(x)= the proportion of people with a disability at age x

Practical structure Task 3 – Fit an exponential curve to (England) mobility schedules (with and without weights). Uses saved data from task 2 Task 4 – Fit curves to regional mobility schedules Task 5 – Use your model rates to calculate the number of people with a mobility disability in six districts. (Data provided)

Fitting a curve in stata nl (MO_OBS_RT=exp({a}+{b}*age)) predict pred_MO_UK

Exponential curve – parameter estimates (males) Confidence interval a b

Mobility disability schedules – observed and modelled

Analytic weights Stata treats the rates at each age as being equally reliable. Can use weights to relax this assumption If we assume our rates stem from a binomial process then: Where p x = proportion with a disability at age x and N x equals the number of people sampled at age x.

Calculating weights (task 3) Re-open the HSE data Re-calculate age specific rates (MO_OBS_RT) (as in task 2) egen mobilitycount=count(MO_OBS_RT), by (age sex) gen mobilityweight=mobilitycount/(MO_OBS_RT*(1*MO_OBS_RT))

Model weights – mobility disability

Fitting a curve in stata nl (MO_OBS_RT=exp({a}+{b}*age)) [aweight=mobilityweight] predict pred_MO_UK

Mobility schedules – observed and modelled (with weights) Better fit at youngest ages

Task 4 – regional curves Open HSE data Drop institutional residents (no gora) Are differences in regional rates of mobility disability significant? ( )

Task 4 - regional curves Calculate regional schedules of mobility disability rates by sex age gora: egen MO_num=total(mobility_w) by sex age gora: egen MO_denom=total(count_w) gen MO_OBS_RT=MO_num/MO_denom

Task 4 – regional curves Weights are the same as used for national data (task 3) Regional age patterns of weight very unstable After calculating regional rates and weights: Duplicates drop age sex gora, force

Task 4 –regional curves nl (MO_OBS_RT=exp({a}+{b}*age)) if sex==1&gora==1 [aweight=mobilityweight] predict pred_MO1_M nl (MO_OBS_RT=exp({a}+{b}*age)) if sex==1&gora==2 [aweight=mobilityweight] predict pred_MO2_M Fit curves for each region (males and females)

Task 5 Aim - generate district estimates of the numbers of people with mobility disabilities Practical 1 task 5 dataset.dta a row for each single year of age (10, 11,….84,88) for males and females in each of the six districts Contains the national and regional model rates from tasks 3 and 4 Population counts