Methods for investigating zoning effects Mark Tranmer CCSR.

Slides:



Advertisements
Similar presentations
Multilevel modelling short course
Advertisements

The Census Area Statistics Myles Gould Understanding area-level inequality & change.
Latent normal models for missing data Harvey Goldstein Centre for Multilevel Modelling University of Bristol.
Brief introduction on Logistic Regression
Objectives 10.1 Simple linear regression
Variance reduction techniques. 2 Introduction Simulation models should be coded such that they are efficient. Efficiency in terms of programming ensures.
Sampling Distributions
The controlled assessment is worth 25% of the GCSE The project has three stages; 1. Planning 2. Collecting, processing and representing data 3. Interpreting.
458 More on Model Building and Selection (Observation and process error; simulation testing and diagnostics) Fish 458, Lecture 15.
Chapter 4 Multiple Regression.
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
1 Regression Analysis Regression used to estimate relationship between dependent variable (Y) and one or more independent variables (X). Consider the variable.
Statistical Background
Topic 3: Regression.
Experimental Evaluation
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Sampling Theory Determining the distribution of Sample statistics.
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
‘Estimating with Confidence’ and hindsight: Population estimates for areas smaller than districts, revisions to levels of 1991 Census non-response Paul.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Estimation and Hypothesis Testing. The Investment Decision What would you like to know? What will be the return on my investment? Not possible PDF for.
Hypothesis testing – mean differences between populations
1 1 Slide Statistical Inference n We have used probability to model the uncertainty observed in real life situations. n We can also the tools of probability.
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
Copyright © Cengage Learning. All rights reserved. 12 Simple Linear Regression and Correlation.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
CHAPTER 8 Estimating with Confidence
Statistical inference. Distribution of the sample mean Take a random sample of n independent observations from a population. Calculate the mean of these.
Investigating the Accuracy and Robustness of the Icelandic Cod Assessment and Catch Control Rule A. Rosenberg, G. Kirkwood, M. Mangel, S. Hill and G. Parkes.
AP STATISTICS LESSON 10 – 2 DAY 1 TEST OF SIGNIFICANCE.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population.
The Central Limit Theorem and the Normal Distribution.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
5-1 ANSYS, Inc. Proprietary © 2009 ANSYS, Inc. All rights reserved. May 28, 2009 Inventory # Chapter 5 Six Sigma.
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
1 Land accounts in Europe – current state and outlook Land accounts 01/10/2015 Daniel Desaulty
GIS September 27, Announcements Next lecture is on October 18th (read chapters 9 and 10) Next lecture is on October 18th (read chapters 9 and 10)
Chapter 11: The ANalysis Of Variance (ANOVA)
1 Chi-square Test Dr. T. T. Kachwala. Using the Chi-Square Test 2 The following are the two Applications: 1. Chi square as a test of Independence 2.Chi.
§2.The hypothesis testing of one normal population.
Monitoring of a condition of economic systems on the basis of the analysis of dynamics of entropy A.N. Tyrsin 1, O.V. Vorfolomeeva 2 1 – Science and Engineering.
Module 25: Confidence Intervals and Hypothesis Tests for Variances for One Sample This module discusses confidence intervals and hypothesis tests.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Building Valid, Credible & Appropriately Detailed Simulation Models
STATISTICS People sometimes use statistics to describe the results of an experiment or an investigation. This process is referred to as data analysis or.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Exercise 1 Content –Covers chapters 1-4 Chapter 1 (read) Chapter 2 (important for the exercise, 2.6 comes later) Chapter 3 (especially 3.1, 3.2, 3.5) Chapter.
INTRODUCTION Despite recent advances in spatial analysis in transport, such as the accounting for spatial correlation in accident analysis, important research.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Stats Methods at IC Lecture 3: Regression.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
Inference about the slope parameter and correlation
Linear Regression with One Regression
Confidence Intervals and Hypothesis Tests for Variances for One Sample
i) Two way ANOVA without replication
S2 Chapter 6: Populations and Samples
CONCEPTS OF HYPOTHESIS TESTING
CONCEPTS OF ESTIMATION
Inferences and Conclusions from Data
More about Posterior Distributions
Stat 217 – Day 28 Review Stat 217.
Basic Practice of Statistics - 3rd Edition Introduction to Inference
Hypothesis Testing: The Difference Between Two Population Means
Regression Analysis.
Presentation transcript:

Methods for investigating zoning effects Mark Tranmer CCSR

Allowing for area effects Suppose we have some area level information Such as: aggregate information for a particular set of areal units e.g. wards; EDs; Output Areas; Districts Or individual level data with area indicators.

Allowing for these effects in our analyses Then we might say great! Ill fit a multilevel model – especially if we have individual level data with area indicators. Or we might calculate correlations etc at the area level from aggregate area level data.

But … If we calculate area level correlations because we want to make inferences about individuals that live in those areas but we only have area level data… problem: ecological fallacy So lets suppose we can actually do an analysis using individual level data with area indicators … e.g. a multilevel model. Hence simultaneously allowing for individual and area level effects. Does that solve the problem?

No, because … What do we mean by an area? Modifiable Areal Unit Problem (MAUP) Analyses that involve areas are affected by The average population size of those areas: scale effects

No, because … Once we choose a particular scale, they are also affected by the way in which those areas are defined. I.e. the choice of boundaries: Zoning effects. Also: Scope effects? What is the overall region of study? This will have implications for the extent of variation.

Zoning effects example Suppose we have a region that contains a 9 areal units of equal population, and we want to make a ward from three of these contiguous units.

Zoning effects example Ward A1

Zoning effects example Ward B1

Zoning effects example Overlay wards A1 and B1

Zoning effects example We can also do the same thing for the other wards: e.g.

Im interested in developing a statistical framework to investigate these effects I think a cross-classified multilevel model might be the way to tackle the problem What I hope to do is to find a way to assess the nature and extent of zoning effects at a particular scale.

Two level model(s)

Cross-classified model

How to test this idea Simulated data: I set up a simulation study I generated some simulated data for a normally distributed variable. Each of the 9 cells in the grid has a different (but known) mean and within each of the 9 cells I set the variance to be equal (25). So I aimed to simulate complex between-cell variation (whilst knowing the procedure I had applied to induce that variation).

I assumed these zonings

Results Two level models * Variance component estimates WardIndiv A, person B, person Cell,person Cross-classified models Estimated parameter: Var(A)Var(B)Var(A*B)Var(Indiv) A,B,cell,person

Conclusion I think we have a framework for investigating the causes of zoning effects It seems to work for simulated data, though I have yet to fully work out what these results mean Can anyone suggest to me some real data that investigate using this methodology.