INTRODUCTION Despite recent advances in spatial analysis in transport, such as the accounting for spatial correlation in accident analysis, important research.

Slides:



Advertisements
Similar presentations
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Advertisements

Multilevel modelling short course
Methods for investigating zoning effects Mark Tranmer CCSR.
The Census Area Statistics Myles Gould Understanding area-level inequality & change.
Agent-based Modeling: A Brief Introduction Louis J. Gross The Institute for Environmental Modeling Departments of Ecology and Evolutionary Biology and.
Zone design methods for epidemiological studies Samantha Cockings, David Martin Department of Geography University of Southampton, UK Thanks to: Arne Poulstrup,
Nicky Best and Chris Jackson With Sylvia Richardson Department of Epidemiology and Public Health Imperial College, London
GIS and Spatial Statistics: Methods and Applications in Public Health
Correlation and Autocorrelation
Lecture 3: Chi-Sqaure, correlation and your dissertation proposal Non-parametric data: the Chi-Square test Statistical correlation and regression: parametric.
Clustered or Multilevel Data
Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.
Methods of Geographical Perturbation for Disclosure Control Division of Social Statistics And Department of Geography Caroline Young Supervised jointly.
Measuring local segregation in Northern Ireland Chris Lloyd, Ian Shuttleworth and David McNair School of Geography, Queen’s University, Belfast ICPG, St.
Business Statistics - QBM117 Statistical inference for regression.
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
IS415 Geospatial Analytics for Business Intelligence
Analysis of Clustered and Longitudinal Data
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Area Objects and Spatial Autocorrelation Chapter 7 Geographic Information Analysis O’Sullivan and Unwin.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
Chapter 9 Statistical Data Analysis
Adaptive Kernel Density in Demographic Analysis Richard Lycan Institute on Aging Portland State University.
CSD 5100 Introduction to Research Methods in CSD Observation and Data Collection in CSD Research Strategies Measurement Issues.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
GIScience 2000 Raster Data Pixels as Modifiable Areal Units E. Lynn Usery U.S. Geological Survey University of Georgia.
Spatial Statistics in Ecology: Area Data Lecture Four.
Health Datasets in Spatial Analyses: The General Overview Lukáš MAREK Department of Geoinformatics, Faculty.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Spatial Data Analysis Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What is spatial data and their special.
Methodology for producing the revised back series of population estimates for Julie Jefferies Population and Demography Division Office for.
Chapter 1 Introduction to Statistics. Statistical Methods Were developed to serve a purpose Were developed to serve a purpose The purpose for each statistical.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Sampling Populations Ideal situation - Perfect knowledge Not possible in many cases - Size & cost Not necessary - appropriate subset  adequate estimates.
DTC Quantitative Methods Survey Research Design/Sampling (Mostly a hangover from Week 1…) Thursday 17 th January 2013.
Chapter 13: Correlation An Introduction to Statistical Problem Solving in Geography As Reviewed by: Michelle Guzdek GEOG 3000 Prof. Sutton 2/27/2010.
MDG data at the sub-national level: relevance, challenges and IAEG recommendations Workshop on MDG Monitoring United Nations Statistics Division Kampala,
Chapter Outline 2.1 Estimation Confidence Interval Estimates for Population Mean Confidence Interval Estimates for the Difference Between Two Population.
Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU.
1 Research Methods Festival 2008 Zhiqiang Feng 1,2 and Paul Boyle 1 1 School of Geography & Geosciences University of St Andrews 2 The Centre for Census.
1. 2 To be able to determine which of the three measures(mean, median and mode) to apply to a given set of data with the given purpose of information.
GEOG3025 Geographical referencing and the modifiable areal unit problem.
Chapter 10 Correlation and Regression Lecture 1 Sections: 10.1 – 10.2.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 2: Correlation of Time Courses of Simultaneous.
Methods for point patterns. Methods consider first-order effects (e.g., changes in mean values [intensity] over space) or second-order effects (e.g.,
An ecological analysis of crime and antisocial behaviour in English Output Areas, 2011/12 Regression modelling of spatially hierarchical count data.
Reliability performance on language tests is also affected by factors other than communicative language ability. (1) test method facets They are systematic.
Copyright © 2011, 2005, 1998, 1993 by Mosby, Inc., an affiliate of Elsevier Inc. Chapter 19: Statistical Analysis for Experimental-Type Research.
GIS September 27, Announcements Next lecture is on October 18th (read chapters 9 and 10) Next lecture is on October 18th (read chapters 9 and 10)
CMSSE Summer School Dots to boxes: Do the size and shape of spatial units jeopardize economic geography estimations? A.Briant, P.-P. Combes, M. Lafourcade.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
EED 401: ECONOMETRICS COURSE OUTLINE
WorkShop 2007 Final Presentation Lu, Zhixiang July, 30, 2007.
CPH Dr. Charnigo Chap. 11 Notes Figure 11.2 provides a diagram which shows, at a glance, what a neural network does. Inputs X 1, X 2,.., X P are.
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
Spatial statistics: Spatial Autocorrelation
ESPON THE MODIFIABLE AREA UNIT PROBLEM
CHAPTER 11 Inference for Distributions of Categorical Data
Lecture 6 Implementing Spatial Analysis
Brief Introduction to Multilevel Analysis
CHAPTER 11 Inference for Distributions of Categorical Data
Why are Spatial Data Special?
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
CHAPTER 11 Inference for Distributions of Categorical Data
Regional accessibility indicators: developments and perspectives
Presentation transcript:

INTRODUCTION Despite recent advances in spatial analysis in transport, such as the accounting for spatial correlation in accident analysis, important research have not been fully addressed. Specifically, the paper examines the problems related to:  Spatial mismatching  The modifiable areal unit problem (MAUP)  Ecological fallacy  Spatial dependency Many of these problems in spatial analysis were previously investigated in ecological analysis, and they are relatively new to many transport researchers. AIM This paper intends to review and assess several data and methodological issues that affect spatial analysis in transport studies, before recommending strategies for addressing these issues in an integrated way, providing conclusions and suggesting future research directions. In the context of this paper, the term “spatial model” refers to a regression model that involves spatial data. SPATIAL MISMATCHING Geo-coded data usually come from different sources, and there may be inconsistencies in terms of the quality of spatial data among these different data sources thus causing a spatial mismatch problem. ECOLOGICAL FALLACY Closely related to the MAUP, the ecological fallacy states that results based on grouped aggregate data may not be applied to the individual units that form the aggregate dataset (Openshaw, 1984). Because of ecological fallacy, some researchers advocate analysing the data at an individual level. However, similar to ecological fallacy, a disaggregate level analysis may be subject to atomistic fallacy which refers to the fallacy of drawing inferences at aggregate level based on individual level data and is the counterpart of the ecological fallacy. SPATIAL DEPENDENCY Two types of spatial correlation exist: Spatial autocorrelation (“everything is related to everything else, but near things are more related than distant things”) “Within group” correlation as spatial units are often naturally clustered and it is expected that spatial units within the same group share similar characteristics Several spatial models that commonly employed to handle such correlation are: Simultaneous autoregressive (SAR) model Conditional autoregressive (CAR) model Spatial filtering model Multilevel modelling technique to account for “within group” correlation Any SAR model can be represented by a CAR model although the converse is not true. One problem which both CAR and SAR models encounter relates to how to specify a neighbouring structure. Different choices of neighbouring structure result in varying degrees of spatial associations, which requires future research for selecting an appropriate one. STRATEGIES OF ANALYSING SPATIAL DATA A good strategy is required to take account of the various issues involved in a spatial analysis. Table 3 below summarises the issues and recommends strategies. Chao Wang, Mohammed Quddus, Tim Ryley, Marcus Enoch, Lisa Davison Corresponding author’s Transport Studies Group, School of Civil and Building Engineering, Loughborough University, UK Spatial models in transport: a review and assessment of methodological issues ( ) Loughborough University Leicestershire, UK, LE11 3TU Tel: +44 (0) MODIFIABLE AREAL UNIT PROBLEM (MAUP) The modifiable areal unit problem (MAUP) refers to situations that occur when the boundary of zones (i.e. zone definition) used in a spatial analysis changes, the statistical inference and interpretation derived from the zones is also different (Openshaw, 1984). It is often the case that the definition of zones used in a spatial analysis is arbitrary and modifiable. For instance, zones are often defined based on political and administrative considerations (e.g. enumeration districts and electoral wards in the UK). The definition of the zones however may not have much geographic meaning, and therefore, the statistical inference based on the zones may also be questionable. MAUP can be illustrated using both real-world (Table 1) and simulated (Table 2) data: Aggregation 1Aggregation 2Aggregation 3 Coefz valueElasticityCoefz valueElasticityCoefz valueElasticity x Intercept Number of observations Pseudo R squared Table 2 The coefficients and elasticities of x under different grouping systems using simulated data (y~Poisson ( x)) LSOAWardDistrict Coefz valueCoefz valueCoefz value Population density (/km 2 ) Employment density (/km 2 ) Proportion of male IMD score Number of cars per person (for age 16-74) Proportion of people who are white Proportion of people in 0-15Reference case Proportion of people in Proportion of people in Proportion of people in Proportion of people in Proportion of people in Proportion of people who mainly work at home compared to all people aged & in employment Number of bus stops Number of train stations Number of tube stations Total area (km 2 ) Intercept Number of observations5, R squared Table 1 The coefficients under different aggregations using 2009 London Travel Demand Survey (LTDS) data and linear regression models Nature of the issueStrategies Spatial mismatching Often arises when spatial data were obtained from different sources, which compromise the data quality  Carefully check the data quality  Employ map-matching techniques such as weighting score method to correctly match spatial units MAUP The definition of zones and road segments is arbitrary and modifiable. Different definitions could lead to different statistical inference  Check the homogeneity of the data analysed  Sensitivity analysis testing parameter estimates under different data aggregations  Use uniform grid cells with a tested ‘optimal’ size  Utilise mathematical clustering scheme based on spatial homogeneity in demographic and transport characteristics Ecological fallacy Related to MAUP. Results based on aggregate data may not apply to individuals  As with MAUP, check if the homogeneity of the data  Use a better sampling/clustering scheme  Employ multilevel models for data that are explicitly defined in a hierarchical structure Spatial dependency Spatial data are correlated due to unobserved effects and data clustering  Check whether spatial data are correlated using various techniques such as Moran’s I test  Identify and employ an appropriate spatial model to control for the spatial autocorrelation, such as SAR, CAR and spatial filter models. For SAR and CAR models, different neighbouring structures need to be tested.  Employ multilevel modelling technique to account for data clustering Table 3 Issues and strategies in a spatial analysis CONCLUSIONS Several important data and methodological issues relating to spatial analyses employed in transport and future research directions have been discussed. Strategies for analysing spatial data has been offered Advanced spatial analysis should be employed in some transport research areas, such as estimating travel demand of demand responsive transport. REFERENCE Openshaw, S., The modifiable areal unit problem. Concepts and techniques in modern geography, (38).