Title Methodological considerations in fine-scale spatial analysis: point pattern investigation of discarded syringes used in public injection of illicit.

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

One-sample T-Test Matched Pairs T-Test Two-sample T-Test
Chapter 7 Hypothesis Testing
1 Analyzing HIV Prevalence Trends from Antenatal Clinic (ANC) Sentinel Surveillance Data and Serosurveillance Data from High Risk Groups* Ray Shiraishi.
Introduction to Smoothing and Spatial Regression
T-tests continued.
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
Random Sampling and Data Description
Zakaria A. Khamis GE 2110 GEOGRAPHICAL STATISTICS GE 2110.
Spatial Statistics II RESM 575 Spring 2010 Lecture 8.
GIS and Spatial Statistics: Methods and Applications in Public Health
Hypothesis testing Week 10 Lecture 2.
Correlation and Autocorrelation
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Inferences About Process Quality
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Statistics for Managers Using Microsoft® Excel 5th Edition
Richard M. Jacobs, OSA, Ph.D.
Geographic Information Science
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
The Spatial Scan Statistic. Null Hypothesis The risk of disease is the same in all parts of the map.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Fundamentals of Hypothesis Testing: One-Sample Tests
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
RESEARCH A systematic quest for undiscovered truth A way of thinking
IS415 Geospatial Analytics for Business Intelligence Lesson 10: Geospatial Data Analysis- Point Patterns Analysis.
Adaptive Kernel Density in Demographic Analysis Richard Lycan Institute on Aging Portland State University.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Statistical Review We will be working with two types of probability distributions: Discrete distributions –If the random variable of interest can take.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Health Datasets in Spatial Analyses: The General Overview Lukáš MAREK Department of Geoinformatics, Faculty.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Chapter 4 – Distance methods
Applications of Spatial Statistics in Ecology Introduction.
New Advanced Higher Subject Implementation Events Statistics Unit Assessment at Advanced Higher.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
A Comparison of Zonal Smoothing Techniques Prof. Chris Brunsdon Dept. of Geography University of Leicester
Spatial Statistics in Ecology: Point Pattern Analysis Lecture Two.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Point Pattern Analysis Point Patterns fall between the two extremes, highly clustered and highly dispersed. Most tests of point patterns compare the observed.
Chapter 10 The t Test for Two Independent Samples
Methods for point patterns. Methods consider first-order effects (e.g., changes in mean values [intensity] over space) or second-order effects (e.g.,
Point Pattern Analysis
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.
INTRODUCTION TO HYPOTHESIS TESTING From R. B. McCall, Fundamental Statistics for Behavioral Sciences, 5th edition, Harcourt Brace Jovanovich Publishers,
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
2005 Unbinned Point Source Analysis Update Jim Braun IceCube Fall 2006 Collaboration Meeting.
Zakaria A. Khamis GE 2110 GEOGRAPHICAL STATISTICS GE 2110.
Multiple comparisons problem and solutions James M. Kilner
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
METU, GGIT 538 CHAPTER V MODELING OF POINT PATTERNS.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Lecture Slides Elementary Statistics Twelfth Edition
Density Estimation Converts points to a raster
Probability and Statistics
Spatial analysis Measurements - Points: centroid, clustering, density
Summary of Prev. Lecture
Chapter 2 Simple Comparative Experiments
By Lewis Dijkstra, PhD Deputy Head of the Economic Analysis Unit,
Discrete Event Simulation - 4
Spatial Point Pattern Analysis
Special Topics in Geo-Business Data Analysis
Presentation transcript:

Title Methodological considerations in fine-scale spatial analysis: point pattern investigation of discarded syringes used in public injection of illicit drugs Mapping and Analysis for Public Safety September 2005 Savannah, Georgia Luc de Montigny ( U.WASHINGTON.EDU ) University of Washington Urban Design and Planning

Outline Overview of the case study – the big picture Issues associated with fine-scale analysis of point data •Constrained opportunity surface •Non-observations as data Investigation of two novel approaches •Kernel Density Estimate ratios •Random Labeling (SPPA) Discussion and conclusion Luc de Montigny – MAPS – 2005

Context: the Case Study Analyze the distribution of syringes found in the most active hard-drug use neighborhood of Montréal, Canada. Luc de Montigny – MAPS – 2005 Research Questions •Where do “needle-drops” cluster? •Why are some areas more affected than others? •How effective are current interventions (drop-boxes, NEP)? Ultimate Goals •Understand public injection behavior •Educate CPTED initiatives Syringes: n=4,172

Macro-Scale Analysis Crime – like disease – is often analyzed for large areas (states, counties, cities). Large extents usually mean low resolution (big units of analysis) and aggregation of data. Discrete events are pooled; point values become area counts (points -> surface). Traditional geo/spatial statistical analyses can be used. Underlying assumptions effectively hold. Luc de Montigny – MAPS – 2005

Micro-Scale Analysis There are compelling reasons to push analysis to finer spatial resolutions. •Substantive (analysis scale = intervention scale) •Methodological (MAUP) Fine-scale analysis introduces new challenges to old tools. •Why? •What to do about it? Luc de Montigny – MAPS – 2005

Methodological Implications of Micro-Scale Analysis Crime events are not points sampled from a continuous surface; they represent observations of discrete events. This is a different type of pattern resulting from different types of processes. This distinction has implications, two of which are discussed: 1.Non-observations constitute useful data 2.The area of opportunity may not be continuous Luc de Montigny – MAPS – 2005

1) Non-Observations as Data Assuming an exhaustive sampling strategy (e.g., documentation of all police reports), units of analysis that do not host an event represent a none-event. There is a difference between “zero” and “no data.” •Problem: a useful source of information is ignored •Proposed Solution: borrow case/control approaches developed in epidemiology Luc de Montigny – MAPS – 2005

Using non-observations: Random Labeling Comparison of the spatial distribution of events (cases) to the spatial distribution of non-events (controls). •Cases: points where syringes were found •Controls: random points where syringes were not found Used to assess whether clustering in the events is greater than what is expected due to environmental heterogeneity. •Here we use Ripley’s K function to summarize the spatial point patterns. •D (d ) = K cases (d ) — K controls (d ) Luc de Montigny – MAPS – 2005

Random Labeling – Significance Luc de Montigny – MAPS – 2005 To assess the significance of the difference between K cases (d ) and K controls (d ), generate simulation envelopes: •pool the points (cases + controls); •randomly assign “case” status to n case points; •and calculate the summary function; •repeat X number of times. •The maximum and minimum values for each distance bin are taken from all iterations of the simulation. Under the null hypothesis: •K cases (d ) = K controls (d ) = K random assignment (d )

Random Labeling – Results K 1 : observations (cases) K 2 : non-observations (controls) Non-flat curve* indicates difference between spatial distribution of cases from distribution of controls: clustering over and above that of environmental heterogeneity. Peaks outside the simulation envelope should be considered significant. Luc de Montigny – MAPS – 2005 *D (d )=K cases (d ) - K controls (d ) ˆ

2) Constrained Opportunity Surface In many situations, events can occur in some spaces, but not others. •Problem: increased likelihood of type II error •Proposed Solution: constrain the opportunity surface to the area where events can be observed, i.e. explicitly define a spatial sample frame Luc de Montigny – MAPS – 2005

Delimiting the Sample Frame Where can syringes be found? •Alleys and sidewalks •Parks •Parking lots and vacant lots Sample frame ≈ 0.3 Study area Luc de Montigny – MAPS – 2005

Cluster Analysis using Kernel Density Estimates (KDE) KDE is a form of surface modeling – values are estimated for locations between data points. KDE can be extended to estimate the intensity of one type of point data relative to another. •In epidemiology, KDE are calculated for both events (cases) and for populations at risk (controls), to control for uneven distributions of population. •This approach can be adapted for use here: density of events (distribution of syringes) can be normalized by density of opportunity (distribution of sample frame). Luc de Montigny – MAPS – 2005

The “smoothed” surface represents the intensity of discarded syringes within the search radius, or bandwidth (100m) of any given location in the study area (i.e., for every grid cell). KDE – Syringe Points Luc de Montigny – MAPS – 2005 PVC lines represent the boundary of the area that contains 90% of the volume of a probability density distribution; on average 90% of the points that were used to generate the KDE are contained within the lines.

KDE – Sample Frame Here the sample frame is converted to a grid (10m), and the centroid of each cell is used for the purposes of the kernel density estimation. Luc de Montigny – MAPS – 2005

KDE – Syringe/Sample Ratio The ratio surface represents, for each grid cell, the syringe point KDE value divided by the square of the sample frame KDE value. Luc de Montigny – MAPS – 2005

KDE – Comparison A comparison of how syringe points cluster in the study area (simple density estimate), to how those same points cluster within the sample space (the ratio between the two density estimates). These results suggest that the distribution (clustering) of syringes is due to factors other than the distribution of opportunity. Luc de Montigny – MAPS – 2005

Caveats and Limitations Random Labeling •Huge departure from envelope is due to mis-specifying the null hypothesis (only “proving” the obvious) – should use different null hypothesis. •K function assumes stationarity; probably violated in this case – should use inhomogenous function. Kernel Density Estimate ratios •The intensity of opportunity (sample space density estimates) is measured in an arbitrary way. The choice of grid resolution, and bandwidth size are influential to the density estimate, yet are not grounded in theory. •Density surfaces should be “clipped” to areas within the sample frame for the purposes of visualization and analysis. Luc de Montigny – MAPS – 2005

Summary •Most events studied in criminology are the result of point processes (point patterns). •Tools designed for the analysis of surfaces may not be appropriate for criminology. •Popular analytic techniques have underlying assumptions that are violated at the micro-scale. •Ignoring the above can result in erroneous results (type II error, model mis-specification) Contact information: U.WASHINGTON.EDU Luc de Montigny – MAPS – 2005

Acknowledgements This research would not be possible without the hard work and collaboration of Spectre de rue, Montréal. Luc de Montigny – MAPS – 2005

Selected Reading & Software References •Gatrell AC, Bailey TC, Diggle PJ, Rowlingson BS (1996) Spatial point pattern analysis and its application in geographic epidemiology. Transactions of the Institute of British Geographers 21: •Walter C, McBratney AB, Viscarra Rossel RA, Markus JA (2005) Spatial point-process statistics: concepts and application to the analysis of lead contamination in urban soil. Environmetrics 16: •Beyer HL (2005) Hawth's Analysis Tools for ArcGIS. Available at •Rowlingson BS, Diggle PJ, Bivand R (2005) The splancs package for R. Available at •Baddeley A, Turner R (2005) The spatstat package for R. Available at •Lewin-Koh NJ, Bivand R (2005) The maptools package for R. Available at Luc de Montigny – MAPS – 2005

The K-function describes the degree to which there is spatial dependence in the arrangement of events K(d) = λ -1 E[number of events within d from a randomly selected event] Where λ is the intensity, and E[] the expectation Formally: Where, •R is the region (extent) •I is a binary indicator function •w is the proportion of the search radius that falls within R Appendix – Ripley’s K Summary Function Luc de Montigny – MAPS – 2005