Spatial Clusters and Pattern Analysis

Slides:



Advertisements
Similar presentations
Sources and effects of bias in investigating links between adverse health outcomes and environmental hazards Frank Dunstan University of Wales College.
Advertisements

Global Clustering Tests. Tests for Spatial Randomness H 0 : The risk of disease is the same everywhere after adjustment for age, gender and/or other covariates.
11 Pre-conference Training MCH Epidemiology – CityMatCH Joint 2012 Annual Meeting Intermediate/Advanced Spatial Analysis Techniques for the Analysis of.
Spatial Autocorrelation using GIS
Spatio – Temporal Cluster Detection Using AMOEBA
Statistical approaches for detecting clusters of disease. Feb. 26, 2013 Thomas Talbot New York State Department of Health Bureau of Environmental and Occupational.
Early Detection of Disease Outbreaks Prospective Surveillance.
SOCIOLOGY/ANTHROPOLOGY Soc 122 Introduction to Sociology General Education Requirement Category: Social Science.
Zakaria A. Khamis GE 2110 GEOGRAPHICAL STATISTICS GE 2110.
Introduction to Spatial Regression Glen Johnson, PhD Lehman College / CUNY School of Public Health
Empirical/Asymptotic P-values for Monte Carlo-Based Hypothesis Testing: an Application to Cluster Detection Using the Scan Statistic Allyson Abrams, Martin.
Local Measures of Spatial Autocorrelation
GIS and Spatial Statistics: Methods and Applications in Public Health
Correlation and Autocorrelation
 Statistical approaches for detecting unexplained clusters of disease.  Spatial Aggregation Thomas Talbot New York State Department of Health Environmental.
Geographic Information Systems (GIS) for Epidemiology and Public Health Dr. Ming-Hsiang Tsou Department of Geography, San Diego State University PPT slides:
Department of Geography, San Diego State University
Department of Geography, San Diego State University
The process of [social research theory/model/framework conceptual relationships hypotheses working hypotheses and measurement research design data collection.
Measuring local segregation in Northern Ireland Chris Lloyd, Ian Shuttleworth and David McNair School of Geography, Queen’s University, Belfast ICPG, St.
Chance Is the association causal? RR = 7 Detectives in the Classroom – Investigation 3-3: Chance.
Why Geography is important.
Innovative Uses of Geographic Information Systems Lance A. Waller Department of Biostatistics Rollins School of Public Health Emory University
Tse-Chuan Yang, Ph.D The Geographic Information Analysis Core Population Research Institute Social Science Research Institute Pennsylvania State University.
Spatial Statistics for Cancer Surveillance Martin Kulldorff Harvard Medical School and Harvard Pilgrim Health Care.
Global Measures of Spatial Autocorrelation
Geographic Information Science
Concepts and Challenges
Point Pattern Analysis
A Very spatial Presentation. ANCIENT BABYLONIAN CLAY TABLETS DEPICT THE EARTH AS A FLAT CIRCULAR DISK EARLIEST DIRECT EVIDENCE OF MAPPING COMES FROM THE.
Area Objects and Spatial Autocorrelation Chapter 7 Geographic Information Analysis O’Sullivan and Unwin.
Using ArcGIS/SaTScan to detect higher than expected breast cancer incidence Jim Files, BS Appathurai Balamurugan, MD, MPH.
The Spatial Scan Statistic. Null Hypothesis The risk of disease is the same in all parts of the map.
EUROHEIS 2 Dr Linda Beale October 2007 – September 2010.
Exploratory Analysis of Disease Data & Introduction to UNC’s GIS Reference Library Prepared originally by Kristen Hampton Updated and maintained by Ben.
Exploratory Analysis of Disease Data & Introduction to UNC’s GIS Reference Library.
Study Designs Afshin Ostovar Bushehr University of Medical Sciences Bushehr, /4/20151.
Role of Statistics in Geography
Spatial Data Analysis Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What is spatial data and their special.
Cluster Detection Comparison in Syndromic Surveillance MGIS Capstone Project Proposal Tuesday, July 8 th, 2008.
Spatial Data Mining Satoru Hozumi CS 157B. Learning Objectives Understand the concept of Spatial Data Mining Understand the concept of Spatial Data Mining.
GIS is about geography and about thinking geographically Demers,
Geographical Data and Measurement Geography, Data and Statistics.
Local Indicators of Categorical Data Boots, B. (2003). Developing local measures of spatial association for categorical data. Journal of Geographical Systems,
Point Pattern Analysis
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Spatial analysis: a roadmap David O’Sullivan University of Auckland School of Geography and Environmental Science
Descriptive study design
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Statistical Significance: Tests for Spatial Randomness.
Patterns and Trends CE/ENVE 424/524. Classroom Situation Option 1: Stay in Lopata House 22 pros: spacious room desks with chairs built in projector cons:
GIS Software Applications in Epidemiology Marcus Liscombe Brent Croft GISC GIS MANAGEMENT AND IMPLEMENTATION.
Headlines Introduction General concepts
1 Copyright © 2012 by Mosby, an imprint of Elsevier Inc. Copyright © 2008 by Mosby, Inc., an affiliate of Elsevier Inc. Chapter 24 Public Health Surveillance.
Spatial Scan Statistic for Geographical and Network Hotspot Detection C. Taillie and G. P. Patil Center for Statistical Ecology and Environmental Statistics.
GEOGRAPHIC CLUSTERS OF HEAD & NECK CANCER IN FLORIDA Recinda Sherman, MPH, CTR Florida Cancer Data Systems NAACCR Detroit, June 7, 2007.
WorkShop 2007 Final Presentation Lu, Zhixiang July, 30, 2007.
General Elliptical Hotspot Detection Xun Tang, Yameng Zhang Group
Spatial statistics Lecture 3 2/4/2008. What are spatial statistics Not like traditional, a-spatial or non-spatial statistics But specific methods that.
Key Question What is human geography? © 2012 John Wiley & Sons, Inc. All rights reserved.
Introduction to Spatial Statistical Analysis
Aldo Aviña Environmental and Occupational Health
Dept of Biostatistics, Emory University
Chapter 2: The Pitfalls and Potential of Spatial Data
The Role of Poverty in Prostate Cancer in African-Americans
Spatial Point Pattern Analysis
Epidemiology: the branch of medicine that deals with the incidence, distribution, and possible control of diseases and other factors relating to health.
Why are Spatial Data Special?
Software Cluster, Aldrich-Wayne Free, Old
Thinking Geographically
Presentation transcript:

Spatial Clusters and Pattern Analysis Chris Jochem Geog 5161 – Spring 2011

When you know ‘where’, you can start to ask ‘why’ John Snow’s map of cholera deaths in London, 1854. Need to move beyond simply mapping events and beyond general point pattern analysis. Water pump locations

Goals of Cluster Analysis Examine “unusual” groupings of events in space and/or time (Cromley and McLafferty 2002) Both confirmatory and exploratory of hypotheses Different ways to operationalize unusual or unexpected patterns using probability distributions Common Questions (Waller and Gotway 2004, 155): Do cases tend to occur near other cases? (possible infectious agent) Does a particular area within the study region seem to contain a significant excess of observed events? (possible environmental risk factor) Where are the most unusual collection of cases? (possible cluster)

Different Methodologies … for different levels of analysis Point Pattern Analysis: density and distance measurements Ex: density map of cholera cases Clustering requires different statistical tests often used sequentially or as part of a larger study to select areas for more detailed field work. Three main categories of tests: Global Local Focal

3 main Categories of Tests Global: a single test for general patterns and spatial autocorrelation over an entire study region Moran’s I Geary’s C Local: search for specific regions or areas where clustering is observed above expected levels Example: areas of high crime or terrorist attacks Local Moran’s I Getis-Ord Gi* Spatial Scan Statistic Focal: specialized statistics searching only in regions around fixed locations Example: cancers around nuclear reactors Stone’s Test Bithell Tango

Consider your Data Point Data Events (diseases crimes, conflicts, etc.) Cases/Controls Measurement locations Considerations Point level accuracy Polygon Data Census or social attributes (poverty, unemployment, income, etc.) Aggregate counts of individual-level events Considerations Modifiable areal unit problem (MAUP)

Spatial Scan Statistic As implemented in SatScan® software Input point or area data for events and background population, can vary over time Pass a circular or elliptical filter of varying radii across study area Count observed cases and test likelihood ratio against expected cases given the population or person-time Pros Cons Spatial, temporal, or space-time clusters Controls for risk factors and covariates Learning curve for set-up and interpretation No graphical output

Examples O’Loughlin, John and Frank D. W. Witmer. 2011. The Localized Geographies of Violence in the North Caucasus of Russia, 1999-2007. The Annals of the Association of American Geographers 101, no. 1 (January): 178 – 201. Using a spatial scan statistic to find local clusters of conflicts in space and time.

Examples Kulldorff M, Athas W, Feuer E, Miller B, Key C. Evaluating cluster alarms: A space-time scan statistic and brain cancer in Los Alamos. American Journal of Public Health, 1998; 88:1377-1380. See the demo! Many additional examples: http://www.satscan.org/references.html

Considerations and Critiques Must consider data limitations and accuracies How do you define a ‘cluster’?  expected outcomes Possibility of occurring by chance, especially with small numbers Based on theory or hypothesized relationships “Texas Sharpshooter Fallacy”

Considerations and Critiques Must consider underlying population at risk People are not evenly distributed Complete spatial randomness is usually not valid Difficult to link causality to clusters (Elliot et al. 2000, Elliot and Wakefield 2001) Usually requires further studies What matters is scientific, not statistical, significance (Gould 1970) See also O’Sullivan and Unwin (2003), and Harvey (1966,1967)

Resources Free Software: SatScan: http://www.satscan.org/ CrimeStat: http://www.icpsr.umich.edu/CrimeStat/ GeoDa: http://geodacenter.asu.edu/projects/opengeoda R packages: http://cran.r-project.org/web/views/Spatial.html Broad Street Cholera Data: http://www.asdar-book.org/datasets.php?dataset=4

References Anselin, Luc. 2006. How (not) to lie with spatial statistics. American Journal of Preventive Medicine 30: s3-s6. Cromley, Ellen K. and Sara L. McLafferty. 2002. GIS and Public Health. New York: Guilford Press. Elliott, Paul and Jon Wakefield. 2001. Disease clusters: Should they be investigated, and, if so, when and how? Journal of the Royal Statistical Society A 164, 1: 3-12. Elliott, Paul, Jon Wakefield, Nicola Best, and David Briggs. 2000. Spatial Epidemiology: Methods and Applications. Oxford University Press. Gould, Peter. 1970. Is statistix inferens the geographic name for a wild goose? Economic Geography 46 (June): 439-448. Harvey, David W. 1966. Geographical processes and the analysis of point patterns: Testing models of diffusion by quadrat sampling. Transactions of the Institute of British Geographers 40: 81-95. Harvey, David W. 1967. Some methodological problems in the useof Neyman type A and negative binomial distribution for the analysis of point patterns. Transactions of the Institute of British Geographers 44: 81-95.

References Kulldorff, Martin, and Neville Nagarwalla. 1995. Spatial disease clusters: Detection and inference. Statistics in Medicine 14: 799-810. Kulldorff, Martin, W. Athas, E. Feuer, B. Miller, and C. Key. 1998. Evaluating cluster alarms: A space-time scan statistic and brain cancer in Los Alamos. American Journal of Public Health 88:1377-1380. Kulldorff, Martin. 1997. A spatial scan statistic. Communications in Statistics – Theory and Methods 26, no. 6: 1481-1496. Kulldorff, Martin. and Information Management Services, Inc. SaTScanTM v8.0: Software for the spatial and space-time scan statistics. http://www.satscan.org/, 2009. O’Loughlin, John and Frank D. W. Witmer. 2011. The Localized Geographies of Violence in the North Caucasus of Russia, 1999-2007. The Annals of the Association of American Geographers 101, no. 1 (January): 178 – 201. O’Sullivan, David and David J. Unwin. 2003. Geographic Information Analysis. Hoboken, New Jersey: John Wiley and Sons. Olsen, Sjurdur F., Marco Martuzzi, and Paul Elliott. 1996. Cluster analysis and disease mapping – why, when, and how? A step by step guide. British Medical Journal 313 (October): 863-866. Waller, Lance A. and Carol A. Gotway. 2004. Applied Spatial Statistics for Public Health Data. Hoboken, New Jersey: John Wiley and Sons.

Spatial Clusters and Pattern Analysis Chris Jochem Geog 5161 – Spring 2011