Descriptive Statistics for Spatial Distributions Chapter 3 of the textbook Pages 76-115.

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

Class Session #2 Numerically Summarizing Data
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Table of Contents Exit Appendix Behavioral Statistics.
QUANTITATIVE DATA ANALYSIS
Regionalized Variables take on values according to spatial location. Given: Where: A “structural” coarse scale forcing or trend A random” Local spatial.
CHAPTER 3: Statistical Description of Data
Chapter 5: Variability and Standard (z) Scores How do we quantify the variability of the scores in a sample?
Measures of Central Tendency
Today: Central Tendency & Dispersion
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Objectives 1.2 Describing distributions with numbers
May 06th, Chapter - 7 INFORMATION PRESENTATION 7.1 Statistical analysis 7.2 Presentation of data 7.3 Averages 7.4 Index numbers 7.5 Dispersion from.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Chapter 1Chapter 2Chapter 3 Chapter 4 Chapter 6 Chapter 7Chapter 11 Chapter 13Chapter 12.
Data Handbook Chapter 4 & 5. Data A series of readings that represents a natural population parameter A series of readings that represents a natural population.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
Descriptive Statistics: Numerical Methods
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
Psyc 235: Introduction to Statistics Lecture Format New Content/Conceptual Info Questions & Work through problems.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
 The outer boundary of a circle or other curvilinear figure is…… circumference  One part or unit of each hundred or a fraction is called a…… percent.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Descriptive Statistics Descriptive Statistics describe a set of data.
Chapter 4 – Descriptive Spatial Statistics Scott Kilker Geog Advanced Geographic Statistics.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Understanding Your Data Set Statistics are used to describe data sets Gives us a metric in place of a graph What are some types of statistics used to describe.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Probability and Statistics in Geology Probability and statistics are an important aspect of Earth Science. Understanding the details, population of a data.
Central Tendency & Dispersion
What’s the Point? Working with 0-D Spatial Data in ArcGIS
Point Pattern Analysis Point Patterns fall between the two extremes, highly clustered and highly dispersed. Most tests of point patterns compare the observed.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Introduction to statistics I Sophia King Rm. P24 HWB
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
1 Day 1 Quantitative Methods for Investment Management by Binam Ghimire.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
Chapter 2 Describing and Presenting a Distribution of Scores.
Welcome to… The Exciting World of Descriptive Statistics in Educational Assessment!
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Latitude and Longitude.  A grid system that allows us to pinpoint exact areas on the Earth.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
7 th Grade Math Vocabulary Word, Definition, Model Emery Unit 4.
Exploratory Data Analysis
Statistical Methods Michael J. Watts
Different Types of Data
Statistical Methods Michael J. Watts
Measures of Central Tendency
PROGRAMME 27 STATISTICS.
COORDINATE PLANE The plane containing the "x" axis and "y" axis.
Numerical Descriptive Measures
Basic Statistical Terms
Chapter 1: Exploring Data
Ticket in the Door GA Milestone Practice Test
Biostatistics Lecture (2).
Presentation transcript:

Descriptive Statistics for Spatial Distributions Chapter 3 of the textbook Pages

Descriptive Statistics for Point Data Also called geostatistics Used to describe point data including: –The center of the points –The dispersion of the points

Descriptive spatial statistics: Centrality Assume point data. Example types of geographic centers: –U.S. physical center –U.S. population center Mean center Median center

Mean Center (Centroid) A centroid is the arithmetic mean (a.k.a. the “center of mass”) of a spatial data object or set of objects, which is calculated mathematically In the simplest case the centroid is the geographic mean of a single object I.e., imagine taking all the points making up the outer edge of of a polygon, adding up all the X values and all the Y values, and dividing each sum by the number of points. The resulting mean X and Y coordinate pair is the centroid. For example: the center of a circle or square

Mean Center (Centroid) A more complicated case is when a centroid is the geographic mean of many spatial objects This type of centroid would be calculated using the geographic mean of all the objects in one or more GIS layer I.e., the coordinates of each point and/or of each individual polygon centroid are used to calculate an overall mean For example: the center of a population

Mean Center (Centroid) in Irregular Polygons Where is the centroid for the following shapes? In these cases the true centroid is outside of the polygons

Measures of Central Tendency – Arithmetic Mean A standard geographic application of the mean is to locate the center (centroid) of a spatial distribution Assign to each member a gridded coordinate and calculating the mean value in each coordinate direction -- > Bivariate mean or mean center This measure minimizes the squared distances For a set of (x, y) coordinates, the mean center is calculated as:

Weighted Mean Center Calculated the same as the normal mean center, but with an additional Z value multiplied by the X and Y coordinates This would be used if, for example, the points indicated unequal amounts (e.g., cities with populations)

Manhattan Median The point for which half of the distribution is to the left, half to the right, half above and half below For an even number of points there is no exact solution For an odd number of points the is an exact solution The solution can change if we rotate the axes May also called the bivariate median

Manhattan Median Equation The book describes this as something created graphically (e.g., drawing lines between points) However it can be calculated by using the median X and Y values If there are an even number of points the Manhattan median is actually a range

Euclidian Median The point that minimizes aggregate distance to the center For example: if the points were people and they all traveled to the a single point (the Euclidian Median), the total distance traveled would be minimum May also called the point of Minimum Aggregate Travel (MAT) or the median center

Euclidian Median Point that minimizes the sum of distances Must be calculated iteratively Iterative calculations: –When mathematical solutions don’t exist. –Result from one calculation serves as input into next calculation. –Must determine: Starting point Stopping point Threshold used to stop iterating This may also be weighted in the same way we weight values for the mean center

Euclidian Median Equations

Measures of Central Tendency How do they differ? Mean center: –Minimizes squared distances –Easy to calculate –Affected by all points Manhattan Median: –Minimizes absolute deviations –Shortest distances when traveling only N-S and/or E-W –Easy to calculate –No exact solution for an even number of points Euclidian Median: –True shortest path –Harder to calculate (and no exact solution)

Dispersion: Standard Distance Standard distance –Analogous to standard deviation –Represented graphically as circles on a 2-D scatter plot

Dispersion (not discussed in textbook) Average distance –Often more interesting –Distances are always positive, so average distance from a center point is not 0. Relative distance –Standard distance is measured in units (i.e. meters, miles). –The same standard distance has very different meanings when the study area is one U.S. state vs. the whole U.S. –Relative distance relates the standard distance to the size of the study area.

Dispersion: Quartilides Quartilides are determined like the Manhattan median, but for only X or Y, not both Similar to quantiles (e.g., percentiles and quartiles) from chapter 2, but in 2-D Examples: Northern, Southern, Eastern, Western

Pattern Analysis This will be discussed in greater detail later in the class, but some of these measures start hinting at things like clustering

Directional Statistics Directional statistics are concerned with… Characterizing and quantifying direction is challenging, in part, because 359 and 0 degrees are only one degree apart To deal with this we often use trigonometry to make measurements easier to use For example, taking the cosine of a slope aspect measurement provides an indication of north or south facing

Directional Graphics Circular histogram –Bins typically assigned to standard directions 4 – N, S, E, W 8 – N, NE, E, SE, S, SW, W, NW 16 – N, NNE, NE, ENE, E, ESE, SE, SSE, S, SSW, SW, WSW, W, WNW, NW, NNW Rose diagram –May used radius length or area (using radius ^0.5) to indicate frequency

Directional Statistics Directional Mean –Assumes all distances are equal –Calculates a final direction angle –An additional equation is required to determine the quadrant –Derived using trigonometry Unstandardized variance –Tells the final distance, but not the direction Circular Variance –Based on the unstandardized variance –Gives a standardized measure of variance –Values range from 0 to 1, with 1 equaling a final distance of zero

Problems Associated With Spatial Data Boundary Problem Scale Problem Modifiable Units Problem Problems of Pattern

Boundary Problem Can someone give me a concise definition of the boundary problem? Which of these boundaries are “correct” and why? How can we improve the boundaries?

Scale Problem Also referred to as the aggregation problem When scaling up, detail is lost Scaling down creates an ecological fallacy

Modifiable Units Problem Also called the Modifiable Area Units Problem (MAUP) Similar to scaling problems because they also involve aggregation The take home message is that how we aggregate the input units will impact the values of the output units A real world example of this is Gerrymandering voting districts

Problems of Pattern This “problem” relates to the limitations of some statistics (e.g., LQ, CL, Lorenz Curves) Fortunately there are many other types of statistics that can be used in addition to or instead of these limited measured (e.g., pattern metrics)

For Monday Read pages