Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.

Slides:



Advertisements
Similar presentations
Quantitative Methods in HPELS 440:210
Advertisements

Measures of Dispersion
Measures of Dispersion or Measures of Variability
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Slides by JOHN LOUCKS St. Edward’s University.
Edpsy 511 Homework 1: Due 2/6.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Measures of Central Tendency
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
Chapter 2 Describing Data with Numerical Measurements
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 1 Overview and Descriptive Statistics.
Statistics Recording the results from our studies.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
CHS Statistics 2.5: Measures of Spread
Descriptive Statistics: Numerical Methods
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Describing distributions with numbers
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Page 1 Chapter 3 Variability. Page 2 Central tendency tells us about the similarity between scores Variability tells us about the differences between.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
STATISTICS “CALCULATING DESCRIPTIVE STATISTICS –Measures of Dispersion” 4.0 Measures of Dispersion.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Unit 3 Lesson 2 (4.2) Numerical Methods for Describing Data
Agenda Descriptive Statistics Measures of Spread - Variability.
Measures of Dispersion. Introduction Measures of central tendency are incomplete and need to be paired with measures of dispersion Measures of dispersion.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
Chapter 3: Averages and Variation Section 2: Measures of Dispersion.
Measures of variability: understanding the complexity of natural phenomena.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Summary Statistics: Measures of Location and Dispersion.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
LIS 570 Summarising and presenting data - Univariate analysis.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Descriptive Statistics(Summary and Variability measures)
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Chapter 1 Lesson 7 Variance and Standard Deviation.
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores.
CHAPTER 1 Exploring Data
Midrange (rarely used)
Numerical Descriptive Measures
Numerical Descriptive Measures
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Numerical Descriptive Measures
The Mean Variance Standard Deviation and Z-Scores
The Mean Variance Standard Deviation and Z-Scores
Presentation transcript:

Chapter 5: Measures of Dispersion

Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents differ from each other. (spread is another interchangeable term) Dispersion helps us to measure the extent to which the values or responses deviate from each other. The tools used to measure the amount of variability or spread in the categories or values of a variable are called the measures of dispersion or measures of variability. Example: Which one of these data sets has more variability? A: 22, 24, 25, 28, 30B: 1, 5, 12, 20, 32 Data set B is clearly more spread out.

Measures of Dispersion for Numerical Variables Dispersion in the case of a numerical variable is the degree to which the numbers in a distribution vary from the typical, or average value. Three typical methods to measure dispersion for numerical variables are: – Range – Interquartile range – Standard deviation

Range The range is the difference between the highest value and lowest value in a data set. It is the simplest way to obtain information about dispersion. Daily temperature is an example of range as it is usually given in the form of the highest and lowest values.

Example: Compute the range for the following set of data: 12, 3, 17, 21, 11, 18, 20, 19, 6, 15 Range: 21 – 3 = 18 There are a few disadvantages to using range: 1)The range does not give any information about the distribution of the observations within its two endpoint values. 2)The range can be skewed by outliers, just as the mean can be. For example, add the value 200 to the above data set. Now the range is 197. That is not representative of the data.

Interquartile Range Interquartile range is the range of the middle 50% of a distribution after the bottom 25% and top 25% have been eliminated. Interquartile range is more resistant to outliers than the range. In order to calculate the IQR, we first have to find the quartiles of the distribution, which are the values that divide the distribution into four groups of roughly the same size.

Steps to calculate quartiles 1) Put the data in numerical order and find the median (this also happens to be the second quartile) 2) Find the median of the values whose position in the ordered list is to the left of the median. This value is the first quartile. 3) Find the median of the values whose position in the ordered list is to the right of the median. This value is the third quartile. Note: When the number of observations is odd, don’t include the median value in the calculations in steps 2 and 3.

If the position of a quartile is between two numbers, average those numbers as you would to find a median. Subtract the first quartile from the third quartile to get the IQR. When dealing with frequency distributions, there are useful formulas to help us figure out the position of the quartiles. If the position is a whole number, then the quartile lies on that exact one observation. If the position is a decimal, then the quartile is the average.

Example: Find the interquartile range. 5, 6, 5, 10, 11, 16, 15, 13 Order the data:5, 5, 6, 10, 11, 13, 15, 16 The median is 10.5 Q1 is 5.5. Q3 is 14 IQR=14-5.5=8.5

Example: Find the IQR for the following data.

Standard Deviation Standard deviation measures the variability in a distribution using the squared deviations from the mean. A deviation is the difference between the score and the mean of the data set. Deviations can be positive, negative, or 0.

Example: Find the deviations and the sum of the deviations for the following data set: 25, 26, 30. Since the sum of the deviations from the mean is always zero, their average will always be zero. We need a different way to measure the average deviations from the mean. To counteract that problem, we will square each deviation and then find the average of the squared deviations.

The Variance The variance is the average of the squared deviations from the mean. Standard deviation is the square root of the variance. The variance (and as a result standard deviation) is calculated differently for the population and the sample.

Notation

Steps to find the variance. 1)Find the sum of your data set. 2)Find the mean. 3)Calculate the deviations. 4)Square the deviations. 5)Sum the squared deviations. 6)Divide the sum of the squared deviations by n if working with the population or n-1 if working with the sample. To find the standard deviation, take the square root of step 6.

Example: Find the variance and the standard deviation for the sample data: 25, 26, 30.

You try. Find the variance and standard deviation for the population data: 2, 4, 8, 14.

Short-cut Formula for Variance and SD There are two quantities in the short-cut formula that we need to clarify.

Note: For the population variance and standard deviation, divide by n stead of n-1

Let’s retry the previous example of the sample of 25, 26, and 30. We know the variance should come out to be 7 and the sd 2.65.

Now try the example with 2, 4, 8, and 14. Remember we are dealing with a population!