Excursions in Modern Mathematics, 7e: 14.4 - 2Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.

Slides:



Advertisements
Similar presentations
Probabilistic & Statistical Techniques
Advertisements

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
CHAPTER 1 Exploring Data
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 16 Mathematics of Normal Distributions 16.1Approximately Normal.
Measures of Dispersion or Measures of Variability
1.2 Describing Distributions with Numbers. Center and spread are the most basic descriptions of what a data set “looks like.” They are intuitively meant.
§ 14.3 Numerical Summaries of Data
Describing Distributions Numerically
Measures of Central Tendency
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Describing distributions with numbers
Objectives 1.2 Describing distributions with numbers
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
Chapter 3 Descriptive Measures
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
Slide 1 Statistics Workshop Tutorial 6 Measures of Relative Standing Exploratory Data Analysis.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation As A Ruler And The Normal Model.
Describing distributions with numbers
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Objectives The student will be able to: find the variance of a data set. find the standard deviation of a data set.
Numerical Measures of Variability
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 5, Slide 1 Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Measures of variability: understanding the complexity of natural phenomena.
Numerical descriptors BPS chapter 2 © 2006 W.H. Freeman and Company.
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Numerical descriptors BPS chapter 2 © 2006 W.H. Freeman and Company.
Describing Distributions with Numbers Chapter 2. What we will do We are continuing our exploration of data. In the last chapter we graphically depicted.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 16 Mathematics of Normal Distributions 16.1Approximately Normal.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
Numerical descriptions of distributions
Slide 3-1 Copyright © 2008 Pearson Education, Inc. Chapter 3 Descriptive Measures.
Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Copyright © 2016 Brooks/Cole Cengage Learning Intro to Statistics Part II Descriptive Statistics Intro to Statistics Part II Descriptive Statistics Ernesto.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 6- 1.
CHAPTER 4 NUMERICAL METHODS FOR DESCRIBING DATA What trends can be determined from individual data sets?
Descriptive Statistics ( )
Statistics for Managers Using Microsoft® Excel 5th Edition
Please take out Sec HW It is worth 20 points (2 pts
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Summary (Week 1) Categorical vs. Quantitative Variables
The Standard Deviation as a Ruler and the Normal Model
Summary (Week 1) Categorical vs. Quantitative Variables
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables 14.3 Numerical Summaries 14.4Measures of Spread

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. An obvious approach to describing the spread of a data set is to take the difference between the highest and lowest values of the data. This difference is called the range of the data set and usually denoted by R. Thus, R = Max – Min. The range of a data set is a useful piece of information when there are no outliers in the data. In the presence of outliers the range tells a distorted story. The Range

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. For example, the range of the test scores in the Stat 101 exam is 24 – 1 = 23 points, an indication of a big spread within the scores (i.e., a very heterogeneous group of students). True enough, but if we discount the two outliers, the remaining 73 test scores would have a much smaller range of 16 – 6 = 10 points. The Range

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. To eliminate the possible distortion caused by outliers, a common practice when measuring the spread of a data set is to use the interquartile range, denoted by the acronym IQR. The interquartile range is the difference between the third quartile and the first quartile (IQR = Q3 – Q1), and it tells us how spread out the middle 50% of the data values are. For many types of real-world data, the interquartile range is a useful measure of spread. The Interquartile Range

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. The five-number summary for the 2007 SAT math scores was Min = 200 (yes, there were a few jokers who missed every question!), Q 1 = 430, M = 590, Max = 800 (there are still a few geniuses around!). It follows that the 2007 SAT math scores had a range of 600 points (800 – 200 = 600) and an interquartile range of 160 points (IQR = 590 – 430 = 160). Example SAT Math Scores: Part 3

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. The most important and most commonly used measure of spread for a data set is the standard deviation. The key concept for understanding the standard deviation is the concept of deviation from the mean. If A is the average of the data set and x is an arbitrary data value, the difference x – A is x’s deviation from the mean. The deviations from the mean tell us how “far” the data values are from the average value of the data. The idea is to use this information to figure out how spread out the data is. Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. The deviations from the mean are themselves a data set, which we would like to summarize. One way would be to average them, but if we do that, the negative deviations and the positive deviations will always cancel each other out so that we end up with an average of 0. This, of course, makes the average useless in this case. The cancellation of positive and negative deviations can be avoided by squaring each of the deviations. Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. The squared deviations are never negative, and if we average them out, we get an important measure of spread called the variance, denoted by V. Finally, we take the square root of the variance and get the standard deviation, denoted by the Greek letter  (and sometimes by the acronym SD). The following is an outline of the definition of the standard deviation of a data set. Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. ■ Let A denote the mean of the data set. For each number x in the data set, compute its deviation from the mean (x – A) and square each of these numbers. These numbers are called the squared deviations. ■ Find the average of the squared deviations. This number is called the variance V. ■ The standard deviation is the square root of the variance THE STANDARD DEVIATION OF A DATA SET

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. Over the course of the semester, Angela turned in all of her homework assignments. Her grades in the 10 assignments (sorted from lowest to highest) were 85, 86, 87, 88, 89, 91, 92, 93, 94, and 95. Our goal in this example is to calculate the standard deviation of this data set the old-fashioned way (i.e., doing our own grunt work). The first step is to find the mean A of the data set. It’s not hard to see that A = 90. Example 14.19Calculation of a SD

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. The second step is to calculate the deviations from the mean and then the squared deviations. When we average the squared deviations, we get 11. This means that the variance is V = 11 and thus the standard deviation (rounded to one decimal place) is Example 14.19Calculation of a SD

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. It is clear from just a casual look at Angela’s homework scores that she was pretty consistent in her homework, never straying too much above or below her average score of 90 points. The standard deviation is, in effect, a way to measure this degree of consistency (or lack thereof). A small standard deviation tells us that the data are consistent and the spread of the data is small, as is the case with Angela’s homework scores. Interpreting the Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. The ultimate in consistency within a data set is when all the data values are the same (like Angela’s friend Chloe, who got a 20 in every homework assignment). When this happens the standard deviation is 0. Interpreting the Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. On the other hand, when there is a lot of inconsistency within the data set, we are going to get a large standard deviation. This is illustrated by Angela’s other friend, Tiki, whose homework scores were 5, 15, 25, 35, 45, 55, 65, 75, 85, and 95. We would expect the standard deviation of this data set to be quite large–in fact, it is almost 29 points. Interpreting the Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. The standard deviation is arguably the most important and frequently used measure of data spread.Yet it is not a particularly intuitive concept. Here are a few basic guidelines that recap our preceding discussion: Summary of the Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. ■ The standard deviation of a data set is measured in the same units as the original data. For example, if the data are points on a test, then the standard deviation is also given in points. Conversely, if the standard deviation is given in dollars, then we can conclude that the original data must have been money–some prices, salaries, or something like that. For sure, the data couldn’t have been test scores on an exam. Summary of the Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. ■ It is pointless to compare standard deviations of data sets that are given in different units. Even for data sets that are given in the same units–say, for example, test scores–the underlying scale should be the same. We should not try to compare standard deviations for SAT scores measured on a scale of 200–800 points with standard deviations of a set of homework assignments measured on a scale of 0–100 points. Summary of the Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. ■ For data sets that are based on the same underlying scale, a comparison of standard deviations can tell us something about the spread of the data. If the standard deviation is small, we can conclude that the data points are all bunched together– there is very little spread. As the standard deviation increases, we can conclude that the data points are beginning to spread out. Summary of the Standard Deviation

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. The more spread out they are, the larger the standard deviation becomes. A standard deviation of 0, means that all data values are the same. Summary of the Standard Deviation As a measure of spread, the standard deviation is particularly useful for analyzing real-life data.