We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!
Presentation is loading. Please wait.
Published byCamren Stell
Modified about 1 year ago
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.5 Using Measures of Position to Describe Variability
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 3 The th percentile is a value such that percent of the observations fall below or at that value. Percentile
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 4 Quartiles Figure 2.14 The Quartiles Split the Distribution Into Four Parts. 25% is below the first quartile (Q1), 25% is between the first quartile and the second quartile (the median, Q2), 25% is between the second quartile and the third quartile (Q3), and 25% is above the third quartile. Question: Why is the second quartile also the median?
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 5 SUMMARY: Finding Quartiles Arrange the data in order. Consider the median. This is the second quartile, Q2. Consider the lower half of the observations (excluding the median itself if n is odd). The median of these observations is the first quartile, Q1. Consider the upper half of the observations (excluding the median itself if n is odd). Their median is the third quartile, Q3.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 6 Consider the sodium values for the 20 breakfast cereals. What are the quartiles for the 20 cereal sodium values? From Table 2.3, the sodium values, in ascending order, are: The median of the 20 values is the average of the 10th and 11 th observations, 180 and 180, which is Q2 = 180 mg. The first quartile Q1 is the median of the 10 smallest observations (in the top row), which is the average of 130 and 140, Q1 = 135mg. The third quartile Q3 is the median of the 10 largest observations (in the bottom row), which is the average of 200 and 210, Q3 = 205 mg. Example: Cereal Sodium Data
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 7 The interquartile range is the distance between the third quartile and first quartile: IQR = Q3 Q1 IQR gives spread of middle 50% of the data The Interquartile Range (IQR) In Words: If the interquartile range of U.S. music teacher salaries equals $16,000, this means that for the middle 50% of the distribution of salaries, $16,000 is the distance between the largest and smallest salaries.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 8 Examining the data for unusual observations, such as outliers, is important in any statistical analysis. Is there a formula for flagging an observation as potentially being an outlier? The 1.5 x IQR Criterion for Identifying Potential Outliers An observation is a potential outlier if it falls more than 1.5 x IQR below the first quartile or more than 1.5 x IQR above the third quartile. Detecting Potential Outliers
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 9 The 5-number summary is the basis of a graphical display called the box plot, and consists of Minimum value First Quartile Median Third Quartile Maximum value 5-Number Summary of Positions
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 10 A box goes from Q1 to Q3. A line is drawn inside the box at the median. A line goes from the lower end of the box to the smallest observation that is not a potential outlier and from the upper end of the box to the largest observation that is not a potential outlier. The potential outliers are shown separately. SUMMARY: Constructing a Box Plot
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 11 Figure 2.15 shows a box plot for the sodium values. Labels are also given for the five-number summary of positions. Example: Boxplot for Cereal Sodium Data Figure 2.15 Box Plot and Five-Number Summary for 20 Breakfast Cereal Sodium Values. The central box contains the middle 50% of the data. The line in the box marks the median. Whiskers extend from the box to the smallest and largest observations, which are not identified as potential outliers. Potential outliers are marked separately. Question: Why is the left whisker drawn down only to 50 rather than to 0?
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 12 Comparing Distributions A box plot does not portray certain features of a distribution, such as distinct mounds and possible gaps, as clearly as does a histogram. Box plots are useful for identifying potential outliers. Figure 2.16 Box Plots of Male and Female College Student Heights. The box plots use the same scale for height. Question: What are approximate values of the quartiles for the two groups?
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 13 The z-score also identifies position and potential outliers. The z-score for an observation is the number of standard deviations that it falls from the mean. A positive z-score indicates the observation is above the mean. A negative z- score indicates the observation is below the mean. For sample data, the z -score is calculated as: An observation from a bell-shaped distribution is a potential outlier if its z-score +3 (3 standard deviation criterion). Z-Score
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 5- 1.
Chapter 3, Numerical Descriptive Measures Data analysis is objective –Should report the summary measures that best meet the assumptions about the data.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
AP Stat Day Days until AP Exam Describing Data with Numbers Box and Whisker Plots Variance and Standard Deviation.
Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights.
Chapter 1 Section 1.2 Describing Distributions with Numbers.
Agenda Informationer –Bedst på Nettet –Forslag til eksamensopg. Opsamling fra sidst –Spørgsmålstyper –Svaralternativer –Skalaer –Gruppeopgaven Deskriptiv.
Introduction Our daily lives often involve a great deal of data, or numbers in context. It is important to understand how data is found, what it means,
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edwards University © 2002 South-Western /Thomson Learning.
Agenda Informationer –G-mails til Klaus –Brug supplerende litt. –Åben Excel (Opsamling fra sidst) Deskriptiv statistik –Teori –Små øvelser i Excel Videre.
Chapter 4 Exploring Numerical Data Objectives Students will be able to: 1)graph the distribution of a numerical variable 2) calculate summary statistics.
Probability and Statistics Representation of Data Measures of Center for Data Simple Analysis of Data.
Copyright © 2010 Pearson Education, Inc. Slide Suppose that 30% of the subscribers to a cable television service watch the shopping channel at least.
Rubric Unit Plan Univariate Bivariate Examples Resources Curriculum Statistically Thinking A study of univariate and bivariate statistics.
Copyright © 2010 Pearson Education, Inc. Slide
Looking at Data-Distributions 1.1-Displaying Distributions with Graphs.
Chapter 3 - Part A Descriptive Statistics: Numerical Methods Measures of Location Measures of Variability x x % %
Descriptive Statistics (Part 2) Standardized Data Standardized Data Percentiles and Quartiles Percentiles and Quartiles Box Plots Box Plots Grouped Data.
UNIT – 1 Data Preprocessing. Data Preprocessing Learning Objectives Understand why preprocess the data. Understand how to clean the data. Understand how.
1 Study question: distribution of IQ The IQ has an approximately normal distribution, with a mean of 100 and a standard deviation of 15. If 1000 people.
Box plot Edexcel S1 Mathematics 2003 (or box and whisker plot)
AP Statistics More on Describing Distributions. On the Docket… Homework Questions Notes: Numerical measures of center and spread Boxplots Finish Life.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edwards University.
A bar chart of a quantitative variable with only a few categories (called a discrete variable) communicates the relative number of subjects with each of.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
BUS 220: ELEMENTARY STATISTICS Chapter 4: Numerical Measures.
Chapter Four Making and Describing Graphs of Quantitative Variables Center, Shape. and Spread.
FPP 3-6 Exploratory Data Analysis: One Variable. Plan of attack Distinguish different types of variables Summarize data numerically Summarize data graphically.
Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., Chapter Three Describing Data: Measures of Central Tendency GOALS When you have completed.
CCGPS Coordinate Algebra UNIT QUESTION: How can I represent, compare, and interpret sets of data? Standard: MCC9-12.S.ID.1-3, 5-9, SP.5 Todays Question:
© 2016 SlidePlayer.com Inc. All rights reserved.