Understanding and Comparing Distributions

Slides:



Advertisements
Similar presentations
Copyright © 2010 Pearson Education, Inc. Slide
Advertisements

F IND THE S TANDARD DEVIATION FOR THE FOLLOWING D ATA OF GPA: 4, 3, 2, 3.5,
Understanding and Comparing Distributions
Understanding and Comparing Distributions 30 min.
CHAPTER 4 Displaying and Summarizing Quantitative Data Slice up the entire span of values in piles called bins (or classes) Then count the number of values.
Slide 3- 1 Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 4- 1.
Chapter 4: Displaying Quantitative Data
Chapter 5: Understanding and Comparing Distributions
Understanding and Comparing Distributions
Understanding and Comparing Distributions
Copyright © 2010, 2007, 2004 Pearson Education, Inc.
CHAPTER 7: Exploring Data: Part I Review
Copyright © 2009 Pearson Education, Inc. Chapter 5 Understanding and Comparing Distributions.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
1 Chapter 4: Describing Distributions 4.1Graphs: good and bad 4.2Displaying distributions with graphs 4.3Describing distributions with numbers.
Slide 4-1 Copyright © 2004 Pearson Education, Inc. Dealing With a Lot of Numbers… Summarizing the data will help us when we look at large sets of quantitative.
Chapter 4 Displaying Quantitative Data. Quantitative variables Quantitative variables- record measurements or amounts of something. Must have units or.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Chapter 5 Understanding and Comparing Distributions Math2200.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Understanding and Comparing Distributions.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Understanding and Comparing Distributions
Displaying Quantitative Data AP STATS NHS Mr. Unruh.
Understanding and Comparing Distributions Ch. 5 Day 1 Notes AP Statistics EQ: How do we make boxplots and why? How do we compare distributions?
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Understanding & Comparing Distributions Chapter 5.
Copyright © 2009 Pearson Education, Inc. Slide 4- 1 Practice – Ch4 #26: A meteorologist preparing a talk about global warming compiled a list of weekly.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 6- 1.
AP Statistics. Chapter 1 Think – Where are you going, and why? Show – Calculate and display. Tell – What have you learned? Without this step, you’re never.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 4- 1.
Chapter 2 The Normal Distributions. Section 2.1 Density curves and the normal distributions.
UNIT ONE REVIEW Exploring Data.
Exploratory Data Analysis
Understanding and Comparing Distributions
Chapter 4 Understanding and Comparing Distributions.
1.3 Measuring Center & Spread, The Five Number Summary & Boxplots
Chapter 5 : Describing Distributions Numerically I
Describing Distributions Numerically
Understanding and Comparing Distributions
Understanding and Comparing Distributions
Objective: Given a data set, compute measures of center and spread.
Describing Distributions Numerically
AP Statistics CH. 4 Displaying Quantitative Data
How could data be used in an EPQ?
Displaying Quantitative Data
Warm – Up 1. Find the IQR and Variance of the following data set. Are there any outliers in the data set? How do you know? Grades: 86, 78, 97, 96, 89,
The histograms represent the distribution of five different data sets, each containing 28 integers from 1 through 7. The horizontal and vertical scales.
Understanding and Comparing Distributions
Displaying and Summarizing Quantitative Data
DAY 3 Sections 1.2 and 1.3.
Chapter 5: Describing Distributions Numerically
A Modern View of the Data
Histograms: Earthquake Magnitudes
Warmup What five numbers need to be mentioned in the complete sentence you write when the data distribution is skewed?
Displaying and Summarizing Quantitative Data
Understanding and Comparing Distributions
Drill Construct a Histogram to represent the data of test score averages in 20 cities using 5 Bars. Test Averages {62, 68, 72, 58, 83, 91, 70, 82, 68,
Understanding and Comparing Distributions
Displaying and Summarizing Quantitative Data
Displaying and Summarizing Quantitative Data
Organizing Data AP Stats Chapter 1.
AP Statistics Day 4 Objective: The students will be able to describe distributions with numbers and create and interpret boxplots.
Understanding and Comparing Distributions
Describing Distributions Numerically
Honors Statistics Review Chapters 4 - 5
Understanding and Comparing Distributions
Warm-Up 4 87, 90, 95, 78, 75, 90, 92, 90, 80, 82, 77, 81, 95, Find the 5-Number Summary for the data 2. Address every type of measure of spread.
Describing Location in a Distribution
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Presentation transcript:

Understanding and Comparing Distributions Chapter 5 Understanding and Comparing Distributions

Comparing Groups It is almost always more interesting to compare groups. With histograms, note the shapes, centers, and spreads of the two distributions. What does this graphical display tell you? A: Both distributions are skewed to the right, but summer is skewed more strongly. Summer has a much lower center than winter. The spread of winter is much less variable than summer.

Comparing Groups (cont.) Boxplots offer an ideal balance of information and simplicity, hiding the details while displaying the overall summary information. We often plot them side by side for groups or categories we wish to compare. What do these boxplots tell you? The asterisk indicates outliers that are 3 or more IQR’s from the quartiles. This is a common convention and will be used throughout the text. However, the AP test has yet to use this notation.

What About Outliers? If there are any clear outliers and you are reporting the mean and standard deviation, report them with the outliers present and with the outliers removed. The differences may be quite revealing. Note: The median and IQR are not likely to be affected by the outliers.

Timeplots: Order, Please! For some data sets, we are interested in how the data behave over time. In these cases, we construct timeplots of the data.

A time plot of interest

*Re-expressing Skewed Data to Improve Symmetry When the data are skewed it can be hard to summarize them simply with a center and spread, and hard to decide whether the most extreme values are outliers or just part of a stretched out tail. How can we say anything useful about such data?

*Re-expressing Skewed Data to Improve Symmetry (cont.) One way to make a skewed distribution more symmetric is to re-express or transform the data by applying a simple function (e.g., logarithmic function). Note the change in skewness from the raw data (previous slide) to the transformed data (right):

What Can Go Wrong? (cont.) Avoid inconsistent scales, either within the display or when comparing two displays. Label clearly so a reader knows what the plot displays. Good intentions, bad plot:

What Can Go Wrong? (cont.) Beware of outliers Be careful when comparing groups that have very different spreads. Consider these side-by-side boxplots of cotinine levels: Re-express . . .

What have we learned? We’ve learned the value of comparing data groups and looking for patterns among groups and over time. We’ve seen that boxplots are very effective for comparing groups graphically. We’ve experienced the value of identifying and investigating outliers. We’ve graphed data that has been measured over time against a time axis and looked for long-term trends both by eye and with a data smoother.

AP Tips When comparing distributions, make sure to compare the center, shape and spread. All three are required for full credit. When comparing center and spread, use comparison words. Examples: “The mean was 34 mpg in group A, while the group B mean was 30 mpg.” (NOT full credit) “Group A had a larger mean of 34 mpg compared to group B’s mean of 30 mpg.” (full credit)