Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Chapter 2 Exploring Data with Graphs and Numerical Summaries
Descriptive Measures MARE 250 Dr. Jason Turner.
Descriptive Statistics
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
CHAPTER 4 Displaying and Summarizing Quantitative Data Slice up the entire span of values in piles called bins (or classes) Then count the number of values.
Displaying & Summarizing Quantitative Data
It’s an outliar!.  Similar to a bar graph but uses data that is measured.
Descriptive Statistics
Numerical Representation of Data Part 3 – Measure of Position
Section 2.5 Measures of Position.
Statistics: Use Graphs to Show Data Box Plots.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Section 2.5 Measures of Position Larson/Farber 4th ed.
Descriptive Statistics
Chapter Descriptive Statistics 1 of © 2012 Pearson Education, Inc. All rights reserved.
Measures of Central Tendency
Graphing Quantitative Data Sets
Descriptive Statistics
Section 2.5 Measures of Position.
Section 2.5 Measures of Position Larson/Farber 4th ed. 1.
Descriptive Statistics
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Chapter 6 1. Chebychev’s Theorem The portion of any data set lying within k standard deviations (k > 1) of the mean is at least: 2 k = 2: In any data.
Objectives Describe the central tendency of a data set.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
1 of 96 Chapter Outline 2.1 Frequency Distributions and Their Graphs 2.2 More Graphs and Displays 2.3 Measures of Central Tendency 2.4 Measures of Variation.
Statistics Numerical Representation of Data Part 1 – Measures of Central Tendency.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Descriptive Statistics 2.
Descriptive Statistics. Frequency Distributions and Their Graphs What you should learn: How to construct a frequency distribution including midpoints,
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Descriptive Statistics 2.
Chapter 2 Section 5 Notes Coach Bridges
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Measures of Central Tendency A statistic is a characteristic or measure obtained by using the data values from a sample. A parameter is a characteristic.
Section 2.3 Measures of Central Tendency 1 of 149 © 2012 Pearson Education, Inc. All rights reserved.
Measures of Position. Determine the quartiles of a data set Determine the interquartile range of a data set Create a box-and-whisker plot Interpret.
Chapter Descriptive Statistics 1 of © 2012 Pearson Education, Inc. All rights reserved.
2.3 Measures of Central Tendency Coach Bridges NOTES.
1 Never let time idle away aimlessly.. 2 Chapters 1, 2: Turning Data into Information Types of data Displaying distributions Describing distributions.
2.3: Measures of Central Tendency Chapter 2: Descriptive Statistics Objectives... Determine the mean, median, and mode of a population and of a sample.
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
Section 2.3 Measures of Central Tendency. Section 2.3 Objectives Determine the mean, median, and mode of a population and of a sample (and which to use.
Section 2.5 Measures of Position.
Chapter Descriptive Statistics 1 of © 2012 Pearson Education, Inc. All rights reserved.
Chapter Descriptive Statistics 1 of © 2012 Pearson Education, Inc. All rights reserved.
Standard Deviation by Hand 9 – Step Process See 4.2 WS.
Chapter 4 Measures of Central Tendency Measures of Variation Measures of Position Dot Plots Stem-and-Leaf Histograms.
Do Now Identify the w’s and specify each variable as categorical or quantitative. Scientists at a major pharmaceutical firm conducted an experiment to.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Frequency Polygons. Graphs of Frequency Distributions Frequency Polygon A line graph that emphasizes the continuous change in frequencies. Use the same.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Measures of Center.
Algebra II Descriptive Statistics 1 Larson/Farber 4th ed.
Statistics Test # 2 Review
Chapter 2 Descriptive Statistics.
2 Chapter Descriptive Statistics © 2012 Pearson Education, Inc.
Descriptive Statistics
Statistics Unit Test Review
Laugh, and the world laughs with you. Weep and you weep alone
2 Chapter Descriptive Statistics © 2012 Pearson Education, Inc.
2 Chapter Descriptive Statistics © 2012 Pearson Education, Inc.
2 Chapter Descriptive Statistics © 2012 Pearson Education, Inc.
Chapter 2 Descriptive Statistics.
Descriptive Statistics
Displaying and Summarizing Quantitative Data
Honors Statistics Review Chapters 4 - 5
Descriptive Statistics
What is Statistics? Statistics The science of collecting, organizing, analyzing, and interpreting data in order to make decisions. Larson/Farber 4th ed.
Presentation transcript:

Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position

Frequency Histogram A bar graph that represents the frequency distribution. The horizontal scale is quantitative and measures the data values. The vertical scale measures the frequencies of the classes. Consecutive bars must touch. data values frequency

Solution: Frequency Histogram (using class boundaries) You can see that more than half of the GPS navigators are priced below $

Relative Frequency Histogram Has the same shape and the same horizontal scale as the corresponding frequency histogram. The vertical scale measures the relative frequencies, not frequencies (Percents). data values relative frequency

Solution: Relative Frequency Histogram From this graph you can see that 27% of GPS navigators are priced between $ and $

Graphing Quantitative Data Sets Stem-and-leaf plot Each number is separated into a stem and a leaf. Similar to a histogram. Still contains original data values. Data: 21, 25, 25, 26, 27, 28, 30, 36, 36,

Example: Constructing a Stem-and-Leaf Plot The following are the numbers of text messages sent last week by the cellular phone users on one floor of a college dormitory. Display the data in a stem-and-leaf plot

Solution: Constructing a Stem-and-Leaf Plot The data entries go from a low of 78 to a high of 159. Use the rightmost digit as the leaf.  For instance, 78 = 7 | 8 and 159 = 15 | 9 List the stems, 7 to 15, to the left of a vertical line. For each data entry, list a leaf to the right of its stem

Solution: Constructing a Stem-and-Leaf Plot Include a key to identify the values of the data. From the display, you can conclude that more than 50% of the cellular phone users sent between 110 and 130 text messages.

CW1: WS 4.1 – Stem & Leaf Plot

Graphing Quantitative Data Sets Dot plot Each data entry is plotted, using a point, above a horizontal axis. Data: 21, 25, 25, 26, 27, 28, 30, 36, 36,

Example: Constructing a Dot Plot Use a dot plot organize the text messaging data. So that each data entry is included in the dot plot, the horizontal axis should include numbers between 70 and 160. To represent a data entry, plot a point above the entry's position on the axis. If an entry is repeated, plot another point above the previous point

Solution: Constructing a Dot Plot From the dot plot, you can see that most values cluster between 105 and 148 and the value that occurs the most is 126. You can also see that 78 is an unusual data value

CW2: WS 4.1- Stem & Leaf Plot Create Dot Plots per Directions given in class

Shapes - Unimodal

Multimodal 3 or more Peaks Bimodal 2 Peaks

Uniform Distribution All entries or classes in the distribution have approximately equal frequencies.

Symmetric Distribution A vertical line can be drawn through the middle of a graph of the distribution and the resulting halves are approximately mirror images.

Tails – Skewed to Thinner End Skewed Left Mean is to the LEFT of the Median Skewed Right Mean is to the RIGHT of the Median

GAPS Multiple Modes Different Sources?

Outliers – Informative or Error?

Practice: Describe the Following Shapes We will do this as a group Nothing is required on paper

Median – A Measure of Center The value that lies in the middle of the data when the data set is ordered. Measures the center of an ordered data set by dividing it into two equal parts. If the data set has an  odd number of entries: median is the middle data entry.  even number of entries: median is the mean of the two middle data entries.

Example: Finding the Median The prices (in dollars) for a sample of roundtrip flights from Chicago, Illinois to Cancun, Mexico are listed. Find the median of the flight prices © 2012 Pearson Education, Inc. All rights reserved.

Solution: Finding the Median First order the data There are seven entries (an odd number), the median is the middle, or fourth, data entry. The median price of the flights is $427. © 2012 Pearson Education, Inc. All rights reserved.

Example: Finding the Median The flight priced at $432 is no longer available. What is the median price of the remaining flights? © 2012 Pearson Education, Inc. All rights reserved.

Solution: Finding the Median First order the data There are six entries (an even number), the median is the mean of the two middle entries. The median price of the flights is $412. © 2012 Pearson Education, Inc. All rights reserved.

CW 3 – WS Median

Quartiles – Measure of Spread Fractiles are numbers that partition (divide) an ordered data set into equal parts. Quartiles approximately divide an ordered data set into four equal parts.  First quartile, Q 1 : About one quarter of the data fall on or below Q 1.  Second quartile, Q 2 : About one half of the data fall on or below Q 2 (median).  Third quartile, Q 3 : About three quarters of the data fall on or below Q 3. © 2012 Pearson Education, Inc. All rights reserved.

Example: Finding Quartiles The number of nuclear power plants in the top 15 nuclear power-producing countries in the world are listed. Find the first, second, and third quartiles of the data set Solution: Q 2 divides the data set into two halves Q2Q2 Lower half Upper half © 2012 Pearson Education, Inc. All rights reserved.

Solution: Finding Quartiles The first and third quartiles are the medians of the lower and upper halves of the data set Q2Q2 Lower half Upper half Q1Q1 Q3Q3 About one fourth of the countries have 10 or fewer nuclear power plants; about one half have 18 or fewer; and about three fourths have 31 or fewer. © 2012 Pearson Education, Inc. All rights reserved.

Interquartile Range Interquartile Range (IQR) The difference between the third and first quartiles. IQR = Q 3 – Q 1 © 2012 Pearson Education, Inc. All rights reserved.

Example: Finding the Interquartile Range Find the interquartile range of the data set Recall Q 1 = 10, Q 2 = 18, and Q 3 = 31 Solution: IQR = Q 3 – Q 1 = 31 – 10 = 21 The number of power plants in the middle portion of the data set vary by at most 21. © 2012 Pearson Education, Inc. All rights reserved.

Box-and-Whisker Plot Exploratory data analysis tool. Highlights important features of a data set. Requires (five-number summary) © 2012 Pearson Education, Inc. All rights reserved.

Drawing a Box-and-Whisker Plot 1.Find the five-number summary of the data set. 2.Construct a horizontal scale that spans the range of the data. 3.Plot the five numbers above the horizontal scale. 4.Draw a box above the horizontal scale from Q 1 to Q 3 and draw a vertical line in the box at Q 2. 5.Draw whiskers from the box to the minimum and maximum entries. Whisker Maximum entry Minimum entry Box Median, Q 2 Q3Q3 Q1Q1 © 2012 Pearson Education, Inc. All rights reserved.

Example: Drawing a Box-and-Whisker Plot Draw a box-and-whisker plot that represents the data set Min = 6, Q 1 = 10, Q 2 = 18, Q 3 = 31, Max = 104, Solution: About half the data values are between 10 and 31. By looking at the length of the right whisker, you can conclude 104 is a possible outlier. © 2012 Pearson Education, Inc. All rights reserved.

Range The difference between the maximum and minimum data entries in the set. The data must be quantitative. Range = (Max. data entry) – (Min. data entry) © 2012 Pearson Education, Inc. All rights reserved.

Example: Finding the Range A corporation hired 10 graduates. The starting salaries for each graduate are shown. Find the range of the starting salaries. Starting salaries (1000s of dollars) © 2012 Pearson Education, Inc. All rights reserved.

Solution: Finding the Range Ordering the data helps to find the least and greatest salaries Range = (Max. salary) – (Min. salary) = 47 – 37 = 10 The range of starting salaries is 10 or $10,000. minimum maximum © 2012 Pearson Education, Inc. All rights reserved.

CW 4 – WS Quartiles

CW 5 – WS 4.4 – Box & Wiskers

Median and IQR Based on POSITION of data Appropriate for any data

Mean and Standard Deviation Based on Size of data Not as effective when data is strongly skewed or outliers are present

CW 6 – WS 4.5 – Mean (The Basics)

Mean Mean (average) The sum of all the data entries divided by the number of entries. Sigma notation: Σx = add all of the data entries (x) in the data set. Population mean: Sample mean: © 2012 Pearson Education, Inc. All rights reserved.

Mean – Notation for this Class & Book Our book uses The book uses y to represent variables we want to summarize, model or predict

Example: Finding a Sample Mean The prices (in dollars) for a sample of round-trip flights from Chicago, Illinois to Cancun, Mexico are listed. What is the mean price of the flights? © 2012 Pearson Education, Inc. All rights reserved.

Solution: Finding a Sample Mean The sum of the flight prices is Σx = = 3695 To find the mean price, divide the sum of the prices by the number of prices in the sample The mean price of the flights is about $ © 2012 Pearson Education, Inc. All rights reserved.

Comparing the Mean and Median Both describe a typical entry of a data set. Advantage of using the mean:  The mean is a reliable measure because it takes into account every entry of a data set. Disadvantage of using the mean:  Greatly affected by outliers (a data entry that is far removed from the other entries in the data set). © 2012 Pearson Education, Inc. All rights reserved.

Example: Comparing the Mean and Median Find the mean and median of the sample ages of a class shown. Which best describes a typical entry of this data set? Are there any outliers? Ages in a class © 2012 Pearson Education, Inc. All rights reserved.

Solution: Comparing the Mean and Median Mean: Median: Ages in a class © 2012 Pearson Education, Inc. All rights reserved.

Solution: Comparing the Mean and Median Mean ≈ 23.8 years Median = 21.5 years The mean takes every entry into account, but is influenced by the outlier of 65. The median also takes every entry into account, and it is not affected by the outlier. © 2012 Pearson Education, Inc. All rights reserved.

5 Number Summary  Min Q1Q1  Median Q3Q3  Max  Q 3  Median  Q 1  Min

See 4.6 WS 8 Steps to Find the Standard Deviation Okay, technically 9 Steps, but step 6 is a do over.