Section 2.1 Frequency Distributions and Their Graphs.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Descriptive Statistics
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
1 Chapter 1: Sampling and Descriptive Statistics.
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 1 & 3.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Numerical Representation of Data Part 3 – Measure of Position
Descriptive Statistics
12.3 – Measures of Dispersion
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics
Describing Data: Numerical
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 1 Exploring Data
Copyright © 2004 Pearson Education, Inc.
Descriptive Statistics
2 Chapter Descriptive Statistics 2012 Pearson Education, Inc.
Chapter 2 Summarizing and Graphing Data
Descriptive Statistics
MM207-Statistics Unit 2 Seminar-Descriptive Statistics Dr Bridgette Stevens AIM:BStevensKaplan (add me to your Buddy list) 1.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Statistics Chapter 9. Day 1 Unusual Episode MS133 Final Exam Scores
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Chapter 2 Describing Data.
Describing distributions with numbers
Lecture 3 Describing Data Using Numerical Measures.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
1 of 96 Chapter Outline 2.1 Frequency Distributions and Their Graphs 2.2 More Graphs and Displays 2.3 Measures of Central Tendency 2.4 Measures of Variation.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
Descriptive Statistics. Frequency Distributions and Their Graphs What you should learn: How to construct a frequency distribution including midpoints,
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Sect. 2-1 Frequency Distributions and Their graphs Objective SWBAT construct a frequency distribution including limits,boundaries, midpoints, relative.
+ CHAPTER 2 Descriptive Statistics SECTION 2.1 FREQUENCY DISTRIBUTIONS.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
Descriptive Statistics Unit 6. Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton.
What is Statistics?. Statistics 4 Working with data 4 Collecting, analyzing, drawing conclusions.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Chapter 2 Summarizing and Graphing Data  Frequency Distributions  Histograms  Statistical Graphics such as stemplots, dotplots, boxplots, etc.  Boxplots.
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
The Normal Distributions.  1. Always plot your data ◦ Usually a histogram or stemplot  2. Look for the overall pattern ◦ Shape, center, spread, deviations.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Graphs. Types of Graphs  Bar Graphs  Pie Charts  Dotplots  Stem and Leaf Plots  Histograms  Box Plots  Scatter Plots  Normal Curves.
Descriptive Statistics
Exploratory Data Analysis
Descriptive Statistics
Statistical Reasoning
Chapter 3 Describing Data Using Numerical Measures
Chapter 2: Descriptive Statistics
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Descriptive Statistics
Presentation transcript:

Section 2.1 Frequency Distributions and Their Graphs

How to construct the following graphs from frequency distributions: frequency histograms relative frequency histograms frequency polygons ogives

Definition: A table that shows the classes or intervals of data entries with the number of entries in each class. Example: ClassFrequency

ClassƒBoundaries ClassƒMidpoint

Time on Phone minutes Relative frequency ClassFrequencyRelative Frequency Cumulative Frequency

Time on Phone minutes ClassƒMidpoint How to Construct a Frequency Polygon Plot the midpoint and frequency. Connect consecutive midpoints. Extend the frequency polygon to the axis by one class

Time on Phone minutes

ClassFrequencyRelative Frequency Cumulative Frequency Minutes Minutes on Phone Cumulative frequency

ClassFrequencyRelative Frequency Cumulative Relative Frequency Cumulative Relative Frequency Minutes Minutes on Phone

 In the texbook turn to page 31. Using the data set do the following:  Fill in the following frequency distribution:  On a single piece of graph paper construct:  A Histogram  A Frequency Polygon  An Ogive ClassFrequencyRelative F.Cumulative F.MidpointsBoundaries 0 – 9 10 – – – – – – 69

Section 2.2 More Graphs and Displays

quantitative How to graph and interpret quantitative data sets using stem-and-leaf plots and dot plots qualitative How to graph and interpret qualitative data sets using pie charts and Pareto charts paired data How to graph and interpret paired data sets (bivariate data) using scatter plots

STEM LEAF Key: 6 | 7 means Minutes Spent on the Phone

Stem Leaf st line digits nd line digits

Example: Suppose the following stem-and-leaf plot set represents the scores on a statistics quiz (out of 50). a) How many students took the quiz? b) How many students scored a perfect score of 50? c) What is the lowest score on the quiz? d) What score occurs the most frequently (the “mode”) e) What advantage(s) does a stem-and-leaf plot have compared to a histogram?

Example: Suppose the following stem-and-leaf plot set represents the scores on a statistics quiz (out of 50). a) How many students took the quiz? 31 b) How many students scored a perfect score of 50? 2 c) What is the lowest score on the quiz? 6 d) What score occurs the most frequently (the “mode”) 17 e) What advantage(s) does a stem-and-leaf plot have compared to a histogram? Retains the data values

 What is the highest grade a man earned on the quiz?  How many women were in the class?  What percent of the class is male?  If a passing grade is 35 or higher, what percent of the females passed?  Overall do you think women or men did better on this quiz?

 What is the highest grade a man earned on the quiz? 50  How many women were in the class? 12  What percent of the class is male? 19/31 = 61.3%  If a passing grade is 35 or higher, what percent of the females passed? 3/12 = 25%  Overall do you think women or men did better on this quiz? Men

Time Spent on Phone minutes Minutes Spent on the Phone Duplicates: 87, 99, 103, 105

The following is a dotplot for a different group of students taking the 50-point statistics quiz a) How many students took the quiz? b) How many students scored a perfect score of 50? c) What is the lowest score on the quiz? d) What score represents the mode?

The following is a dotplot for a different group of students taking the 50-point statistics quiz a) How many students took the quiz? 26 b) How many students scored a perfect score of 50? 0 c) What is the lowest score on the quiz? 4 d) What score represents the mode? 27

 A Bar chart is constructed by labeling each category of data on either the horizontal or vertical axis and the frequency (or relative frequency) on the other axis.  A Pareto chart is a bar graph whose bars are drawn in decreasing order

A pie chart is a circle divided into sectors. Each sector represents a category of data. The area of each sector is proportional to the frequency of the category.  What percentage of the M&M’s are green?  What color was the most common M&M color?  Suppose 17% of the M&Ms are blue. If you have a bag with 400 M&M’s. How many of them would be blue?

A pie chart is a circle divided into sectors. Each sector represents a category of data. The area of each sector is proportional to the frequency of the category.  What percentage of the M&M’s are green? ≈12%  What color was the most common M&M color? Orange  Suppose 17% of the M&Ms are blue. If you have a bag with 400 M&M’s. How many of them would be blue? 68 M&Ms

Central Angle for each segment = Category Budget Degrees Human Space Flight Technology Mission Support Total: # in category total number X 360 degrees

AbsencesGrade Number of Absences x y Final Grade Do the absences depend on the grade? OR Do the grades depend on the absences? The Impact of Absences on Grades

Section 2.3 Measures of Central Tendency

Discover how outliers affects the mean, median, and mode How to describe the shape of a distribution as symmetric, uniform, or skewed How to find a weighted mean and the mean of a frequency distribution

Given the following set determine the mean, median, and mode Mean: Median: Mode:

Given the following set determine the mean, median, and mode Mean: Median: Mode:

Remove the outlier and determine the mean, median, mode Mean: Median: Mode: Outlier: An individual value that falls outside the overall pattern

Remove the outlier and determine the mean, median, mode Mean: 23.75New Mean: Median: 21.5New Median: 21 Mode: 20New Mode: 20

Uniform Symmetric Bell-shaped Symmetric Skewed RightSkewed Left Mean = Median Mean > MedianMean < Median Mean = Median When describing the center of a data set use the mean for symmetrical distributions and the median for skewed distributions

Weighted Mean: The mean of a data set whose entries have varying weights. Example: You are taking AP Biology and your grade is determined by five sources: 50% test, 15% midterm, 20% final exam, 10% lab work, and 5% homework. You scores are 86 (test), 96 (midterm), 82 (final), 98 (lab work), and 100 (homework. What is your weighted mean? SourceXWXW Tests Midterm Final Labs Homework

Weighted Mean: The mean of a data set whose entries have varying weights. Example: You are taking AP Biology and your grade is determined by five sources: 50% test, 15% midterm, 20% final exam, 10% lab work, and 5% homework. You scores are 86 (test), 96 (midterm), 82 (final), 98 (lab work), and 100 (homework. What is your weighted mean? SourceXWXW Tests860.5 Midterm Final820.2 Labs980.1 Homework Σw = 1

Weighted Mean: The mean of a data set whose entries have varying weights. Example: You are taking AP Biology and your grade is determined by five sources: 50% test, 15% midterm, 20% final exam, 10% lab work, and 5% homework. You scores are 86 (test), 96 (midterm), 82 (final), 98 (lab work), and 100 (homework. What is your weighted mean? SourceXWXW Tests Midterm Final Labs Homework Σw = 1Σxw = 88.6

The average starting salaries (by degree attained) for 20 employees at a company are given below: 2 with a doctorate: $105,000 7 with a masters degree: $63, with a bachelors degree: $41,000 What is the mean starting salary for these employees? SourceXWXW Σw = Σxw =

The average starting salaries (by degree attained) for 20 employees at a company are given below: 2 with a doctorate: $105,000 7 with a masters degree: $63, with a bachelors degree: $41,000 What is the mean starting salary for these employees? SourceXWXW Doctorate105,0002 Masters63,0007 Bachelors41,00011 Σw = 20 = 20 =

The average starting salaries (by degree attained) for 20 employees at a company are given below: 2 with a doctorate: $105,000 7 with a masters degree: $63, with a bachelors degree: $41,000 What is the mean starting salary for these employees? SourceXWXW Doctorate105, ,000 Masters63, ,000 Bachelors41, ,000 Σw = 20 Σxw = 1,102,000 = 1,102, = $55,100

Mean of a Frequency Distribution: Example: Use the frequency distribution to approximate the mean number of minutes that a sample of Internet subscribers spent online during their most recent sessions. xfxf n = 50 *If you are given a class, use the midpoint for x.

Mean of a Frequency Distribution: Example: Use the frequency distribution to approximate the mean number of minutes that a sample of Internet subscribers spent online during their most recent sessions. xfxf n = 50 Σ xf = *If you are given a class, use the midpoint for x.

Example: Use the frequency distribution to approximate the mean age of the residents of Bow, Wyoming. *If you are given a class, use the midpoint for x. Agexfxf 0 – – – – – – – – – 898 n = Σ xf =

Example: Use the frequency distribution to approximate the mean age of the residents of Bow, Wyoming. Agexfxf 0 – – – – – – – – – n = 389 Σ xf = 13,620.5 *If you are given a class, use the midpoint for x. 13, years

1. Suppose John wants to calculate his grade in his statistics class with the grading scale listed below. There are three exams each worth 15%, a final exam worth 25% and homework worth 30%. John has received a 81%, 77%, and 65% on his tests. His homework percentage is 84.6%. If John gets a 78% on the final exam, calculate his grade in the class. 2. Suppose that Ashley is took 4 courses this semester. She got a 4.0 in her 3 credit statistics class (of course), a 3.0 in her 5 credit chemistry class, a 3.5 in her 2 credit fitness class and a 2.5 in her 4 credit English class. What is her GPA for the semester?

Section 2.4 Finding and Interpreting Measures of Variation (Day 1)

How to find the range, variance, and standard deviation of a data set.

Closing prices for two stocks were recorded on ten successive Fridays. Calculate the mean, median, and mode for each DIS The Walt Disney Company WMT Wal-Mart Stores

Both stocks have the same mean, median, and mode. Does this mean the data are the same? How are the two stocks different? DIS Mean = 61.5 Median = 62 Mode = 67 WMT Mean = 61.5 Median = 62 Mode = WMT DIS

Both stocks have the same mean, median, and mode. Does this mean the data are the same? How are the two stocks different? DIS Mean = 61.5 Median = 62 Mode = 67 WMT Mean = 61.5 Median = 62 Mode = WMT DIS RANGE = 11 RANGE = 57

The deviation for each value x is the difference between the value of x and the mean of the data set. To learn to calculate measures of variation that use every value in the data set, you first want to know about deviations. In a population, the deviation for each value x is: In a sample, the deviation for each value x is:

– 5.5 – 4.5 – 3.5 – – – 61.5 DIS Deviation The sum of the deviations is always zero. Mean: 61.5

Population Variance: The sum of the squares of the deviations, divided by N. Sample Variance: The sum of the squares of the deviations, divided by n – 1.

x - x – 5.5 – 4.5 – 3.5 – x (x – x)

Population Standard Deviation: The square root of the population variance. The standard deviation is $4.58 for the Disney Stock. Sample Standard Deviation: The square root of the sample variance.

Now, let’s find the standard deviation for the Wal-Mart stock prices. x – x x (x – x) 2

Now, let’s find the standard deviation for the Wal-Mart stock prices. x – x x (x – x) x

The population standard deviation for DIS stock is $4.58. The population standard deviation for WMT stock is $ The data for the WMT stock is much more deviated (spread out) than DIS stock.

Describes the standard deviation as a percent of the mean. Used to compare data with different units. Example: On average students spend 80 minutes studying per night with a standard deviation of 15 minutes. Also student height averages at 66 inches with a standard deviation of 3.5 inches. Which is more variable, height or study time? CV study = 15/80 * 100 = CV height = 3.5/66 * 100 = 5.303

Range = Maximum value – Minimum value Sample Standard Deviation Sample Variance Population Standard Deviation Population Variance Coefficient of Variation

Section 2.4 Finding and Interpreting Measures of Variation (Day 2)

How to approximate the sample standard deviation for grouped data

Sample Grouped Standard Deviation Sample Grouped Variance

f x x xf (x- ) (x- ) 2 (x- ) 2 f

f x x xf (x- ) (x- ) 2 (x- ) 2 f

f x x xf (x- ) (x- ) (x- ) 2 f

f x x xf (x- ) (x- ) (x- ) 2 f

f f Class xf (x- ) (x- ) 2 (x- ) 2 f x

f f Class xf (x- ) (x- ) 2 (x- ) 2 f x n = 27∑xf =

S=17.18 f f Class xf (x- ) (x- ) (x- ) 2 f n = 27∑xf = ∑ (x- ) 2 f = 7674 x

In studying the behavior of Old Faithful geyser in Yellowstone National Park, geologists collect data for the time (in minutes) between eruptions. The table below summarizes actual data that were obtained. (a) What is the mean time between eruptions? (b) What is the standard deviation for the time between eruptions?

Section 2.4 Finding and Interpreting Measures of Variation (Day 3)

How to use the Empirical to interpret standard deviation

Data with symmetric bell-shaped distribution have the following characteristics. About 68% of the data lies within 1 standard deviation of the mean About 99.7% of the data lies within 3 standard deviations of the mean About 95% of the data lies within 2 standard deviations of the mean –4–3–2–101234

The mean value of homes on a street is $125 thousand with a standard deviation of $5 thousand. The data set has a bell shaped distribution. Estimate the percent of homes between $120 and $135 thousand. $120 thousand is 1 standard deviation below the mean 81.5% of the homes have a value between $120,000 and $135, $135 thousand is 2 standard deviations above the mean.

The distribution of heights of young women aged 18 to 24 is approximately normal with mean 64.5 inches and standard deviation 2.5 inches. (a) What percent of the young women are taller than 64.5 inches? (b) What percent of young women are between 64.5 and 69.5 inches? (c) What percent of heights are less than 59.5 inches?

Suppose it takes you, on average, 20 minutes to drive to school with a standard deviation of 2 minutes. Suppose a Normal model is appropriate for your driving times. (a)How often will you arrive in less than 22 minutes? (b) How often will it take you more than 26 minutes? (c) How often will it take you between 22 and 28 minutes?

Each portion of the SAT reasoning test is designed to be approximately normal and have an overall mean of 500 and standard deviation of 100. (a) What percent of students will score above 500? (b) What percent of students will score below 400? (c) What percent of students will score between 600 and 800?

Section 2.5 Measures of Position

How to find the quartiles and inter-quartile range of a data set How to represent the data using a box and whisker plot How to interpret other fractiles, such as percentiles How to calculate z-scores

3 quartiles Q 1, Q 2 and Q 3 divide the data into 4 equal parts.  Q 2 is the same as the median.  Q 1 is the median of the data below Q 2.  Q 3 is the median of the data above Q 2.

You are managing a store. The average sale for each of 27 randomly selected days in the last year is given. Find Q 1, Q 2, and Q 3. The data in ranked order (n = 27) are: The median = Q2 = 42. Q1 is 30 & Q3 is 45. The Interquartile Range is Q 3 – Q 1 = 45 – 30 = 15. Lower half Upper half

A box and whisker plot uses 5 key values to describe a set of data. Q 1, Q 2 and Q 3, the minimum value and the maximum value. 5 Number Summary Minimum value First Quartile Q 1 The median Q 2 Third Quartile Q 3 Maximum value Interquartile Range = 45 – 30 = 15

The data in ranked order (n = 26) are: Lower half Upper half Number Summary Minimum value: First Quartile Q 1 : The median Q 2 : Third Quartile Q 3 : Maximum value: Interquartile Range =____________ 100

The data in ranked order (n = 26) are: Lower half Upper half Number Summary Minimum value: 51 First Quartile Q 1 : 73 The median Q 2 : 87 Third Quartile Q 3 : 93 Maximum value: Interquartile Range = 93 – 73 =

Percentiles divide the data into 100 parts. There are 99 percentiles: P 1, P 2, P 3 …P 99. Example: If a ten year old boy is at the 75 th percentile, that means the boys weight is greater than 75% of all ten year old boys. P 50 =Q 2 =median P 25 =Q 1 P 75 =Q 3

The standard score or z-score, represents the number of standard deviations that a data value, x, falls from the mean. Example: The test scores for a civil service exam have a mean of 152 and standard deviation of 7. Find the standard z-score for a person with a score of: (a) 161 (b) 148 (c) 152

(c) 152 (a) 161 (b) 148 A value of x = 161 is 1.29 standard deviations above the mean. A value of x = 148 is 0.57 standard deviations below the mean. A value of x = 152 is equal to the mean. Mean: 152 and standard deviation: 7

What does a z-score of 1.29 mean? This means that 90.15% of the people scored a 161 or below on their civil service exam. OR This means that 9.85% of the people scored a 161 or higher on their civil service exam.