Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Descriptive Statistics
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Descriptive Statistics
Descriptive Statistics
Measures of Variation Section 2.4 Statistics Mrs. Spitz Fall 2008.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Measures of Dispersion
Descriptive Statistics
Measures of Central Tendency Section 2.3 Statistics Mrs. Spitz Fall 2008.
Descriptive Statistics
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
More Graphs and Displays
Descriptive Statistics
Frequency Distributions and Their Graphs
Descriptive Statistics
Describing Data: Numerical
Chapter 2 Describing Data with Numerical Measurements
Describing Data Using Numerical Measures
Department of Quantitative Methods & Information Systems
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Copyright © 2004 Pearson Education, Inc.
Descriptive Statistics
Chapter 2 Summarizing and Graphing Data
Descriptive Statistics
MM207-Statistics Unit 2 Seminar-Descriptive Statistics Dr Bridgette Stevens AIM:BStevensKaplan (add me to your Buddy list) 1.
Descriptive Statistics
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Review Measures of central tendency
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Chapter 2 Describing Data.
Frequency Distributions and Their Graphs Section 2.1.
Lecture 3 Describing Data Using Numerical Measures.
Descriptive Statistics
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
Descriptive Statistics. Frequency Distributions and Their Graphs What you should learn: How to construct a frequency distribution including midpoints,
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Sect. 2-1 Frequency Distributions and Their graphs Objective SWBAT construct a frequency distribution including limits,boundaries, midpoints, relative.
Section 2.1 Frequency Distributions and Their Graphs.
Symbol Description It would be a good idea now to start looking at the symbols which will be part of your study of statistics.  The uppercase Greek letter.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Chapter 2 Descriptive Statistics
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
Sect.2.4 Measures of variation Objective: SWBAT find the range of a data set Find the variance and standard deviation of a population and of a sample How.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Chapter 2 Descriptive Statistics.
Chapter 2 Descriptive Statistics.
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics
Chapter 2: Descriptive Statistics
Section 2.4 Measures of Variation.
Descriptive Statistics
Two Data Sets Stock A Stock B
Frequency Distributions
Chapter 2 Describing, Exploring, and Comparing Data
Presentation transcript:

Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics

Frequency Distributions and Their Graphs Section 2.1

Larson/Farber Ch 2 3 Frequency Distributions Make a frequency distribution table with five classes. Minutes Spent on the Phone Key values: Minimum value = Maximum value =

Larson/Farber Ch Mark a tally | in appropriate class for each data value. Steps to Construct a Frequency Distribution 1. Choose the number of classes 2. Calculate the Class Width 3. Determine Class Limits Should be between 5 and 15. (For this problem use 5) Find the range = maximum value – minimum. Then divide this by the number of classes. Finally, round up to a convenient number. ( ) / 5 = 11.6 Round up to 12 The lower class limit is the lowest data value that belongs in a class and the upper class limit it the highest. Use the minimum value as the lower class limit in the first class. (67) After all data values are tallied, count the tallies in each class for the class frequencies.

Larson/Farber Ch 2 5  f = Do all lower class limits first. Class Limits Tally f Construct a Frequency Distribution Minimum = 67, Maximum = 125 Number of classes = 5 Class width = 12

Larson/Farber Ch 2 6 Boundaries Frequency Histogram Time on Phone minutes f Class 67 – f

Larson/Farber Ch 2 7 Frequency Polygon Time on Phone minutes f Class f Mark the midpoint at the top of each bar. Connect consecutive midpoints. Extend the frequency polygon to the axis.

Larson/Farber Ch Midpoint: (lower limit + upper limit) / 2 Relative frequency: class frequency/total frequency Cumulative frequency: Number of values in that class or in lower. Midpoint Relative frequency Class f Other Information Cumulative Frequency (67+ 78)/23/30

Larson/Farber Ch 2 9 Relative Frequency Histogram Time on Phone minutes Relative frequency Relative frequency on vertical scale

Larson/Farber Ch 2 10 Ogive An ogive reports the number of values in the data set that are less than or equal to the given value, x Cumulative Frequency minutes Minutes on Phone

More Graphs and Displays Section 2.2

Larson/Farber Ch 2 12 Stem-and-Leaf Plot 6 | 7 | 8 | 9 | 10| 11| 12| Lowest value is 67 and highest value is 125, so list stems from 6 to StemLeaf To see complete display, go to next slide.

Larson/Farber Ch | 7 7 | | | | | | Stem-and-Leaf Plot Key: 6 | 7 means 67

Larson/Farber Ch 2 14 Stem-and-Leaf with t wo lines per stem 6 | 7 7 | 1 7 | 8 8 | 2 8 | | 2 9 | | | | 2 11 | | | 5 Key: 6 | 7 means 67 1st line digits nd line digits st line digits nd line digits

Larson/Farber Ch 2 15 Dotplot Phone minutes

Larson/Farber Ch 2 16 NASA budget (billions of $) divided among 3 categories. Pie Chart §Used to describe parts of a whole §Central Angle for each segment Construct a pie chart for the data.

Larson/Farber Ch 2 17 NASA Budget (Billions of $) Mission Support 19% Technology 41% Human Space Flight 40% Total Pie Chart Billions of $ Human Space Flight5.7 Technology5.9 Mission Support Degrees

Larson/Farber Ch 2 18 Scatter Plot xx x y AbsencesGrade x Final Grade Absences

Measures of Central Tendency Section 2.3

Larson/Farber Ch 2 20 Measures of Central Tendency Mean: The sum of all data values divided by the number of values. For a population: For a sample : Median: The point at which an equal number of values fall above and fall below Mode: The value with the highest frequency

Calculate the mean, the median, and the mode n = 9 Mean: Median: Sort data in order The middle value is 3, so the median is 3.3. Mode: The mode is 2 since it occurs the most times. An instructor recorded the average number of absences for his students in one semester. For a random sample the data are:

Calculate the mean, the median, and the mode n =8 Mean: Median: Sort data in order The middle values are 2 and 3, so the median is 2.5. Mode: The mode is 2 since it occurs the most. Suppose the student with 40 absences is dropped from the course. Calculate the mean, median and mode of the remaining values. Compare the effect of the change to each type of average

Larson/Farber Ch 2 23 Uniform Symmetric Skewed rightSkewed left Mean = Median Mean > MedianMean < Median Shapes of Distributions

Measures of Variation Section 2.4

Larson/Farber Ch 2 25 Closing prices for two stocks were recorded on ten successive Fridays. Calculate the mean, median and mode for each. Mean = 61.5 Median =62 Mode= 67 Mean = 61.5 Median =62 Mode= Stock AStock B Two Data Sets

Larson/Farber Ch 2 26 Range for A = = $11 Range = Maximum value - Minimum value Range for B = = $57 The range is easy to compute but only uses 2 numbers from a data set. Measures of Variation

Larson/Farber Ch 2 27 The deviation for each value x is the difference between the value of x and the mean of the data set. In a population, the deviation for each value x is: x -  In a sample, the deviation for each value x is: Measures of Variation To learn to calculate measures of variation that use each and every value in the data set, you first want to know about deviations.

Larson/Farber Ch Deviations µ =  ( x - µ) = 0 Stock ADeviation The sum of the deviations is always zero.

Larson/Farber Ch 2 29 Population Variance: The sum of the squares of the deviations, divided by N. x Sum of squares Population Variance

Larson/Farber Ch 2 30 Population Standard Deviation Population Standard Deviation The square root of the population variance. The population standard deviation is $4.34

Larson/Farber Ch 2 31 Sample Variance and Standard Deviation To calculate a sample variance divide the sum of squares by n-1. The sample standard deviation, s is found by taking the square root of the sample variance.

Larson/Farber Ch 2 32 Summary Population Standard Deviation Sample Variance Sample Standard Deviation Range = Maximum value - Minimum value Population Variance

Larson/Farber Ch 2 33 Data with symmetric bell-shaped distribution has the following characteristics. About 68% of the data lies within 1 standard deviation of the mean About 99.7% of the data lies within 3 standard deviations of the mean About 95% of the data lies within 2 standard deviations of the mean % 2.35% 68% Empirical Rule ( %)

Larson/Farber Ch 2 34 The mean value of homes on a street is $125 thousand with a standard deviation of $5 thousand. The data set has a bell shaped distribution. Estimate the percent of homes between $120 and $135 thousand Using the Empirical Rule 68% 13.5% 68% $120 thousand is 1 standard deviation below the mean and $135 thousand is 2 standard deviation above the mean. 68% % = 81.5% So, 81.5% have a value between $120 and $135 thousand %

Larson/Farber Ch 2 35 Chebychev’s Theorem For k = 3, at least 1-1/9 = 8/9= 88.9% of the data lies within 3 standard deviation of the mean. For any distribution regardless of shape the portion of data lying within k standard deviations (k >1) of the mean is at least 1 - 1/k 2.  = 6  = 3.84 For k = 2, at least 1-1/4 = 3/4 or 75% of the data lies within 2 standard deviation of the mean.

Larson/Farber Ch 2 36 Chebychev’s Theorem The mean time in a women’s 400-meter dash is 52.4 seconds with a standard deviation of 2.2 sec. Apply Chebychev’s theorem for k = A 2 standard deviations At least 75% of the women’s 400- meter dash times will fall between 48 and 56.8 seconds. Mark a number line in standard deviation units.

Measures of Position Section 2.5

Larson/Farber Ch 2 38 You are managing a store. The average sale for each of 27 randomly selected days in the last year is given. Find Q 1, Q 2 and Q quartiles Q 1, Q 2 and Q 3 divide the data into 4 equal parts. Q 2 is the same as the median. Q 1 is the median of the data below Q 2 Q 3 is the median of the data above Q 2 Quartiles

Larson/Farber Ch 2 39 The data in ranked order (n (n = 27) are: Median rank (27 +1)/2 = 14. The median = Q2 = 42. There are 13 values below the median. Q1 rank= 7. Q1 is 30. Q3 is rank 7 counting from the last value. Q3 is 45. The Interquartile Range is Q 3 - Q 1 = = 15 Finding Quartiles

Larson/Farber Ch 2 40 Box and Whisker Plot A box and whisker plot uses 5 key values to describe a set of data. Q 1, Q 2 and Q 3, the minimum value and the maximum value. Q 1 Q 2 = the median Q 3 Minimum value Maximum value Interquartile Range = 45-30=15

Larson/Farber Ch 2 41 Percentiles Percentiles divide the data into 100 parts. There are 99 percentiles: P 1, P 2, P 3 …P 99. A 63nd percentile score indicates that score is greater than or equal to 63% of the scores and less than or equal to 37% of the scores. P 50 = Q2 Q2 = the median P 25 = Q1Q1 P 75 = Q3Q3

Larson/Farber Ch 2 42 Percentiles falls on or above 25 of the 30 values. 25/30 = So you can approximate 114 = P 83. Cumulative distributions can be used to find percentiles.

Larson/Farber Ch 2 43 Standard Scores The standard score or z-score, represents the number of standard deviations that a data value, x falls from the mean. The test scores for a civil service exam have a mean of 152 and standard deviation of 7. Find the standard z-score for a person with a score of: (a) 161 (b) 148 (c) 152

Larson/Farber Ch 2 44 (c) (a) (b) A value of x =161 is 1.29 standard deviations above the mean. A value of x =148 is 0.57 standard deviations below the mean. A value of x =152 is equal to the mean. Calculations of z-scores