Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.

Slides:



Advertisements
Similar presentations
Chapter 3, Numerical Descriptive Measures
Advertisements

Chapter 4: Descriptive Statistical Measures
Measures of Dispersion
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics A.A. Elimam College of Business San Francisco State University.
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
Basic Business Statistics 10th Edition
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Statistics for Managers.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Numerical Descriptive Techniques
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
STATISTICS I COURSE INSTRUCTOR: TEHSEEN IMRAAN. CHAPTER 4 DESCRIBING DATA.
4 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 2 Describing Data.
Describing distributions with numbers
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
Variation This presentation should be read by students at home to be able to solve problems.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Statistical Methods © 2004 Prentice-Hall, Inc. Week 3-1 Week 3 Numerical Descriptive Measures Statistical Methods.
Welcome to MM305 Unit 2 Seminar Dr. Bob Statistical Foundations for Quantitative Analysis.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Chapter 2 Describing Data: Numerical
COMPLETE BUSINESS STATISTICS
Descriptive Statistics ( )
Statistics for Managers Using Microsoft® Excel 5th Edition
Business and Economics 6th Edition
Analysis and Empirical Results
Chapter 3 Describing Data Using Numerical Measures
Averages and Variation
Descriptive Statistics
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Descriptive Statistics
Honors Statistics Review Chapters 4 - 5
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
St. Edward’s University
Business and Economics 7th Edition
Presentation transcript:

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-2 Excel Support Excel statistical functions Analysis Toolpak tools PHStat tools and procedures

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-3 Descriptive Statistics Frequency distributions and histograms Measures of central tendency Measures of dispersion Measures of shape Data profiles Coefficient of variation Correlation

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-4 Excel Descriptive Statistics Tool

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-5 Facebook Survey Results

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-6 Terminology and Notation Parameter – a measurable characteristic of a population:  is a parameter,  x is not x i represents the i th observation  indicates the operation of addition N is the size of the population; n is the size of the sample f i is the number of observations in cell i of a frequency distribution

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-7 Arithmetic Mean Population Sample Excel function AVERAGE(range)

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-8 Properties of the Mean Meaningful for interval and ratio data All data used in the calculation Unique for every set of data Affected by unusually large or small observations (outliers) The only measure of central tendency where the sum of the deviations of each value from the measure is zero; i.e.,  ( x i – x ) = 0

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-9 Median Middle value when data are ordered from smallest to largest. This results in an equal number of observations above the median as below it. Unique for each set of data Not affected by extremes Meaningful for ratio, interval, and ordinal data Excel function MEDIAN(range)

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-10 Mode Observation that occurs most frequently; for grouped data, the midpoint of the cell with the largest frequency (approximate value) Useful when data consist of a small number of unique values

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-11 Midrange Average of the largest and smallest observations Useful for very small samples, but extreme values can distort the result

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-12 Measures of Dispersion Dispersion – the degree of variation in the data. E.g., {48, 49, 50, 51, 52} and {10, 30, 50, 70, 90} Range – difference between the maximum and minimum observations Same issues as with midrange

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-13 Variance Population Sample Excel functions VARP, VAR

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-14 Standard Deviation Population Sample The standard deviation has the same units of measurement as the original data, unlike the variance Excel functions STDEVP, STDEV

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-15 Grouped Data: Calculation of Mean Sample Population In a frequency distribution, replace x i with a representative value (e.g., midpoint)

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-16 Grouped Data: Calculation of Variance Sample Population

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-17 Chebyshev’s Theorem For any set of data, the proportion of values that lie within k standard deviations of the mean is at least 1 – 1/k 2, for any k > 1 For k = 2, at least ¾ of the data lie within 2 standard deviations of the mean For k = 3, at least 8/9, or 89% lie within 3 standard deviations of the mean For k = 10, at least 99/100, or 99% of the data lie within 10 standard deviations of the mean

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-18 Example Mean = 28.87; standard deviation =  about the mean: [-36.9, 94.6] 2  about the mean: [-15.0, 72.7]

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-19 Coefficient of Variation CV = Standard Deviation / Mean CV is dimensionless, and therefore is useful when comparing data sets that are scaled differently.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-20 Frequency Distribution Tabular summary showing the frequency of observations in each of several non- overlapping (mutually exclusive) classes, or cells Relative frequency – fraction or proportion of observations that fall within a cell Cumulative frequency – proportion or percentage of observations that fall below the upper limit of a cell

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-21 Example: Facebook Friends

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-22 Histogram Column chart representing a frequency distribution

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-23 Excel Tool: Histogram Excel Menu >Tools > Data Analysis > Histogram Specify range of data Define and specify bin range (recommended) Select output options (always check Chart Output

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-24 Good Practice Guidelines Cell intervals should be of equal width. Choose the width using the formula (largest value – smallest value)/number of cells but round to reasonable values (e.g., 97 to 100) Choose somewhere between 5 to 15 cells to provide a useful picture of the data

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-25 Examples from Facebook Survey

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-26 Excel Frequency Function Define bins Select a range of cells adjacent to the bin range (if continuous data, add one empty cell below this range as an overflow cell) Enter the formula =FREQUENCY(range of data, range of bins) and press Ctrl-Shift-Enter simultaneously. Construct a histogram using the Chart Wizard for a column chart.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-27 Skewness Coefficient of skewness (CS) -0.5 < CS < 0.5 indicates relative symmetry Relatively SymmetricPositively skewed

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-28 Kurtosis Refers to the peakedness or flatness of a distribution. Coefficient of kurtosis (CK) CK < 3: more flat with wide degree of dispersion CK >3 more peaked with less dispersion The higher the kurtosis, the more area in the tails of the distribution

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-29 Data Profiles (Fractiles) Describe the location and spread of data over its range Quartiles – a division of a data set into four equal parts; shows the points below which 25%, 50%, 75% and 100% of the observations lie (25% is the first quartile, 75% is the third quartile, etc.) Deciles – a division of a data set into 10 equal parts; shows the points below which 10%, 20%, etc. of the observations lie Percentiles – a division of a data set into 100 equal parts; shows the points below which “k” percent of the observations lie

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-30 Statistical Relationships Correlation – a measure of strength of linear relationship between two variables Sample correlation coefficient Covariance – average of the products of the deviations of each variable from its mean; describes how two variables move together Sample covariance

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-31 Examples of Correlation

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-32 Excel Tool: Correlation Excel menu > Tools > Data Analysis > Correlation

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-33 Colleges and Universities Data

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-34 Box and Whisker Plots Display minimum, first quartile (Q 1 ), median, third quartile (Q 3 ), and maximum values graphically min 1 st quartile median 3 rd quartile max

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-35 PHStat Tool: Box and Whisker Plot PHStat menu > Descriptive Statistics > Box and Whisker Plot Enter data range Choose type of data set Check box for five number summary

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-36 Stem and Leaf Display Each number is divided into two parts: x  y x = stem, and y = leaf Stem = cell; leaf = value within cell 11   46 Stem and leaf display aggregates and sorts all leaves within the same stem: Number Stem  Leaf     6

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-37 Stem and Leaf Stem unit is a power of 10; the higher the stem unit, the more aggregation of data

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-38 PHStat Tool: Stem and Leaf Display PHStat menu > Descriptive Statistics > Stem and Leaf Display Enter data range Select stem unit or autocalculation Check Summary Statistics box

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-39 Dot Scale Diagram PHStat menu > Descriptive Statistics > Dot Scale Diagram

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-40 Categorical Data: Proportions Proportion - fraction of data that has a certain characteristic Use the Excel function COUNTIF(data range, criteria) to count observations meeting a criterion to compute proportions.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-41 Cross-Tabulation (Contingency Table)