CHAPTER 2: Basic Summary Statistics

Slides:



Advertisements
Similar presentations
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Advertisements

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Biostatistics Unit 2 Descriptive Biostatistics 1.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
2.3. Measures of Dispersion (Variation):
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
MGQ 201 WEEK 4 VICTORIA LOJACONO. Help Me Solve This Tool.
Descriptive Statistics Anwar Ahmad. Central Tendency- Measure of location Measures descriptive of a typical or representative value in a group of observations.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Why statisticians were created Measure of dispersion FETP India.
Variability. Statistics means never having to say you're certain. Statistics - Chapter 42.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
Descriptive Statistics: Numerical Methods
Describing Data Statisticians describe a set of data in two general ways. Statisticians describe a set of data in two general ways. –First, they compute.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
According to researchers, the average American guy is 31 years old, 5 feet 10 inches, 172 pounds, works 6.1 hours daily, and sleeps 7.7 hours. These numbers.
INVESTIGATION 1.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Introduction to Statistics Santosh Kumar Director (iCISA)
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data Descriptive Statistics: Central Tendency and Variation.
Statistics 1: Introduction to Probability and Statistics Section 3-2.
Applied Quantitative Analysis and Practices LECTURE#07 By Dr. Osman Sadiq Paracha.
CHAPTER 2: Basic Summary Statistics
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Chapter 3 Descriptive Statistics: Numerical Methods.
Bio-Statistic KUEU 3146 & KBEB 3153 Bio-Statistic Data grouping and presentations Part II: Summarizing Data.
Lectures' Notes STAT –324 Probability Probability and Statistics for Engineers First Semester 1431/1432 and 5735 Teacher: Dr. Abdel-Hamid El-Zaid Department.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
© 1999 Prentice-Hall, Inc. Chap Measures of Central Location Mean, Median, Mode Measures of Variation Range, Variance and Standard Deviation Measures.
Chapter 3 Numerical Descriptive Measures. 3.1 Measures of central tendency for ungrouped data A measure of central tendency gives the center of a histogram.
Descriptive Statistics ( )
Statistics in Management
Statistics for Managers Using Microsoft® Excel 5th Edition
Business and Economics 6th Edition
Descriptive Statistics
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics
Descriptive Statistics: Overview
Introductory Mathematics & Statistics
Central Tendency and Variability
Numerical Measures: Centrality and Variability
Summary descriptive statistics: means and standard deviations:
Numerical Descriptive Measures
Lectures' Notes STAT – 324  Probability and Statistics for Engineers First Semester 1431/1432 and 5735 Teacher: Dr. Abdel-Hamid El-Zaid Department.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Summary descriptive statistics: means and standard deviations:
Statistics 1: Introduction to Probability and Statistics
Numerical Descriptive Measures
Numerical Descriptive Measures
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
St. Edward’s University
2.3. Measures of Dispersion (Variation):
Lectures' Notes STAT – 324  Probability and Statistics for Engineers First Semester 1431/ and 5735 Teacher: Dr. Abdel-Hamid El-Zaid Department.
Business and Economics 7th Edition
Numerical Descriptive Measures
Presentation transcript:

CHAPTER 2: Basic Summary Statistics ·       Measures of Central Tendency (or location) §       Mean – mode – median ·       Measures of Dispersion (or Variation) §       Variance – standard deviation – coefficient of variation 2.1. Introduction: For the population of interest, there is a population of values of the variable of interest.

Let x1,x2, …, xn be the sample values (these values are known) Let X1,X2, …, XN be the population values (in general, they are unknown) of the variable of interest. The population size = N Let x1,x2, …, xn be the sample values (these values are known) The sample size = n  (i)              A parameter is a measure (or number) obtained from the population values X1,X2, …, XN (parameters are unknown in general)               A statistic is a measure (or number) obtained from the sample values x1,x2, …, xn (statistics are known in general)

2.2. Measures of Central Tendency: (Location) ·       The values of a variable often tend to be concentrated around the center of the data. Some of these measures are: the mean, mode, median ·       These measures are considered as representatives (or typical values) of data. : Mean: (1) Population mean : If X1,X2, …, XN are the population values of the variable of interest , then the population mean is: (unit)

·       The population mean is a parameter (it is usually unknown) (2) Sample mean : If are the sample values, then the sample mean is (unit)

Consider the following population values: ·       The sample mean is a statistic (it is known) ·  The sample mean is used to approximate (estimate) the population mean . Example: Consider the following population values: Suppose that the sample values obtained are:

Then: (unit) (unit) Notes: ·       The mean is simple to calculate. ·       There is only one mean for a given sample data. ·       The mean can be distorted by extreme values. The mean can only be found for quantitative variables Median: The median of a finite set of numbers is that value which divides the ordered set into two equal parts. Let x1,x2, …, xn be the sample values . We have two cases:

(1) If the sample size, n, is odd: ·       The median is the middle value of the ordered observations. The middle observation is the ordered observation The median = The order observation. * 1 2 … Middle value= MEDIAN n Ordered set (smallest to largest) Rank (or order)

n = 5 (odd number) Example: Find the median for the sample values: 10, 54, 21, 38, 53. Solution: n = 5 (odd number) The rank of the middle value (median) = = (5+1)/2 = 3 10 21 38 53 54 1 2 = 3 4 5 Ordered set Rank (or order) The median =38 (unit)

(2) If the sample size, n, is even: ·       The median is the mean (average) of the two middle values of the ordered observations. · The middle two values are the ordered and observations. ·       The median = Ordered set   Rank (or order)

Example: Find the median for the sample values: 10, 35, 41, 16, 20, 32 Solution: .n = 6 (even number) The rank of the middle values are = 6 / 2 = 3 = (6 / 2) + 1 = 4

The median is simple to calculate. Ordered set Rank (or order) The median (unit) Note: The median is simple to calculate. There is only one median for given data. The median is not affected too much by extreme values. The median can only be found for quantitative variables

· The mode is simple to calculate but it is not “good”. The mode of a set of values is that value which occurs with the highest frequency. ·If all values are different or have the same frequency, there is no mode. · A set of data may have more than one mode. Example: Note: ·  The mode is simple to calculate but it is not “good”. ·  The mode is not affected too much by extreme values. ·  The mode may be found for both quantitative and qualitative variables.

2.3. Measures of Dispersion (Variation): The variation or dispersion in a set of values refers to how spread out the values are from each other.      The variation is small when the values are close together.  There is no variation if the values are the same.  

Range: Some measures of dispersion: Range = Max  Min Smaller variation   Larger variation Some measures of dispersion: Range – Variance – Standard deviation Coefficient of variation Range: Range is the difference between the largest (Max) and smallest (Min) values. Range = Max  Min

Example: Find the range for the sample values: 26, 25, 35, 27, 29, 29.   Solution: Range = 35  25 = 10 (unit) Note: The range is not useful as a measure of the variation since it only takes into account two of the values. (it is not good) Variance:   The variance is a measure that uses the mean as a point of reference. The variance is small when all values are close to the mean. The variance is large when all values are spread out from the mean.   deviations from the mean:

Deviations from the mean: (1) Population variance:  Let X1,X2, …, XN be the population values. The population variance is defined by where is the population mean

(2) Sample Variance: Let be the sample values. The sample variance is defined by: where is the sample mean.

Example: We want to compute the sample variance of the following sample values: 10, 21, 33, 53, 54.  Solution: n=5

Another method: Calculating Formula for S2:

Note: To calculate S2 we need: ·       n = sample size ·       The sum of the values ·       The sum of the squared values

Standard Deviation: ·       The standard deviation is another measure of variation. ·       It is the square root of the variance. (1) Population standard deviation is: (unit) (2) Sample standard deviation is: (unit)

Coefficient of Variation (C.V.): ·       The variance and the standard deviation are useful as measures of variation of the values of a single variable for a single population (or sample). ·       If we want to compare the variation of two variables we cannot use the variance or the standard deviation because: 1.    The variables might have different units. 2.    The variables might have different means.

C.V. = (free of unit or unit less) ·       We need a measure of the relative variation that will not depend on either the units or on how large the values are. This measure is the coefficient of variation (C.V.) which is defined by: C.V. = (free of unit or unit less)   Mean St.dev. C.V. 1st data set 2nd data set

·       The relative variability in the 1st data set is larger than the relative variability in the 2nd data set if C.V1> C.V2 (and vice versa).  Example: 1st data set: 66 kg, 4.5 kg 2nd data set: 36 kg, 4.5 kg   Since , the relative variability in the 2nd data set is larger than the relative variability in the 1st data set.

Notes: (Some properties of , S, and S2:   Sample values are : x1,x2, …, xn a and b are constants Sample Data Sample mean Sample st.dev Sample Variance Absolute value:

Example: Sample mean St..dev. Variance 1,3,5 3 2 4 (1) (2) (3)   Sample mean St..dev. Variance 1,3,5 3 2 4 (1) (2) (3) -2, -6, -10 11, 13, 15 8, 4, 0 -6 13 16 Data (1) (a = 2) (2) (b = 10)   (3) (a = 2, b = 10)