1 Tendencia central y dispersión de una distribución.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Chapter 3, Numerical Descriptive Measures
Class Session #2 Numerically Summarizing Data
© 2002 Prentice-Hall, Inc.Chap 3-1 Basic Business Statistics (8 th Edition) Chapter 3 Numerical Descriptive Measures.
Calculating & Reporting Healthcare Statistics
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics A.A. Elimam College of Business San Francisco State University.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Numerical Descriptive Techniques
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Intro to Descriptive Statistics
Basic Business Statistics 10th Edition
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Measures of Central Tendency
Describing Data: Numerical
Numerical Descriptive Techniques
1 Descriptive Statistics: Numerical Methods Chapter 4.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
Review of Measures of Central Tendency, Dispersion & Association
Economics 173 Business Statistics Lecture 2 Fall, 2001 Professor J. Petry
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Statistics for Managers.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Numerical Descriptive Techniques
Chapter 3 – Descriptive Statistics
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
Descriptive Statistics Descriptive Statistics describe a set of data.
QBM117 Business Statistics Descriptive Statistics Numerical Descriptive Measures.
Descriptive Statistics Roger L. Brown, Ph.D. Medical Research Consulting Middleton, WI Online Course #1.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Descriptive Statistics: Numerical Methods
Review of Measures of Central Tendency, Dispersion & Association
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Variation This presentation should be read by students at home to be able to solve problems.
1 Economics 173 Business Statistics Lectures 1 & 2 Summer, 2001 Professor J. Petry.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
Summary Statistics: Measures of Location and Dispersion.
Economics 173 Business Statistics Lectures 1 Fall, 2001 Professor J. Petry.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
EXPECTATION, VARIANCE ETC. - APPLICATION 1. 2 Measures of Central Location Usually, we focus our attention on two types of measures when describing population.
Statistical Methods © 2004 Prentice-Hall, Inc. Week 3-1 Week 3 Numerical Descriptive Measures Statistical Methods.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
© 1999 Prentice-Hall, Inc. Chap Measures of Central Location Mean, Median, Mode Measures of Variation Range, Variance and Standard Deviation Measures.
Descriptive Statistics ( )
Business and Economics 6th Edition
Numerical Descriptive Techniques
Chapter 4 Describing Data (Ⅱ ) Numerical Measures
Ch 4 實習.
Numerical Measures: Centrality and Variability
Midrange (rarely used)
Descriptive Statistics
Numerical Descriptive Measures
Numerical Descriptive Measures
Numerical Descriptive Measures
Numerical Descriptive Statistics
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Business and Economics 7th Edition
Numerical Descriptive Measures
Presentation transcript:

1 Tendencia central y dispersión de una distribución

2 Review Topics Measures of Central Tendency Mean, Median, Mode Quartile Measures of Variation The Range, Variance and Standard Deviation, Coefficient of variation Shape Symmetric, Skewed

3 Important Summary Measures Central Tendency Mean Median Mode Quartile One sample Summary Measures Variation Variance Standard Deviation Coefficient of Variation Range

4 Measures of Central Tendency Central Tendency Mean MedianMode Data: You can access practice sample data on HMO premiums here.here

5 With one data point clearly the central location is at the point itself. But if the third data point appears on the left hand-side of the midrange, it should “pull” the central location to the left. Measures of Central Location (Tendency) Usually, we focus our attention on two aspects of measures of central location: – Measure of the central data point (the average). – Measure of dispersion of the data about the average. With two data points, the central location should fall in the middle between them (in order to reflect the location of both of them). If the third data point appears exactly in the middle of the current range, the central location should not change (because it is currently residing in the middle).

6 – This is the most popular and useful measure of central location Sum of the measurements Number of measurements Mean = Sample meanPopulation mean Sample sizePopulation size § Arithmetic mean

7 Example 4.1 The mean of the sample of six measurements 7, 3, 9, -2, 4, 6 is given by Example 4.2 Suppose the telephone bills of example 2.1 represent population of measurements. The population mean is

8 26,26,28,29,30,32,60,31 Odd number of observations 26,26,28,29,30,32,60 Example 4.4 Seven employee salaries were recorded (in 1000s) : 28, 60, 26, 32, 30, 26, 29. Find the median salary. – The median of a set of measurements is the value that falls in the middle when the measurements are arranged in order of magnitude. Suppose one employee’s salary of $31,000 was added to the group recorded before. Find the median salary. Even number of observations 26,26,28,29, 30,32,60,31 There are two middle values! First, sort the salaries. Then, locate the value in the middle First, sort the salaries. Then, locate the value s in the middle 26,26,28,29, 30,32,60, , § The median

9 – The mode of a set of measurements is the value that occurs most frequently. – Set of data may have one mode (or modal class), or two or more modes. The modal class For large data sets the modal class is much more relevant than the a single- value mode. § The mode

10 Example 4.6 A professor of statistics wants to report the results of a midterm exam, taken by 100 students. The data appear in file XM Find the mean, median, and mode, and describe the information they provide. The mean provides information about the over-all performance level of the class. The Median indicates that half of the class received a grade below 81%, and half of the class received a grade above 81%. The mode must be used when data is qualitative. If marks are classified by letter grade, the frequency of each grade can be calculated.Then, the mode becomes a logical measure to compute. Excel Results

11 Relationship among Mean, Median, and Mode If a distribution is symmetrical, the mean, median and mode coincide If a distribution is non symmetrical, and skewed to the left or to the right, the three measures differ. A positively skewed distribution (“skewed to the right”) Mean Median Mode

12 ` If a distribution is symmetrical, the mean, median and mode coincide If a distribution is non symmetrical, and skewed to the left or to the right, the three measures differ. A positively skewed distribution (“skewed to the right”) Mean Median Mode Mean Median Mode A negatively skewed distribution (“skewed to the left”)

13 Measures of Variation Variation VarianceStandard DeviationCoefficient of Variation Population Variance Sample Variance Population Standard Deviation Sample Standard Deviation Range Interquartile Range

14 Measures of variability (Looking beyond the average) Measures of central location fail to tell the whole story about the distribution. A question of interest still remains unanswered: How typical is the average value of all the measurements in the data set? How much spread out are the measurements about the average value? or

15 Observe two hypothetical data sets The average value provides a good representation of the values in the data set. Low variability data set High variability data set The same average value does not provide as good presentation of the values in the data set as before. This is the previous data set. It is now changing to...

16 – The range of a set of measurements is the difference between the largest and smallest measurements. – Its major advantage is the ease with which it can be computed. – Its major shortcoming is its failure to provide information on the dispersion of the values between the two end points. ? ? ? But, how do all the measurements spread out? Smallest measurement Largest measurement The range cannot assist in answering this question Range § The range

17 – This measure of dispersion reflects the values of all the measurements. – The variance of a population of N measurements x 1, x 2,…,x N having a mean  is defined as – The variance of a sample of n measurements x 1, x 2, …,x n having a mean is defined as § The variance

18 Consider two small populations: Population A: 8, 9, 10, 11, 12 Population B: 4, 7, 10, 13, = = = = = = = = +6 Sum = 0 The mean of both populations is …but measurements in B are much more dispersed then those in A. Thus, a measure of dispersion is needed that agrees with this observation. Let us start by calculating the sum of deviations A B The sum of deviations is zero in both cases, therefore, another measure is needed.

= = = = = = = = +6 Sum = 0 A B The sum of deviations is zero in both cases, therefore, another measure is needed. The sum of squared deviations is used in calculating the variance. See example next.

20 Let us calculate the variance of the two populations Why is the variance defined as the average squared deviation? Why not use the sum of squared deviations as a measure of dispersion instead? After all, the sum of squared deviations increases in magnitude when the dispersion of a data set increases!!

21 – Example 4.8 Find the mean and the variance of the following sample of measurements (in years). 3.4, 2.5, 4.1, 1.2, 2.8, 3.7 – Solution A shortcut formula =[ … ]-[(17.7) 2 /6] = (years) 2

22 Sample Standard Deviation For the Sample : use n - 1 in the denominator. Data: s = n = 8 Mean =16 = s

23 Interpreting Standard Deviation The standard deviation can be used to – compare the variability of several distributions – make a statement about the general shape of a distribution. The empirical rule: If a sample of measurements has a mound-shaped distribution, the interval

24 Comparing Standard Deviations s = = = Value for the Standard Deviation is larger for data considered as a Sample. Data : N= 8 Mean =16

25 Comparing Standard Deviations Mean = 15.5 s = Data B Data A Mean = 15.5 s = Mean = 15.5 s = 4.57 Data C

26 Measures of Association Two numerical measures are presented, for the description of linear relationship between two variables depicted in the scatter diagram. – Covariance - is there any pattern to the way two variables move together? – Correlation coefficient - how strong is the linear relationship between two variables

27  x (  y ) is the population mean of the variable X (Y) N is the population size. n is the sample size. § The covariance

28 – This coefficient answers the question: How strong is the association between X and Y. § The coefficient of correlation

29 COV(X,Y)=0  or r = +1 0 Strong positive linear relationship No linear relationship Strong negative linear relationship or COV(X,Y)>0 COV(X,Y)<0