Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 1 The mean, the number of observations, the variance and the standard deviation.

Similar presentations


Presentation on theme: "Chapter 1 The mean, the number of observations, the variance and the standard deviation."— Presentation transcript:

1

2 Chapter 1 The mean, the number of observations, the variance and the standard deviation

3 Some definitions zData - observations, measurements, scores zStatistics - a series of rules and methods that can be used to organize and interpret data. zDescriptive Statistics - methods to summarize large amounts of data with just a few numbers. zInferential Statistics - mathematical procedures to make statements of a population based on a sample.

4 More Definitions zParameter - a number that summarizes or describes some aspect of a population. zSampling Error - the difference between a statistic and its parameter. zNon-parametric Statistics - statistics for observations that are discrete, mutually exclusive, and exhaustive.

5 Where we are going

6 Descriptive Statistics zNumber of Observations zMeasures of Central Tendency zMeasures of Variability

7 Observations zEach score is represented by the letter X. zThe total number of observations is represented by N.

8 Measures of Central Tendency Finding the most typical score ymedian - the middle score ymode - the most frequent score ymean - the average score

9 zGreek letters are used to represent population parameters. z z (mu) is the mathematical symbol for the mean. z z is the mathematical symbol for summation. zFormula -  = (  X) / N zEnglish: To calculate the mean, first add up all the scores, then divide by the number of scores you added up. Calculating the Mean

10 The mode, the median and the mean 60 63 45 63 65 70 55 60 65 63 Ages of people retiring from Rutgers this year. 45 55 60 63 65 70  X = 609 N = 10 Mean  = 60.90 Mode is 63.Median is 63.

11 Measures of Variability zRange - the distance from the highest to the lowest score. zInter-quartile Range - the distance from the top 25% to the bottom 25%. zSum of Squares (SS) - the distance of each score from the mean, squared and then summed. zVariance (  2 )- the average squared distance of scores from mu (SS/N) zStandard Deviation (  )- the square root of the variance.

12 Computing the variance and the standard deviation Scores on a 10 question Psychology quiz Student John Jennifer Arthur Patrick Marie X78357X78357  X = 30 N = 5  = 6.00 X -  +1.00 +2.00 -3.00 +1.00  (X-  ) = 0.00 (X -  ) 2 1.00 4.00 9.00 1.00  (X-  ) 2 = SS = 16.00  2 = SS/N = 3.20  = = 1.79

13 The variance is our most basic and important measure of variability zThe variance ( =sigma squared) is the average squared distance of individual scores from the population mean. zOther indices of variation are derived from the variance. zThe average unsquared distance of scores from mu is the standard deviation. To find it you compute the square root of the variance.

14 Other measures of variability derived from the variance zWe can randomly choose scores from a population to form a random sample and then find the mean of such samples. zEach score you add to a sample tends to correct the sample mean back toward the population mean, mu. zThe average squared distance of sample means from the population mean is the variance divided by n, the size of the sample. zTo find the average unsquared distance of sample means from mu divide the variance by n, then take the square root. The result is called the standard error of the sample mean or, more briefly, the standard error of the mean. We’ll see more of this in Ch. 4.

15 Making predictions (1) zWithout any other information, the population mean (mu) is the best prediction of each and every person’s score. zSo you should predict that everyone will score precisely at the population mean. zWhy? Because the mean is an unbiased predictor or estimate (that is, the deviations around the mean sum to zero).

16 Making predictions (2) zThe mean is precisely the number that is the smallest squared distance on the average from the other numbers in the distribution. zThus, the mean is your best prediction, because it is a least squares, unbiased predictor.

17 What happens if we make a prediction other than mu. Scores on a Psychology quiz (mu = 6.00) What happens if we predict everyone will score 5.50? Student John Jennifer Arthur Patrick Marie X78357X78357  X = 30 N = 5  = 6.00 X - 5.5 +1.50 +2.50 -2.50 -0.50 +1.50  (X- ?) = 2.50 (X -  ) 2 2.25 6.25 0.25 2.25  (X- ?) 2 = SS = 17.25  2 = SS/N = 3.45  = = 1.86 X - 5.50(X - 5.50) 2

18 Compare that to predicting that everyone will score right at the mean (mu). Scores on a 10 question Psychology quiz Student John Jennifer Arthur Patrick Marie X78357X78357  X = 30 N = 5  = 6.00 X -  +1.00 +2.00 -3.00 +1.00  (X-  ) = 0.00 (X -  ) 2 1.00 4.00 9.00 1.00  (X-  ) 2 = SS = 16.00  2 = SS/N = 3.20  = = 1.79

19 But when you predict that everyone will score at the mean, you will be wrong. In fact, it is often the case that no one will score precisely at the mean. zIn statistics, we don’t expect our predictions to be precisely right. zWe want to make predictions that are wrong in a particular way. zWe want our predictions to be as close to the high scores as to the low scores in the population. zThe mean is the only number that is an unbiased predictor, it is the only number around which deviations sum to zero.

20 We want to be wrong by the least amount possible zIn statistics, we consider error to be the squared distance between a prediction and the actual score. zThe mean is the least average squared distance from all the scores in the population. zThe number that is the least average squared distance from the scores in the population is the prediction that is least wrong, the least in error. zThus, saying that everyone will score at the mean (even if no one does!) is the prediction that gives you the smallest amount of error.

21 So the mean is the best prediction of everyones’ score because it is a least squares, unbiased predictor for all the scores in the population.

22 Why doesn’t everyone score right at the mean? zSources of Error yIndividual differences yMeasurement problems If we predict that everyone will score right at the mean, how much error do you make on the average? To find out, find the distance of each score from the mean, square that distance and divide by the number of scores to find the average error. WHOOPS: THAT’S SIGMA 2. Mean square for error = variance

23 Questions and answers – the mean. zWHAT QUALITIES OF THE MEAN (MU) MAKE IT THE BEST PREDICTION YOU CAN MAKE OF WHERE EVERYONE WILL SCORE? zThe mean is an unbiased predictor or estimate, because the deviations around the mean always sum to zero. zThe mean is a least squares predictor because it is the smallest squared distance on the average from all the scores in the population.

24 Q & A: the mean zWHY WOULD YOU PREDICT THAT EVERYONE WILL SCORE AT THE MEAN WHEN, IN FACT, OFTEN NO ONE CAN POSSIBLY SCORE PRECISELY AT THE MEAN? zIn statistics, we don’t expect our predictions to be precisely right. zWe want to make predictions that are close and wrong in a particular way. zWe want least squares, unbiased predictors.

25 Q & A: The variance zWHAT ARE THE OTHER NAMES FOR THE VARIANCE? zSigma 2 and the mean square for error. zWHAT OTHER MEASURES OF VARIABILITY CAN BE EASILY COMPUTED ONCE YOU KNOW THE VARIANCE? zThe standard deviation and the standard error of the sample mean.

26 How do you compute zTHE VARIANCE? Find the distance of each score from the mean, square it, sum them up and divide by the number of scores in the population. zTHE STANDARD DEVIATION? Compute the square root of the variance. zTHE STANDARD ERROR OF THE SAMPLE MEAN? Divide the variance by n, the size of the sample, and then take a square root.


Download ppt "Chapter 1 The mean, the number of observations, the variance and the standard deviation."

Similar presentations


Ads by Google