STANDARD SCORES AND THE NORMAL DISTRIBUTION

Slides:



Advertisements
Similar presentations
Statistical Reasoning for everyday life
Advertisements

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Measures of Dispersion or Measures of Variability
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Calculating & Reporting Healthcare Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slides by JOHN LOUCKS St. Edward’s University.
1.2: Describing Distributions
Describing Distributions Numerically
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 4: The Normal Distribution and Z-Scores.
12.3 – Measures of Dispersion
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Measures of Central Tendency
Measures of Central Tendency
STAT 13 -Lecture 2 Lecture 2 Standardization, Normal distribution, Stem-leaf, histogram Standardization is a re-scaling technique, useful for conveying.
Basic Statistics Standard Scores and the Normal Distribution.
Statistics Used In Special Education
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Describing Data from One Variable
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Data Analysis and Statistics. When you have to interpret information, follow these steps: Understand the title of the graph Read the labels Analyze pictures.
Chapter 3 – Descriptive Statistics
1.3 Psychology Statistics AP Psychology Mr. Loomis.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Central Tendency Introduction to Statistics Chapter 3 Sep 1, 2009 Class #3.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation As A Ruler And The Normal Model.
Copyright © 2009 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
The Standard Deviation as a Ruler and the Normal Model
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Data and Variation.
Chapter 6 The Normal Distribution. 2 Chapter 6 The Normal Distribution Major Points Distributions and area Distributions and area The normal distribution.
Chapter 2 Modeling Distributions of Data Objectives SWBAT: 1)Find and interpret the percentile of an individual value within a distribution of data. 2)Find.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
Numerical descriptions of distributions
Normal Distributions (aka Bell Curves, Gaussians) Spring 2010.
Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Chapter 2 Describing and Presenting a Distribution of Scores.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Introduction We learned from last chapter that histogram can be used to summarize large amounts of data. We learned from last chapter that histogram can.
The Normal Approximation for Data. History The normal curve was discovered by Abraham de Moivre around Around 1870, the Belgian mathematician Adolph.
Chapter 3.3 – 3.4 Applications of the Standard Deviation and Measures of Relative Standing.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Making Sense of Statistics: A Conceptual Overview Sixth Edition PowerPoints by Pamela Pitman Brown, PhD, CPG Fred Pyrczak Pyrczak Publishing.
The Normal Distributions.  1. Always plot your data ◦ Usually a histogram or stemplot  2. Look for the overall pattern ◦ Shape, center, spread, deviations.
Exploratory Data Analysis
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
The Normal Distribution
Normal Distribution Z-distribution.
Chapter 1 Warm Up .
Summary (Week 1) Categorical vs. Quantitative Variables
Summary (Week 1) Categorical vs. Quantitative Variables
Presentation transcript:

STANDARD SCORES AND THE NORMAL DISTRIBUTION Handout #8

Test Scores Suppose you (together with many other students) take tests in three subjects. On each test, the range of possible scores runs from 0 to 100 points. The table below shows your score in each of the three subjects In which subject did you do best? In which subject did you do best relative to other students? The answer to the second question obviously depends on the overall frequency distribution of score.

Test Scores (cont.) Indeed, the picture looks different when we look at your scores relative to overall distribution of scores and, in particular, to its summary statistics. Let’s compare each of your scores to the mean score in each subject. Now what seems to be your strongest subject?

Your Deviations from the Mean While you got the highest score in ENGL, this score was actually slightly below (by 1 point) the mean. In fact, it is likely (but not certain) that you scored in the bottom half of all students taking the test. On the other hand, you scored well above average in both of the other subjects (10 points in Math and 15 in POLI). Note that the magnitudes that we have just referred to here are your deviations from mean in each subject. Since your deviation from the mean is greatest with respect to POLI, this may appear to be your strongest subject. But this may not be the case.

Your Deviations from the Mean Compared with Other Deviations from the Mean While you have a deviation from the mean in each subject, so does every other student who took the test. Consider the distribution of POLI scores. Almost certainly quite a few students scored close the mean, but probably quite a few others scored well above the mean (like you) and others well below. On the one hand, but if most students scored very close to the mean (so the dispersion in test scores is small), your score of 72 would make you an outlier, scoring higher than almost all other students. On the other hand, if many students scored well above the mean (and — since we know that the sum of all deviations from the mean must sum to zero — many other students scored well below the mean, so the dispersion of test scores is large), your score of 72, while certainly good, would be less outstanding.

Your Deviations Compared with Other Deviations (cont.) Thus whether your score is outstanding or merely good depends not just on your score compared with the mean score but also on your deviation from the mean compared with other deviations from the mean, i.e., the dispersion of scores. Recall from Handout #7 that the standard measure of dispersion — the standard deviation — itself is directly based on the deviations from the mean. Recall also that the SD of scores (though precisely defined as the square root of the average of all squared deviations) is approximately the same as (though usually somewhat greater than) the average of the absolute deviations from the mean.

Your Deviations from the Mean Compared with the Standard Deviation from the Mean Thus, to get a sense of how outstanding your POLI and MATH scores are, we should look at how big your deviation from the mean is compared with the standard (“average”) deviation from the mean, by calculating the ratio of your deviation to the standard deviation. The result of this calculation is called your standard score.

Standard Scores So in terms of your standard score, i.e., how your deviation from the mean compares with the standard deviation from the mean, it is evident that your best performance was actually in MATH (where you scored two standard deviations above the mean), compared with POLI (where you scored only one standard deviation above the mean). In ENGL you scored 1/8 of a standard deviation below the mean.

Other Variants of Standard Scores Approximately half of the people who take any test necessarily get negative standard scores. This unavoidable arithmetical fact apparently is regarded as demoralizing, so standard scores are commonly converted into so-called T-scores, which are all positive. By convention, T-scores are calculated by multiplying standard scores by 10 and then adding 50. In turn, SAT scores are equal to T-scores multiplied by 10. IQ scores are also derived from standard scores, calculated by multiplying standard scores by 15 and then adding 100. The table below shows how you performed in the three subjects in terms of each of these scoring systems (where, as is conventional, all derived scores have been rounded to the nearest whole point).

Your Percentile Rank in Each Subject? While it is extremely likely that your percentile rank among all students taking each test is highest in MATH and lowest in ENGL, we do not know this for sure in the absence of knowing the full frequency distribution of scores (as opposed to knowing only the two summary statistics: the mean and the SD). Much data — particularly including tests scores, many other interval measures, and many types of sample statistics — is (at least approximately) normally distributed. However, a lot of other data (especially ratio measures), such as weight, income (as we have seen), wealth, house prices, and many other ratio variables, is skewed with longer thin tails in the direction of (much) higher values while there is a zero-limit on the minimum value.

The Normal Distribution A normal distribution is a continuous frequency density that is a particular type of symmetric bell-shaped curve. Because the curve has a single peak and is symmetric about this peak, its mode, median, and mean values coincide at this peak. Most observed values lie relatively “close” (in way that is made more specific below) to the center of distribution, and their density falls off on either side of peak.

A Normal Curve

The Mean and SD of the Normal Distribution The mean of a normal distribution determines its location on the horizontal scale. The mean value of the distribution (here equal to the mode) is simply the value (point on the horizontal scale) of the variable that lies under the highest point on the curve. For example, if a constant amount is added to (or subtracted from) every value of the variable, the normal curve slides upwards (or downwards) by that constant amount. The standard deviation of a normal distribution determines how “spread out” the distribution is. Once the horizontal scale is fixed, if the SD is small, the curve has a high peak with sharp slopes on either side; if the SD is large, the curve it has a low peak with gentle slopes on either side.

Finding the SD of a Normal Curve There is a precise connection between the shape of a normal curve and its SD. The two points of maximum steepness on either side of the peak are called the inflection points of the (normal) curve. It turns out (as a mathematical theorem) that horizontal distance from the mean to each inflection point is identical to the standard deviation of the normal curve.

Finding the SD of a Normal Curve (cont.) Here is another method for eyeballing the magnitude of the SD of a normal distribution. Put two vertical lines on either side of, and equidistant from, the peak and then draw them apart or bring them closer together (keeping them equidistant from the peak) until it appears that just about two-thirds of the areas under the curve lies in the interval between the two vertical lines. The horizontal distance from the mean to either line is equal to (a very good approximation of) the standard deviation of the distribution.

The 68%-95%-99.7% Rule More generally, we can state what is called the [approximate] 68%-95%-99.7% rule of the normal distribution. The rule is this: about 68% of all observed values lie within one SD of the mean, about 95% lie within two SDs of the mean, and about 99.7% (that is, virtually all) lie within three SDs of the mean. This is why no SAT scores below 200 [3 standard deviations below the mean] or above 800 [3 standard deviations above the mean] are reported. And here is another useful rule: in a normal distribution, half the cases have observed values that lie within about 2/3 of the SD of the mean, i.e., the first and third quartiles lie at just about 2/3 of a SD below and above the mean respectively, so In a normal distribution, the interquartile range is equal to about 1.33 SDs. All this is illustrated in the following chart, which shows a standardized normal curve, i.e., a normal curve in which the mean is set at 0 and the SD is set at 1. Put otherwise, the units on the horizontal scale shows standard scores.

Standard Scores and Percentile Ranks If test scores are normally distributed, we know from the 68-95-99.7% and 50% rules the percentile ranks associated with the following standard scores [and SATs]: If your Standard Score is then your Percentile rank is about SAT -3 0.15 200 -2 2.5 300 -1 16 400 -0.67 25 433 0 50 500 +0.67 75 567 +1 84 600 +2 97.5 700 +3 99.85 800

Your Percentile Ranks (if Test Scores are Normally Distributed) Subject Stan. Score Percentile ENGL -0.125 45 MATH +2.0 97.5 POLI +1.0 84

Most scores are “mediocre” Note that, in a normal distribution, most cases are “packed” into a relatively narrow interval quite close to the mean. Therefore, in this range of “mediocrity” (literally, in the vicinity of the median), a small change in one’s score can produce a big change one’s percentile rank. For example, if you get a score of 460 when you first take the SAT and then get a score of 540 when you take it a second time, you have made a nice but not spectacular improvement (80 points), but it still jumps you from the 33rd percentile to the 67th (i.e., it jumps you over one-third of all SAT takers). But to jump above the remaining third of SAT takers still above you (i.e., to the 99th percentile or better), your score would have to go from 540 to 800 (260 points).

Complete Table of the Normal Distribution How do we know that an SAT score of 460 puts you at the 33rd percentile (and likewise for other scores)? You can integrate the “Gaussian equation” for the normal curve (see below) over the relevant range. You can look in a statistical table (or use an scientific calculator). You can use a statistical applet such as is found on the course webpage => X is value of variable; Y is height of the normal curve.

Normal Density Curve Applet