Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 3 The Normal Curve Where have we been? To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum.

Similar presentations


Presentation on theme: "Chapter 3 The Normal Curve Where have we been? To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum."— Presentation transcript:

1

2 Chapter 3 The Normal Curve

3 Where have we been?

4 To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum them (SS), divide by N (  2 ) and take a square root(  ). Example: Scores on a Psychology quiz Student John Jennifer Arthur Patrick Marie X78357X78357  X = 30 N = 5  = 6.00 X -  +1.00 +2.00 -3.00 +1.00  (X-  ) = 0.00 (X -  ) 2 1.00 4.00 9.00 1.00  (X-  ) 2 = SS = 16.00  2 = SS/N = 3.20  = = 1.79

5 The variance and standard deviation are numbers that describe how far, on the average, scores are from their mean, mu. zBut we often want additional detail about how scores will fall around their mean. zWe may also wish to theorize about how scores should fall around their mean.

6 Describing and theorizing about how scores fall around their mean. zFrequency distributions zStem and leaf displays zBar graphs and histograms zTheoretical frequency distributions

7 Frequency distributions # of acdnts 0 1 2 3 4 5 6 7 8 9 10 11 Absolute Frequency 117 157 158 115 78 44 21 7 6 1 3 1 708 Cumulative Frequency 117 274 432 547 625 669 690 697 703 704 707 708 Cumulative Relative Frequency.165.387.610.773.883.945.975.983.993.994.999 1.000 Cumulative frequencies show number of scores at or below each point. Calculate by adding all scores below each point. Cumulative relative frequencies show the proportion of scores at or below each point. Calculate by dividing cumulative frequencies by N at each point.

8 Stem and Leaf Display zReading time data Reading Time 2.9 2.8 2.7 2.6 2.5 Leaves 5,5,6,6,6,6,8,8,9 0,0,1,2,3,3,3 5,5,5,5,5,6,6,6,7,7,7,7,7,7,7,8,9,9,9,9 0,0,1,2,3,3,3,3,4,4,4 5,5,5,5,6,6,6,8,9,9 0,0,0,1,2,3,3,3,4,4 5,6,6,6 0,1,1,1,2,3,3,4 6,6,8,8,8,8,8,9,9,9 0,1,1,1,2,2,2,4,4,4,4 i =.05 #i = 10

9 Transition to Histograms 999977777776665555999977777776665555 988666655988666655 33321003332100 4443333210044433332100 99866655559986665555 44333210004433321000 66656665 4332111043321110 4444222111044442221110 99988888669998888866

10 Histogram of reading times 20 18 16 14 12 10 8 6 4 2 0 Reading Time (seconds) FrequencyFrequency

11 Normal Curve

12 Principles of theoretical frequency distributions zExpected frequency = Theoretical relative frequency X N zExpected frequencies are your best estimates because they are closer, on the average, than any other estimate when we square the error. zLaw of Large Numbers - The more observations that we have, the closer the relative frequencies should come to the theoretical distribution.

13 Using the theoretical frequency distribution known as the normal curve

14 The Normal Curve zDescribed mathematically by Gauss in 1851. So it is also called the “Gaussian”distribution. It looks something like a bell, so it is also called a “bell shaped” curve. zThe normal curve is a figural representation of a theoretical frequency distribution. zThe frequency distribution represented by the normal curve is symmetrical. yThe mean (mu) falls exactly in the middle. y68.26% of scores fall within 1 standard deviation of the mean. y95.44% of scores fall within 2 standard deviations of the mean. y99.74% of scores fall within 3 standard deviations of mu.

15 NOTE: Since the curve is symmetrical around the mean, whatever happens on one side of the curve is exactly mirrored on the other side. This also means that the mean, the median and the mode are all the same in a normal distribution

16 The normal curve and Z scores zThe normal curve is the theoretical relative frequency distribution that underlies most variables that are of interest to psychologists. zA Z score expresses the number of standard deviations that a score is above or below the mean in a normal distribution. zAny point on a normal curve can be referred to with a Z score

17 The Z table and the curve zThe Z table shows the normal curve in tabular form as a cumulative relative frequency distribution. zThat is, the Z table lists the proportion of a normal curve between the mean and points further and further from the mean. zThe Z table shows only the cumulative proportion in one half of the curve. The highest proportion possible on the Z table is therefore.5000

18 IMPORTANT CONCEPT: The proportion of the curve between any two points on the curve represents the theoretical relative frequency (TRF) of scores between those points.

19 Area of the curve between two points = proportion of scores between those points zFor example, if the area of the curve between two points is 46.32% of the curve, we would expect to find a proportion of.4632 of the scores between those two points.

20 With a little arithmetic, using the Z table, we can determine: The proportion of the curve above or below any Z score. Which equals the proportion of the scores we can expect to find above or below any Z score. The proportion of the curve between any two Z scores. Which equals the proportion of the scores we can expect to find between any two Z scores.

21 Normal Curve – Basic Geography FrequencyFrequency Measure The mean One standard deviation |--------------49.87-----------------|------------------49.87------------| |--------47.72----------|----------47.72--------| -3.00 -2.00 -1.00 0.00 1.00 2.00 3.00 Z scores |---34.13--|--34.13---| Percentages 3 2 1 0 1 2 3 Standard deviations

22 The z table The Z table contains pairs of columns: columns of Z scores coordinated with columns of proportions from mu to Z. The columns of proportions show the proportion of the scores that can be expected to lie between the mean and any other point on the curve. The Z table shows the cumulative relative frequencies for half the curve. Z Score 0.00 0.01 0.02 0.03 0.04. 1.960 2.576. 3.90 4.00 4.50 5.00 Proportion mu to Z.0000.0040.0080.0120.0160..4750.4950..49995.49997.499997.4999997

23 Another important concept: Most scores are close to the mean! So if you have two equal sized intervals, the one closer to mu contains a higher proportion of scores What proportion of scores falls in the interval between Zs of -.50 to +.50 (an interval of one standard deviation right around the mean)?.1915 +.1915 =.3830 (almost 40%) Note: this is the one-standard-deviation-wide interval with the highest proportion anywhere on the curve. Note that almost 40% of a population should score within half a standard deviation from the mean. Proportion =.1293 +.1293 =.2586Proportion =.1293 +.1293 =.2586

24 Intervals further from mu What proportion of scores falls in the interval between Zs of 0.00 to +1.00 (an interval of one standard deviation starting at the mean, but not right around it)? This one can be read directly from the table -.3413 (It is a little over a third) What proportion of scores falls in the interval between Zs of +.50 to +1.50 (an interval of one standard deviation a little further from the mean)?..4332 -.1915 =.2417 This time we are down to less than a quarter of the population.

25 Common Z scores – memorize these scores and proportions Z Proportion Score mu to Z 0.00.0000 3.00.4987 2.00.4772 1.00.3413 1.960.4750 2.576.4950 (x 2 = 99% between Z= –2.576 and Z= + 2.576) ( x 2 = 95% between Z= –1.960 and Z= +1.960)

26 Areas between two points on the curve

27 470 USING THE Z TABLE - Proportion of the scores between a specific Z score and the mean. FrequencyFrequency score. 3 2 1 0 1 2 3 Standard deviations Proportion mu to Z for -0.30 =.1179 Proportion score to mean =.1179

28 470 USING THE Z TABLE - Proportion of the scores in a population between two Z scores that are identical in size, but have opposite signs. FrequencyFrequency score. 3 2 1 0 1 2 3 Standard deviations Proportion mu to Z for -0.30 =.1179 Proportion between +Z and -Z =.1179 +.1179 =.2358 530

29 The critical values of the normal curve zCritical values of a distribution show which symmetrical interval around mu contains 95% and 99% of the curve. zIn the Z table, the critical values are starred and shown to three decimal places z95% (a proportion of.9500) is found between Z scores of –1.960 and +1.960 z99% (a proportion of.9900) is found between Z scores of –2.576 and +2.576

30 -1.06 USING THE Z TABLE - Proportion of scores between a two different Z scores on opposite sides of the mean. (ADD THE TWO PROPORTIONS!). FrequencyFrequency Percent between two scores. -3.00 -2.00 -1.00 0.00 1.00 2.00 3.00 Z scores +0.37 Proportion mu to Z for -1.06 =.3554 Proportion mu to Z for.37 =.1443 Area Area Add/Sub Total Per Z 1 Z 2 mu to Z 1 mu to Z 2 Z 1 to Z 2 Area Cent -1.06 +0.37.3554.1443 Add.4997 49.97 %

31 +1.50 USING THE Z TABLE - Proportion of scores between two Z scores on the same side of the mean. (Subtract the smaller proportion from the larger one.) FrequencyFrequency Percent between two scores. -3.00 -2.00 -1.00 0.00 1.00 2.00 3.00 Z scores +1.12 Proportion mu to Z for 1.12 =.3686 Proportion mu to Z for 1.50 =.4332 Area Area Add/Sub Total Per Z 1 Z 2 mu to Z 1 mu to Z 2 Z 1 to Z 2 Area Cent +1.50 +1.12.4332.3686 Sub.0646 6.46 %

32 Expected Frequencies

33 Obtaining expected frequencies (EF) from the normal curve. zBasic rule: To find an expected frequency, multiply the proportion of scores expected in the part of the curve by the total N. Expected frequency = theoretical relative frequency x N.

34 Expected frequencies are another least squared, unbiased prediction. Expected frequencies usually must be wrong, as they are routinely written to two decimal place. For example, it is impossible to actually find 65 hundreths of a score anywhere. So, expected frequencies are another set of least squared, unbiased predictions. Such predictions can be expected to be wrong, but close.

35 EF=TRF x N zIn the examples that I’ve solved that follow, let’s assume we have a population of size 300 (N=300) zTo find the expected frequency, compute the proportion of the curve between two specific Z scores, just as we have been doing. zThen multiply that proportion (also called the theoretical relative frequency or TRF) by N.

36 Expected frequency = theoretical relative frequency x number of participants (EF=TRF*N). TRF from mean to Z = -.30 =.1179. If N = 300: EF=.1179*300 = 35.37.. 470 FrequencyFrequency 3 2 1 0 1 2 3 Standard deviations Proportion mu to Z for -0.30 =.1179 EF=.1179x300 = 35.37

37 Expected frequencies below a specific Z score

38 EF below a score. zThis is the opposite of expected frequencies above a score. It is like asking the EF between your Z score and the entire half the curve (50% or.5000) that lies below the mean. If Z is above mu, TRF is between two Z scores on opposite sides of the mean. To get TRF, add half of the curve (.5000) to the area from mu to Z. To get EF, then multiply TRF by N. zIf Z is below mu, TRF is between two Z scores on the same side of the mean. To get TRF, subtract the area from mu to Z from half of the curve (.5000). To get EF, then multiply TRF by Z

39 If N = 300, what is the EF of scores below a Z of 1.00. zExpected frequency below a score: If Z is above mu, to get TRF, add half of the curve (.5000) to the area from mu to Z. TRF below Z = +1.00 is.3413 +.5000 =.8413.

40 If N = 300: EF=.8413 x 300 = 252.39. FrequencyFrequency inches Proportion =.5000 up to mean 3 2 1 0 1 2 3 Standard deviations +.3413 for 1 SD =.8413

41 Percentile rank

42 Percentile rank is the proportion of the population you score as well as or better than times 100. The proportion you score as well as or better than is shown by the part of the curve below (to the left of) your score.

43 Computing percentile rank yAbove the mean, add the proportion of the curve from mu to Z to.5000. yBelow the mean, subtract the proportion of the curve from mu to Z from.5000. yIn either case, then multiply by 100 and round to the nearest integer (if 1 st to 99 th ). yFor example, a Z score of –2.10 yProportion mu to Z =.4821 yProportion at or below Z =.5000 -.4821 =.0179 yPercentile =.0179 x 100 = 1.79 = 2 nd percentile

44 Percentile Rank is the percent of the population you score as well as or better = Theoretical Relative Frequency below your Z score times 100. What is the percentile rank of someone with a Z score of +1.00 FrequencyFrequency inches Percentile:.5000 up to mean 3 2 1 0 1 2 3 Standard deviations +.3413 =.8413.8413 x 100 =84.13 =84 th percentile

45 A rule about rounding percentile rank zBetween the 1 st and 99 th percentiles, you round off to the nearest integer. zBelow the first percentile and above the 99 th, use as many decimal places as necessary to express percentile rank. zFor example, someone who scores at Z=+1.00 is at the 100(.5000+.3413) = 84.13 = 84 th percentile. zAlternatively, someone who scores at Z=+3.00 is at the 100(.5000+.4987)=99.87= 99.87 th percentile. Above 99 and below 1, don’t round to integers. zWe never say that someone is at the 0 th or 100 th percentile.

46 Calculate percentiles Z Area Add to.5000 (if Z > 0) Proportion Percentile Score mu to Z Sub from.5000 (if Z < 0) at or below -2.22.4868.5000 -.4868.0132 1st -0.68.2517.5000 -.2517.2483 25th +2.10.4821.5000 +.4821.9821 98th +0.33.1293.5000 +.1293.6293 63rd +0.00.0000.5000 +-.0000.5000 50th

47 Below the 1 st percentile and above the 99 th : Don’t round! zWhat percentile are you at if your Z score is +3.04? zArea mu to Z =.4988. zSince Z is above the mean, add proportion mu to Z to.5000 zPercentile = (.4988+.5000)*100 = 99.88 zAbove 99 th percentile, DON”T ROUND! zThe answer is the 99.88 th percentile


Download ppt "Chapter 3 The Normal Curve Where have we been? To calculate SS, the variance, and the standard deviation: find the deviations from , square and sum."

Similar presentations


Ads by Google