Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 3 Fundamental statistical characteristics I: Measures of central tendency.

Similar presentations


Presentation on theme: "Chapter 3 Fundamental statistical characteristics I: Measures of central tendency."— Presentation transcript:

1 Chapter 3 Fundamental statistical characteristics I: Measures of central tendency

2 2 Fundamental statistical characteristics Group indexes  Central tendency  Variability (Dispersion)  Bias (Asymmetry)  Skewness (Kurtosis) Individual indexes  Position Centiles (C i ) Percentiles (P i ) Quartiles (Q i )  Raw scores (X i )  Differentials scores (x i )  Standard scores (Z i )

3 3 Which value represents the whole? Around which value are the majority of the data? Central tendency indexes ModeMeanMedian

4 4 How is the data arranged with respect to the distribution center? How far or together are the data from each other? Variability or dispersion indexes AtS2S2 C.VQS

5 5 How are the data arranged with respect to the rest? Are data piled at one end? Bias or Asymmetry indexes g1

6 6 Which form is the distribution? Is it flattened or sharp? Skewness or Kurtosis indexes g2

7 7 Central tendency indexes

8 8 To describe a data distribution we need at least two statistics:  1. One that reflects the central tendency: value which represents the whole. Value around which the majority of the data is placed.  2. Another that reflects the dispersion around this value, if the data are far apart or close together with respect to the central value.

9 9 Central tendency measure  Is a brief description of a mass of data, usually obtained from a sample.  Serves to describe, indirectly, the population from which the sample was extracted.  Representative sample; the average of their values will say a lot of the average we would get on the population they represent.

10 10 1. Which is the value most often repeated? Mo

11 11 Mode (Mo)  “The most often repeated value”.  “The value most frequently observed in a sample or population”.  “The variable value with the highest absolute frequency”.  It is symbolized by Mo (Fechner y Pearson)

12 12 Type I distributions: Small data set a) Unimodal distribution:  Data: [8 – 8 – 11 – 11 – 15 – 15 – 15 – 15 – 15 – 17 – 17 – 17 – 19 - 19] Mo = 15

13 13 b) Amodal distribution:  Data: [8 – 8 – 8 – 11 – 11 – 11 – 15 – 15 – 15 – 17 – 17 – 17 – 19 – 19 –19] Without mode

14 14 c) Bimodal distribution:  Data: [8 – 9 – 9 – 10 – 10 – 10 – 10 – 11 – 11 – 13 – 13 – 13 – 13 – 15] Mo 1 = 10 Mo 2 = 13

15 15 d) Multimodal distribution:  Data: [8 – 8 – 9 – 9 – 9 – 10 – 11 – 11 – 11 – 12 – 12 – 13 – 13 – 13 – 14 – 15 - 15] Mo 1 = 9 Mo 3 = 13 Mo 2 = 11

16 16 Type II distributions: Big data set XiXi fifi 123 25 1610 185 14 Frequency table a) Unimodal distribution: Mo = 14 MOST OFTEN REPEATED VALUE

17 17 XiXi fifi 10 45 87 2 6 b) Bimodal distribution: Mo 1 = 2 y Mo 2 = 6 MOST OFTEN REPEATED VALUES

18 18 Complete the table if you know that the modes are: -2, -1 y 5 and that f 3 = f 4 XiXi fifi fr i %i%i -25 0O,12 2 520 6

19 19 2. What is the average score in motivation?

20 20 Arithmetic mean  It is the central tendency index most commonly used  Definition: “It is the sum of all observed values divided by the total number of them”.

21 21 Type I distributions: Small data set  Example: The following are 10 numbers remembered by 10 children in a immediate memory task 6 – 5 – 4 – 7 – 5 – 7 – 8 – 6 – 7 - 8

22 22 6 – 5 – 4 – 7 – 5 – 7 – 8 – 6 – 7 - 8 4 5 67 8 6.3

23 23 In the following serie, the “center of gravity” is: 3 – 10 – 8 – 4 – 7 – 6 – 9 – 12 – 2 – 4

24 24 23456789101112 6.5

25 25 Type II distributions: Big data set MEAN FREQUENCY TABLE Possibility 1:

26 26 XiXi fifi fiXifiXi 033 * 00 166 * 16 277 * 214 333 * 39 411 * 44 Frequency tables

27 27 01 2 34 1.65

28 28

29 29

30 30 Clearances

31 31 XiXi fifi fr i fr i X i 030,150,00 160,30 270,350,70 330,150,45 410,050,20 Possibility 2 (derived from possibility 1) :

32 32 3. Which is the value exceeded by half of the subjects? Mdn

33 33 Median (Mdn)  Definitions: It is the distribution point that divides it into 2 equal parts. It is the value with the property that the number of observations smaller than itself is equal to the number of observations higher than itself. It is the value that holds the central point of an ordered series of data. 50% of the values are above and the other 50% is below the central value.

34 34 Graphic representation  It is defined as a point (a value), not like a data or particular measure. A point whose value does not necessarily have to match any observed values.

35 35 ODD data set: [7 – 11 – 6 – 5 – 7 – 12 – 9 – 8 – 10 – 6 – 9] 1º) Data is sorted from the lowest to the highest: [5 – 6 – 6 – 7 – 7 – 8 – 9 – 9 – 10 – 11 – 12] 2º) Central value is obtained: Type I distributions: Small data set

36 36 Mdn = 8 1º5 2º6 3º6 4º7 5º7 6º8 7º 9 8º9 9º10 10º11 11º12 [5 – 6 – 6 – 7 – 7 – 8 – 9 – 9 – 10 – 11 – 12]

37 37 EVEN data set: [23 – 35 – 43 – 29 – 34 – 41 – 33 – 38 – 38 – 32] 1º) Data is sorted from the lowest to the highest: [ 23 – 29 – 32 – 33 – 34 – 35 – 38 – 38 – 41 – 43] 2º)

38 38 1º2º3º4º5º6º7º8º9º10º 23293233343538 4143 [23 – 29 – 32 – 33 – 34 – 35 – 38 – 38 – 41 – 43]

39 39 Frequency tables Example:  n= 36  To be even, there are 2 central data  36/2=18. Central point between 18 and 19 (18’5)  x 18 =x 19 =10; x 18’5 =10 Type II distributions: Big data set XiXi fifi FiFi 233 369 5514 10620 151232 23436

40 40 Comparison between measures of central tendency  If there aren’t arguments against, we always prefer the mean: Other statistics are based on the mean. It's the best estimator of their parameter.  We prefer the median: When the variable is ordinal. When there exists very extreme data. When there exist open intervals.  We prefer the mode: When the variable is qualitative or nominal. When the open interval matches the median.

41 41 Degree of agreement to consider "shouting" as a sign of aggression XiXi fifi 13 26 35 46 512

42 42 Number of rituals that students do before an exam XiXi fifi 21 38 410 521 619 79 82 91

43 43 Position measures

44 44  Central tendency measures: used to indicate around which particular value a concrete data set is placed.  Position measures: used to provide information about the relative position in which a case is with respect to the data set which it belongs to. Are used to interpret specific data.

45 45 Quantiles MdnQuartilesPercentilesDeciles

46 46 Quantiles  Mdn: divides the distribution in 2 parts :  Quartiles (Q k ): divide the distribution in 4 parts : Q 1, Q 2, Q 3 : i/k = 1/4  Deciles (D k ): divide the distribution in 10 parts : D 1, D 2,..., D 9 : i/k = 1/10  Percentiles (P k ): divides the distribution in 100 parts : P 1, P 2,..., P 99 : i/k = 1/100  They divide the distribution in K parts with the same amount of data. i/k = ½

47 47 Quantiles graphic representation

48 48 Calculating the value that corresponds to a particular quantile  1. Translate the position measurement to an absolute position  2. Find out the value for the data that occupies the absolute position of our interest  The question is: What value takes the position...? QuantilPositionFiFi Value iF.A.X E + D(X E+1 - X E )

49 49  E.g. 7th decile corresponds to the position 20;  Which value is shown by the data that takes the absolute position 20?

50 50 XiXi fifi FiFi 233 31114 41529 53059 62382 71294 8498 9199 Example: Q 3

51 51 XiXi fifi FiFi Positions 2331 – 3 311144 – 14 4152915 – 29 5305930 – 59 6238260 – 82 7129483 – 94 849895 – 98 9199 Example: Q 3  1.  2. Value = 6

52 52 XiXi fifi FiFi 211 389 41019 52140 61959 7968 8270 9171 Example: D 6

53 53 XiXi fifi FiFi Positions 2111 3892 – 9 4101910 – 19 5214020 – 40 6195941 – 59 796860 – 68 827069 – 70 9171 Example: D 6  1.  2. Value = 6

54 54 XiXi fifi FiFi Positions 2111 3892 – 9 4101910 – 19 5214020 – 40 6195941 – 59 796860 – 68 827069 – 70 9171 Example: P 13  1.  2. Value = 3.36 (see next) INTERPOLATE

55 55 Interpolate  X E+D =X E + D(X E+1 - X E ) X E+D = Value which corresponds to the quantile X E = Entire position value D= Decimal part of the position X E+1 = Previous position value  P 13 = 3 + 0.36(4 - 3)=3.36 3892 – 9 4101910 – 19

56 56 1ºCentile i 2ºPosition i/k(n + 1) 3ºFiFi F.A. 4ºValue X E + D(X E+1 - X E ) PERCENTILE Percentile: It is the value that leaves beneath a percentage of the subjects, as passed by the percentage of the remaining subjects CALCULATION ABSTRACT:

57 57 Xfifi FiFi 130 21545 32570 4676 5480 PERCENTILES Examples C 50 = P 50 = Q 2 = D 5 = Mdn CentilePosition Cumulative Frequency Value 5040,5Between 40 and 412+0,5(2-2) = 2

58 58 Centile 25 75 38 90 95 Xfifi 130 215 325 46 54 Calculate the following centiles

59 59 CentilePosition Cumulative frenquency Value 2520,25Between 20 and 211+0,25(1-1) = 1 7560,75Between 60 and 613+0,75(3-3) = 3 3830,78Between 30 and 311+0,78(2-1) = 1,78 9072,90Between 72 and 734+0,90(4-4) = 4 9576,95Between 76 and 774+0,95(5-4) = 4,95

60 60 A very common example  In babies case, we use the percentile to evaluate the growth of babies. The most used are for the weight and size of baby.  Imagine Juanita’s case, our cousin.  What’s meaning that Juanita is in size 25 percentil? This means that for each 100 babies, 70 weighed more than her (24 weighed less than her).  What if she was in the high 80 percentile? This means that for each 100 babies, 20 taller than her (79 were smaller than her)

61 61 Complete the table, knowing that the distribution is bimodal and the mean and the median have the same value XiXi fifi FiFi 156 188 21 22236

62 62 WomenMan XiXi fifi XiXi fifi 1925204 15218 62410 2252612 251278 298 8 Calculate 84 percentile in the man’s sample and the 3 quartile in the women sample Results obtained from a sample of women and a man in the province of Seville in a psychological test are contained in the following tables:


Download ppt "Chapter 3 Fundamental statistical characteristics I: Measures of central tendency."

Similar presentations


Ads by Google