Presentation is loading. Please wait.

Presentation is loading. Please wait.

Design and Data Analysis in Psychology I English group (A) Salvador Chacón Moscoso Susana Sanduvete Chaves Milagrosa Sánchez Martín School of Psychology.

Similar presentations


Presentation on theme: "Design and Data Analysis in Psychology I English group (A) Salvador Chacón Moscoso Susana Sanduvete Chaves Milagrosa Sánchez Martín School of Psychology."— Presentation transcript:

1 Design and Data Analysis in Psychology I English group (A) Salvador Chacón Moscoso Susana Sanduvete Chaves Milagrosa Sánchez Martín School of Psychology Dpt. Experimental Psychology 1

2 Lesson 5 Sampling and sampling distribution 2

3  The statistical inference presents two categories: Estimation theory (lesson 6):  Given an index in the sample, the aim is to infer the value of the index in the population.  Two kinds of estimation:  Punctual estimation: it provides a single value.  Estimation by intervals: it facilitates a range of values. Decision theory (lesson 8):  Procedure to make decisions in the field of statistical inference. 1. Introduction 3

4 ESTIMATION THEORY STATISTICS PARAMETERS 1. Introduction 4

5 2. Phases of the inferential process 1. Obtain a sample randomly. 2. Calculate the statistics (indexes in the sample): 3. Construct a sampling distribution (means or proportions; the possible results that can be found taking different samples). 4. Choose a probability model (e.g., if we throw a dice, there are six possible results, and they are equiprobable). The most used in psychology is the normal law. 5. Calculate the corresponding parameters (indexes in the population) based on the statistics. 5

6  The value of the statistic will be closer to the value of the parameter depending on the degree of representativeness of the sample studied. For example, it depends on: The sample size. The similarity-difference between participants. The sampling procedure.  Nevertheless, there will be always some discrepancy between statistic and parameter. This is the sampling error.  Solution: The precise value of the sampling error is unknown. Using the inference, we will know with a certain confidence that this error does not exceed a limit. 3. Sampling error 6

7 Sample Population Statistics Parameters (Latin letters) (Greek letters) 3. Sampling error. Calculation: pSpS μπσμπσ 7

8 e The sampling error is the difference between a statistic and its corresponding parameter. 8

9  There are two main concepts related to the sampling error: 1. Accuracy: the precision with which a statistic represents the parameter. 1. Reliability: the measure of the constancy of a statistic when you calculate it for several samples of the same type and size. 3. Sampling error 9

10  Accuracy: example. What estimator is more accurate?  = 50 3. Sampling error 10

11 3. Sampling error 11 is more accurate.

12  Reliability: example.  What group of means is more reliable? 3. Sampling error 12

13  Reliability: example. The first group of means is more reliable because variation between them is lower. 3. Sampling error 13

14  The lower the sampling error is, the more probable is that the estimator in a sample presents the same value as the parameter. 3. Sampling error 14

15  Definition: it is a distribution of theoretical probability that establishes a functional relation between the possible values of a statistic, based on a sample of size n and the probability associated with each one of these values, for all the possible samples of size n, extracted from a particular population.  The construction of a sampling distribution presents three phases: 4. Sampling distribution 15

16 Population 16 4. Sampling distribution PHASE 1. Collect all the samples of the same size n, extracted randomly from the population under study. S3S3 S1S1 S2S2 SkSk

17 PHASE 2. Calculate the same estimator in each sample. S 1 S 2 S 3 S n 17 We will find different values of the estimator (e.g., the mean) in the different samples. 4. Sampling distribution

18 PHASE 3. Group these measures in a new distribution. 18 4. Sampling distribution Mean of means

19  In general, the sampling distribution will differ from the distribution of the population.  The variance of the statistic provides a measure of dispersion of the particular sampling values with respect to the expected value of the statistic, considering all the possible samples of size n.  The standard deviation of the sampling distribution is called standard error of the estimator.  We are only going to study the sampling distribution of two statistics: 4.1. The mean. 4.2. The proportion. 19 4. Sampling distribution

20 Mean or expected value Standard error 4.1. Sampling distribution of the mean 20

21 21 4.1. Sampling distribution of the mean Distribution of the population Sampling distribution

22 1. The statistics obtained in the samples are grouped around the parameter of the population. 2. The bigger n is, the closer to the parameter the statistics are. 3. In large samples, the graphic representation presents the following characteristics: 22 4.1. Sampling distribution of the mean. Characteristics

23 a) It is symmetric. The central vertical axis is the parameter . b) The bigger n is, the narrower the Bell- shaped curve is. c) It takes the form of the normal curve. 23 4.1. Sampling distribution of the mean. Characteristics

24 4. Its mean matches with the real mean in the population. 5. It is more or less variable. If its change is small (i.e., has a small sigma), means differ little from each other, and it is very reliable. 24 4.1. Sampling distribution of the mean. Characteristics

25 25 4.1. Sampling distribution of the mean. Standardization Sample Sampling distribution Population

26  Standardization allows to calculate probabilities (if you know the probability model that has the distribution). We can consider normal distribution when n≥30. 26 4.1. Sampling distribution of the mean. Standardization

27 Means Based on σ Based on S N = ∞ N ≠ ∞ 27 correction 4.1. Sampling distribution of the mean

28 We applied a test to a population and we obtained a mean (μ) of 18 points and a standard deviation (σ) of 3 points. Assuming that the variable is normally distributed in the population: a) Which raw scores do delimit the central 95% of the participants of that population? b) Which raw scores do delimit the central 99% of the average scores in samples of 225 participants, obtained randomly? 28 4.1. Sampling distribution of the mean. Example 1

29 Z 1 =-1.96Z 2 =1.96 29 4.1. Sampling distribution of the mean. Example 1 0.475 a) Which raw scores do delimit the central 95% of the participants of that population?

30 4.1. Sampling distribution of the mean. Example 1 30

31 31 4.1. Sampling distribution of the mean. Example 1

32 X 1 =12.12X 2 =23.88 The raw scores that delimit the central 95% of the participants are 12.12 and 23.88. 32 4.1. Sampling distribution of the mean. Example 1

33 -2.582.58 99% 33 4.1. Sampling distribution of the mean. Example 1 b) Which raw scores do delimit the central 99% of the average scores in samples of 225 participants, obtained randomly? 0.495

34 4.1. Sampling distribution of the mean. Example 1 34

35 35 4.1. Sampling distribution of the mean. Example 1

36 36 4.1. Sampling distribution of the mean. Example 1 99% 17.484 and 18.516 delimit the central 99% of the average scores in samples of 225 participants.

37 Calculate the probability of extracting a sample of 81 participants with mean equal or lower than 42, from a population whose mean () is 40 and standard deviation () is 9. 37 4.1. Sampling distribution of the mean. Example 2

38 38 4.1. Sampling distribution of the mean. Example 2 ?0.5

39 In a sampling distribution of means with samples of 49 participants, the means of the central 90% of the samples are between 47 and 53 points. Calculate: a) The raw scores that delimit the central 95% of the means. b) The standard deviation of the population (σ). c) The raw scores that delimit the central 95% of the means, when the sample size is 81. 39 4.1. Sampling distribution of the mean. Example 3

40 90% a)The raw scores that delimit the central 95% of the means. 40 4.1. Sampling distribution of the mean. Example 3 0.45

41 95% 41 4.1. Sampling distribution of the mean. Example 3 0.475 Z 1 =-1.96Z 2 =1.96

42 b) The standard deviation of the population (σ). 42 4.1. Sampling distribution of the mean. Example 3

43 c) The raw scores that delimit the central 95% of the means, when the sample size is 81. 95% -1.961.96 43 4.1. Sampling distribution of the mean. Example 3

44 44 4.1. Sampling distribution of the mean. Example 3

45 4.2. Sampling distribution of proportions  p = x/n, being x the number of participants that presented a characteristic and n, the sample size.  We can consider normal distribution when Πn ≥5 and (1- Π)n ≥5 45

46 Mean or expected value Standard error 4.2. Sampling distribution of proportions 46

47 47 4.2. Sampling distribution of proportions. Standardization

48 Proportions Based on σ Based on S N = ∞ N ≠ ∞ 48 4.2. Sampling distribution of proportions correction

49 In a population, the proportion of smokers was 0.60. If we chose from this population a sample of n=200, which is the probability of finding 130 or fewer smokers in that sample? 49 4.2. Sampling distribution of proportions. Example 1

50 Can we consider these data from a normal distribution? 50 4.2. Sampling distribution of proportions. Example 1

51 51 4.2. Sampling distribution of proportions. Example 1

52 In a election to choose president, a candidate obtained the 45% of the votes. If you would choose randomly and independently a sample of 100 voters, which is the probability of obtaining that the candidate received more than the 50% of the votes? 52 4.2. Sampling distribution of proportions. Example 2

53 53 4.2. Sampling distribution of proportions. Example 2

54 The 30% of the students in Seville passed a concrete test. Extracting samples of 100 students from this population, calculate: a) The values that delimit the central 99% of the proportions of these samples. b) The percentage of samples that have a proportion equal or higher than 0.35 of students that passed the test. 54 4.2. Sampling distribution of proportions. Example 3

55 a) Calculate the values that delimit the central 99% of the proportions of these samples. 99% -2.582.58 55 4.2. Sampling distribution of proportions. Example 3 0.495

56 56 4.2. Sampling distribution of proportions. Example 3

57 57 4.2. Sampling distribution of proportions. Example 3 b) The percentage of samples that have a proportion equal or higher than 0.35 of students that passed the test. The 13.35% of samples have a proportion equal or higher than 0.35 ?


Download ppt "Design and Data Analysis in Psychology I English group (A) Salvador Chacón Moscoso Susana Sanduvete Chaves Milagrosa Sánchez Martín School of Psychology."

Similar presentations


Ads by Google