Sampling distributions:. In Psychology we generally make inferences about populations on the basis of samples. We therefore need to know what relationship.

Slides:



Advertisements
Similar presentations
Chapter 8: Estimating with Confidence
Advertisements

Table of Contents Exit Appendix Behavioral Statistics.
Normal Distribution; Sampling Distribution; Inference Using the Normal Distribution ● Continuous and discrete distributions; Density curves ● The important.
Objectives Look at Central Limit Theorem Sampling distribution of the mean.
Review: What influences confidence intervals?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
The Normal Distribution: The Normal curve is a mathematical abstraction which conveniently describes ("models") many frequency distributions of scores.
What z-scores represent
1 Sociology 601, Class 4: September 10, 2009 Chapter 4: Distributions Probability distributions (4.1) The normal probability distribution (4.2) Sampling.
Sampling distributions:
Chapter Sampling Distributions and Hypothesis Testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 5 Chicago School of Professional Psychology.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Chapter 11: Random Sampling and Sampling Distributions
Inferential Statistics
Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Hypothesis Testing:.
From Last week.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Chapter 11: Estimation Estimation Defined Confidence Levels
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Sampling Distributions
EDUC 200C Friday, October 26, Goals for today Homework Midterm exam Null Hypothesis Sampling distributions Hypothesis testing Mid-quarter evaluations.
From last lecture (Sampling Distribution): –The first important bit we need to know about sampling distribution is…? –What is the mean of the sampling.
Inferential statistics.  Summary from last week  The normal distribution  Hypothesis testing  Type I and II errors  Statistical power  Exercises.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
Probability and Samples
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Some probability distribution The Normal Distribution
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
Probability.  Provides a basis for thinking about the probability of possible outcomes  & can be used to determine how confident we can be in an effect.
9.3: Sample Means.
The Normal Curve Theoretical Symmetrical Known Areas For Each Standard Deviation or Z-score FOR EACH SIDE:  34.13% of scores in distribution are b/t the.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Chapter 221 What Is a Test of Significance?. Chapter 222 Thought Question 1 The defendant in a court case is either guilty or innocent. Which of these.
PSY 307 – Statistics for the Behavioral Sciences Chapter 9 – Sampling Distribution of the Mean.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
Intro to Inference & The Central Limit Theorem. Learning Objectives By the end of this lecture, you should be able to: – Describe what is meant by the.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
IM911 DTC Quantitative Research Methods Statistical Inference I: Sampling distributions Thursday 4 th February 2016.
From the population to the sample The sampling distribution FETP India.
Warsaw Summer School 2015, OSU Study Abroad Program Normal Distribution.
m/sampling_dist/index.html.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Inferential Statistics. Population Curve Mean Mean Group of 30.
THE NORMAL DISTRIBUTION
And distribution of sample means
Normal Distribution.
Sampling Distributions
Summary descriptive statistics: means and standard deviations:
Sampling Distributions
Probability and the Sampling Distribution
Review: What influences confidence intervals?
Summary descriptive statistics: means and standard deviations:
Sampling Distributions
Chapter 10: Estimating with Confidence
Sampling distributions:
Estimating Population Parameters Based on a Sample
Normal Distribution and z-scores
Presentation transcript:

Sampling distributions:

In Psychology we generally make inferences about populations on the basis of samples. We therefore need to know what relationship exists between samples and populations. A population of scores has a mean (μ), a standard deviation (σ) and a shape (e.g., normal distribution).

high frequency of raw scores low µ = 63 in. σ = 2 in. CONCRETE BUT MUNDANE EXAMPLE: HEIGHT OF ALL ADULT WOMEN IN ENGLAND σ Height (inches) 67 59

high frequency of raw scores low 64.2 in Sample of 100 adult women from England s = 2.5 in N = 100 s

If we take repeated samples, each sample has a mean height, a standard deviation (s), and a shape. Due to random fluctuations, each sample is different - from other samples and from the parent population. Fortunately, these differences are predictable - and hence we can still use samples in order to make inferences about their parent populations.

Frequency (how many means are a given value) 63.0 Mean sample heights (inches) Taking multiple samples (of a given size) gives rise to: Sampling distribution

What are the properties of the distribution of Sample Means? (a) The mean of the sample means is the same as the mean of the population of raw scores: (b) The sampling distribution of the mean also has a standard deviation. The standard deviation of the sample means is called the "standard error". It is NOT the same as the standard deviation of the population (of raw scores).

The standard error underestimates the population s.d.: The bigger the sample size, the smaller the standard error. i.e., variation between samples decreases as sample size increases. (This is because producing a sample mean reduces the influence of any extreme raw scores in a sample).

frequency 63 Mean sample heights (inches) µ = 63 in. N = 100 Sampling distribution for N = 100 Suppose we take samples of N = 100

How do we calculate the standard deviation of a sampling distribution of the mean (known as the standard error)? For N =100 Suppose the N = 16 instead of 100

(c) The distribution of sample means is normally distributed, no matter what the shape of the original distribution of raw scores in the population. Example: Annual income of American citizens. This distribution is positively skewed. Many people in the lower and medium income bracket; very few are ultra rich. Suppose we take many samples of size N = 50. The sampling distribution of the mean will be normal. This is due to the Central Limit Theorem. This holds true only for sample sizes of 30 and greater.

Given the distribution is normal we use properties of normal distribution to do interesting things. Various proportions of scores fall within certain limits of the mean (i.e. 68% fall within the range of the mean +/- 1 standard deviation; 95% within +/- 2 standard deviation, etc.).

Using z-scores, we can represent a given score in terms of how different it is from the mean of the group of scores. X i = 64 μ = 63 Quick reminder of how to calculate z-score

Relationship of a sample mean to the population mean: We can do the same with sample means: (a) we obtain a particular sample mean; (b) we can represent this in terms of how different it is from the mean of its parent population. μ = 63 64

If we obtain a sample mean that is much higher or lower than the population mean, there are two possible reasons: (a) our sample mean is a rare "fluke" (a quirk of sampling variation); (b) our sample has not come from the population we thought it did, but from some other, different, population. The greater the difference between the sample and population means, the more plausible (b) becomes.

Take another example: The human population I.Q. is 100. A random sample of people has a mean I.Q. of 170. There are two explanations: (a) the sample is a fluke: by chance our random sample contained a large number of highly intelligent people. (b) the sample does not come from the population we thought they did: our sample was actually from a different population - e.g., aliens masquerading as humans.

high frequency of sample means low population mean I.Q. (100)sample mean I.Q. (170) Relationship between population mean and sample mean:

This logic can be extended to the difference between two samples from the same population: A common experimental design: We compare two groups of people: an experimental group and a control group. Experimental group get a "wolfman" drug. Control group get a harmless placebo. Dependent Variable: number of dog-biscuits consumed. At the start of the experiment, they are two samples from the same population ("humans").

At the end of the experiment, are they: (a) still two samples from the same population (i.e., still two samples of "humans" - our experimental treatment has left them unchanged); OR (b) now samples from two different populations - one from the "population of humans" and one from the "population of wolfmen"?

We can decide between these alternatives as follows: The differences between any two sample means from the same population are normally distributed, around a mean difference of zero. Most differences will be relatively small, since the Central Limit Theorem tells us that most samples will have similar means to the population mean (and hence similar means to each other). If we obtain a very large difference between our sample means, it could have occurred by chance, but this is very unlikely - it is more likely that the two samples come from different populations.

Possible differences between two sample means: high low mean of sample Amean of sample B frequency of raw scores a big difference between mean of sample A and mean of sample B: sample means lowhigh

Possible differences between two sample means (cont.): high low mean of sample Amean of sample B frequency of raw scores a small difference between mean of sample A and mean of sample B: sample means low high

Frequency distribution of differences between sample means: high low frequency of differences between sample means size of difference between sample means ( "sample mean A" minus "sample mean B") no difference mean A bigger than mean B mean A smaller than mean B

Frequency distribution of differences between sample means: high low size of difference between sample means no difference mean A bigger than mean B mean A smaller than mean B A small difference between sample means (quite likely to occur by chance) frequency of differences between sample means

Frequency distribution of differences between sample means: high low size of difference between sample means no difference mean A bigger than mean B mean A smaller than mean B A large difference between sample means (unlikely to occur by chance) frequency of differences between sample means