The problem of sampling error It is often the case—especially when making point predictions—that what we observe differs from what our theory predicts.

Slides:



Advertisements
Similar presentations
Testing Theories: Three Reasons Why Data Might not Match the Theory.
Advertisements

1 G Lect 2a G Lecture 2a Thinking about variability Samples and variability Null hypothesis testing.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Statistics: Purpose, Approach, Method. The Basic Approach The basic principle behind the use of statistical tests of significance can be stated as: Compare.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Chapter 10: Hypothesis Testing
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
t scores and confidence intervals using the t distribution
Sample size computations Petter Mostad
Point and Confidence Interval Estimation of a Population Proportion, p
Topic 2: Statistical Concepts and Market Returns
Don’t spam class lists!!!. Farshad has prepared a suggested format for you final project. It will be on the web
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Parameters and Statistics Probabilities The Binomial Probability Test.
Discrete Probability Distributions
Statistics for the Social Sciences Psychology 340 Spring 2005 Hypothesis testing.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Today Concepts underlying inferential statistics
T scores and confidence intervals using the t distribution.
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
Inferential Statistics
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Hypothesis Testing II The Two-Sample Case.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Theory testing Part of what differentiates science from non-science is the process of theory testing. When a theory has been articulated carefully, it.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in heads half the time (i.e.,
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Testing Theories: Three Reasons Why Data Might not Match the Theory Psych 437.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Exam 1 Median: 74 Quartiles: 68, 84 Interquartile range: 16 Mean: 74.9 Standard deviation: 12.5 z = -1: 62.4 z = -1: 87.4 z = -1z = +1 Worst Question:
Inferential Statistics 2 Maarten Buis January 11, 2006.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Associate Professor Arthur Dryver, PhD School of Business Administration, NIDA url:
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Testing Hypotheses about Differences among Several Means.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Testing Theories: The Problem of Sampling Error. The problem of sampling error It is often the case—especially when making point predictions—that what.
Approximate letter grade assignments ~ D C B 85 & up A.
Inferential Statistics A Closer Look. Analyze Phase2 Nature of Inference in·fer·ence (n.) “The act or process of deriving logical conclusions from premises.
1 Since everything is a reflection of our minds, everything can be changed by our minds.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Experimental Psychology PSY 433 Appendix B Statistics.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Inference: Probabilities and Distributions Feb , 2012.
© Copyright McGraw-Hill 2004
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
Week 6. Statistics etc. GRS LX 865 Topics in Linguistics.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Please hand in homework on Law of Large Numbers Dan Gilbert “Stumbling on Happiness”
SPSS Problem and slides Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Introduction to Hypothesis Testing: The Binomial Test
Introduction to Hypothesis Testing: The Binomial Test
Randomization Tests PSU /2/14.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Introduction to Hypothesis Testing: The Binomial Test
Making Data-Based Decisions
Introducing Hypothesis Tests
Presentation transcript:

The problem of sampling error It is often the case—especially when making point predictions—that what we observe differs from what our theory predicts. There are at least three reasons why we may observe something other than what our theory predicts. –the theory is incomplete or dead wrong –the data are imprecise –sampling error Today we’re going to spend some time discussing the problem of sampling error, and some of the ways in which psychologists deal with it in their research.

Sampling error Sampling error occurs in research because we’re working with a subset of the population of interest. –Sometimes the population is sociological (e.g., US citizens) –Sometimes it is an abstract-mathematical value (e.g., p =.5 coin flip). In either case, it is necessary to know how much sampling error may be present in our data, and to take those errors into account when empirically evaluating our theories.

The problem Why is sampling error a problem in psychological research?

The problem Why is sampling error a problem in psychological research? Imagine the following research scenario –We are interested in the extra-sensory perception (ESP), and we are trying to determine whether people can correctly guess which of four cards someone is thinking about. –Our sample of 25 people gets 28% of their guesses correct. –Is this evidence for extra-sensory abilities?

The problem It is important to note that, even if people lack ESP and are simply guessing, sometimes they’ll be correct. In fact, we expect them to be correct 25% of the time, on average. More importantly, sometimes people will do better than 25%--even if they are simply guessing.

The problem How often will this occur? What distribution of correct guesses should we expect if, in fact, the probability of a correct guess is 25% (p =.25) and we test 23 people? binomial theorem: gives us the expected number of “hits” (a sample value) for a given number of trials and a given probability (a population value) for getting a hit.

sampling distribution

How likely are the various outcomes? In this case we see that it would not be unusual for people to be right 30% of the time, even if they were simply guessing. In fact, in this situation, approximately 95% of the time we would expect people to be right between 9% and 43% of the time.

Dealing with sampling error How can we deal with the problem of sampling error in psychological research? The most straight-forward way of controlling sampling error is by using large sample sizes.

In this scenario, the sample size is 20. Notice that there is a lot of variation in the sample correlations. In this scenario, the sample size is 200. Notice that there is much less of variation in the sample correlations. population r =.20 ~95% fall between -.25 and.60~95% fall between.06 and.33

Sampling distributions and point predictions It is often necessary to take into account sampling distributions when testing point predictions. For example, if our theory predicts that the correlation between television viewing and obesity should be zero when we hold exercise constant, we know that we might not observe a correlation of exactly zero due to sampling error--even if our theory is correct.

Sampling distributions and point predictions By constructing a sampling distribution, we can determine what range of correlations we’re likely to observe. –If the correlation we observe in our study is within the range that we might expect given sampling error, then we might not count this as a problem for the theory. –If, however, the correlation is much bigger than what we might expect given sampling error, we might count it against the theory.

Correlation Density If we were to observe a correlation of.30, we may begin to question our theory since, if our theory were true, we would expect to observe correlations falling roughly between -.20 and.20.

Sampling distributions and directional predictions When our theory makes a directional prediction, it is very difficult to generate the appropriate sampling distribution. –If ESP works, the population value of p could be anywhere between.26 and As a result, when a directional prediction is made, researchers often test what is sometimes called the “null hypothesis”—the hypothesis that there is no association between variables (r = 0, mean difference = 0, p =.25, etc.)

Significance tests Basic logic of a null-hypothesis significance test: –The theoretically derived hypothesis (people have ESP abilities) is that p .25. (directional) –The null hypothesis (people don’t have ESP abilities) is that p =.25. (mutually exclusive) –If we can show that the proportion correct that we observe in our study is unlikely if the null hypothesis is true, then that indirectly lends credence to the theoretical hypothesis.

Additional example We assume that a certain drug may improve people’s memory We randomly assign people to a drug condition or a placebo condition –theoretical hypothesis (directional): the drug group will have better memory performance than the other group (M drug – M no drug > 0) –null hypothesis (point prediction): the drug group will have the same performance as the control group (M drug – M no drug = 0) –If we can show that the observed difference is too large to be explained by the null hypothesis, then that may be taken as indirect support for our theoretical hypothesis.

Significance tests Some commonly used null-hypothesis significance tests –t-tests –analysis of variance (ANOVA) –z-tests –chi-squares The use of null-hypothesis significance tests is very common in psychology, but hotly debated.

Summary Any time we draw a sample from a population, there is likely to be some degree of sampling error present. Sampling error can interfere with our ability to test a theory because it will lead the values we observe to deviate from those expected even if the theory is true. One way to handle sampling error is to minimize it by using large sample sizes. When researchers cannot make point predictions, they often use significance tests and test the null hypothesis instead.