Inferential statistics by example Maarten Buis Monday 2 January 2005.

Slides:



Advertisements
Similar presentations
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Advertisements

Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Inference Sampling distributions Hypothesis testing.
Topic 6: Introduction to Hypothesis Testing
Hypothesis testing Week 10 Lecture 2.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Stat Day 16 Observations (Topic 16 and Topic 14)
BCOR 1020 Business Statistics Lecture 22 – April 10, 2008.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Stat 217 – Day 15 Statistical Inference (Topics 17 and 18)
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
CHAPTER 23 Inference for Means.
Statistics for the Social Sciences
Descriptive Statistics: Maarten Buis Lecture 1: Central tendency, scales of measurement, and shapes of distributions.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Hypothesis Testing – Introduction
Mid-semester feedback In-class exercise. Chapter 8 Introduction to Hypothesis Testing.
8 - 1 © 2003 Pearson Prentice Hall Chi-Square (  2 ) Test of Variance.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Claims about a Population Mean when σ is Known Objective: test a claim.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
The Probability of a Type II Error and the Power of the Test
Inferential Statistics 2 Maarten Buis January 11, 2006.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Chapter 20 Testing Hypothesis about proportions
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 9 Testing a Claim
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Introduction to Hypothesis Testing
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Ex St 801 Statistical Methods Part 2 Inference about a Single Population Mean (HYP)
9.3 Hypothesis Tests for Population Proportions
CHAPTER 9 Testing a Claim
STA 291 Spring 2010 Lecture 18 Dustin Lueker.
CHAPTER 9 Testing a Claim
Hypothesis Testing – Introduction
CONCEPTS OF HYPOTHESIS TESTING
CHAPTER 9 Testing a Claim
Chapter 9: Hypothesis Tests Based on a Single Sample
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
STA 291 Spring 2008 Lecture 18 Dustin Lueker.
CHAPTER 9 Testing a Claim
STA 291 Summer 2008 Lecture 18 Dustin Lueker.
Chapter 9: Significance Testing
CHAPTER 9 Testing a Claim
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
CHAPTER 9 Testing a Claim
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Inferential statistics by example Maarten Buis Monday 2 January 2005

Two statistics courses Descriptive Statistics (McCall, part 1) Inferential Statistics (McCall, part 2 and 3)

Course Material McCall: Fundamental Statistics for Behavioral Sciences. SPSS (available from Surfspot.nl) Lectures: 2 x a week computer labs: 1 x a week. course website

setup of lectures Recap of material assumed to be known New Material Student Recap

How to pass this course Read assigned portions of McCall before each lecture Do the exercises Do the computer lab assignments, and hand them in before Tuesday 17:00! come to the computer lab come to the lectures ask questions: during class or to the course mailing list

What is inference? Drawing general conclusions from partial information Based on your observations some conclusions are more plausible than others. Compare with logic

Sources of uncertainty in inference Sample Measurement Model Typos when typing the data into SPSS Inference, as discussed here, assumes that random sampling error is by far the most dominant source of uncertainty.

How is inference done? If a null hypothesis is true than the probability of observing the data is so small that either we have drawn a very weird sample or the null hypothesis is false. (Ronald Fisher) We use a “good” procedure to choose between two hypotheses, whereby “good” means that you draw the right conclusion in 95% of the times you use that procedure. (Jerzy Neyman and Egon Pearson)

PrdV New populist party, wanted to participate in the next election if 41% of the Dutch population thought that “the PrdV would be an asset to Dutch politics”. This was asked to a sample of 2,598 people between, and on 16 December only 31% agreed. Peter R. de Vries decided not to participate in the next election.

The Inference Problem The 31% people approving is 31% of the people in the sample. Peter R. de Vries doesn’t care about what people in the sample think, he cares about what all the people in the Netherlands think. Could it be that he has drawn a “weird” sample, and that in the Netherlands 41% or more really think he would be an asset to Dutch politics?

Two hypotheses H 0 : 41% or more support PrdV H A : less than 41% support PrdV

A thought experiment (1) If support for PrdV in the Netherlands is 41% and we draw 100 random samples of 2598 persons, than we get 100 estimates of the support for PrdV, some of them a bit too high, some of them a bit too low. We would expect that 5 samples would show a support for PrdV of 39% or less. If we find a support for PrdV of 39% or less and reject H 0, than we have followed a procedure that would result in taking the right decision in 95% of the times we used that procedure.

What does that 39% mean? We propose the following procedure: If we find a support for PrdV of less than x% than reject H 0 We choose x in such a way that the probability of rejecting H 0 when we shouldn’t is only 5% The reason for mistakenly rejecting H 0 is drawing a ‘weird’ sample.

Where does that 39% come from? If H 0 is true, than we draw a sample from a population in which the support for PrdV is 41% We can let the computer draw many (100,000) samples and calculate the mean in each sample. 50,000 or 5% of these samples have a mean of 39% or less. So if we reject H 0 when we find a support of 39% or less, than the probability of making a mistake is 5%

Where did that 39% come from? If we draw many random samples, and compute the mean in each sample, than the distribution of these means will be approximately normally distributed with a mean of.41 and a standard deviation of Remember that the sample size is 2598, and the SD of a proportion is, so the Standard Deviation of the distribution of means is 5% of the samples has a support for PrdV of less than 39%

Neyman Pearson hypothesis testing This procedure is the Neyman Pearson hypothesis testing approach Note that it tells us something quality of the procedure we use to make a decision, not about the strength of evidence against H 0

Thought experiment (2) If the H 0 is true, than the probability of drawing a sample of size 2598 with a support for PrdV of 31% or less is x This is so small that we think it is safe to reject H 0.

Where did that x come from? In the 100,000 samples that were drawn from the population if H 0 were true none were lees than.31% So the probability of drawing this or a more extreme sample when H 0 is true is less than 1/100,000. Remember that if H 0 is true, the distribution of means obtained from many samples is normal with a mean of.41 and a standard deviation of.0096 The proportion of samples with a mean less than.31 is x

Fisher hypothesis testing This procedure is Fisher hypothesis testing. Note that it gives us a measure of evidence against H 0, but it does not give us an indication of how likely we are to make the wrong decision.

Fisher vs. Neyman Pearson You will draw the same conclusion whichever method you use. However, it really helps to choose one approach when writing your results down.

Limits to inference More importantly, both assume random sampling, and we almost never have that. Testing is more helpful to determine whether the data is ‘screaming’ or whispering’ at us. Knowing the reasoning behind statistical inference will help you determine the weight you should assign to conclusions derived from statistical tests.

Terminology (1) Distribution means obtained from different samples is the sampling distribution of the mean. The standard deviation of the sampling distribution is the standard error. Proportion of samples that wrongly reject the H 0 is the significance level or  or Type I error rate. Proportion of samples that wrongly fail to reject H 0 is the Type II error rate or . Proportion of samples that will rightly reject H 0 is the power.

Terminology (2) The probability of the data given that H 0 is true is the p-value. Maximum p-value that will cause you to reject H 0 is also the level of significance.

What to do before Wednesday? Read Chapter 8 Do exercises of chapter 8