Stat 512 Day 9: Confidence Intervals (Ch 5) Open Stat 512 Java Applets page.

Slides:



Advertisements
Similar presentations
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Advertisements

Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Inference Sampling distributions Hypothesis testing.
INFERENCE: SIGNIFICANCE TESTS ABOUT HYPOTHESES Chapter 9.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
STAT 135 LAB 14 TA: Dongmei Li. Hypothesis Testing Are the results of experimental data due to just random chance? Significance tests try to discover.
Stat 301 – Day 17 Tests of Significance. Last Time – Sampling cont. Different types of sampling and nonsampling errors  Can only judge sampling bias.
Stat 301 – Day 19 One sample z-test (4.3). Last Week - Sampling How to select random samples so that we feel comfortable generalizing from our sample.
Stat 301 – Day 28 Review. Last Time - Handout (a) Make sure you discuss shape, center, and spread, and cite graphical and numerical evidence, in context.
Stat 512 – Lecture 10 Cautions with Inference Open Applets page and Minitab and see me…
Stat 301 – Day 21 Adjusted Wald Intervals Power. Last Time – Confidence Interval for  When goal is to estimate the value of the population proportion.
Stat Day 16 Observations (Topic 16 and Topic 14)
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Stat 217 – Day 13 Sampling Distributions (Topic 13) Submit Activity 12-6?
Stat 512 – Day 8 Tests of Significance (Ch. 6). Last Time Use random sampling to eliminate sampling errors Use caution to reduce nonsampling errors Use.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Stat 217 – Day 15 Statistical Inference (Topics 17 and 18)
Stat 217 – Day 20 Comparing Two Proportions The judge asked the statistician if she promised to tell the truth, the whole truth, and nothing but the truth?
Review of normal distribution. Exercise Solution.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
Hypothesis Testing A hypothesis is a conjecture about a population. Typically, these hypotheses will be stated in terms of a parameter such as  (mean)
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 9 Introduction to Hypothesis Testing.
Chapter 20: Testing Hypotheses about Proportions
Testing Hypotheses About Proportions
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 20 Testing Hypotheses About Proportions.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Hypothesis Testing for Proportions
Significance Tests: THE BASICS Could it happen by chance alone?
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Essential Statistics Chapter 131 Introduction to Inference.
INTRODUCTION TO INFERENCE BPS - 5th Ed. Chapter 14 1.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Chapter 20 Testing hypotheses about proportions
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Large sample CI for μ Small sample CI for μ Large sample CI for p
9.1 – The Basics Ch 9 – Testing a Claim. Jack’s a candidate for mayor against 1 other person, so he must gain at least 50% of the votes. Based on a poll.
Essential Statistics Chapter 141 Thinking about Inference.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
Slide Slide 1 Section 8-4 Testing a Claim About a Mean:  Known.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
AP Statistics Section 11.1 B More on Significance Tests.
9.3/9.4 Hypothesis tests concerning a population mean when  is known- Goals Be able to state the test statistic. Be able to define, interpret and calculate.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
A significance test or hypothesis test is a procedure for comparing our data with a hypothesis whose truth we want to assess. The hypothesis is usually.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Assumptions 1) Sample is large (n > 30) a) Central limit theorem applies b) Can.
Copyright© 1998, Triola, Elementary Statistics by Addison Wesley Longman 1 Testing a Claim about a Mean: Large Samples Section 7-3 M A R I O F. T R I O.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Uncertainty and confidence If you picked different samples from a population, you would probably get different sample means ( x ̅ ) and virtually none.
Can't Type? press F11 Can’t Hear? Check: Speakers, Volume or Re-Enter Seminar Put ? in front of Questions so it is easier to see them. 1 Welcome to Unit.
Unit 4 – Inference from Data: Principles
Significance Test for the Difference of Two Proportions
One-Sample Inference for Proportions
Testing Hypotheses about Proportions
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Sampling Distribution of Sample Means Open Minitab
Two-sided p-values (1.4) and Theory-based approaches (1.5)
The Practice of Statistics in the Life Sciences Fourth Edition
Comparing Two Proportions
Presentation transcript:

Stat 512 Day 9: Confidence Intervals (Ch 5) Open Stat 512 Java Applets page

Last Time – Tests of Significance 0. Define the parameter of interest 1. Check technical conditions 2. State competing null and alternative hypotheses about the population parameter of interest (in symbols and in words) H 0 : parameter = value H a : parametervalue parameter = population mean,  ; population proportion,  Statement of research conjecture

PP (a) Let  represent the proportion of the population who prefer to hear bad news first. Ho:  =.5 (equally likely to prefer bad news as good news) Ha:  >.5 (majority of population prefer to hear bad news first) (b) Let  represent the average time (in hours) third and fourth graders spend watching television. Ho:  = 2 (spent 2 hours per day on average) Ha:  > 2 (the population average is more than 2 hours per day) Don’t forget the parameter! Always the equality Reflects the actual research conjecture

PP (c) Let  1 represent the proportion of all parolees in the population who receive a literature course who commit a crime within 30 months of parole. Let  2 represent the proportion of all parolees in the population without a literature course who commit a crime within 30 months of parole. Ho:  1 =  2 (no difference in the rate at which these populations commit a subsequent crime) Ha:  1 ≠  2 (there is a difference in the subsequent crime rate in these two populations)

Last Time – Tests of Significance 0. Define the parameter of interest 1. Check technical conditions 2. State competing null and alternative hypotheses about the population parameter of interest (in symbols and in words) Assume H 0 true, sketch picture of sampling distribution 3. Calculate test statistic 4. Determine p-value 5. State conclusion (reject or fail to reject the null hypothesis), translate back into English

Interpreting p-value P-value = probability of observing sample data at least this extreme when the null hypothesis is true due to “chance” (random sampling) alone  How often would we get data “like this” if the null was true  “Like this” is determined by the alternative If p-value is small  evidence against H 0 If p-value is large  lack of evidence against H 0  We don’t get to support the null! Guilty! Not Guilty! Can we prove the dice are fair?

Example? I love all chocolate ice cream  Have yet to find a choc ice cream don’t like… Data are behaving as expected based on that initial belief… have no reason to doubt it…  What if I find one that I don’t like? Specify alternative as what hoping to show…

Determining the p-value (PP) If Ha: parameter  > value, the p-value is probability above, P(Z>z) If Ha: parameter ≠ value, the p-value is 2P(Z>|z|), “two-sided” If Ha: parameter < value, the p-value is probability below P(Z<z)

State hypotheses before see data Social science research has established that the expression “absence makes the heart grow fonder” is generally true. Do you find this result surprising? Surprising 4Not Surprising 17 “out of sight, out of mind”… Surprising 0Not Surprising 22

Special cases: When want to know if parameter differs from hypothesized value, use two-sided H a  Doubles the p-value From now on, when working with quantitative data, will use the t distribution to find p-value  df = n – 1 (technology) Level of significance: May decide from the very start how low p-value will have to be to convince you, e.g.,.01,.05  Then if p-value < , say result is statistically significant at that level, e.g.,.05 or 5%

Example 4 Let  represent the ratio used by American Indians H 0 :  =.618 (used same ratio on average) H a :  ≠.618 (ratio used by American Indians differs) t = 2.05 with df = 19 P-value =.054 Weak evidence against H 0 Not overwhelmingly convincing that the mean ratio used by American Indians differs from.618. BUT this procedure probably not valid with these data since the sample size was small (n = 20 < 30) and it does not appear that the population distribution follows a normal distribution. Would need another analysis tool… If  =.618, how often would we find a sample mean at least as extreme.661 in a random sample?  =.618 =.661

The next question: People turn to the right more than half the time… the average healthy temperature of an American adult is not 98.6 o F… less than 51% of Brown athletes are women… Tests of significance have only told us what the parameter value is not, well what is it?

Example 1: Kissing the Right Way If more than half the population turns to the right, how often is it? 2/3 of the time? ¾ of the time? 70% of the time?

Plausible values for  H 0 :  =.50? two-sided p-value =.0012 H 0 :  = 2/3 two-sided p-value =.5538 H 0 :  =.70 two-sided p-value =.1814 H 0 :  =.75 Two-sided p-value =.0069  Sample proportions  ( ) Observed sample proportion

What are the plausible values of  ? The plausible values of  are those for which the two-sided p-value >.05 z = -2 z = 2 Empirical rule: 95% of sample proportions are within 2 standard deviations of  In 95% of samples,  should be within 2 standard deviations of

General Strategy To estimate population proportion, calculate sample proportion and look 2 standard deviations in each direction Standard “error” An approximate “95% confidence interval for  ”

In general A C% confidence interval (100-C)/2 90%.05 z* = %.005 z*=2.58 C%

Example 2: NCAA Gambling  = proportion of all male NCAA athletes who participate in some type of gambling behavior = sqrt(.634(1-.634)/12651) We are 95% confident that , the probability that an NCAA Div I athlete gambles, is between.626 and.642. Between 62.6% and 64.2% of Div I athlete gamble.

Example 3: Body Temperatures Let  = average body temperature of a healthy adult We are 95% confident that the mean body temperature of healthy adults is between o F and o F (assuming this sample was representative) What if we only asked women?

Example 4: NCAA Gambling cont What sample size is needed if we want the margin of error to be.01, with 95% confidence  With qualitative data, if don’t have a prior guess for , then use.5 – this guarantees margin of error is not larger…  If n is non-integer, always round up to the nearest integer

Example 5: What do we mean by “confidence”? Determine confidence interval using your sample proportion of orange candies… Did everyone obtain the same interval? Will everyone’s interval capture  ?

Interpretation of confidence What is the reliability of this procedure…  Assuming the Central Limit Theorem applies, how often will a C% confidence interval succeed in capturing the population parameter  To explore this, have to pretend we know the population parameter

Interpretation of confidence What if I put all of these confidence intervals into a bag and randomly selected one? The confidence level is a probability statement about the method, not individual intervals  If we had millions of intervals…  Sample size?

For Next Time Finish “surveys” on BB PP using applet HW 5  Problems 2 and 3… Review sheet and suggested problems have been posted on line