Unit 4 – Inference from Data: Principles

Slides:



Advertisements
Similar presentations
Comparing Two Proportions
Advertisements

Chapter 7 Sampling and Sampling Distributions
1 BA 275 Quantitative Business Methods Statistical Inference: Hypothesis Testing Type I and II Errors Power of a Test Hypothesis Testing Using Statgraphics.
Hypothesis Testing W&W, Chapter 9.
Chapter 7, Sample Distribution
Chapter 9 Introduction to the t-statistic
Chapter 4 Inference About Process Quality
Please enter data on page 477 in your calculator.
Statistical Inferences Based on Two Samples
Chapter 8 Estimation Understandable Statistics Ninth Edition
Topics
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Multiple Regression and Model Building
6.2 Confidence Intervals for the Mean (Small Samples) Statistics Mrs. Spitz Spring 2009.
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Chapter 16 Inferential Statistics
Unit 4 – Inference from Data: Principles
Chapter 8 Hypothesis Test.
Review Unit 7: Hypothesis Testing. Things to consider: What is the p-value? What is more serious – Type I error or Type II? How do we choose the level.
Chapter 9 Hypothesis Testing.
Statistics 270– Lecture 25. Cautions about Z-Tests Data must be a random sample Outliers can distort results Shape of the population distribution matters.
CHAPTER 23 Inference for Means.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 9: Testing a Claim Section 9.3a Tests About a Population Mean.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics 101 Chapter 10 Section 2. How to run a significance test Step 1: Identify the population of interest and the parameter you want to draw conclusions.
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Tests of Significance: Stating Hypothesis; Testing Population Mean.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
T tests comparing two means t tests comparing two means.
Hypothesis Testing Involving One Population Chapter 11.4, 11.5, 11.2.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Part 1: Chapters 7 to 9. 95% within 2 standard deviations 68% within 1 standard deviations 99.7% within 3 standard deviations.
Review of Power of a Test
CHAPTER 9 Testing a Claim
Topics
Unit 4 – Inference from Data: Principles
More on Inference.
CHAPTER 9 Testing a Claim
Chapter 9 Hypothesis Testing.
Inference about Two Means - Independent Samples
Topics
STA 291 Spring 2010 Lecture 18 Dustin Lueker.
Warm Up Check your understanding P. 586 (You have 5 minutes to complete) I WILL be collecting these.
Lecture Slides Elementary Statistics Twelfth Edition
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Inference for Distributions
Chapter 8 Hypothesis Testing with Two Samples.
Hypothesis Tests for a Population Mean in Practice
More on Inference.
CHAPTER 9 Testing a Claim
Statistical Inference about Regression
STAT Z-Tests and Confidence Intervals for a
Statistical Inference for Managers
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
STA 291 Spring 2008 Lecture 18 Dustin Lueker.
Chapter 11: Testing a Claim
CHAPTER 9 Testing a Claim
Lecture Slides Elementary Statistics Twelfth Edition
CHAPTER 9 Testing a Claim
Carrying Out Significance Tests
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

Unit 4 – Inference from Data: Principles Topics 19 - 20 Unit 4 – Inference from Data: Principles

Topic 19 Confidence Intervals: Means

Topic 19 - Confidence Interval: Mean, σ is unknown The purpose of confidence intervals is to use the sample statistic to construct an interval of values that you can be reasonably confident contains the actual, though unknown, parameter. The estimated standard deviation of the sample statistic X-bar is called the standard error Confidence Interval for a population proportion : where n >= 30 t * is calculated based on level of confidence When running for example 95% Confidence Interval: 95% is called Confidence Level and we are allowing possible 5% for error, we call this alpha (α )= 5% where α is the significant level

Topic 19 - Confidence Interval: Mean, σ is unknown Use if the sample data is given, use the Stat, Edit and enter data in the calculator before running the Confidence Interval L1 is where data is entered by you C-Level: is the level you are running the Confidence Interval Use if the information about sample data is given. X-Bar mean of sample data Sx is Standard deviation of the sample n is sample size C-Level: is the level you are running the Confidence Interval

Activity 19-3: M&M Consumption

Travel time to work.  A study of commuting times reports the travel times to work of a random sample of 20 employed adults in New York State. The mean is   = 31.25 minutes and the standard deviation is s = 21.88 minutes. What is the standard error of the mean? s/√n = 21.88/√20 = 4.8925 minutes.

Ancient air.  The composition of the earth’s atmosphere may have changed overtime. To try to discover the nature of the atmosphere long ago, we can examine the gas in bubbles inside ancient amber. Amber is tree resin that has hardened and been trapped in rocks. The gas in bubbles within amber should be a sample of the atmosphere at the time the amber was formed. Measurements on specimens of amber from the late Cretaceous era (75 to 95 million years ago) give these percents of nitrogen: 63.4 65.0 64.4 63.3 54.8 64.5 60.8 49.1 51.0 Assume (this is not yet agreed on by experts) that these observations are an SRS from the late Cretaceous atmosphere. Use a 90% confidence interval to estimate the mean percent of nitrogen in ancient air.

Ancient air. Enter data for L1. 95% confidence Interval: Using TI83, under Stat, TEST, Choose option 8:TInterval Mean of the sample = 59.6 Standard deviation = 6.26 Degree of freedom = df= 8 Confidence interval for mean percent of nitrogen is between 54.8 and 64.4.

Exercise 19-15 Page 414 Exercise 19-23 Page 417 Exercise 19-24 Page 417

Topic 20 Test of Significance: Means

Topic 20 – Test of Significant: Mean The purpose of Test of Significant is when we do know the population Parameter but we do not necessary agree with it or we have question about it. To do the test we need to run a sample and we use the statistic to test its validity. Step 1: Identify and define the parameter. Step 2: we initiate hypothesis regarding the question – we can not run test of significant without establishing the hypothesis Step 3: Decide what test we have to run, in case of proportion, we use t-test

Topic 20 – Test of Significant: Mean Step 4: Run the test from calculator Step 5: From the calculator write down the p-value T-test Step 6: Compare your p-value with α – alpha – Significant Level If p-value is smaller than α we “reject” the null hypothesis, then it is statistically significant based on data. If p-value is greater than the α we “Fail to reject” the null hypothesis, then it is not statistically significant based on data. Last step: we write conclusion based on step 6 at significant level α p- value > 0.1: little or no evidence against H0 • 0.05 < p- value <= 0.10: some evidence against H0 • 0.01 < p- value <= 0.05: moderate evidence against H0 • 0.001 < p- value <= 0.01: strong evidence against H0 • p- value <= 0.001: very strong evidence against H0

Few Possible cases to look at: A teacher suspects that the mean for older students is higher than 115 Higher than means (> 115) The opposite of higher than is less than or equal to 115 ( 115) Comparing the two, null hypothesis is the comparison that includes equality (=) Ho: µ = 115 Ha: µ > 115 One-sided alternative A teacher suspects that the mean for older students is same or more than 115 Same or more than means (> 115) The opposite of same or more than is less than 115 (< 115) Ho: µ = 115 Ha: µ < 115 One-sided alternative A teacher suspects that the mean for older students is also 115 Same means (= 115) The opposite of same is not equal to 115 ( 115) Ho: µ = 115 Ha: µ  115 Two-sided alternative

Fuel economy.  According to the Environmental Protection Agency (EPA), the Honda Civic hybrid car gets 51 miles per gallon (mpg) on the highway. The EPA ratings often overstate true fuel economy. Larry keeps careful records of the gas mileage of his new Civic hybrid for 3000 miles of highway driving. His result is x-bar= 47.2 mpg. Larry wonders whether the data show that his true long-term average highway mileage is less than 51 mpg. What are his null and alternative hypotheses? Answer Larry wonders whether the data show that his true long-term average highway mileage is less than 51 mpg. H0: µ = 51 mpg; Ha: µ < 51 mpg.

Problem If a researcher is interested in testing whether the mean is different from some claimed value, 55, then the null and alternative are test the hypotheses H0: μ = 55, Ha: μ ≠ 55

H0 : x-bar = 1000 grams Ha : x-bar < 1000 grams Stating hypotheses.  In planning a study of the birth weights of babies whose mothers did not see a doctor before delivery, a researcher states the hypotheses as H0 : x-bar = 1000 grams Ha : x-bar < 1000 grams What’s wrong with this? Hypotheses should be stated in terms of µ, not x-bar  .

Topic 20 – Test of Significant: Mean, σ is unknown Use if the sample data is given, use the Stat, Edit and enter data in the calculator before running the T-test µ0 is mean–value in question List: L1 where the raw data is entered by you µ: is the alternative hypothesis Use if the information about sample data is given. µ0 is mean–value in question X-bar is sample mean Sx is Sample Standard deviation n is sample size µ: is the alternative hypothesis

Improving your SAT score. We suspect that on the average students will score higher on their second attempt at the SAT mathematics exam than on their first attempt. Suppose we know that the changes in score (second try minus first try) follow a Normal distribution. Here are the results for 46 randomly chosen high school students: Do these data give good evidence that the mean change in the population is greater than zero? −30 24 47 70 −62 55 −41 −32 128 −11 −43 122 −10 56 32 −28 −19 1 17 57 −14 −58 77 27 −33 51 −67 29 94 2 12 −53 −49 49 8 −24 96 120 −2 −39 99

Activity 20- 2: Sleeping Times The null hypothesis is that the mean sleep time of the population is 7 hours. In symbols, the null hypothesis is H0 : µ = 7.0 hours. The alternative hypothesis is that the mean sleep time of the population is not 7 hours. In symbols, the alternative hypothesis is Ha : µ ≠ 7.0 hours. Sample Number Sample Size Sample Mean Sample SD Test Statistic p- value 1 10 6.6 0.825 2 1.597 3 30 4

Exercise 20-8: UFO Sighters’ Personality – Page 432 Exercise 20-10: Credit Card Usage - Page 433 Exercise 20-21: Pet Ownership - Page 436 Exercise 20-14: Age Guesses – Page 434

EXTRA Problems

Problem Assume that you are conducting a test of significance using a significance level of α = 0.10. If your test yields a P-value of 0.08, what is the appropriate conclusion? P-value = 0.08 < 0.10 Reject Null, It is statistically significant

Problem The nicotine content in cigarettes of a certain brand is normally distributed with mean (in milligrams) μ and standard deviation σ = 0.1. The brand advertises that the mean nicotine content of their cigarettes is 1.5, but measurements on a random sample of 400 cigarettes of this brand gave a mean of x = 1.52. Is this evidence that the mean nicotine content is actually higher than advertised? at significance level α = 0.01. You conclude Is this evidence that the mean nicotine content is actually higher than advertised? State the hypothesis test the hypotheses H0: μ = 1.5, Ha: μ > 1.5

Problem A researcher wants to know if the average time in jail for robbery has increased from what it was several years ago when the average sentence was 7 years. He obtains data on 400 more recent robberies and finds an average time served of 7.5 years. If we assume the standard deviation of sample is 3 years, what is the p-value of the test? at significance level α = 0.05. You conclude