Lecture 6 Outline: Tue, Sept 23 Review chapter 2.2 –Confidence Intervals Chapter 2.3 –Case Study 2.1.1 –Two sample t-test –Confidence Intervals Testing.

Slides:



Advertisements
Similar presentations
Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
Advertisements

Lecture 6 Outline – Thur. Jan. 29
CHAPTER 24: Inference for Regression
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Objectives (BPS chapter 24)
Confidence Interval and Hypothesis Testing for:
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Lecture 23: Tues., Dec. 2 Today: Thursday:
Lecture 7 Outline Levene’s test for equality of variances (4.5.3) Interpretation of p-values (2.5.1) Robustness and resistance of t-tools ( )
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Lecture 10 Outline: Tue, Oct 7 Resistance of two sample t-tools (Chapter 3.3) Practical strategies for two-sample problem (Chapter 3.4) Review Office hours:
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
Lecture 23: Tues., April 6 Interpretation of regression coefficients (handout) Inference for multiple regression.
Lecture 7 Outline – Thur, Sep 25
BCOR 1020 Business Statistics
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
Lecture 5 Outline: Thu, Sept 18 Announcement: No office hours on Tuesday, Sept. 23rd after class. Extra office hour: Tuesday, Sept. 23rd from 12-1 p.m.
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Two Population Means Hypothesis Testing and Confidence Intervals With Unknown Standard Deviations.
Chapter 11: Inference for Distributions
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
5-3 Inference on the Means of Two Populations, Variances Unknown
1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.
Week 9 October Four Mini-Lectures QMM 510 Fall 2014.
7.1 Lecture 10/29.
Statistical Inference for Two Samples
Inference for regression - Simple linear regression
Hypothesis testing – mean differences between populations
Estimation and Confidence Intervals
Chapter 19: Two-Sample Problems STAT Connecting Chapter 18 to our Current Knowledge of Statistics ▸ Remember that these formulas are only valid.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
More About Significance Tests
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Comparing Two Population Means
Lecture 14 Sections 7.1 – 7.2 Objectives:
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
June 25, 2008Stat Lecture 14 - Two Means1 Comparing Means from Two Samples Statistics 111 – Lecture 14 One-Sample Inference for Proportions and.
Essential Statistics Chapter 131 Introduction to Inference.
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Confidence intervals and hypothesis testing Petter Mostad
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Chapter 10 Inference for Regression
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 26 Chapter 11 Section 1 Inference about Two Means: Dependent Samples.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 10, Slide 1 Two-Sample Tests and One-Way ANOVA Chapter 10.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Inference for distributions: - Comparing two means.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Lecture 22 Dustin Lueker.  Similar to testing one proportion  Hypotheses are set up like two sample mean test ◦ H 0 :p 1 -p 2 =0  Same as H 0 : p 1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Review Statistical inference and test of significance.
AP Statistics Chapter 24 Comparing Means. Objectives: Two-sample t methods Two-Sample t Interval for the Difference Between Means Two-Sample t Test for.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Presentation transcript:

Lecture 6 Outline: Tue, Sept 23 Review chapter 2.2 –Confidence Intervals Chapter 2.3 –Case Study –Two sample t-test –Confidence Intervals Testing for equal variances (Chapter 4.5.3)

Notes on Homework #1. Finding quartiles: –The stem and leaf plot provides an ordered list of the observations in increasing order –The first quartile is the median of the observations whose position in the ordered list is to the left (to the top in stem-and-leaf plot) of the overall median –The third quartile is the median of the observations whose position in the ordered list is to the right (to the bottom in stem-and-leaf plot) of the overall median. #3. Unemployment Spells.

One-sample t-tools and paired t-test Testing hypotheses about the mean difference in pairs is equivalent to testing hypotheses about the mean of a single population Probability model: Simple random sample with replacement from population. Test statistic:

p-value Fact: If H 0 is true, then t has the Student’s t-distribution with n-1 degrees of freedom Can look up quantiles of t-distribution in Table A.2. The (2-sided) p-value is the proportion of random samples with absolute value of t >= observed test statistic T o =|t o | if H 0 is true. Schizophrenia example: t o =3.23, p-value = Prob>|t| = The reliability of the p-value (as the probability of observing as extreme a test statistic as the one actually observed if H 0 is true) is only guaranteed if the probability model of random sampling is correct – if the data is collected haphazardly rather than through random sampling, the p-value is not reliable.

One-sided tests One-sided alternatives: H 1 :, H 1 :, Two- sided alternative: H 1 : Choice of one-sided or two-sided depends on how specifically the researcher can pinpoint the alternative. Always report whether p-value is one-or-two- sided. One-sided test: H 1 : –Test statistic: –For schizophrenia example, t=3.21, p-value (1-sided) =.003

p-value animation

Confidence Interval for A confidence interval is a range of “plausible values” for a statistical parameter (e.g., the population mean) based on the data. It conveys the precision of the sample mean as an estimate of the population mean. A confidence interval typically takes the form: point estimate margin of error The margin of error depends on two factors: –Standard error of the estimate –Degree of “confidence” we want.

CI for population mean If the population distribution of Y is normal (* we will study the if part later) 95% CI for mean of single population: For schizophrenia data:

Interpretation of CIs A 95% confidence interval will contain the true parameter (e.g., the population mean) 95% of the time if repeated random samples are taken. It is impossible to say whether it is successful or not in any particular case, i.e., we know that the CI will usually contain the true mean under random sampling but we do not know for the schizophrenia data if the CI (0.067cm 3,0.331cm 3 ) contains the true mean difference. The accuracy of the confidence interval is only guaranteed if the probability model is correct – if the data is collected haphazardly rather than through random sampling, the confidence interval is not reliable

Matched Pairs in JMP Click Analyze, Matched Pairs, put two columns (e.g., affected and unaffected) into Y, Paired Response. Can also use one-sample t-test. Click Analyze, Distribution, put difference into Y, columns. Then click red triangle under difference and click test mean. For both methods of doing paired t-test (Analyze, Matched Pairs or Analyze, Distribution), the 95% confidence intervals for the mean are shown on the output.

Case Study Background: During a severe winter storm in New England, 59 English sparrows were found freezing and brought to Bumpus’ laboratory – 24 died and 35 survived. Broad question: Did those that perish do so because they lacked physical characteristics enabling them to withstand the intensity of this episode of selective elimination? Specific questions: Do humerus (arm bone) lengths tend to be different for survivors than for those that perished? If so, how large is the difference?

Structure of Data Two independent samples Observational study – cannot infer a causal relationship between humerus length and survival Sparrows were not collected randomly. Fictitious probability model: Independent simple random samples with replacement from two populations (sparrows that died and sparrows that survived). See Display 2.7

Two-sample t-test Population parameters: H 0 :, H 1 : Equal spread model: (call it ) Statistics from samples of size n 1 and n 2 from pops. 1 and 2: For Bumpus’ data:

Sampling Distribution of (equal spread model) Pooled estimate of : See Display 2.8

Two sample t-test H 0 :, H 1 : Test statistic: T= If population distributions are normal with equal, then if H 0 is true, the test statistic t has a Student’s t distribution with degrees of freedom. p-value equals probability that T would be greater than observed |t| under random sampling model if H 0 is true; calculated from Student’s t distribution. For Bumpus data, two-sided p-value =.0809, suggestive but inconclusive evidence of a difference

One-sided p-values If H 1:, test statistic is If H 1:, test statistic is p-value is probability that T would be >= observed T 0 if H 0 is true

Confidence Interval for 100(1- )% confidence interval for : For 95% confidence interval, Factors affecting width of confidence interval: –Sample size –Population standard deviation –Level of confidence

Two sample tests and CIs in JMP Click on Analyze, Fit Y by X, put Group variable in X and response variable in Y, and click OK Click on red triangle next to Oneway Analysis and click Means/ANOVA/t-test To see the means and standard deviations themselves, click on Means and Std Dev under red triangle

Bumpus’ Data Revisited Bumpus concluded that sparrows were subjected to stabilizing selection – birds that were markedly different from the average were more likely to have died. Bumpus (1898): “The process of selective elimination is most severe with extremely variable individuals, no matter in what direction the variations may occur. It is quite as dangerous to be conspicuously above a certain standard of organic excellence as it is to be conspicuously below the standard. It is the type that nature favors.” Bumpus’ hypothesis is that the variance of physical characteristics in the survivor group should be smaller than the variance in the perished group

Testing Equal Variances Two independent samples from populations with variances and H 0 : vs. H 1 : Levene’s Test – Section In JMP, Fit Y by X, under red triangle next to Oneway Analysis of humerus by group, click Unequal Variances. Use Levene’s test. p-value =.4548, no evidence that variances are not equal, thus no evidence for Bumpus’ hypothesis.