1 Session 8 Tests of Hypotheses. 2 By the end of this session, you will be able to set up, conduct and interpret results from a test of hypothesis concerning.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Introductory Mathematics & Statistics for Business
Overview of Lecture Parametric vs Non-Parametric Statistical Tests.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
1 Session 7 Standard errors, Estimation and Confidence Intervals.
SADC Course in Statistics Common Non- Parametric Methods for Comparing Two Samples (Session 20)
The Poisson distribution
SADC Course in Statistics Introduction to Non- Parametric Methods (Session 19)
SADC Course in Statistics Tests for Variances (Session 11)
Assumptions underlying regression analysis
SADC Course in Statistics Basic principles of hypothesis tests (Session 08)
SADC Course in Statistics Meaning and use of confidence intervals (Session 05)
SADC Course in Statistics Comparing two proportions (Session 14)
SADC Course in Statistics Linking tests to confidence intervals (and other issues) (Session 10)
SADC Course in Statistics (Session 09)
STATISTICAL INFERENCE ABOUT MEANS AND PROPORTIONS WITH TWO POPULATIONS
Chapter 7 Sampling and Sampling Distributions
Chapter 17/18 Hypothesis Testing
Hypothesis Test II: t tests
You will need Your text Your calculator
Elementary Statistics
HYPOTHESIS TESTING. Purpose The purpose of hypothesis testing is to help the researcher or administrator in reaching a decision concerning a population.
Chapter 10: The t Test For Two Independent Samples
Lecture 14 chi-square test, P-value Measurement error (review from lecture 13) Null hypothesis; alternative hypothesis Evidence against null hypothesis.
The t-distribution William Gosset lived from 1876 to 1937 Gosset invented the t -test to handle small samples for quality control in brewing. He wrote.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
9.4 t test and u test Hypothesis testing for population mean Example : Hemoglobin of 280 healthy male adults in a region: Question: Whether the population.
6. Statistical Inference: Example: Anorexia study Weight measured before and after period of treatment y i = weight at end – weight at beginning For n=17.
Chi-Square and Analysis of Variance (ANOVA)
Active Learning Lecture Slides For use with Classroom Response Systems Comparing Groups: Analysis of Variance Methods.
Hypothesis Tests: Two Independent Samples
Chapter 4 Inference About Process Quality
Comparison of 2 Population Means Goal: To compare 2 populations/treatments wrt a numeric outcome Sampling Design: Independent Samples (Parallel Groups)
Small differences. Two Proportion z-Interval and z-Tests.
Lecture Unit Multiple Regression.
Chapter 15 ANOVA.
Comparing Two Population Parameters
Module 16: One-sample t-tests and Confidence Intervals
Module 17: Two-Sample t-tests, with equal variances for the two populations This module describes one of the most utilized statistical tests, the.
McGraw-Hill, Bluman, 7th ed., Chapter 9
Putting Statistics to Work
Statistical Inferences Based on Two Samples
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.
Analysis of Variance Chapter 12 . McGraw-Hill/Irwin
Chapter Thirteen The One-Way Analysis of Variance.
1 Chapter 20: Statistical Tests for Ordinal Data.
Simple Linear Regression Analysis
Multiple Regression and Model Building
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.
Adapted by Peter Au, George Brown College McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited.
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
PSY 307 – Statistics for the Behavioral Sciences
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Chapter 9 Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests Basic Business Statistics 10 th Edition.
AM Recitation 2/10/11.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Chap 9-1 Two-Sample Tests. Chap 9-2 Two Sample Tests Population Means, Independent Samples Means, Related Samples Population Variances Group 1 vs. independent.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.
AP Statistics Section 11.1 B More on Significance Tests.
© Copyright McGraw-Hill 2004
Testing a Single Mean Module 16. Tests of Significance Confidence intervals are used to estimate a population parameter. Tests of Significance or Hypothesis.
Chapter 10 Two Sample Tests
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
Chapter 9 Hypothesis Testing.
Defining the null and alternative hypotheses
Chapter 12: Comparing Independent Means
Essential Statistics Two-Sample Problems - Two-sample t procedures -
Presentation transcript:

1 Session 8 Tests of Hypotheses

2 By the end of this session, you will be able to set up, conduct and interpret results from a test of hypothesis concerning a population mean explain how means from two populations may be compared, and state assumptions associated with the independent samples t-test interpret computer output from one or two-sample t- tests, present and write up conclusions resulting from such tests explain the difference between statistical significance and an important result Learning Objectives

3 Farmers growing maize in a certain area were getting average yields of 2900 kg/ha. A new Integrated Pest Management (IPM) approach was attempted with 16 farmers. Objective: To determine if the new approach results in an increase in maize yields. Yields from these 16 farmers (after using IPM) gave mean = 3454 kg/ha, with standard deviation = 672 kg/ha hence s.e. = 168. Can we determine whether IPM has really increased maize yields? An illustrative example

4 In above example, clearly the sample mean of 3454 kg/ha is greater than 2990 kg/ha But the question of interest is does this result indicate a significant increase in the yield or might it just be a result of the usual random variation of yield Hypothesis testing seeks to answer such questions by looking at the observed change relative to the noise, i.e. the standard error in the sample estimate Is the yield increase real?

5 Null hypothesis H 0 : = 2900 where is the true mean yield of farmers in the area using the new approach The promoters of the new approach are confident that yields with the new approach cannot possibly decrease. Hence the above null hypothesis needs to be tested against the alternative hypothesis H 1 : > 2900 Null H 0 & Alternative H 1

6 Testing the hypothesis Compute the t test statistic t = ( - )/(s/ n) = (3454 – 2900)/(168) = 3.30 which follows a t-distribution with n-1=15 degrees of freedom. Use values of the t-distribution to find the probability of getting a result, which is as extreme, or more extreme than the one (3.30) observed, given H 0 is true. The smaller this probability value, the greater is the evidence against the null hypothesis. This probability is called the p-value or significance level of the test

7 Analysis in Stata Type db ttesti or look for the One-sample mean comparison calculator on the menu

8 Results Result from the one-sided test done here t-probabilities from formulae or table t-value

9 Interpretation and conclusions It is clear from t-tables that the p-value is smaller than Using statistical software, we get the exact p-value as This p-value is so small, there is sufficient evidence to reject H 0. Conclusion: Use of the new IPM technology has led to an increase in maize yields (p-value=0.0024)

10 AgricNon-agric As part of a health survey, cholesterol levels of men in a small rural area were measured, including those working in agriculture and those employed in non- agricultural work. Aim: To see if mean cholesterol levels were different between the two groups. An example: Comparing 2 means

11 Begin with summarising each column of data. AgricNon-agric Mean= Std. dev. = Variance = There appears to be a substantial difference between the two means. Our question of interest is: Is this difference showing a real effect, or could it merely be a chance occurrence? Summary statistics

12 To answer the question, we set up: Null hypothesis H 0 : no difference between the two groups (in terms of mean response), i.e. 1 = 2 Alternative hypothesis H 1 : there is a difference, i.e. 1 2 The resulting test will be two-sided since the alternative is not equal to. Setting up the hypotheses

13 Use a two-sample (unpaired) t-test - appropriate with 2 independent samples Assumptions - normal distributions for each sample - constant variance (so test uses a pooled estimate of variance) - observations are independent Procedure - assess how large the difference in means is, relative to the noise in this difference, i.e. the std. error of the difference. Test for comparing means

14 where s 2, the pooled estimate of variance, is given by The test statistic is: Test Statistic

15 The pooled estimate of variance, is : = Hence the t-statistic is: = 41.7/ (2x1279.5/10) = 2.61, based on 18 d.f. Comparing with tables of t 18, this result is significant at the 2% level, so reject H 0. Note: The exact p-value = Numerical Results

16 Difference of means: 41.7 Standard error of difference: % confidence interval for difference in means: (8.09, 75.3). Conclusions: There is some evidence (p=0.018) that the mean cholesterol levels differ between those working in agriculture and others. The difference in means is 42 mg/dL with 95% confidence interval (8.1, 75.3). Results and conclusions

17 Analysis in Stata Input the data and do a t-test Or complete the dialogue as shown below Or type ttesti

18 Results This was a 2-sided test

19 Take care to report results according to size of p-value. For example, evidence of an effect is : almost conclusive if p-value < and could be said to be strong if p-value < If 0.01< p-value < 0.05, results indicate some evidence of an effect. If p-value > 0.05, but close to 0.05, it may indicate something is going on, but further confirmatory study is needed. General reporting the results

20 e.g. Farmers report that using a fungicide increased crop yields by 2.7 kg ha -1, s.e.m.=0.41 This gave a t-statistic of 6.6 (p-value<0.001) Recall that the p-value is the probability of rejecting the null hypothesis when it is true. i.e. it is the chance of error in your conclusion that there is an effect due to fungicide! Significance: further comments

21 In relation to the example on the previous slide, we may find one of the following situations for different crops. Mean yields: with and without fungicide Not an important finding! Very important finding! It is likely that in the first of these results, either too much replication or the incorrect level of replication had been used (e.g. plant level variation, rather than plot level variation used to compare means). How important are sig. tests?

22 e.g. There was insufficient evidence in the data to demonstrate that using a fungicide had any effect on plant yields (p=0.128). Mean yields: with and without fungicide This difference may be an important finding, but the statistical analysis was unable to pick up this difference as being statistically significant. HOW CAN THIS HAPPEN? Too small a sample size? High variability in the experimental material? One or two outliers? All sources of variability not identified? What does non-significance tell us?

23 Statistical significance alone is not enough. Consider whether the result is also scientifically meaningful and important. When a significant result if found, report the finding in terms of the corresponding estimates, their standard errors and C.I.s (as is done by Stata) Significance – Key Points