RDPStatistical Methods in Scientific Research - Lecture 11 Lecture 1 Interpretation of data 1.1 A study in anorexia nervosa 1.2 Testing the difference.

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Lecture 3 Outline: Thurs, Sept 11 Chapters Probability model for 2-group randomized experiment Randomization test p-value Probability model for.
AP Statistics Section 11.2 B. A 95% confidence interval captures the true value of in 95% of all samples. If we are 95% confident that the true lies in.
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
CHAPTER 24: Inference for Regression
Introduction to Statistics
Significance Testing Chapter 13 Victor Katch Kinesiology.
PSY 307 – Statistics for the Behavioral Sciences
Evaluating Hypotheses
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 22 = More of Chapter “Confidence Interval Estimation”
8 - 10: Intro to Statistical Inference
Chapter 8 Estimation: Single Population
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
Lecture 6. Hypothesis tests for the population mean  Similar arguments to those used to develop the idea of a confidence interval allow us to test the.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Statistical Inference for Two Samples
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Overview Definition Hypothesis
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Hypothesis testing – mean differences between populations
1 G Lect 5b G Lecture 5b A research question involving means The significance test approach »The problem of s 2 »Student’s t distribution.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
More About Significance Tests
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Adapted by Peter Au, George Brown College McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.2.
Stat 1510 Statistical Inference: Confidence Intervals & Test of Significance.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
CHAPTER 9 Testing a Claim
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
10.1: Confidence Intervals Falls under the topic of “Inference.” Inference means we are attempting to answer the question, “How good is our answer?” Mathematically:
RDPStatistical Methods in Scientific Research - Lecture 41 Lecture 4 Sample size determination 4.1 Criteria for sample size determination 4.2 Finding the.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
BPS - 3rd Ed. Chapter 141 Tests of significance: the basics.
3-1 MGMG 522 : Session #3 Hypothesis Testing (Ch. 5)
Chapter 8 Parameter Estimates and Hypothesis Testing.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
1 Probability and Statistics Confidence Intervals.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.
Independent Samples: Comparing Means Lecture 39 Section 11.4 Fri, Apr 1, 2005.
More on Inference.
Chapter 9 Hypothesis Testing.
More on Inference.
Chapter 9 Hypothesis Testing.
Presentation transcript:

RDPStatistical Methods in Scientific Research - Lecture 11 Lecture 1 Interpretation of data 1.1 A study in anorexia nervosa 1.2 Testing the difference between the samples 1.3 Confidence intervals for treatment effects

RDPStatistical Methods in Scientific Research - Lecture A study in anorexia nervosa Ben-Tovim, Whitehead and Crisp (1979) “Sufferers from anorexia nervosa, even those whose bodies have become severely emaciated, often maintain that their bodily dimensions are quite normal”  Are anorexics able to judge their own bodily dimensions?  Are they worse at doing so than healthy controls? 8 anorexics and 11 controls participated in a study of these questions

RDPStatistical Methods in Scientific Research - Lecture 13 The apparatus Two lights on a horizontal beam Move them together: “Say stop when the distance apart is the same as the width of your waist” Repeat while they move apart, and then average the two measurements to give the perceived width

RDPStatistical Methods in Scientific Research - Lecture 14 Body perception index Let  A  = mean BPI for anorexics  C  = mean BPI for controls Null hypothesis is H 0 :  A  =  C

RDPStatistical Methods in Scientific Research - Lecture 15 The data Anorexics: 130, 194, 160, 120, 152, 144, 120, 141 Controls: 202, 140, 168, 160, 147, 133, 229, 172, 130, 206, 153 Summary: Overall (n = 19): mean = , standard deviation = Anorexics (n A = 8): mean = , standard deviation = Controls (n C = 11): mean = , standard deviation =

RDPStatistical Methods in Scientific Research - Lecture 16 Formulae mean: standard deviation (a measure of the spread of the data):

RDPStatistical Methods in Scientific Research - Lecture 17 Notes  Here we have means for anorexics and for controls and standard deviations S A for anorexics and S C for controls  These are sample means and sample standard deviations: they vary from sample to sample  The population means are  A for anorexics and  C for controls and the population standard deviations are  A for anorexics and  C for controls: these are fixed truths that will never be known precisely  and are estimates of  A and  C S A and S C are estimates of  A and  C

RDPStatistical Methods in Scientific Research - Lecture Testing the difference between the samples The two group means are different from one another Are they significantly different? Or might the difference just be due to chance? We will use a t-test to find out

RDPStatistical Methods in Scientific Research - Lecture 19 The t-statistic where

RDPStatistical Methods in Scientific Research - Lecture 110 Notes  We begin with an estimate of the difference between the means:  This is standardised by dividing by S: S 2 is a weighted average of  Standardisation ensures that t is unit-free  Division by is a matter of convention, but it does ensure that values are not too greatly affected by sample sizes

RDPStatistical Methods in Scientific Research - Lecture 111 Calculation so that

RDPStatistical Methods in Scientific Research - Lecture 112 Theory Suppose that  the BPIs of anorexics follow the normal distribution with mean  A and standard deviation   the BPIs of controls follow the normal distribution with mean  C and the same standard deviation  Then, if  A =  C, the statistic t follows Student’s t-distribution on 17 degrees of freedom (17 = 19 – 2 = n  # parameters)  a similar shape (slightly fatter) centred on 0

RDPStatistical Methods in Scientific Research - Lecture 113 The t-distribution The probability that a random variable following Student’s t-distribution on 17 degrees of freedom is   is 0.061

RDPStatistical Methods in Scientific Research - Lecture 114 Interpretation If the null hypothesis H 0 :  A  =  C is true (and the populations have the same standard deviation), then t is unusually negative The chance of it being so negative (or even more so) is This is the p-value against the one-sided alternative H 1 :  A  <  C The value is not so small that one would wish to reject H 0 and conclude that there is a significant difference – it shows a trend, but does not constitute strong evidence

RDPStatistical Methods in Scientific Research - Lecture 115 Caution! The investigators sought evidence that anorexics had a poorer perception of their bodily dimensions than controls – that  A  >  C The trend is in the opposite direction! “So maybe the anorexics have a better perception, being so obsessed by their bodies” Investigators are going to wish to interpret the data, whichever direction the difference, so use a two-sided p-value

RDPStatistical Methods in Scientific Research - Lecture 116 Two-sided p-value Double the one-sided p-value to give the two-sided p-value: p = 0.122

RDPStatistical Methods in Scientific Research - Lecture 117 Convention  A two-sided p-value  0.05 is usually taken to represent strong evidence of an effect  This goes back to Fisher in the 1930s  It is rather arbitrary, but it is a useful yardstick  A one-sided p-value  is usually taken to represent strong evidence of an effect – this avoids “cheating” by choosing the direction of the difference once the data have been observed

RDPStatistical Methods in Scientific Research - Lecture Confidence intervals for treatment effects We have used  A to denote the population mean of the BPIs for anorexics and  A to denote their population standard deviations These are estimated by the sample mean = and by the sample standard deviation S A = respectively How good an estimate of  A is ? How big or small might  A actually be? A confidence interval will answer this question

RDPStatistical Methods in Scientific Research - Lecture 119 Another t-distributed random variable Let Note that you cannot calculate t A as it depends on the unknown  A If the BPI observations are normally distributed, then t A follows Student’s t-distribution with (n A – 1) df Now, a t 7 random variable lies between  and with probability 0.95

RDPStatistical Methods in Scientific Research - Lecture 120 A confidence interval for  A It follows that, with probability 0.95, which is

RDPStatistical Methods in Scientific Research - Lecture 121 A confidence interval for  A So, with probability 0.95, We say that (124.73, ) is a 95% confidence interval for  A The upper and lower limits are random, while  A is fixed The limits capture the true value of  A with probability 0.95 It could well be that the true mean BPI for anorexics is as low as , it could also be as high as

RDPStatistical Methods in Scientific Research - Lecture 122 A confidence interval for  C For the controls, n C = 11, = and S C = The 97.5% point of the t distribution on 10 df is Hence, the 95% confidence interval for  C is (   /  11)  (145.49, ) Note that the confidence intervals for  A and  C overlap What about a 95% confidence interval for  =  A  C ?

RDPStatistical Methods in Scientific Research - Lecture 123 A confidence interval for  =  A  C Now follows Student’s t-distribution with (n – 2) df The 97.5% point of the t distribution on 17 df is 2.110

RDPStatistical Methods in Scientific Research - Lecture 124 A confidence interval for  =  A  C It follows that, with probability 0.95 so that the 95% confidence interval for  is

RDPStatistical Methods in Scientific Research - Lecture 125 Interpretation 0 lies within the confidence interval  consistent with lack of significant evidence against H 0 :  = 0 at the 5% level (2-sided), as found from the t-test The mean BPI for anorexics could be substantially lower that that for controls (by more than 50), or slightly higher Larger sample sizes would reduce the width of the confidence intervals, and make it easier to determine whether there really is a difference