Randomized Trial of Preoperative Chemoradiation Versus Surgery Alone in Patients with Locoregional Esophageal Carcinoma, Ursa et al. Statistical Methods:

Slides:



Advertisements
Similar presentations
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Advertisements

Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
Issues About Statistical Inference Dr R.M. Pandey Additional Professor Department of Biostatistics All-India Institute of Medical Sciences New Delhi.
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Inference Sampling distributions Hypothesis testing.
Last Time (Sampling &) Estimation Confidence Intervals Started Hypothesis Testing.
STAT 135 LAB 14 TA: Dongmei Li. Hypothesis Testing Are the results of experimental data due to just random chance? Significance tests try to discover.
Introduction to Statistics
Find the Joy in Stats ? ! ? Walt Senterfitt, Ph.D., PWA Los Angeles County Department of Public Health and CHAMP.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Introduction to Hypothesis Testing
Business Statistics - QBM117
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Introduction to Hypothesis Testing
Stat 301 – Day 21 Large sample methods. Announcements HW 4  Updated solutions Especially Simpson’s Paradox  Should always show your work and explain.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Chapter 25 Asking and Answering Questions About the Difference Between Two Population Means: Paired Samples.
Sample size calculations
Sample Size Determination
Sample Size Determination Ziad Taib March 7, 2014.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 11 Introduction to Hypothesis Testing.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Survival Curves Marshall University Genomics Core.
1. Statistics: Learning from Samples about Populations Inference 1: Confidence Intervals What does the 95% CI really mean? Inference 2: Hypothesis Tests.
1 Economics 173 Business Statistics Lectures 3 & 4 Summer, 2001 Professor J. Petry.
Fundamentals of Hypothesis Testing: One-Sample Tests
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Hypothesis Testing.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Confidence Intervals Elizabeth S. Garrett Oncology Biostatistics March 27, 2002 Clinical Trials in 20 Hours.
Health and Disease in Populations 2001 Sources of variation (2) Jane Hutton (Paul Burton)
Confidence Intervals Elizabeth Garrett-Mayer
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.
Chapter 4 Introduction to Hypothesis Testing Introduction to Hypothesis Testing.
Chapter 8 Introduction to Hypothesis Testing
CONFIDENCE INTERVALS DR.S.SHAFFI AHAMED ASST. PROFESSOR DEPT. OF F & CM.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
CHAPTER 17: Tests of Significance: The Basics
Therapeutic Equivalence & Active Control Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
S-012 Testing statistical hypotheses The CI approach The NHST approach.
Chapter 8 Delving Into The Use of Inference 8.1 Estimating with Confidence 8.2 Use and Abuse of Tests.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Information Technology and Decision Making Information Technology and Decision Making Example 10.1 Experimenting with a New Pizza Style at the Pepperoni.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Lecture & tutorial material prepared by: Dr. Shaffi Shaikh Tutorial presented by: Dr. Rufaidah Dabbagh Dr. Nurah Al-Amro.
CONFIDENCE INTERVALS DR.S.SHAFFI AHAMED ASST. PROFESSOR DEPT. OF F & CM.
Probability & Significance Everything you always wanted to know about p-values* *but were afraid to ask Evidence Based Chiropractic April 10, 2003.
Statistical Analysis II Lan Kong Associate Professor Division of Biostatistics and Bioinformatics Department of Public Health Sciences December 15, 2015.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
Lesson Use and Abuse of Tests. Knowledge Objectives Distinguish between statistical significance and practical importance Identify the advantages.
Statistical Inference By Using Confidence Intervals.
1 BLA Sipuleucel-T (APC-8015) FDA Statistical Review and Findings Bo-Guang Zhen, PhD Statistical Reviewer, OBE, CBER March 29, 2007 Cellular, Tissue.
Chapter 20 Testing Hypotheses About Proportions. confidence intervals and hypothesis tests go hand in hand:  A confidence interval shows us the range.
Section 9.1 First Day The idea of a significance test What is a p-value?
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
DSCI 346 Yamasaki Lecture 1 Hypothesis Tests for Single Population DSCI 346 Lecture 1 (22 pages)1.
P-values.
Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations Greenland et al (2016)
Significance Tests: The Basics
Presentation transcript:

Randomized Trial of Preoperative Chemoradiation Versus Surgery Alone in Patients with Locoregional Esophageal Carcinoma, Ursa et al. Statistical Methods: –Kaplan-Meier Curves –Log Rank Test –Confidence Intervals –Cox Proportional Hazards Model Focus on: –hypothesis testing and pvalues –confidence intervals

Hypothesis Tests “Although the survival rate at 3 years was somewhat longer for those randomly assigned to the multimodality arm (30%) than to the surgery alone arm (16%), this difference was not significant (p = 0.15).” “Patients randomized to the multimodality arm have a 31% lower risk of death from any cause over the study (p = 0.09) after adjusting for the other prognostic factors.” èWhich (if any) should be concluded? ÊSurgery and multimodality treatment have same response rate. ËMultimodality treatment did not prove to be better and it should be abandoned. ÌThere is a clinically interesting difference here, but the sample size is not sufficient to conclude strong evidence in favor of multimodality treatment.

What is a pvalue? The pvalue is the probability of observing a result as extreme as you did if the null hypothesis is true. Null hypothesis?! –“no association” (I.e. between survival and arm) –3 year survival rate in arm I is the same as in arm II So, in our case, the pvalue is the probability of observing a 30% 3 year survival in arm II and a 16% 3 year survival in arm I if, in truth, there is no difference in 3 year survival in the two treatment modalities. Drawback: hypothesis testing conclusion relies heavily on sample size to make conclusion. When interpreting pvalues, people assume we have a yes/no result and do not always consider what the magnitude of the difference is. What is the difference between p = 0.04 and p= 0.08? Not unusual to see –statistically significant, clinically insignificant result –statistically insignificant, clinically significant result

Confidence Intervals Hazard Ratio95% CI Chemo v surgery Arm 1Arm II %95%CI%95%CI 1 year survival year survival What about the confidence interval for the 1 year and 3 year difference?

Convincing, but could be more so Why not provide confidence intervals for... –median survival –1 year survival –3 year survival Would give readers a “reasonable range” of values to consider for treatment effect that are intuitive. What is remembered? –P = 0.09 which means insignificant result –But, can anyone remember the treatment effect?

Confidence Intervals for Reporting Results of Clinical Trials, Simon “[Hypothesis tests] are sometimes overused and their results misinterpreted.” “Confidence intervals are of more than philosophical interest, because their broader use would help eliminate misinterpretations of published results “Frequently, a significance level or pvalue is reduced to a “significance test” by saying that if the level is greater than 0.05, then the difference is “not significant” and the null hypothesis is “not rejected….The distinction between statistical significance and clinical significance should not be confused.”

Caveats “They should not be interpreted as reflecting the absence of a clinically important difference in true response probabilities.”

Other Confidence Intervals of Interest Response Rate X-year survival rate Median survival Hazard ratio Odds ratios Means of continuous variables

How “Power” fits in -- should you report power? “’Power’ represents the the probability of obtaining a statistically significant results if a difference in efficacy of a specified magnitude actually exists.” “Power takes no account of the actual results obtained.” Example: –Two treatment groups, 50 patients in each. –Observed response rates of 40% in group A and 50% in group B. –The power to detect a difference of 45% to 65% is only 0.57 –But, 95% confidence interval for true difference is -10% to 30% –CAN conclude: unlikely that you will see an improvement of more than 10% for A versus B. –Power calculation (if published with results) would confuse matters.

Survival Example Assume that in an oncology randomized clinical trial where outcome is time to death, median survival in the two arms (new treatment and old treatment) is found to be the same: 6 months. 20 deaths per treatment. Are treatments essentially equivalent, or is trial too small to conclude? A 90% confidence interval for the ratio of median survival in new versus old is 0.59 to So, we can say that, because median survival in old treatment group is 6 months, the possible improvement is about 4 months in the new treatment (6 x 1.68 = 10.1). Conclusion: this small trial is rather conclusive that any improvement in survival associated with new treatment is likely to be small.

Conclusions A difference that is not statistically significant is taken to mean that no real difference exists. The best way to ensure that a nonsignificant difference will be obtained is to use an inadequate sample size. (and vice versa) Opportunities for misinterpretation can be reduced if investigators use confidence intervals in reporting results and interpreting their findings.