BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012.

Slides:



Advertisements
Similar presentations
Validity and Reliability of Analytical Tests. Analytical Tests include both: Screening Tests Diagnostic Tests.
Advertisements

Sample size estimation
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Statistical Issues in Research Planning and Evaluation
Statistical Decision Making
Azita Kheiltash Social Medicine Specialist Tehran University of Medical Sciences Diagnostic Tests Evaluation.
CRITICAL APPRAISAL Dr. Cristina Ana Stoian Resident Journal Club
1. Estimation ESTIMATION.
Assessing and Comparing Classification Algorithms Introduction Resampling and Cross Validation Measuring Error Interval Estimation and Hypothesis Testing.
Epidemiology in Medicine Sandra Rodriguez Internal Medicine TTUHSC.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
PSY 307 – Statistics for the Behavioral Sciences
Statistics for the Social Sciences Psychology 340 Fall 2006 Review For Exam 1.
Darlene Goldstein 29 January 2003 Receiver Operating Characteristic Methodology.
Statistical Fridays J C Horrow, MD, MSSTAT
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
Statistics for Health Care
The t Tests Independent Samples.
Inferential Statistics
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Descriptive statistics Inferential statistics
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Chapter 8 Introduction to Hypothesis Testing
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
SPH 247 Statistical Analysis of Laboratory Data May 19, 2015SPH 247 Statistical Analysis of Laboratory Data1.
OKU 9 Chapter 15: ORTHOPAEDIC RESEARCH Brian E. Walczak.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Basic statistics 11/09/13.
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Indices of Performances of CPRs Nicola.
Statistics for Health Care Biostatistics. Phases of a Full Clinical Trial Phase I – the trial takes place after the development of a therapy and is designed.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Instructor Resource Chapter 5 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
No criminal on the run The concept of test of significance FETP India.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
MEASURES OF TEST ACCURACY AND ASSOCIATIONS DR ODIFE, U.B SR, EDM DIVISION.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Ch 10 – Intro To Inference 10.1: Estimating with Confidence 10.2 Tests of Significance 10.3 Making Sense of Statistical Significance 10.4 Inference as.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
Diagnostic Test Characteristics: What does this result mean
© Copyright McGraw-Hill 2004
Screening.  “...the identification of unrecognized disease or defect by the application of tests, examinations or other procedures...”  “...sort out.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Intro to Psychology Statistics Supplement. Descriptive Statistics: used to describe different aspects of numerical data; used only to describe the sample.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Laboratory Medicine: Basic QC Concepts M. Desmond Burke, MD.
1 Probability and Statistics Confidence Intervals.
Chapter 13 Understanding research results: statistical inference.
CHAPTER 3 Key Principles of Statistical Inference.
Uses of Diagnostic Tests Screen (mammography for breast cancer) Diagnose (electrocardiogram for acute myocardial infarction) Grade (stage of cancer) Monitor.
More on Inference.
Probability and Statistics
Understanding Results
Randomized Trials: A Brief Overview
Measuring Success in Prediction
More on Inference.
Chapter 9: Hypothesis Tests Based on a Single Sample
Patricia Butterfield & Naomi Chaytor October 18th, 2017
Interpreting Epidemiologic Results.
Statistical Power.
Presentation transcript:

BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012

OUTLINE Definition Why do we need to know “statistics”? – Validity- external and internal – Reliability – Sensitivity – Specificity – Positive and negative predictive values ROC curves

STATISTICS “the science that deals with the collection, classification, analysis, and interpretation of numerical facts or data, and that, by use of mathematical theories of probability, imposes order and regularity on aggregates of more or less disparate elements”

BUT WHY?

To effectively conduct research To read and evaluate journals To develop critical and analytical thinking To be an informed consumer To know your limits

Population Sample Treatment group A vs Treatment group B Inference Sampling

POPULATION Target Population Eg smokers Study Population (eligible) Eg Ages yrs, ≥ 30 pack years Study Sample (enrolled) Eg geographic area, site, consent Target Population Study Population Study Sample

Mean, Median and Mode

Measures of dispersion Range Standard deviation- Bell’s curve Others

NORMAL DISTRIBUTION

VALIDITY AND RELIABILITY-quality of measurements

VALIDITY AND RELIABILITY

GENERALIZABILITY Also “External Validity” how representative of the target population is the study population and study sample? Based on inclusion/exclusion criteria and standard of care Where When Who (subjects) What (treatments, outcomes) How (assessments, data collection)

INTERNAL VALIDITY or no “bias” Estimated effect in study sample is accurate reflection of true effect in underlying study population Can be in jeopardy due “Systematic Error” – Selection or allocation bias – Information or recall bias or treatment related bias – Confounding – Cross over or lost to follow up Deal with this by creating a good study design and statistics Also- randomize, blind, quality control, adjust for co- variates

RANDOMIZATION Balances groups on known and unknown risk factors in the long run Provides statistical basis for estimation and testing Effort, time, cost Ethics – patient wants “best” treatment – physician wants to tailor treatment to individual patient Clinical Equipoise?

HYPOTHESIS TESTING

H 0 : Null Hypothesis--There is no association between an exposure and outcome H 1 : Alternative Hypothesis--There is an association between an exposure and outcome In general, epidemiologists attempt to reject the null One-sided (higher or lower) vs two-sided (different)

H 0 True (No Association) H 1 True (There is an Association) Do Not Reject H 0 Correct Decision Type II Error ( β ) Reject H 0 Type I Error ( α ) Correct Decision TRUTH DECISIONDECISION

Type I error (false positive, α) probability of rejecting null hypothesis, when it is true Type II error (false negative, β) probability of not rejecting null hypothesis, when it is false Power (1-β) probability of rejecting null hypothesis, when it is false

POWER AND SAMPLE SIZE Ensure study worth doing and feasible Have to consider – study design – outcome (distribution, variability, etc.) – hypothesis to be tested (superiority, equivalence, etc.) – statistical analysis to be performed – clinically relevant/meaningful effect size to be detected Larger the effect size, smaller the sample size needed More subjects more power!

Measures of diagnostic test performance

SENSITIVITY A test’s ability to identify diseased individuals Probability of a positive test among those with disease

SPECIFICITY A test’s ability to rule out disease Probability of a negative test among those without disease

SENSITIVITY AND SPECIFICITY ARE FIXED ASPECTS OF A TEST DO NOT CHANGE BASED ON DISEASE PREVALENCE

Disease +Disease -Total Test +TPFPTP+FP Test -FNTNFN+TN TotalTP+FNFP+TN TP+FP+FN+TN Sensitivity = True Positive /Those with disease = TP/(TP+FN) Specificity = True Negative/Those without disease = TN/(TN+FP)

EXAMPLE 200 patients D-dimer positive in 50 CT angio positive in 60 CT angio and d-dimer negative in 120 Construct a 2 X 2 table and calculate sensitivity and specificity.

PENo PE D-dimer positive50 D-dimer negative

PENo PE D-dimer positive D-dimer negative

PENo PE D-dimer positive D-dimer negative Sensitivity= TP/TP+FN= 30/30+30=50% Specificity= TN/TN+FP= 120/120+20= 85%

POSITIVE PREDICTIVE VALUE The probability of having disease given a positive test: P(D+|T+) ‘How likely is it that this patient has the disease given that the test result is positive?’ Calculated as: (TRUE POSITIVE)/(TRUE POSITIVE + FALSE POSITIVE) Sensitivity is the probability of test positive given disease: P(T+|D+)

Disease +Disease -Total Test Test Total What is the PPV?

NEGATIVE PREDICTIVE VALUE The probability of not having disease given a negative test: P(D-|T-) ‘How likely is it that this patient does not have the disease given that the test result is negative?’ Calculated as: (TRUE NEGATIVE)/(TRUE NEGATIVE + FALSE NEGATIVE) Specificity is the probability of testing negative given no disease: P(T-|D-)

Disease +Disease -Total Test Test Total What is the NPV?

Sensitivity and specificity are characteristics of tests themselves Predictive values (positive and negative) are functions of both test characteristics and prevalence

EFFECT OF PREVALENCE 4000 people, of which 2000 are ill and 2000 well A screening test with 99% sensitivity and 99% specificity will yield 1980 TP and 1980 TN and only 20 each for FP and FN Hence PPV is 99%. What if the number of ill people is only 200? Then well people are 3800 and FP increase to 38 and PPV falls to 84%

Disease +Disease - Test Test Disease +Disease - Test Test

ROC Receiver operator characteristic curves are a plot of false positives against true positives for all cut-off values The area under the curve of a perfect test is 1.0 and that of a useless test, no better than tossing a coin, is 0.5