CT image testing. What is a CT image? CT= computed tomography CT= computed tomography Examines a person in “slices” Examines a person in “slices” Creates.

Slides:



Advertisements
Similar presentations
Power and sample size.
Advertisements

Diagnostic Metrics Week 2 Video 3. Different Methods, Different Measures  Today we’ll continue our focus on classifiers  Later this week we’ll discuss.
Evaluating Classifiers
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Chapter 4 Pattern Recognition Concepts: Introduction & ROC Analysis.
Evaluation of segmentation. Example Reference standard & segmentation.
Diagnostic Tests Patrick S. Romano, MD, MPH Professor of Medicine and Pediatrics Patrick S. Romano, MD, MPH Professor of Medicine and Pediatrics.
Receiver Operating Characteristic (ROC) Curves
The University of Texas at Austin, CS 395T, Spring 2008, Prof. William H. Press 1 Computational Statistics with Application to Bioinformatics Prof. William.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Evaluation of Screening and Diagnostic Tests.
Multiple testing Justin Chumbley Laboratory for Social and Neural Systems Research Institute for Empirical Research in Economics University of Zurich With.
GerstmanChapter 41 Epidemiology Kept Simple Chapter 4 Screening for Disease.
Evaluating Classifiers Lecture 2 Instructor: Max Welling Read chapter 5.
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Darlene Goldstein 29 January 2003 Receiver Operating Characteristic Methodology.
Decision Theory Naïve Bayes ROC Curves
ROC Curves.
Chapter 9.2 ROC Curves How does this relate to logistic regression?
ROC Curve and Classification Matrix for Binary Choice Professor Thomas B. Fomby Department of Economics SMU Dallas, TX February, 2015.
Chapter 9.2 ROC Curves How does this relate to logistic regression?
DATASET INTRODUCTION 1. Dataset: Urine 2 From Cleveland Clinic
Computer Vision Lecture 8 Performance Evaluation.
Evaluating Classifiers
AM Recitation 2/10/11.
BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012.
SPH 247 Statistical Analysis of Laboratory Data May 19, 2015SPH 247 Statistical Analysis of Laboratory Data1.
Medical decision making. 2 Predictive values 57-years old, Weight loss, Numbness, Mild fewer What is the probability of low back cancer? Base on demographic.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Overview Basics of Hypothesis Testing
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Indices of Performances of CPRs Nicola.
Performance measurement. Must be careful what performance metric we use For example, say we have a NN classifier with 1 output unit, and we code ‘1 =
StAR web server tutorial for ROC Analysis. ROC Analysis ROC Analysis: This module allows the user to input data for several classifiers to be tested.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Sensitivity Sensitivity answers the following question: If a person has a disease, how often will the test be positive (true positive rate)? i.e.: if the.
Diagnostic Imaging Tests  MRI  CT  Ultrasound  PET scan  Mammography  X-ray.
Experiments in Machine Learning COMP24111 lecture 5 Accuracy (%) A BC D Learning algorithm.
CpSc 810: Machine Learning Evaluation of Classifier.
Sensitivity & Specificity Sam Thomson 8/12/10. Sensitivity Proportion of people with the condition who have a positive test result Proportion of people.
Likelihood 2005/5/22. Likelihood  probability I am likelihood I am probability.
Fitting probability models to frequency data. Review - proportions Data: discrete nominal variable with two states (“success” and “failure”) You can do.
Evaluating Results of Learning Blaž Zupan
1 Statistics and Image Quality Evaluation III Oleh Tretiak Medical Imaging Systems Fall, 2002.
Covariance matrices for all of the classes are identical, But covariance matrices are arbitrary.
Pattern Classification an Diagnostic Decision Yongnan Ji.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
INTRODUCTION TO HYPOTHESIS TESTING From R. B. McCall, Fundamental Statistics for Behavioral Sciences, 5th edition, Harcourt Brace Jovanovich Publishers,
Session 8: Paired Samples (Zar, Chapter 9,24). General: One population of subjects: x 1, x 2, …, x n, but a pair of data points on each. Examples: Before.
Evaluating Classification Performance
Professor William H. Press, Department of Computer Science, the University of Texas at Austin1 Opinionated in Statistics by Bill Press Lessons #50 Binary.
Laboratory Medicine: Basic QC Concepts M. Desmond Burke, MD.
Objective Evaluation of Intelligent Medical Systems using a Bayesian Approach to Analysis of ROC Curves Julian Tilbury Peter Van Eetvelt John Curnow Emmanuel.
Evaluation of Diagnostic Tests & ROC Curve Analysis PhD Özgür Tosun.
ROC curve estimation. Index Introduction to ROC ROC curve Area under ROC curve Visualization using ROC curve.
SCREENING FOR DISEASE. Learning Objectives Definition of screening; Principles of Screening.
Chapter 9, Part E. VII. Calculating the Probability of Type II Errors A common decision in business is whether to accept a shipment or not, based upon.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Sensitivity, Specificity, and Receiver- Operator Characteristic Curves 10/10/2013.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Biostatistics Class 3 Probability Distributions 2/15/2000.
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Accuracy, sensitivity and specificity analysis
7. Performance Measurement
Classification Evaluation And Model Selection
Machine Learning Week 10.
Accuracy, sensitivity and specificity analysis
Experiments in Machine Learning
Patricia Butterfield & Naomi Chaytor October 18th, 2017
More on Maxent Env. Variable importance:
Evidence Based Diagnosis
Presentation transcript:

CT image testing

What is a CT image? CT= computed tomography CT= computed tomography Examines a person in “slices” Examines a person in “slices” Creates density-based images Creates density-based images

Three different window densities Bone- densest only bone shows detail, the rest is too dark to make out Bone- densest only bone shows detail, the rest is too dark to make out Soft tissue- moderate density, bone is white, soft tissue is detailed, and “air space” is too dark Soft tissue- moderate density, bone is white, soft tissue is detailed, and “air space” is too dark Lung- low density, bone and soft tissue white, and “air space” is detailed Lung- low density, bone and soft tissue white, and “air space” is detailed

Bone Window

Soft Tissue Window

Lung Window

What a radiologist has to look at

How diagnoses are decided Look for “abnormalities” in density Look for “abnormalities” in density

Why is this cumbersome?

Our project- dare to compare Test the new “unified LSU CT image” Test the new “unified LSU CT image”

Hypothesis Unified LSU CT image: Unified LSU CT image: Equally accurate Equally accurate Faster Faster

Methods Our test Our test ROC Analysis ROC Analysis ROCKit Software ROCKit Software

ROC Receiver Operating Characteristics Receiver Operating Characteristics Developed in 1950’s Developed in 1950’s Statistical decision theory Statistical decision theory Used in business, economics, etc Used in business, economics, etc Goal: Is to be a meaningful tool for judging the performance of a diagnostic imaging system

Example X  Population X  Population y  healthy z  diseased Gaussian distribution Gaussian distribution Decision boundary Decision boundary Interested in optimum decision boundary Interested in optimum decision boundary

EXAMPLE Non-diseased cases Diseased cases Threshold

EXAMPLE Non-diseased cases Diseased cases more typically:

DECISION BOUNDARY TRUE NEGATIVE TRUE NEGATIVE TRUE POSITIVE TRUE POSITIVE FALSE NEGATIVE FALSE NEGATIVE FALSE POSITIVE FALSE POSITIVE What happens when you place the decision boundary? What happens when you place the decision boundary?

DECISION BOUNDARY Threshold Non- diseased cases Diseased cases FALSE NEGATIVE IN RED TRUE POSITVE IN GREEN

DECISION BOUNDARY Threshold Non-diseased cases Diseased cases FALSE POSITIVE IN WHITE TRUE NEGATIVE IN WHITE

SENSITIVITY TPF Threshold Non-diseased cases Diseased cases SENSITIVITY = TP / TP + FN GREEN /GREEN+RED FNF = 1-TPF

SPECIFICITY TNF Threshold Non-diseased cases Diseased cases SPECIFICITY = TN / TN + FP LARGE WHITE / LARGE WHITE + SMALL WHITE FPF = 1-TNF

PREVALENCE One of the important properties of ROC is that it is prevalence independent One of the important properties of ROC is that it is prevalence independent Low prevalence is rare Low prevalence is rare Example Example ROC IS NOT PREVALENCE DEPENDENT! ROC IS NOT PREVALENCE DEPENDENT!

ROC CURVE TPF, sensitivity FPF, 1-specificity Entire ROC curve

COMPARING CURVES

ROC EXPERIMENT Datasets Datasets Two options Two options BINARY BINARY MULT-SCALE RATING MULT-SCALE RATING Fitting curves Fitting curves

Rockit Who developed Rockit? -Charles E. of Chicago What is Rockit? -curve fitting software calculates points for a Roc curve -calculates max probability of two Gaussian distributions What is a Roc Curve? -describes how good the imaging systems is and accuracy of the given data and viewer’s results

Example patients, 40 negative, 60 positive 100 patients, 40 negative, 60 positiveGiven PPNPNNP… Confidence Rating Confidence Rating 1. Definitely Negative 2. Possibly Negative 3. Not sure 4. Possibly Positive 4. Possibly Positive 5. Definitely Positive This is the viewer’s results …. Now, from the confidence rating we consider 1=N and 2, 3, 4, 5=P to interpret view’s results. Interpretation of viewer’s result PPPPNPP… Comparing given data and interpretation TPTNFPFN (a) TP- disease is present, diagnosed correctly (b) TN-disease is not present, diagnosed correctly (c) FP-person been diagnosed not having the disease, but the disease is present (d) FN-person been diagnosed having the disease, but the disease is not present. Now, we compute the FPF and TPF. In order to compute the FPF, you need this formula FPF=1-TP/TP+FN and TPF=1-TN/TN+FP. Now, we compute the FPF and TPF. In order to compute the FPF, you need this formula FPF=1-TP/TP+FN and TPF=1-TN/TN+FP. Therefore, in our case FPF=1-45/45+13=.224 and TPF=1-25/25+0=0 Therefore, in our case FPF=1-45/45+13=.224 and TPF=1-25/25+0=0 So, our first point is (.224, 0). So, our first point is (.224, 0).

Example 2 Now we can compute a second point on the Roc curve. Now we can compute a second point on the Roc curve. We consider a different confidence rating. We consider a different confidence rating. 1, 2=N and 3, 4, 5=P 1, 2=N and 3, 4, 5=P Results based on viewer’s answer from above PPNPNNNN…. The new Truth table is: The new Truth table is: TPTNFPFN Now the FPF=1-46 / =.245 and TPF= 1-28/ 28+0 = 0 Now the FPF=1-46 / =.245 and TPF= 1-28/ 28+0 = 0 Now, we have our second point (.245, 0) Now, we have our second point (.245, 0) Based upon the viewer’s results, a third and forth point can be created by changing the confidence ratings. Based upon the viewer’s results, a third and forth point can be created by changing the confidence ratings. 1, 2, 3=N and 4, 5=P->third point 1, 2, 3, 4=N and 5=P->forth point Therefore, five confidence ratings give you four points on the Roc curve. Therefore, five confidence ratings give you four points on the Roc curve.

Rockit input For Condition 1: d For Condition 1: d Enter the Total Number of Actually-Negative Cases (an integer): 40 Beginning with category 1 and separated by blanks, Enter (on one line, integers only) the number of responses to Actually- Negative cases in each category: For Condition 1: d For Condition 1: d Enter the Total Number of Actually-Positive Cases (an integer): 60 Beginning with category 1 and separated by blanks, Beginning with category 1 and separated by blanks, Enter (on one line, integers only) the number of responses to Actually- Enter (on one line, integers only) the number of responses to Actually- Positive cases in each category: Positive cases in each category:

Rockit a, b parameters a, b parameters - a represents separation of the Gaussian distributions -b represents width of two Gaussian distributions Open excel template to enter a, b parameters Open excel template to enter a, b parameters Roc curve is produced Roc curve is produced