Midterm Review Goodness of Fit and Predictive Accuracy

Slides:



Advertisements
Similar presentations
1 Hypothesis Testing William P. Wattles, Ph.D. Psychology 302.
Advertisements

Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
TRIM Workshop Arco van Strien Wildlife statistics Statistics Netherlands (CBS)
1 Hypothesis testing. 2 A common aim in many studies is to check whether the data agree with certain predictions. These predictions are hypotheses about.
Lecture 16: Logistic Regression: Goodness of Fit Information Criteria ROC analysis BMTRY 701 Biostatistical Methods II.
Lecture 22: Evaluation April 24, 2010.
Correlation and regression
Analysis of frequency counts with Chi square
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 25, Slide 1 Chapter 25 Comparing Counts.
PSY 340 Statistics for the Social Sciences Chi-Squared Test of Independence Statistics for the Social Sciences Psychology 340 Spring 2010.
BIOST 536 Lecture 9 1 Lecture 9 – Prediction and Association example Low birth weight dataset Consider a prediction model for low birth weight (< 2500.
An Illustrative Example of Logistic Regression
Example of Simple and Multiple Regression
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.5 Small Sample.
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
How Can We Test whether Categorical Variables are Independent?
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, November 19 Chi-Squared Test of Independence.
AM Recitation 2/10/11.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Statistics for clinical research An introductory course.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Correlation.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
Experimental Design making causal inferences Richard Lambert, Ph.D.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. 1.. Section 11-2 Goodness of Fit.
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Slide 1 The SPSS Sample Problem To demonstrate these concepts, we will work the sample problem for logistic regression in SPSS Professional Statistics.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 12 The Analysis of Categorical Data and Goodness-of-Fit Tests.
Hypothesis Testing Hypothesis Testing Topic 11. Hypothesis Testing Another way of looking at statistical inference in which we want to ask a question.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Approximate letter grade assignments ~ D C B 85 & up A.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Testing Hypothesis That Data Fit a Given Probability Distribution Problem: We have a sample of size n. Determine if the data fits a probability distribution.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
1 Risk Assessment Tests Marina Kondratovich, Ph.D. OIVD/CDRH/FDA March 9, 2011 Molecular and Clinical Genetics Panel for Direct-to-Consumer (DTC) Genetic.
Copyright © 2010 Pearson Education, Inc. Slide
© Department of Statistics 2012 STATS 330 Lecture 23: Slide 1 Stats 330: Lecture 23.
Lecture 10 Chapter 23. Inference for regression. Objectives (PSLS Chapter 23) Inference for regression (NHST Regression Inference Award)[B level award]
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
© Copyright McGraw-Hill 2004
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Intermediate Applied Statistics STAT 460 Lecture 23, 12/08/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu
1 Chi-square Test Dr. T. T. Kachwala. Using the Chi-Square Test 2 The following are the two Applications: 1. Chi square as a test of Independence 2.Chi.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Nonparametric Statistics
BPS - 5th Ed. Chapter 231 Inference for Regression.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Stats Methods at IC Lecture 3: Regression.
BINARY LOGISTIC REGRESSION
Analysis of Covariance (ANCOVA)
Geology Geomath Chapter 7 - Statistics tom.h.wilson
Presentation transcript:

Midterm Review Goodness of Fit and Predictive Accuracy October 28, 2008

Assessing the fit of the model Summary measures of goodness of fit: Pearson chi-square statistic Deviance Akaike criterion Hosmer-Lemeshow statistic (can be used both when data can and cannot be grouped by covariate combinations). Comparing observed values to expected (predicted) values on the individual and grouped level not useful in this setting. Valid only if number of observations large compared to number of covariate patterns.

Assessing the fit of the model Summary goodness of fit measures give an overall indication of the fit of the model. Small value does not rule out possibility of substantial and, thus, interesting deviation from fit for a few subjects, BUT large value is clear indication of substantial problem with model.

Hosmer-Lemeshow goodness of fit test H0: Model fits data well. Ha: Model does not fit data well. To calculate test statistic: Order the fitted values. Group the fitted values into g classes of roughly equal size. Calculate the observed and expected number in each class. Perform a Χ2 goodness of fit test.

Example (H&L, page 150)

Example (H&L, page 150) How are the data grouped in this case? Deciles of estimated probabilities. What is the observed frequency in the drug free group for the fifth decile? 16 How is this value obtained? As the sum of the observed outcomes for the 58 subjects in this group. What is the corresponding estimated expected frequency? 12.7 How is this value obtained? As the sum of the 58 estimated probabilities for these subjects. When fitted logistic model is the correct model, the distribution of the H-L statistic is well-approximated by the chi-square distribution with 8 degrees of freedom. The H-L statistic = 4.39 with p-value 0.820. What can we conclude about the fit of the model? The model seems to fit the data well.

Predictive accuracy What do we mean by predictive accuracy? What are some commonly used measures of predictive accuracy? Sensitivity Specificity Area under the ROC curve (AUC) Why might one prefer to look at ROC curve rather than sensitivity/specificity? Sensitivity and specificity rely on a single cutpoint to classify a test result as positive. ROC curves plot sensitivity versus 1-specificity for all possible cutpoints.

Example: predictive accuracy Dr. C uses logistic regression to model the relationship between esophageal cancer risk and tobacco use, adjusting for known confounders. Dr. C calculates the area under the ROC curve (AUC) for the final model using the complete data. Dr. C speaks to a colleague about her analysis, who suggests that (in the future) she report AUC based on a different sample than was used to fit the model. Do you agree or disagree with her colleague’s suggestion? Why? Agree. We would expect AUC calculated based on the sample used to fit the model to be too optimistic.

Example: predictive accuracy Worried as to the effect this might have had on her results, Dr. C randomly divides the data into two samples of equal size. She re-fits her model using the first half of the data. She then calculates the AUC for both the first half of the data (the estimation sample) and the second half of the data (the validation sample) and obtains the results on the following slide. Can you tell which plot was obtained using the estimation data and which plot was obtained using the validation data? Why or why not? We would expect the AUC to be smaller for the validation sample than for the estimation sample in expectation. This is not guaranteed, however, for an individual dataset. Therefore, we cannot be sure which plot corresponds to the estimation sample and which corresponds to the validation sample.

Example: predictive accuracy

Example: predictive accuracy I tell you that the plot on the right was obtained using the validation sample. Comment on the predictive accuracy of Dr. C’s final model. With AUC = 0.7787, we conclude that the ability of the model to discriminate between subjects who experience the outcome (esophageal cancer) and subjects who do not experience the outcome is acceptable [see general rule on page 162 of H&L].