Intermediate Applied Statistics STAT 460 Lecture 23, 12/08/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu

Slides:



Advertisements
Similar presentations
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Advertisements

1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Chapter 8 – Logistic Regression
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Midterm Review Goodness of Fit and Predictive Accuracy
Stat 217 – Day 27 Chi-square tests (Topic 25). The Plan Exam 2 returned at end of class today  Mean.80 (36/45)  Solutions with commentary online  Discuss.
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Chapter 8 Introduction to Hypothesis Testing
1 Chapter 20 Two Categorical Variables: The Chi-Square Test.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Chapter 13: Inference in Regression
Chapter 10 Hypothesis Testing
Confidence Intervals and Hypothesis Testing - II
Fundamentals of Hypothesis Testing: One-Sample Tests
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Psy B07 Chapter 8Slide 1 POWER. Psy B07 Chapter 8Slide 2 Chapter 4 flashback  Type I error is the probability of rejecting the null hypothesis when it.
The schedule moving forward… Today – Evaluation Research – Remaining time = start on quantitative data analysis Thursday – Methodology section due – Quantitative.
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
CHAPTER 14 MULTIPLE REGRESSION
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory variables.
Chapter 26 Chi-Square Testing
Chi-Square Procedures Chi-Square Test for Goodness of Fit, Independence of Variables, and Homogeneity of Proportions.
Section 9-1: Inference for Slope and Correlation Section 9-3: Confidence and Prediction Intervals Visit the Maths Study Centre.
Analyze Improve Define Measure Control L EAN S IX S IGMA L EAN S IX S IGMA Chi-Square Analysis Chi-Square Analysis Chi-Square Training for Attribute Data.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Intermediate Applied Statistics STAT 460 Lecture 17, 11/10/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.
© Department of Statistics 2012 STATS 330 Lecture 20: Slide 1 Stats 330: Lecture 20.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Intermediate Applied Statistics STAT 460 Lecture 18, 11/10/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu
Intermediate Applied Statistics STAT 460 Lecture 20, 11/19/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/20/12 Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 13- Inference For Tables: Chi-square Procedures Section Test for goodness of fit Section Inference for Two-Way tables Presented By:
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Chapter 13 Understanding research results: statistical inference.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
A.P. STATISTICS EXAM REVIEW TOPIC #2 Tests of Significance and Confidence Intervals for Means and Proportions Chapters
The Chi-Square Distribution  Chi-square tests for ….. goodness of fit, and independence 1.
AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Test of Goodness of Fit Lecture 41 Section 14.1 – 14.3 Wed, Nov 14, 2007.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
CHAPTER 29: Multiple Regression*
Hand in your Homework Assignment.
Chapter 9 Hypothesis Testing: Single Population
Presentation transcript:

Intermediate Applied Statistics STAT 460 Lecture 23, 12/08/2004 Instructor: Aleksandra (Seša) Slavković TA: Wang Yu

Revised schedule Nov 8 lab on 2-way ANOVANov 10 lecture on two-way ANOVA and blocking Post HW9 Nov 12 lecture repeated measure and review Nov 15 lab on repeated measuresNov 17 lecture on categorical data/logistic regression HW9 due Post HW10 Nov 19 lecture on categorical data/logistic regression Nov 22 lab on logistic regression & project II introduction No class Thanksgiving No class Thanksgiving Nov 29 labDec 1 lecture HW10 due Post HW11 Dec 3 lecture and Quiz Dec 6 labDec 8 lecture HW 11 due Dec 10 lecture & project II due Dec 13 Project II due

Project II: extension of due data  DUE by Monday, Dec. 13 by 11am  Location: 412 Thomas Building (my office, there will be an envelope/box marked for drop off)  (1) You MUST send your data file by Wed. Dec. 8, 2004 via to TA and me  (2) You MUST turn in TWO HARD copies of your report  (3) You are welcome to turn in the project earlier. If you do so, but wish to submit a newer version by the final deadline, make sure that you clearly mark the most recent version  (4) IMPORTANT: I will NOT accept any late projects. Deadline is 11am! At 11:10am the projects will be collected and by 11:30am you will get an notifying you ONLY IF I DO NOT have a copy of your project indicating that you will receive zero points (so please do NOT wait until 10:55am to print the final version as something always goes wrong the last minute :-) -- so plan ahead!)  (5) I hope to have project graded and final grades assigned by Wed. Dec. 15.

This Lecture  Review: Model Fit Significance of the coefficients Model Selection  Prediction  HW10 back  HW11 turn-in  Course Evaluation

Model fit  Read notes in HW 11  Chapters 20 and 21 textbook  Lecture notes handouts from the course website

The deviance goodness-of-fit statistics  For testing the overall fit (adequacy) of the model  The deviance statistics has an approximate chi-square distribution with n-p degrees of freedom Think of n is the number of cells in a table (or number of observations), and p the number of parameters in the model  Null hypothesis: the model we are testing fits the data well  Alternative hypothesis: a different/more structure is needed to adequately model the outcome  Large p-value indicates that the model is adequate

The deviance goodness-of-fit statistics  SAS: Regression/Logistic/Statistics/Goodness-of-it  If the proportions are too small (e.g. counts per group less than 5) this measure could be misleading  CAUTION: when have continuous explanatory variables the number of groups is very large (every unique value of the continues variable will create a new cell) so the above condition is rarely met  Then, obtain Hosmer-Lemeshow Statistics (large p-value indicates good model) Or take the difference of log likelihoods (-2 Log L in the output under BETA=0 in SAS) for two models. This difference is approximately chi-squared with degrees of freedom equal to the difference in the number of parameters of the two models.

Significance of coefficients  For each single coefficient the software gives an estimate of the coefficient with its standard error and the p-value.  Null hypothesis: there is no relationship between the outcome and the explanatory variable  Alternative hypothesis: there is a strong relationship  Low p-value indicates that the predictor is significant (keep it in the model)

Model comparison  Nested models Take a difference of the Deviances and the degrees of freedom The new statistics also follows chi-square distribution Low p-value indicates a significant difference between the two models; and typically want the simpler model  Non-nested models In SAS look at AIC for example The lower value indicates a better model  In SAS: Logistic/Model/Selection

Prediction/Classification Tests  Handouts  ap39/sect49.html  The purpose of prediction test/classification is to determine if a person/unit/object belongs to the group with a specific characteristic.  Some applications: Drug use Exposure to a disease Pre-employment polygraph testing Survival or not

Prevalence  Widespread or a dominance of persons with a specific character in a tested population  Example: prevalence of persons with a specific character in a population of interest.  For example, let D represent a class of people with a character (or a disease)  For example, Let S denote a membership to the group D, and Š a non-membership, as indicated by the test result.   =P(D)

Accuracy  Sensitivity The probability that a person with the specific characteristic is correctly classified  =P[S|D]  Specificity The probability that a person who does NOT have a specific characteristic is correctly classified  =P[Š|Ď]

 Predictive value of a positive test (PVP) The conditional probability that a person whom test indicates belongs to a certain class/group actually does. P[D|S]  Predictive value of a negative test (PVN) The conditional probability that a person whom test indicates does NOT belong to a certain group actually does NOT belong. P[Ď | Š]

 False positive Mistakenly classify someone as with the characteristic P[Ď|S] 1- PVP = 1- P[D|S]  False negative Mistakenly identify someone without the characteristic P[D|Š] 1- PVN = 1 – P[Ď| Š]

ROC = Receiver Operating Characteristics  Handout_Accuracy.pdf  Handout_ROC_Titanic.doc  In SAS Logistic/Statistics/Classification Table  E.g for a single table with cutoff probability at 0.5 enter: From 0.5 to 0.5 Logistic/Plot/ROC curve Logistic/Prediction/Predict New Data

Commands in SAS  To create contingency tables, calculate chi-square statistic, etc… Statistics/Table Analysis  To run the logistic regression Statistics/Regression/Logistic

Lessons from the course  Overview/summary Lecture22.pdf  You’ve learned something if you understand some basic principles/concepts of statistics (e.g. difference between sample and population, statistics and parameters,…) If you understand the use of some methods we covered in class If you understand that the data analysis / interpretation is done within a context of a problem(s)/question(s) If you can pick up a book and learn partial (or fully) on your own how to apply a method not covered in class

Next Lecture  Presentation by Deet and Bill  Course wrap-up  Quiz grades  Project II questions/turn-in