Chapter 13 Multiple Regression

Slides:

Advertisements

Similar presentations

Chapter 6 Confidence Intervals.

Advertisements

Lesson 10: Linear Regression and Correlation

1 9. Logistic Regression ECON 251 Research Methods.

CHAPTER 6 CONTINUOUS RANDOM VARIABLES AND THE NORMAL DISTRIBUTION Prem Mann, Introductory Statistics, 8/E Copyright © 2013 John Wiley & Sons. All rights.

1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.

Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.

1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.

Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.2 Estimating Differences.

Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)

Chapter 6 Confidence Intervals.

Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.

Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)

1 1 Slide © 2016 Cengage Learning. All Rights Reserved. The equation that describes how the dependent variable y is related to the independent variables.

1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.

1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.

1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.

1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.

1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.

1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.

Chapter 11: Applications of Chi-Square. Count or Frequency Data Many problems for which the data is categorized and the results shown by way of counts.

CHAPTER 14 MULTIPLE REGRESSION

© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.

Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.

Chapter 10 Lecture 2 Section: We analyzed paired data with the goal of determining whether there is a linear correlation between two variables.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12: Analyzing the Association Between Quantitative Variables: Regression Analysis Section.

1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.

Linear Regression Analysis 5E Montgomery, Peck & Vining 1 Chapter 8 Indicator Variables.

Logistic Regression Applications Hu Lunchao. 2 Contents 1 1 What Is Logistic Regression? 2 2 Modeling Categorical Responses 3 3 Modeling Ordinal Variables.

Active Learning Lecture Slides For use with Classroom Response Systems Association: Contingency, Correlation, and Regression.

© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.

Chapter 13 Multiple Regression

Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.3 Two-Way ANOVA.

Active Learning Lecture Slides For use with Classroom Response Systems Association: Contingency, Correlation, and Regression.

Copyright © 2009 Pearson Education Active Learning Lecture Slides For use with Classroom Response Systems Chapter 3: Association: Contingency, Correlation,

1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.

Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.

Properties of the Binomial Probability Distributions 1- The experiment consists of a sequence of n identical trials 2- Two outcomes (SUCCESS and FAILURE.

Active Learning Lecture Slides For use with Classroom Response Systems Association: Contingency, Correlation, and Regression.

MBF1413 | Quantitative Methods Prepared by Dr Khairul Anuar 8: Time Series Analysis & Forecasting – Part 1

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12: Analyzing the Association Between Quantitative Variables: Regression Analysis Section.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.1 Using Several Variables to Predict a Response.

Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.1 Independence.

Chapter 13 Comparing Groups: Analysis of Variance Methods

Chapter 7. Classification and Prediction

Regression Analysis Module 3.

8. Association between Categorical Variables

Active Learning Lecture Slides For use with Classroom Response Systems

CONTINUOUS RANDOM VARIABLES AND THE NORMAL DISTRIBUTION

John Loucks St. Edward’s University . SLIDES . BY.

Correlation and Regression

Business Statistics Multiple Regression This lecture flows well with

Lecture Slides Elementary Statistics Thirteenth Edition

M248: Analyzing data Block D.

Topic 5: Exploring Quantitative data

Cost estimation and behaviour

Chi Square Two-way Tables

Section 8.1 Day 4.

MBF1413 | Quantitative Methods Prepared by Dr Khairul Anuar

CONTINUOUS RANDOM VARIABLES AND THE NORMAL DISTRIBUTION

Analyzing the Association Between Categorical Variables

CHAPTER 14 MULTIPLE REGRESSION

Honors Statistics The Standard Deviation as a Ruler and the Normal Model Chapter 6 Part 3.

Correlation and Regression Lecture 1 Sections: 10.1 – 10.2

Regression and Categorical Predictors

Presentation transcript:

Chapter 13 Multiple Regression Section 13.6 Modeling a Categorical Response

Modeling a Categorical Response Variable The regression models studied so far are designed for a quantitative response variable y. When y is categorical, a different regression model applies, called logistic regression.

Examples of Logistic Regression A voter’s choice in an election (Democrat or Republican), with explanatory variables: annual income, political ideology, religious affiliation, and race. Whether a credit card holder pays their bill on time (yes or no), with explanatory variables: family income and the number of months in the past year that the customer paid the bill on time.

The Logistic Regression Model Denote the possible outcomes for y as 0 and 1. Use the generic terms failure (for outcome = 0), and success (for outcome =1). The population mean of the scores equals the population proportion of ‘1’ outcomes (successes). That is, The proportion, p, also represents the probability that a randomly selected subject has a successful outcome.

The Logistic Regression Model The straight-line model is usually inadequate when there are multiple explanatory variables. A more realistic model has a curved S-shape instead of a straight-line trend. The regression equation that best models this S- shaped curve is known as the logistic regression equation.

The Logistic Regression Model Figure 13.10 Two Possible Regressions for a Probability p of a Binary Response Variable. A straight line is usually less appropriate than an S-shaped curve. Question: Why is the straight-line regression model for a binary response variable often poor?

The Logistic Regression Model A regression equation for an S-shaped curve for the probability of success p is: This equation for p is called the logistic regression equation. Logistic regression is used when the response variable has only two possible outcomes (it’s binary).

Example: Travel Credit Cards An Italian study with 100 randomly selected Italian adults considered factors that are associated with whether a person possesses at least one travel credit card. The table 13.12 on the next slide shows results for the first 15 people on this response variable and on the person’s annual income (in thousands of euros).

Example: Travel Credit Cards Table 13.12 Annual Income (in thousands of euros) and Whether Possess a Travel Credit Card. The response y equals 1 if a person has a travel credit card and equals 0 otherwise.

Example: Travel Credit Cards Let x = annual income and let y = whether the person possesses a travel credit card (1 = yes, 0 = no). Table 13.13 shows what software provides for conducting a logistic regression analysis. Table 13.13 Results of Logistic Regression for Italian Credit Card Data

Example: Travel Credit Cards Substituting the and estimates into the logistic regression model formula yields:

Example: Travel Credit Cards Find the estimated probability of possessing a travel credit card at the lowest and highest annual income levels in the sample, which were x = 12 and x = 65.

Example: Travel Credit Cards For x = 12 thousand euros, the estimated probability of possessing a travel credit card is:

Example: Travel Credit Cards For x = 65 thousand euros, the estimated probability of possessing a travel credit card is:

Example: Travel Credit Cards Insight: Annual income has a strong positive effect on having a credit card. The estimated probability of having a travel credit card changes from 0.09 to 0.97 as annual income changes over its range.

Example: Estimating Proportion of Students Who’ve Used Marijuana A three-variable contingency table from a survey of senior high-school students is shown on the next slide. The students were asked whether they had ever used: alcohol, cigarettes or marijuana. We’ll treat marijuana use as the response variable and cigarette use and alcohol use as explanatory variables.

Example: Estimating Proportion of Students Who’ve Used Marijuana Table 13.14 Alcohol, Cigarette, and Marijuana Use for High School Seniors

Example: Estimating Proportion of Students Who’ve Used Marijuana Let y indicate marijuana use, coded: (1 = yes, 0 = no) Let be an indicator variable for alcohol use, coded (1 = yes, 0 = no) Let be an indicator variable for cigarette use, coded (1 = yes, 0 = no)

Example: Estimating Proportion of Students Who’ve Used Marijuana Table 13.15 MINITAB Output for Estimating the Probability of Marijuana Use Based on Alcohol Use and Cigarette Use

Example: Estimating Proportion of Students Who’ve Used Marijuana The logistic regression prediction equation is:

Example: Estimating Proportion of Students Who’ve Used Marijuana For those who have not used alcohol or cigarettes, . For them, the estimated probability of marijuana use is

Example: Estimating Proportion of Students Who’ve Used Marijuana For those who have used alcohol and cigarettes, . For them, the estimated probability of marijuana use is

Example: Estimating Proportion of Students Who’ve Used Marijuana SUMMARY: The probability that students have tried marijuana seems to depend greatly on whether they’ve used alcohol and/or cigarettes.