Lecture 20 – Tues., Nov. 18th Multiple Regression: –Case Studies: Chapter 9.1 –Regression Coefficients in the Multiple Linear Regression Model: Chapter.

Slides:



Advertisements
Similar presentations
Lecture 17: Tues., March 16 Inference for simple linear regression (Ch ) R2 statistic (Ch ) Association is not causation (Ch ) Next.
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
Class 16: Thursday, Nov. 4 Note: I will you some info on the final project this weekend and will discuss in class on Tuesday.
July 1, 2008Lecture 17 - Regression Testing1 Testing Relationships between Variables Statistics Lecture 17.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Lecture 15 – Tues., Oct. 28 Review example of one-way layout Simple Linear Regression: –Simple Linear Regression Model, 7.2 –Least Squares Regression Estimation,
Chapter 10 Simple Regression.
Lecture 23: Tues., Dec. 2 Today: Thursday:
Lecture 22: Thurs., April 1 Outliers and influential points for simple linear regression Multiple linear regression –Basic model –Interpreting the coefficients.
Class 15: Tuesday, Nov. 2 Multiple Regression (Chapter 11, Moore and McCabe).
Chapter 12 Simple Regression
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Lecture 23: Tues., April 6 Interpretation of regression coefficients (handout) Inference for multiple regression.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Class 3: Thursday, Sept. 16 Reliability and Validity of Measurements Introduction to Regression Analysis Simple Linear Regression (2.3)
Lecture 6 Notes Note: I will homework 2 tonight. It will be due next Thursday. The Multiple Linear Regression model (Chapter 4.1) Inferences from.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Lecture 21 – Thurs., Nov. 20 Review of Interpreting Coefficients and Prediction in Multiple Regression Strategy for Data Analysis and Graphics (Chapters.
Lecture 19 Simple linear regression (Review, 18.5, 18.8)
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Correlation and Regression Analysis
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Regression and Correlation Methods Judy Zhong Ph.D.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Linear Trend Lines = b 0 + b 1 X t Where is the dependent variable being forecasted X t is the independent variable being used to explain Y. In Linear.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Introduction to Linear Regression
Applied Quantitative Analysis and Practices LECTURE#22 By Dr. Osman Sadiq Paracha.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Statistical Methods Statistical Methods Descriptive Inferential
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Lecture 10: Correlation and Regression Model.
ECON 338/ENVR 305 CLICKER QUESTIONS Statistics – Question Set #8 (from Chapter 10)
Week 101 ANOVA F Test in Multiple Regression In multiple regression, the ANOVA F test is designed to test the following hypothesis: This test aims to assess.
Correlation & Regression Analysis
Chapter 10: Determining How Costs Behave 1 Horngren 13e.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
Stat 112 Notes 6 Today: –Chapter 4.1 (Introduction to Multiple Regression)
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 18 Introduction to Simple Linear Regression (Data)Data.
Chapter 11: Linear Regression and Correlation Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Stat 112 Notes 8 Today: –Chapters 4.3 (Assessing the Fit of a Regression Model) –Chapter 4.4 (Comparing Two Regression Models) –Chapter 4.5 (Prediction.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Lecture 6 Feb. 2, 2015 ANNOUNCEMENT: Lab session will go from 4:20-5:20 based on the poll. (The majority indicated that it would not be a problem to chance,
The simple linear regression model and parameter estimation
Chapter 11: Linear Regression and Correlation
Simple Linear Regression
Simple Linear Regression - Introduction
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
CHAPTER 29: Multiple Regression*
Rainfall Example The data set contains cord yield (bushes per acre) and rainfall (inches) in six US corn-producing states (Iowa, Nebraska, Illinois, Indiana,
Indicator Variables Often, a data set will contain categorical variables which are potential predictor variables. To include these categorical variables.
Product moment correlation
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
Presentation transcript:

Lecture 20 – Tues., Nov. 18th Multiple Regression: –Case Studies: Chapter 9.1 –Regression Coefficients in the Multiple Linear Regression Model: Chapter 9.2 –JMP Output: Chapter Office Hours: Today and Thursday after class, tomorrow (Wednesday) instead of 1:30-2:30 or by appointment.

Midterm 2 Scores

Multiple Regression Multiple Regression: Seeks to estimate the mean of Y given multiple explanatory variables X 1,…,X p, denoted by Examples: –Y=1 st year GPA, X 1 =Math SAT, X 2 =Verbal SAT, X 3 =High School GPA –Y=Sales price of house, X 1 =Square Footage, X 2 =Number of Rooms Uses of Regression Analysis –Describe association between mean of Y and X 1,…,X p ; describe association between mean of Y and X 1 after taking into account X 2,…,X p. –Passive Prediction: Predict Y based on X 1,…,X p. –Control: Predict what Y will be if you change X 1,…,X p.

Multiple Linear Regression Model There is a normally distributed subpopulation of responses for each combination of the explanatory variables with The observations are independent of one another.

Case Study Meadowfoam is a small plant found growing in moist meadows in Northwest. Researchers conducted a randomized experiment to find out how to elevate meadowfoam production In a controlled growth chamber, they focused on the effects of two-light related factors: light intensity and timing of onset of light treatment. Light intensity levels: 150,300,450,600,750,900 Timing of onset: Early, Late

Case Study Cont. Variables: –Y = average number of flowers per meadowfoam plant –X 1 =light intensity –X 2 =1 if late timing, 0 if early timing Multiple Linear Regression Model:

Interpretation of Coefficients = the change in the mean of y that is associated with a one unit increase in where is held fixed. = mean of y when

Coefficients in Meadowfoam Study For meadowfoam study: – = change in mean flowers per plant associated with 1 increase in light intensity for fixed time of onset – = change in mean flowers per plant associated with switching from late to early onset for fixed light intensity.

Estimation of Multiple Linear Regression Model The coefficients are estimated by choosing to make the sum of squared prediction errors as small as possible, i.e., choose to minimize Predicted value of y given x 1,…,x p : = SD(Y|X 1,…,X p ), estimated by = root mean square error

Multiple Linear Regression in JMP Analyze, Fit Model Put response variable in Y Click on explanatory variables and then click Add under Construct Model Effects Click Run Model.

JMP Output from Meadowfoam Study

Reading JMP Output Estimated multiple linear regression model:. Approximately 95% of flowers per plant will lie within 2*6.44 =12.88 flowers per plant of p-values for coefficients indicate that there is strong evidence that higher light intensity is associated with less flowers per plant on average for fixed time onset and that early time onset is associated with more flowers per plant on average for fixed light intensity.

Case Study What characteristics are associated with bigger brain size after accounting for body size, i.e., what characteristics are associated with bigger brain size holding body size fixed? Y=brain weight, X 1 =body weight, X 2 =gestation period, X 3 =litter size Multiple Linear Regression Model

Interpretation in Randomized Experiments vs. Obs. Studies Randomized Experiments: Interpretation of an “effect” of an explanatory variable is straightforward and causation is implied. Example: “A 1-unit increase in light intensity causes the mean number of flowers to increase by “ Observational Studies: Cannot make causal conclusions from statistical association. “For any subpopulation of mammal species with the same body weight and litter size, a 1-day increase in the species’ gestation length is associated with a - gram increase in mean brain weight.” Interpretation is only useful if subpopulation of mammals with fixed values of body weight and litter size, but varying gestation lengths, exist.

Interpreting Coefficients Interpretation depends on what other X’s are included. – measures rates of change in mean brain weight with changes in gestation length in population of all mammal species where body size is variable – measures the rate of change in mean brain weight with changes in gestation length within subpopulations of fixed body size.