Class 27 Example: Height and Weight

Slides:



Advertisements
Similar presentations
Class 28 Get Ready…..
Advertisements

Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Simple Regression Model
By: Jacob Kemble Matt Kelly Taylor Shannon  We realize that alcohol can play a huge role on the performance of a college student. We conducted a survey.
Linear Regression Using Excel 2010 Linear Regression Using Excel ® 2010 Managerial Accounting Prepared by Diane Tanner University of North Florida Chapter.
Overview Correlation Regression -Definition
Multiple Regression Analysis
1 Regression Analysis Modeling Relationships. 2 Regression Analysis Regression Analysis is a study of the relationship between a set of independent variables.
Chapter 15 Multiple Regression. Regression Multiple Regression Model y =  0 +  1 x 1 +  2 x 2 + … +  p x p +  Multiple Regression Equation y = 
Chapter 13 Multiple Regression
Chapter 14 Introduction to Linear Regression and Correlation Analysis
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
St. Louis City Crime Analysis 2015 Homicide Prediction Presented by: Kranthi Kancharla Scott Manns Eric Rodis Kenneth Stecher Sisi Yang.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Chapter 12 Multiple Regression
1 An example. 2 AirlinePercentage on time Complaints Southwest Continental Northwest US Airways United American
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation Analysis
Linear Regression Example Data
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Business Statistics: Communicating with Numbers By Sanjiv Jaggia.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
 Independent X – variables that take on only a limited number of values are termed categorical variables, dummy variables, or indicator variables. 
Quantitative Demand Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Extending that Line into the Future St. Louis CMG February 12, 2008 Wayne Bell – UniGroup, Inc.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Measures of relationship Dr. Omar Al Jadaan. Agenda Correlation – Need – meaning, simple linear regression – analysis – prediction.
You want to examine the linear dependency of the annual sales of produce stores on their size in square footage. Sample data for seven stores were obtained.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Regression. Population Covariance and Correlation.
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
Class 23 The most over-rated statistic The four assumptions The most Important hypothesis test yet Using yes/no variables in regressions.
ANOVA for Regression ANOVA tests whether the regression model has any explanatory power. In the case of simple regression analysis the ANOVA test and the.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Discussion of time series and panel models
Regression Analysis Relationship with one independent variable.
Welcome to Econ 420 Applied Regression Analysis Study Guide Week Four Ending Wednesday, September 19 (Assignment 4 which is included in this study guide.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Tutorial 4 MBP 1010 Kevin Brown. Correlation Review Pearson’s correlation coefficient – Varies between – 1 (perfect negative linear correlation) and 1.
Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations10 ANOVA dfSSMSF Regression
Inference with computer printouts. Coefficie nts Standard Errort StatP-value Lower 95% Upper 95% Intercept
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Simple Linear Regression In the previous lectures, we only focus on one random variable. In many applications, we often work with a pair of variables.
Lecture 10: Correlation and Regression Model.
Linear Discriminant Analysis (LDA). Goal To classify observations into 2 or more groups based on k discriminant functions (Dependent variable Y is categorical.
 Input parameters 1, 2, …, n  Values of each denoted X 1, X 2, X n  For each setting of X 1, X 2, X n observe a Y  Each set (X 1, X 2, X n,Y) is one.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Correlation. Up Until Now T Tests, Anova: Categories Predicting a Continuous Dependent Variable Correlation: Very different way of thinking about variables.
Yvette Garcia, PharmD, BCPS 1 st Year Executive Administration Program.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Business Statistics, 4e by Ken Black Chapter 14 Multiple Regression Analysis.
Class 22. Understanding Regression EMBS Part of 12.7 Sections 1-3 and 7 of Pfeifer Regression note.
Real Estate Sales Forecasting Regression Model of Pueblo neighborhood North Elizabeth Data sources from Pueblo County Website.
CHAPTER 2: Basic Summary Statistics
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Regression Modeling Applications in Land use and Transport.
Inference with Computer Printouts. Leaning Tower of Pisa Find a 90% confidence interval. Year Lean
Tutorial 5 Thursday February 14 MBP 1010 Kevin Brown.
Samantha Bellah Adv. Stats Final Project Real Estate Forecasting Regression Model Market: Highland Park Neighborhood Data Sources: Zillow.com E:\PuebloRESales2014Q1Q2.xlsx.
Chard Appliances Becca Carlson Bethany Haefner Jack Lim Liang Wen.
Simple linear regression and correlation Regression analysis is the process of constructing a mathematical model or function that can be used to predict.
REGRESSION REVISITED. PATTERNS IN SCATTER PLOTS OR LINE GRAPHS Pattern Pattern Strength Strength Regression Line Regression Line Linear Linear y = mx.
Relationship with one independent variable
Simple Linear Regression
Relationship with one independent variable
CHAPTER 2: Basic Summary Statistics
Presentation transcript:

Class 27 Example: Height and Weight Case: Colonial Broadcasting (HBS: 9-894-011)

Heights and Weights of n=30 11-year-old girls CM Inches KG 135 53 26 146 57 33 153 60 55 154 61 50 139 32 131 52 25 149 59 44 137 54 31 143 56 36 35 141 28 136 151 48 155 133 34 164 65 47 37 46 147 58 152 140 42 148 29 30 144.800 57.067 36.167 Al used the regression of KG on CM to forecast the weight of a girl 144.8 cm tall. Al’s point forecast was _______________ Sample Means

Any three can be used to find the fourth. The regression line 𝑌 = 𝑎 + 𝑏 𝑋 𝑌 = 𝑎 + 𝑏 𝑋 Always goes thru 𝑋 , 𝑌 Any three can be used to find the fourth.

Heights and Weights of n=30 11-year-old girls CM Inches KG 135 53 26 146 57 33 153 60 55 154 61 50 139 32 131 52 25 149 59 44 137 54 31 143 56 36 35 141 28 136 151 48 155 133 34 164 65 47 37 46 147 58 152 140 42 148 29 30 144.800 57.067 36.167 Bo regressed KG on inches. Which model will be the better predictor of KG? Al’s Bo’s They should give identical results. Sample Means

Regression Statistics Multiple R 0.742 R Square 0.551 Adj R Square AL   Regression Statistics Multiple R 0.742 R Square 0.551 Adj R Square 0.535 Standard Error 5.248 Observations 30 ANOVA df SS MS F Sig F Regression 1 946.892 34.376 0.000003 Residual 28 771.274 27.546 Total 29 1718.167 Coefficients t Stat P-value Intercept -71.371 18.366 -3.886 0.001 CM 0.743 0.127 5.863 BO   Regression Statistics Multiple R 0.720 R Square 0.518 Adj R Square 0.501 Standard Error 5.439 Observations 30 ANOVA df SS MS F Sig F Regression 1 889.874 30.082 0.000007 Residual 28 828.293 29.582 Total 29 1718.167 Coefficients t Stat P-value Intercept -66.701 18.782 -3.551 0.001 Inches 1.803 0.329 5.485

Regression Statistics What if we use both?? SUMMARY OUTPUT   Regression Statistics Multiple R 0.760 R Square 0.577 Adj R Square 0.546 Standard Error 5.187 Observations 30 ANOVA df SS MS F Sig F Regression 2 991.748 495.874 18.431 8.967E-06 Residual 27 726.419 26.904 Total 29 1718.167 Coefficients t Stat P-value Intercept -72.836 18.187 -4.005 0.0004 CM 2.180 1.121 1.946 0.0621 Inches -3.623 2.806 -1.291 0.2076

Which Girl was most over(under)weight?

How would you use these data to estimate the number of CM per inch?

Colonial Broadcasting Company Three Networks ABN, BBS, CBC Data from 88 made-for-TV movies (1992) CBC wants to know what factors affect the movie’s Rating. (the percent of US households with TVs tuned into a program) CBC needs to forecast the rating of a proposed movie.

. Obs Network Month Day Rating Fact Stars Prev Rating Competition 1 BBS 15.6 14.2 14.5 2 7 10.8 15.3 17.2 . 19 11 14.4 12.1 20 13.6 11.4 11.9 21 ABN 14.6 19.3 22 16.3 15.2 57 12 12.8 12.0 58 16.8 15.7 10.1 59 CBC 14.0 8.2 14.8 60 11.3 13.0 13.2 87 11.2 16.4 88 19.1 12.6 15.4 Average 5.88 4.25 13.82 0.41 13.77 14.06 Stdev 3.91 2.85 2.54 0.49 0.54 3.23 2.29 median 4 14.05 13.65 14.1 mode 13.8 min 8.9 5.3 max 19.5 24.7 20.3

1a. Rank the networks based on average 1992 rating. StatTools (Core Analysis Pack)   Analysis: Regression 1. Dependent Variable: RATING Performed By: PEP Date: Thursday, May 04, 2006 Updating: Static Multiple R-Square Adjusted StErr of Summary R Estimate 0.3380 0.1143 0.0934 2.4212 Degrees of Sum of Mean of F-Ratio p-Value ANOVA Table Freedom Squares Explained 2 64.2912 32.14560148 5.4833 0.0058 Unexplained 85 498.3060 5.862423013 Coefficient Standard t-Value Lower Upper Regression Table Error Limit Constant 13.3633 0.4421 30.2299 < 0.0001 12.4844 14.2423 ABN 1.3972 0.5913 2.3627 0.0204 0.2214 2.5729 BBS -0.6483 0.6990 -0.9276 0.3563 -2.0380 0.7414 1b. How big was the ratings gap between the top and bottom ranked networks?

2a. What is the average rating of fact based movies? StatTools (Core Analysis Pack)   Analysis: Regression 2. Dependent Variable: RATING Performed By: PEP Date: Thursday, May 04, 2006 Updating: Static Multiple R-Square Adjusted StErr of Summary R Estimate 0.2724 0.0742 0.0635 2.461 Degrees of Sum of Mean of F-Ratio p-Value ANOVA Table Freedom Squares Explained 1 41.7582 6.8950 0.0102 Unexplained 86 520.8390 6.0563 Coefficient Standard t-Value Lower Upper Regression Table Error Limit Constant 13.24615 0.34127 38.8141 < 0.0001 12.568 13.925 Fact 1.40107 0.53357 2.6258 0.340 2.462 2a. What is the average rating of fact based movies? 2b. Is the difference in fact and fiction ratings statistically significant?

fact-based movies had fewer stars (than fictional movies) StatTools (Core Analysis Pack)   Analysis: Regression 3. Dependent Variable: RATING Performed By: PEP Date: Thursday, May 04, 2006 Updating: Static Multiple R-Square Adjusted StErr of Summary R Estimate 0.3733 0.1394 0.1191 2.387 Degrees of Sum of Mean of F-Ratio p-Value ANOVA Table Freedom Squares Explained 2 78.420 39.210 6.8836 0.0017 Unexplained 85 484.177 5.696 Coefficient Standard t-Value Lower Upper Regression Table Error Limit Constant 12.568 0.425 29.550 < 0.0001 11.72 13.41 Fact 1.799 0.541 3.327 0.0013 0.72 2.87 Stars 1.259 0.496 2.537 0.0130 0.27 2.24 3. Which is most true? fact-based movies had fewer stars (than fictional movies) Fact-based movies had more stars. Fact-based movies had the same number of stars. Cannont be determined.

StatTools (Core Analysis Pack)   Analysis: Regression 5. Dependent Variable: RATING Performed By: PEP Date: Thursday, May 04, 2006 Updating: Static Multiple R-Square Adjusted StErr of Summary R Estimate 0.7387 0.5456 0.4799 1.834 Degrees of Sum of Mean of F-Ratio p-Value ANOVA Table Freedom Squares Explained 11 306.964 27.906 8.2964 < 0.0001 Unexplained 76 255.634 3.364 Coefficient Standard t-Value Lower Upper Regression Table Error Limit Constant 12.87691 2.01203 6.3999 8.870 16.884 Fact 1.89451 0.44028 4.3029 1.018 2.771 Stars 0.74425 0.42113 1.7673 0.0812 -0.095 1.583 Prev Rating 0.18571 0.10872 1.7081 0.0917 -0.031 0.402 Competition -0.29356 0.11035 -2.6602 0.0095 -0.513 -0.074 ABN 1.07497 1.03428 1.0393 0.3019 -0.985 3.135 BBS -1.04990 0.59970 -1.7507 0.0840 -2.244 0.145 OCT -1.54061 0.68598 -2.2458 0.0276 -2.907 -0.174 DEC 1.39816 0.72802 1.9205 0.0585 -0.052 2.848 APR-MAY -1.40377 0.56574 -2.4813 0.0153 -2.531 -0.277 MON 2.52860 1.00136 2.5252 0.0136 0.534 4.523 SUN 1.52567 0.70636 2.1599 0.0339 0.119 2.933 4. On Sunday night, CBC usually airs “Josette and Yvette” at 8 pm followed by the Sun night movie. “J&Y” typical get a 17.5 rating. If they replace “J&Y” with a rock concert expected to get a rating of 20, what is the expected change in the movie rating?

StatTools (Core Analysis Pack)   Analysis: Regression 5. Dependent Variable: RATING Performed By: PEP Date: Thursday, May 04, 2006 Updating: Static Multiple R-Square Adjusted StErr of Summary R Estimate 0.7387 0.5456 0.4799 1.834 Degrees of Sum of Mean of F-Ratio p-Value ANOVA Table Freedom Squares Explained 11 306.964 27.906 8.2964 < 0.0001 Unexplained 76 255.634 3.364 Coefficient Standard t-Value Lower Upper Regression Table Error Limit Constant 12.87691 2.01203 6.3999 8.870 16.884 Fact 1.89451 0.44028 4.3029 1.018 2.771 Stars 0.74425 0.42113 1.7673 0.0812 -0.095 1.583 Prev Rating 0.18571 0.10872 1.7081 0.0917 -0.031 0.402 Competition -0.29356 0.11035 -2.6602 0.0095 -0.513 -0.074 ABN 1.07497 1.03428 1.0393 0.3019 -0.985 3.135 BBS -1.04990 0.59970 -1.7507 0.0840 -2.244 0.145 OCT -1.54061 0.68598 -2.2458 0.0276 -2.907 -0.174 DEC 1.39816 0.72802 1.9205 0.0585 -0.052 2.848 APR-MAY -1.40377 0.56574 -2.4813 0.0153 -2.531 -0.277 MON 2.52860 1.00136 2.5252 0.0136 0.534 4.523 SUN 1.52567 0.70636 2.1599 0.0339 0.119 2.933 5. A high-ranking CBC exec argued that network programming does not affect total size of network audience, only the relative share each network receives. Does the regression support or refute this assertion?

StatTools (Core Analysis Pack)   Analysis: Regression 4. Dependent Variable: RATING Performed By: PEP Date: Thursday, May 04, 2006 Updating: Static Multiple R-Square Adjusted StErr of Summary R Estimate 0.5342 0.2854 0.2510 2.2008 Degrees of Sum of Mean of F-Ratio p-Value ANOVA Table Freedom Squares Explained 4 160.5680 40.1420 8.2874 < 0.0001 Unexplained 83 402.0291 4.8437 Coefficient Standard t-Value Lower Upper Regression Table Error Limit Constant 12.1471 0.4857 25.0104 11.181 13.113 Fact 2.0818 0.5044 4.1271 1.079 3.085 Stars 1.3464 0.4730 2.8466 0.0056 0.406 2.287 ABN 1.2635 0.5485 2.3036 0.0237 0.173 2.354 BBS -1.2135 0.6559 -1.8500 0.0679 -2.518 0.091 6. BBS’s new movie is fiction- based with 2 stars. We don’t know when it will be aired. Will it’s rating exceed the 1992 average for BBS movies?