Lecture 5 Regression. Homework Issues…past 1.Bad Objective: Conduct an experiment because I have to for this class 2.Commas – ugh  3.Do not write out.

Slides:



Advertisements
Similar presentations
Objectives 10.1 Simple linear regression
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Chapter 14, part D Statistical Significance. IV. Model Assumptions The error term is a normally distributed random variable and The variance of  is constant.
Inference for Regression
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
4.3 Confidence Intervals -Using our CLM assumptions, we can construct CONFIDENCE INTERVALS or CONFIDENCE INTERVAL ESTIMATES of the form: -Given a significance.
Design of Experiments and Data Analysis. Let’s Work an Example Data obtained from MS Thesis Studied the “bioavailability” of metals in sediment cores.
Objectives (BPS chapter 24)
Inference for Regression 1Section 13.3, Page 284.
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Statistics: Data Analysis and Presentation Fr Clinic II.
Data Freshman Clinic II. Overview n Populations and Samples n Presentation n Tables and Figures n Central Tendency n Variability n Confidence Intervals.
The Simple Regression Model
SIMPLE LINEAR REGRESSION
ASSESSING THE STRENGTH OF THE REGRESSION MODEL. Assessing the Model’s Strength Although the best straight line through a set of points may have been found.
T-test.
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Simple Linear Regression Analysis
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
BCOR 1020 Business Statistics
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 24 – April 17, 2008.
Hypothesis Testing Using The One-Sample t-Test
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 12 Section 1 Inference for Linear Regression.
Relationships Among Variables
Lecture 5 Correlation and Regression
Active Learning Lecture Slides
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Correlation and Linear Regression
II.Simple Regression B. Hypothesis Testing Calculate t-ratios and confidence intervals for b 1 and b 2. Test the significance of b 1 and b 2 with: T-ratios.
STA291 Statistical Methods Lecture 27. Inference for Regression.
Hypothesis Testing in Linear Regression Analysis
4.2 One Sided Tests -Before we construct a rule for rejecting H 0, we need to pick an ALTERNATE HYPOTHESIS -an example of a ONE SIDED ALTERNATIVE would.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
1 Lecture 4 Main Tasks Today 1. Review of Lecture 3 2. Accuracy of the LS estimators 3. Significance Tests of the Parameters 4. Confidence Interval 5.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
June 30, 2008Stat Lecture 16 - Regression1 Inference for relationships between variables Statistics Lecture 16.
Week 13a Making Inferences, Part III t and chi-square tests.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Data Analysis, Presentation, and Statistics
Example x y We wish to check for a non zero correlation.
Using Microsoft Excel to Conduct Regression Analysis.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
Chapter 9 Minitab Recipe Cards. Contingency tests Enter the data from Example 9.1 in C1, C2 and C3.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Simple Linear Regression and Correlation (Continue..,) Reference: Chapter 17 of Statistics for Management and Economics, 7 th Edition, Gerald Keller. 1.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Comparing Models.
Lecture #25 Tuesday, November 15, 2016 Textbook: 14.1 and 14.3
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
SIMPLE LINEAR REGRESSION
Presentation transcript:

Lecture 5 Regression

Homework Issues…past 1.Bad Objective: Conduct an experiment because I have to for this class 2.Commas – ugh  3.Do not write out symbols (‘pi’), use the symbol (‘  ’) 4.Summarize results (don’t give me everything and then some) 5.Report: mean ± std. dev.

Homework Issues…past 1.A confidence interval should be reported as an interval, e.g., 1.2 – Define abbreviations when first used, e.g., CI 3.However, there were too many conjunctive adverbs at the start of sentences! 4.Equation formatting

Homework Issues…present? 1.Do not show 27 digits of accuracy 2.UNITS!!! UNITS!!! INCLUDE UNITS!!! 3.Every table and figure should have a caption and be referred to in the text. 4.A section (e.g., results) should be more than just a table and a figure.

On to the lecture…

In Excel… three ways to perform a linear regression: 1.Built-in functions SLOPE() and INTERCEPT() -- no details 2.Adding a trendline to a chart, and showing the regression equation on the chart (simplest) 3.Regression analysis using the Data Analysis Toolkit (best option – more information)

Option 3 in Excel

Excel Results Recall that we forced the intercept = 0

Interpretation of results… Excel reports the Standard Error, not the standard deviation. They are not equal. See next slide. The P-value is the probability that the observed result could be explained by random chance. The tiny P-value for the slope (1.91 x ) indicates that there is a miniscule probability that the observed result can be explained by random chance. That is, you REALLY NEED the slope term to explain the data.

Interpretation of results… The 95% confidence interval for the true value of the slope (true value of π in this example) is presented in the output table. In this example, with 95% confidence, the true value of π is somewhere between and The 90% confidence interval is to , which does not contain the true value!! Measurement bias – not small, random, additive error?

Calculating std. dev. Slope se = Slope sd = · sqrt(20) = Our experimental results are: – “The experimental value of π was found to be 3.22 ± ” – “The 95% confidence interval for true value of π ranges from to ”

Multivariable Regression Fit this data to an equation of the form:

Plot

Multivariable Regression y is the response variable. Order of the other columns does not matter.

In Excel…

Results… (bug?)

Interpretation… The coefficients ± s are: b 0 = 5.53 ± b 1 = 2.12 ± 8.54 b 2 = 3.98 ± 0.78 Standard deviations are significantly larger than the mean values for b 0 and b 1. p-values for these coefficients are 0.42 and These p-values are well over 0.05, so these terms are statistically insignificant (at 5%.) We can regress this data nearly as well with:

p-value? Recall: The lower the p-value, the less likely the result, assuming the null hypothesis, so the more "significant" the result, in the sense of statistical significance. The null hypothesis here is, simplistically, that the coefficient is zero.

t-Test on a Regression Slope Comparison of b 1 from regression with another value, . The t-test is a hypothesis test. Here are the hypotheses for this t-test. – H0 (null hypothesis) – The slope, b 1, is equal to the known value, β. – H1 (test hypothesis) – The slope, b 1, is not equal to the known value, β.

t-Statistic The appropriate t-statistic for this case is calculated as where The t statistic is always positive; you may have to use (β-b 1 ) to get a positive value.

Critical t Value If t stat > t crit – Reject the null hypothesis that the slope, b 1, is equal to the known value, β. If t stat ≤ t crit – Fail to reject the null hypothesis. Get t crit from a t-Table or Excel (see example). degrees of freedom, DOF = N-2

Example We are comparing b 1 = 3.22 (first example in lecture) to  = . Get SSE = from regression output. Calculate: t stat = Choose α = DOF = 20 – 2 = 18. In Excel, calculate TINV(α,DOF), which returns the value t crit =2.101 when α = 0.05 and DOF = 18 Since t stat ≤ t crit (0.952 < 2.101) we fail to reject the null hypothesis. Conclusion? We cannot say with 95% confidence that b 1 is not equal to 

Example Choose α = DOF = 20 – 2 = 18. In Excel, calculate TINV(α,DOF), which returns the value t crit =0.86 when α = 0.40 and DOF = 18 Since t cirt ≤ t stat we reject the null hypothesis. Conclusion? We can say with 60% confidence that b 1 is not equal to  Hmmm…that’s a coin flip.