Stor 155, Section 2, Last Time 2-way Tables –Sliced populations in 2 different ways –Look for independence of factors –Chi Square Hypothesis test Simpson’s.

Slides:



Advertisements
Similar presentations
Inference for Regression
Advertisements

Objectives (BPS chapter 24)
Inference for Regression 1Section 13.3, Page 284.
Lecture 23: Tues., Dec. 2 Today: Thursday:
The Simple Regression Model
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
The Basics of Regression continued
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Simple Linear Regression Analysis
Ch. 14: The Multiple Regression Model building
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Least squares line Interpreting coefficients Prediction Cautions The formal model Section 2.6, 9.1, 9.2 Professor Kari Lock Morgan.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Correlation and Linear Regression
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Linear Regression and Correlation
Stat 31, Section 1, Last Time T distribution –For unknown, replace with –Compute with TDIST & TINV (different!) Paired Samples –Similar to above, work.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Lecture 14 Multiple Regression Model
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Stor 155, Section 2, Last Time Inference for Regression –Least Square Fits –Sampling distrib’ns for slope and intercept Regression Tool –Gave many useful.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Chapter 15 Inference for Regression
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Stat 31, Section 1, Last Time 2-way tables –Testing for Independence –Chi-Square distance between data & model –Chi-Square Distribution –Gives P-values.
Forecasting Techniques: Single Equation Regressions Su, Chapter 10, section III.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Stat 31, Section 1, Last Time Inference for Proportions –Hypothesis Tests 2 Sample Proportions Inference –Skipped 2-way Tables –Sliced populations in 2.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Stat 155, Section 2, Last Time Binomial Distribution –Normal Approximation –Continuity Correction –Proportions (different scale from “counts”) Distribution.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Chapter 26: Inference for Slope. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other.
BPS - 5th Ed. Chapter 231 Inference for Regression.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
CHAPTER 12 More About Regression
Chapter 4 Basic Estimation Techniques
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Stat 31, Section 1, Last Time Sampling Distributions
CHAPTER 12 More About Regression
Statistical Methods For Engineers
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Basic Practice of Statistics - 3rd Edition Inference for Regression
CHAPTER 12 More About Regression
CHAPTER 12 More About Regression
Linear Regression and Correlation
Presentation transcript:

Stor 155, Section 2, Last Time 2-way Tables –Sliced populations in 2 different ways –Look for independence of factors –Chi Square Hypothesis test Simpson’s Paradox –Aggregating can give opposite impression Inference for Regression –Sampling Distributions – TDIST & TINV

Reading In Textbook Approximate Reading for Today’s Material: Pages & Review Approximate Reading for Next Class: Pages & Review

Inference for Regression Chapter 10 Recall: Scatterplots Fitting Lines to Data Now study statistical inference associated with fit lines E.g. When is slope statistically significant?

Recall Scatterplot For data (x,y) View by plot: (1,2) (3,1) (-1,0) (2,-1)

Recall Linear Regression Idea: Fit a line to data in a scatterplot To learn about “basic structure” To “model data” To provide “prediction of new values”

Recall Linear Regression Given a line,, “indexed” by Define “residuals” = “data Y” – “Y on line” = Now choose to make these “small”

Recall Linear Regression Make Residuals > 0, by squaring Least Squares: adjust to Minimize the “Sum of Squared Errors”

Least Squares in Excel Computation: 1.INTERCEPT (computes y-intercept a) 2.SLOPE (computes slope b) Revisit Class Example 14

Inference for Regression Idea: do statistical inference on: –Slope a –Intercept b Model: Assume: are random, independent and

Inference for Regression Viewpoint: Data generated as: y = ax + b Y i chosen from X i Note: a and b are “parameters”

Inference for Regression Parameters and determine the underlying model (distribution) Estimate with the Least Squares Estimates: and (Using SLOPE and INTERCEPT in Excel, based on data)

Inference for Regression Distributions of and ? Under the above assumptions, the sampling distributions are: Centerpoints are right (unbiased) Spreads are more complicated

Inference for Regression Formula for SD of : Big (small) for big (small, resp.) –Accurate data  Accurate est. of slope Small for x’s more spread out –Data more spread  More accurate Small for more data –More data  More accuracy

Inference for Regression Formula for SD of : Big (small) for big (small, resp.) –Accurate data  Accur’te est. of intercept Smaller for –Centered data  More accurate intercept Smaller for more data –More data  More accuracy

Inference for Regression One more detail: Need to estimate using data For this use: Similar to earlier sd estimate, Except variation is about fit line is similar to from before

Inference for Regression Now for Probability Distributions, Since are estimating by Use TDIST and TINV With degrees of freedom =

Inference for Regression Convenient Packaged Analysis in Excel: Tools  Data Analysis  Regression Illustrate application using: Class Example 32, Old Text Problem 10.12

Inference for Regression Class Example 32, Old Text Problem Utility companies estimate energy used by their customers. Natural gas consumption depends on temperature. A study recorded average daily gas consumption y (100s of cubic feet) for each month. The explanatory variable x is the average number of heating degree days for that month. Data for October through June are:

Inference for Regression Data for October through June are: MonthX = Deg. DaysY = Gas Cons’n Oct Nov Dec Jan Feb Mar Apr May Jun01.1

Inference for Regression Class Example 32, Old Text Problem Excel Analysis: Good News: Lots of things done automatically Bad News: Different language, so need careful interpretation

Inference for Regression Excel Glossary: ExcelStor 155 R2R2 r 2 = Prop’n of Sum of Squares Explained by Line interceptIntercept b X VariableSlope a CoefficientEstimates &.

Inference for Regression Excel Glossary: ExcelStor 155 Standard Errors Estimates of &. (recall from Sampling Dist’ns) T – Stat.(Est. – mean) / SE, i.e. put on scale of T – distribution P-valueFor 2-sided test of:

Inference for Regression Excel Glossary: ExcelStor 155 Lower 95% Upper 95% Ends of 95% Confidence Interval for a and b (since chose 0.95 for Confidence level) Predicted.Points on line at, i.e..

Inference for Regression Excel Glossary: ExcelStor 155 Residual for. Recall: gave useful information about quality of fit (useful to plot) Standard Residuals: on standardized scale

Inference for Regression Some useful variations: Class Example 33, Text Problems Excel Analysis:

Inference for Regression Class Example 33, (10.23 – 10.25) Engineers made measurements of the Leaning Tower of Pisa over the years 1975 – “Lean” is the difference between a points position if the tower were straight, and its actual position, in tenths of a meter, in excess of 2.9 meters. The data are:

Inference for Regression Class Example 33, (10.23 – 10.25) The data are: YearLean

Inference for Regression Class Example 33, (10.23 – 10.25) : (a)Plot the data, does the trend in lean over time appear to be linear? (b)What is the equation of the least squares fit line? (c)Give a 95% confidence interval for the average rate of change of the lean.

Inference for Regression HW: b,c (using log base 10, for part c: Est’d slope: Est'd intercept: % CI for slope: [0.186, 0.202])

And Now for Something Completely Different Graphical Displays: Important Topic in Statistics Has large impact Need to think carefully to do this Watch for attempts to fool you

And Now for Something Completely Different Graphical Displays: Interesting Article: “How to Display Data Badly” Howard Wainer The American Statistician, 38, Internet Available:

And Now for Something Completely Different Main Idea: Point out 12 types of bad displays With reasons behind Here are some favorites…

And Now for Something Completely Different Hiding the data in the scale

And Now for Something Completely Different The eye perceives areas as “size”:

And Now for Something Completely Different Change of Scales in Mid- Axis Really trust the Post???

Review Slippery Issues Major Confusion: Population Quantities Vs. Sample Quantities

Review Slippery Issues Population Quantities: Parameters Will never know But can think about Sample Quantities: Estimates (of parameters) Numbers we work with Contain info about parameters

Review Slippery Issues Population Mathematical Notation: (fixed & unknown) Sample Mathematical Notation : (summaries of data, have numbers)

Review Slippery Issues Sampling Distributions: Measurement Error: Counting / Proportions:

Review Slippery Issues Confidence Intervals: Based on margin of error: Measurement Error: brackets 95% of time Counting / Proportions: brackets95% of time

Review Slippery Issues Hypothesis Testing: Statement of Hypotheses: Actual Test: P-value = P{What saw or m.c. | Bdry}

Hypothesis Testing from 3/22 Other views of hypothesis testing: View 2: Z-scores Idea: instead of reporting p-value (to assess statistical significance) Report the Z-score A different way of measuring significance

Hypothesis Testing – Z scores E.g. Fast Food Menus: Test Using P-value = P{what saw or m.c.| H 0 & H A bd’ry}

Hypothesis Testing – Z scores P-value = P{what saw or or m.c.| H 0 & H A bd’ry}