S ECTION 4.1 – T RANSFORMING R ELATIONSHIPS Linear regression using the LSRL is not the only model for describing data. Some data are not best described.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Residuals.
4.1: Linearizing Data.
Math 243 Exam 2 Preparation In regression what is considered an influential point? Any ordered pair is considered influential, if when removed, the result.
Copyright © 2011 Pearson Education, Inc. Curved Patterns Chapter 20.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 20 Curved Patterns.
Scatter Diagrams and Linear Correlation
Chapter Four: More on Two- Variable Data 4.1: Transforming to Achieve Linearity 4.2: Relationships between Categorical Variables 4.3: Establishing Causation.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Lesson Quiz: Part I 1. Change 6 4 = 1296 to logarithmic form. log = 4 2. Change log 27 9 = to exponential form = log 100,000 4.
Chapter 7 Scatterplots, Association, and Correlation Stats: modeling the world Second edition Raymond Dahlman IV.
C HAPTER 3: E XAMINING R ELATIONSHIPS. S ECTION 3.3: L EAST -S QUARES R EGRESSION Correlation measures the strength and direction of the linear relationship.
Descriptive Methods in Regression and Correlation
+ Hw: pg 788: 37, 39, 41, Chapter 12: More About Regression Section 12.2b Transforming using Logarithms.
Chapter 12-2 Transforming Relationships Day 2
Lesson Transforming to Achieve Linearity. Knowledge Objectives Explain what is meant by transforming (re- expressing) data. Tell where y = log(x)
Transforming to achieve linearity
Chapter 4: More on Two-Variable (Bivariate) Data.
The Practice of Statistics
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Examining Relationships
Bivariate Data When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
Transforming Relationships Chapter 4.1: Exponential Growth and Power Law Models Part A: Day 1: Exponential Growth.
Chapter 5 Residuals, Residual Plots, Coefficient of determination, & Influential points.
“Numbers are like people… torture them long enough and they’ll tell you anything...” Linear transformation.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
4.1 Transforming Relationships. Transforming (reexpressing) -Applying a function such as the logarithmic or square root to a quantitative variable -Because.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
8.7 M ODELING WITH E XPONENTIAL & P OWER F UNCTIONS p. 509.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Chapter 8 Linear Regression HOW CAN A MODEL BE CREATED WHICH REPRESENTS THE LINEAR RELATIONSHIP BETWEEN TWO QUANTITATIVE VARIABLES?
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.2 Least-Squares Regression.
AP Statistics Section 4.1 A Transforming to Achieve Linearity.
1. Plot the data. What kind of growth does it exhibit? (plot by hand but you may use calculators to confirm answers.) 2. Use logs to transform the data.
Section 4.1 Transforming Relationships AP Statistics.
4.1 Warm Up Describe a monotonic function. Enter the given data in L1 and L2 Create a scatter plot for the data. Determine if a linear model is a good.
Residuals, Influential Points, and Outliers
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Chapter 10 Notes AP Statistics. Re-expressing Data We cannot use a linear model unless the relationship between the two variables is linear. If the relationship.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Chapter 4 More on Two-Variable Data. Four Corners Play a game of four corners, selecting the corner each time by rolling a die Collect the data in a table.
HW 16 Key. 20:31 Wal-Mart. 20:31 a a.Scatterplot. Linear? No. It grows at a growing rate.
12.2 TRANSFORMING TO ACHIEVE LINEARITY To use transformations involving powers, roots, and logarithms to find a power or exponential model that describes.
CHAPTER 3 Describing Relationships
Ch. 10 – Linear Regression (Day 2)
Linear Regression Special Topics.
CHAPTER 3 Describing Relationships
Active Learning Lecture Slides
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Chapter 8 Part 2 Linear Regression
Least-Squares Regression
residual = observed y – predicted y residual = y - ŷ
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Residuals, Influential Points, and Outliers
Section 4.1 Exponential Modeling
Ch 4 : More on Two-Variable Data
CHAPTER 3 Describing Relationships
Homework: pg. 180 #6, 7 6.) A. B. The scatterplot shows a negative, linear, fairly weak relationship. C. long-lived territorial species.
Math 243 Exam 2 Preparation.
Homework: pg. 276 #5, 6 5.) A. The relationship is strong, negative, and curved. The ratios are all Since the ratios are all the same, the exponential.
Chapters Important Concepts and Terms
Homework: PG. 204 #30, 31 pg. 212 #35,36 30.) a. Reading scores are predicted to increase by for each one-point increase in IQ. For x=90: 45.98;
Review of Chapter 3 Examining Relationships
Presentation transcript:

S ECTION 4.1 – T RANSFORMING R ELATIONSHIPS Linear regression using the LSRL is not the only model for describing data. Some data are not best described linearly. In some cases the removal of outliers from data may cause a drop in the correlation so that linear no longer does a satisfactory job of describing the data. In some cases the bulk of the data may not be linear at all. Non-linear relationships between two quantitative variables can sometimes be changed into linear relationships by transforming one or both variables.

S ECTION 4.1 – T RANSFORMING R ELATIONSHIPS Transforming can be thought of as re-expressing the data. We may want to transform either the explanatory variable x, or the response variable y in a scatter plot, or maybe even both. We will call the transformed variable " t " when talking about the transforming in general. Many variables take only 0 or positive values, so we are particularly interested in how functions behave for positive values of t.

The following models are common functions in which you should be familiar with their shape and equation. These are models in which t > 0.

The following scatterplot represents brain weight against body weight for 96 species of mammals.

The scatterplot is not very satisfactory since most mammals are so small relative to elephants and hippos The lower left corner of the plot shows that most of the species overlap forming a “blob” The correlation with all 96 species is r =.86, but removing the elephant, r =.50 To get a closer look at the observations that are in the lower-left corner, the 4 outliers were removed

This scatterplot represents the 92 observations with the 4 outliers removed. Instead of a linear relationship, you can see that as body weight increases, the graph bends to the right which is representative of a logarithmic function.

The following plot includes the original 96 observations, but instead of plotting the y-value against the x -value, the logarithm of the brain weights ( y -value) were plotted against the logarithm of the body weights ( x -value). There are no longer any extreme outliers or very influential observations and the pattern is very linear with r =.96

The ladder of power functions is in the form: linear reciprocal square root logarithmic inverse square

C ONCAVITY OF P OWER F UNCTIONS

E XPONENTIAL G ROWTH

T RANSFORMING DATA

R EVIEW P ROPERTIES OF L OGARITHMS 13

E XAMPLE 1 – G ROWTH OF C ELL P HONE U SE The cell phone industry enjoyed substantial growth in the 1990’s. One way to measure cell phone growth is to look at the number of subscribers. Find a linear model to predict the number of subscribers in the year Year Subscribers (thousands) ,00924,13433,78644,04355,31269,20986,047 There is an increasing trend, but the overall pattern is not linear. The pattern looks like an exponential curve. Is this exponential growth?

While the curve may appear to be exponential growth, we can’t simply depend on what our eyes see. If you suspect exponential growth, first calculate the ratios of consecutive terms to see if they are the same fixed percentage of the previous total. To avoid overflow in the calculator it is good practice to code the years (let 1990 = 1) Don’t use 0 since you can’t take the log of 0 E XAMPLE 1 – G ROWTH OF C ELL P HONE U SE YearSubscribersRatioslog(y) 15, , , , , , , , Means that the # of subscribers in 1994 is 151% of or 1.51 times the # of subscribers in Could also say that it’s a 51% increase. On average, subscribers were increasing around 35% each year.

E XAMPLE 1 – G ROWTH OF C ELL P HONE U SE

Homework: p #’s 6-8