Ch. 10 – Scatterplots, Association and Correlation (Day 1)

Slides:



Advertisements
Similar presentations
Correlation Data collected from students in Statistics classes included their heights (in inches) and weights (in pounds): Here we see a positive association.
Advertisements

CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
+ Scatterplots and Correlation Displaying Relationships: ScatterplotsThe most useful graph for displaying the relationship between two quantitative variables.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Unit 3: Describing Relationships
Statistics: Analyzing 2 Quantitative Variables MIDDLE SCHOOL LEVEL  Session #2  Presented by: Dr. Del Ferster.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Statistics 7 Scatterplots, Association, and Correlation.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
Chapter 3: Describing Relationships
Scatterplots, Association, and Correlation
Scatterplots Chapter 6.1 Notes.
CHAPTER 3 Describing Relationships
Ch. 10 – Linear Regression (Day 2)
Two Quantitative Variables
Section 3.1 Scatterplots.
Chapter 3: Describing Relationships
Scatter Plots.
Chapter 7: Scatterplots, Association, and Correlation
Chapter 3: Describing Relationships
Scatterplots, Association and Correlation
Describing Bivariate Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
11A Correlation, 11B Measuring Correlation
Chapter 3 Scatterplots and Correlation.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Summarizing Bivariate Data
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Examining Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Correlation r Explained
Chapter 3: Describing Relationships
AP Stats Agenda Text book swap 2nd edition to 3rd Frappy – YAY
CHAPTER 3 Describing Relationships
Describing Relationships
Correlation & Trend Lines
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Review of Chapter 3 Examining Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
3.2 Correlation Pg
Presentation transcript:

Ch. 10 – Scatterplots, Association and Correlation (Day 1)

Scatterplots So far, all of our analysis has looked at one variable at a time In this chapter, we will look at the relationship between two variables If the variables are quantitative, we can do this by starting with a graph called a scatterplot

Scatterplots Ex Use the following data to examine the relationship between the amount of fertilizer (lbs per acre) used on plots of land in a particular farming region and the number of bushels per acre of grain produced. Fertilizer Bushels 31 43 33 44 40 45 50 54 59 53 68 80 63

THINK: How will we draw the graph? To decide which variable will go on which axis, think about what you are trying to learn Do the variables have an explanatory/response relationship? In this case, we are wondering how the amount of fertilizer used affects the amount of grain produced Fertilizer is the explanatory variable Bushels produced is the response variable In a scatterplot, the explanatory variable goes on the x-axis and the response variable goes on the y-axis If we aren’t looking at this type of relationship for the variables, you can use either axis

SHOW: Draw the scatterplot Bushels Lbs of Fertilizer 30 40 50 60 70 80 60 55 50 45 40 Don’t forget about labels and scale!

TELL: What does a scatterplot show us? In most of our previous graphs, we were looking for center, shape, and spread of a single quantitative variable This time we are looking at the relationship between two quantitative variables If the two variables seem related, this is referred to as an association Specifically, we are looking at the form, direction and strength of the association

Form: Is it linear? Our eventual goal is to create a model for the data In order to decide which calculations to use, we need to first look at the form (shape) the pattern follows A scatterplot has a linear form if a straight line could be used to describe it reasonably well For now, we will simply describe form as linear or nonlinear Linear Nonlinear

Direction: Positive, Negative or No Association? Once we decide that the form is linear, we now turn to direction If y increases as x increases, this is a positive association If y decreases as x increases, this is a negative association Positive association Negative association No association

Strength: Strong, Moderate, Weak? The last thing we should address is the strength of the relationship The conclusions we draw about strength are highly subjective, especially if they are based strictly on looking at the scatterplot Strong association Moderate association Weak association

Correlation Coefficient r = correlation coefficient for linear relationships Measures the strength and direction of a linear relationship between two quantitative variables

Calculating r r = .9782 Fertilizer Bushels 31 43 33 44 40 45 50 54 59 Lbs of Fertilizer 30 40 50 60 70 80 60 55 50 45 40 Fertilizer Bushels 31 43 33 44 40 45 50 54 59 53 68 80 63 r = .9782

What does r tell us? Close to +1 = strong, positive linear association Close to -1 = strong, negative linear association Close to 0 = weak or no linear association r = 1 or r = -1 means a perfect linear correlation

Properties of r r is a number between -1 and 1 Since r is based on z-scores, it is not affected by shifting or re-scaling, and it has no units The correlation of x with y is the same as the correlation of y with x (it doesn’t matter which variable is used as x or y – the correlation stays the same) Remember that r only works for linear associations of quantitative variables r is very sensitive to outliers – be careful! Even though we have this numerical calculation, strength is still subjective – a value such as 0.68 that is considered strong for one set of data might be considered weak for another

Outliers A scatterplot can also show us outliers In this context, an outlier is a point which doesn’t seem to fit within the pattern formed by the rest of the data

Homework Pg. 542 # 12, 14, 16 Directions: Make a scatterplot of the data. Calculate the correlation coefficient and interpret what this means.