Lesson 3 - 1 Scatterplots and Correlation. Knowledge Objectives Explain the difference between an explanatory variable and a response variable Explain.

Slides:



Advertisements
Similar presentations
AP Statistics Section 3.1B Correlation
Advertisements

Scatterplots and Correlation
3.1b Correlation Target Goal: I can determine the strength of a distribution using the correlation. D2 h.w: p 160 – 14 – 18, 21, 26.
Lesson Diagnostics on the Least- Squares Regression Line.
Scatter Diagrams and Linear Correlation
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
Chapter 3: Examining Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
AP STATISTICS LESSON 3 – 1 EXAMINING RELATIONSHIPS SCATTER PLOTS.
A P STATISTICS LESSON 3 – 2 CORRELATION.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Stat 1510: Statistical Thinking and Concepts Scatterplots and Correlation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Examining Relationships Prob. And Stat. 2.2 Correlation.
Lesson Least-Squares Regression. Knowledge Objectives Explain what is meant by a regression line. Explain what is meant by extrapolation. Explain.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Objectives (IPS Chapter 2.1)
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Section 4.1 Scatter Diagrams and Correlation. Definitions The Response Variable is the variable whose value can be explained by the value of the explanatory.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
+ Warm Up Tests 1. + The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots.
The Practice of Statistics
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Lesson Scatter Diagrams and Correlation. Objectives Draw and interpret scatter diagrams Understand the properties of the linear correlation coefficient.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Unit 3: Describing Relationships
Independent Dependent Scatterplot Least Squares
C HAPTER 3: E XAMINING R ELATIONSHIPS. 3.2: C ORRELATION Measures the direction and strength of a linear relationship between two variables. Usually written.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
3.1 Scatterplots and Correlation Objectives SWBAT: IDENTIFY explanatory and response variables in situations where one variable helps to explain or influences.
Scatterplots and Correlation Section 3.1 Part 2 of 2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates, Moore.
Balloon Activity Thoughts Did you discover a relationship between the circumference of the balloons and the time it took for them to descend? What were.
Unit 3 – Association: Contingency, Correlation, and Regression Lesson 3-2 Quantitative Associations.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
Lesson Scatterplots and Correlation. Objectives Describe why it is important to investigate relationships between variables Identify explanatory.
3.1 Scatterplots and Correlation
Describing Bivariate Relationships
A P STATISTICS LESSON 3 – 2 CORRELATION.
Scatterplots and Correlation
Do Now Create a scatterplot following these directions
Review of Chapter 3 Examining Relationships
Do Now Describe the following relationships. Do Now Describe the following relationships.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 4 - Scatterplots and Correlation
Chapter 3 Scatterplots and Correlation.
A P STATISTICS LESSON 3 – 2 CORRELATION.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Describing Bivariate Relationships
AP Stats Agenda Text book swap 2nd edition to 3rd Frappy – YAY
CHAPTER 3 Describing Relationships
Section 11.1 Correlation.
Bivariate Data Response Variable: measures the outcome of a study (aka Dependent Variable) Explanatory Variable: helps explain or influences the change.
Correlation Coefficient
Review of Chapter 3 Examining Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Lesson Scatterplots and Correlation

Knowledge Objectives Explain the difference between an explanatory variable and a response variable Explain what it means for two variables to be positively or negatively associated Define the correlation r and describe what it measures List the four basic properties of the correlation r that you need to know in order to interpret any correlation List four other facts about correlation that must be kept in mind when using r

Construction Objectives Given a set of bivariate data, construct a scatterplot. Explain what is meant by the direction, form, and strength of the overall pattern of a scatterplot. Explain how to recognize an outlier in a scatterplot. Explain how to add categorical variables to a scatterplot. Use a TI-83/84/89 to construct a scatterplot. Given a set of bivariate data, use technology to compute the correlation r.

Vocabulary Bivariate data – Categorical Variables – Correlation (r) – Negatively Associated – Outlier – Positively Associated – Scatterplot – Scatterplot Direction – Scatterplot Form – Scatterplot Strength –

Scatter Plots Shows relationship between two quantitative variables measured on the same individual. Each individual in the data set is represented by a point in the scatter diagram. Explanatory variable plotted on horizontal axis and the response variable plotted on vertical axis. Do not connect the points when drawing a scatter diagram.

Drawing Scatter Plots by Hand Plot the explanatory variable on the x-axis. If there is no explanatory-response distinction, either variable can go on the horizontal axis. Label both axes Scale both axes (but not necessarily the same scale on both axes). Intervals must be uniform. Make your plot large enough so that the details can be seen easily. If you have a grid, adopt a scale so that you plot uses the entire grid

TI-83 Instructions for Scatter Plots Enter explanatory variable in L1 Enter response variable in L2 Press 2 nd y= for StatPlot, select 1: Plot1 Turn plot1 on by highlighting ON and enter Highlight the scatter plot icon and enter Press ZOOM and select 9: ZoomStat

Interpreting Scatterplots Just like distributions had certain important characteristics (Shape, Outliers, Center, Spread) Scatter plots should be described by –Direction positive association (positive slope left to right) negative association (negative slope left to right) –Form linear – straight line, curved – quadratic, cubic, etc, exponential, etc –Strength of the form weak moderate (either weak or strong) strong –Outliers (any points not conforming to the form) –Clusters (any sub-groups not conforming to the form)

Response Explanatory Response Explanatory Response Explanatory Response Explanatory Response Explanatory Strong Negative Quadratic AssociationWeak Negative Linear Association No RelationStrong Positive Linear Association Strong Negative Linear Association Example 1

Example 2 Describe the scatterplot below Colorado Mild Negative Exponential Association One obvious outlier Two clusters > 50% < 50%

Example 3 Describe the scatterplot below Mild Positive Linear Association One mild outlier

Adding Categorical Variables Use a different plotting color or symbol for each category

Associations Remember the emphasis in the definitions on above and below average values in examining the definition for linear correlation coefficient, r

Where x is the sample mean of the explanatory variable s x is the sample standard deviation for x y is the sample mean of the response variable s y is the sample standard deviation for y n is the number of individuals in the sample Linear Correlation Coefficient, r (x i – x) s x (y i – y) s y 1 r = n – 1 Σ

Equivalent Form for r Easy for computers (and calculators) r = x i y i x i y i – n Σ Σ Σ √ x i x i 2 – n Σ ( Σ ) 2 y i y i 2 – n Σ ( Σ ) 2 = s xy √s xx √s yy

Important Properties of r Correlation makes no distinction between explanatory and response variables r does not change when we change the units of measurement of x, y or both Positive r indicates positive association between the variables and negative r indicates negative association The correlation r is always a number between -1 and 1

Linear Correlation Coefficient Properties The linear correlation coefficient is always between -1 and 1 If r = 1, then the variables have a perfect positive linear relation If r = -1, then the variables have a perfect negative linear relation The closer r is to 1, then the stronger the evidence for a positive linear relation The closer r is to -1, then the stronger the evidence for a negative linear relation If r is close to zero, then there is little evidence of a linear relation between the two variables. R close to zero does not mean that there is no relation between the two variables The linear correlation coefficient is a unitless measure of association

TI-83 Instructions for Correlation Coefficient With explanatory variable in L1 and response variable in L2 Turn diagnostics on by –Go to catalog (2 nd 0) –Scroll down and when diagnosticOn is highlighted, hit enter twice Press STAT, highlight CALC and select 4: LinReg (ax + b) and hit enter twice Read r value (last line)

Example 4 Draw a scatter plot of the above data Compute the correlation coefficient x y r = y x

Example 5 Match the r values to the Scatterplots to the left 1)r = )r = )r = )r = 0 5)r = 0.5 6)r = 0.9 A B CF E D F E A C B D

Cautions to Heed Correlation requires that both variables be quantitative, so that it makes sense to do the arithmetic indicated by the formula for r Correlation does not describe curved relationships between variables, not matter how strong they are Like the mean and the standard deviation, the correlation is not resistant: r is strongly affected by a few outlying observations Correlation is not a complete summary of two-variable data

Observational Data Reminder If bivariate (two variable) data are observational, then we cannot conclude that any relation between the explanatory and response variable are due to cause and effect Remember Observational versus Experimental Data

Summary and Homework Summary –Scatter plots can show associations between variables and are described using direction, form, strength and outliers –Correlation r measures the strength and direction of the linear association between two variables –r ranges between -1 and 1 with 0 indicating no linear association Homework –3.7, 3.8, 3.13 – 3.16, 3.21