Chapter 7 Scatterplots, Association, and Correlation Stats: modeling the world Second edition Raymond Dahlman IV.

Slides:



Advertisements
Similar presentations
Correlation and Linear Regression
Advertisements

 Objective: To look for relationships between two quantitative variables.
Unit 4: Linear Relations Minds On 1.Determine which variable is dependent and which is independent. 2.Graph the data. 3.Label and title the graph. 4.Is.
Scatterplots By Wendy Knight. Review of Scatterplots  Scatterplots – Show the relationship between 2 quantitative variables measured on the same individual.
CHAPTER 12 More About Regression
Relationship of two variables
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 6, Slide 1 Chapter 6 Scatterplots, Association and Correlation.
How do scientists show the results of investigations?
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Copyright © 2010 Pearson Education, Inc. Unit 2: Chapter 7 Scatterplots, Association, and Correlation.
Scatterplots, Association,
Scatterplots, Associations, and Correlation
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
New Seats – Block 1. New Seats – Block 2 Warm-up with Scatterplot Notes 1) 2) 3) 4) 5)
Slide 7-1 Copyright © 2004 Pearson Education, Inc.
1 Chapter 7 Scatterplots, Association, and Correlation.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Objectives (IPS Chapter 2.1)
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Sec 1.5 Scatter Plots and Least Squares Lines Come in & plot your height (x-axis) and shoe size (y-axis) on the graph. Add your coordinate point to the.
Scatterplots are used to investigate and describe the relationship between two numerical variables When constructing a scatterplot it is conventional to.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
“Numbers are like people… torture them long enough and they’ll tell you anything...” Linear transformation.
Aim: Review for Exam Tomorrow. Independent VS. Dependent Variable Response Variables (DV) measures an outcome of a study Explanatory Variables (IV) explains.
Scatterplots and Correlation Section 3.1 Part 1 of 2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates, Moore.
Chapter 7 Scatterplots, Association, and Correlation.
Correlations: Relationship, Strength, & Direction Scatterplots are used to plot correlational data – It displays the extent that two variables are related.
Bivariate Data AS (3 credits) Complete a statistical investigation involving bi-variate data.
 Describe the association between two quantitative variables using a scatterplot’s direction, form, and strength  If the scatterplot’s form is linear,
Correlation The apparent relation between two variables.
Scatterplots Chapter 7 Definition and components Describing Correlation Correlation vs. Association.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Scatterplots, Association, and Correlation.
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
DO NOW 1.Given the scatterplot below, what are the key elements that make up a scatterplot? 2.List as many facts / conclusions that you can draw from this.
Linear Regression Day 1 – (pg )
AP Statistics Section 4.1 A Transforming to Achieve Linearity.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Statistics: Analyzing 2 Quantitative Variables MIDDLE SCHOOL LEVEL  Session #2  Presented by: Dr. Del Ferster.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
UNIT 4 Bivariate Data Scatter Plots and Regression.
Chapter 7 Scatterplots, Association, and Correlation.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
6.7 Scatter Plots. 6.7 – Scatter Plots Goals / “I can…”  Write an equation for a trend line and use it to make predictions  Write the equation for a.
Module 11 Scatterplots, Association, and Correlation.
Chapter 4 Scatterplots – Descriptions. Scatterplots Graphical display of two quantitative variables We plot the explanatory (independent) variable on.
Chapter 5 Lesson 5.1a Summarizing Bivariate Data 5.1a: Correlation.
Scatter Plots. Standard: 8.SP.1 I can construct and interpret scatterplots.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Scatterplots, Association, and Correlation.
Statistics 7 Scatterplots, Association, and Correlation.
Unit 3 – Association: Contingency, Correlation, and Regression Lesson 3-2 Quantitative Associations.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 7- 1.
Going Crackers! Do crackers with more fat content have greater energy content? Can knowing the percentage total fat content of a cracker help us to predict.
Chapter 8 Part I Answers The explanatory variable (x) is initial drop, measured in feet, and the response variable (y) is duration, measured in seconds.
Scatterplots, Association, and Correlation
4-5 Scatter Plots and Lines of Best Fit
CHAPTER 7 LINEAR RELATIONSHIPS
Two Quantitative Variables
Ch. 10 – Scatterplots, Association and Correlation (Day 1)
Chapter 7: Scatterplots, Association, and Correlation
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Basic Practice of Statistics - 3rd Edition
Scatterplots and Correlation
Correlation.
Examining Relationships
Scatterplots, Association and Correlation
Chapters Important Concepts and Terms
Presentation transcript:

Chapter 7 Scatterplots, Association, and Correlation Stats: modeling the world Second edition Raymond Dahlman IV

Scatterplots Scatter plots are an effective way of displaying data, and lets us to visibly see if there is an association between 2 quantitative sets of variables. When looking at a scatter plot, we look for the direction, form, strength, and any unusual features like outliers in the data

Features of Scatterplots Direction The direction of data determines whether the variables have a positive or negative correlation

Features of Scatterplots Form The form, or the overall shape of the data, tells us how the two data sets are related, either linearly or by a non-linear function.

Features of Scatterplots Strength The strength of the data is the relationship of each data point to the overall expected value, the line of best fit.

Determining Variables When presented with 2 sets of data, it is necessary to assign each variable to its respective category. The data set that you believe that determines the other is the independent, or explanatory variable. The other variable, is the dependent, or response variable. When graphing, the explanatory variable is the x-axis, while the response variable is the y-axis.

Correlation Correlation is the measurement of the linear strength between 2 quantitative sets of data. To find the measure of correlation, or 𝑟 , we use the correlation coefficient. 𝑟= 𝑧 𝑥 𝑧 𝑦 𝑛−1

Correlation Conditions Because the correlation coefficient is only useful for linear trends, the data must follow these conditions. The Quantitative Variables Condition: The 2 variables must be quantitative data in the appropriate units The Straight Enough Condition: The data must have a linear trend. If it does not, there are methods to straighten the data. The Outlier Condition: Be aware of outliers that could potentially distort the correlation coeficant.

Correlation Properties Correlation is always −1≤𝑟≤1 with the sign of 𝑟 determining if is a negative or positive correlation. When r is equal to 1 or -1, all the data points fall along a single straight line, a rare occurrence. A correlation near 0 corresponds to a weak linear association. Correlation treats the variables symmetrically, where x:y is equal to y:x Correlation only depends on z-scores, and will not change unless the individual z-scores are changed.

Straightening Scatterplots When a scatterplot shows a bent from that consistently increases or decreases, we can often straighten the form of the plot by re- expressing one or both variables, enabling us to apply the correlation coefficient.

Problem #33 People who responded to a July 2004 Discovery Channel poll named the 10 best roller coasters in the United States. The table below shows the length of the initial drop (in feet) and the duration of the ride (in seconds). What do these data indicate about the heght of a roller coaster and the length of the ride you can expect?

Make a Scatterplot

Drop (ft) Duration (s) 𝜇 171 133 𝜎 74.75 51.33 After making a scatterplot, we must determine the mean and standard deviation. Then we will take these values and change them into z-scores using z = 𝑥−𝜇 𝜎 -0.88299 0.038964 1.123809 -0.545490 0.915644 0.419048 2.084551 0.321088 -0.253263 -0.40136 -1.324761 0.575283 0.136372 -1.01678 -0.837717 -0.84286 0.526008 -1.13719 Now, we can plug this into our correlation coefficient equation, and we come up with a r value of ~0.35 The overall all trend is a weak positive correlation between the drop length and the duration of the ride.

Problem #35 Is there any pattern to the locations of the planets in our solar system? The table shows the average distance of each of the nine planets from the sun.   a) Make a scatterplot and describe the association. (Remember: direction, form, and strength!)  b) Why would you not want to talk about the correlation between planet position and distance from the sun?  c) Make a scatterplot showing the logarithm of distance vs. position. What is better about this scatterplot? 

A: The relation between the position and distance is non-linear with a positive association. There is very little scatter in the data. B: The relation is not linear.

C: The relation between the position and the log of the distance appears to be roughly linear.