Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.

Slides:



Advertisements
Similar presentations
Scatterplots and Correlation
Advertisements

3.1b Correlation Target Goal: I can determine the strength of a distribution using the correlation. D2 h.w: p 160 – 14 – 18, 21, 26.
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
+ Scatterplots and Correlation Displaying Relationships: ScatterplotsThe most useful graph for displaying the relationship between two quantitative variables.
CHAPTER 3 Describing Relationships
AP STATISTICS LESSON 3 – 1 EXAMINING RELATIONSHIPS SCATTER PLOTS.
Examining Relationships Prob. And Stat. CH.2.1 Scatterplots.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Warm-Up A trucking company determines that its fleet of trucks averages a mean of 12.4 miles per gallon with a standard deviation of 1.2 miles per gallon.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
The Big Picture Where we are coming from and where we are headed…
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Unit 3: Describing Relationships
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
Chapter 4 Describing Relationships. Section 4.1 Scatterplots and Correlation.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
Chapter 14 STA 200 Summer I Scatter Plots A scatter plot is a graph that shows the relationship between two quantitative variables measured on the.
Lecture 4 Chapter 3. Bivariate Associations. Objectives (PSLS Chapter 3) Relationships: Scatterplots and correlation  Bivariate data  Scatterplots (2.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
3. Relationships Scatterplots and correlation
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Daniela Stan Raicu School of CTI, DePaul University
The Practice of Statistics in the Life Sciences Fourth Edition
Chapter 3: Describing Relationships
Daniela Stan Raicu School of CTI, DePaul University
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Examining Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 4 - Scatterplots and Correlation
Chapter 3 Scatterplots and Correlation.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3 Examining Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Presentation transcript:

Chapter 4 Scatterplots and Correlation

Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots Adding categorical variables to scatterplots Measuring linear association: correlation r Facts about correlation

Explanatory and Response Variables Response variable measures an outcome of a study. An explanatory variable explains, influences or cause changes in a response variable. Independent variable and dependent variable. Be careful!! The relationship between two variables can be strongly influenced by other variables that are lurking in the background.

Explanatory and response variables Note: There is not necessary to have a cause-and- effect relationship between explanatory and response variables. Example 4.1(P. 80) Example. Cigarette smoking and lung cancer Example. Sales of personal computers and athletic shoes

Displaying relationships: Scatterplots –A scatterplot displays the relationship between two quantitative variables measured on the same individuals. –It is the most common way to display the relation between two quantitative variables. –It displays the form, direction, and strength of the relationship between two quantitative variables. –The values of one variable appear on the horizontal axis, and the values of the other variable appear on the vertical axis. Each individual in the data appears as the point in the plot fixed by the values of both variables for that individual.

Example 4.3 ( P.82 )

Interpreting scatterplots How to examine a scatterplot: –An overall pattern showing: The form, direction, and strength of the relationship –Outliers or other deviations from this pattern.

Interpreting scatterplots Overall Pattern –Form: Linear relationships, where the points show a straight-line pattern, are an important form of relationship between two variables. Curved relationships and clusters (a number of similar individuals that occur together) are other forms to watch for. –Directions: If the relationship has a clear direction, we speak of either positive association (the more the x, the more the y) or negative association (the more the x, the less the y). –Strength: The strength of a relationship is determined by how close the points in the scatterplot lie to a line.

Example 4.5 ( P.84 )

Adding categorical variables to scatterplots

Scatterplot & Correlation Scatterplots provide a visual tool for looking at the relationship between two variables. Unfortunately, our eyes are not good tools for judging the strength of the relationship. Changes in the scale or the amount of white space in the graph can easily change our judgment of the strength of the relationship. Correlation is a numerical measure we use to show the strength of linear association.

Measuring linear association: correlation r (The Pearson Product-Moment Correlation Coefficient or Correlation Coefficient) The correlation r measures the strength and direction of the linear association between two quantitative variables, usually labeled X and Y.

Facts about correlation What kind of variables do we use? –1. No distinction between explanatory and response variables. –2. Both variables should be quantitative Numerical properties –1. –2. r>0: positive association between variables –3. r<0: negative association between variables –4. If r =1or r = - 1, it indicates perfect linear relationship –5. As |r| is getting close to 1, much stronger relationship –6. Effected by a few outliers  not resistant. –7. It doesn’t describe curved relationships –8. Not easy to guess the value of r from the appearance of a scatter plot