Relationships Can Be Deceiving Statistics lecture 5.

Slides:



Advertisements
Similar presentations
The Question of Causation YMS3e 4.3:Establishing Causation AP Statistics Mr. Molesky.
Advertisements

Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Can Be Deceiving Chapter 11.
Aim: How do we establish causation?
Section 7.2 ~ Interpreting Correlations Introduction to Probability and Statistics Ms. Young ~ room 113.
AP Statistics Causation & Relations in Categorical Data.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Correlation: Relationships Can Be Deceiving. The Impact Outliers Have on Correlation An outlier that is consistent with the trend of the rest of the data.
LSP 121 Introduction to Correlation. Correlation The news is filled with examples of correlation – If you eat so many helpings of tomatoes… – One alcoholic.
Correlation: Relationships Can Be Deceiving. An outlier is a data point that does not fit the overall trend. Speculate on what influence outliers have.
Research Methods How adolescent development and behavior is studied.
Scatterplots By Wendy Knight. Review of Scatterplots  Scatterplots – Show the relationship between 2 quantitative variables measured on the same individual.
Describing Relationships: Scatterplots and Correlation
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Research Methods Case studies Correlational research
Chapter 4 Section 3 Establishing Causation
The Question of Causation
HW#9: read Chapter 2.6 pages On page 159 #2.122, page 160#2.124,
1 10. Causality and Correlation ECON 251 Research Methods.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Can Be Deceiving Chapter 11.
Section 7.3 ~ Best-Fit Lines and Prediction Introduction to Probability and Statistics Ms. Young.
STEVE DOIG CRONKITE SCHOOL OF JOURNALISM Statistics for Science Journalists.
Chapter 151 Describing Relationships: Regression, Prediction, and Causation.
Copyright © 2014 Pearson Education, Inc. All rights reserved Chapter 4 Regression Analysis: Exploring Associations between Variables.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Warm-Up Aug. 28th Grab a bias worksheet off the stool and begin reading the scenarios. Also make sure to grab other materials (workbook, spiral.
Chapter 15 Describing Relationships: Regression, Prediction, and Causation Chapter 151.
Essential Statistics Chapter 41 Scatterplots and Correlation.
Introduction to Correlation.  Correlation – when a relationship exists between two sets of data  The news is filled with examples of correlation ◦ If.
Chapter 151 Describing Relationships: Regression, Prediction, and Causation.
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113.
Does Association Imply Causation? Sometimes, but not always! What about: –x=mother's BMI, y=daughter's BMI –x=amt. of saccharin in a rat's diet, y=# of.
Chapter 4 Scatterplots and Correlation. Explanatory and Response Variables u Interested in studying the relationship between two variables by measuring.
More about Correlation
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Get out your Residuals Worksheet! You will be able to distinguish between correlation and causation. Today’s Objectives:
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Non-Experimental Design Where are the beakers??. What kind of research is considered the “gold standard” by the Institute of Education Sciences? A.Descriptive.
Research Methods How adolescent development and behavior is studied.
Chapter 4 Day Six Establishing Causation. Beware the post-hoc fallacy “Post hoc, ergo propter hoc.” To avoid falling for the post-hoc fallacy, assuming.
Cautions About Correlation and Regression Section 4.2.
4.1 Statistics Notes Should We Experiment or Should We Merely Observe?
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 8 (3-4), 9 More about Correlation. Today’s Lecture l SD Line l Calculating r l correlation vs causation.
Prediction and Causation How do we predict a response? Explanatory Variables can be used to predict a response: 1. Prediction is based on fitting a line.
CORRELATION RESEARCH / STUDIES. Correlation and Research In correlation studies, researchers observe or measure a relationship between variables in which.
Copyright ©2011 Brooks/Cole, Cengage Learning Gathering Useful Data for Examining Relationships Observation VS Experiment Chapter 6 1.
The Question of Causation 4.2:Establishing Causation AP Statistics.
AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.
Chapter 5: 02/17/ Chapter 5 Regression. 2 Chapter 5: 02/17/2004 Objective: To quantify the linear relationship between an explanatory variable (x)
Essential Statistics Chapter 41 Scatterplots and Correlation.
2.7 The Question of Causation
Cautions About Correlation and Regression Section 4.2
CHAPTER 3 Describing Relationships
Given random data, we look for order and meaningful patterns.
7.2 Interpreting Correlations
Cautions about Correlation and Regression
Daniela Stan Raicu School of CTI, DePaul University
7.2 Interpreting Correlations
Chapter 2 Looking at Data— Relationships
Basic Practice of Statistics - 3rd Edition
7.2 Interpreting Correlations
Daniela Stan Raicu School of CTI, DePaul University
The Question of Causation
Examining Relationships
Basic Practice of Statistics - 3rd Edition Regression
Essential Statistics Scatterplots and Correlation
Does Association Imply Causation?
OVERALL LEADING CAUSES OF DEATH IN THE USA
Section 6.2 Establishing Causation
Basic Practice of Statistics - 3rd Edition Lecture Powerpoint
Presentation transcript:

Relationships Can Be Deceiving Statistics lecture 5

Goals for Lecture 5 Recognize when correlation can be misleading Realize reasons why two variables may be related, without cause-and-effect Understand non-statistical considerations that can help establish a causal relationship

Thought Question 1 For each of these, is the correlation higher or lower than it would have been without the outlier?

Thought Question 2 There is a strong correlation in Lisbon between weekly sales of hot castanhas and weekly sales of tecidos para espirra. Does this mean that castanhas cause people to espirrar?

Thought Question 3 Research has found that countries with higher average fat intake tend to have higher breast cancer rates. Does this provide evidence that dietary fat is a contributing cause of breast cancer?

Problems with Correlations Outliers can inflate or deflate correlations Groups combined inappropriately may mask relationships

With Outliers

Without Outliers

Hours Worked vs. Annual Earnings r = +.53

Hours Worked vs. Annual Earnings r = +.53

Hours Worked vs. Annual Earnings r = +.39

Combining Groups Can Deceive Class correlation of weight to height: r =.69 Men’s correlation of weight to height: r =.58 Women’s correlation of weight to height: r =.21

Combining groups

More combining groups

Remember! Correlation does not imply causation. (Igrejas and liquor stores, shoe size and reading ability)

Correlation of variables When considering relationships between measurement variables, there are two kinds: Explanatory (or independent) variable: The variable that attempts to explain or is purported to cause (at least partially) differences in the… Response (or dependent or outcome) variable Often, chronology is a guide to distinguishing them (examples: baldness and heart attacks, poverty and test scores)

Some reasons why two variables could be related The explanatory variable is the direct cause of the response variable

Some reasons why two variables could be related The explanatory variable is the direct cause of the response variable Example: pollen counts and percent of population suffering allergies, intercourse and babies

Some reasons two variables could be related The response variable actually is causing a change in the explanatory variable

Some reasons two variables could be related The response variable is causing a change in the explanatory variable Example: hotel occupancy and advertising spending, divorce and alcohol abuse

Some reasons two variables could be related The explanatory variable is a contributing -- but not sole -- cause

Some reasons two variables could be related The explanatory variable is a contributing -- but not sole -- cause Example: birth complications and violence, gun in home and homicide, hours studied and grade, diet and cancer

Some reasons two variables could be related Confounding variables may exist

Some reasons two variables could be related Confounding variables may exist Example: happiness and heart disease, traffic deaths and speed limits

Some reasons two variables could be related Both variables may result from a common cause

Some reasons two variables could be related Both variables may result from a common cause Example: SAT score and GPA, hot chocolate and tissues, storks and babies, fire losses and firefighters, WWII fighter opposition and bombing accuracy

Some reasons two variables could be related Both variables are changing over time

Some reasons two variables could be related Both variables are changing over time Example: divorces and drug offenses, divorces and suicides

Some reasons two variables could be related The association may be nothing more than coincidence

Some reasons two variables could be related The association may be nothing more than coincidence Example: clusters of disease, brain cancer from cell phones

So how can we confirm causation? The only way to confirm is with a designed experiment. But non-statistical evidence of a possible connection may include: A reasonable explanation of cause and effect. A connection that happens under varying conditions. Potential confounding variables ruled out.

Why? Orchestra conductors tend to live long lives. Fewer accidents after speed limits were lowered in 1973 due to the oil embargo. In the week before the 1994 Northridge earthquake, 149 were admitted for heart attacks. In the week after there were 201.

PERGUNTAS?