# Correlation and Causation

## Presentation on theme: "Correlation and Causation"— Presentation transcript:

Correlation and Causation
By Melissa Storm

Correlation A statistical measurement of a linear relationship between two variables Correlation is considered positive when the values increase together Correlation is negative when as one value increases the other decreases Storm

Causation Causation is the process in which the change in one variable causes a change in another variable Can also be considered as a cause and effect relationship Example: When the variable “hours worked” increases, the variable “income earned” also increases. This is a direct relationship where as the “hours worked” changes, the “income earned” will always change with it. Storm

Correlation vs Causation
Correlation does not imply causation Two events may occur together but that does not mean that one event causes the other Example: Research shows a strong positive correlation between ice cream sales and crime. Does this mean that buying ice cream causes crime? Or is there some hidden factor, like warm weather, that perhaps could be causing both? Google Correlate Storm

Correlation vs Causation
Storm

Correlation vs Causation
Storm

Coincidence Coincidence occurs when two variables are correlated but one does not cause the other This situation happens when no confounding variable can be found that explains the correlation and the two variables just happened to be correlated randomly Example: Global average temperature and number of pirates are positively correlated. This, however, does not imply that the number of pirates cause temperature rises or that temperature rises cause an increase in the number of pirates. This is merely a coincidence. Storm

Confounding Variable A confounding variable is a variable that may not have been considered in the model when two variables are found to be correlated. A confounding variable will be correlated with both of the other variables Going back to the ice cream/crime example, warm weather would be considered the confounding variable. As the weather gets warmer, both crime and ice cream sales increase. Storm

Chain Reaction A chain reaction occurs when X causes Y indirectly. Usually this implies that there is another variable, or several variables, involved in a chain reaction between X and Y Example: Amount of precipitation might be highly correlated with the price of produce. This is due to the chain reaction where the amount of precipitation affects the quality and amount of produce that is produced therefore affecting the price of the produce. Storm

Reverse Causation Reverse causation occurs when a change in the response variable causes a change in the independent variable Example: If the response variable is the crime rate and one of the independent variables is the number of police officers, it would be incorrect to assume that as the number of police officers increases so does the crime rate. In this situation, the number of police officers increases because the crime rate increases. Storm

Bidirectional Causation
Bidirectional Causation occurs when variable A effects B, and B also effects A Example: A job interview (A) in the afternoon will effect the anxiety level in the morning (B), and also the level of anxiety in the morning (B) will effect the job interview in the afternoon (A). Storm

Control Group A control group is used to reduce the impact of unknown external variables making it easier to identify causation Example: In clinical trials, there is always a treatment group and a control group. If both groups get better, we can assume that some external factor is causing the improvement, and not the treatment under consideration. If only the treatment group gets better, it is safe to assume that the treatment is what is causing the patients to get better. Storm

Double Blinding Double blinding is used to repress the placebo effect in clinical trials The placebo effect is where patients believe they are getting better, so they actually do In double blinding, neither the patient nor the doctor administering the treatment know whether they are receiving the treatment or the control. Double blinding helps researchers to pinpoint the cause of symptom relief Storm

Example: Correlation in Law
A man that is bald uses a treatment that promises to make him grow hair. When the man does not grow any hair, he decides to conduct a study by collecting information about the amount of hair men have and whether or not they have used the treatment. His results show a correlation between baldness and use of the treatment. He decides to take this to court and sue the company for selling a product that does not work. The people who sell the treatment use correlation vs. causation as their argument. People who use the treatment have less hair to begin with which is why they are using the treatment in the first place. Storm

Example: Correlation in Sports
In basketball, it has been found that there is a high correlation between winning percentage and free throw attempts. This does not, however, imply that free throw attempts lead to winning the game. This is an example of reverse causation, when a team is losing the game they are more likely to foul the other team in order to attempt to get the ball back at the end of a game. Storm

Example: Correlation in Health
People who are overweight are more likely to be ingesting diet foods which use sugar replacements in order to make them fat free. A study would show that there is a strong positive correlation between the amount of diet food a person eats and how overweight they are. This does not however imply that the sugar replacements in the diet foods are making people overweight. People who are overweight are just more likely to be eating diet foods. Storm

Example: Correlation in Politics
There is a strong positive correlation between the GDP and government spending. Democrats insist that increased government spending leads to a higher GDP because increased government spending leads to an increase in the demand for goods and services. Republicans disagree and state that a higher GDP is what leads to more government spending because when GDP increases, the government takes in more tax revenue. Storm

Correlation vs. Causation in the News
Storm

Pearson r Correlation Measure of the linear correlation between X and Y giving some value between 1 and -1 Assumptions: Normally distributed Linearity Homoscedasticity Storm

Kendall Rank Correlation
A non-parametric test that measures the strength of dependence between two variables Concordant: when a pair of observations ( 𝑥 𝑖 , 𝑦 𝑖 ) and ( 𝑥 𝑗 , 𝑦 𝑗 ) have the same rank (i.e. 𝑥 𝑖 < 𝑥 𝑗 𝑎𝑛𝑑 𝑦 𝑖 < 𝑦 𝑗 𝑜𝑟 𝑥 𝑖 > 𝑥 𝑗 𝑎𝑛𝑑 𝑦 𝑖 > 𝑦 𝑗 ) Discordant: when a pair of observations have different ranks (i.e. 𝑥 𝑖 < 𝑥 𝑗 𝑎𝑛𝑑 𝑦 𝑖 > 𝑦 𝑗 𝑜𝑟 𝑥 𝑖 > 𝑥 𝑗 𝑎𝑛𝑑 𝑦 𝑖 < 𝑦 𝑗 ) Storm

Spearman Rank Correlation
Non-parametric test that measures the degree of association between two variables No assumptions about the distribution Ordinal data Storm

Correlation in Software
SAS: proc corr data=DATA pearson spearman kendall; var VARS; run; R: cor(DATA, use=all.obs/complete.obs/pairwise.complete.obs, method=pearson/spearman/kendall) SPSS: Analyze->Correlate->Bivariate->Pearson/Kendall/Spearman Minitab: Stat->Basic Stats->Correlation->enter variables Storm

Sources https://www.mathsisfun.com/data/correlation.html
spearman/ questions/10-correlations-that-are-not-causations.htm Storm

The End! Storm