10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.

Slides:



Advertisements
Similar presentations
Multivariate Relationships Goal: Show a causal relationship between two variables (X  Y) Elements of a cause-and-effect relationship: –Association between.
Advertisements

Three or more categorical variables
Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Soc 3306a Lecture 6: Introduction to Multivariate Relationships Control with Bivariate Tables Simple Control in Regression.
Correlation & Regression Chapter 10. Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section.
Agresti/Franklin Statistics, 1 of 52 Chapter 3 Association: Contingency, Correlation, and Regression Learn …. How to examine links between two variables.
Measures of Association Quiz
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Correlation AND EXPERIMENTAL DESIGN
Chapter 6: Correlational Research Examine whether variables are related to one another (whether they vary together). Correlation coefficient: statistic.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.4 Cautions in Analyzing.
AP Statistics Causation & Relations in Categorical Data.
2.4 Cautions about Correlation and Regression. Residuals (again!) Recall our discussion about residuals- what is a residual? The idea for line of best.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Correlation: Relationships Can Be Deceiving. The Impact Outliers Have on Correlation An outlier that is consistent with the trend of the rest of the data.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Designing Experiments In designing experiments we: Manipulate the independent.
Correlation: Relationships Can Be Deceiving. An outlier is a data point that does not fit the overall trend. Speculate on what influence outliers have.
BPSChapter 61 Two-Way Tables. BPSChapter 62 To study associations between quantitative variables  correlation & regression (Ch 4 & Ch 5) To study associations.
Nemours Biomedical Research Statistics April 2, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Review of Statistics 101 We review some important themes from the course 1.Introduction Statistics- Set of methods for collecting/analyzing data (the art.
Association vs. Causation
Presentation 12 Chi-Square test.
Chapter 15 – Elaborating Bivariate Tables
Chapter 9 Comparing Two Groups
Categorical Variables, Relative Risk, Odds Ratios STA 220 – Lecture #8 1.
Soc 3306a Lecture 10: Multivariate 3 Types of Relationships in Multiple Regression.
Simple Linear Regression
Multiple Regression. In the previous section, we examined simple regression, which has just one independent variable on the right side of the equation.
1 10. Causality and Correlation ECON 251 Research Methods.
The Practice of Statistics
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Moderation & Mediation
Today's topics ● Causal thinking, theories, hypotheses ● Independent and dependent variables; forms of relationships ● Formulating hypothesis; hypothesis.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.1 Using Several Variables to Predict a Response.
Soc 3306a Multiple Regression Testing a Model and Interpreting Coefficients.
CH. 3 Day 4 AP Statistics EQ: What is Simpson’s Paradox?
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
HW#8: Chapter 2.5 page Complete three questions on the last two slides.
October 15. In Chapter 19: 19.1 Preventing Confounding 19.2 Simpson’s Paradox 19.3 Mantel-Haenszel Methods 19.4 Interaction.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
Section 7.4 ~ The Search for Causality Introduction to Probability and Statistics Ms. Young.
Chapter 4 More on Two-Variable Data “Each of us is a statistical impossibility around which hover a million other lives that were never destined to be.
Chapter 2 Notes Math Math 1680 Assignments Look over Chapter 1 and 2 before Wednesday Assignment #2: Chapter 2 Exercise Set A (all, but #7, 8, and.
1 Introduction to Research Methods How we come to know about crime.
Professor B. Jones University of California, Davis.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Feb. 13 Chapter 12, Try 1-9 Read Ch. 15 for next Monday No meeting Friday.
© Department of Statistics 2012 STATS 330 Lecture 30: Slide 1 Stats 330: Lecture 30.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Chapter 3: Descriptive Study of Bivariate Data. Univariate Data: data involving a single variable. Multivariate Data: data involving more than one variable.
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
1 ES9 A random sample of registered voters was selected and each was asked his or her opinion on Proposal 129, a property tax reform bill. The distribution.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 3 Multivariate analysis.
RESEARCH METHODS Lecture 32. The parts of the table 1. Give each table a number. 2. Give each table a title. 3. Label the row and column variables, and.
2.7 The Question of Causation
Presentation 12 Chi-Square test.
Two-Way Tables and The Chi-Square Test*
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Chapter 2 Looking at Data— Relationships
Chapter 13 Multiple Regression
I. Introduction and Data Collection C. Conducting a Study
Chapter 4: More on Two-Variable Data
Multivariate Relationships
Presentation transcript:

10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory variables have an influence on any particular response variable. The effect of an explanatory variable on a response variable may change when we take into account other variables.

Example: Y = whether admitted into grad school at U. California, Berkeley (for the 6 largest departments) X = gender Whether admitted Gender Yes No Total %yes Female Male Difference of sample proportions = … There is very strong evidence of a higher probability of admission for men than for women.

Now let X 1 = gender and X 2 = department to which the person applied. e.g., for Department A, Whether admitted Gender Yes No Total %yes Female % Male % Now,  2 = (df = 1), but difference is …. T he strong evidence is that there is a higher probability of being admitted for than. What happens with other departments?

Female Male Difference of Dept. Total %admitted Total %admitted proportions  2 A % % B 25 68% % C % % D % % E % % F 341 7% 273 6% Total % % There are 6 “partial tables,” which summed give the original “bivariate” table. How can the partial table results be so different from the bivariate table?

Partial tables – display association between two variables at fixed levels of a “control variable.” Example: Previous page shows results from partial tables relating gender to whether admitted, controlling for (keeping constant) department When control variable X 2 is kept constant, association between Y and X 1 is not due to the association of each of them with X 2. Note: When each pair of variables is associated, then a bivariate association for two variables may differ from its partial association, controlling for the other variable.

Example: Y = whether admitted is associated with X 1 = gender, but each of these itself associated with X 2 = department. Department associated with gender: Department associated with whether admitted: Moral: Association does not imply causation! This is true for quantitative and categorical variables.

With observational data, effect of X on Y may be partly due to association of X and Y with lurking variables. (e.g., association between X = whether use marijuana and Y = GPA; other variables?) Causation difficult to assess with observational studies, unlike experimental studies that can control potential lurking variables (by randomization). When X 1 and X 2 both have effects on Y but are also associated with each other, there is said to be confounding. It’s difficult to determine whether either truly causes Y, because a variable’s effect could be partly due to its association with the other variable.

Simpson’s paradox It is possible for the (bivariate) association between two variables to be positive, yet be negative at each fixed level of a third variable. (see scatterplot) Example: Florida countywide data There is a positive correlation between crime rate and education! There is a negative correlation between crime rate and education at each level of urbanization

Types of Multivariate Relationships Spurious association: Y and X 1 both depend on X 2 and association disappears after controlling X 2 Example:

Multiple causes – A variety of factors have influences on the response (most common in practice) In observational studies, usually all (or nearly all) explanatory variables have associations among themselves as well as with response var. Effect of any one changes depending on which other var’s are controlled (statistically), often because it has a direct effect and also indirect effects through other variables. Example: What causes Y = juvenile delinquency?

Statistical interaction – Effect of X 1 on Y changes as the level of X 2 changes. Example: Effect of whether a smoker (yes, no) on whether have lung cancer (yes, no) changes as value of age changes Example: U.S. median annual income by race and gender Race Gender Black White Female Male

The difference in median income between whites and blacks is: $ for females, $ for males i.e., the effect of race on income depends on gender (and the effect of gender on income depends on race), so there is interaction between race and gender in their effects on income. Example (p. 311): X = number of years of education Y = annual income (1000’s of dollars) Suppose E(Y) = for men E(Y) = for women