Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Probability SECTION 11.1 Events and, or, if Disjoint Independent Law of total.

Slides:



Advertisements
Similar presentations
Warm-up Define the sample space of each of the following situations…
Advertisements

Chapter 5: Probability: What are the Chances?
Psychology 10 Analysis of Psychological Data February 26, 2014.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTION 10.3 Categorical variables Variable selection.
Probability Sample Space Diagrams.
Section 5.2 Probability Rules
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
1 Chapter 6: Probability— The Study of Randomness 6.1The Idea of Probability 6.2Probability Models 6.3General Probability Rules.
5.2B TWO-WAY TABLES, GENERAL ADDITION RULE AND VENN DIAGRAMS
Multiple Regression III 4/16/12 More on categorical variables Missing data Variable Selection Stepwise Regression Confounding variables Not in book Professor.
Chapter 15: Probability Rules
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Probability SECTIONS 11.1 Probability (11.1) Odds, odds ratio (not in book)
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Multiple Regression SECTIONS 10.1, 10.3 (?) Multiple explanatory variables.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 6: Probability: What are the Chances? Section 6.3 Conditional Probability.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/4/12 Bayesian Inference SECTION 11.1, 11.2 More probability rules (11.1)
HOMEWORK QUESTIONS?. 5.2 TWO-WAY TABLES PROBABILITY MODELS A probability model describes chance behavior by listing the possible outcomes in the sample.
Bayesian Inference I 4/23/12 Law of total probability Bayes Rule Section 11.2 (pdf)pdf Professor Kari Lock Morgan Duke University.
Section 5.3 Conditional Probability and Independence
CHAPTER 12: General Rules of Probability Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Copyright © 2010 Pearson Education, Inc. Chapter 15 Probability Rules!
Copyright © 2010 Pearson Education, Inc. Chapter 6 Probability.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTION 10.3 Variable selection Confounding variables.
Probability and Simulation Rules in Probability. Probability Rules 1. Any probability is a number between 0 and 1 0 ≤ P[A] ≤ 1 0 ≤ P[A] ≤ 1 2. The sum.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Probability SECTIONS 11.1, 11.2 Probability (11.1, 11.2) Odds, Odds Ratio.
Determine a method to simulate each of the following events:  If 15% of all AP statistics students get a “5” on the AP exam, how many would we need to.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/29/12 Probability SECTION 11.1 Events and, or, if Disjoint Independent.
+ Chapter 5 Probability: What Are the Chances? 5.1Randomness, Probability, and Simulation 5.2Probability Rules 5.3Conditional Probability and Independence.
Slide Slide 1 Survey 1)Locate the following countries on your map: Venezuela, Hungary, Syria, Vietnam, North Korea, Egypt, Pakistan, Guatemala, Germany,
Notes and Questions on Chapters 13,14,15. The Sample Space, denoted S, of an experiment is a listing of all possible outcomes.  The sample space of rolling.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Multiple Regression SECTIONS 10.1, 10.3 Multiple explanatory variables (10.1,
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 5: Probability: What are the Chances? Section 5.2 Probability Rules.
Probability 4/18/12 Probability Conditional probability Disjoint events Independent events Section 11.1 (pdf)pdf Professor Kari Lock Morgan Duke University.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 5: Probability: What are the Chances? Section 5.3 Conditional Probability.
AP Statistics Chapter 6 Section 3. Review of Probability Rules 1.0 < P(A) < 1 2.P(S) = 1 3.Complement Rule: For any event A, 4.Addition Rule: If A and.
+ Section 5.2 Probability Rules After this section, you should be able to… DESCRIBE chance behavior with a probability model DEFINE and APPLY basic rules.
AP Statistics From Randomness to Probability Chapter 14.
Probability Class Homework Check Assignment: Chapter 7 – Exercise 7.20, 7.29, 7.47, 7.48, 7.53 and 7.57 Reading: Chapter 7 – p
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Lecture 3 Probability By Aziza Munir.
Independent and Dependent Events
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Probability Models Section 6.2.
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 6: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Section 3.3: The Addition Rule
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 6: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Chapter 5: Probability: What are the Chances?
Presentation transcript:

Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Probability SECTION 11.1 Events and, or, if Disjoint Independent Law of total probability

Statistics: Unlocking the Power of Data Lock 5 Confounding Variables Once GDP is accounted for, electricity use is no longer a significant predictor of life expectancy. Even after accounting for GDP, cell phone subscriptions per capita is still a significant predictor of life expectancy. Once GDP is accounted for, electricity use is no longer a significant predictor of life expectancy.

Statistics: Unlocking the Power of Data Lock 5 Confounding Variables (review) Multiple regression is one potential way to account for confounding variables This is most commonly used in practice across a wide variety of fields, but is quite sensitive to the conditions for the linear model (particularly linearity) You can only account for confounding variables that you have data on, so it is still very hard to make true causal conclusions without a randomized experiment

Statistics: Unlocking the Power of Data Lock 5 Event An event is something that either happens or doesn’t happen, or something that either is true or is not true Examples: A randomly selected card is a Heart The response variable Y > 90 A randomly selected person is male It rains today

Statistics: Unlocking the Power of Data Lock 5 Probability The probability of event A, P(A), is the probability that A will happen Probability is always between 0 and 1 Probability always refers to an event P(A) = 1 means A will definitely happen P(A) = 0 means A will definitely not happen

Statistics: Unlocking the Power of Data Lock 5 Probability Examples Y = number of siblings. P(Y = 1) = (based on survey data) Y: final grade in STAT 101. P(Y > 90) = (based on last year’s class) P(Gender = male) = (for Duke students, P(it rains today) = 0.3 (

Statistics: Unlocking the Power of Data Lock 5 Sexual Orientation What are the sexual orientation demographics of American adults? We need data! Data collected in 2009 on a random sample of American adults (National Survey of Sexual Health and Behavior)

Statistics: Unlocking the Power of Data Lock 5 MaleFemaleTotal Heterosexual Homosexual Bisexual Other Total Sexual Orientation Herbenick D, Reece M, Schick V, Sanders SA, Dodge B, and Fortenberry JD (2010). Sexual behavior in the United States: Results from a national probability sample of men and women ages 14–94. Journal of Sexual Medicine;7(suppl 5):255–265.

Statistics: Unlocking the Power of Data Lock 5 MaleFemaleTotal Heterosexual Homosexual Bisexual Other Total Sexual Orientation What is the probability that an American adult is homosexual? a)128/5042 = b)128/4673 = c)105/2521 = 0.04 d)I got a different answer

Statistics: Unlocking the Power of Data Lock 5 Two Events P(A and B) is the probability that both events A and B will happen P(A or B) is the probability that either event A or event B will happen

Statistics: Unlocking the Power of Data Lock 5 B A and B A A or B: all color Two Events

Statistics: Unlocking the Power of Data Lock 5 Venn Diagram

Statistics: Unlocking the Power of Data Lock 5 Venn Diagram

Statistics: Unlocking the Power of Data Lock 5 MaleFemaleTotal Heterosexual Homosexual Bisexual Other Total Sexual Orientation What is the probability that an American adult is male and homosexual? a)105/128 = 0.82 b)105/2521 = 0.04 c)105/5042 = d)I got a different answer

Statistics: Unlocking the Power of Data Lock 5 MaleFemaleTotal Heterosexual Homosexual Bisexual Other Total Sexual Orientation What is the probability that an American adult is female or bisexual? a)2679/5042 = b)2587/5042 = c)92/2521 = d)I got a different answer

Statistics: Unlocking the Power of Data Lock 5 P(A or B) B A

Statistics: Unlocking the Power of Data Lock 5 MaleFemaleTotal Heterosexual Homosexual Bisexual Other Total Sexual Orientation What is the probability that an American adult is not heterosexual? a)369/5042 = b)2587/5042 = c)92/2521 = d)I got a different answer

Statistics: Unlocking the Power of Data Lock 5 P(not A)

Statistics: Unlocking the Power of Data Lock 5 Caffeine Based on last year’s survey data, 52% of students drink caffeine in the morning, 48% of students drink caffeine in the afternoon, and 37% drink caffeine in the morning and the afternoon. What percent of students do not drink caffeine in the morning or the afternoon? a)63% b)37% c)100% d)50% P( not(morning or afternoon) = 1 – P(morning or afternoon) = 1 – [P(morning)+P(afternoon) – P(morning and afternoon)] = 1 – [ – 0.37] = 1 – 0.63 = 0.37

Statistics: Unlocking the Power of Data Lock 5 Conditional Probability P(A if B) is the probability of A, if we know B has happened This is read in multiple ways: “probability of A if B” “probability of A given B” “probability of A conditional on B” You may also see this written as P(A | B)

Statistics: Unlocking the Power of Data Lock 5 MaleFemaleTotal Heterosexual Homosexual Bisexual Other Total Sexual Orientation What is the probability that an American adult male is homosexual? a)105/128 = 0.82 b)105/2521 = 0.04 c)105/5042 = d)I got a different answer P(homosexual if male)

Statistics: Unlocking the Power of Data Lock 5 MaleFemaleTotal Heterosexual Homosexual Bisexual Other Total Sexual Orientation What is the probability that an American adult homosexual is male? a)105/128 = 0.82 b)105/2521 = 0.04 c)105/5042 = d)I got a different answer P(male if homosexual)

Statistics: Unlocking the Power of Data Lock 5 Conditional Probability P(homosexual if male) = 0.04 P(male if homosexual) = 0.82

Statistics: Unlocking the Power of Data Lock 5 Conditional Probability A B

Statistics: Unlocking the Power of Data Lock 5 Caffeine Based on last year’s survey data, 52% of students drink caffeine in the morning, 48% of students drink caffeine in the afternoon, and 37% drink caffeine in the morning and the afternoon. What percent of students who drink caffeine in the morning also drink caffeine in the afternoon? a)77% b)37% c)71% P( afternoon if morning) = P(afternoon and morning)/P(morning) = 0.37/0.52 = 0.71

Statistics: Unlocking the Power of Data Lock 5 Helpful Tip If the table problems are easier for your than the sentence problems, try to first convert what you know into a table. 52% of students drink caffeine in the morning, 48% of students drink caffeine in the afternoon, and 37% drink caffeine in the morning and the afternoon Caffeine Afternoon No Caffeine Afternoon Total Caffeine Morning No Caffeine Morning Total P( afternoon if morning) = 37/52 = 0.71

Statistics: Unlocking the Power of Data Lock 5 P(A and B)

Statistics: Unlocking the Power of Data Lock 5 Duke Rank and Experience 60% of STAT 101 students rank their Duke experience as “Excellent,” and Duke was the first choice school for 59% of those who ranked their Duke experience as excellent. What percentage of STAT 101 students had Duke as a first choice and rank their experience here as excellent? a)60% b)59% c)35% d)41% P( first choice and excellent) = P(first choice if excellent)P(excellent) = 0.59  0.60 = 0.354

Statistics: Unlocking the Power of Data Lock 5 Summary

Statistics: Unlocking the Power of Data Lock 5 Disjoint Events Events A and B are disjoint or mutually exclusive if only one of the two events can happen Think of two events that are disjoint, and two events that are not disjoint.

Statistics: Unlocking the Power of Data Lock 5 Disjoint Events If A and B are disjoint, then a)P(A or B) = P(A) + P(B) b)P(A and B) = P(A)P(B) P(A or B) = P(A) + P(B) – P(A and B) If A and B are disjoint, then both cannot happen, so P(A and B) = 0.

Statistics: Unlocking the Power of Data Lock 5 P(A or B) B A B A

Statistics: Unlocking the Power of Data Lock 5 Independence Events A and B are independent if P(A if B) = P(A). Intuitively, knowing that event B happened does not change the probability that event A happened. Think of two events that are independent, and two events that are not independent.

Statistics: Unlocking the Power of Data Lock 5 Independent Events If A and B are independent, then a)P(A or B) = P(A) + P(B) b)P(A and B) = P(A)P(B) P( A and B) = P(A if B)P(B) If A and B are independent, then P(A if B) = P(A), so P(A and B) = P(A)P(B)

Statistics: Unlocking the Power of Data Lock 5 P(A and B)

Statistics: Unlocking the Power of Data Lock 5 Disjoint and Independent Assuming that P(A) > 0 and P(B) > 0, then disjoint events are a)Independent b)Not independent c)Need more information to determine whether the events are also independent If A and B are disjoint, then A cannot happen if B has happened, so P(A if B) = 0. If P(A) > 0, then P(A if B) ≠ P(A) so A and B are not independent.

Statistics: Unlocking the Power of Data Lock 5 To Do Read 11.1 Read The Bayesian Heresy for next classThe Bayesian Heresy Do Project 9 (due Wednesday, 4/23) Do Homework 9 (due Wednesday, 4/23)