Introduction to Probability Theory Rong Jin. Outline  Basic concepts in probability theory  Bayes’ rule  Random variable and distributions.

Slides:



Advertisements
Similar presentations
Bayes rule, priors and maximum a posteriori
Advertisements

Lecture Discrete Probability. 5.3 Bayes’ Theorem We have seen that the following holds: We can write one conditional probability in terms of the.
Lecture 4A: Probability Theory Review Advanced Artificial Intelligence.
Review of Probability. Definitions (1) Quiz 1.Let’s say I have a random variable X for a coin, with event space {H, T}. If the probability P(X=H) is.
Acknowledgement: Thanks to Professor Pagano
Week11 Parameter, Statistic and Random Samples A parameter is a number that describes the population. It is a fixed number, but in practice we do not know.
Lecture (7) Random Variables and Distribution Functions.
ฟังก์ชั่นการแจกแจงความน่าจะเป็น แบบไม่ต่อเนื่อง Discrete Probability Distributions.
Random Variables and Expectation. Random Variables A random variable X is a mapping from a sample space S to a target set T, usually N or R. Example:
Lec 18 Nov 12 Probability – definitions and simulation.
Chapter 4 Discrete Random Variables and Probability Distributions
Chapter 1 Probability Theory (i) : One Random Variable
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 4-1 Introduction to Statistics Chapter 5 Random Variables.
1 Midterm Review Econ 240A. 2 The Big Picture The Classical Statistical Trail Descriptive Statistics Inferential Statistics Probability Discrete Random.
Machine Learning CMPT 726 Simon Fraser University CHAPTER 1: INTRODUCTION.
Probability Distributions
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )
Basic Probability and Statistics CIS 8590 – Fall 2008 NLP 1.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Statistics.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Parameters and Statistics Probabilities The Binomial Probability Test.
Discrete Probability Distributions
1 Discrete Probability Hsin-Lung Wu Assistant Professor Advanced Algorithms 2008 Fall.
Relevance Feedback Users learning how to modify queries Response list must have least some relevant documents Relevance feedback `correcting' the ranks.
Chapter 9 Introducing Probability - A bridge from Descriptive Statistics to Inferential Statistics.
PPT ON BUSINESS STATISTICS
Chapter 5 Discrete Probability Distribution I. Basic Definitions II. Summary Measures for Discrete Random Variable Expected Value (Mean) Variance and Standard.
Prof. SankarReview of Random Process1 Probability Sample Space (S) –Collection of all possible outcomes of a random experiment Sample Point –Each outcome.
Problem A newly married couple plans to have four children and would like to have three girls and a boy. What are the chances (probability) their desire.
Welcome To B ASIC C ONCEPT O F P ROBABILITY D EPARTMENT OF S TATISTICS D R. M D. M ESBAHUL A LAM, A SSOCIATE P ROFESSOR Tel:
Sets, Combinatorics, Probability, and Number Theory Mathematical Structures for Computer Science Chapter 3 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesProbability.
Sets, Combinatorics, Probability, and Number Theory Mathematical Structures for Computer Science Chapter 3 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesProbability.
1 Intro to Probability Zhi Wei 2 Outline  Basic concepts in probability theory  Random variable and probability distribution  Bayes ’ rule.
PBG 650 Advanced Plant Breeding
CSE 531: Performance Analysis of Systems Lecture 2: Probs & Stats review Anshul Gandhi 1307, CS building
ENGG 2040C: Probability Models and Applications Andrej Bogdanov Spring Random variables part two.
Engineering Probability and Statistics Dr. Leonore Findsen Department of Statistics.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
COMP 170 L2 L17: Random Variables and Expectation Page 1.
Chapter 5: Probability Analysis of Randomized Algorithms Size is rarely the only property of input that affects run time Worst-case analysis most common.
Week 21 Conditional Probability Idea – have performed a chance experiment but don’t know the outcome (ω), but have some partial information (event A) about.
Math b (Discrete) Random Variables, Binomial Distribution.
Probability Review-1 Probability Review. Probability Review-2 Probability Theory Mathematical description of relationships or occurrences that cannot.
Sixth lecture Concepts of Probabilities. Random Experiment Can be repeated (theoretically) an infinite number of times Has a well-defined set of possible.
IE 300, Fall 2012 Richard Sowers IESE. 8/30/2012 Goals: Rules of Probability Counting Equally likely Some examples.
Probability (outcome k) = Relative Frequency of k
Probability Theory, Bayes’ Rule & Random Variable Lecture 6.
Welcome to MM305 Unit 3 Seminar Prof Greg Probability Concepts and Applications.
1 Part Three: Chapters 7-9 Performance Modeling and Estimation.
Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Random Variables Lecture Lecturer : FATEN AL-HUSSAIN.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate accuracy.
Conditional Probability 423/what-is-your-favorite-data-analysis-cartoon 1.
4. Overview of Probability Network Performance and Quality of Service.
Introduction to Probability Theory
Welcome to MM305 Unit 3 Seminar Dr
Introduction to Probability Theory
Review of Probabilities and Basic Statistics
Econ 113 Lecture Module 2.
Discrete Probability Distributions
Daniela Stan Raicu School of CTI, DePaul University
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
Probability Probability underlies statistical inference - the drawing of conclusions from a sample of data. If samples are drawn at random, their characteristics.
I flip a coin two times. What is the sample space?
CS590I: Information Retrieval
Daniela Stan Raicu School of CTI, DePaul University
Sets, Combinatorics, Probability, and Number Theory
Presentation transcript:

Introduction to Probability Theory Rong Jin

Outline  Basic concepts in probability theory  Bayes’ rule  Random variable and distributions

Definition of Probability  Experiment: toss a coin twice  Sample space: possible outcomes of an experiment S = {HH, HT, TH, TT}  Event: a subset of possible outcomes A={HH}, B={HT, TH}  Probability of an event : an number assigned to an event Pr(A) Axiom 1: Pr(A)  0 Axiom 2: Pr(S) = 1 Axiom 3: For every sequence of disjoint events Example: Pr(A) = n(A)/N: frequentist statistics

Joint Probability  For events A and B, joint probability Pr(AB) stands for the probability that both events happen.  Example: A={HH}, B={HT, TH}, what is the joint probability Pr(AB)?

Independence  Two events A and B are independent in case Pr(AB) = Pr(A)Pr(B)  A set of events {A i } is independent in case

Independence  Two events A and B are independent in case Pr(AB) = Pr(A)Pr(B)  A set of events {A i } is independent in case  Example: Drug test WomenMen Success20019 Failure18001 A = {A patient is a Women} B = {Drug fails} Will event A be independent from event B ?

Independence  Consider the experiment of tossing a coin twice  Example I: A = {HT, HH}, B = {HT} Will event A independent from event B?  Example II: A = {HT}, B = {TH} Will event A independent from event B?  Disjoint  Independence  If A is independent from B, B is independent from C, will A be independent from C?

 If A and B are events with Pr(A) > 0, the conditional probability of B given A is Conditioning

 If A and B are events with Pr(A) > 0, the conditional probability of B given A is  Example: Drug test Conditioning WomenMen Success20019 Failure18001 A = {A patient is a Women} B = {Drug fails} Pr(B|A) = ? Pr(A|B) = ?

 If A and B are events with Pr(A) > 0, the conditional probability of B given A is  Example: Drug test  Given A is independent from B, what is the relationship between Pr(A|B) and Pr(A)? Conditioning WomenMen Success20019 Failure18001 A = {Patient is a Women} B = {Drug fails} Pr(B|A) = ? Pr(A|B) = ?

Which Drug is Better ?

Simpson’s Paradox: View I Drug IDrug II Success Failure A = {Using Drug I} B = {Using Drug II} C = {Drug succeeds} Pr(C|A) ~ 10% Pr(C|B) ~ 50% Drug II is better than Drug I

Simpson’s Paradox: View II Female Patient A = {Using Drug I} B = {Using Drug II} C = {Drug succeeds} Pr(C|A) ~ 20% Pr(C|B) ~ 5%

Simpson’s Paradox: View II Female Patient A = {Using Drug I} B = {Using Drug II} C = {Drug succeeds} Pr(C|A) ~ 20% Pr(C|B) ~ 5% Male Patient A = {Using Drug I} B = {Using Drug II} C = {Drug succeeds} Pr(C|A) ~ 100% Pr(C|B) ~ 50%

Simpson’s Paradox: View II Female Patient A = {Using Drug I} B = {Using Drug II} C = {Drug succeeds} Pr(C|A) ~ 20% Pr(C|B) ~ 5% Male Patient A = {Using Drug I} B = {Using Drug II} C = {Drug succeeds} Pr(C|A) ~ 100% Pr(C|B) ~ 50% Drug I is better than Drug II

Conditional Independence  Event A and B are conditionally independent given C in case Pr(AB|C)=Pr(A|C)Pr(B|C)  A set of events {A i } is conditionally independent given C in case

Conditional Independence (cont’d)  Example: There are three events: A, B, C Pr(A) = Pr(B) = Pr(C) = 1/5 Pr(A,C) = Pr(B,C) = 1/25, Pr(A,B) = 1/10 Pr(A,B,C) = 1/125 Whether A, B are independent? Whether A, B are conditionally independent given C?  A and B are independent  A and B are conditionally independent

Outline  Important concepts in probability theory  Bayes’ rule  Random variables and distributions

Bayes’ Rule  Suppose that B 1, B 2, … B k form a partition of S: Suppose that Pr(A) > 0. Then

Bayes’ Rule  Suppose that B 1, B 2, … B k form a partition of S: Suppose that Pr(A) > 0. Then  Example: Pr(W|R)R RR W WW R: The weather rains W: The grass is wet Pr(R|W) = ? Pr(R) = 0.8

Bayes’ Rule: More Complicated  Suppose that B 1, B 2, … B k form a partition of S: Suppose that Pr(B i ) > 0 and Pr(A) > 0. Then

Bayes’ Rule: More Complicated  Suppose that B 1, B 2, … B k form a partition of S: Suppose that Pr(B i ) > 0 and Pr(A) > 0. Then

Bayes’ Rule: More Complicated  Suppose that B 1, B 2, … B k form a partition of S: Suppose that Pr(A) > 0. Then

Bayes’ Rule R RR W WW R: The weather rains W: The grass is wet RW Information Pr(W|R) Inference Pr(R|W)

Bayes’ Rule R RR W WW R: The weather rains W: The grass is wet Hypothesis H Evidence E Information: Pr(E|H) Inference: Pr(H|E) PriorLikelihoodPosterior

A More Complicated Example RThe weather rains WThe grass is wet UPeople bring umbrella Pr(UW|R)=Pr(U|R)Pr(W|R) Pr(UW|  R)=Pr(U|  R)Pr(W|  R) R WU Pr(W|R)R RR W WW Pr(U|R)R RR U UU Pr(U|W) = ? Pr(R) = 0.8

Outline  Important concepts in probability theory  Bayes’ rule  Random variable and probability distribution

Random Variable and Distribution  A random variable X is a numerical outcome of a random experiment  The distribution of a random variable is the collection of possible outcomes along with their probabilities: Discrete case: Continuous case:

Random Variable: Example  Let S be the set of all sequence of three rolls of a die. Let X be the sum of the number of dots on the three rolls.  What are the possible values for X?  Pr(X = 6) = ?, Pr(X = 10) = ?

Expectation  A random variable X~Pr(X=x). Then, its expectation is In an empirical sample, x1, x2,…, xN,  Continuous case:  Expectation of sum of random variables

Expectation: Example  Let S be the set of all sequence of three rolls of a die. Let X be the sum of the number of dots on the three rolls.  What is E(X)?  Let S be the set of all sequence of three rolls of a die. Let X be the product of the number of dots on the three rolls.  What is E(X)?

Variance  The variance of a random variable X is the expectation of (X-E[x]) 2 :

Bernoulli Distribution  The outcome of an experiment can either be success (i.e., 1) and failure (i.e., 0).  Pr(X=1) = p, Pr(X=0) = 1-p, or  E[X] = p, Var(X) = p(1-p)

Binomial Distribution  n draws of a Bernoulli distribution X i ~Bernoulli(p), X=  i=1 n X i, X~Bin(p, n)  Random variable X stands for the number of times that experiments are successful.  E[X] = Np, Var(X) = Np(1-p)

Plot of Binomial Distribution

Poisson Distribution  Coming from Binomial distribution Fix the expectation =np Let the number of trials n  A Binomial distribution will become a Poisson distribution  E[X] =, Var(X) =

Plot of Poisson Distribution

Normal (Gaussian) Distribution  X~N( ,  )  E[X]= , Var(X)=  2  If X 1 ~N(  1,  1 ) and X 2 ~N(  2,  2 ), X= X 1 + X 2 ?

Example  In China, people usually favor boys.  So, many people will not stop bearing children unless they get a boy.  Will the male population be larger than the female’s?

Statistical Inference  Problem:  Likelihood function  Approach: Maximum likelihood estimation (MLE), or maximize log-likelihood

Example I: Flip Coins

Example I: Flip Coins (cont’d)

Example II: Normal Distribution