Lecture 5a: Bayes’ Rule Class web site: DEA in Bioinformatics: Statistics Module Box 1Box 2Box 3.

Slides:



Advertisements
Similar presentations
Probability in genetics. Gregor Mendel (1822 – 1884), experimented on peas Mendelian inheritance, single (or very few) genes controlling certain expressed.
Advertisements

Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Psychology 290 Special Topics Study Course: Advanced Meta-analysis April 7, 2014.
Statistics review of basic probability and statistics.
COUNTING AND PROBABILITY
CHAPTER 13: Binomial Distributions
Chapter 7 Probability 7.1 Experiments, Sample Spaces, and Events
5.1 Sampling Distributions for Counts and Proportions (continued)
Sampling Distributions
DATA ANALYSIS Module Code: CA660 Lecture Block 2.
Chapter 4 Probability The description of sample data is only a preliminary part of a statistical analysis. A major goal is to make generalizations or inferences.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Probabilistic Information Retrieval.
Chapter 4: Probability (Cont.) In this handout: Total probability rule Bayes’ rule Random sampling from finite population Rule of combinations.
Today Today: Some more counting examples; Start Chapter 2 Important Sections from Chapter 1: ; Please read Reading: –Assignment #2 is up.
1 Basic Probability Statistics 515 Lecture Importance of Probability Modeling randomness and measuring uncertainty Describing the distributions.
Example 3.2.6: Sampling Fruitflies In a Drosophila population, 30% of the flies are black and 70% are gray. Suppose that two flies are randomly chosen.
Sampling Theory Determining the distribution of Sample statistics.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Bayesian Inference SECTION 11.1, 11.2 Bayes rule (11.2) Bayesian inference.
CHAPTER 10: Introducing Probability
Fall 2013Biostat 5110 (Biostatistics 511) Week 7 Discussion Section Lisa Brown Medical Biometry I.
Chapter 6 Probability.
Probability, Bayes’ Theorem and the Monty Hall Problem
Math 3680 Lecture #11 The Central Limit Theorem for Sample Counts and Sample Proportions.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
6 Probability Chapter6 p Operations on events and probability An event is the basic element to which probability can be applied. Notations Event:
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Sampling Theory Determining the distribution of Sample statistics.
Probability and Sample space
5.5 Distributions for Counts  Binomial Distributions for Sample Counts  Finding Binomial Probabilities  Binomial Mean and Standard Deviation  Binomial.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
1 Sampling Distributions Lecture 9. 2 Background  We want to learn about the feature of a population (parameter)  In many situations, it is impossible.
Nor Fashihah Mohd Noor Institut Matematik Kejuruteraan Universiti Malaysia Perlis ІМ ќ INSTITUT MATEMATIK K E J U R U T E R A A N U N I M A P.
Chapter 1 Probability Spaces 主講人 : 虞台文. Content Sample Spaces and Events Event Operations Probability Spaces Conditional Probabilities Independence of.
“PREDICTIVE MODELING” CoSBBI, July Jennifer Hu.
CPSC 531: Probability Review1 CPSC 531:Probability & Statistics: Review Instructor: Anirban Mahanti Office: ICT Class.
5.3 Random Variables  Random Variable  Discrete Random Variables  Continuous Random Variables  Normal Distributions as Probability Distributions 1.
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
1 TABLE OF CONTENTS PROBABILITY THEORY Lecture – 1Basics Lecture – 2 Independence and Bernoulli Trials Lecture – 3Random Variables Lecture – 4 Binomial.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics;
Bayes’ Theorem -- Partitions Given two events, R and S, if P(R  S) =1 P(R  S) =0 then we say that R and S partition the sample space. More than 2 events.
Week 21 Conditional Probability Idea – have performed a chance experiment but don’t know the outcome (ω), but have some partial information (event A) about.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
Machine Learning Chapter 5. Evaluating Hypotheses
How likely is it that…..?. The Law of Large Numbers says that the more times you repeat an experiment the closer the relative frequency of an event will.
QR 32 Section #6 November 03, 2008 TA: Victoria Liublinska
Random Variables Example:
Section Conditional Probability Objectives: 1.Understand the meaning of conditional probability. 2.Learn the general Multiplication Rule:
1 Chapter 4, Part 1 Basic ideas of Probability Relative Frequency, Classical Probability Compound Events, The Addition Rule Disjoint Events.
Week 21 Rules of Probability for all Corollary: The probability of the union of any two events A and B is Proof: … If then, Proof:
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Sampling Theory Determining the distribution of Sample statistics.
AP STATS: WARM UP I think that we might need a bit more work with graphing a SAMPLING DISTRIBUTION. 1.) Roll your dice twice (i.e. the sample size is 2,
2.2 Discrete Random Variables 2.2 Discrete random variables Definition 2.2 –P27 Definition 2.3 –P27.
HL2 Math - Santowski Lesson 93 – Bayes’ Theorem. Bayes’ Theorem  Main theorem: Suppose we know We would like to use this information to find if possible.
Section 7.3. Why we need Bayes?  How to assess the probability that a particular event occurs on the basis of partial evidence.  The probability p(F)
CHAPTER 6 Random Variables
Chapter 7 Review.
Probability Axioms and Formulas
Chapter 4 Section 4.4 Sampling Distribution Models
CHAPTER 6 Random Variables
What is Probability? Quantification of uncertainty.
Probability and Sample space
Example Human males have one X-chromosome and one Y-chromosome,
Sampling Distributions
COUNTING AND PROBABILITY
Exam 2 - Review Chapters
Conditional Probability, Total Probability Theorem and Bayes’ Rule
Conditional Probability, Total Probability Theorem and Bayes’ Rule
Presentation transcript:

Lecture 5a: Bayes’ Rule Class web site: DEA in Bioinformatics: Statistics Module Box 1Box 2Box 3

Outline (A) CLT – Review and Example Conditional probability (Review) Partition (Review) Which Box? Bayes’ Rule

Central Limit Theorem (CLT) The CLT says that if we –repeat the sampling process many times –compute the sample mean (or proportion) each time –make a histogram of all the means (or proportions) then that histogram of sample means (or proportions) should look like the normal distribution Of course, in practice we only get one sample from the population The CLT provides the basis for making confidence intervals and hypothesis tests for means or proportions

Normal approximation to the binomial distribution One important application of the CLT is when the RV X ~ Bin(n, p) when n is large X is a sum of independent Bernoulli(p) RVs Then X is exactly binomial, but approximately normal with  = np and  =  np(1-p) How large should n be? Large enough so that both np and n(1-p) are at least about 10 Possible to modify the approximation if np or n(1-p) is between 5 and 10 (continuity correction)

Example A pair of dice is rolled approximately 180 times an hour at a craps table in Las Vegas a)Write an exact expression for the probability that 25 or more rolls have a sum of 7 during the first hour b)What is the approximate probability that 25 or more rolls have a sum of 7 during the first hour c)What is the approximate probability that between 700 and 750 rolls have a sum of 7 during 24 hours?

Conditional probability Formally, P(A | B) = P(A AND B) / P(B) P(A | B) is not defined if P(B) = 0 Example: Mendel and peas –Given that a seed is yellow, what is the probability it is a heterozygote?

Mendel and peas Heterozygotes y/g, g/y: y/y, y/g, g/y  g/g  pollen parentseed parent y g y g yy yg gy gg ¼ ¼ ¼ ¼ yellow heterozygote P(yellow and heterozygote) = ?? P(yellow) = ?? P(H|Y) = ??

Partition A partition divides the sample space into disjoint subsets –no gaps –no overlaps

Law of total probability For a partition A, B of the sample space and an event R, P(R) = P(R AND A) + P(R AND B) Can have any number of events in the partition

Which box? Suppose there are 3 similar boxes: 1, 2, 3 Box i contains i white balls and 1 black ball I choose a box at random and then pick 1 ball at random from that box (and show you the ball) – it is white Which box do you guess the ball came from, and what is your chance of getting it right?

Which box? (cont) Box 1Box 2Box 3

Bayes’ Rule For a partition B 1, B 2,..., B n of all possible outcomes, P(B i | A) = P(A | B i ) * P(B i ) / P(A) = P(A | B i ) * P(B i ) P(A | B i ) * P(B i ) P(A | B n ) * P(B n )

Bayes’ rule uses both expressions for P(A AND B) Let’s go back to the definition of conditional probability for a moment: P(A | B) = P(A AND B) / P(B) We can also write P(B | A) = P(A AND B) / P(A) This gives us two ways to express P(A AND B): P(A AND B) = P(A | B) P(B) P(A AND B) = P(B | A) P(A)

Example (from last week) Suppose that 5% of men and 0.25% of women are color blind. A person is chosen at random. What is the probability that the person is color blind assuming... –there are an equal number of males and females –there are twice as many males as females in the population

Example (cont) Suppose that 5% of men and 0.25% of women are color blind. A color blind person is chosen at random. What is the probability that the person is male assuming... –there are an equal number of males and females –there are twice as many males as females in the population

Disease screening Suppose a blood test for a particular disease results in either a + or – reading Assume that 95% of people with the disease produce +, and 2% of people without the disease also produce + Also assume that 1% of the population has the disease What is the chance that a person chosen at random from the population has the disease, given that the test produces +?

Huntington’s disease Rare, autosomal dominant disorder which starts to appear in middle age (40s or 50s) Incidence in the general population is about 1/10,000, which means that the probability of carrying the defective gene is about.01% But for individuals with an affected parent, the probability of carrying the defective gene is 50% There is a test for HD, assume that the false positive rate is 2%, and the false negative rate is 1%

Huntington’s disease (cont) Suppose a person with an affected parent takes the test –What is the chance the person is at risk if the test is negative? –What is the chance the person is at risk if the test is positive?

Huntington’s disease (cont) Now suppose that a person without an affected parent is considering taking the test –What is the chance the person is at risk if the test is negative? –What is the chance the person is at risk if the test is positive?

Bayesian inference The Bayesian probability framework uses a different interpretation of probability than the long-run frequentist interpretation In this version of probability, we have P(parameters|data) = c * P(data|pars) * P(pars) In words, the posterior distribution for the parameters is proportional to the prior times the likelihood In the Bayesian framework, inference is based on the posterior distribution

(BREAK)