Bayesian Statistics: Asking the Right Questions Michael L. Raymer, Ph.D.

Slides:



Advertisements
Similar presentations
Probability.
Advertisements

Fundamentals of Probability
Bayes rule, priors and maximum a posteriori
Section 10.2: Tests of Significance
Juror Understanding of Random Match Probabilities Dale A. Nance Case Western Reserve University August, 2007.
Genophiler: A starting point for reviewing DNA testing results Michael L. Raymer, Ph.D. Travis Doom, Ph.D.
Elementary Statistics for Lawyers References Evett and Weir, Interpreting DNA evidence. Balding, Weight-of-evidence for forensic DNA profiles.
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Database Searches Non-random samples of N individuals Typically individuals convicted of some crime Maryland, people arrested but not convicted.
Inferring the Number of Contributors to Mixed DNA Profiles David Paoletti.
Introductory Mathematics & Statistics for Business
How strong is DNA evidence?
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
CS CS1512 Foundations of Computing Science 2 Lecture 23 Probability and statistics (4) © J.
CS1512 Foundations of Computing Science 2 Week 3 (CSD week 32) Probability © J R W Hunter, 2006, K van Deemter 2007.
Probability of Independent and Dependent Events and Review Probability & Statistics 1.0 Students know the definition of the notion of independent events.
Chapter 4: Basic Estimation Techniques
Chapter 7 Hypothesis Testing
Chapter 13: Chi-Square Test
Contingency Tables Prepared by Yu-Fen Li.
Business and Economics 6th Edition
Chris Morgan, MATH G160 January 8, 2012 Lecture 13
Chapter 2.3 Counting Sample Points Combination In many problems we are interested in the number of ways of selecting r objects from n without regard to.
Bayes’s Theorem and the Weighing of Evidence by Juries Philip Dawid University College London.
Comparing Two Population Parameters
Solving Linear Equations
Warm Up Express the indicated degree of likelihood as a probability value: “There is a 40% chance of rain tomorrow.” A bag contains 6 red marbles, 3 blue.
Putting Statistics to Work
Statistical Inferences Based on Two Samples
Chapter 11: The t Test for Two Related Samples
Testing Hypotheses About Proportions
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 14 From Randomness to Probability.
Basics of Statistical Estimation
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Intro to Bayesian Learning Exercise Solutions Ata Kaban The University of Birmingham 2005.
What is Statistical Modeling
Processing physical evidence discovering, recognizing and examining it; collecting, recording and identifying it; packaging, conveying and storing it;
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 6.1 Chapter Six Probability.
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Visual Recognition Tutorial
Assuming normally distributed data! Naïve Bayes Classifier.
Classification and risk prediction
1 Chapter 12 Probabilistic Reasoning and Bayesian Belief Networks.
Introduction to Bayesian Learning Bob Durrant School of Computer Science University of Birmingham (Slides: Dr Ata Kabán)
Probability and Statistics of DNA Fingerprinting.
Bayesian Classification with a brief introduction to pattern recognition Modified from slides by Michael L. Raymer, Ph.D.
Jeff Howbert Introduction to Machine Learning Winter Classification Bayesian Classifiers.
Overview Definition Hypothesis
Chapter 1 Basics of Probability.
Machine Learning Queens College Lecture 3: Probability and Statistics.
Bayes for Beginners Presenters: Shuman ji & Nick Todd.
 Basic Concepts in Probability  Basic Probability Rules  Connecting Probability to Sampling.
One-class Training for Masquerade Detection Ke Wang, Sal Stolfo Columbia University Computer Science IDS Lab.
Thinking About DNA Database Searches William C. Thompson Dept. of Criminology, Law & Society University of California, Irvine.
Learning Theory Reza Shadmehr Linear and quadratic decision boundaries Kernel estimates of density Missing data.
Bayesian vs. frequentist inference frequentist: 1) Deductive hypothesis testing of Popper--ruling out alternative explanations Falsification: can prove.
Chapter 20 Testing Hypothesis about proportions
Optimal Bayes Classification
1 Chapter 12 Probabilistic Reasoning and Bayesian Belief Networks.
Lecture: Forensic Evidence and Probability Characteristics of evidence Class characteristics Individual characteristics  features that place the item.
Review: Probability Random variables, events Axioms of probability Atomic events Joint and marginal probability distributions Conditional probability distributions.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Bayes Theorem. Prior Probabilities On way to party, you ask “Has Karl already had too many beers?” Your prior probabilities are 20% yes, 80% no.
Copyright ©2004 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 4-1 Probability and Counting Rules CHAPTER 4.
BAYESIAN LEARNING. 2 Bayesian Classifiers Bayesian classifiers are statistical classifiers, and are based on Bayes theorem They can calculate the probability.
HL2 Math - Santowski Lesson 93 – Bayes’ Theorem. Bayes’ Theorem  Main theorem: Suppose we know We would like to use this information to find if possible.
Section Testing a Proportion
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Lecture: Forensic Evidence and Probability Characteristics of evidence
Logistic Regression [Many of the slides were originally created by Prof. Dan Jurafsky from Stanford.]
Presentation transcript:

Bayesian Statistics: Asking the Right Questions Michael L. Raymer, Ph.D.

8/29/03M. Raymer – WSU, FBS2 Statistical Games The defendants DNA is consistent with the evidentiary sample, and the defendants DNA type occurs with a frequency of one in 10,000,000,000. Only about 0.1% of wife batterers actually murder their wives. Therefore, evidence of abuse and battering should not be admissible in a murder trial.

8/29/03M. Raymer – WSU, FBS3 The Question Given the evidentiary DNA type and the defendants DNA type, what is the probability that the evidence sample contains the defendants DNA? Information available: How common is each allele in a particular population? CPI, RMP etc.

8/29/03M. Raymer – WSU, FBS4 An Example Problem Suppose the rate of breast cancer is 1% Mammograms detect breast cancer in 80% of cases where it is present 10% of the time, mammograms will indicate breast cancer in a healthy patient If a woman has a positive mammogram result, what is the probability that she has breast cancer?

8/29/03M. Raymer – WSU, FBS5 Results 75% % % -- 2 <10% -- a lot

8/29/03M. Raymer – WSU, FBS6 Determining Probabilities Counting all possible outcomes If you flip a coin 4 times, what is the probability that you will get heads twice? TTTTTHTTHTTTHHTT TTTHTHTHHTTHHHTH TTHTTHHTHTHTHHHT TTHHTHHHHTHHHHHH P(2 heads) = 6/16 = 0.375

8/29/03M. Raymer – WSU, FBS7 Statistical Preliminaries Frequency and Probability We can guess at probabilities by counting frequencies: P(heads) = 0.5 The law of large numbers: the more samples we take the closer we will get to 0.5.

8/29/03M. Raymer – WSU, FBS8 Distributions Counting frequencies gives us distributions Binomial Distribution (Discrete) Gaussian Distribution (Continuous)

8/29/03M. Raymer – WSU, FBS9 Density Estimation Parametric Assume a Gaussian (e.g.) distribution. Estimate the parameters (, ). Non-parametric Histogram sampling Bin size is critical Gaussian smoothing can help

8/29/03M. Raymer – WSU, FBS10 Combining Probabilities Non-overlapping outcomes: Possible Overlap: Independent Events: The Product Rule

8/29/03M. Raymer – WSU, FBS11 Product Rule Example P(Engine > 200 H.P.) = 0.2 P(Color = red) = 0.3 Assuming independence: P(Red & Fast) = 0.2 × 0.3 = /4 * 1/10 * 1/6 * 1/8 * 1/5 1/10,000

8/29/03M. Raymer – WSU, FBS12 Statistical Decision Making One variable: A ring was found at the scene of the crime. The ring is size 11. The defendants ring size is also 11. If a random ring were left at the crime scene, what is the probability that it would have been size 11?

8/29/03M. Raymer – WSU, FBS13 Multiple Variables Assume independence: Note what happens to significant digits! The ring is size 11, and also made of platinum.

8/29/03M. Raymer – WSU, FBS14 Which Question? If a fruit has a diameter of 4, how likely is it to be an apple? Apples 4 Fruit

8/29/03M. Raymer – WSU, FBS15 Inverting the question Given an apple, what is the probability that it will have a diameter of 4? Given a 4 diameter fruit, what is the probability that it is an apple?

8/29/03M. Raymer – WSU, FBS16 Forensic DNA Evidence Given alleles (17, 17), (19, 21), (14, 15.1), what is the probability that a DNA sample belongs to Bob? Find all (17,17), (19,21), (14,15.1) individuals, how many of them are Bob? How common are 17, 19, 21, 14, and 15.1 in the population?

8/29/03M. Raymer – WSU, FBS17 Conditional Probabilities For related events, we can express probability conditionally: Statistical Independence:

8/29/03M. Raymer – WSU, FBS18 Bayesian Decision Making Terminology We have an object, and we want to decide if it belongs to a class Is this fruit a type of apple? Does this DNA come from a Caucasian American? Is this car a sports car? We measure features of the object (evidence): Size, weight, color Alleles at various loci

8/29/03M. Raymer – WSU, FBS19 Bayesian Notation Feature/Evidence Vector: Classes & Posterior Probability:

8/29/03M. Raymer – WSU, FBS20 A Simple Example You are given a fruit with a diameter of 4 – is it a pear or an apple? To begin, we need to know the distributions of diameters for pears and apples.

8/29/03M. Raymer – WSU, FBS21 Maximum Likelihood P(x)P(x) Class- Conditional Distributions

8/29/03M. Raymer – WSU, FBS22 A Key Problem We based this decision on (class conditional) What we really want to use is (posterior probability) What if we found the fruit in a pear orchard? We need to know the prior probability of finding an apple or a pear!

8/29/03M. Raymer – WSU, FBS23 Prior Probabilities Prior probability + Evidence Posterior Probability Without evidence, what is the prior probability that a fruit is an apple? What is the prior probability that a DNA sample comes from the defendant?

8/29/03M. Raymer – WSU, FBS24 The heart of it all Bayes Rule

8/29/03M. Raymer – WSU, FBS25 Bayes Rule or

8/29/03M. Raymer – WSU, FBS26 Example Revisited Is it an ordinary apple or an uncommon pear?

8/29/03M. Raymer – WSU, FBS27 Bayes Rule Example

8/29/03M. Raymer – WSU, FBS28 Bayes Rule Example

8/29/03M. Raymer – WSU, FBS29 Posing the question 1.What are the classes? 2.What is the evidence? 3.What is the prior probability? 4.What is the class-conditional probability?

8/29/03M. Raymer – WSU, FBS30 An Example Problem Suppose the rate of breast cancer is 1% Mammograms detect breast cancer in 80% of cases where it is present 10% of the time, mammograms will indicate breast cancer in a healthy patient If a woman has a positive mammogram result, what is the probability that she has breast cancer?

8/29/03M. Raymer – WSU, FBS31 Practice Problem Revisited Classes: healthy, cancer Evidence: positive mammogram (pos), negative mammogram (neg) If a woman has a positive mammogram result, what is the probability that she has breast cancer?

8/29/03M. Raymer – WSU, FBS32 A Counting Argument Suppose we have 1000 women 10 will have breast cancer 8 of these will have a positive mammogram 990 will not have breast cancer 99 of these will have a positive mammogram Of the 107 women with a positive mammogram, 8 have breast cancer 8/ = 7.5%

8/29/03M. Raymer – WSU, FBS33 Solution

8/29/03M. Raymer – WSU, FBS34 An Example Problem Suppose the chance of a randomly chosen person being guilty is.001 When a person is guilty, a DNA sample will match that individual 99% of the time of the time, a DNA will exhibit a false match for an innocent individual If a DNA test demonstrates a match, what is the probability of guilt?

8/29/03M. Raymer – WSU, FBS35 Solution

8/29/03M. Raymer – WSU, FBS36 Marginal Distributions

8/29/03M. Raymer – WSU, FBS37 Combining Marginals Assuming independent features: If we assume independence and use Bayes rule, we have a Naïve Bayes decision maker (classifier).

8/29/03M. Raymer – WSU, FBS38 Bayes Decision Rule Provably optimum when the features (evidence) follow Gaussian distributions, and are independent.

8/29/03M. Raymer – WSU, FBS39 Forensic DNA Classes: DNA from defendant, DNA not from defendant Evidence: Allele matches at various loci Assumption of independence Prior Probabilities? Assumed equal (0.5) What is the true prior probability that an evidence sample came from a particular individual?

8/29/03M. Raymer – WSU, FBS40 The Importance of Priors

8/29/03M. Raymer – WSU, FBS41 Likelihood Ratios When deciding between two possibilities, we dont need the exact probabilities. We only need to know which one is greater. The denominator for all the classes is always equal. Can be eliminated Useful when there are many possible classes

8/29/03M. Raymer – WSU, FBS42 Likelihood Ratio Example

8/29/03M. Raymer – WSU, FBS43 Likelihood Ratio Example

8/29/03M. Raymer – WSU, FBS44 From alleles to identity: It is relatively easy to find the allele frequencies in the population Marginal probability distributions Independence assumption Class conditional probabilities Equal prior probabilities Bayesian posterior probability estimate

8/29/03M. Raymer – WSU, FBS45 Thank you.

8/29/03M. Raymer – WSU, FBS46 A Key Advantage The oldest citation: T. Bayes. An essay towards solving a problem in the doctrine of chances. Phil. Trans. Roy. Soc., 53, 1763.