Lecture 8 Probabilities and distributions Probability is the quotient of the number of desired events k through the total number of events n. If it is.

Slides:



Advertisements
Similar presentations
Random Variables ECE460 Spring, 2012.
Advertisements

Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.
Introduction to Probability The problems of data measurement, quantification and interpretation.
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
STA291 Statistical Methods Lecture 13. Last time … Notions of: o Random variable, its o expected value, o variance, and o standard deviation 2.
COUNTING AND PROBABILITY
Statistical Issues in Research Planning and Evaluation
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 6.1 Chapter Six Probability.
From: Probabilistic Methods for Bioinformatics - With an Introduction to Bayesian Networks By: Rich Neapolitan.
Lecture 1 Bayesian inference and maximum likelihood (Idż na całość)
What is the probability that of 10 newborn babies at least 7 are boys? p(girl) = p(boy) = 0.5 Lecture 10 Important statistical distributions Bernoulli.
Body size distribution of European Collembola Lecture 9 Moments of distributions.
Maximum likelihood (ML) and likelihood ratio (LR) test
Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
Point and Confidence Interval Estimation of a Population Proportion, p
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )
Maximum likelihood (ML)
Chapter 6 Probability.
Lecture II.  Using the example from Birenens Chapter 1: Assume we are interested in the game Texas lotto (similar to Florida lotto).  In this game,
Hamid R. Rabiee Fall 2009 Stochastic Processes Review of Elementary Probability Lecture I.
Probability, Bayes’ Theorem and the Monty Hall Problem
Engineering Probability and Statistics
Problem A newly married couple plans to have four children and would like to have three girls and a boy. What are the chances (probability) their desire.
Body size distribution of European Collembola Lecture 9 Moments of distributions.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
Chapter 1 Basics of Probability.
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Short Resume of Statistical Terms Fall 2013 By Yaohang Li, Ph.D.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
1 Probability and Statistics  What is probability?  What is statistics?
Statistical Decision Theory
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Random Sampling, Point Estimation and Maximum Likelihood.
Permutations & Combinations and Distributions
Chapter 8 Probability Section R Review. 2 Barnett/Ziegler/Byleen Finite Mathematics 12e Review for Chapter 8 Important Terms, Symbols, Concepts  8.1.
Lecture 8 Probabilities and distributions Probability is the quotient of the number of desired events k through the total number of events n. If it is.
Dr. Gary Blau, Sean HanMonday, Aug 13, 2007 Statistical Design of Experiments SECTION I Probability Theory Review.
1 Sampling Distributions Lecture 9. 2 Background  We want to learn about the feature of a population (parameter)  In many situations, it is impossible.
Theory of Probability Statistics for Business and Economics.
Engineering Probability and Statistics Dr. Leonore Findsen Department of Statistics.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Ch Counting Techniques Product Rule If the first element or object of an ordered pair can be used in n 1 ways, and for each of these n1 ways.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Stats Probability Theory Summary. The sample Space, S The sample space, S, for a random phenomena is the set of all possible outcomes.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Confidence Interval & Unbiased Estimator Review and Foreword.
IE 300, Fall 2012 Richard Sowers IESE. 8/30/2012 Goals: Rules of Probability Counting Equally likely Some examples.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Engineering Probability and Statistics Dr. Leonore Findsen Department of Statistics.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Computer Performance Modeling Dirk Grunwald Prelude to Jain, Chapter 12 Laws of Large Numbers and The normal distribution.
Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
 Occupancy Model Extensions. Number of Patches or Sample Units Unknown, Single Season So far have assumed the number of sampling units in the population.
Probability Distributions ( 확률분포 ) Chapter 5. 2 모든 가능한 ( 확률 ) 변수의 값에 대해 확률을 할당하는 체계 X 가 1, 2, …, 6 의 값을 가진다면 이 6 개 변수 값에 확률을 할당하는 함수 Definition.
Copyright © Cengage Learning. All rights reserved. 3 Discrete Random Variables and Probability Distributions.
Biostatistics Class 2 Probability 2/1/2000.
Discrete Random Variables and Probability Distributions
Review of Probability and Estimators Arun Das, Jason Rebello
Discrete Event Simulation - 4
POINT ESTIMATOR OF PARAMETERS
Statistical NLP: Lecture 4
Comparing two Rates Farrokh Alemi Ph.D.
Significant models of claim number Introduction
Applied Statistics and Probability for Engineers
Presentation transcript:

Lecture 8 Probabilities and distributions Probability is the quotient of the number of desired events k through the total number of events n. If it is impossible to count k and n we might apply the stochastic definition of probability. The probability of an event j is approximately the frequency of j during n observations.

What is the probability to win in Duży Lotek? The number of desired events is 1. The number of possible events comes from the number of combinations of 6 numbers out of 49. We need the number of combinations of k events out of a total of N events Bernoulli distribution

What is the probability to win in Duży Lotek? Wrong! Hypergeometric distribution P = N K=n+k n We need the probability that of a sample of K elements out of a sample universe of N exactly n have a desired probability and k not.

In Multi Lotek 20 numbers are taken out of a total of 80. What is the probability that you have exactly 10 numbers correct? N = 80 K = 20 n = 10 k = 10

Assessing the number of infected persons Assessing total population size Capture – recapture methods The frequency of marked animals should equal the frequency wothin the total population Assumption: Closed population Random catches Random dispersal Marked animals do not differ in behaviour N real = 38 We take a sample of animals/plants and mark them We take a second sample and count the number of marked individuals

The two sample case You take two samples and count the number of infected persons in the first sample m 1, in the second sample m 2 and the number of infected persons noted in both samples k. How many persons have a certain infectuous desease?

m species l species k species In ecology we often have the problem to compare the species composition of two habitats. The species overlap is measured by the Soerensen distance metric. We do not know whether S is large or small. To assess the expectation we construct a null model. Both habitats contain species of a common species pool. If the pool size n is known we can estimate how many joint species k contain two random samples of size m and l out of n. n species Common species pool Habitat A Habitat B The expected number of joint species. Mathematical expectation The probability to get exactly k joint species. Probability distribution.

Ground beetle species of two poplar plantations and two adjacent wheet fields near Torun (Ulrich et al. 2004, Annales Zool. Fenn.) Pool size 90 to 110 species. There are much more species in common than expected just by chance. The ecological interpretation is that ground beetles colonize fields and adjacent seminatural habitats in a similar manner. Ground beetles do not colonize according to ecological requirements (niches) but according to spatial neighborhood.

Bayesian inference and maximum likelihood (Idż na całość)

The law of dependent propability Theorem of Bayes Thomas Bayes ( ) Abraham de Moivre ( )

Total probability Idż na całość Assume we choose gate 1 (G1) at the first choice. We are looking for the probability p(G1|M3) that the car is behind gate 1 if we know that the moderator opened gate 3 (M3). A B3 B2 B1 N P(B1) P(B3) P(B2) P(A|B1) P(A|B3) P(A|B2)

Calopteryx spelendens We study the occurrence of the damselfly Calopteryx splendens at small rivers. We know from the literature that C. splendens occurs at about 10% of all rivers. Occurrence depends on water quality. Suppose we have five quality classes that occur in 10% (class I), 15% (class II), 27% (class III), 43% (class IV), and 5% (class V) of all rivers. The probability to find Calopteryx in these five classes is 1% (class I), 7% (class II), 14% (class III), 31% (class IV), and 47% (class V). To which class belongs probably a river if we find Calopteryx? p(class II|A) = 0.051, p(class III|A) = 0.183, p(class IV|A) = 0.647, p(class V|A) = Indicator values

Bayes and forensic False positive fallacy Error of the prosecutor Let’s take a standard DNA test for identifying persons. The test has a precision of more than 99%. What is the probability that we identify the wrong person? The forensic version of Bayes theorem

The error of the advocate In the process against the basketball star E. O. Simpson, one of his advocates (a Harvard professor) argued that Simpson sometimes has beaten his wife. However, only very few man who beat their wives later murder them (about 0.1%).

Maximum likelihoods Suppose you studied 50 patients in a clinical trial and detected at 30 of them the presence of a certain bacterial disease. What is the most probable frequency of this disease in the population? We look for the maximum value of the likelihood function

log likelihood estimator ln(Lp)

Home work and literature Refresh: Probability Permutations, variations, combinations Bernoulli event Pascal triangle, binomial coefficients Dependent probability Independent probability Derivative, integral of power functions Prepare to the next lecture: Arithmetic, geometric, harmonic mean Cauchy inequality Statistical distribution Probability distribution Moments of distributions Error law of Gauß Literature: