Chapter 5 Special Distributions Weiqi Luo ( 骆伟祺 ) School of Data & Computer Science Sun Yat-Sen University :

Slides:



Advertisements
Similar presentations
Discrete Random Variables and Probability Distributions
Advertisements

Chapter 3 Some Special Distributions Math 6203 Fall 2009 Instructor: Ayona Chatterjee.
MOMENT GENERATING FUNCTION AND STATISTICAL DISTRIBUTIONS
Statistics review of basic probability and statistics.
Random Variable A random variable X is a function that assign a real number, X(ζ), to each outcome ζ in the sample space of a random experiment. Domain.
Chapter 5 Discrete Random Variables and Probability Distributions
Flipping an unfair coin three times Consider the unfair coin with P(H) = 1/3 and P(T) = 2/3. If we flip this coin three times, the sample space S is the.
Chapter 4 Discrete Random Variables and Probability Distributions
Chapter 1 Probability Theory (i) : One Random Variable
Probability Densities
Review.
Chapter 6 Continuous Random Variables and Probability Distributions
Probability Distributions
1 Review of Probability Theory [Source: Stanford University]
A random variable that has the following pmf is said to be a binomial random variable with parameters n, p The Binomial random variable.
Continuous Random Variables and Probability Distributions
Today Today: Chapter 8 Assignment: 5-R11, 5-R16, 6-3, 6-5, 8-2, 8-8 Recommended Questions: 6-1, 6-2, 6-4, , 8-3, 8-5, 8-7 Reading: –Sections 8.1,
Discrete Random Variables and Probability Distributions
Chapter 5 Continuous Random Variables and Probability Distributions
Class notes for ISE 201 San Jose State University
Discrete Random Variables and Probability Distributions
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Copyright © Cengage Learning. All rights reserved. 3.5 Hypergeometric and Negative Binomial Distributions.
Chapter 4 Continuous Random Variables and Probability Distributions
Previous Lecture: Sequence Database Searching. Introduction to Biostatistics and Bioinformatics Distributions This Lecture By Judy Zhong Assistant Professor.
Discrete Distributions
Standard Statistical Distributions Most elementary statistical books provide a survey of commonly used statistical distributions. The reason we study these.
Short Resume of Statistical Terms Fall 2013 By Yaohang Li, Ph.D.
Continuous Random Variables and Probability Distributions
MOMENT GENERATING FUNCTION AND STATISTICAL DISTRIBUTIONS
Lesson 8 – R Taken from And modified slightlyhttp://
Chapter 5 Statistical Models in Simulation
Winter 2006EE384x1 Review of Probability Theory Review Session 1 EE384X.
Dept of Bioenvironmental Systems Engineering National Taiwan University Lab for Remote Sensing Hydrology and Spatial Modeling STATISTICS Random Variables.
Random Variables & Probability Distributions Outcomes of experiments are, in part, random E.g. Let X 7 be the gender of the 7 th randomly selected student.
Random Sampling, Point Estimation and Maximum Likelihood.
Chapter 4 DeGroot & Schervish. Variance Although the mean of a distribution is a useful summary, it does not convey very much information about the distribution.
Chapter 5.6 From DeGroot & Schervish. Uniform Distribution.
1 Topic 3 - Discrete distributions Basics of discrete distributions Mean and variance of a discrete distribution Binomial distribution Poisson distribution.
Chapter 2 Random variables 2.1 Random variables Definition. Suppose that S={e} is the sampling space of random trial, if X is a real-valued function.
Expectation for multivariate distributions. Definition Let X 1, X 2, …, X n denote n jointly distributed random variable with joint density function f(x.
STA347 - week 31 Random Variables Example: We roll a fair die 6 times. Suppose we are interested in the number of 5’s in the 6 rolls. Let X = number of.
Random Variables Presentation 6.. Random Variables A random variable assigns a number (or symbol) to each outcome of a random circumstance. A random variable.
Chapter 01 Probability and Stochastic Processes References: Wolff, Stochastic Modeling and the Theory of Queues, Chapter 1 Altiok, Performance Analysis.
Stats Probability Theory Summary. The sample Space, S The sample space, S, for a random phenomena is the set of all possible outcomes.
Lesson 6 - R Discrete Probability Distributions Review.
Expectation. Let X denote a discrete random variable with probability function p(x) (probability density function f(x) if X is continuous) then the expected.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.
Topic 3 - Discrete distributions Basics of discrete distributions - pages Mean and variance of a discrete distribution - pages ,
Chapter 4-5 DeGroot & Schervish. Conditional Expectation/Mean Let X and Y be random variables such that the mean of Y exists and is finite. The conditional.
Chapter 2. Conditional Probability Weiqi Luo ( 骆伟祺 ) School of Data & Computer Science Sun Yat-Sen University :
Review of Chapter
Topic 5: Continuous Random Variables and Probability Distributions CEE 11 Spring 2002 Dr. Amelia Regan These notes draw liberally from the class text,
Chapter 4. Random Variables - 3
Continuous Random Variables and Probability Distributions
Copyright © Cengage Learning. All rights reserved. 3 Discrete Random Variables and Probability Distributions.
Chapter 6 Large Random Samples Weiqi Luo ( 骆伟祺 ) School of Data & Computer Science Sun Yat-Sen University :
PROBABILITY AND STATISTICS WEEK 5 Onur Doğan. The Binomial Probability Distribution There are many experiments that conform either exactly or approximately.
3/7/20161 Now it’s time to look at… Discrete Probability.
THE NORMAL DISTRIBUTION
Copyright © Cengage Learning. All rights reserved. 3 Discrete Random Variables and Probability Distributions.
Handout Ch5(1) 實習.
Discrete Random Variables and Probability Distributions
Chapter 7: Sampling Distributions
PROBABILITY AND STATISTICS
Discrete Random Variables and Probability Distributions
Geometric Poisson Negative Binomial Gamma
Review of Chapter 8 Discrete PDFs Binomial and Geometeric
Presentation transcript:

Chapter 5 Special Distributions Weiqi Luo ( 骆伟祺 ) School of Data & Computer Science Sun Yat-Sen University :

School of Data & Computer Science  5.1 Introduction  5.2 The Bernoulli and Binomial Distributions  5.3 The Hypergeometric Distributions  5.4 The Poisson Distributions  5.5 The Negative Binomial Distributions  5.6 The Normal Distributions  5.7 The Gamma Distributions  5.8 The Beta Distributions  5.9 The Multinomial Distributions  5.10 The Bivariate Normal Distributions 2 Chapter 5: Special Distributions

School of Data & Computer Science  Define and discuss several special families of distributions that are widely used in applications of probability and statistics  Discrete Cases: Bernoulli, binomial, hypergeometric, Poisson, negative binomial, and geometric distributions.  Continuous Cases: normal, lognormal, gamma, exponential, and beta distributions 5.1 Introduction 3

School of Data & Computer Science  Definition Bernoulli Distribution. A random variable X has the Bernoulli distribution with parameter p (0 ≤ p ≤ 1) if X can take only the values 0 and 1 and the probabilities are Pr(X = 1) = p and Pr(X = 0) = 1− p. The p.f. of X can be written as follows: To verify that this p.f. f (x|p) actually does represent the Bernoulli distribution, it is simply necessary to note that f (1|p) = p and f (0|p) = 1− p. 5.2 The Bernoulli and Binomial Distributions 4

School of Data & Computer Science  Definition (Cont) If X has the Bernoulli distribution with parameter p, then X 2 and X are the same random variable. It follows that E(X) = 1. p + 0. (1− p) = p, E(X 2 ) = E(X) = p, Var(X) = E(X 2 ) − [E(X)] 2 = p(1− p). Furthermore, the m.g.f. of X is ψ(t) = E(e tX ) = pe t + (1− p) for −∞< t <∞. 5.2 The Bernoulli and Binomial Distributions 5

School of Data & Computer Science  Definition Binomial Distribution. A random variable X has the binomial distribution with parameters n and p if X has a discrete distribution for which the p.f. is as follows: In this distribution, n must be a positive integer, and p must lie in the interval 0 ≤ p ≤ The Bernoulli and Binomial Distributions 6

School of Data & Computer Science  Theorem If the independent random variables X 1,..., X n form n Bernoulli trials with parameter p, and if X = X X n, then X has the binomial distribution with parameters n and p. Note: The Bernoulli random variables must be mutually independent, and they must all have the same parameter (i.i.d.) 5.2 The Bernoulli and Binomial Distributions 7

School of Data & Computer Science  Theorem If X 1,..., X k are independent random variables, and if X i has the binomial distribution with parameters n i and p (i = 1,..., k), then the sum X X k has the binomial distribution with parameters n = n n k and p. 5.2 The Bernoulli and Binomial Distributions 8

School of Data & Computer Science  Homework Ex. 2, 5, 9, The Bernoulli and Binomial Distributions 9

School of Data & Computer Science  Example Sampling without Replacement. Suppose that a box contains A red balls and B blue balls. Suppose also that n ≥ 0 balls are selected at random from the box without replacement, and let X denote the number of red balls that are obtained.  Theorem Probability Function. The distribution of X in Example has the p.f. 5.3 The Hypergeometric Distributions 10 For max{0, n − B} ≤ x ≤ min{n, A}, and f (x|A, B, n) = 0 otherwise.

School of Data & Computer Science 5.3 The Hypergeometric Distributions 11 # of outcomes having X=x Step 1: Choosing x elements from subset S Step 2: Choosing n-x elements from subset F A B N=A+B SF n # of Possible outcomes:

School of Data & Computer Science  The range of rv X 5.3 The Hypergeometric Distributions 12 A B SF n N= A+B X = the number of S’s in a randomly selected sample of size n Max(0, n-B) Min(n, A) x ≤≤

School of Data & Computer Science  Definition Hypergeometric Distribution. Let A, B, and n be nonnegative integers with n ≤ A + B. If a random variable X has a discrete distribution with p.f. as follows, then it is said that X has the hypergeometric distribution with parameters A, B, and n. 5.3 The Hypergeometric Distributions 13

School of Data & Computer Science  Theorem Mean and Variance. Let X have a hypergeometric distribution with strictly positive parameters A, B, and n. Then 5.3 The Hypergeometric Distributions 14

School of Data & Computer Science  Homework Ex. 2, 3, The Hypergeometric Distributions 15

School of Data & Computer Science  Definition Poisson Distribution. Let λ > 0. A random variable X has the Poisson distribution with mean λ if the p.f. of X is as follows: 5.4 The Poisson Distributions 16

School of Data & Computer Science  Mean: λ (refer to Theorem for the proof)  Var : λ (refer to Theorem for the proof)  Moment Generating Function. for all real t. (refer to Theorem for the proof) 5.4 The Poisson Distributions 17

School of Data & Computer Science  Theorem If the random variables X 1,...,X k are independent and if X i has the Poisson distribution with mean λ i (i = 1,..., k), then the sum X X k has the Poisson distribution with mean λ λ k. Refer to p.290 for the proof. 5.4 The Poisson Distributions 18

School of Data & Computer Science  Example Customer Arrivals. A store owner believes that customers arrive at his store at a rate of 4.5 customers per hour on average. He wants to find the distribution of the actual number X of customers who will arrive during a particular one- hour period later in the day. He models customer arrivals in different time periods as independent of each other. As a first approximation, he divides the one-hour period into 3600 seconds and thinks of the arrival rate as being 4.5/3600 = per second. He then says that during each second either 0 or 1 customers will arrive, and the probability of an arrival during any single second is He then tries to use the binomial distribution with parameters n = 3600 and p = for the distribution of the number of customers who arrive during the one-hour period later in the day. Refer to p. 288 for more details. 5.4 The Poisson Distributions 19

School of Data & Computer Science  Theorem Closeness of Binomial and Poisson Distributions. For each integer n and each 0<p<1,let f (x|n, p) denote the p.f. of the binomial distribution with parameters n and p. Let f (x|λ) denote the p.f. of the Poisson distribution with mean λ. Let {p n } ∞ n=1 be a sequence of numbers between 0 and 1 such that lim n→∞ np n = λ. Then for all x = 0, 1, The Poisson Distributions 20

School of Data & Computer Science  Example Approximating a Probability. Suppose that in a large population the proportion of people who have a certain disease is 0.01.We shall determine the probability that in a random group of 200 people at least four people will have the disease. In this example, we can assume that the exact distribution of the number of people having the disease among the 200 people in the random group is the binomial distribution with parameters n = 200 and p = Therefore, this distribution can be approximated by the Poisson distribution for which the mean is λ = np = 2. As a rule, this approximation can be applied if n ≥ 100, and p ≤ 0.01 and np ≤ The Poisson Distributions 21

School of Data & Computer Science  Homework Ex. 1, 4, 7, The Poisson Distributions 22

School of Data & Computer Science  Theorem Sampling until a Fixed Number of Successes. Suppose that an infinite sequence of Bernoulli trials with probability of success p are available. The number X of failures that occur before the r-th success has the following p.d.f.: 5.5 The Negative Binomial Distributions 23

School of Data & Computer Science  Definition Negative Binomial Distribution. A random variable X has the negative binomial distribution with parameters r and p (r = 1, 2,... and 0 < p <1) if X has a discrete distribution for which the p.f. f (x|r, p) is as specified by 5.5 The Negative Binomial Distributions 24

School of Data & Computer Science  Definition Geometric Distribution. A random variable X has the geometric distribution with parameter p (0<p<1) if X has a discrete distribution for which the p.f. f (x|1, p) is as follows: 5.5 The Negative Binomial Distributions 25

School of Data & Computer Science  Example Triples in the Lottery. A common daily lottery game involves the drawing of three digits from 0 to 9 independently with replacement and independently from day to day. Lottery watchers often get excited when all three digits are the same, an event called triples. If p is the probability of obtaining triples, and if X is the number of days without triples before the first triple is observed, then X has the geometric distribution with parameter p. In this case, it is easy to see that p = 0.01, since there are 10 different triples among the 1000 equally likely daily numbers. 5.5 The Negative Binomial Distributions 26

School of Data & Computer Science  Theorem If X 1,..., X r are i.i.d. random variables and if each Xi has the geometric distribution with parameter p, then the sum X X r has the negative binomial distribution with parameters r and p. 5.5 The Negative Binomial Distributions 27

School of Data & Computer Science  Theorem Mean and Variance. If X has the negative binomial distribution with parameters r and p, the mean and the variance of X must be The mean and variance of the geometric distribution with parameter p are the special case with r = The Negative Binomial Distributions 28

School of Data & Computer Science  Homework Ex The Negative Binomial Distributions 29

School of Data & Computer Science  Definition A random variable X has the normal distribution with mean μ and variance σ 2 (−∞ 0) if X has a continuous distribution with the following p.d.f.: 5.6 The Normal Distributions 30

School of Data & Computer Science  Theorem The function defined in Eq. (5.6.1) is a p.d.f. Proof : the function is nonnegative. We must also show that If we let y = (x − μ)/σ, then We shall now let 5.6 The Normal Distributions 31

School of Data & Computer Science  Theorem (cont) We shall now change the variables in this integral from y and z to the polar coordinates r and θ by letting y = rcos θ and z = r sin θ. Then, since y 2 + z 2 = r 2, 5.6 The Normal Distributions 32

School of Data & Computer Science  Theorem Moment Generating Function. The m.g.f. of the Gaussian distribution is Refer to the proof in p The Normal Distributions 33

School of Data & Computer Science  Theorem Mean and Variance. The mean and variance of the distribution with p.d.f. given by are μ and σ 2, respectively. 5.6 The Normal Distributions 34

School of Data & Computer Science  The Shapes of Normal Distributions 5.6 The Normal Distributions Symmetric with respect to the line x = μ. 2. Its maximum value at the point x = μ 3. points of inflection at x = μ + σ and at x = μ − σ.

School of Data & Computer Science  Theorem If X has the normal distribution with mean μ and variance σ 2 and if Y = aX + b, where a and b are given constants and a ≠ 0, then Y has the normal distribution with mean aμ + b and variance a 2 σ 2. See the proof in P The Normal Distributions 36

School of Data & Computer Science  Definition Standard Normal Distribution. The normal distribution with mean 0 and variance 1 is called the standard normal distribution. The p.d.f. of the standard normal distribution is usually denoted by the symbol φ, and the c.d.f. is denoted by the symbol Φ. Thus, Table of Standard normal distribution. 5.6 The Normal Distributions 37

School of Data & Computer Science  Theorem Consequences of Symmetry. For all x and all 0<p <1, Φ(-x) = 1- Φ(x) & Φ -1 (p) = -Φ -1 (1-p) 5.6 The Normal Distributions 38

School of Data & Computer Science  Theorem Converting Normal Distributions to Standard Let X have the normal distribution with mean μ and variance σ 2. Let F be the c.d.f. of X. Then Z = (X − μ)/σ has the standard normal distribution, and, for all x and all 0<p <1, 5.6 The Normal Distributions 39

School of Data & Computer Science  Example Determining Probabilities for a Normal Distribution. Suppose that X has the normal distribution with mean 5 and standard deviation 2. We shall determine the value of Pr(1<X <8). If we let Z = (X − 5)/2, then Z will have the standard normal distribution and Furthermore, Refer to the Table in the P The Normal Distributions 40

School of Data & Computer Science  Comparisons of Normal Distributions 5.6 The Normal Distributions 41 Why?

School of Data & Computer Science 5.6 The Normal Distributions 42  Theorem If the random variables X 1,..., X k are independent and if X i has the normal distribution with mean μi and variance σ 2 i (i = 1,..., k), then the sum X X k has the normal distribution with mean μ μ k and variance σ σ σ 2 k

School of Data & Computer Science  Corollary If the random variables X 1,..., X k are independent, if X i has the normal distribution with mean μ i and variance σ 2 i (i = 1,..., k), and if a 1,..., a k and b are constants for which at least one of the values a 1,..., a k is different from 0, then the variable a 1 X a k X k + b has the normal distribution with mean a 1 μ a k μ k + b and variance 5.6 The Normal Distributions 43

School of Data & Computer Science  Example Heights of Men and Women. Suppose that the heights, in inches, of the women in a certain population follow the normal distribution with mean 65 and standard deviation 1, and that the heights of the men follow the normal distribution with mean 68 and standard deviation 3. Suppose also that one woman is selected at random and, independently, one man is selected at random. We shall determine the probability that the woman will be taller than the man. Let W denote the height of the selected woman, and let M denote the height of the selected man. Then the difference W −M has the normal distribution with mean 65 − 68=−3 and variance = 10. Therefore, if we let 5.6 The Normal Distributions 44

School of Data & Computer Science  Example (cont) Then Z has the standard normal distribution. It follows that Thus, the probability that the woman will be taller than the man is The Normal Distributions 45

School of Data & Computer Science  Definition Sample Mean. Let X 1,..., X n be random variables. The average of these n random variables, is called their sample mean and is commonly denoted 5.6 The Normal Distributions 46

School of Data & Computer Science  Corollary Suppose that the random variables X 1,...,X n form a random sample from the normal distribution with mean μ and variance σ 2, and let X n denote their sample mean. Then X n has the normal distribution with mean μ and variance σ 2 /n. 5.6 The Normal Distributions 47

School of Data & Computer Science  Example Determining a Sample Size. Suppose that a random sample of size n is to be taken from the normal distribution with mean μ and variance 9. We shall determine the minimum value of n for which It is known from Corollary that the sample mean X n will have the normal distribution for which the mean is μ and the standard deviation is 3/n 1/2. Therefore, if we let For each positive number x, it will be true that Pr(|Z| ≤ x) ≥ 0.95 if and only if 1− Ф(x) = Pr(Z > x) ≤ Thus, n>34.6, n should be at least The Normal Distributions 48

School of Data & Computer Science  Definition Lognormal Distribution. If log(X) has the normal distribution with mean μ and variance σ 2, we say that X has the lognormal distribution with parameters μ and σ The Normal Distributions 49

School of Data & Computer Science  Homework Ex. 3, 5, 10, 11, The Normal Distributions 50

School of Data & Computer Science  Definition The Gamma Function. For each positive number α, let the value Г(α) be defined by the following integral: The function Г(α) for α >0 is called the gamma function. 5.7 The Gamma Distributions 51

School of Data & Computer Science  Theorem If α >1, then  Theorem For every positive integer n, 5.7 The Gamma Distributions 52

School of Data & Computer Science  Definition Gamma Distributions. Let α and β be positive numbers. A random variable X has the gamma distribution with parameters α and β if X has a continuous distribution for which the p.d.f. is 5.7 The Gamma Distributions 53

School of Data & Computer Science 5.7 The Gamma Distributions 54

School of Data & Computer Science  Definition Exponential Distributions. Let β >0. A random variable X has the exponential distribution with parameter β if X has a continuous distribution with the p.d.f. 5.7 The Gamma Distributions 55

School of Data & Computer Science  Theorem The exponential distribution with parameter β is the same as the gamma distribution with parameters α = 1 and β. If X has the exponential distribution with parameter β, then 5.7 The Gamma Distributions 56

School of Data & Computer Science  Example Mean and Variance of Lifetime of a Light Bulb. Suppopse that we model the lifetime of a light bulb as a continuous random variable with the following p.d.f.: If we wish to compute the mean and variance of such a lifetime, we need to compute the following integrals: 5.7 The Gamma Distributions 57

School of Data & Computer Science  Definition The Beta Function. For each positive α and β, define The function B is called the beta function. 5.8 The Beta Distributions 58

School of Data & Computer Science  Theorem For all α, β > 0, The relationship between beta and gamma function. 5.8 The Beta Distributions 59

School of Data & Computer Science  Definition Beta Distributions. Let α, β > 0 and let X be a random variable with p.d.f. Then X has the beta distribution with parameters α and β. 5.8 The Beta Distributions 60

School of Data & Computer Science  Mean and Variance 5.8 The Beta Distributions 61

School of Data & Computer Science  Definition Multinomial Distributions. A discrete random vector X = (X 1,..., X k ) whose p.f. is as follows has the multinomial distribution with parameters n and p =(p 1,..., p k ). 5.9 The Multinomial Distributions 62

School of Data & Computer Science  Example Attendance at a Baseball Game. Suppose that 23 percent of the people attending a certain baseball game live within 10 miles of the stadium, 59 percent live between 10 and 50 miles from the stadium, and 18 percent live more than 50 miles from the stadium. Suppose also that 20 people are selected at random from the crowd attending the game. We shall determine the probability that seven of the people selected live within 10 miles of the stadium, eight of them live between 10 and 50 miles from the stadium, and five of them live more than 50 miles from the stadium. 5.9 The Multinomial Distributions 63

School of Data & Computer Science  Theorem Suppose that the random vector X = (X 1, X 2 ) has the multinomial distribution with parameters n and p = (p 1, p 2 ).Then X 1 has the binomial distribution with parameters n and p 1, and X 2 = n − X The Multinomial Distributions 64

School of Data & Computer Science  Theorem Means, Variances, and Covariances. Let the random vector X have the multinomial distribution with parameters n and p. The means and variances of the coordinates of X are 5.9 The Multinomial Distributions 65 Note: Negative Covariance Is Natural for Multinomial Distributions