Presentation is loading. Please wait.

Presentation is loading. Please wait.

Known Probability Distributions

Similar presentations


Presentation on theme: "Known Probability Distributions"— Presentation transcript:

1 Known Probability Distributions
Engineers frequently work with data that can be modeled as one of several known probability distributions. Being able to model the data allows us to: model real systems design predict results Key discrete probability distributions include: binomial negative binomial hypergeometric Poisson EGR

2 Discrete Uniform Distribution
Simplest of all discrete distributions All possible values of the random variable have the same probability, i.e., f(x; k) = 1/ k, x = x1 , x2 , x3 , … , xk Expectations of the discrete uniform distribution draw the uniform distribution EGR

3 Binomial & Multinomial Distributions
Bernoulli Trials Inspect tires coming off the production line. Classify each as defective or not defective. Define “success” as defective. If historical data shows that 95% of all tires are defect-free, then P(“success”) = 0.05. Signals picked up at a communications site are either incoming speech signals or “noise.” Define “success” as the presence of speech. P(“success”) = P(“speech”) Administer a test drug to a group of patients with a specific condition. P(“success”) = ___________ Bernoulli Process n repeated trials the outcome may be classified as “success” or “failure” the probability of success (p) is constant from trial to trial repeated trials are independent. EGR

4 Binomial Distribution
Example: Historical data indicates that 10% of all bits transmitted through a digital transmission channel are received in error. Let X = the number of bits in error in the next 4 bits transmitted. Assume that the transmission trials are independent. What is the probability that Exactly 2 of the bits are in error? At most 2 of the 4 bits are in error? more than 2 of the 4 bits are in error? The number of successes, X, in n Bernoulli trials is called a binomial random variable. remember, YOU define what a “success” is … e.g., votes, defects, errors EGR

5 Binomial Distribution
The probability distribution is called the binomial distribution. b(x; n, p) = , x = 0, 1, 2, …, n where p = _________________ q = _________________ For our example, b(x; n, p) = _________________ p = probability of success q = probability of failure = 1-p b(x; n, p) = (4 choose x)(0.1)x(0.9)4-x , x = 0,1,2,3,4 EGR

6 For Our Example … What is the probability that exactly 2 of the bits are in error? At most 2 of the 4 bits are in error? b(x; n, p) = (4 choose x)(0.1)x(0.9)4-x , x = 0,1,2,3,4 P(X = 2) = (4 choose 2) (0.1)2(0.9)2 = P(X < 2) = P(0) + P(1) + P(2) = (4 choose 0) (0.1)0(0.9)4 + (4 choose 1) (0.1)1(0.9)3 + (4 choose 2) (0.1)2(0.9)2 = EGR

7 Your turn … What is the probability that more than 2 of the 4 bits are in error? b(x; n, p) = (4 choose x)(0.1)x(0.9)4-x , x = 0,1,2,3,4 P(X > 2) = P(3) + P(4) = (4 choose 3) (0.1)3(0.9)1 + (4 choose 4) (0.1)4(0.9)0 = P(X < 2) = P(0) + P(1) + P(2) = (4 choose 0) (0.1)0(0.9)4 + (4 choose 1) (0.1)1(0.9)3 + (4 choose 2) (0.1)2(0.9)2 = EGR

8 Expectations of the Binomial Distribution
The mean and variance of the binomial distribution are given by μ = np σ2 = npq Suppose, in our example, we check the next 20 bits. What are the expected number of bits in error? What is the standard deviation? μ = ___________ σ2 = __________ , σ = __________ μ = np = 20(0.1) = 2 σ2 = npq = 20(0.1)(0.9)= 1.8, σ = 1.34 EGR

9 Another example A worn machine tool produces 1% defective parts. If we assume that parts produced are independent, what is the mean number of defective parts that would be expected if we inspect 25 parts? What is the expected variance of the 25 parts? μ = np = 25(0.01) = 0.25 σ2 = npq = 25(0.01)(0.99)= ** NOTE: ≠ 0.25 EGR

10 Helpful Hints … Sometimes it helps to draw a picture.
Suppose we inspect the next 5 parts … P(at least 3)  P(2 ≤ X ≤ 4)  P(less than 4)  Appendix Table A.1 (pp ) lists Binomial Probability Sums, ∑rx=0b(x; n, p) EGR

11 Your turn … Use Table A.1 to determine
1. b(x; 15, 0.4) , P(X ≤ 8) = ______________ 2. b(x; 15, 0.4) , P(X < 8) = ______________ 3. b(x; 12, 0.2) , P(2 ≤ X ≤ 5) = ___________ 4. b(x; 4, 0.1) , P(X > 2) = ______________ 0.9050 0.7869 – = 1 – = EGR

12 Multinomial Experiments
What if there are more than 2 possible outcomes? (e.g., acceptable, scrap, rework) That is, suppose we have: n independent trials k outcomes that are mutually exclusive (e.g., ♠, ♣, ♥, ♦) exhaustive (i.e., ∑all kpi = 1) Then f(x1, x2, …, xk; p1, p2, …, pk, n) = EGR

13 Example Look at problem 22, pg. 126
f( __, __, __; ___, ___, ___, __) =_________________ = __________________________________ x1 = _______ p1 = x2 = p2 = n = _____ x3 = p3 = f( 5,2,1; 0.5, 0.25, 0.25, 8) = (8 choose 5,2,1)(0.5)5(0.25)2(0.25)1 = 8!/(5!2!1!)* )(0.5)5(0.25)2(0.25)1 = 21/256 or EGR

14 Hypergeometric Distribution
Example*: Automobiles arrive in a dealership in lots of 10. Five out of each 10 are inspected. For one lot, it is know that 2 out of 10 do not meet prescribed safety standards. What is probability that at least 1 out of the 5 tested from that lot will be found not meeting safety standards? *from Complete Business Statistics, 4th ed (McGraw-Hill) Note the difference between binomial (assumes sampling “with replacement”) and hypergeometric (sampling “without replacement”) Also, binomial assumes independence, while hypergeometric does not. EGR

15 This example follows a hypergeometric distribution:
A random sample of size n is selected without replacement from N items. k of the N items may be classified as “successes” and N-k are “failures.” The probability associated with getting x successes in the sample (given k successes in the lot.) Where, k = number of “successes” = 2 n = number in sample = 5 N = the lot size = 10 x = number found = 1 or 2 EGR

16 Hypergeometric Distribution
In our example, = _____________________________ P(X > 1) = = 0.778 EGR

17 Expectations of the Hypergeometric Distribution
The mean and variance of the hypergeometric distribution are given by What are the expected number of cars that fail inspection in our example? What is the standard deviation? μ = ___________ σ2 = __________ , σ = __________ μ = nk/N = 5*2/10 = 1 σ2 = (5/9)(5*2/10)(1-2/10) = 0.444 σ = 0.667 EGR

18 Your turn … A worn machine tool produced defective parts for a period of time before the problem was discovered. Normal sampling of each lot of 20 parts involves testing 6 parts and rejecting the lot if 2 or more are defective. If a lot from the worn tool contains 3 defective parts: What is the expected number of defective parts in a sample of six from the lot? What is the expected variance? What is the probability that the lot will be rejected? N = 20 n = 6 k = 3 μ = nk/N = 6*3/20 =18/20=0.9 σ2 = (14/19)(6*3/20)(1-3/20) = P(X>2) = 1 – [P(0)+P(1)] = =0.2018 = P(2)+P(3) = [3 choose 2]*[17 choose 4] / [20 choose 6] + [3 choose 3]*[17 choose 3] / [20 choose 6] = = EGR

19 Binomial Approximation
Note, if N >> n, then we can approximate this with the binomial distribution. For example: Automobiles arrive in a dealership in lots of out of each 100 are inspected. 2 /10 (p=0.2) are indeed below safety standards. What is probability that at least 1 out of 5 will be found not meeting safety standards? Recall: P(X ≥ 1) = 1 – P(X < 1) = 1 – P(X = 0) Hypergeometric distribution Binomial distribution h(0;100,5,20) = (20 choose 0)(80 choose 5)/(100 choose 5) = 1-P(0) = = From Table A1, n=5, p=0.2 b(0;5,0.2) = 1-P(0) = = NOTE: If N = 200, then the hypergeometric distribution yields P(X > 1) = 0.676 Comparing to example 5.14, we can see that the binomial approximation gets very close as N gets very large relative to n. (Compare to example 5.14, pg. 129) EGR

20 Negative Binomial Distribution
Example: Historical data indicates that 30% of all bits transmitted through a digital transmission channel are received in error. An engineer is running an experiment to try to classify these errors, and will start by gathering data on the first 10 errors encountered. What is the probability that the 10th error will occur on the 25th trial? EGR

21 This example follows a negative binomial distribution:
Repeated independent trials. Probability of success = p and probability of failure = q = 1-p. Random variable, X, is the number of the trial on which the kth success occurs. The probability associated with the kth success occurring on trial x is given by, Where, k = “success number” = 10 x = trial number on which k occurs = 25 p = probability of success (error) = 0.3 q = 1 – p = 0.7 EGR

22 Negative Binomial Distribution
In our example, = _____________________________ b*(15;10,0.1) = (24 choose 9)(.3)10(.7)15 = 0.037 EGR

23 Geometric Distribution
Example: In our example, what is the probability that the 1st bit received in error will occur on the 5th trial? This is an example of the geometric distribution, which is a special case of the negative binomial in which k = 1. The probability associated with the 1st success occurring on trial x is given by = __________________________________ (0.3)(0.7)4 = 0.072 EGR

24 Your turn … A worn machine tool produces 1% defective parts. If we assume that parts produced are independent: What is the probability that the 2nd defective part will be the 6th one produced? What is the probability that the 1st defective part will be seen before 3 are produced? How many parts can we expect to produce before we see the 1st defective part? (Hint: see Theorem 5.4, pg. 135) b*(6:2,0.01) = (5 choose 1)(.01)2(.99)4 = P(X<3) = P(1)+P(2) = (0.01)*(0.99)1-1 + (0.01)*(0.99)2-1 = μ = 1/p = 1/0.2 = 5 EGR

25 Poisson Process The number of occurrences in a given interval or region with the following properties: “memoryless” P(occurrence) during a very short interval or small region is proportional to the size of the interval and doesn’t depend on number occurring outside the region or interval. P(X>1) in a very short interval is negligible memoryless  number in one interval is independent of the number in a different interval EGR

26 Poisson Process Examples: Number of bits transmitted per minute.
Number of calls to customer service in an hour. Number of bacteria in a given sample. Number of hurricanes per year in a given region. memoryless  number in one interval is independent of the number in a different interval EGR

27 Poisson Process Example
An average of 2.7 service calls per minute are received at a particular maintenance center. The calls correspond to a Poisson process. To determine personnel and equipment needs to maintain a desired level of service, the plant manager needs to be able to determine the probabilities associated with numbers of service calls. What is the probability that fewer than 2 calls will be received in any given minute? EGR

28 Poisson Distribution The probability associated with the number of occurrences in a given period of time is given by, Where, λ = average number of outcomes per unit time or region = 2.7 t = time interval or region = 1 minute EGR

29 Our Example The probability that fewer than 2 calls will be received in any given minute is … P(X < 2) = P(X = 0) + P(X = 1) = __________________________ The mean and variance are both λt, so μ = _____________________ Note: Table A.2, pp , gives Σt p(x;μ) P(x=0) = e /0! + e /1! = μ = 2.7 EGR

30 Poisson Distribution If more than 6 calls are received in a 3-minute period, an extra service technician will be needed to maintain the desired level of service. What is the probability of that happening? μ = λt = _____________________ P(X > 6) = 1 – P(X < 6) = _____________________ μ = 2.7*3 = 8.1 ≈ 8 see page 668 , with μ = 8 and r = 6, P(X < 6) = P = = EGR

31 Poisson Distribution EGR

32 Poisson Distribution The effect of λ on the Poisson distribution
EGR


Download ppt "Known Probability Distributions"

Similar presentations


Ads by Google