3 Hypergeometric Distribution With the binomial distribution you sample with replacement and count the number of successes after a set number of trials. But what if you sample without replacement?Now the trials are no longer independent and we can no longer use the binomial to model this situation. We need to look for another type of distribution that will describe these problems.
4 Hypergeometric Distribution - Given a population with N members- We are interested in an outcome that can be classified as a success or a failureLet the probability of success in the population be p- Sample from this population without replacement of size n
5 Hypergeometric Distribution Examples:–The probability of a full house in a poker hand.–The probability of 3 brown M&M’s in a selection of 5 M&M’s from a bag with 20 brown M&M’s and 30 other colors.- The probability of selecting 3 out of 5 defective parts on a moving conveyor belt containing 100 parts
6 Hypergeometric Distribution Notation: X ~ Hyp(N, n, p)PMF: where: x = the # of successesn = the # of trials, or how manywe’re choosingf(x) = the probability of x success in n trialsN = # of elements in populationm = # of elements in populationlabeled a successp = probability of success in ENTIRE population (m/N)
7 Hypergeometric Distribution Expectation and Variance:E(X) =Var(X) =
8 Hypergeometric Distribution PMF:is the number of ways m elements can be selected from a population of size Nis the number of ways that x successes can be selected from a total of r successes in the populationis the number of ways that m-x failures can be selected from a total ofN-n failures in the population
9 Hypergeometric Distribution Notation: X ~ Hyp(N, n, p)PMF: where: x = the # of successesn = the # of trials, or how manywe’re choosingf(x) = the probability of x success in n trialsN = # of elements in populationr = # of elements in populationlabeled a successp = probability of success in ENTIRE population
10 Hypergeometric Distribution Expectation and Variance:E(X) =Var(X) =
11 Hypergeometric Distribution PMF:is the number of ways n elements can be selected from a population of size Nis the number of ways that x successes can be selected from a total of r successes in the populationis the number of ways that n-x failures can be selected from a total ofN-r failures in the population
12 Hypergeometric Example #1 A bag of Skittles has 20 reds and 80 pieces of other colors. Find the probability that you randomly select 4 reds in a handful of 10 Skittles…a) With replacementb) Without replacement
13 Hypergeometric Example #1a With replacement:X ~ Bin(n = 4, p = 0.2)P(X=4) =
14 Hypergeometric Example #1b b) Without replacement:X ~ Hyp(N=100, n=10, p=0.2)P(X=4) =
15 Hypergeometric Example #1c c) What is the expected number of red Skittles in a handful of 20 pieces?With replacement:E(X) = np = 20(0.2) = 4Without replacement:E(X) = n(r/N) = 20(20/100) =4
16 Hypergeometric Example #1c c) What is the variance of number of red Skittles in a handful of 20 pieces?With replacement:Var(X) = np(1-p) = 20(0.2)(0.8) = 3.2Without replacement:Var(X) =
17 Hypergeometric Example #2 In a jar there are 20,000 coins, 500 of which are quarters. You select 5 coins randomly. What is the probability that you get exactly 2 quarters?a)With replacement?b)Without replacement?
18 Hypergeometric Example #3 A marine biologist has been tracking manatees in the Miami region. There are a total of 200 manatees in the region, and 80 of them have been tagged with their information recorded. Each day he will take a random sample of 12 manatees (without replacement) and will continue to record information on those that have been tagged. Let T be the number of manatees that have been tagged in your sample.T ~ Hyp(N=200, n=12, p=80/200=0.4)
19 Hypergeometric Example #3a N=number on POPULATIONn=how many we’re choosingr=number of “desirable” objects or “success” objects in populationp=probability of success in ENTIRE population = r/NWhat is the probability you choose exactly 3 tagged manatees?
20 Hypergeometric Example #3b Given you have less than 4 tagged manatees, what is the probability you have exactly 3 tagged manatees?
21 Hypergeometric Example #3c If he continues this same procedure for 4 days (all days independent of one another), what is the probability that he has exactly 3 tagged manatees in his sample all 4 days?How many tagged manatees you expect to see in a sample of 12?
22 Hypergeometric Example #4 How often do we REALLY know the population size? Collecting records from all the marine biologists in Florida, we have a total of 500 tagged manatees. How do we estimate the population size?If we sample from this LARGE population, say there are a total of 2,000 manatees (and 500 tagged), we select 10 of them.
23 Hypergeometric Example #4a 1. What is the exact distribution for the number of tagged manatees in our sample?2. What is the exact probability we have exactly 4 tagged manatees in our sample?
24 Hypergeometric Example #4b 3. What is a good approximation for the number of tagged manatees in our sample?4. What is the approximate probability we select 4 tagged manatees in the sample?
25 Hypergeometric Example #5 Little Johnny has a jar containing 10 blue marbles and 12 red marbles. He reaches into the jar and selects 5 marbles without replacement. Let X denote the number of red marbles he obtains.a) Identify the distribution and parameters corresponding to the random variable X.
26 Hypergeometric Example #5 b) What is the probability Johnny obtains exactly 3 red marbles?c) If Johnny repeats this experiment a large number of times, on average how many red marbles can he expect to obtain?
27 Hypergeometric Example #6a In a certain mid-west town consisting of 100 residents, 60% are in favor of raising the local sales tax rate while the other 40% are opposed. Suppose a sample of 10 residents is taken without replacement. Let X denote the number of residents who are in favor of raising taxes.a) Identify the distribution and parameters corresponding to the random variable X.
28 Hypergeometric Example #6b b) What is the probability of obtaining at least 9 residents who are in favor of raising taxes?c) What number of residents in favor of raising taxes can we expect to obtain?
29 Hypergeometric Example #7a Axline Computers manufactures personal computers at two plants, one in Texas and the other in Hawaii. The Texas plant has 40 employees; the Hawaii plant has 20. A random sample of 10 employees is to be asked to fill out a benefits questionnaire. Let X be a worker from Hawaii.a) What is the probability that none of the employees in the sample are from the Hawaii plant?
30 Hypergeometric Example #7b b) What is the probability that one of the employees in the sample works at the plant in Hawaii?c) What is the probability that two or more of the employees in the sample work at the plant in Hawaii?d) What is the expected number of employees from the Hawaii plant to be included in the sample?
31 Approximating Hypergeometric We can approximate the hypergeometric distribution by the binomial distribution if: N > 20*nThis is because N is so big there is very little chance of getting the same object; so even though HG is without replacement and Bin is with replacement, which such a large N it is as if the binomial distribution is now without replacement because the chance of grabbing the same object twice is so small.
32 Approximation Example #1 You roll two 20-sided dice 400 times. Let X be the number of double 20’s you see. Find the approximate probability you see 2 double 20’s.
33 Approximation Example #2 A shoe store has 2,000 pairs of shoes, 800 are men’s shoes, and the rest are women’s. You randomly select 4 pairs of shoes without replacement. What is the approximate probability you select exactly 2 pairs of men’s shoes?
34 Approximation Example #3 A Chicago baseball convention has 5,000 attendees consisting of 3,500 Cubs fans and 1,500 White Soxs fans. Ten people are randomly chosen to participate in a contest to win World Series tickets. What is the approximate probability exactly 7 Cubs fans are selected?
35 Approximation Example #4 Suppose an earthquake will occur somewhere in California each day with probability Assuming earthquake occurrences are weakly dependent from day to day find the approximate probability Californians will experience no earthquakes this year. How many earthquakes can Californians expect to experience in 2010?