Presentation on theme: "Discrete random variables Learning outcomes This work will help you to learn about probability distributions for discrete random variables how to calculate."— Presentation transcript:
Discrete random variables Learning outcomes This work will help you to learn about probability distributions for discrete random variables how to calculate and use E(X), the expectation (mean) how to calculate and use E[g(X)], the expectation of a simple function of X how to calculate and use Var(X), the variance of X about the cumulative distribution function F(x) about the binomial and Poisson distributions
When a variable is discrete, it is possible to specify or describe all its possible numerical variables, for example the number of females in a group of four students: the possible values are 0, 1, 2, 3, 4, the amount gained, in pence, in a game where the entry fee is 10 p and the prizes are 50 p and £1: the possible values are 10, 40, 90, the number of times you throw a die until a six appears: the possible values are 1, 2, 3, 4, 5, … to infinity. Probability Distributions Consider this situation: By mistake, three faulty fuses are put into a box containing two good fuses. The faulty and good fuses become mixed up and are indistinguishable by sight. You take two fuses from the box. What is the probability that you take a.no faulty fuses, b.one faulty fuse, c.two faulty fuses.
F F F F’ 2 faulty 1 faulty 0 faulty a.P(no faulty fuses) = 0.1 b.P(one faulty fuse) = 0.6 c.P(two faulty fuses) = 0.3 Probability Outcome
The variable being considered here is ‘the number of faulty fuses’ and it can be denoted by X. then the answers to the previous questions can be written as or placed in a table x012 P(X = x) then X is a discrete random variable. For a discrete random variable, the sum of the probabilities is 1, The function that is responsible for allocating probabilities, P(X = x) is known as the probability density function of X. (p.d.f of X).
Two tetrahedral dice, each with faces labelled 1, 2, 3 and 4 are thrown and score noted, where the score is the sum of the two numbers on which the dice land. Find the probability density function (p.d.f.) of X, where X is the random variable ‘the score when the two dice are thrown’. The p.d.f. of a discrete random variable Y is given by P(Y = y) = cy 2, for y = 0, 1, 2, 3, 4. Given that c is a constant, find the value of c. The discrete random variable W has probability distribution as shown w P(W = w) d Find a. the value of d e. the mode
Expectation of X, E(X) E(X) is read as ‘E of X’ and it gives an average or typical value of X, known as the expected value or expectation of X. This is comparable with the mean in descriptive statistics. Experimental approach The frequency distribution shows the results when an unbiased die is thrown 120 times. Score, x Frequency, f Total 120 The mean score
You could write this out in a different way These are the relative frequencies of the scores of 1, 2, 3, 4, 5, 6 respectively Notice that they are close to
Theoretical approach When an unbiased die is thrown the probability of obtaining a particular value is. Score, x P(X = x) The expected mean or expectation of X, is obtained by multiplying each score by its probability, then summing. It is written E(X) so
A random variable X has probability distribution as shown. Find the expectation, E(X) x-2 12 P(X = x) Find the expected number of sixes when three fair dice are thrown. x12345 P(X = x) Find the expectation, E(X)
A fruit machine consists of three windows which operate independently. Each window shows pictures of fruits: lemons, apples, cherries or bananas. The probability that a window shows a particular fruit is as follows. P(lemon) = 0.4 P(cherries) = 0.2 P(apple) = 0.1 P(banana) = 0.3 The rules for playing the game on the fruit machine are: You win £1 You win 50p You win 40p You win 80p In any order Find the expected gain or loss if you play the game.
Expectation of any function of X, E[g(X )] The definition of expectation can be extended to any function of X, In general, if g(X) is any function of the discrete random variable X, then For example
x123 P(X = x) Calculate a. E(X), b. E(3), c. E(5X) d.E(5X+3) In general for constants a and b,
A six-sided die has faces marked with numbers 1, 3, 5, 7, 9 and 11. It is biased so that the probability of obtaining the number R in a single roll of the die is proportional to R. a. Show that the probability distribution of R is given by b.The die is to be rolled and a rectangle drawn with sides of length 6 cm and R cm. Calculate the expected value of the area of the rectangle. c.The die is to be rolled again and a square with sides of length 24R -1 cm. Calculate the expected value of the perimeter of the square. r P(R = r)k3k3k5k5k7k7k9k9k11k r P(R = r)
X is the number of heads obtained when two coins are tossed find a.The expected number of heads, b. E(X 2 ), c. E(X 2 – X). x012 P(X = x)
Variance of X, Var(X ) Remember that variance = (standard deviation) 2 Experimental approach For a frequency distribution with mean the variance s 2 is given by Theoretical approach
The random variable X has probability distribution as shown in the table: x12345 P(X = x) Find
First box Second box Two boxes each contain three cards,. The first box contains cards labelled 1, 3, and 5; the second box contains cards labelled 2, 6 and 8. In a game, a player draws one card at random from each box and his score, X, is the sum of the numbers on the two cards. a.Obtain the six possible values of X and find the corresponding probabilities. b.Calculate E(X), E(X 2 ) and the variance of X x P(X = x)
The following results relating to variance are useful. If a and b are any constants, For example
The cumulative distribution function, F(x) In a frequency distribution, the cumulative frequencies are obtained by summing all the frequencies up to a particular value. In the same way, in a particular distribution, the probabilities u to certain values are summed to give the cumulative probability. The cumulative probability function is written F(x). Consider the following probability distribution. x
x12345 F(x)F(x) The cumulative distribution function is In general, for the discrete random variable X, The cumulative distribution function F(x) where
The discrete random variable X has cumulative distribution function Write out the probability distribution and suggest what X represents. x F(x)F(x) x P(X = x)
For a discrete random variable X the cumulative distribution function F(x) is shown x12345 F(x)F(x)
The binomial distribution In a particular population, 10% of people have blood type B. If three people are selected at random from the population, what is the probability that exactly two of them have blood type B? B B B B’B’ B’B’ B’B’ B B’B’ B B’B’ B B’B’ B B’B’
Now consider the situation when eight people are selected. What is the probability that exactly two of the eight people will have blood type B? Can you find the probability that exactly two have blood type B in a randomly selected group of 12 people.?
Conditions for binomial model For a situation to be described using a binomial model a finite number, n, trials are carried out, the trials are independent the outcome of each trial is deemed either a success or a failure, the probability, p of successful outcome is the same for each trial. The discrete random variable, X is the number of successful outcomes in n trials. Then X is said to follow a binomial distribution NOTE: The number of trials n and the probability of success p, are both needed in order to describe the distribution completely.
At Sellitall Supermarket, 60% of the customers pay by credit card. Find the probability that in a randomly selected sample of ten customers. a.Exactly two pay by credit card b.More than seven pay by credit card. Five independent trials of an experiment are carried out. The probability of a successful outcome p and the probability of failure is 1 – p = q Write out the probability distribution of X, where X is the number of successful outcomes in five trials. Comment on your answer. The random variable X is distributed B(7, 0.2). Find correct to three decimal places a.P(X = 3), b.P(1 < X ≤ 4) c.P(X > 1)
A box contains a large number of pens. The probability that a pen is faulty is 0.1. How many pens would you need to select to be more than 95% certain of picking at least one faulty one? Expectation and variance of the binomial distribution It can be shown that The random variable X is B(4, 0.8). Construct the probability distribution for X and find the expectation and variance. x01234 P(X = x)
The probability that it will be a fine day is 0.4. Find the expected number of fine days in a week and also the standard deviation. X is B(n, p) with mean 5 and standard deviation 2. Find the values of n and p. The mode of the binomial distribution The mode is the value of X that is most likely to occur. Consider the following probability sketches.
when p = 0.5 and n is odd, there are two modes, otherwise the distribution has one mode The mode can be found by calculating all the probabilities and find the value of X with the highest probability. This without a GDC can be very tedious; it is only usually necessary to consider the probabilities of values of X close to the mean np. For the probabilities too small to illustrate For the probabilities too small to illustrate
The probability that a student is awarded a distinction in the Mathematics examination is In a randomly selected group of 50 students, what is the most likely number of students awarded a distinction?
The Poisson distribution Consider these random variables the number of emergency calls received by an ambulance control in an hour, the number of vehicles approaching a expressway toll bridge in a five minute interval, the number of flaws in a metre length of material the number of white corpuscles on a slide. Assuming these occur randomly, they are all examples of variables that can be modelled using a Poisson distribution. Conditions for Poisson model Events occur singly at random in a given interval of time or space. λ, the mean number of occurrences in the given interval, is known and is finite. The variable X is the number of occurrences in the given interval. If the above conditions are satisfied, X is said to follow a Poisson distribution written
A student finds that the average number of amoebas in 10 ml of pond water from a particular pond is four. Assuming that the number of amoebas follows a Poisson distribution, find the probability that in 10 ml sample a.there are exactly five amoebas, b.there are no amoebas, c.there are fewer than three amoebas. These two results are useful in general Unit interval Care must be taken to specify the interval being considered. In the previous example the mean number of amoebas in 10 ml of pond water from a particular pond is 4 so the number in 10 ml is distributed Po(4). Now suppose you want to find a probability relating to the number of amoebas in 5 ml of water from the same pond. The mean number of amoebas in 5 ml is two, so the number 5 ml is distributed Po(2) Similarly, the number of amoebas in 1 ml of pond water is distributed Po(0.4)
On average the school photocopier breaks down eight times during the school week. (Monday to Friday). Assuming that the number of breakdowns can be modelled by a Poisson distribution, find the probability that is breaks down a.five times in a given week, b.once on Monday, c.eight times in a fortnight. Mean and variance of the Poisson distribution X follows a Poisson distribution with standard deviation 1.5. Find P(X ≥ 3)
Notice for small values of λ, the distribution is very skew, but it becomes more symmetrical as λ increases
The mode of the Poisson distribution The mode is the value of X that is the most likely to occur, i.e. with the greatest probability. From the diagrams, we can see that when λ = 1, there are two modes, 0 and 1, when λ = 2, there are two modes, 1 and 2, when λ = 1, there are two modes, 2 and 3, In general, if λ is an integer, there are two modes, λ – 1 and λ. For example, if X ~ Po(8), the modes are 7 and 8. Notice also that when λ = 1.6, the mode is 1, when λ = 2.2, the mode is 2, when λ = 3.8, the mode is 3, In general, if λ is not an integer, mode is the integer below λ. For example, if X ~ Po(4.9), the mode is 4.