Presentation is loading. Please wait.

Presentation is loading. Please wait.

SADC Course in Statistics Probability Distributions (Session 04)

Similar presentations


Presentation on theme: "SADC Course in Statistics Probability Distributions (Session 04)"— Presentation transcript:

1 SADC Course in Statistics Probability Distributions (Session 04)

2 To put your footer here go to View > Header and Footer 2 Learning Objectives At the end of this session you will be able to: solve basic problems concerning real- valued probability distributions. distinguish between discrete and continuous random variables (r.v.s). explain what is meant by a probability distribution. calculate the population mean and variance of a given distribution.

3 To put your footer here go to View > Header and Footer 3 Session Contents In this session you will be introduced to the theory of probability distributions. be shown how to build a firm foundation of the theory of probability distributions in preparation for applications in statistical inference (Module H2). strengthen the mathematical skills that are required to deal correctly with probability ideas.

4 To put your footer here go to View > Header and Footer 4 Random variables In the previous two sessions we dealt with probabilities of events. In practice events of interest are those generated by random variables. A random variable is a variable that associates outcomes in the sample space with numerical values.

5 To put your footer here go to View > Header and Footer 5 An example – birth of a baby Girl Boy 0 1 Line showing numerical scale X Sample space The figure above depicts a random variable X defined as X = 0if outcome is a boy X = 1if outcome is a girl

6 To put your footer here go to View > Header and Footer 6 Often the outcomes are actual measurements. Thus, we could have: a random variable Y which records measurements of weights (of say maize cobs) into numbers with kilograms as units. Outcomes of any experiment can be recorded as real numbers by defining an appropriate random variable. We do this because it is easier to work with numbers. A second example:

7 To put your footer here go to View > Header and Footer 7 Types of random variables A random variable is said to be discrete if the set of possible values is countable. Examples of discrete random variables are those that records events on gender, family size, number of traffic offenses,... A random variable is said to be continuous if the set of possible values is not countable. Examples of continuous random variables are those that record events such as weight, height, time, etc...

8 To put your footer here go to View > Header and Footer 8 Continuous random variables can be mapped into discrete random variables by grouping. For example, age X is a continuous random variable since it is a measure of time since birth. We can define a discrete random variable Y as Y = 1 if 0X<5. = 2 if 5X<10 = 3 if 10X<15 = etc. You cannot convert a discrete random variable into a continuous one. Continuous to discrete?

9 To put your footer here go to View > Header and Footer 9 Probability distributions A probability distribution is a table, a function or a graph that presents possible outcomes of a trial, say E (e.g. throw of a die), together with their corresponding probabilities. Note that the outcome probabilities must sum to 1 since occurrence of E results in exactly one outcome.

10 To put your footer here go to View > Header and Footer 10 An example The following is an example of a probability distribution for the gender of a new born child: Outcome Values (x) of a random variable X P(x) Male00.5 Female10.5 Total1

11 To put your footer here go to View > Header and Footer 11 A probability distribution can sometimes be specified using a function f called a probability (mass/density) function. The function f must satisfy the following conditions: Probability mass/density function

12 To put your footer here go to View > Header and Footer 12 The function P(x) of the slide 10 is a probability mass function since it satisfies the two conditions above. Point 1 of slide 11 satisfies the first law of probability, as it must since P(x) represents a probability. Point 2 of slide 11 indicates that the sum is used if the set of values x is countable; otherwise the integral applies. Points to note:

13 To put your footer here go to View > Header and Footer 13 Expected values The weighted centre of a probability distribution is called the expected value written E(X). More formally the expected value of a random variable X is defined as: in the discrete case. in the continuous case. E(X) is also called the population mean and is usually denoted by.

14 To put your footer here go to View > Header and Footer 14 Example (i) If f(x) is given by then E(X) = 0(0.5) + 1(0.5) = 0.5 xf(x) = Prob(x) 00.5 1 Total1

15 To put your footer here go to View > Header and Footer 15 Example (ii) Let f(x) = 2x, for 0 x 1

16 To put your footer here go to View > Header and Footer 16 Moments The k-th moment of a random variable X is defined as: in the discrete case. in the continuous case. The moments of a distribution characterize the shape of a distribution. The notation k is often used to denote the k-th moment.

17 To put your footer here go to View > Header and Footer 17 Class exercise Suppose a coin is tossed twice. (a)Write down the possible values for the random variable X defined as: X = number of heads that occur (b)Prepare a table showing the probability distribution function of X (c)Use this table to determine the expected value of X

18 To put your footer here go to View > Header and Footer 18 Measures of spread The variance of a random variable X is defined as Notice that E(X 2 ) is the second moment of X. The variance of X is also called the population variance and is denoted by 2. The square root of the variance is called the standard deviation of X. It is denoted by.

19 To put your footer here go to View > Header and Footer 19 Patterns for differing variances Note that the bigger the variance, the larger is the spread.

20 To put your footer here go to View > Header and Footer 20 Skewness and kurtosis If the probability distribution is not symmetrical about the mean it is said to be skew. The distribution has a positive skewness if the tail of high values is longer than the tail of low values, and negative skewness if the reverse is true. Kurtosis is a measure of the peakness of a probability distribution. It is usually used as a comparison with the normal distribution (see later sessions) since a kurtosis of more than 3 indicates that the distribution has a higher peak than the normal distribution.

21 To put your footer here go to View > Header and Footer 21 Cumulative probability distribution In many applications we want to calculate probabilities of the type P(Xk) or P(X>k) instead of P(X=k). The probabilities P(Xk) for k = 0, 1, 2,.. provide an example of what is called the cumulative distribution of a random variable X. Here, the random variable X is discrete.

22 To put your footer here go to View > Header and Footer 22 P(X>k) = 1 – P(X k). This is a direct result of the probability result that P(A c ) = 1 – P(A). Similar results can be obtained for continuous random variables. That is, if a < b then the event {X a} is a sub-event of the event {X b}. Hence P(X a) < P(X b). Some results

23 To put your footer here go to View > Header and Footer 23 The cumulative distribution at x, denoted F( x ), is formally defined as: in the discrete case for a positive random variable. in the continuous case By definition, cumulative distribution is an increasing function having certain properties. These are shown below. Definition of F(x)

24 To put your footer here go to View > Header and Footer 24 F (– ) = 0. F (+ ) = 1. This says that the total area under the probability density function is 1. F(a) < F(b) for a { "@context": "http://schema.org", "@type": "ImageObject", "contentUrl": "http://images.slideplayer.com/3/794451/slides/slide_24.jpg", "name": "To put your footer here go to View > Header and Footer 24 F (– ) = 0.", "description": "F (+ ) = 1. This says that the total area under the probability density function is 1. F(a) < F(b) for a

25 To put your footer here go to View > Header and Footer 25 An example using F(x) - discrete A discrete r.v. X, representing the number of girls in families with 5 children, has the foll: dist n : X = No. of girlsP(X=x)F(x) 00.03125 10.15625 20.31250 3 40.15625 50.03125 What is the probability of 4 children or less? Complete the table with values of F(x)

26 To put your footer here go to View > Header and Footer 26 An example using F(x) - continuous A continuous random variable r.v. X, has probability density function given by What is its cumulative distribution function? Answer:

27 To put your footer here go to View > Header and Footer 27 Practical work follows to ensure learning objectives are achieved…


Download ppt "SADC Course in Statistics Probability Distributions (Session 04)"

Similar presentations


Ads by Google