Presentation is loading. Please wait.

Presentation is loading. Please wait.

Office 310 320.... Probability-Related Concepts How to Assign Probabilities to Experimental Outcomes Probability Rules Discrete Random Variables Continuous.

Similar presentations


Presentation on theme: "Office 310 320.... Probability-Related Concepts How to Assign Probabilities to Experimental Outcomes Probability Rules Discrete Random Variables Continuous."— Presentation transcript:

1 Office 310 320...

2 Probability-Related Concepts How to Assign Probabilities to Experimental Outcomes Probability Rules Discrete Random Variables Continuous Random Variables Probability Distribution & Functions Topics Covered

3 Concepts An event – Any phenomenon you can observe that can have more than one outcome (e.g., flipping a coin) An outcome – Any unique condition that can be the result of an event (e.g., flipping a coin: heads or tails), a.k.a simple event or sample points Sample space – The set of all possible outcomes associated with an event Probability is a measure of the likelihood of each possible outcome

4 Probability Distributions The usual application of probability distributions is to find a theoretical distribution Reflects a process that explains what we see in some observed sample of a geographic phenomenon Compare the form of the sampled information and theoretical distribution through a test of significance Geography: discrete random events in space and time (e.g. how often will a tornado occur?)

5 Discrete Probability Distributions Discrete probability distributions –The Uniform Distribution –The Binomial Distribution –The Poisson Distribution Each is appropriately applied in certain situations and to particular phenomena

6 The Uniform Distribution Source: http://davidmlane.com/hyperstat/A12237.html

7 The Uniform Distribution Describes the situation where the probability of all outcomes is the same n outcomes  P(x i ) = 1/n e.g. flipping a coin: P(x heads ) = 1/2 = P(x tails ) 0 0.25 0.50 P(xi)P(xi) heads xixi tails A uniform probability mass function

8 a<=x<=b otherwise x < a a<=x<=b x>b Source: http://en.wikipedia.org/wiki/Uniform_distribution_(discrete)

9 The Uniform Distribution A little simplistic and perhaps useless But actually well applied in two situations 1. The probability of each outcome is truly equal (e.g. the coin toss) 2. No prior knowledge of how a variable is distributed (i.e. complete uncertainty), the first distribution we should use is uniform (no assumptions about the distribution)

10 The Uniform Distribution However, truly uniformly distributed geographic phenomena are somewhat rare We often encounter the situation of not knowing how something is distributed until we sample it When we are resisting making assumptions we usually apply the uniform distribution as a sort of null hypothesis of distribution

11 The Uniform Distribution Example – Predict the direction of the prevailing wind with no prior knowledge of the weather system’s tendencies in the area We would have to begin with the idea that P(x North ) = 1/4 P(x East ) = 1/4 P(x South ) = 1/4 P(x West ) = 1/4 Until we had an opportunity to sample and find out some tendency in the wind pattern based on those observations 0 0.25 P(xi)P(xi) N E SW 0.125

12 (Lillesand et al. 2004) (Binford 2005) Remote Sensing Supervised classification

13 The Binomial Distribution Provides information about the probability of the repetition of events when there are only two possible outcomes, –e.g. heads or tails, left or right, success or failure, rain or no rain … –Events with multiple outcomes may be simplified as events with two outcomes (e.g., forest or non-forest) Characterizing the probability of a proportion of the events having a certain outcome over a specified number of events

14 The Binomial Distribution A binomial distribution is produced by a set of Bernoulli trials (Jacques Bernoulli) The law of large numbers for independent trials – at the heart of probability theory Given enough observed events, the observed probability should approach the theoretical values drawn from probability distributions –e.g. enough coin tosses should approach the P = 0.5 value for each outcome (heads or tails)

15 How to test the law? A set of Bernoulli trials is the way to operationally test the law of large numbers using an event with two possible outcomes: (1) N independent trials of an experiment (i.e. an event like a coin toss) are performed (2) Every trial must have the same set of possible outcomes (e.g., heads and tails)

16 Bernoulli Trials (3) The probability of each outcome must be the same for all trials, i.e. P(x i ) must be the same each time for both x i values (4) The resulting random variable is determined by the number of successes in the trials (successes  one of the two outcomes) p = the probability of success in a trial q = (p –1) as the probability of failure in a trial p + q = 1

17 Bernoulli Trials Suppose on a series of successive days, we will record whether or not it rains in Chapel Hill We will denote the 2 outcomes using R when it rains and N when it does not rain nPossible Outcomes# of Rain DaysP(# of Rain Days) 1R 1p p N0 (1 - p) q 2RR2p 2 p 2 RN NR1 2[p*(1 – p)] 2pq NN0 (1 – p) 2 q 2 3RRR 3 p 3 p 3 RRN RNR NRR 2 3[p 2 *(1 – p)] 3p 2 q NNR NRN RNN 1 3[p*(1 – p) 2 ]3pq 2 NNN0 (1 – p) 3 q 3

18 Bernoulli Trials If we have a value for P(R) = p, we can substitute it into the above equations to get the probability of each outcome from a series of successive samples (e.g. p=0.2) nPossible Outcomes# of Rain DaysP(# of Rain Days) 1R 1 p=0.2 N0 q=0.8 2RR2 p 2 =0.04 RN NR1 2pq=0.32 NN0 q 2 =0.64 3RRR 3 p 3 =0.008 RRN RNR NRR 2 3p 2 q=0.096 NNR NRN RNN 1 3pq 2 =0.384 NNN0 q 3 =0.512

19 Bernoulli Trials 1 event,  = (p + q) 1 = p + q 2 events,  = (p + q) 2 = p 2 + 2pq + q 2 3 events,  = (p + q) 3 = p 3 + 3p 2 q + 3pq 2 + q 3 4 events,  = (p + q) 4 = p 4 + 4p 3 q + 6p 2 q 2 + 4pq 3 + q 4 Source: Earickson, RJ, and Harlin, JM. 1994. Geographic Measurement and Quantitative Analysis. USA: Macmillan College Publishing Co., p. 132. A graphical representation: probability # of successes The sum of the probabilities can be expressed using the binomial expansion of (p + q) n, where n = # of events

20 The Binomial Distribution A general formula for calculating the probability of x successes (n trials & a probability p of success: where C(n,x) is the number of possible combinations of x successes and (n –x) failures: P(x) = C(n,x) * p x * (1 - p) n - x C(n,x) = n! x! * (n – x)!

21 The Binomial Distribution – Example e.g., the probability of 2 successes in 4 trials, given p=0.2 is: P(x) = 4! 2! * (4 – 2)! * (0.2) 2 *(1 – 0.2) 4 - 2 P(x) = 24 2 * 2 * (0.2) 2 * (0.8) 2 P(x) = 6 * (0.04)*(0.64) = 0.1536

22 The Binomial Distribution – Example Calculating the probabilities of all possible number of rain days out of four days (p = 0.2): The chance of having one or more days of rain out of four: P(1) + P(2) + P(3) + P(4) = 0.5904 xP(x)C(n,x)p x (1 – p) n – x 0P(0)1(0.2) 0 (0.8) 4 =0.4096 1P(1)4(0.2) 1 (0.8) 3 =0.4096 2P(2)6(0.2) 2 (0.8) 2 =0.1536 3P(3)4(0.2) 3 (0.8) 1 =0.0256 4P(4)1(0.2) 4 (0.8) 0 =0.0016

23 The Binomial Distribution – Example Naturally, we can plot the probability mass function produced by this binomial distribution: x i P(x i ) 00.4096 10.4096 20.1536 30.0256 40.0016 0 0.25 0.50 P(xi)P(xi) 1234 xixi 0

24 Source: http://www.itl.nist.gov/div898/handbook/eda/section3/eda366i.htm The following is the plot of the binomial probability density function for four values of p and n = 100

25 Source: http://home.xnet.com/~fidler/triton/math/review/mat170/probty/p-dist/discrete/Binom/binom1.htm

26 Source: http://www.mpimet.mpg.de/~vonstorch.jinsong/stat_vls/s3.pdf

27 Rare Discrete Random Events Some discrete random events in question happen rarely (if at all), and the time and place of these events are independent and random (e.g., tornados) The greatest probability is zero occurrences at a certain time or place, with a small chance of one occurrence, an even smaller chance of two occurrences, etc. heavily peaked and skewed: 0 0.25 0.5 P(xi)P(xi) 1234 xixi 0

28 The Poisson Distribution In the 1830s, S.D. Poisson described a distribution with these characteristics Describing the number of events that will occur within a certain area or duration (e.g. # of meteorite impacts per state, # of tornados per year, # of hurricanes in NC) Poisson distribution’s characteristics: 1. It is used to count the number of occurrences of an event within a given unit of time, area, volume, etc. (therefore a discrete distribution)

29 The Poisson Distribution 2. The probability that an event will occur within a given unit must be the same for all units (i.e. the underlying process governing the phenomenon must be invariant) 3. The number of events occurring per unit must be independent of the number of events occurring in other units (no interactions) 4. The mean or expected number of events per unit (λ) is found by past experience (observations)

30 The Poisson Distribution Poisson formulated his distribution as follows: P(x) = e - * x x! To calculate a Poisson distribution, you must know λ where e = 2.71828 (base of the natural logarithm) λ = the mean or expected value x = 1, 2, …, n – 1, n # of occurrences x! = x * (x – 1) * (x – 2) * … * 2 * 1

31 The Poisson Distribution Poisson distribution P(x) = e - * x x! The shape of the distribution depends strongly upon the value of λ, because as λ increases, the distribution becomes less skewed, eventually approaching a normal-shaped distribution as it gets quite large  increases, the distribution becomes less skewed We can evaluate P(x) for any value of x, but large values of x will have very small values of P(x)

32 λ

33 The Poisson Distribution Poisson distribution P(x) = e - * x x! The shape of the distribution depends strongly upon the value of λ, because as λ increases, the distribution becomes less skewed, eventually approaching a normal-shaped distribution as l gets quite large increases, the distribution becomes less skewed We can evaluate P(x) for any value of x, but large values of x will have very small values of P(x)

34 The Poisson Distribution Poisson distribution P(x) = e - * x x! The Poisson distribution can be defined as the limiting case of the binomial distribution: constant

35 The Poisson Distribution The Poisson distribution is sometimes known as the Law of Small Numbers, because it describes the behavior of events that are rare We can observe the frequency of some rare phenomenon, find its mean occurrence, and then construct a Poisson distribution and compare our observed values to those from the distribution (effectively expected values) to see the degree to which our observed phenomenon is obeying the Law of Small Numbers:

36 Murder rates in Fayetteville Fitting a Poisson distribution to the 24-hour murder rates in Fayetteville in a 31-day month (to ask the question ‘Do murders randomly occur in time?’) # of Murders Days (Frequency) 017 19 23 31 41 Total 31 days

37 Murder rates in Fayetteville = mean murders per day = 22 / 31 = 0.71 # of Murders Days# of Murders* # of Days 0170 199 236 313 41 4 Total 31 days22

38 Murder rates in Fayetteville = mean murders per day = 22 / 31 = 0.71

39 Murder rates in Fayetteville x (# of Murders)Obs. Frequency (F obs ) x*F obs F exp 017 015.2 19 910.9 23 63.7 31 30.9 41 40.2 Total 31 2230.9 = mean murders per day = 22 / 31 = 0.71 We can compare F obs to F exp using a X 2 test to see if observations do match Poisson Dist.

40 Murder rates in Fayetteville

41 The Poisson Distribution Procedure for finding Poisson probabilities and expected frequencies: (1) Set up a table with five columns as on the previous slide (2) Multiply the values of x by their observed frequencies (x * F obs ) (3) Sum the columns of F obs (observed frequency) and x * F obs (4) Compute λ = Σ (x * F obs ) / Σ F obs (5) Compute P(x) values using the equation or a table (6) Compute the values of Fexp = P(x) * Σ F obs

42 Source: http://www.mpimet.mpg.de/~vonstorch.jinsong/stat_vls/s3.pdf


Download ppt "Office 310 320.... Probability-Related Concepts How to Assign Probabilities to Experimental Outcomes Probability Rules Discrete Random Variables Continuous."

Similar presentations


Ads by Google