Presentation is loading. Please wait.

Presentation is loading. Please wait.

The role of probability in statistics In statistical inference, we want to make general statements about the population based on measurements taken from.

Similar presentations


Presentation on theme: "The role of probability in statistics In statistical inference, we want to make general statements about the population based on measurements taken from."— Presentation transcript:

1 The role of probability in statistics In statistical inference, we want to make general statements about the population based on measurements taken from a sample. How will all suspension systems produced with the new composite behave? How will the radar navigation system perform in all aircraft? What speed improvements will we obtain for all roofing applications using the new product? To answer these questions, we ___________ from the population and hope to generalize the results. ETM 620 - 09U 1

2 Probability See chapter 1 of Hines et al. for more The probability of an event, A is the likelihood of that event given the entire sample space of possible events. 0 ≤ P(A) ≤ 1 P(ø) = 0 P(S) = 1 For mutually exclusive events, P(A 1 U A 2 U … U A k ) = P(A 1 ) + P(A 2 ) + … P(A k ) ETM 620 - 09U 2

3 Applications of probability Example: An appliance manufacturer has learned of an increased incidence of short circuits and fires in a line of ranges sold over a 5 month period. A review of the FMEA data indicates the probabilities that if a short circuit occurs, it will be at any one of several locations is as follows (assume these are independent, mutually exclusive events): LocationP House Junction0.46 Oven/MW junction0.14 Thermostat0.09 Oven coil0.24 Electronic controls0.07 ETM 620 - 09U 3

4 Applications of probability The probability that the short circuit does not occur at the house junction is … The probability that the short circuit occurs at either the Oven/MW junction or the oven coil is … The probability that both the electronic controls and the thermostat short circuit simultaneously is … ETM 620 - 09U 4

5 Serial and parallel systems For increased safety and reliability, systems are often designed with redundancies. A typical system might look like the following: ETM 620 - 09U 5

6 Serial and parallel systems What is the probability that: Segment 1 works? Segment 2 works? The entire system works? 1 2 ETM 620 - 09U 6

7 Conditional probability If segment 2 works, what is the probability that component C does not work? ETM 620 - 09U 7

8 Conditional probability If the system works, what is the probability that component D does not work? ETM 620 - 09U 8

9 Defining probabilities: Random variables To learn more, refer to chapter 2 of Hines et al. Examples: Out of 100 heart catheterization procedures performed at a local hospital each year, the probability that more than five of them will result in complications is __________ Drywall anchors are sold in packs of 50 at the local hardware store. The probability that no more than 3 will be defective is __________ In general, ___________ ETM 620 - 09U 9

10 10 Functions of random variables Chapter 3 of Hines et al. Example: Probability of X, the number of cars passing through a car wash in one hour on a sunny Friday afternoon, is given by Let g(X) = 2X -1 represent the amount of money paid to the attendant by the manager. What can the attendant expect to earn during this hour on any given sunny Friday afternoon? E[g(X)] = Σ g(x) f(x) = ____________________ = _______________________________ x456789 P(X = x)1/12 1/4 1/6

11 ETM 620 - 09U 11 Mathematical expectation Example: Repair costs for a particular machine are represented by the following probability distribution: What is the expected value of the repairs? – That is, over time what do we expect repairs to cost on average? x$50200350 P(X = x)0.30.20.5

12 ETM 620 - 09U 12 Expected value μ = E(X) μ = mean of the probability distribution For discrete variables, μ = E(X) = ∑ x f(x) So, for our example, E(X) = ________________________

13 ETM 620 - 09U 13 Your turn … By investing in a particular stock, a person can take a profit in a given year of $4000 with a probability of 0.3 or take a loss of $1000 with a probability of 0.7. What is the investor’s expected gain on the stock?

14 ETM 620 - 09U 14 Variability Recall our example: Repair costs for a particular machine are represented by the following probability distribution: What is the variance in the repair cost? – That is, how might we define the spread of costs? x$50200350 P(X = x)0.30.20.5

15 ETM 620 - 09U 15 Variance Remember  μ = E(X) = $230 Then, for discrete variables, σ 2 = E [(X - μ ) 2 ] = ∑ (x - μ ) 2 f(x) = E (X 2 ) - μ 2 So, for our example, σ 2 = ________________________

16 ETM 620 - 09U 16 Your turn … By investing in a particular stock, a person can take a profit in a given year of $4000 with a probability of 0.3 or take a loss of $1000 with a probability of 0.7. What are the variance and standard deviation of the investor’s gain on the stock?

17 Known probability distributions Engineers frequently work with data that can be modeled as one of several known probability distributions. Being able to model the data allows us to: model real systems design predict results Key discrete probability distributions (Chapter 5) include: binomial negative binomial (incl. Pascal) hypergeometric Poisson ETM 620 - 09U 17

18 Bernoulli trials Examples: Inspect tires coming off the production line. Classify each as defective or not defective. Define “success” as defective. If historical data shows that 95% of all tires are defect-free, then P(“success”) = 0.05. Signals picked up at a communications site are either incoming speech signals or “noise.” Define “success” as the presence of speech. P(“success”) = P(“speech”) Administer a test drug to a group of patients with a specific condition. P(“success”) = ___________ Bernoulli Process n repeated trials the outcome may be classified as “success” or “failure” the probability of success (p) is constant from trial to trial repeated trials are independent. ETM 620 - 09U 18

19 Binomial distribution Example: Historical data indicates that 10% of all bits transmitted through a digital transmission channel are received in error. Let X = the number of bits in error in the next 4 bits transmitted. Assume that the transmission trials are independent. What is the probability that Exactly 2 of the bits are in error? At most 2 of the 4 bits are in error? more than 2 of the 4 bits are in error? The number of successes, X, in n Bernoulli trials is called a binomial random variable. ETM 620 - 09U 19

20 Binomial distribution The probability distribution is called the binomial distribution. b(x; n, p) =, x = 0, 1, 2, …, n where p = _________________ q = _________________ For our example, b(x; n, p) = _________________ ETM 620 - 09U 20

21 For our example … What is the probability that exactly 2 of the bits are in error? At most 2 of the 4 bits are in error? ETM 620 - 09U 21

22 Your turn … What is the probability that more than 2 of the 4 bits are in error? ETM 620 - 09U 22

23 Expectations of the binomial distribution The mean and variance of the binomial distribution are given by µ = np σ 2 = npq Suppose, in our example, we check the next 20 bits. What are the expected number of bits in error? What is the standard deviation? μ = ___________ σ 2 = __________, σ = __________ ETM 620 - 09U 23

24 Another example A worn machine tool produces 1% defective parts. If we assume that parts produced are independent, what is the mean number of defective parts that would be expected if we inspect 25 parts? What is the expected variance of the 25 parts? ETM 620 - 09U 24

25 Helpful hints … Sometimes it helps to draw a picture. Suppose we inspect the next 5 parts … P(at least 3)  P(2 ≤ X ≤ 4)  P(less than 4)  In Excel, the function =binomdist(number_s,trials,probability_s,cumulative) gives the cumulative distribution function (when cumulative = “true” or 1) and the probability mass function (when cumulative = “false” or 0). ETM 620 - 09U 25

26 Multinomial experiments What if there are more than 2 possible outcomes? (e.g., acceptable, scrap, rework) That is, suppose we have: n independent trials k outcomes that are mutually exclusive (e.g., ♠, ♣, ♥, ♦ ) exhaustive (i.e., ∑ all k p i = 1) Then f(x 1, x 2, …, x k ; p 1, p 2, …, p k, n) = ETM 620 - 09U 26

27 Example According to genetic theory, a certain cross of guinea pig will result in red, black, and white offspring in the ratio 8:4:4. What is the probability that among 8 offspring, 5 will be red, 2 black, and 1 white? f( __, __, __; ___, ___, ___, __) =_________________ = __________________________________ x 1 =_______p 1 =_______ x 2 =_______p 2 =_______n = _____ x 3 =_______p 3 =_______ ETM 620 - 09U 27

28 ETM 620 - 09U 28 Hypergeometric distribution Example*: Automobiles arrive in a dealership in lots of 10. Five out of each 10 are inspected. For one lot, it is know that 2 out of 10 do not meet prescribed safety standards. What is probability that at least 1 out of the 5 tested from that lot will be found not meeting safety standards? * from Complete Business Statistics, 4th ed (McGraw-Hill)

29 ETM 620 - 09U 29 This example follows a hypergeometric distribution: A random sample of size n is selected without replacement from N items. k of the N items may be classified as “successes” and N-k are “failures.” The probability associated with getting x successes in the sample (given k successes in the lot.) Where, k = number of “successes” = 2n = number in sample = 5 N = the lot size = 10x = number found = 1 or 2

30 Hypergeometric distribution In our example, = _____________________________ ETM 620 - 09U 30

31 ETM 620 - 09U 31 Expectations of the hypergeometric The mean and variance of the hypergeometric distribution are given by What are the expected number of cars that fail inspection in our example? What is the standard deviation? μ = ___________ σ 2 = __________, σ = __________

32 ETM 620 - 09U 32 Your turn … A worn machine tool produced defective parts for a period of time before the problem was discovered. Normal sampling of each lot of 20 parts involves testing 6 parts and rejecting the lot if 2 or more are defective. If a lot from the worn tool contains 3 defective parts: 1. What is the expected number of defective parts in a sample of six from the lot? 2. What is the expected variance? 3. What is the probability that the lot will be rejected?

33 ETM 620 - 09U 33 Binomial approximation Note, if N >> n, then we can approximate this with the binomial distribution. For example: Automobiles arrive in a dealership in lots of 100. 5 out of each 100 are inspected. 2 /10 (p=0.2) are indeed below safety standards. What is probability that at least 1 out of 5 will be found not meeting safety standards? Recall: P(X ≥ 1) = 1 – P(X < 1) = 1 – P(X = 0) Hypergeometric distributionBinomial distribution

34 ETM 620 - 09U 34 Negative binomial (Pascal) distribution Example: Historical data indicates that 30% of all bits transmitted through a digital transmission channel are received in error. An engineer is running an experiment to try to classify these errors, and will start by gathering data on the first 10 errors encountered. What is the probability that the 10 th error will occur on the 25 th trial?

35 ETM 620 - 09U 35 This example follows a negative binomial distribution: Repeated independent trials. Probability of success = p and probability of failure = q = 1-p. Random variable, X, is the number of the trial on which the r th success occurs. Note: when r is an integer value this is referred to as a Pascal distribution. The probability associated with the r th success occurring on trial x is given by, Where, r = “success number” = 10 x = trial number on which r occurs = 25 p = probability of success (error) = 0.3 q = 1 – p = 0.7

36 ETM 620 - 09U 36 Negative Binomial Distribution In our example, = _____________________________

37 ETM 620 - 09U 37 Poisson process The number of occurrences in a given interval or region with the following properties: “memoryless” P(occurrence) during a very short interval or small region is proportional to the size of the interval and doesn’t depend on number occurring outside the region or interval. P(X>1) in a very short interval is negligible Examples: Number of bits transmitted per minute. Number of calls to customer service in an hour. Number of bacteria in a given sample. Number of hurricanes per year in a given region.

38 ETM 620 - 09U 38 Poisson Process Example An average of 2.7 service calls per minute are received at a particular maintenance center. The calls correspond to a Poisson process. To determine personnel and equipment needs to maintain a desired level of service, the plant manager needs to be able to determine the probabilities associated with numbers of service calls. What is the probability that fewer than 2 calls will be received in any given minute?

39 ETM 620 - 09U 39 Poisson Distribution The probability associated with the number of occurrences in a given period of time is given by, Where, λ = average number of outcomes per unit time or region = 2.7 t = time interval or region = 1 minute

40 ETM 620 - 09U 40 Our example The probability that fewer than 2 calls will be received in any given minute is … P(X < 2) = P(X = 0) + P(X = 1) = __________________________ The mean and variance are both λ t, so μ = _____________________ Note: Table I, pp. 598-600, gives Σ t p(x; λt ) In Excel, the function =POISSON(x,mean,cumulative) gives the cumulative distribution function (when cumulative = “true” or 1) and the probability mass function (when cumulative = “false” or 0).

41 ETM 620 - 09U 41 Poisson distribution If more than 6 calls are received in a 3-minute period, an extra service technician will be needed to maintain the desired level of service. What is the probability of that happening? μ = λ t = _____________________ P(X > 6) = 1 – P(X < 6) = _____________________

42 ETM 620 - 09U 42 Poisson distribution

43 ETM 620 - 09U 43 Poisson Distribution The effect of λ on the Poisson distribution


Download ppt "The role of probability in statistics In statistical inference, we want to make general statements about the population based on measurements taken from."

Similar presentations


Ads by Google