Presentation on theme: "Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm."— Presentation transcript:
Your 4 th homework is assigned. It is due on 12 th of Feb, 11:59 pm.
a) Adult not working during summer vacation. b) The experiment consist of 10 identical trials. A trial for this experiment is an individual. There are only two possible outcomes: work or do not work The probability remains same for each individual (trial) Individuals are independent c) 0.35 d) e)
Thinking Challenge The communications monitoring company Postini has reported that 91% of messages are spam. Suppose your inbox contains 25 messages. What are the mean and standard deviation of the number of real messages you should expect to find in your inbox? What is the probability that you will find only 1 or 2 real messages?
Thinking Challenge What are the mean and standard deviation of the number of real messages you should expect to find in your inbox? (2.25, 1.43) What is the probability that you will find only 1 or 2 real messages? (P(X=1)+P(X=2)= )
Content 1.Two Types of Random Variables 2.Probability Distributions for Discrete Random Variables 3.The Binomial Distribution 4.Hypergeometric Distributions 5.Probability Distributions for Continuous Random Variables 6.The Normal Distribution 7.Uniform Distribution
4.4 Hypergeometric Distribution
Characteristics of a Hypergeometric Random Variable 1.The experiment consists of randomly drawing n elements without replacement from a set of N elements, r of which are Ss (for success) and (N – r) of which are Fs (for failure). 2.The hypergeometric random variable x is the number of Ss in the draw of n elements.
Hypergeometric Probability Distribution Function where... [x = Maximum [0, n – (N – r), …, Minimum (r, n)]
Hypergeometric Probability Distribution Function N = Total number of elements r = Number of Ss in the N elements n = Number of elements drawn x = Number of Ss drawn in the n elements
a. Hypergeometric b. Binomial
Thinking Challenge A carton of 12 eggs has 4 rotten eggs and 8 good eggs. Three eggs are chosen at random from the carton to make a three-egg omelet. Let X = the number of rotten eggs chosen. What is the probability that the sample will consist of one rotten egg and two good eggs, that is, what is P(X = 1)? (a) 81/220 (b) 192/220 (c) 112/220 (d) 56/220
Thinking Challenge 5 cards are picked from a deck of 52 cards without replacement. What is the probability that 2 of the selected cards will be Ace? a) b) c) d)
4.5 Probability Distributions for Continuous Random Variables
Continuous Probability Density Function The graphical form of the probability distribution for a continuous random variable x is a smooth curve that might appear as below:
Continuous Probability Density Function This curve, a function of x, is denoted by the symbol f(x) and is variously called a probability density function (pdf), a frequency function, or a probability distribution. The areas under a probability distribution correspond to probabilities for x. The area A beneath the curve between two points a and b is the probability that x assumes a value between a and b.
Continuous Probability Density Function P(a
4.6 The Normal Distribution
Importance of Normal Distribution 1.Describes many random processes or continuous phenomena 2.Can be used to approximate discrete probability distributions Example: binomial 3.Basis for classical statistical inference
Normal Distribution 1.Bell-shaped & symmetrical 2.Mean, median, mode are equal x f(x) Mean Median Mode
Probability Density Function where µ = Mean of the normal random variable x = Standard deviation π = e = P(x < a) is obtained from a table of normal probabilities
Effect of Varying Parameters ( & )
Normal Distribution Probability cd x f(x) Probability is area under curve!
Standard Normal Distribution The standard normal distribution is a normal distribution with µ = 0 and = 1. A random variable with a standard normal distribution, denoted by the symbol z, is called a standard normal random variable.
The Standard Normal Table: P(0 < z < 1.96) z = 0 = Z Standard Normal Probability Table (Portion) Probabilities.4750
The Standard Normal Table: P(–1.26 z 1.26) z = 0 = 1 –1.26 Standard Normal Distribution P(–1.26 z 1.26) = =.7924
The Standard Normal Table: P(z > 1.26) z Standard Normal Distribution 1.26 P(z > 1.26) =.5000 –.3962 = = 1 = 0.500
The Standard Normal Table: P(–2.78 z –2.00) –2.78 z – Standard Normal Distribution P(–2.78 z –2.00) =.4973 –.4772 =.0201 = 1 = 0
The Standard Normal Table: P(z > –2.13) z –2.13 Standard Normal Distribution P(z > –2.13) = = = 1 = 0
The rest of the quizzes will be held in the lecture on Wednesdays. The updated dates for quizzes are as below. I will take attendance and will give extra credit to the students who have attended at least 8 of the lectures till the end of the semester.
Non-standard Normal Distribution x f(x) Normal distributions differ by mean & standard deviation. Each distribution would require its own table. Thats an infinite number of tables!
Property of Normal Distribution If x is a normal random variable with mean μ and standard deviation, then the random variable z, defined by the formula has a standard normal distribution. The value z describes the number of standard deviations between x and µ.
Standardize the Normal Distribution Normal Distribution x One table! = 1 z Standard Normal Distribution = 0
Finding a Probability Corresponding to a Normal Random Variable 1.Sketch normal distribution, indicate mean, and shade the area corresponding to the probability you want. 2.Convert the boundaries of the shaded area from x values to standard normal random variable z Show the z values under corresponding x values. 3.Use Table in Appendix D to find the areas corresponding to the z values. Use symmetry when necessary.
Of course, the TI does it all: normalcdf(a,b,μ,σ) returns the probability P(a < x < b) with x distributed N(μ, σ).
Non-standard Normal μ = 5, σ = 10: P(5 < x < 6.2) z = 1.12 Standard Normal Distribution.0478 Normal Distribution x = 5 = = 0
Non-standard Normal μ = 5, σ = 10: P(3.8 x 5) z = Standardized Normal Distribution Normal Distribution x = = 10 = 1
Non-standard Normal μ = 5, σ = 10: P(2.9 x 7.1) z.21 Standard Normal Distribution x Normal Distribution = 10 = 1
Non-standard Normal μ = 5, σ = 10: P(x 8) x = 5 = 10 8 Normal Distribution = 1 z = 0.30 Standard Normal Distribution
Non-standard Normal μ = 5, σ = 10: P(7.1 X 8) = 1.30 z.21 Standard Normal Distribution = x Normal Distribution = 5 = 0
Normal Distribution Thinking Challenge You work in Quality Control for GE. Light bulb life has a normal distribution with = 2000 hours and = 200 hours. Whats the probability that a bulb will last A. between 2000 and 2400 hours? B. less than 1470 hours?
Solution* P(2000 x 2400) Standard Normal Distribution z = Normal Distribution x = 2000 = =
Solution* P(x 1470) z = 0 = 1 –2.65 Standard Normal Distribution x = 2000 = Normal Distribution
Finding z-Values for Known Probabilities What is z, given P(z) =.1217? z = 0 = 1 ?.1217 Standard Normal Probability Table (Portion) Z
Finding x Values for Known Probabilities Normal Distribution x = 5 = 10 ?.1217 Standard Normal Distribution z = 0 =
Using a TI: invNorm(p, μ, σ) returns the 100pth percentile of N(μ, σ).
Normal Distribution Thinking Challenge At one university, the students are given z-scores at the end of each semester rather than the traditional GPAs. The mean and the standard deviation of all students cumulative GPAs, on which the z-scores are based, are 2.7 and.5, respectively. a) Translate each of the following z-scores to the corresponding GPA: z=2.0, z=-1, z=0.5, z=-2.5. b) Students with z-scores below -1.6 are put on probation. What is the corresponding probationary GPA? c) The president of the university wishes to graduate the top 20% of the students with cum laude honors and the top 2.5% with summa cum laude honors. Under the assumption that the distribution is exactly normal, by using the Table Z in the appendix, determine the limits be set in terms of original GPAs.
Solution a) 3.7, 2.2, 2.95, 1.45 b) 1.9 c)P(z > 0.84)=0.20 P(z > 1.96)=0.025 So, for cum laude: 0.84=(x-2.7)/0.5 x= 3.12; for summa cum laude: 1.96=(x-2.7)/0.5 x= 3.68
4.7 Descriptive Methods for Assessing Normality
Determining Whether the Data Are from an Approximately Normal Distribution 1.Construct either a histogram or stem-and-leaf display for the data and note the shape of the graph. If the data are approximately normal, the shape of the histogram or stem-and-leaf display will be similar to the normal curve.
Determining Whether the Data Are from an Approximately Normal Distribution 2.Compute the intervals and determine the percentage of measurements falling in each. If the data are approximately normal, the percentages will be approximately equal to 68%, 95%, and 100%, respectively; from the Empirical Rule (68%, 95%, 99.7%).
Determining Whether the Data Are from an Approximately Normal Distribution 3.Find the interquartile range, IQR, and standard deviation, s, for the sample, then calculate the ratio IQR/s. If the data are approximately normal, then IQR/s 1.3.
Determining Whether the Data Are from an Approximately Normal Distribution 4.Examine a normal probability plot for the data. If the data are approximately normal, the points will fall (approximately) on a straight line. Observed value Expected z–score
Normal Probability Plot A normal probability plot for a data set is a scatterplot with the ranked data values on one axis and their corresponding expected z-scores from a standard normal distribution on the other axis. [Note: Computation of the expected standard normal z-scores are beyond the scope of this lecture. Therefore, we will rely on available statistical software packages to generate a normal probability plot.]
Using Stat Cranch for Normal probability plot
4.8 Other Continuous Distributions: Uniform
Uniform Distribution Continuous random variables that appear to have equally likely outcomes over their range of possible values possess a uniform probability distribution. Suppose the random variable x can assume values only in an interval c x d. Then the uniform frequency function has a rectangular shape.
Probability Distribution for a Uniform Random Variable x Mean: Probability density function: Standard Deviation:
Uniform Distribution Example Youre production manager of a soft drink bottling company. You believe that when a machine is set to dispense 12 oz., it really dispenses between 11.5 and 12.5 oz. inclusive. Suppose the amount dispensed has a uniform distribution. What is the probability that less than 11.8 oz. is dispensed? SODA
Uniform Distribution Solution P(11.5 x 11.8)= (Base)/(Height) = (11.8 – 11.5)/(1) = f(x)f(x) x
Consider a powder specimen that has exactly 10 anthrax spores. Suppose that the number of anthrax spores in the sample detected by the new method follows an approximate uniform distribution between 0 and 10. a. Find the probability that 8 or fewer anthrax spores are detected in the powder specimen. b. Find the probability that between 2 and 5 anthrax spores are detected in the powder specimen. c. Find the expected number of anthrax spores. d. Find the standard deviation of anthrax spores. Review Q1
Review Q2 The output from a statistical computer program indicates that the mean and standard deviation of a data set consisting of 200 measurements are $1500 and $300, respectively. Suppose the frequency distribution of the data set has normal distribution. What of percentage of data will be between $750 and $1200?
Review Q3 Almost all companies utilize some type of year-end performance review for their employees. Human Resources(HR) at the university of Texas health Science Center provides guidelines for supervisors rating their subordinates. For example raters are advised to examine their ratings for a tendency to be too lenient or too harsh. According to HR, if you have this tendency, consider using a normal distribution- 10% of employees (rated) exemplary, 20% distinguished, 40% competent, 20% marginal, and 10% unacceptable. Suppose you are rating an employees performance on a scale of 1(lowest) to 100(highest). Also assume the ratings follow normal distribution with a mean of 50 and a standard deviation of 15. a)What is the lowest rating you should give to an exemplary employee if you follow the Univ. of Texas HR guidelines? b) What is the lowest rating you should give to an competent employee if you follow the Univ. of Texas HR guidelines?
Review Q4 Suppose that 5 out of 13 liver transplants done at a hospital will fail within a year. Consider a random sample of 4 of these 13 patients. What is the probability that 1 of these patients will result in failed transplants?
Key Ideas Properties of Probability Distributions Discrete Distributions 1.p(x) 0 2. Continuous Distributions 1.P(x = a) = 0 2. P(a < x < b) = area under curve between a and b
Key Ideas Methods for Assessing Normality 1. Histogram
Key Ideas Methods for Assessing Normality 2. Stem-and-leaf display
Key Ideas Methods for Assessing Normality 3. (IQR)/S Normal probability plot