Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-1 Lesson 5: Continuous Probability Distributions.

Slides:



Advertisements
Similar presentations
Continuous Random Variables and the Central Limit Theorem
Advertisements

Normal Probability Distributions 1 Chapter 5. Chapter Outline Introduction to Normal Distributions and the Standard Normal Distribution 5.2 Normal.
Chapter 5 Discrete Random Variables and Probability Distributions
© 2003 Prentice-Hall, Inc.Chap 5-1 Business Statistics: A First Course (3 rd Edition) Chapter 5 Probability Distributions.
Chapter 4 Discrete Random Variables and Probability Distributions
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson5-1 Lesson 5: Continuous Probability Distributions.
Chapter 6 Continuous Random Variables and Probability Distributions
Ka-fu Wong © 2003 Chap 7- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
The Normal Probability Distribution
Chapter 6 The Normal Distribution and Other Continuous Distributions
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson4-1 Lesson 4: Discrete Probability Distributions.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Continuous Random Variables and Probability Distributions
Chapter 5 Continuous Random Variables and Probability Distributions
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Continuous Probability Distributions Chapter 7.
TOPIC 5 Normal Distributions.
Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., 2000 LIND MASON MARCHAL 1-1 Chapter Six The Normal Probability Distribution GOALS When you have completed.
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson6-1 Lesson 6: Sampling Methods and the Central Limit Theorem.
Continuous Probability Distributions
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Chapter 4 Continuous Random Variables and Probability Distributions
Distributions Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Continuous Probability Distributions
© Copyright McGraw-Hill CHAPTER 6 The Normal Distribution.
Ch 7 Continuous Probability Distributions
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 8 Continuous.
Chapter 6: Probability Distributions
Business Statistics: Communicating with Numbers
Continuous Random Variables and Probability Distributions
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
1 Normal Random Variables In the class of continuous random variables, we are primarily interested in NORMAL random variables. In the class of continuous.
Theory of Probability Statistics for Business and Economics.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
The Gaussian (Normal) Distribution: More Details & Some Applications.
Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.
Chapter 6.1 Normal Distributions. Distributions Normal Distribution A normal distribution is a continuous, bell-shaped distribution of a variable. Normal.
Modular 11 Ch 7.1 to 7.2 Part I. Ch 7.1 Uniform and Normal Distribution Recall: Discrete random variable probability distribution For a continued random.
7- 1 Chapter Seven McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Managerial Decision Making Facilitator: René Cintrón MBA / 510.
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Random Variables Presentation 6.. Random Variables A random variable assigns a number (or symbol) to each outcome of a random circumstance. A random variable.
The Normal Distribution
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter The Normal Probability Distribution 7.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved THE Normal PROBABILITY DISTRIBUTION.
7- 1 Chapter Seven McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
INTRODUCTORY MATHEMATICAL ANALYSIS For Business, Economics, and the Life and Social Sciences  2011 Pearson Education, Inc. Chapter 16 Continuous Random.
Ka-fu Wong © 2003 Chap 6- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Basic Business Statistics
SESSION 37 & 38 Last Update 5 th May 2011 Continuous Probability Distributions.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.
5 - 1 © 1998 Prentice-Hall, Inc. Chapter 5 Continuous Random Variables.
Continuous Random Variables and Probability Distributions
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions Basic Business.
© 1999 Prentice-Hall, Inc. Chap Statistics for Managers Using Microsoft Excel Chapter 6 The Normal Distribution And Other Continuous Distributions.
5 - 1 © 2000 Prentice-Hall, Inc. Statistics for Business and Economics Continuous Random Variables Chapter 5.
1 Math 10 Part 4 Slides Continuous Random Variables and the Central Limit Theorem © Maurice Geraghty, 2015.
Chap 5-1 Discrete and Continuous Probability Distributions.
THE NORMAL DISTRIBUTION
Normal Distribution 1. Objectives  Learning Objective - To understand the topic on Normal Distribution and its importance in different disciplines. 
Chapter Normal Probability Distributions 1 of 25 5  2012 Pearson Education, Inc. All rights reserved.
Normal Probability Distributions
Continuous Probability Distributions
Chapter 6 Created by Bethany Stubbe and Stephan Kogitz.
Normal Probability Distributions
Analysis of Economic Data
The Normal Distribution
Chapter 5 Normal Probability Distribution
Presentation transcript:

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-1 Lesson 5: Continuous Probability Distributions

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-2 Outline Continuous probability distributions Features of univariate probability distribution Features of bivariate probability distribution Marginal density and Conditional density Expectation, Variance, Covariance and Correlation Coefficient Importance of normal distribution The normal approximation to the binomial

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-3 Types of Probability Distributions Number of random variablesJoint distribution 1Univariate probability distribution 2Bivariate probability distribution 3Trivariate probability distribution …… nMultivariate probability distribution Probability distribution may be classified according to the number of random variables it describes.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-4 Continuous Probability Distributions The curve f(x) is the continuous probability distribution (or probability curve or probability density function) of the random variable X if the probability that X will be in a specified interval of numbers is the area under the curve f(x) corresponding to the interval. Properties of f(x) 1.f(x)  0 for all x 2.The total area under the curve of f(x) is equal to 1

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-5 Features of a Univariate Continuous Distribution Let X be a random variable that takes any real values in an interval between a and b. The number of possible outcomes are by definition infinite. The main features of a probability density function f(x) are: f(x)  0 for all x and may be larger than 1. The probability that X falls into an subinterval (c,d) is and lies between 0 and 1. P(X  (a,b)) = 1. P(X = x) = 0.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-6 The Univariate Uniform Distribution If c and d are numbers on the real line, the random variable X ~ U(c,d), i.e., has a univariate uniform distribution if The mean and standard deviation of a uniform random variable x are

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-7 The Uniform Density

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-8 The Normal Probability Distribution The random variable X ~ N( ,  2 ), i.e., has a univariate normal distribution if for all x on the real line (- ,+  )  and  are the mean and standard deviation,  = … and e = is the base of natural or Naperian logarithms.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-9 Learning exercise 4: Part-time Work on Campus A student has been offered part-time work in a laboratory. The professor says that the work will vary from week to week. The number of hours will be between 10 and 20 with a uniform probability density function, represented as follows: How tall is the rectangle? What is the probability of getting less than 15 hours in a week? Given that the student gets at least 15 hours in a week, what is the probability that more than 17.5 hours will be available?

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-10 Learning exercise 4: Part-time Work on Campus How tall is the rectangle? (20-10)*h = 1 h=0.1 What is the probability of getting less than 15 hours in a week? 0.1*(15-10) = 0.5 Given that the student gets at least 15 hours in a week, what is the probability that more than 17.5 hours will be available? 0.1*( ) = /0.5 = 0.5 P(hour>17.5)/P(hour>15)

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-11 Features of a Bivariate Continuous Distribution Let X 1 and X 2 be a random variables that takes any real values in a region (rectangle) of (a,b,c,d). The number of possible outcomes are by definition infinite. The main features of a probability density function f(x 1,x 2 ) are: f(x 1,x 2 )  0 for all (x 1,x 2 ) and may be larger than 1. The probability that (X 1,X 2 ) falls into a region (rectangle) or (p,q,r,s) is and lies between 0 and 1. P((X 1,X 2 )  (a,b,c,d)) = 1. P((X 1,X 2 ) = (x 1,x 2 ) ) = 0.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-12 The Bivariate Uniform Distribution If a, b, c and d are numbers on the real line,, the random variable (X 1,X 2 ) ~ U(a,b,c,d), i.e., has a bivariate uniform distribution if

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-13 The Marginal Density The marginal density functions are:

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-14 The Conditional Density The conditional density functions are:

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-15 The Expectation (Mean) of Continuous Probability Distribution For univariate probability distribution, the expectation or mean E(X) is computed by the formula: For bivariate probability distribution, the the expectation or mean E(X) is computed by the formula:

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-16 Conditional Mean of Bivariate Discrete Probability Distribution For bivariate probability distribution, the conditional expectation or conditional mean E(X|Y) is computed by the formula: Unconditional expectation or mean of X, E(X)

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-17 Expectation of a linear transformed random variable If a and b are constants and X is a random variable, then E(a) = a E(bX) = bE(X) E(a+bX) = a+bE(X)

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-18 The Variance of a Continuous Probability Distribution For univariate continuous probability distribution If a and b are constants and X is a random variable, then V(a) = 0 V(bX) = b 2 V(X) V(a+bX) = b 2 V(X)

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-19 The Covariance of a Bivariate Discrete Probability Distribution Covariance measures how two random variables co-vary. If a and b are constants and X is a random variable, then C(a,b) = 0 C(a,bX) = 0 C(a+bX,Y) = bC(X,Y)

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-20 Variance of a sum of random variables If a and b are constants and X and Y are random variables, then V(X+Y) = V(X) + V(Y) + 2C(X,Y) V(aX+bY) =a 2 V(X) + b 2 V(Y) + 2abC(X,Y)

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-21 Correlation coefficient The strength of the dependence between X and Y is measured by the correlation coefficient:

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-22 Importance of Normal Distribution 1.Describes many random processes or continuous phenomena 2.Basis for Statistical Inference

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-23 Characteristics of a Normal Probability Distribution 1.bell-shaped and single-peaked (unimodal) at the exact center of the distribution.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-24 Characteristics of a Normal Probability Distribution 2.Symmetrical about its mean. The arithmetic mean, median, and mode of the distribution are equal and located at the peak. Thus half the area under the curve is above the mean and half is below it.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-25 Characteristics of a Normal Probability Distribution The normal probability distribution is asymptotic. That is the curve gets closer and closer to the X-axis but never actually touches it.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-26 N(0,  2 ) Symmetric Mean=median = mode Unimodal Bell-shaped Asymptotic

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-27 N( ,  2 ) x x x (a) (b) (c)

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-28 Normal Distribution Probability Probability is the area under the curve! c d X f(X)f(X) A table may be constructed to help us find the probability

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-29 Infinite Number of Normal Distribution Tables Normal distributions differ by mean & standard deviation. Each distribution would require its own table. X f(X)f(X)

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data The Standard Normal Probability Distribution -- N(0,1) The standard normal distribution is a normal distribution with a mean of 0 and a standard deviation of 1. It is also called the z distribution. A z-value is the distance between a selected value, designated X, and the population mean , divided by the population standard deviation, . The formula is:

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Transform to Standard Normal Distribution -- A numerical example Any normal random variable can be transformed to a standard normal random variable x x-  (x-  )/σ x/σx/σ Mean std

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data The Standard Normal Probability Distribution Any normal random variable can be transformed to a standard normal random variable Suppose X ~ N(µ,  2 ) Z=(X-µ)/  ~ N(0,1) P(X<k) = P [(X-µ)/  < (k-µ)/  ]

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-33 Standardize the Normal Distribution  Z = 0  z = 1 Z Because we can transform any normal random variable into standard normal random variable, we need only one table! Normal Distribution Standardized Normal Distribution X  

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-34 Standardizing Example Z  Z = 0  Z = 1.12 Normal Distribution Standardized Normal Distribution X  = 5  =

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-35 Obtaining the Probability Z  Z = 0  Z = Z Standardized Normal Probability Table (Portion) Probabilities Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-36 Example P(3.8  X  5) Z  Z = 0  Z = Normal Distribution Standardized Normal Distribution Shaded Area Exaggerated X  = 5  =

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-37 Example (2.9  X  7.1) 0  Z = Z.21 Normal Distribution Standardized Normal Distribution 5  = X Z X Z X               Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-38 Example (2.9  X  7.1) 0  Z = Z.21 Normal Distribution Standardized Normal Distribution 5  = X Z X Z X               Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-39 Example P(X  8) Z  Z = 0  Z = 1.30 Normal Distribution Standardized Normal Distribution Z X        X  = 5  = 10 8 Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-40 Example P(7.1  X  8)  z = 0  Z = 1.30 Z.21 Normal Distribution Standardized Normal Distribution Z X Z X                = 5  = X Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-41 Normal Distribution Thinking Challenge You work in Quality Control for GE. Light bulb life has a normal distribution with µ= 2000 hours &  = 200 hours. What’s the probability that a bulb will last between 2000 & 2400 hours? less than 1470 hours?

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-42 Solution P(2000  X  2400) Z  Z = 0  Z = Normal Distribution.4772 Standardized Normal Distribution Z X        X  = 2000  = P(2000<X<2400) = P [(2000-µ)/  <(X-µ)/  < (2400-µ)/  ] = P[(X-µ)/  < (2400-µ)/  ] – P [(X-µ)/  < (2000-µ)/  ] = P[(X-µ)/  < (2400-µ)/  ] – 0.5 Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-43 Solution P(X  1470) Z  Z = 0  Z = Normal Distribution Standardized Normal Distribution Z X        X  = 2000  = P(X<1470) = P [(X-µ)/  < (1470-µ)/  ] Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-44 Finding Z Values for Known Probabilities Z Z  Z = 0  Z = Standardized Normal Probability Table (Portion) What Is Z Given P(Z) = ? Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-45 Finding X Values for Known Probabilities Z  Z = 0  Z = 1.31 X  = 5  = 10 ? Normal DistributionStandardized Normal Distribution.1217 Shaded Area Exaggerated

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-46 EXAMPLE 1 The bi-monthly starting salaries of recent MBA graduates follows the normal distribution with a mean of $2,000 and a standard deviation of $200. What is the z-value for a salary of $2,400?

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-47 EXAMPLE 1 continued A z-value of 2 indicates that the value of $2,400 is one standard deviation above the mean of $2,000. A z-value of – 1.50 indicates that $1,900 is 1.5 standard deviation below the mean of $2000. What is the z-value of $1,900 ?

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-48 Areas Under the Normal Curve About 68 percent of the area under the normal curve is within one standard deviation of the mean.  ±  P(  -  < X <  +  ) = About 95 percent is within two standard deviations of the mean.  ± 2  P(  - 2  < X <  + 2  ) = Practically all is within three standard deviations of the mean.  ± 3  P(  - 3  < X <  + 3  ) =

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-49 EXAMPLE 2 The daily water usage per person in New Providence, New Jersey is normally distributed with a mean of 20 gallons and a standard deviation of 5 gallons. About 68 percent of those living in New Providence will use how many gallons of water? About 68% of the daily water usage will lie between 15 and 25 gallons.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-50 EXAMPLE 2 continued What is the probability that a person from New Providence selected at random will use between 20 and 24 gallons per day? P(20<X<24) =P[(20-20)/5 < (X-20)/5 < (24-20)/5 ] =P[ 0<Z<0.8 ] The area under a normal curve between a z-value of 0 and a z-value of 0.80 is We conclude that percent of the residents use between 20 and 24 gallons of water per day.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-51 How do we find P(0<z<0.8) P(0<z<0.8) = P(z<0.8) – P(z<0) = – 0.5 = P(z<c) c P(0<z<c) c 0 P(0<z<0.8) =

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-52 EXAMPLE 2 continued What percent of the population use between 18 and 26 gallons of water per day? Suppose X ~ N(µ,  2 ) Z=(X-µ)/  ~ N(0,1) P(X<k) = P [(X-µ)/  < (k-µ)/  ]

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-53 How do we find P(-0.4<z<1.2) P(z<c) c P(0<z<c) c 0 P(-0.4<z<1.2) = P(-0.4<z<0) + P(0<z<1.2) =P(0<z<0.4) + P(0<z<1.2) = = P(-0.4<z<1.2) = P(z<1.2) - P(z<-0.4) = P(z 0.4) = P(z<1.2) – [1- P(z<0.4)] = – [ ] = P(-0.4<z<0) =P(0<z<0.4) because of symmetry of the z distribution.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-54 EXAMPLE 3 Professor Mann has determined that the scores in his statistics course are approximately normally distributed with a mean of 72 and a standard deviation of 5. He announces to the class that the top 15 percent of the scores will earn an A. What is the lowest score a student can earn and still receive an A?

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-55 Example 3 continued To begin let k be the score that separates an A from a B. 15 percent of the students score more than k, then 35 percent must score between the mean of 72 and k. Write down the relation between k and the probability: P(X>k) = 0.15 and P(X k) = 0.85 Transform X into z: P[(X-72)/5) < (k-72)/5 ] = P[z < (k-72)/5] P[0<z < s] = = 0.35 Find s from table: P[0<z<1.04]=0.35 Compute k: (k-72)/5=1.04 implies K=77.2 Those with a score of 77.2 or more earn an A.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-56 The Normal Approximation to the Binomial The normal distribution (a continuous distribution) yields a good approximation of the binomial distribution (a discrete distribution) for large values of n. The normal probability distribution is generally a good approximation to the binomial probability distribution when n  and n(1-  ) are both greater than 5. Why can we approximate binomial by normal? Because of the Central Limit Theorem.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-57 The Normal Approximation continued Recall for the binomial experiment: There are only two mutually exclusive outcomes (success or failure) on each trial. A binomial distribution results from counting the number of successes. Each trial is independent. The probability is fixed from trial to trial, and the number of trials n is also fixed.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-58 The Normal Approximation normal binomial

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-59 Continuity Correction Factor Because the normal distribution can take all real numbers (is continuous) but the binomial distribution can only take integer values (is discrete), a normal approximation to the binomial should identify the binomial event "8" with the normal interval "(7.5, 8.5)" (and similarly for other integer values). The figure below shows that for P(X > 7) we want the magenta region which starts at 7.5.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-60 Continuity Correction Factor Example: If n=20 and p=.25, what is the probability that X is greater than or equal to 8? The normal approximation without the continuity correction factor yields z=(8-20 ×.25)/(20 ×.25 ×.75) 0.5 = 1.55, P(X ≥ 8) is approximately.0606 (from the table). The continuity correction factor requires us to use 7.5 in order to include 8 since the inequality is weak and we want the region to the right. z = ( ×.25)/(20 ×.25 ×.75) 0.5 = 1.29, P(X ≥ 7.5) is The exact solution from binomial distribution function is The continuity correct factor is important for the accuracy of the normal approximation of binomial. The approximation is quite good.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-61 EXAMPLE 4 A recent study by a marketing research firm showed that 15% of American households owned a video camera. For a sample of 200 homes, how many of the homes would you expect to have video cameras? What is the variance? What is the standard deviation? What is the mean?

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-62 What is the probability that less than 40 homes in the sample have video cameras? “ Less than 40 ” means “ less or equal to 39 ”. We use the correction factor, so X is The value of z is EXAMPLE 5 continued

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-63 Example 4 continued From Standard Normal Table the area between 0 and 1.88 on the z scale is So the area to the left of 1.88 is = The likelihood that less than 40 of the 200 homes have a video camera is about 97%.

Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson END - Lesson 5: Continuous Probability Distributions