The Gaussian (Normal) Distribution: More Details & Some Applications.

Slides:



Advertisements
Similar presentations
T HE ‘N ORMAL ’ D ISTRIBUTION. O BJECTIVES Review the Normal Distribution Properties of the Standard Normal Distribution Review the Central Limit Theorem.
Advertisements

©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin The Normal Probability Distribution Chapter 7.
Normal Distribution Sampling and Probability. Properties of a Normal Distribution Mean = median = mode There are the same number of scores below and.
Chapter 7 Introduction to Sampling Distributions
Ka-fu Wong © 2003 Chap 7- 1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
The Normal Probability Distribution
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., 2000 LIND MASON MARCHAL 1-1 Chapter Six The Normal Probability Distribution GOALS When you have completed.
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson5-1 Lesson 5: Continuous Probability Distributions.
Discrete and Continuous Random Variables Continuous random variable: A variable whose values are not restricted – The Normal Distribution Discrete.
12.3 – Measures of Dispersion
The Normal Distribution
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Today: Central Tendency & Dispersion
BPT 2423 – STATISTICAL PROCESS CONTROL.  Frequency Distribution  Normal Distribution / Probability  Areas Under The Normal Curve  Application of Normal.
© Copyright McGraw-Hill CHAPTER 6 The Normal Distribution.
Normal Curves and Sampling Distributions
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Ch 7 Continuous Probability Distributions
The normal distribution
8.5 Normal Distributions We have seen that the histogram for a binomial distribution with n = 20 trials and p = 0.50 was shaped like a bell if we join.
Business Statistics: Communicating with Numbers
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
Theory of Probability Statistics for Business and Economics.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Learning Objectives for Section 11.5 Normal Distributions The student will be able to identify what is meant.
Chapter 11 Data Descriptions and Probability Distributions Section 5 Normal Distribution.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
Continuous Random Variables Continuous Random Variables Chapter 6.
Normal Distribution Section 2.2. Objectives  Introduce the Normal Distribution  Properties of the Standard Normal Distribution  Use Normal Distribution.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Identify.
Some probability distribution The Normal Distribution
Modular 11 Ch 7.1 to 7.2 Part I. Ch 7.1 Uniform and Normal Distribution Recall: Discrete random variable probability distribution For a continued random.
7- 1 Chapter Seven McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Ch. 22 in the book by Giancoli
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling Distributions.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 6 Probability Distributions Section 6.2 Probabilities for Bell-Shaped Distributions.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
© 2010 Pearson Prentice Hall. All rights reserved. CHAPTER 12 Statistics.
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
MATB344 Applied Statistics Chapter 6 The Normal Probability Distribution.
Central Tendency & Dispersion
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved THE Normal PROBABILITY DISTRIBUTION.
7- 1 Chapter Seven McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Introduction to Probability and Statistics Thirteenth Edition Chapter 6 The Normal Probability Distribution.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
Continuous Probability Distribution By: Dr. Wan Azlinda Binti Wan Mohamed.
The Normal Distribution (Gaussian Distribution) Honors Analysis Learning Target: I can analyze data using the normal distribution.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Continuous Probability Distributions Chapter 7.
Copyright © 2009 Pearson Education, Inc. Chapter 24: Gauss’s Law.
THE NORMAL DISTRIBUTION
Normal Distribution 1. Objectives  Learning Objective - To understand the topic on Normal Distribution and its importance in different disciplines. 
Theoretical distributions: the Normal distribution.
Chapter 6 Continuous Random Variables Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
13-5 The Normal Distribution
Chapter 6 Created by Bethany Stubbe and Stephan Kogitz.
Normal Distribution.
The Normal Distribution
Analysis of Economic Data
Chapter 12 Statistics 2012 Pearson Education, Inc.
Gaussian (Normal) Distribution
Elementary Statistics: Picturing The World
Gaussian (Normal) Distribution
Chapter 5 Normal Probability Distribution
Chapter 12 Statistics.
Presentation transcript:

The Gaussian (Normal) Distribution: More Details & Some Applications

The Gaussian (Normal) Distribution The Gaussian Distribution is one of the most used distributions in all of science. It is also called the “bell curve” or the Normal Distribution. If this is the “Normal Distribution”, logically, shouldn’t there also be an “Abnormal Distribution”?

Johann Carl Friedrich Gauss (1736–1806, Germany) Mathematician, Astronomer & Physicist. Sometimes called the “Prince of Mathematics" (?) A child prodigy in math. (Do you have trouble believing some of the following? I do!) Age 3: He informed his father of a mistake in a payroll calculation & gave the correct answer!! Age 7: His teacher gave the problem of summing all integers to his class to keep them busy. Gauss quickly wrote the correct answer 5050 on his slate!! Whether or not you believe all of this, it is 100% true that he Made a HUGE number of contributions to Mathematics, Physics, & Astronomy!!

Johann Carl Friedrich Gauss A Genius! He made a HUGE number of contributions to Mathematics, Physics, & Astronomy 1. Proved The Fundamental Theorem of Algebra, that every polynomial has a root of the form a+bi. 2. Proved The fundamental Theorem of Arithmetic, that every natural number can be represented as a product of primes in only one way. 3. Proved that every number is the sum of at most 3 triangular numbers. 4. Developed the method of least squares fitting & many other methods in statistics & probability. 5. Proved many theorems of integral calculus, including the divergence theorem (when applied to the E field, it is what is called Gauss’s Law). 6. Proved many theorems of number theory. 7. Made many contributions to the orbital mechanics of the solar system. 8. Made many contributions to Non-Euclidean geometry 9. One of the first to rigorously study the Earth’s magnetic field

x f ( x ral itrbuion:  =0,  = 1 Characteristics of a Normal or Gaussian Distribution a It is Symmetric It’s Mean, Median, & Mode are Equal

A 2-Dimensional Gaussian

Gaussian or Normal Distribution It is a symmetrical, bell-shaped curve. It has a point of inflection at a position 1 standard deviation from mean. Formula: f (X ) x 

The Normal Distribution Note the constants:  = e = This is a bell shaped curve with different centers and spreads depending on  and 

There are only 2 variables that determine the curve, the mean  & the variance . The rest are constants. For “z scores” (  = 0,  = 1), the equation becomes: The negative exponent means that big |z| values give small function values in the tails.

Normal Distribution It’s a probability function, so no matter what the values of  and , it must integrate to 1!

The Normal Distribution is Defined by its Mean & Standard Deviation.  =  2 = Standard Deviation =  l

Normal Distribution Can take on an infinite number of possible values. The probability of any one of those values occurring is essentially zero. Curve has area or probability = 1

A normal distribution with a mean of 0 and a standard deviation of 1 is called the standard normal distribution. Z Value: The distance between a selected value, designated X, and the population mean, divided by the population standard deviation, 7-6

Example 1 The monthly incomes of recent MBA graduates in a large corporation are normally distributed with a mean of $2000 and a standard deviation of $200. What is the Z value for an income of $2200? An income of $1700? For X = $2200, Z= ( )/200 = 1. For X = $1700, Z = ( )/200 = -1.5 A Z value of 1 indicates that the value of $2200 is 1 standard deviation above the mean of $2000, while a Z value of $1700 is 1.5 standard deviation below the mean of $

Probabilities Depicted by Areas Under the Curve Total area under the curve is 1 The area in red is equal to p(z > 1) The area in blue is equal to p(-1< z <0) Since the properties of the normal distribution are known, areas can be looked up on tables or calculated on a computer.

Probability of an Interval

Cumulative Probability

A table will give this probability Given any positive value for z, the corresponding probability can be looked up in standard tables. Given positive z The probability found using a table is the probability of having a standard normal variable between 0 & the given positive z.

Areas Under the Standard Normal Curve

Areas and Probabilities The Table shows cumulative normal probabilities. Some selected entries: zF(z)z z About 54 % of scores fall below z of.1. About 46 % of scores fall below a z of -.1 (1-.54 =.46). About 14% of scores fall between z of 1 and 2 ( ).

Areas Under the Normal Curve About 68 percent of the area under the normal curve is within one standard deviation of the mean. About 95 percent is within two standard deviations of the mean percent is within three standard deviations of the mean. 7-9

x f ( x ral itrbuion:  =0,  = 1 Areas Under the Normal Curve Between: % % % Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc.,

Key Areas Under the Curve For normal distributions + 1  ~ 68% + 2  ~ 95% + 3  ~ 99.9%

“ Rule” 68% of the data 95% of the data 99.7% of the data

Rule For a Normally distributed variable: 1.> 68.26% of all possible observations lie within one standard deviation on either side of the mean (between  and  2.  > 95.44% of all possible observations lie within two standard deviations on either side of the mean (between  and  3.  > 99.74% of all possible observations lie within two standard deviations on either side of the mean (between  and 

Using the unit normal (z), we can find areas and probabilities for any normal distribution. Suppose X = 120,  =100,  =10. Then z = ( )/10 = 2. About 98 % of cases fall below a score of 120 if the distribution is normal. In the normal, most (95%) are within 2  of the mean. Nearly everybody (99%) is within 3  of the mean.

Rule

Rule in Math terms…

Example 2 The daily water usage per person in New Providence, New Jersey is normally distributed with a mean of 20 gallons and a standard deviation of 5 gallons. About 68% of the daily water usage per person in New Providence lies between what two values? That is, about 68% of the daily water usage will lie between 15 and 25 gallons. 7-11

Normal Approximation to the Binomial Using the normal distribution (a continuous distribution) as a substitute for a binomial distribution (a discrete distribution) for large values of n seems reasonable because as n increases, a binomial distribution gets closer and closer to a normal distribution. The normal probability distribution is generally deemed a good approximation to the binomial probability distribution when n and n(1- ) are both greater than

Binomial Distribution for n = 3 & n =

Flip coin N times Each outcome has an associated random variable X i (= 1, if heads, otherwise 0) Number of heads: N H is a random variable N H = x 1 + x 2 + …. + x N Central Limit Theorem

Coin flip problem. Probability function of N H –P(Head) = 0.5 (fair coin) N = 5N = 10N = 40

Central Limit Theorem The distribution of the sum of N random variables becomes increasingly Gaussian as N grows. Example: N uniform [0,1] random variables.

% % Probability / % Normal Distribution

Why are normal distributions so important? Many dependent variables are commonly assumed to be normally distributed in the population If a variable is approximately normally distributed we can make inferences about values of that variable Example: Sampling distribution of the mean So what? Remember the Binomial distribution –With a few trials we were able to calculate possible outcomes and the probabilities of those outcomes Now try it for a continuous distribution with an infinite number of possible outcomes. Yikes! The normal distribution and its properties are well known, and if our variable of interest is normally distributed, we can apply what we know about the normal distribution to our situation, and find the probabilities associated with particular outcomes.

Since we know the shape of the normal curve, we can calculate the area under the curve The percentage of that area can be used to determine the probability that a given value could be pulled from a given distribution. The area under the curve tells us about the probability- in other words we can obtain a p-value for our result (data) by treating it as a normally distributed data set.