SADC Course in Statistics Importance of the normal distribution (Session 09)

Slides:



Advertisements
Similar presentations
SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)
Advertisements

SADC Course in Statistics Simple Linear Regression (Session 02)
The Poisson distribution
SADC Course in Statistics Further ideas concerning confidence intervals (Session 06)
SADC Course in Statistics Introduction to Non- Parametric Methods (Session 19)
SADC Course in Statistics Tests for Variances (Session 11)
Assumptions underlying regression analysis
SADC Course in Statistics Basic principles of hypothesis tests (Session 08)
SADC Course in Statistics Meaning and use of confidence intervals (Session 05)
SADC Course in Statistics The binomial distribution (Session 06)
SADC Course in Statistics Sampling weights: an appreciation (Sessions 19)
SADC Course in Statistics Revision of key regression ideas (Session 10)
SADC Course in Statistics Confidence intervals using CAST (Session 07)
SADC Course in Statistics Sample size determinations (Session 11)
SADC Course in Statistics Decomposing a time series (Session 03)
SADC Course in Statistics Comparing two proportions (Session 14)
SADC Course in Statistics Review and further practice (Session 10)
SADC Course in Statistics Revision using CAST (Session 04)
SADC Course in Statistics Introduction to Statistical Inference (Session 03)
SADC Course in Statistics (Session 09)
SADC Course in Statistics Modelling ideas in general – an appreciation (Session 20)
SADC Course in Statistics Revision on tests for means using CAST (Session 17)
Probability Distributions
Assumptions and Conditions
Estimation of Means and Proportions
Assumptions for Z Confidence Intervals
Central Limit Theorem. So far, we have been working on discrete and continuous random variables. But most of the time, we deal with ONE random variable.
Sta220 - Statistics Mr. Smith Room 310 Class #14.
Section 5.3 ~ The Central Limit Theorem Introduction to Probability and Statistics Ms. Young ~ room 113.
STA305 week 31 Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for.
Sampling Distributions
Copyright © 2010 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Sampling Distributions
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Standard Normal Distribution
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
REGRESSION MODEL ASSUMPTIONS. The Regression Model We have hypothesized that: y =  0 +  1 x +  | | + | | So far we focused on the regression part –
Chapter 7: Variation in repeated samples – Sampling distributions
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
SADC Course in Statistics Paddy results: a discussion (Session 17)
Business Statistics - QBM117 Statistical inference for regression.
The Lognormal Distribution
Sample Distribution Models for Means and Proportions
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
SADC Course in Statistics The normal distribution (Session 08)
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Chapter 6: Probability Distributions
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Lecture 15: Statistics and Their Distributions, Central Limit Theorem
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Sampling Distribution Models Chapter 18. Toss a penny 20 times and record the number of heads. Calculate the proportion of heads & mark it on the dot.
1 Topic 5 - Joint distributions and the CLT Joint distributions –Calculation of probabilities, mean and variance –Expectations of functions based on joint.
Confidence Interval Estimation For statistical inference in decision making:
Normal approximation of Binomial probabilities. Recall binomial experiment:  Identical trials  Two outcomes: success and failure  Probability for success.
SESSION 37 & 38 Last Update 5 th May 2011 Continuous Probability Distributions.
CHAPTER 5 CONTINUOUS PROBABILITY DISTRIBUTION Normal Distributions.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Sampling distributions BPS chapter 10 © 2006 W. H. Freeman and Company.
Introduction to Probability & Statistics The Central Limit Theorem
Warmup Normal Distributions.
Sampling Distributions
Presentation transcript:

SADC Course in Statistics Importance of the normal distribution (Session 09)

To put your footer here go to View > Header and Footer 2 Learning Objectives At the end of this session you will be able to: discuss reasons why the normal probability distribution is important state the Central Limit Theorem and its value in approximating Binomial and Poisson probabilities by normal probabilities explain how the assumption of normality for a given random variable can be checked

To put your footer here go to View > Header and Footer 3 Importance of Normal Distribution Many measurements can be closely approximated by the normal distribution since many variables show normal variation as a resultant of many minor influences up and down Data which are not normal, can often be transformed into a normal random variable The normal distribution underpins a lot of inference ideas. We have seen that probability statements about any normally distributed variable can be done via N(0,1)

To put your footer here go to View > Header and Footer 4 The Central Limit Theorem (CLT) One of the key reasons why the normal distribution is important is because of the Central Limit Theorem (CLT). This theorem states that the sample mean of any random variable has an approximate normal distribution, provided that the sample size is sufficiently large.

To put your footer here go to View > Header and Footer 5 Consequences of the Central Limit Theorem Many statistical techniques are based on the assumption that the mean of the distribution follows a normal distribution As a consequence of the Central Limit Theorem, the above assumption is not invalidated as long as the sample size is large enough, e.g. say > about 30. The CLT also implies that the binomial and Poisson probabilities approach the normal probabilities as n becomes large (see below).

To put your footer here go to View > Header and Footer 6 Normal approximation to the binomial distribution Recall that the form of the binomial distribution for p=0.5 closely resembles the normal distribution This is because the binomial probabilities are symmetric when p=0.5 However, even with p0.5, the normal approximation holds for large n because a binomial random variable is the mean of several Bernoulli random variables and then the CLT applies

To put your footer here go to View > Header and Footer 7 Normal approximation to the Poisson distribution Recall from previous session (slides 8-12) that as the Poisson parameter becomes large, the shape of the Poisson distribution becomes bell-shaped and symmetrical This is again a consequence of the CLT since is the mean of the Poisson distribution

To put your footer here go to View > Header and Footer 8 More formally… has a normal distribution with mean 0 and variance 1 (standard normal) when the sample size n is large. Note that = r/n, where r=number of successes in n trials, i.e. r is a binomial random variable. If is an average of a series of n Bernoulli random variables (0,1 variables), then

To put your footer here go to View > Header and Footer 9 The same result is true for the Poisson average, i.e. Z defined below can be approximated by the standard normal distribution for large values of. and further …

To put your footer here go to View > Header and Footer 10 Thus the normal distribution plays an important role in statistics. Most of the techniques covered in Modules H2 and H8 are based on assuming that the key response of interest follows a normal distribution. We therefore need to be able to check whether measurements on a given random variable follows a normal distribution. This is done by producing a normal probability plot. Checking for normality

To put your footer here go to View > Header and Footer 11 Statistics software packages generally have a facility for producing this plot. Below is the plot for maize cob weights. In this plot, the Y-axis corresponds to values you would expect from an actual normal distribution. The X-axis corresponds to your data. This implies that a straight line indicates the normality assumption is valid. What do you deduce from graph below? Normal Probability Plot

To put your footer here go to View > Header and Footer 12