HIM 3200 Normal Distribution Biostatistics Dr. Burton
Progression of a histogram into a continuous distribution z
Progression of a histogram into a continuous distribution z
Progression of a histogram into a continuous distribution z
Progression of a histogram into a continuous distribution z
Progression of a histogram into a continuous distribution z
Progression of a histogram into a continuous distribution z
Progression of a histogram into a continuous distribution z
Area under the curve = 50% 50%
Areas under the curve relating to z scores % 0 to % 0 to +1
Areas under the curve relating to z scores % -1 to -2+1 to %
Central limit theorem In reasonably large samples (25 or more) the distribution of the means of many samples is normal even though the data in individual samples may have skewness, kurtosis or unevenness. Therefore, a t-test may be computed on almost any set of continuous data, if the observations can be considered random and the sample size is reasonably large.
Areas under the curve relating to z scores % 13.6% 95.4%
Areas under the curve relating to z scores % 2.1% -2 to to +3
Areas under the curve relating to z scores %
Areas under the curve relating to +z scores (one tailed tests) % Acceptance area Critical area =15.9%
Areas under the curve relating to +z scores (one tailed tests) % Acceptance area Critical area =2.3%
Areas under the curve relating to +z scores (one tailed tests) % Acceptance area Critical area =0.2%
Asymmetric Distributions Positively Skewed Right Negatively Skewed Left
Distributions (Kurtosis) Flat curve = Higher level of deviation from the mean High curve = Smaller deviation from the mean
Distributions (Bimodal Curve)
-3 -2 -- ++ +2 +3 Z scores Theoretical normal distribution with standard deviations Probability [% of area in the tail(s)] Upper tail Two-tailed
What is the z score for 0.05 probability? (one-tailed test) What is the z score for 0.05 probability? (two tailed test) 1.96 What is the z score for 0.01? (one-tail test) What is the z score for 0.01 probability? (two tailed test) 2.576
The Relationship Between Z and X =100 =15 X= Z= Population Mean Standard Deviation 130 –
Central limit theorem In reasonably large samples (25 or more) the distribution of the means of many samples is normal even though the data in individual samples may have skewness, kurtosis or unevenness. Therefore, a t-test may be computed on almost any set of continuous data, if the observations can be considered random and the sample size is reasonably large.
(x - x) 2 n - 1 s = Student’s t distribution t = x - s / n Standard deviation
Standard Error of the Mean SE = s/ N N = 15 X = s = 34.1 s x = 8.8 Sample SE = 34.1/ 15 SE = 34.1/ 3.87 SE = 34.1/ 15 SE = 8.8 = = 30.2
Confidence Intervals The sample mean is a point estimate of the population mean. With the additional information provided by the standard error of the mean, we can estimate the limits (interval) within which the true population mean probably lies. Source: Osborn
Confidence Intervals This is called the confidence interval which gives a range of values that might reasonably contain the true population mean The confidence interval is represented as:a b –with a certain degree of confidence - usually 95% or 99% Source: Osborn
Confidence Intervals Before calculating the range of the interval, one must specify the desired probability that the interval will include the unknown population parameter - usually 95% or 99%. After determining the values for a and b, probability becomes confidence. The process has generated an interval that either does or does not contain the unknown population parameter; this is a confidence interval. Source: Osborn
Confidence Intervals To calculate the Confidence Interval (CI) Source: Osborn
Confidence Intervals In the formula, is equal to 1.96 or 2.58 (from the standard normal distribution) depending on the level of confidence required: –CI 95, = 1.96 –CI 99, = 2.58 Source: Osborn
Confidence Intervals Given a mean of and a standard error of 8.8, the CI 95 is calculated: = = 97.7, Source: Osborn )8.8( )/( 95 nsXCI