Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,

Slides:



Advertisements
Similar presentations
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
Advertisements

Sampling Distributions (§ )
Chapter 6 Continuous Random Variables and Probability Distributions
Chapter 4 Probability Distributions
Definitions Uniform Distribution is a probability distribution in which the continuous random variable values are spread evenly over the range of possibilities;
Estimates and sample sizes Chapter 6 Prof. Felix Apfaltrer Office:N763 Phone: Office hours: Tue, Thu 10am-11:30.
Sampling Distributions
CHAPTER 6 Statistical Analysis of Experimental Data
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Estimating a Population Proportion
7-2 Estimating a Population Proportion
Lecture Slides Elementary Statistics Twelfth Edition
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Fall 2013Biostat 5110 (Biostatistics 511) Week 7 Discussion Section Lisa Brown Medical Biometry I.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Chapter 6 The Normal Probability Distribution
Chapter 10 Introduction to Statistics Frequency Distributions; Measures Of Central Tendency Population, variable, sample Example 1 (p. 572 – 573)
Chapter 6: Probability Distributions
Probability & the Normal Distribution
Chapter 7: Sampling Distributions
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Dan Piett STAT West Virginia University
Chapter 6: Probability Distributions
5.5 Distributions for Counts  Binomial Distributions for Sample Counts  Finding Binomial Probabilities  Binomial Mean and Standard Deviation  Binomial.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Chapter 6. Continuous Random Variables Reminder: Continuous random variable.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Biostat. 200 Review slides Week 1-3. Recap: Probability.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 6 Normal Probability Distributions 6-1 Review and Preview 6-2 The Standard Normal.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
Estimating a Population Proportion
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Statistical estimation, confidence intervals
Biostatistics, statistical software III. Population, statistical sample. Probability, probability variables. Important distributions. Properties of the.
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
Thursday August 29, 2013 The Z Transformation. Today: Z-Scores First--Upper and lower real limits: Boundaries of intervals for scores that are represented.
Introduction to Statistics Chapter 6 Feb 11-16, 2010 Classes #8-9
The Normal Distribution
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter The Normal Probability Distribution 7.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Chapter 6 Continuous Random Variables.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
MM207 Statistics Welcome to the Unit 6 Seminar Wednesday, March 7, to 9 PM ET.
SESSION 37 & 38 Last Update 5 th May 2011 Continuous Probability Distributions.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 4-6 Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor © 2013.
Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions 5-4 Mean, Variance and Standard Deviation.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions.
THE NORMAL DISTRIBUTION
The Role of Probability Chapter 5. Objectives Understand probability as it pertains to statistical inference Understand the concepts “at random” and “equally.
Theoretical distributions: the Normal distribution.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Chapter Five The Binomial Probability Distribution and Related Topics
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Normal Probability Distributions
Chapter 6. Continuous Random Variables
The Normal Distribution
Descriptive and inferential statistics. Confidence interval
Lecture Slides Elementary Statistics Twelfth Edition
Presentation transcript:

Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida, College of Nursing Professor, College of Public Health Department of Epidemiology and Biostatistics Associate Member, Byrd Alzheimer’s Institute Morsani College of Medicine Tampa, FL, USA 1

SECTION 2.1 Module Overview and Introduction Probability theory and discrete and continuous sampling distributions

SECTION 2.4 Bayes Theorem

Procedure for updating a probability based on new information. Rule can be used to compute a conditional probability based on specific, available information (i.e. links degree of belief in a proposition before and after accounting for evidence). Can represent a subjective degree of belief that changes over time to account for new evidence. Often used in meta analyses and synthesis of evidence P(B|A) P(A) P(A|B) = P(B) Bayes Theorem

5 Question: What is the probability that Daphne’s next male child will be affected by the X-linked disorder? Aaron Barbara Bart Carlos Cathy Desmond Daphne EarlJoseph Naïve: 1/2 1/4 1/8 1/16 Bayesian Methods (example)

6 Question: What is the probability that Daphne’s next male child will be affected by the X-linked disorder? Aaron Barbara Bart Carlos Cathy Desmond Daphne EarlJoseph Cathy carriernon-carrier 1/2 1/2 1/3 2/3 Daphne carriernon-carrier 1/6 5/6 1/21 20/21 Thus, the probability that Daphne’s next male child will be affected is: 1/2 x 1/21 = 1/42. Prior: Posterior: Prior: Posterior:

SECTION 2.5 Binomial Distribution Model

Probability Model: Mathematical equation or formula used to generate probabilities based on certain assumptions about the process. Very important for statistical inference. Binomial Model: Two possible outcomes – often labeled as “success” or “failure”, or as “disease” or “no disease”. Allows computation of observing a specified number of responses (e.g. successes) when the process is repeated a specific number of times (e.g. among a set of patients). For the binomial model with a set number of trials: p = probability of success, and q = 1 – p

Binomial distribution model n = number of times process is repeated (e.g. # of patients) x = number of successes (outcomes) p = probability of outcome for any individual (i.e. independent) ! = factorial n! P(x outcomes) = p x (1-p) n-x x!(n-x)! Example: Assume that a medication is effective 80% (0.80) of the time (i.e. p=0.8) Assume that the medication will be given to 10 patients(i.e. n=10) What is the probability the medication will be effective in exactly 7 patients? (x=7) (i.e. if we had to guess, we would think it was most likely that the medication would be effective in 8 patients) 10! P(7 successes) = (1-0.80) !(10-7)!

10! P(7 successes) = (1-0.80) !(10-7)! 10! 10(9)(8)(7)(6)(5)(4)(3)(2) = !(10-7)! [7(6)(5)(4)(3)(2)(1)][(3)(2)(1)] 10(9)(8) = =120 3(2) P(7 successes) = (120)(0.80 7) )(1-0.80) 10-7 ) P(7 successes) = (120)(0.2097)(0.008) = n! P(x outcomes) = p x (1-p) n-x x!(n-x)! Binomial distribution model

Many times we do not want to know a probability of an outcome in an exact number of persons, but rather a certain number or more persons. For example, using our previous scenario, what is the probability that the medication will be effective at least 7 of the 10 patients? (i.e. P(> 7 successes)) To do this, we need to compute individual probabilities for all the combinations. P(7 successes)= P(8 successes)= P(9 successes)= P(10 successes)= P(>7 successes)= =

Binomial distribution model (practice) n! P(x outcomes) = p x (1-p) n-x x!(n-x)! Assume that a drug is effective 90% of the time Assume that the medication will be given to 12 patients What is the probability the drug will be effective in exactly 10 patients? Complete the formula below: P(10 successes) =

Binomial distribution model (practice) n! P(x outcomes) = p x (1-p) n-x x!(n-x)! Assume that a drug is effective 90% of the time Assume that the medication will be given to 12 patients What is the probability the drug will be effective in exactly 10 patients? Complete the formula below: 12! P(10 successes) = (1-0.90) !(12-10)!

Binomial distribution model (practice) Complete the formula below: 12! P(10 successes) = (1-0.90) !(12-10)! 12! 12(11)(10)(9)(8)(7)(6)(5)(4)(3)(2) = !(12-10)! [10(9)(8)7(6)(5)(4)(3)(2)(1)][(2)(1)] 12(11) = =66 (2) P(10 successes) = (66)( ) )(1-0.90) ) P(10 successes) = (66)(0.3487)(0.01) =

Binomial distribution model (calculating standard deviation) Our Example: Assume a medication is effective 80% (0.80) of the time (i.e. p=0.8) Assume the medication will be given to 10 patients (i.e. n=10) What is the probability the medication will be effective in exactly 7 patients? (i.e. x=7) The expected number of outcomes of a binomial population is: µ = np So, in our example, µ = (10 x 0.8) = 8 The standard deviation (σ) = sqrt[(n(p))(1-p)] So, in our example, σ = sqrt[(10 x 0.8) x (1-0.8)] σ = sqrt[(8 x 0.2)] σ = sqrt[1.6] σ = 1.265

Binomial distribution model (practice) (calculating standard deviation) Example: Assume that a medication is effective 90% of the time Assume that the medication will be given to 20 patients What is the expected number of outcomes: µ = np So, in this example, µ = ______________ Calculate the standard deviation (σ) = sqrt[(n(p))(1-p)] So, in this example, σ = _______________________

Binomial distribution model (practice) (calculating standard deviation) Example: Assume that a medication is effective 90% of the time Assume that the medication will be given to 20 patients What is the expected number of outcomes: µ = np So, in this example, µ = (20 x 0.9) = 18 Calculate the standard deviation (σ) = sqrt[(n(p))(1-p)] So, in this example, σ = sqrt[(20 x 0.9) x (1-0.9)] σ = sqrt[(18 x 0.1)] σ = sqrt[1.8] σ = 1.34

SECTION 2.6 Poisson Distribution Model

A discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time (i.e. X = 0,1,2,3, ….) Usually associated with rare events Approximates the binomial distribution when N is large (i.e. >100) and p is small (i.e. <0.01) Requirements for the Poisson Distribution a)Length of time period is fixed in advance; b)Events occur at a constant average rate; c)Events can be counted in whole numbers d)Number of events occurring in disjoint intervals are statistically independent.

Poisson Distribution Model Illustration: Assume deaths from typhoid fever in a given population are Poisson distributed with a mean of 2.3 deaths per year. What is the probability distribution of deaths in this population? Pr(X = k) k

Poisson Distribution Model Poisson Formula Suppose we conduct a Poisson study in which the average number of health events within a given time period is μ. Then, the Poisson probability is: P(x; μ) = (e -μ ) (μ x ) / x! where x is the actual number of events that result from the study, and e is approximately equal to

Poisson Formula Example: The average number of colds for toddlers in day care is 2 per year. Knowing this, what is the estimated probability that a new toddler to day care will have exactly 3 colds during the following year? P(x; μ) = (e -μ ) (μ x ) / x! μ = 2; since 2 colds per year, on average. x = 3; since we want to find the likelihood that 3 colds will occur in the next year. e = ; since e is a constant equal to approximately Plug these values into the Poisson formula: P(3; 2) = ( ) (2 3 ) / 3! P(3; 2) = ( ) (8) / 6 P(3; 2) = Thus, the probability of a toddler having 3 colds in the next year is

Poisson Formula (Practice) Assume that the average number of cases of tuberculosis within a nursing home is 4 per year. Knowing this, what is the estimated probability that exactly 4 new cases of TB will occur during the following 6-months? P(x; μ) = (e -μ ) (μ x ) / x! μ = ________ x = ________ e = ________ Plug these values into the Poisson formula:

Poisson Formula (Practice) Assume that the average number of cases of tuberculosis within a nursing home is 4 per year. Knowing this, what is the estimated probability that exactly 4 new cases of TB will occur during the following 6-months? P(x; μ) = (e -μ ) (μ x ) / x! μ = 2; since 4 cases of TB per year = 2 cases of TB per 6 months, on average. x = 4; since we want to find the likelihood that 4 cases will occur in next 6 months. e = ; since e is a constant equal to approximately Plug these values into the Poisson formula: P(4; 2) = ( ) (2 4 ) / 4! P(4; 2) = ( ) (16) / 24 P(4; 2) = Thus, the probability of exactly 4 new cases of TB in the next 6-months is 0.09.

SECTION 2.7 Properties of the Normal Distribution

Normal Distribution Appropriate for a continuous outcome if:  Mean = median = mode  Symmetric around the mean  P(x > µ) = p(x < µ) where x is continuous variable and µ is mean  ~68% of all values fall between the mean and one 1 SD (i.e. P(µ - σ < x < P(µ + σ) = 0.68)  ~95% of all values fall between the mean and 2 SD (i.e. P(µ - 2σ < x < P(µ + 2σ) = 0.95)  ~99% of all values fall between the mean and 3 SD (i.e. P(µ - 3σ < x < P(µ + 3σ) = 0.99)

Normal Distribution

~68% of values between mean and one 1 SD ~95% - ~68% = 27% / 2 = ~13.6% of all values between -1 to -2 SD and +1 to +2SD ~99% - ~95% = 4% / 2 = ~2.0% of all values between -2 to -3 SD and +2 to +3SD

Normal Distribution (Practice) Assume that BMI is normally distributed with µ = 29.4 and σ = Put in the appropriate values on the normal distribution curve for BMI values plus or minus 1, 2, and 3 SD from the mean 2. What is the median BMI value? _______ 3. Approximately what percentage of the population has a BMI > 34? _________ 4. Approximately what percentage of the population has a BMI between 15.6 and 20.2? __________ 5. What is the approximate minimum and maximum BMI in the population? Min: _________ Max: __________ 29.4 ? ? ? Body Mass Index (BMI) µ

Normal Distribution (Practice) Assume that BMI is normally distributed with µ = 29.4 and σ = Put in the appropriate values on the normal distribution curve for BMI values plus or minus 1, 2, and 3 SD from the mean 2. What is the median BMI value? _29.4__ 3. Approximately what percentage of the population has a BMI > 34? __16.0%___ (i.e. 13.6% + 2.2% + 0.2%) 4. Approximately what percentage of the population has a BMI between 15.6 and 20.2? ____2.2%______ 5. What is the approximate minimum and maximum BMI in the population? Min: ____~15_____ Max: ____~44____ 29.4 Body Mass Index (BMI) µ µ-3σµ-2σµ-1σ µ+1σ µ+2σ µ+3σ 34% 13.6% 34%13.6% 0.2% 2.2% 0.2%

Normal Distribution Question: What do we if we want to calculate a probability when the value of interest is not the mean or a multiple of the standard deviation? Answer: Compute a z-score and use a table of probabilities for a “standard” normal distribution with mean of 0 and standard deviation of 1. Standard Normal Distribution; µ=0, σ=1 µ

Normal Distribution x - µ Z = σ Standardized Score (z-score) Where: x is the value of interest µ is the mean σ is the standard deviation Example: Body Mass Index x is 35 µ is 29.4 σ is – 29.4 Z = =

29.4 Body Mass Index (BMI) % 13.6% 34%13.6% 0.2% 2.2% 0.2% Body Mass Index µ is 29.4 σ is 4.6 Body Mass Index (BMI) 35.0 If x = 34, then z = 1 i.e. 34 – = Thus, P(z > 1 = ( = 0.16) If x = 35, then z = 1.17 i.e. 35 – = Thus, P(z > 1.17 = (?????) Refer to Appendix Table 1 P(z > 1.17 = 0.121) P Area under the curve

Body Mass Index (BMI) 35.0 In Appendix Table 1: The probability value of (and 1 – = 0.121) is determined by first looking at the left column for z to 1 decimal place, and then across the top row for z to the second decimal place. P Can also get the exact probability for z = 1.17 Cumulative probability: P(Z < 1.17) = Thus, probability: P(Z > 1.17) = ( ) = Area under the curve

Body Mass Index (BMI) 37.0 P Area under the curve Standard Normal Distribution (Practice) A.What is the probability that a person in the population will have a BMI > 37.0? z = _______________P(Z > ???) = __________ B.What is the probability that a person in the population will have a BMI < 27.0? z = _______________P(Z < ???) = __________ Body Mass Index µ is 29.4 σ is P Area under the curve x - µ Z = σ After calculating z for questions A and B, refer to Appendix Table 1

Body Mass Index (BMI) 37.0 P Area under the curve Standard Normal Distribution (Practice) Body Mass Index µ is 29.4 σ is P Area under the curve x - µ Z = σ After calculating z for questions A and B, refer to Appendix Table 1 A.What is the probability that a person in the population will have a BMI > 37.0? z = (37 – 29.4) / 4.6 = 1.65 P(Z > 1.65) = B.What is the probability that a person in the population will have a BMI < 27.0? z = ( ) / 4.6 = P(Z < -0.52) =

Percentiles from Standard Normal Distribution Standard normal distribution can also be used to compute percentiles. x = µ + zσ Example: Calculate the 90 th percentile for BMI: Body Mass Index (BMI) 90 th percentile Body Mass Index µ is 29.4 σ is In Appendix Table 1: The z value for the 90 th percentile is found by looking in the body of the table for a value of 0.90, or the nearest value. In this case, it is which corresponds to a z value of 1.28 (the actual z value is 1.282) So, x = (1.282 x 4.6) = 35.3 µ

Percentiles from Standard Normal Distribution (Practice) x = µ + zσ Calculate the 33 rd percentile for BMI: z = _________so, x = _________________________ Body Mass Index (BMI) 33 rd percentile Body Mass Index µ is 29.4 σ is µ

Percentiles from Standard Normal Distribution (Practice) x = µ + zσ Calculate the 33 rd percentile for BMI: z = -0.44so, x = (-0.44 x 4.6) = 27.4 Body Mass Index (BMI) 33 rd percentile Body Mass Index µ is 29.4 σ is µ

Standard Normal Distribution x - µ Z = σ Z values for common percentiles: Percentile Z 1 st th th th th th th th th th th 2.326

SECTION 2.8 Sampling Distributions and Central Limit Theorem

Sampling Distributions  In estimating the mean of a continuous variable in a population, the mean of a representative sample is a good estimate of the unknown population mean, but it is only an estimate.  When making estimates about population parameters based on sample statistics, it is very important to quantify the precision of the parameter estimates (e.g. standard error of the mean). Illustration: Assume a population of 6 measurements of self-reported pain (on a 0 to 100 scale) after total hip replacement with scores as follows: The population mean (μ) is:∑X / N = 71.7 and the standard deviation is:sqrt[∑(X- μ) 2 / N = 25.9

Sampling Distributions The population mean (μ) is:∑X / N = 71.7 and the standard deviation is:sqrt[∑(X- μ) 2 / N = 25.9 Suppose we did not have population data and wanted to estimate the mean from a sample, taking a sample size of 4. There are 15 different possible samples with n=4 when sampling without replacement is used (i.e. each individual can only be sampled once in a given sample). The probability of selecting any one of the 15 possible samples is 1/15 – see next slide.

Sample Observations in SampleSample Mean (X) The table represents the sampling distribution of the sample means So, with the original population sample of N=6, the population mean is: µ = ∑X / N = 71.7 with SD of However, the mean of the sample means, denoted as µ X is 71.7 and a standard deviation of σ X = 8.5. Note that µ = µ X (71.7), yet σ (25.9) is much smaller than σ X = 8.5 This is because the range of the population data (25 to 100) is much larger than the range of the sample means (60 to 88.8). These properties are formally stated in the Central Limit Theorem.

Central Limit Theorem If we take simple random samples of size n from the population with replacement, then for large samples (n > 30), the sample distribution of the sample means is approximately normally distributed with: µ X = µ and σ X = σ / sqrt(n) Because the distribution of the sample means is approximately normal, the normal probability model can be used to make inferences about a population mean. The parameter σ X = σ / sqrt(n), as noted above, is the standard error (meaning the standard deviation of the sample means) For a dichotomous outcome, the theorem holds for samples that: Minimum[np, n(1-p)] > 5, where n is the sample size and p is probability of the outcome in the population.

Central Limit Theorem (Practice) CharacteristicNµσµXµX σXσX Age (in years) Systolic blood pressure Body mass index Resting heart rate Sample PopulationSample Means µ X = µ σ X = σ / sqrt(n)

Central Limit Theorem (Practice) CharacteristicNµσµXµX σXσX Age (in years) Systolic blood pressure Body mass index Resting heart rate Sample PopulationSample Means µ X = µ σ X = σ / sqrt(n) Note that µ X = µ because the sample population mean is an unbiased estimate of the true mean Whereas σ X = σ are different quantities: σ is an estimate of variability in the sample σ X is an estimate of precision of the mean estimate Thus, as n increases, σ X is smaller, but σ may be > or <

Central Limit Theorem -- Application Assume that in adults over age 50 that HDL cholesterol has: µ = 54 and σ = 17. Suppose a physician has 40 patients (older than 50) and wants to determine the probability that their mean HDL is >60 i.e. P(X>60) Intuitively, this should appear very unlikely….. X - µ 60 – 54 6 Z = Z = =----= 2.22 σ / sqrt(n)17 / sqrt(40)2.7 From appendix Table 1, P(z > 2.22) = (1 – ) = (i.e. very unlikely)

Central Limit Theorem – Application (Practice) Assume that in adults over age 50 that HDL cholesterol has: µ = 44 and σ = 16. Suppose a physician has 50 patients (older than 50) and wants to determine the probability that their mean HDL is <40 i.e. P(X<40) X - µ Z = Z = σ / sqrt(n) X = ____µ = ____σ = ____n = ____ From appendix Table 1, P(z < ????) =

Central Limit Theorem – Application (Practice) Assume that in adults over age 50 that HDL cholesterol has: µ = 44 and σ = 16. Suppose a physician has 50 patients (older than 50) and wants to determine the probability that their mean HDL is <40 i.e. P(X<40) X - µ 40 – Z = Z = =-----= σ / sqrt(n)16 / sqrt(50)2.26 X = 40µ = 44σ = 16n = 50 From appendix Table 1, P(z < -1.77) =

SECTION 2.9 SPSS – Calculation of Z-Scores

Example: Age Analyze Descriptive Statistics Descriptives Before you click OK, be sure that the box marked "Save Standardized Values as Variables" is checked. Run a Frequency distribution for the standardized variable

SPSS – Calculation of Z-Scores Example: Age GET FILE='G:\NGR \Datasets\baseline_random.sav'. DATASET NAME DataSet1 WINDOW=FRONT. DESCRIPTIVES VARIABLES=SCR_AGE /SAVE /STATISTICS=MEAN STDDEV MIN MAX. Descriptive Statistics NMinimum MaximumMeanStd. Deviation Age (years) Valid N (listwise)503

SPSS – Calculation of Z-Scores Example: Age FREQUENCIES VARIABLES=ZSCR_AGE /FORMAT=NOTABLE /STATISTICS=STDDEV MEAN SKEWNESS SESKEW /HISTOGRAM /ORDER=ANALYSIS. Statistics Zscore: Age (years) NValid503 Missing0 Mean0E-7 Std. Deviation Skewness.097 Std. Error of Skewness.109