Biostatistics: Methods and Applications

Biostatistics: Methods and Applications
Prof. Weidong Tian Prof. Hong Zhang Prof. Ruoyu Luo Tel: Office: 2320 East Guanghua Building

Review of Last Week’s talk
Hypothesis testing Type I vs. Type II error One sample test

Your decision Reality Defendant is innocent Defendant is guilty
He is innocent (H0) 1-α β He is guilty (Ha) α 1-β type I error: rejecting H0 when H0 is correct. type II error: rejecting Ha when Ha is correct. Power of the test: accepting Ha when Ha is correct. α: significance level.

Minimizing type I error increases type II error

Type II error is also determined by the difference of two population mean

The biomarker is effective
ROC The biomarker is effective The biomarker is very effective

Critical region, critical value, significance level and P-value
Critical region: the null hypothesis is rejected when a calculated value of the test statistic lies within the region Critical value: the value which determines the boundary of the critical region Significance level(α): the size of a critical region, and the probability of the type I error P-value: the probability of obtaining a test statistic at least as extreme as the one that was actually observed, assuming that the null hypothesis is true. When the p-value is less than the significance level α, which is often 0.05 or 0.01, the null hypothesis is rejected, and the result is considered "statistically significant".

Two-tailed vs. one-tailed test

α/2: area of the critical region
critical value Z-test t-test Z=-3

We have talked about hypothesis test of one sample.
In practice, we often need to compare one group to another in order to make inference about the differences between two samples. How can we extend the one sample test to two sample test?

Outline Test of two paired samples (related, e.g., “before treatment” and “after treatment”) Test of two independent samples (independent, e.g., “women” and “men”) Sample size for two samples

Test of two paired samples
Problem: We are interested in testing whether the performance of students on analyzing statistical problems is improved after drinking a cup of coffee. Solution: suppose the distribution of the performance follows a normal distribution with N(μ,σ) we randomly sample a number of students, and give each of them a cup of tea; then, we evaluate their performance, and compute μ’. H0: μ= μ’, Ha: μ’ >= μ.

Questions: How many students should we sample given the confidence level (α) and the power of the test (1-β), suppose the variance are the same.

From our last homework suppose μ’-μ=0.5 We need to sample 1752 students

Instead of randomly sampling a number of students, and evaluating their performance after drinking a cup of coffee, we evaluate the performance difference of a number of students before and after drinking a cup of coffee H0: μ= 0, Ha: μ > 0. It is expected that the variation between students’ performance difference will be significantly lower than that of the performance among students.

before Now under the same condition before Now We need to sample 22 students

Using paired samples can significantly reduce the variance between random samples Using paired samples can significantly reduce the number of samples to be tested Using paired samples can reduce the confounding variables Using paired samples can improve the power of the test Test of two paired samples can be simplified to the test of one sample

The central limit of theorem applies to the two paired samples simulation before after

The central limit of theorem applies to the two paired samples n=100 n=30

Test of two paired samples - example
17 girls being treated for anorexia were weighed before and after treatment. Difference scores were calculated for each participant. Change in weight

95% confidence interval

Standard error

Effect size: When a difference is statistically significant, it does not necessarily mean that it is big, important, or helpful in decision-making. It simply means you can be confident that there is a difference. Sometimes when sample size is large, you can detect the significance of the difference, though the difference may be small. Effect size helps you to determine the meaningfulness of the difference, and can be compared across different studies.

We have talked about the test of paired samples.
In practice, we are often faced with comparing two samples that are not related with each other. Test of two independent samples is the most widely used statistical test of all time. It is simple, straightforward, easy to use, and adaptable to a broad range of situations.

Problems: We are interested in determining whether a newly found plant hormone can enhance plant growth. Strategy: Randomly select 20 Arabidoposis seeds, and divide them into two groups each with 10 seeds. Grow them under conditions that are identical in every respect except one: The seeds from Group A are grown in normal condition (Control), while those from Group B are grown in normal + hormone condition (Case). Measure the height of the plants in two groups at different time. Compare the mean height between two groups. Make conclusion.

Problems: We are interested in determining whether two strains of mice, A and B, differ with respect to their ability to learn to avoid an aversive stimulus Strategy: Randomly sample Na mice from strain A and Nb mice from strain B. Following a standard aversive- conditioning procedure, measuring for each one how well and quickly the avoidance behavior is acquired. Compare the difference of mean response behavior between group A and B. Make conclusion.

t-test for two independent samples
Assumptions: Both populations are normally distributed Samples are randomly and independently drawn If both populations are not normal, need large sample sizes

Goal: Evaluate the mean difference between two populations (or between two treatment conditions). For paired samples, we can compute the difference scores. For independent samples, we cannot compute the difference scores.

paired t-test t-test of independent samples =0 how to estimate?

Let’s first look at the variance of For two independent samples:

are the two population variances

The easiest case: If However, usually σ is unknown How to estimate σ? Sample variance can be used to estimate population variance How can we best uses these two estimates?

Solution: use the average of the two sample variance as the estimate. However, because the two samples may not have the same size, we need to take into account the sample size, and give more weight to the one with larger size.

Pooled average: the weighted average of the two sample variances degree of freedom Bessel’s correction Recall unbiased estimation

Steps for calculating a test statistic calculate calculate pooled variance calculate standard error calculate T and d.f. calculate confidence interval, p-value, etc.

Example 1: Bumpus’s Data on Natural Selection
Two groups of sparrows are researched by the scientists with: One group survived in the storm; The other died in the storm. Questions of interest: Do the lengths of arm bone tend to be different for survivors than for those that died? If so, how large is the difference?

Perished Survived Average 0.7380 SD 0.0235 0.0198 n 24 35 Assumption : two groups follow the normal distribution, with equal variance.

Steps for calculating a test statistic calculate calculate pooled variance calculate standard error

Calculate T and d.f.

Reject the hypothesis that there is any difference

confidence interval p-value

Example 2: Blood pressure of children
To investigate the question of whether the children of city A and city B have the same systolic blood pressure, a random sample of n = 10 children was selected from each city and their blood pressures measured.

difference of sample means, df, and t critical value pooled variance

confidence interval p-value

Example 3: Independent Random Samples from Two Populations of Serum Uric Acid Values

When variances are unknown, test of variance have to be performed before conducting t-test of two independent samples

Assumption: both population follow normal distribution
Recall F-distribution about two variance Assumption: both population follow normal distribution

F-test for two variances
hypothesis Test statistic

Example Purpose: To investigate the variations in tree diameters from the north and south tract in Thomas County, Georgia are similar to each other or not? Solution: Take random samples of 25 trees from each half, and perform the F-test North 27.8, 14.5, 39.1, 3.2, 58.8, 55.5, 25, 5.4, 19, 30.6, 15.1, 3.6, 38.4, 15, 2.2, 14.2, 44.2, 25.7, 11.2, 46.8, 36.9, 54.1, 10.2, 2.5, 13.8 South 44.4, 26.1, 50.4, 23.3, 39.5, 51, 48.1, 47.2, 40.3, 37.4, 36.8, 21.7, 35.7, 32, 40.4, 12.8, 5.6, 44.3, 52.9, 38, 2.6, 44.6, 45.5, 29.1, 18.7

df sample variance F statistic F critical values

Accept the null hypothesis that the two variance are the same

Perform t-test to examine the difference between mean Reject the null hypothesis that the two means are the same

What if F-test finds that the two variance are different?
How can we test the difference between means?

Recall what we have learnt in the previous slides
t-test of independent samples these two variances are unknown, and unequal Estimate from sample variance For t-distribution, we need to know the degree of freedom

For two independent samples with equal variance
The degrees of freedom are not known exactly but can be estimated using the Welch-Satterthwaite approximation For two independent samples with unequal variance

Case study: the effects of lead exposure on the psychological and neurological well-being of children Experiment design: A group of children who lived near a lead smelter in El Paso, Texas, were identified and their blood levels of lead were measured. Children are identified into two groups (exposed: blood-lead levels >= 40 ug/mL, and control). Two important outcome variables were studied the number of finger-wrist taps in the dominant hand (for neurological function). the Wechsler full-scale IQ score.

Ld72: blood lead level measured in 1972
Iqf: IQ score maxfwt: finger-wrist taps Exposed: Ld73 >=40 OR LD72 >= 40 Control: Ld73 <40 AND LD72 < 40

1. read data (lead level, fwt scores, iq scores)
2. identify exposed and control group matrix exposed and control will save the index of children that satisfy the exposed and control lead level, respectively

3. collect fwt and iq scores from exposed and control group
fwt_exposed: matrix that saves the fwt scores of children from the lead exposed group 4. determine sample statistic (mean, variance) 5. determine sample size

6. test of variance Critical value of F statistic df

7. calculate pooled variance

7. calculate t-statistic
8. calculate t-critical value

9. calculate p-value

10. make conclusions (1): lead exposed and control group have difference in fwt scores which reflects neurological functions (2): lead exposed and control group have no significant difference in iq scores

effect size one sample paired sample independent sample

Flowchart of two sample t-test

Treatment of outliers extreme studentized deviate (ESD)
An outlier is determined by ESD, sample size n, and the percentile p

No outliers for fwt data
Treatment of outliers n_exposed = 32 n_control = 60 ESD(32, 0.95) =(2.91, 2.98) ESD(60, 0.95) =(3.20) No outliers for fwt data

Treatment of outliers No outliers for iq data
ESD(32, 0.95) =(2.91, 2.98) ESD(60, 0.95) =(3.20) No outliers for iq data

Estimate sample size Suppose that the true distributions of both finger-wrist tapping scores and full-scale IQ scores are normal. How many children should be identified to have an 80% chance of detecting a significant difference using a two-sided test with α=0.05?(We anticipate that the size of control group should be 2 times the size of exposed size. )

We don’t know the population means and variances
Use sample means and variances as the estimates

Summary t-test Sample data Hypothesized population parameter
Sample variance Estimated standard error t-statistic one sample paired samples independent samples

Biostatistics: Methods and Applications

Similar presentations

Presentation on theme: "Biostatistics: Methods and Applications"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Biostatistics: Methods and Applications

Similar presentations

Presentation on theme: "Biostatistics: Methods and Applications"— Presentation transcript:

Similar presentations

About project

Feedback