Chapter 14 Nonparametric Statistics

Slides:

Advertisements

Similar presentations

EcoTherm Plus WGB-K 20 E 4,5 – 20 kW.

Advertisements

AGVISE Laboratories %Zone or Grid Samples – Northwood laboratory

Trend for Precision Soil Testing % Zone or Grid Samples Tested compared to Total Samples.

Simplifications of Context-Free Grammars

Lecture 8: Hypothesis Testing

AP STUDY SESSION 2.

STATISTICS HYPOTHESES TEST (I)

STATISTICS INTERVAL ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.

David Burdett May 11, 2004 Package Binding for WS CDL.

Create an Application Title 1Y - Youth Chapter 5.

Add Governors Discretionary (1G) Grants Chapter 6.

CHAPTER 18 The Ankle and Lower Leg

Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION

Chapter 7 Sampling and Sampling Distributions

The 5S numbers game..

Biostatistics Unit 5 Samples Needs to be completed. 12/24/13.

Break Time Remaining 10:00.

The basics for simulations

Factoring Quadratics — ax² + bx + c Topic

Turing Machines.

Elementary Statistics

PP Test Review Sections 6-1 to 6-6

MM4A6c: Apply the law of sines and the law of cosines.

Chapter 16 Goodness-of-Fit Tests and Contingency Tables

Chi-Square and Analysis of Variance (ANOVA)

Regression with Panel Data

Hypothesis Tests: Two Independent Samples

Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.

Introduction Our daily lives often involve a great deal of data, or numbers in context. It is important to understand how data is found, what it means,

Biology 2 Plant Kingdom Identification Test Review.

Chapter 1: Expressions, Equations, & Inequalities

2.5 Using Linear Models Month Temp º F 70 º F 75 º F 78 º F.

Quantitative Analysis (Statistics Week 8)

Adding Up In Chunks.

MaK_Full ahead loaded 1 Alarm Page Directory (F11)

When you see… Find the zeros You think….

Before Between After.

2011 FRANKLIN COMMUNITY SURVEY YOUTH RISK BEHAVIOR GRADES 9-12 STUDENTS=332.

Putting Statistics to Work

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.

Statistical Inferences Based on Two Samples

© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.

Static Equilibrium; Elasticity and Fracture

Converting a Fraction to %

Chapter Thirteen The One-Way Analysis of Variance.

Ch 14 實習(2).

Clock will move after 1 minute

Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.

Experimental Design and Analysis of Variance

Physics for Scientists & Engineers, 3rd Edition

Select a time to count down from the clock above

Copyright Tim Morris/St Stephen's School

1.step PMIT start + initial project data input Concept Concept.

9. Two Functions of Two Random Variables

4/4/2015Slide 1 SOLVING THE PROBLEM A one-sample t-test of a population mean requires that the variable be quantitative. A one-sample test of a population.

1 Dr. Scott Schaefer Least Squares Curves, Rational Representations, Splines and Continuity.

1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.

1 Non Deterministic Automata. 2 Alphabet = Nondeterministic Finite Accepter (NFA)

1 Chapter 15: Nonparametric Statistics Section 15.1 How Can We Compare Two Groups by Ranking?

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.4 Analyzing Dependent Samples.

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 15 Nonparametric Statistics Section 15.1 Compare Two Groups by Ranking.

Presentation transcript:

Chapter 14 Nonparametric Statistics Learn …. About Nonparametric Statistical Methods

Nonparametric Statistical Methods Nonparametric methods are used: When the data are ranks for the subjects, rather than quantitative measurements. When it’s inappropriate to assume normality.

How Can We Compare Two Groups by Ranking? Section 14.1 How Can We Compare Two Groups by Ranking?

Example: How to Get A Better Tan Experiment: A student wanted to compare ways of getting a tan without exposure to the sun. She decided to investigate which of two treatments would give a better tan: An “instant bronze sunless tanner” lotion A tanning studio

Example: How to Get A Better Tan Subjects: Five female students participated in the experiment. Three of the students were randomly selected to use the tanning lotion. The other two students used the tanning studio.

Example: How to Get A Better Tan Results: The girls’ tans were ranked from 1 to 5, with 1 representing the best tan. Possible Outcomes: Consider all possible rankings of the girls’ tans. A table of possibilities is displayed on the next page.

Example: How to Get A Better Tan

Example: How to Get A Better Tan For each possible outcome, a mean rank is calculated for the ‘lotion’ group and for the ‘studio’ group. The difference in the mean ranks is then calculated for each outcome.

Example: How to Get A Better Tan For this experiment, the samples were independent random samples – the responses for the girls using the tanning lotion were independent of the responses for the girls using the tanning studio.

Example: How to Get A Better Tan Suppose that the two treatments have identical effects. A girl’s tan would be the same regardless of which treatment she uses. Then, each of the ten possible outcomes is equally likely. So, each outcome has probability of 1/10.

Example: How to Get A Better Tan Using the ten possible outcomes, we can construct a sampling distribution for the difference between the sample mean ranks. The distribution is displayed on the next page.

Example: How to Get A Better Tan

Example: How to Get A Better Tan Graph of the Sampling Distribution:

Example: How to Get A Better Tan The student who planned the experiment hypothesized that the tanning studio would give a better tan than the tanning lotion.

Example: How to Get A Better Tan She wanted to test the null hypothesis: H0: The treatments are identical in tanning quality. Against Ha: Better tanning quality results with the tanning studio.

Example: How to Get A Better Tan This alternative hypothesis is one-sided. If Ha were true, we would expect the ranks to be smaller (better) for the tanning studio. Thus, if Ha were true, we would expect the differences between the sample mean rank for the tanning lotion and the sample mean rank for the tanning studio to be positive.

Wilcoxon Test The test comparing two groups based on the sampling distribution of the difference between the sample mean ranks is called the Wilcoxon test.

Wilcoxon Nonparametric Test for Comparing Two Groups Assumptions: Independent random samples from two groups.

Wilcoxon Nonparametric Test for Comparing Two Groups Hypotheses: H0: Identical population distributions for the two groups (this implies equal expected values for the sample mean ranks). Ha: Higher expected value for the sample mean rank for a specified group (one-sided).

Wilcoxon Nonparametric Test for Comparing Two Groups Test Statistic: Difference between sample mean ranks for the two groups (Equivalently, can use sum of ranks for one sample).

Wilcoxon Nonparametric Test for Comparing Two Groups P-value: One-tail or two-tail probability, depending on Ha, that the difference between the sample mean ranks is as extreme or more extreme than observed. Conclusion: Report the P-value and interpret it. If a decision is needed, reject H0 if the P-value ≤ significance level such as 0.05.

For the actual experiment: Example: Does the UV Tanning Studio Give a Better Tan than the Tanning Lotion? For the actual experiment: the ranks were (2,4,5) for the girls using the tanning lotion the ranks were (1,3) for the girls using the tanning studio.

The mean rank for the tanning lotion is: (2+4+5)/3 = 3.7 Example: Does the UV Tanning Studio Give a Better Tan than the Tanning Lotion? The mean rank for the tanning lotion is: (2+4+5)/3 = 3.7 The mean rank for the tanning studio is: (1+3)/2=2

The test statistic is the difference between the sample mean ranks: Example: Does the UV Tanning Studio Give a Better Tan than the Tanning Lotion? The test statistic is the difference between the sample mean ranks: 3.7 – 2 = 1.7

Example: Does the UV Tanning Studio Give a Better Tan than the Tanning Lotion? The one-sided alternative hypothesis states that the tanning studio gives a better tan. This means that the expected mean rank would be larger for the tanning lotion than for the tanning studio, if Ha is true. And, the difference between the mean ranks would be positive.

Example: Does the UV Tanning Studio Give a Better Tan than the Tanning Lotion? The test statistic we obtained from the data was: Difference between the sample mean ranks = 1.7. P-value = P(difference between sample mean ranks at least as large as 1.7)

Example: Does the UV Tanning Studio Give a Better Tan than the Tanning Lotion? The P-value can be obtained from the graph of the sampling distribution (as seen on a previous page and displayed again here):

This is not a very small P-value. Example: Does the UV Tanning Studio Give a Better Tan than the Tanning Lotion? P-value = 0.20. This is not a very small P-value. The evidence does not strongly support the claim that the tanning studio gives a better tan.

The Wilcoxon Rank Sum The Wilcoxon test can, equivalently, use as the test statistic the sum of the ranks in just one of the samples. This statistic will have the same probabilities as the differences between the sample mean ranks. Some software reports the sum of ranks as the Wilcoxon rank sum statistic.

Example: Is there a treatment difference between the UV Tanning Studio and the Tanning Lotion? Suppose the experiment was designed with a two-sided alternative hypothesis: H0: The treatments are identical in tanning quality. Ha: The treatments are different in tanning quality.

Example: Is there a treatment difference between the UV Tanning Studio and the Tanning Lotion?

Using the Wilcoxon Test with a Quantitative Response When the response variable is quantitative, the Wilcoxon test is applied by converting the observations to ranks. For the combined sample, the observations are ordered from smallest to largest. The test compares the mean ranks for the two samples.

Example: Do Drivers Using Cell Phones Have Slower Reaction Times? Experiment: A sample of 64 college students were randomly assigned to a cell phone group or a control group, 32 to each. On a machine that simulated driving situations, participants were instructed to press a “brake button” when they detected a red light.

Example: Do Drivers Using Cell Phones Have Slower Reaction Times? Experiment: The control group listened to the radio while they performed the simulated driving. The cell phone group carried out a conversation on a cell phone. Each subject’s response time to the red lights is recorded and averaged over all of his/her trials.

Example: Do Drivers Using Cell Phones Have Slower Reaction Times? Boxplots of the data:

Example: Do Drivers Using Cell Phones Have Slower Reaction Times? The box plots do not show any substantial skew, but there is an extreme outlier for the cell phone group. The t inferences that we have used previously assume normal population distributions. The Wilcoxon Test does not assume normality. This test can be used in place of the t test if the normality assumption is questioned.

Example: Do Drivers Using Cell Phones Have Slower Reaction Times? To use the Wilcoxon test, we need to rank the data (response times) from 1 (smallest reaction time) to 64 (largest reaction time). The test statistic is then calculated from the ranks.

Example: Do Drivers Using Cell Phones Have Slower Reaction Times? The next page shows the output for the hypothesis test: H0: The distribution of reaction times is identical for the two groups. Ha: The distribution of reaction times differs for the two groups.

Example: Do Drivers Using Cell Phones Have Slower Reaction Times?

Example: Do Drivers Using Cell Phones Have Slower Reaction Times? The small P-value (.019) shows strong evidence against the null hypothesis. The sample mean ranks suggest that reaction times tend to be slower for those using cell phones.

Example: Do Drivers Using Cell Phones Have Slower Reaction Times? Insight: The Wilcoxon test is not affected by outliers. No matter how far the largest observation falls from the next largest, it still gets the same rank.

Nonparametric Estimation Comparing Groups When the response variable is quantitative, we can compare a measure of center for the two groups. One way to do this is by comparing means. This method requires the assumption of normal population distributions.

Nonparametric Estimation Comparing Groups When the response distribution is highly skewed, nonparametric methods are preferred. For highly skewed distributions, a better measure of the center is the median. We can then estimate the difference between the population medians for the two groups.

Nonparametric Estimation Comparing Groups Most software for the Wilcoxon test reports point and interval estimates comparing medians. Some software refers to the equivalent Mann-Whitney test.

Nonparametric Estimation Comparing Groups The Wilcoxon test (and the Mann-Whitney test) does not require a normal population assumption. It does require an extra assumption: the population distributions for the two groups are symmetric and have the same shape.

For a study on the effects of hypnosis, subjects were divided into a control group and treatment group and a measure of respiratory ventilations was taken on each subject. Controls: 3.99 4.19 4.21 4.54 4.64 4.69 4.84 5.48 Treated: 4.36 4.67 4.78 5.08 5.16 5.20 5.52 5.74 What is the mean rank for the Control Group? 8 49 6.125 7.5

For a study on the effects of hypnosis, subjects were divided into a control group and treatment group and a measure of respiratory ventilations was taken on each subject. Controls: 3.99 4.19 4.21 4.54 4.64 4.69 4.84 5.48 Treated: 4.36 4.67 4.78 5.08 5.16 5.20 5.52 5.74 What is the mean rank for the Treatment Group? 8 10.875 13.25 16

For a study on the effects of hypnosis, subjects were divided into a control group and treatment group and a measure of respiratory ventilations was taken on each subject. Controls: 3.99 4.19 4.21 4.54 4.64 4.69 4.84 5.48 Treated: 4.36 4.67 4.78 5.08 5.16 5.20 5.52 5.74 A test of the hypothesis that subjects in the treatment group tended to ventilate more resulted in a P-value of 0.025. Does this P-value support the claim that subjects in the treatment group ventilated more? yes no

Nonparametric Methods for Several Groups and for Matched Pairs Section 14.2 Nonparametric Methods for Several Groups and for Matched Pairs

Comparing Mean Ranks of Several Groups The Wilcoxon test for comparing mean ranks of two groups extends to a comparison of mean ranks for several groups. This test is called the Kruskal-Wallis test.

ANOVA test vs. Kruskal-Wallis test Both tests are used to compare many groups. The ANOVA F test assumes normal population distributions. The Kruskal-Wallis test does not make this assumption. The Kruskal-Wallis test is a “safer” method to use with small samples when not much information is available about the shape of the distributions.

ANOVA test vs. Kruskal-Wallis test The Kruskal-Wallis test is also useful when the data are merely ranks and we don’t have a quantitative measurement of the response variable.

Summary: Kruskal-Wallis Test Assumptions: Independent random samples from several (g) groups. Hypotheses: H0: Identical population distributions for the g groups Ha: Population distributions not all identical.

Summary: Kruskal-Wallis Test Test statistic: Uses between-groups variability of sample mean ranks. Software easily calculates this. P-value: Right-tail probability above observed test statistic value from chi-squared distribution. Conclusion: Report the P-value and interpret in context.

Example: Does Heavy Dating Affect College GPA? Experiment: A student in a statistics class (Tim) decided to study whether dating was associated with college GPA. He wondered whether students who data a lot tend to have poorer GPAs.

Example: Does Heavy Dating Affect College GPA? Experiment: He asked 17 students in the class to anonymously fill out a short questionnaire in which they were asked to give their college GPA and to indicate whether, during their college careers, they had dated regularly, occasionally, or rarely.

Example: Does Heavy Dating Affect College GPA? Dot plots of the GPA data for the 3 dating groups:

Example: Does Heavy Dating Affect College GPA? Since the dot plots showed evidence of severe skew to the left and since the sample size was small in each group, Tim felt safer analyzing the data with the Kruskal-Wallis test than with the ordinary ANOVA F test.

Example: Does Heavy Dating Affect College GPA? The hypotheses for the Kruskal-Wallis test: H0: Identical population distributions for the three dating groups Ha: Population distributions for the three dating groups are not all identical.

Example: Does Heavy Dating Affect College GPA? This table shows the data with the GPA values ordered from smallest to largest for each dating group.

Example: Does Heavy Dating Affect College GPA? MINITAB output for the Kruskal-Wallis test:

Example: Does Heavy Dating Affect College GPA? The test statistic reported in the output is H = 0.72. The corresponding P-value reported in the output is 0.696. This large P-value does not give any evidence against H0. It is plausible that GPA is independent of dating group.