Week 6 Dr. Jenne Meyer.  Article review  Rules of variance  Keep unaccounted variance small (you want to be able to explain why the variance occurs)

Slides:



Advertisements
Similar presentations
Hypothesis Testing Steps in Hypothesis Testing:
Advertisements

Inference about the Difference Between the
1 Chapter 10 Comparisons Involving Means  1 =  2 ? ANOVA Estimation of the Difference between the Means of Two Populations: Independent Samples Hypothesis.
Chapter 10 Comparisons Involving Means
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
QUANTITATIVE DATA ANALYSIS
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Statistics for Managers Using Microsoft® Excel 5th Edition
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
The Kruskal-Wallis Test The Kruskal-Wallis test is a nonparametric test that can be used to determine whether three or more independent samples were.
Chi-Square Tests and the F-Distribution
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 15 The.
Inferential Statistics
Chapter 12: Analysis of Variance
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
F-Test ( ANOVA ) & Two-Way ANOVA
1 Chapter 11 – Test for the Equality of k Population Means nRejection Rule where the value of F  is based on an F distribution with k - 1 numerator d.f.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 30 日 第八週:變異數分析與實驗設計.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Introduction to Linear Regression and Correlation Analysis
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
QMS 6351 Statistics and Research Methods Regression Analysis: Testing for Significance Chapter 14 ( ) Chapter 15 (15.5) Prof. Vera Adamchik.
1 1 Slide Simple Linear Regression Coefficient of Determination Chapter 14 BA 303 – Spring 2011.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Chapter 16 The Chi-Square Statistic
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
ANALYSIS OF VARIANCE (ANOVA) BCT 2053 CHAPTER 5. CONTENT 5.1 Introduction to ANOVA 5.2 One-Way ANOVA 5.3 Two-Way ANOVA.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 11-1 Chapter 11 Chi-Square Tests and Nonparametric Tests Statistics for.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter Eight: Using Statistics to Answer Questions.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
© 2006 by Thomson Learning, a division of Thomson Asia Pte Ltd.. 1 Slide Slide Slides Prepared by Juei-Chao Chen Fu Jen Catholic University Slides Prepared.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
ANalysis Of VAriance can be used to test for the equality of three or more population means. H 0 :  1  =  2  =  3  = ... =  k H a : Not all population.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
1 Pertemuan 19 Analisis Varians Klasifikasi Satu Arah Matakuliah: I Statistika Tahun: 2008 Versi: Revisi.
Rancangan Acak Lengkap ( Analisis Varians Klasifikasi Satu Arah) Pertemuan 16 Matakuliah: I0184 – Teori Statistika II Tahun: 2009.
Chapter 11 – Test for the Equality of k
Chapter 13 f distribution and 0ne-way anova
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Statistics Analysis of Variance.
Chapter 12 Tests with Qualitative Data
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
Statistical Inference about Regression
Chapter 18: The Chi-Square Statistic
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Week 6 Dr. Jenne Meyer

 Article review

 Rules of variance  Keep unaccounted variance small (you want to be able to explain why the variance occurs)  Determining the components of variance enables the research to estimate what variance is accounted for.  ** variance enables the researcher to interpret statistically significant differences and relationships  Measures of variance  ANOVA (linear)  Regression  Analysis of covariance

 ANalysis Of VAriance  can be used to test for the equality of three or more population means using data obtained from observational or experimental studies

 In Analysis of Variance, we analyze the variance -- not the means  “Good” variance is variance between groups; we want this to be different  “Bad” variance is variance within groups because too much variance obscures any effects  In ANOVA, we will partition the variance to try to:  Show a big variance between groups  Show a small variance within groups

ANOVA test for equality of c population means: Hypothesis: H0: m1 = m2 = m3 =…... = mc HA: Not all population means are equal Test statistic F* = MSA / MSE = numerator / denominator Rejection Rule Reject H0 if F *> Fa where the value of Fa  is based on an F distribution with c - 1 numerator degrees of freedom and n - c denominator degrees of freedom.

As with other hypothesis tests, the rejection region for a level of significance equal to  is F  is the “critical value” FF F-distribution Do Not Reject H 0 Reject H 0 Critical Value MSA / MSE

 A factor is the “general name” for the sample data that is being studied.  A level refers to the number of sample variables that are associated with the factor. The levels of a factor are also know as “treatments”  The response variable is the quantitative sample data

FactorLevels or number of treatments Quantitative Response Variable Absorption material in diaper No. of materialsAmt. of fluid absorbed (oz) College majorNo. of majorsStarting salary dollar amounts ($) Soft drink versionNo. of versionsTaste test (numerical scores) Drug typeNo. of drugsTime it takes drug to work (minutes) Plant locationNo. of facilitiesNumber of employees

 For example, let’s say you were comparing the “mean time to failure” for computer disks from 3 different suppliers….the “One Factor” that is being considered here is the “suppliers”  In this case however, the “one factor” has “three levels”  The “response variable” is the quantitative sample data, which is the “mean time to failure” numbers

 A farmer grows tomatoes. He wants to test three different fertilizers (Gro-Big, Chemical X, and Tomato King) to determine their effect on producing tomatoes. The following table (next slide) shows that the farmer planted 12 plots of equal acreage with tomatoes. He used each of the three fertilizers on four different plots, then compared the number of bushels of tomatoes produced per acre in each plot.  (The fertilizers are the "treatments" since they are the source of the variation in the output of tomatoes. The ANOVA analysis determines whether there is a "significant" difference in tomato output by type of fertilizer.)

Gro-Big fertilizer Chemical X fertilizer Tomato King fertilizer 55 Sample Plot 1 66 Sample Plot 1 47 Sample Plot 1 54 Sample Plot 2 76 Sample Plot 2 51 Sample Plot 2 59 Sample Plot 3 67 Sample Plot 3 46 Sample Plot 3 56 Sample Plot 4 71 Sample Plot 4 48 Sample Plot 4 Tomato Production (bushels per acre)

F   4.26 Do Not Reject H 0 Reject H 0 MSA / MSE H 0 :  1 =  2 =  3 H A : At least one mean is different F = samples -1 = 2 12 observations – 3samples = 9

Excel Output with calculation components outlined

At  =.05, Since F > Fcrit, we Reject H 0 and conclude that at least one mean is different (at least one fertilizer yields different output); also, p-value <.05, Reject H 0

 Step 1:H0:  1 =  2 =  3 HA: At least one mean is different  Step 2:  = 0.05 with F2,9 “Reject H0 if F >4.256”  Step 3: F = 49.6  Step 4: H0 is rejected since test F is greater than critical F  Step 5: At least one of the means (mean tomato production) is significantly different than the others

 How to: Conduct One-Way ANOVA in Excel  The data must be setup such that each population is in its own column  Click Tools > Data Analysis…  Then select ANOVA: Single Factor  Enter the cell range (or sweep) the data,  Enter level of significance alpha, and indicate how you want the output directed

 J. R. Reed would like to know if the mean number of hours worked per week is the same for the department managers at her three manufacturing plants (Buffalo, Pittsburgh, and Detroit).  A simple random sample of 5 managers from each of the three plants was taken and the number of hours worked by each manager for the previous week is shown below.

F   3.89 Do Not Reject H 0 Reject H 0 MSA / MSE H 0 :  1 =  2 =  3 H A : At least one mean is different

At  =.05, Since F > Fcrit, we Reject H 0 and conclude that at least one mean is different (at least one plant’s mean number of hours worked is different); also, p-value <.05, Reject H 0

 Step 1:H0:  1 =  2 =  3 HA: At least one mean is different  Step 2:  = 0.05 with F2,12 “Reject H0 if F >3.89”  Step 3: F = 9.54  Step 4: H0 is rejected since test F is greater than critical F  Step 5: At least one of the means is significantly different (at least one plant’s mean number of hours worked is different) than the others

 Up to now, we have mostly been working data based on “normal” distributions (parametric data)  The data was typically interval or ratio level (height, weight, distances, etc)  But, what if the data exists merely as COUNTS (frequency) of a particular category, and is nominal type data (nonparametric data)?  Qualitative data are typically non-numerical in nature  Nominal data categorizes data – color, brand, etc.  Ordinal data orders data  The frequencies or counts for each category are the only numerical data we have to work with

 Nonparametric Tests  Chi-square ▪ Goodness of fit test ▪ Testing proportions for more than two populations  Wilcoxon rank sum test  Kruskal-Wallis test

 Chi-Square Goodness of Fit Test  Checks to see how well a set of data fit the model for a particular “expected” probability distribution  Compares the observed frequency distribution to what would be expected if the H0 were true. ▪ H0: The data come from a population with a specific probability distribution (normal, binomial, etc.) ▪ HA: The data do not come from a population with the specified distribution

 Chi-Square Goodness of Fit Test  The observed frequencies (fo ) (or simply “O”) are the actual number of observations that fall into each class in a frequency distribution or histogram.  The expected frequencies (fe) (or simply “E”) are the number of observations that are expected in each category….the expected distribution is also known for as the “hypothesized probability distribution”

 Chi-Square Goodness of Fit Test  H0: fo = fe (the data comes from a population with a specific distribution) HA: fo  fe (not H0)  The test statistic is:  The critical value is a chi-square value with (c – 1) degrees of freedom, where c is the number of categories.

 Example  A University expects the distribution of their students to follow the chart below.

 Example A University expects the distribution of their students to follow the chart at the top right. The University took a survey, and observed the data distribution indicated by the table at the bottom right.

 To calculate the chi-square, one first multiplies the expected percentages for each category times the observed data total to find the “expected” percentages for each category.  Then, calculate (O-E), (O-E)2, (O-E)2/E for each category row; sum the (O-E)2/E column to find the chi-square value,

 Example  Find the Chi-square critical cutoff value for df = 4-1 = 3 and a =.05  The critical cutoff value is 7.815

 Outcome:  H0: There is no difference between the observed and the expected frequencies of absences.  HA: There is a difference between the observed and the expected  Decision Rule: Reject H0 if c2 >  Test statistic: c2 =  Conclusion: Reject H0 and conclude that there is a difference between the observed and expected frequencies of student ages.

 Review Coke example in class

 Chapter 12 problems 4, 7  Chapter 13 problems 1 (f only), 9, 12