Some Alternative Approaches Two Samples. Outline Scales of measurement may narrow down our options, but the choice of final analysis is up to the researcher.

Slides:



Advertisements
Similar presentations
Equivalence Testing Dig it!.
Advertisements

Comparing Two Proportions (p1 vs. p2)
Issues in factorial design
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Statistical Decision Making
ANOVA: Analysis of Variation
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 29 Multiple Regression.
Chapter Seventeen HYPOTHESIS TESTING
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
Statistics: Data Analysis and Presentation Fr Clinic II.
An Inference Procedure
Don’t spam class lists!!!. Farshad has prepared a suggested format for you final project. It will be on the web
Statistics: Data Presentation & Analysis Fr Clinic I.
Lecture 9: One Way ANOVA Between Subjects
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Role and Place of Statistical Data Analysis and very simple applications Simplified diagram of a scientific research When you know the system: Estimation.
Today Concepts underlying inferential statistics
5-3 Inference on the Means of Two Populations, Variances Unknown
Chapter 14 Inferential Data Analysis
Bootstrapping applied to t-tests
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Standard Error of the Mean
EVALUATION David Kauchak CS 451 – Fall Admin Assignment 3 - change constructor to take zero parameters - instead, in the train method, call getFeatureIndices()
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Hypothesis Testing:.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
Statistical Analysis Statistical Analysis
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
More About Significance Tests
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Stats Lunch: Day 7 One-Way ANOVA. Basic Steps of Calculating an ANOVA M = 3 M = 6 M = 10 Remember, there are 2 ways to estimate pop. variance in ANOVA:
MANAGEMENT AND ANALYSIS OF WILDLIFE BIOLOGY DATA Bret A. Collier 1 and T. Wayne Schwertner 2 1 Institute of Renewable Natural Resources, Texas A&M University,
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
Types of Data in FCS Survey Nominal Scale – Labels and categories (branch, farming operation) Ordinal Scale – Order and rank (expectations, future plans,
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Confidence intervals and hypothesis testing Petter Mostad
PSY2004 Research Methods PSY2005 Applied Research Methods Week Five.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Simple examples of the Bayesian approach For proportions and means.
Data Analysis, Presentation, and Statistics
Tests after a significant F
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
T tests comparing two means t tests comparing two means.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Statistical Inferences for Variance Objectives: Learn to compare variance of a sample with variance of a population Learn to compare variance of a sample.
Micro array Data Analysis. Differential Gene Expression Analysis The Experiment Micro-array experiment measures gene expression in Rats (>5000 genes).
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Dependent-Samples t-Test
Comparing Two Means t-tests: Rationale for the tests t-tests as a GLM
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Introduction to SAS Essentials Mastering SAS for Data Analytics
Presentation transcript:

Some Alternative Approaches Two Samples

Outline Scales of measurement may narrow down our options, but the choice of final analysis is up to the researcher The following are examples of how one might approach a simple situation involving two samples of data They are not necessarily in lieu of the traditional approach or NHST, but are considered better in that framework (e.g. due to focus on effect size), but some actually do take a different approach Some may be appropriate to both independent samples and paired samples

Raw Scale Sometimes the difference in means is large enough and the scale well known enough that it would be an obviously meaningful difference to anyone with knowledge of the research situation Example: random sample of high school and middle school students and height

Interval for the Mean difference This is actually nothing different than the standard t-test and gives you the exact same information if using the same alpha Does the interval for the difference between means contain the null hypothesized value? –If not reject This should be part of your default reporting anyway.

Equivalence Testing Recall that non-rejection of a nil null does not mean we can accept it Absence of Evidence ≠ Evidence of Absence We’ll talk more in detail regarding equivalence testing as it shifts focus to effect size and multiple outcomes, but the gist is that instead of establishing a difference between groups, the goal is to establish similarity The methodological approach is that we are trying to show that the similarity seen is not due chance variation, just as before we tried to show that the difference is not due to chance variation

‘Inferential’ Confidence Intervals Can be used as part of the equivalence testing approach Specifically they allow for tabled and graphical display that easily communicates statistical difference Regular group mean CIs i.e. CIs for their own mean, in plots are misleading, they can overlap but the test still be statistically significant With the ICI approach, the two CIs for the groups’ respective means are adjusted 1 so that if the CIs for the mean of each group do not overlap, one knows they are statistically different at e.g. the.05 level

Robust t-test A simple robust t-test would involve trimmed means and a pooled winsorized variance As such it can easily be converted to a robust measure of Cohen’s d type effect size Using Wilcox’s libraries (see class webpage) yuenbt(group1, group2,tr=.2, alpha=.05,plotit=T)

Non-Parametric approaches Bootstrap t-test would allow us better performance in instances of non-normality Old methods rely on ranking the data, and as such end up in a different metric that may not be as easily interpretable Bootstrapping can also be applied to the other approaches discussed here, e.g. a bootstrapped CI for a robust effect size.

Bayesian Approach See additional specific slides (coming soon) Gist: multiple hypotheses, prior information, posterior probabilities of the hypotheses, ‘updating’

Effect size Effect size and an interval estimate of it provides an NHST approach but with entire focus shifted to effect size Example vs. the nil null: does the interval contain zero? –If not reject Also measures of overlap, to be talked about in more detail elsewhere, frame the effect size in terms of group overlap

Graphical Approach Often group comparisons are made with a bar graph and error bar, which is not only very dull, but contains no information about the distribution of scores, and is often using the ‘wrong’ CIs to convey group differences Better approaches require a program that’s capable of decent graphics

Regression Using the group membership as the predictor As we have noted, proper coding insures the exact same information will result as the t-test itself

Shifting Prediction to Group Membership Instead of evaluating mean differences, predict group membership (typically with several predictors) Candidate analyses –Logistic Regression –Discriminant Function Analysis

Vector Difference Norm Paired or equal N independent samples Example data: Total taxation from labor and consumption as %GDP Basic Question: How much have taxation rates changed over this time period? –(1981 = 38, 1995 = 40) –Mean difference = 2.0 t(17) = -2.55, p =.021 Problem: Some go up, don’t change, or even go down. And the average doesn’t give us a sense of ‘net change’ –One goes up 2 another down 2, and so you’re going to say the average difference is zero? How might we get such an estimate? Employ the difference norm 1

Vector Difference Norm Multiplying a vector requires that we multiply the column by a row of the same number of values 1 –The “ ’ ” indicates the vector has been transposes (i.e. flopped onto its side) The first value of 260 in and of itself isn’t interpretable However, if we took the 1981 values, raised those by 10% and performed the same operation we did using the 1995 values, we’d come to nearly the same result (265.8) Interpretation: The difference from 1981 to 1995 is equivalent to a 10% increase to all their values

Gist Theory, the goals of research and the communication of ideas drive the analysis more than data type As has been mentioned before, learning statistics is like learning a new language, and you’ll want to increase your vocabulary There are always several options within any research setting for viable and appropriate analysis Using multiple, unique, and/or uncommon methods may be necessary to communicate the specific understanding you have of the information you’ve collected.