Introduction to Statistical Methods By Tom Methven Digital slides and tools available at: www.macs.hw.ac.uk/~mjc/teaching/ResearchMethods.

Slides:



Advertisements
Similar presentations
Parametric/Nonparametric Tests. Chi-Square Test It is a technique through the use of which it is possible for all researchers to:  test the goodness.
Advertisements

© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Hypothesis testing & Inferential Statistics
BHS Methods in Behavioral Sciences I
Methods and Measurement in Psychology. Statistics THE DESCRIPTION, ORGANIZATION AND INTERPRATATION OF DATA.
Lecture 9: One Way ANOVA Between Subjects
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Today Concepts underlying inferential statistics
Richard M. Jacobs, OSA, Ph.D.
Non-parametric statistics
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Inferential Statistics
(a brief over view) Inferential Statistics.
Choosing Statistical Procedures
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
AM Recitation 2/10/11.
Statistical Analysis I have all this data. Now what does it mean?
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Statistical inference: confidence intervals and hypothesis testing.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Statistical Analysis Statistical Analysis
Psychometrics.
Single-Sample T-Test Quantitative Methods in HPELS 440:210.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Hypothesis Testing CSCE 587.
Statistical Analysis I have all this data. Now what does it mean?
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
CS130 – Software Tools Fall 2010 Statistics and PASW Wrap-up 1.
User Study Evaluation Human-Computer Interaction.
TAUCHI – Tampere Unit for Computer-Human Interaction ERIT 2015: Data analysis and interpretation (1 & 2) Hanna Venesvirta Tampere Unit for Computer-Human.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Psychology 101. Statistics THE DESCRIPTION, ORGANIZATION AND INTERPRATATION OF DATA.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
Histograms and Distributions Experiment: Do athletes have faster reflexes than non-athletes? Questions: - You go out and 1st collect the reaction time.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Review Hints for Final. Descriptive Statistics: Describing a data set.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Experimental Design and Statistics. Scientific Method
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
SOCW 671 #2 Overview of SPSS Steps in Designing Research Hypotheses Research Questions.
Chapter 10 The t Test for Two Independent Samples
Chapter Eight: Using Statistics to Answer Questions.
Data Analysis.
PCB 3043L - General Ecology Data Analysis.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
STATISTICS FOR SCIENCE RESEARCH (The Basics). Why Stats? Scientists analyze data collected in an experiment to look for patterns or relationships among.
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
The Analysis of Variance ANOVA
Statistics for Political Science Levin and Fox Chapter Seven
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
Introduction to Statistical Methods By Tom Methven Digital slides and tools available at:
T tests comparing two means t tests comparing two means.
ENGR 610 Applied Statistics Fall Week 7 Marshall University CITE Jack Smith.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Chapter 9 Introduction to the t Statistic
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Research Methodology Lecture No :25 (Hypothesis Testing – Difference in Groups)
Inferential Statistics
Quantitative Methods in HPELS HPELS 6210
What are their purposes? What kinds?
Presentation transcript:

Introduction to Statistical Methods By Tom Methven Digital slides and tools available at:

Moving Bell-curves

Designing the Experiment 1. Define exactly what you want to measure 2. Pick which statistical test to use, first 3. Decide on your experimental design

Worked Example Two different piano designs, iPiano and Wii Piano! This example will be built up through the notes to show how to go about designing/analysing a simple experiment.

Level Of Measurement (Non- Parametric) Nominal – Labelled results or characteristics – Can be used for equivalence and set membership – Can show commonality but leaves it undefined E.g. List of names, football players’ numbers Ordinal – Results in some sort of rank order – Results do not show differences between places! – No arithmetic is valid, you cannot say 3 rd place is the same as 1 st and 2 nd combined, for example E.g. Race results, top 10 films

Level Of Measurement (Parametric) Interval – Results placed along a regular scale, with an arbitrary zero-point – Multiplication/Division is meaningless but the ratio of differences is valid as differences are consistent E.g. Degrees Fahrenheit Ratio – Results placed along a regular scale with an absolute zero-point – Allows for statements such as 6 units is twice as much as 3 units E.g. Kelvin and most SI units

Statistic Basics ModeThe most common score MedianThe central score MeanThe ‘average’ of all the scores E.g. For the results 9,2,5,3,6,9,5,6,4,2,6 Mode: 6 Median: 5 Mean: 5.18

Worked Example Results Using time to successfully play a simple tune as the measure. Therefore, we’re using a ratio scaling, so we can use parametric tests. Results: Interface 1Interface 2 Person Person Person Person Person Person Person Person Person Person Mean:

Randomisation and Ordering Effects Performing an experiment gives experience which could improve subjects’ ‘scores’ For a two-part test, an equal number of people should try each condition first When you have many conditions or trials, it is easiest to show then in a random order

Latin Squares A way of counter-balancing condition order for small amounts of conditions E.g. For three possible conditions: You can also get ‘balanced’ Latin squares for experiments with even number of conditions. Order of conditions or trials Group 1ABC Group 2BCA Group 3CAB

Accuracy of the Mean Variance: Standard Deviation: Standard Error:

Degrees of Freedom For sample populations, often ‘N – 1’ is used rather than N. This is the simplest calculation of D.O.F., but it can get very complex. We assume that the sample mean is the same as the population mean. Therefore, it is related to how many values are free to vary without altering the required mean value. (Rugby example)

Student’s T-Test Used for comparing the means of two sample groups using the variance Several types, dependant on the samples: – ‘Dependant/Paired’ – Using this one! – ‘Independent Equal Variance’ – ‘Independent Unequal Variance’ Assumes populations are normally distributed

Student’s T-Test Create a ‘null hypothesis’: – E.g. There is no difference in learning between iPiano and Wii Piano Create an ‘alternative hypothesis’: – One-tailed: Wii Piano is easier to learn – Two-tailed: Wii Piano is easier or harder to learn

Dependent T-Test Used to compare the results of two groups = Average difference = Expected difference (0 for null hypothesis) = Standard deviation of differences = Sample Size

Worked Example T Result = = = 10 t-value = 2.26

Interpreting T-Value The T-Test gives a t-value, which can be converted to a p-value If the p-value < 0.05 there is less than a 5% chance the null hypothesis is true Therefore we conclude that the change of condition caused the observed effect Our p-value was 0.025, so we can accept our alternate hypothesis: ‘Wii Piano is easier to learn’

Effect Size Tells us how important the result is in practical terms. Gives an ‘r-value’. – r = 0.10 (small effect) – 1% of total variance – r = 0.30 (medium effect) – 9% of total variance – r = 0.50 (large effect) – 25% of the variance – r DOES NOT work on a linear scale, so these can only be used as good guidelines. It is becoming more common to include the r-value as well as the p-value as it is a well understood and widely used check.

[letter]-values Recap: t-value: Result of the t-test – Normally converted into a ‘p-value’ for easy comparison between statistical tests. Used to calculate ‘r-value’ p-value: Is it statistically significantly? – If ‘p-value’ is less than 0.05, then you can discard the null hypothesis and accept your alternate hypothesis. r-value: Is the effect substantial in reality? – Does the result mean anything in real terms? HOW MUCH does our experiment explain the change in means?

Final Results p-value = r-value = 0.60 Degrees of freedom = 9 “The results show that Wii Piano allows users to play a set tune successfully significantly faster than iPiano (p = 0.025). In addition, the effect size was large (r = 0.6), showing the result was substantial in real terms.”

Error Bars Error bars: Plot standard error

Excel Example TTEST in Excel will give a ‘p-value’ directly

Summing Up Dependant t-test when using a single group – For example, when using 10 friends to test two interfaces Avoid ordering effects – Randomise order and get an even number to do each test first Use ‘TTEST’ in Excel to get p-value easily – Easier than calculating it yourself, but harder to get r-value! Check p < 0.05 and quote the value and result – Always make sure to quote the actual p-value in your results as well as saying whether the result was statistically significant

Recommended Reading