An Introduction to Educational Research Statistics Graham McMahon MD MMSc 1.

Slides:



Advertisements
Similar presentations
Chapter 16 Introduction to Nonparametric Statistics
Advertisements

Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
Departments of Medicine and Biostatistics
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Statistical Tests Karen H. Hagglund, M.S.
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Data measurement, probability and statistical tests
Parametric & Nonparametric Models for Within-Groups Comparisons overview X 2 tests parametric & nonparametric stats Mann-Whitney U-test Kruskal-Wallis.
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 14 Inferential Data Analysis
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
Statistical Analysis KSE966/986 Seminar Uichin Lee Oct. 19, 2012.
Nonparametrics and goodness of fit Petter Mostad
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
AM Recitation 2/10/11.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
ANALYSIS OF VARIANCE. Analysis of variance ◦ A One-way Analysis Of Variance Is A Way To Test The Equality Of Three Or More Means At One Time By Using.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Chapter 15 Data Analysis: Testing for Significant Differences.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Where are we?. What we have covered: - How to write a primary research paper.
A Repertoire of Hypothesis Tests  z-test – for use with normal distributions and large samples.  t-test – for use with small samples and when the pop.
EDLD 6392 Advanced Topics in Statistical Reasoning Texas A&M University-Kingsville Research Designs and Statistical Procedures.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Ordinally Scale Variables
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Experimental Design and Statistics. Scientific Method
Medical Statistics as a science
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 11: Bivariate Relationships: t-test for Comparing the Means of Two Groups.
Sample size and common statistical tests There are three kinds of lies- lies, dammed lies and statistics…… Benjamin Disraeli.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Statistics in IB Biology Error bars, standard deviation, t-test and more.
Chapter 6: Analyzing and Interpreting Quantitative Data
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Tuesday PM  Presentation of AM results  What are nonparametric tests?  Nonparametric tests for central tendency Mann-Whitney U test (aka Wilcoxon rank-sum.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Remember You just invented a “magic math pill” that will increase test scores. On the day of the first test you give the pill to 4 subjects. When these.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
Data Analysis: Statistics for Item Interactions. Purpose To provide a broad overview of statistical analyses appropriate for exploring interactions and.
Quantitative methods and R – (2) LING115 December 2, 2009.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
Chapter 15 Analyzing Quantitative Data. Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Presentation transcript:

An Introduction to Educational Research Statistics Graham McMahon MD MMSc 1

Course Overview  Last week: Stages of a trial from design to completion Stages of a trial from design to completion Generating hypotheses Generating hypotheses Working with the IRB Working with the IRB Considering the funding required Considering the funding required Trial Designs Trial Designs  Today: Choosing an outcome variable Choosing an outcome variable Powering your study Powering your study Establishing inter-rater reliability Establishing inter-rater reliability Determining if there is a difference between two groups Determining if there is a difference between two groups Test development Test development Qualitative approaches Qualitative approaches 2

Stages of an Educational Interventional Trial StageActivities 1 Initial Design Hypothesis, Size 2 Protocol Design Define methods, collaborations, IRB 3Recruitment Subject Acquisition, Monitoring 4Followup Collect outcome data 5Analysis Prepare “Clean + Locked” Database Perform analysis 6Reporting Write and submit manuscript 7 Additional analyses Further explorations of trial data 3

Population & Sampling  Must balance Variability [the smaller or more diverse the population, the more variable; variability creates error] Variability [the smaller or more diverse the population, the more variable; variability creates error] Generalizability [population can’t be too specific] Generalizability [population can’t be too specific] Access [you can only study those you have access to] Access [you can only study those you have access to] Cost [larger studies are much more expensive] Cost [larger studies are much more expensive]  Consider Participation rate Participation rate Multiple sites Multiple sites Online projects Online projects Lower reimbursement Lower reimbursement 4

Outcome  What is really important?  What would colleagues care about?  ‘Hard’ outcomes Death, attendence, Death, attendence,  ‘Soft’ outcomes Satisfaction, self-confidence Satisfaction, self-confidence 5

Outcomes / Endpoints  Primary Outcome What you power your study on What you power your study on  Secondary Outcome Other related outcomes that may be interesting to test Other related outcomes that may be interesting to test  Exploratory Outcomes Association studies, subgroups that may be interesting, but likely to be underpowered Association studies, subgroups that may be interesting, but likely to be underpowered May serve as pilot data for future studies May serve as pilot data for future studies  Surrogate Endpoint In the causal pathway and affected by the intervention In the causal pathway and affected by the intervention 6

Group Activity  Medical errors and patient safety continue to be an important concern for patients and physicians. Numerous reports have suggested that fatigue and sleepiness contribute to medical errors. You are the program director in an internal medicine residency that has 40 residents and want to make a contribution in this area.  List an hypothesis that could be generated based on this reflection.  How would you measure sleepiness?

You review the available sleepiness scales and must choose one. Which one is best? A Awake index B Sleepy score C Doze Index D Snory scale E Yawn score Scale Size Mean Rating for Residents Standard Deviation for Residents DistributionMean>MedianMean=MedianMean=MedianMean<MedianMean=Median Expected Score Difference

Power and Error  α is the probability of making a Type I error  Power is the likelihood of avoiding a Type II error  Use trial type, α and power to calculate sample size 9

Sample Size Calculations 10

Calculating Sample Size Effect Size 1 SD diff between groups with power of 0.8 requires subjects 0.3 SD diff between groups with power of 0.8 requires subjects 11

Simple Calculation  N (per group) = 15.8 / (effect size) 2 for power of 80% and α=0.05  Remember to increase enrollment so that number completing ≥ expected sample size 12

You review the available sleepiness scales and must choose one. Which one is best? A Awake index B Sleepy score C Doze Index D Snory scale E Yawn score Scale Size Mean Rating for Residents Standard Deviation for Residents DistributionMean>MedianMean=MedianMean=MedianMean<MedianMean=Median Expected Score Difference Effect size = score difference / standard deviation

Power and Samples Sizes A Awake index B Sleepy score C Doze Index D Snory scale E Yawn score Scale Size Mean Rating for Residents Standard Deviation for Residents DistributionMean>MedianMean=MedianMean=MedianMean<MedianMean=Median Expected Score Difference N per group Power (N=15 per grp)

Calculating Sample Size using Software Difference between groups Standard Deviation Choose Test biostat.mc.vanderbilt.edu/twiki /bin/view/Main/PowerSampleSize 15

Two faculty offer to measure the sleepiness of residents using your scale. How can you find out if they are good raters?

Interrater Reliability   Interrater reliability is the extent to which two or more individuals (coders or raters) agree.   Training, education and monitoring skills can enhance interrater reliability.   Goal is generally reliability > 0.8 Categorial: measure % Ordinal: spearman rho Continuous: pearson r Rater 1Rater 2Rater 1Rater Pearson 0.81 Pearson 0.56

Analyzing your Data  Plan your analysis  Consider consulting a specialist  Test for normality  Choose the right test  Avoid statistical explorations with the data 18

  You start your study and find that among the interns the M:F ratio was 12:5 and 8:9 and wonder if they are statistically unbalanced.

Categorical Counts  Chi-square statistic: no cell in the table should have an expected frequency of <1, and no more than 20% of the cells should have an expected frequency of <5.  Use Fisher’s exact test when numbers are small Group 1Group 2 Men128 Women59 Chi-square = 1.1 Fisher exact, p=

 You collect your baseline observations and find the following sleepiness in each group. Are they different? Grp 1 – 8, 6, 5, 2, 3, 9, 11, 6, 11 Grp 1 – 8, 6, 5, 2, 3, 9, 11, 6, 11 Grp 2 – 3, 5, 5, 2, 7, 4, 8, 10, 2 Grp 2 – 3, 5, 5, 2, 7, 4, 8, 10, 2

Summary of Tests Type of DataTwo Paired Groups Two Independent Groups Many Independent Groups Correlation CategoriesMcNemarChi-square ContinuousPaired t-testt-testANOVAPearson r RankWilcoxonKruskal-WallisSpearman r 22 Test for Normality!

t-test  Comparing two means  Check if paired or unpaired  The more SE’s you are away from zero, the less likely that the difference occurred by chance Had Elective No Elective Number of students Mean Score76%64% SD

Testing difference between two groups over time  t- test on between group difference at end  t-test on change over time Time 1Time 2 24

ANOVA (analysis of variance)  Extension of t-tests across more than two groups.  Gives a single overall test of whether there are differences between groups.  Avoids multiple comparisons  Compares variance within a group to variance between groups.  Generates an F-statistic and P-value Group 1 Group 2 Group 3 Group 4 25

Statistical Tests for Skewed or Rank Data  These data don’t follow normal rules  Non-parametric tests are less powerful  Two groups Wilcoxon rank sum (=Mann-Whitney-U) Wilcoxon rank sum (=Mann-Whitney-U)  Three or more groups Kruskal-Wallis Kruskal-Wallis 26

Wilcoxon Rank Sum  Rank all observations in increasing order of magnitude, ignoring which group they come from.  Add up the ranks in the smaller of the two groups.  Look up the critical value of the sum of ranks for that size group. 27

Summary of Tests Type of DataTwo Paired Groups Two Independent Groups Many Independent Groups Correlation CategoriesMcNemarChi-square ContinuousPaired t-testt-testANOVAPearson r RankWilcoxonKruskal-WallisSpearman r 28

Summary  Careful choice of your population will improve your chances of finding an effect  Choose your outcome measure thoughtfully  Estimate your power and sample size in advance  Ensure internal consistency is good  Determine normality and analyze your dataset accordingly

Graham McMahon 30