We think you have liked this presentation. If you wish to download it, please recommend it to your friends in any social system. Share buttons are a little bit lower. Thank you!
Presentation is loading. Please wait.
Published byReynold Craig
Modified about 1 year ago
CSCI 6960- Research Methods - 1 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Inferential Statistics: Descriptive statistics deals with the analysis of data collected on the sample (subjects). Inferential statistics is – in contrast – concerned with what the subjects can tell us about the larger population they represent. It is usually impossible to observe the entire population of our interest. We resort to observing and measuring the characteristics of a sample of that population instead. Example: To study the software defect discovery capability of inspectors, we cannot observe everyone who has and will do inspections. Instead we look at a sample population.
CSCI 6960- Research Methods - 2 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) The sample is selected and used as if it represents the general population. Therefore, it better do so! Strict sampling procedures must be followed. At any rate, due to natural (chance) variability between any sample and its general population, we CANNOT say, with certainty that any observation made with respect to the sample would extend to the general population. We can only talk about the probability that it would. Example: In a test designed to find the extent to which programmers with no training in testing would find program defects compared to those
CSCI 6960- Research Methods - 3 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) that have had such training, we find that the sample of programmers under our study found on average 5.3 defects in an hour of testing. The control group (the ones who were trained in testing) found on average 5.8 defects. What does this finding mean? A.Given the difference between the sample groups, does this mean that there is a difference between the trained population and non-trained population? B.Is this difference sufficiently significant to be 1) not just by chance, 2) or small enough to not signify a real difference? (so for example to recommend discontinuing tester training.)
CSCI 6960- Research Methods - 4 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Here, we are testing the null hypothesis that: There is no real difference between the two population means. We know about the null hypothesis and the type I and type II errors. To go further, however, we need to distinguish between a population parameter and a sample statistic. A sample statistic is a statistic describing the sample drawn. For example, taking the mean of the data-points from the sample. A population parameter is a characteristic of the entire population. For example if we take the mean of ALL data-points in the population.
CSCI 6960- Research Methods - 5 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Our task is usually to estimate population parameters and to draw conclusions about population parameters based on sample statistics. Inferential statistics are used to compute the probability of obtaining the observed data if the null-hypothesis were true. If the probability is small, then it is unlikely that the null- hypothesis is true. The somewhat arbitrary cut-off points (called alpha levels) were introduced to cater for such measure.
CSCI 6960- Research Methods - 6 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Testing for mean differences: Inferential statistics are used most frequently to evaluate mean differences between groups. We can use such techniques to specify the research hypothesis in terms of mean differences. There are a number of tests for evaluating mean differences in two or more groups. These include the: 1.Simple t-test 2.Correlated t-test 3.Analysis of Variance (ANOVA)
CSCI 6960- Research Methods - 7 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Simple t-test: This test is typically used with score data from two independent samples of subjects (e.g. trained and non-trained). The null- hypothesis is that there is NO difference in the two population means. In other words, the observed difference between the sample means is due only to chance. The test statistic is called the t statistic. We compute the t statistic and the probability (p value) of obtaining this t value if the null hypothesis is true. If p is less than our alpha level, we reject the null-hypothesis and conclude that the population means are different.
CSCI 6960- Research Methods - 8 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) We compute the t statistic using the following formula: and are the means of the two samples and and are the sum of squares of each sample.
CSCI 6960- Research Methods - 9 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Correlated t-test: When we do not have independent samples, we use the correlated t-test. Examples of such design might be within-subject design, where the same subjects appear in each group or matched-subjects design, where all subjects are paired and then randomly assigned so that one member of the pair goes into one group and the other into another group.
CSCI 6960- Research Methods - 10 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) An example of within-subject design might be when a group of subjects are given an error free, a somewhat buggy and a bug- riddled program to test. The testing time of each person testing each program is recorded. As each subject is exposed to all levels of the independent variable, the scores in each condition are correlated with the scores in the other condition (i.e. performance in one is correlated with performance in the other condition.) The critical comparison is the difference between correlated groups on the dependent variable (rate of defect identification).
CSCI 6960- Research Methods - 11 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) An example of matched subject design would be when comparing the effectiveness of training in two different methods of testing software. If tester 1 is trained in method A he or she cannot be trained in method B. We need a new subject, say subject 2 who is the paired member of subject 1. Now we can compare the two groups using a correlated t-test.
CSCI 6960- Research Methods - 12 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Analysis of Variance (ANOVA): When we have more than two groups and want to test for mutual mean difference between all the groups we usually use a form of the ANOVA method. The term is actually confusing as the test compares the means of the various groups but it does so by computing and comparing the different population variance estimates. One advantage of ANOVA is that it allows analysis of one,two or several independent variables at the same time. Each independent variable is called a factor and the research design with more than one factor is called factorial research design.
CSCI 6960- Research Methods - 13 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) We refer to a study with only one independent variable as a one- way ANOVA, with two factors, it is called two-way ANOVA and so forth. Example: In a study of tester behavior it was found that testers do generally better when there is a moderate level of other activity in the work-place. Testing in total silence usually ends in poorer result. Further investigation found that it is not the absence of ambient noise that results in poor performance alone but that conditions such as the mood of the tester, the load under which they work, personal problems, etc. are also contributing factors. It seems therefore that silence is a necessary but insufficient condition for degradation of performance as are moods, problems and loads. We can now set up a factorial design to test two or
CSCI 6960- Research Methods - 14 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) several independent variables may be considered. Let us say we concentrate on two level of ambient noise (binary; relatively noisy or silent) and mood (negative or positive). So we have two factors and two levels for each. This gives four treatment combinations leading to a 2 by 2 factorial design.
CSCI 6960- Research Methods - 15 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Multivariate Analysis of Variance (MANOVA): The difference between MANOVA and ANOVA is in the dependent variable. In ANOVA we can have multiple independent variables but one dependent variable. In MANOVA we can have multiple dependent variables also.
CSCI 6960- Research Methods - 16 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Mann-Whitney U test: This is the comparison inferential test to be used if we have two ordinal groups of data and the groups are independent. Wilcoxon signed-rank test: This is the comparison inferential test to be used if we have two ordinal groups of data and the groups are not independent.
CSCI 6960- Research Methods - 17 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Kruskal-Wallis one-way ANOVA: This is the comparison inferential test to be used if we have more than two ordinal groups of data and the groups are independent. Friedman two-way ANOVA: This is the comparison inferential test to be used if we have more than two ordinal groups of data and the groups are not independent.
CSCI 6960- Research Methods - 18 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Chi-square goodness of fit: This is when we have one group being compare against a hypothetical situation or a theory. Chi-square test for independence: This is when we have more than one group being compared against each other.
CSCI 6960- Research Methods - 19 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Start Go to Flowchart 1 Start with 1 st hypothesis What is the dependent variable? Type of data for that dependent variable? Nominal Ordinal Score Go to flowchart 2 Go to flowchart 3 Go to flowchart 4 Are there more research hypotheses to evaluate? End yes No Selecting appropriate Statistical Analysis procedures. Flowchart 0
CSCI 6960- Research Methods - 20 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Identify the following: 1.Level of constraint (Naturalistic, case-study, correlational, …, experimental) 2.Independent variables 3.Levels and scale of independent variables 4.Each independent variable manipulated or not. 5.Type of design (independent groups, correlated groups, mixed..) 6.Dependent variables 7.Levels and scale of dependent variables 8.Operational procedures 9.Research hypothesis 10.Type of test needed 9relationship or difference) Start End Flowchart 1: Start-up
CSCI 6960- Research Methods - 21 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Start Inferential or Descriptive Desc. Frequency count Back to flowchart 0. More hypotheses? Chi-square goodness-of-fit Comparing one group against hypothetical situation or two or more groups Infer. one Two+ Chi-square test of independence Flowchart 2: Nominal Data
CSCI 6960- Research Methods - 22 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Start Inferential or Descriptive Infer. Type of description Desc. Median central Range Variab. relation Spearman rank correlation Number of groups Groups independent or correlated Two Indep. Corr. Kruskal-Wallis one way ANOVA Three+ Mann-Wittney U-test Wilcoxon signed-rank test Back to flowchart 0. More hypotheses? Groups independent or correlated Indep. Freiedman two way ANOVA Corr.. Flowchart 3: Ordinal Data
CSCI 6960- Research Methods - 23 - HO 8 © Houman Younessi 2013 Lecture 8 Quantitative Procedures (2) Start Inferential or Descriptive Infer. Type of description Desc. Mean, median,mode,Z central Variance, std Variab. relation Pearson product moment Number of factors Number of groups Groups independent or correlated one two Indep. Corr. ANOVA Two+ Single group t test t test for independent groups t test for correlated groups Back to flowchart 0. More hypotheses? Flowchart 4: Score Data
Today Concepts underlying inferential statistics Types of inferential statistics Parametric T-tests ANOVA Multiple regression ANCOVA Non-parametric Chi-Square.
Types of Inferential Statistics Parametric Statistics : estimate the value of a population parameter from the characteristics of a sample –Assumes the.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Chapter 14 Inferential Data Analysis. Inferential Statistics Techniques that allow us to study samples and then make generalizations about the population.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
QUANTITATIVE DATA ANALYSIS Chapter 11. LEVELS OF MEASUREMENT Variable attributes: the characteristics or qualities that describe a variable Variable attributes.
Chapter 15 Data Analysis: Testing for Significant Differences.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Chapter 14 Nonparametric Statistics. 2 Introduction: Distribution-Free Tests Distribution-free tests – statistical tests that don’t rely on assumptions.
Inferential Statistics Psych 231: Research Methods in Psychology.
Inferential Statistics Statistical Analysis of Research Data.
Educational Research Chapter 12 Inferential Statistics Gay, Mills, and Airasian.
Experimental Psychology PSY 433 Appendix B Statistics.
Statistics Definition Methods of organizing and analyzing quantitative data Types Descriptive statistics –Central tendency, variability, etc. Inferential.
Stats/Methods II JEOPARDY. Jeopardy Chi-Square Single-Factor Designs Factorial Designs Ordinal Data Surprise $100 $200$200 $300 $500 $400 $300 $400 $300.
Chapter 13 Understanding research results: statistical inference.
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
PSYC512: Research Methods PSYC512: Research Methods Lecture 9 Brian P. Dyre University of Idaho.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Nonparametric or Distribution- free Tests. Parametric tests The test technique makes use of one or more values obtained from sample data [often called.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
2/10/11. Infer properties of the population from what is observed in the sample An inference is a generalization, as inferences go beyond the data.
t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.
CSCI Research Methods HO 6 © Houman Younessi 2012 Lecture 6 Quantitative Procedures(1) Research variables: Every research project – irrespective.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
CHAPTER 23: Two Categorical Variables: The Chi-Square Test Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Educational Research: Data analysis and interpretation – 2 Inferential statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Outline of Today’s Discussion 1.The Chi-Square Test of Independence 2.The Chi-Square Test of Goodness of Fit.
Statistics and Research methods Wiskunde voor HMI Bijeenkomst 3 Relating statistics and experimental design.
1 Nonparametric Statistical Techniques Chapter 17.
Copyright © Cengage Learning. All rights reserved. 14 Elements of Nonparametric Statistics.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
CHAPTER 15 NONPARAMETRIC STATISTICS. Learning Objectives Determine situations where nonparametric procedures are better alternatives to the parametric.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
© 2017 SlidePlayer.com Inc. All rights reserved.