Analysing and Presenting Quantitative Data:

Slides:



Advertisements
Similar presentations
SI0030 Social Research Methods Week 5 Luke Sloan
Advertisements

Contingency Table Analysis Mary Whiteside, Ph.D..
Richard M. Jacobs, OSA, Ph.D.
Psychology Practical (Year 2) PS2001 Correlation and other topics.
Statistics. Hypothesis Testing Hypothesis is a ‘testable statement’ Types = alternate, research, experimental (H1), null (H0) They are 1 or 2 tailed (directional.
STATISTICAL ANALYSIS. Your introduction to statistics should not be like drinking water from a fire hose!!
Inferential Statistics
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Chapter 18: The Chi-Square Statistic
Hypothesis Testing Chapter 13. Hypothesis Testing Decision-making process Statistics used as a tool to assist with decision-making Scientific hypothesis.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Session 7.1 Bivariate Data Analysis
Social Research Methods
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Mann-Whitney and Wilcoxon Tests.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Choosing Statistical Procedures
Non-parametric Dr Azmi Mohd Tamil.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Descriptive Research: Quantitative Method Descriptive Analysis –Limits generalization to the particular group of individuals observed. –No conclusions.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Angela Hebel Department of Natural Sciences
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Tuesday, April 8 n Inferential statistics – Part 2 n Hypothesis testing n Statistical significance n continued….
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Research Tools and Techniques The Research Process: Step 7 (Data Analysis Part B) Lecture 29.
Chapter 13 Understanding research results: statistical inference.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Logic of Hypothesis Testing
Data measurement, probability and Spearman’s Rho
Hypothesis Testing.
Lecture Nine - Twelve Tests of Significance.
Data analysis Research methods.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis mutually exclusive exhaustive.
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/6.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
CHOOSING A STATISTICAL TEST
Non-Parametric Tests.
Inferential Statistics
Social Research Methods
Inferential statistics,
Ass. Prof. Dr. Mogeeb Mosleh
Non – Parametric Test Dr. Anshul Singh Thapa.
Unit XI: Data Analysis in nursing research
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Parametric versus Nonparametric (Chi-square)
Understanding Statistical Inferences
Research Methods: Data analysis and reporting investigations.
RES 500 Academic Writing and Research Skills
Analyzing and Interpreting Quantitative Data
Chapter 13: Using Statistics
Presentation transcript:

Analysing and Presenting Quantitative Data: Inferential Statistics

Objectives After this session you will be able to: Choose and apply the most appropriate statistical techniques for exploring relationships and trends in data (correlation and inferential statistics).

Stages in hypothesis testing Hypothesis formulation. Specification of significance level (to see how safe it is to accept or reject the hypothesis). Identification of the probability distribution and definition of the region of rejection. Selection of appropriate statistical tests. Calculation of the test statistic and acceptance or rejection of the hypothesis.

Hypothesis formulation Hypotheses come in essentially three forms.Those that: Examine the characteristics of a single population (and may involve calculating the mean, median and standard deviation and the shape of the distribution). Explore contrasts and comparisons between groups. Examine associations and relationships between groups.

Specification of significance level – potential errors Significance level is not about importance – it is how likely a result is to be probably true (not by chance alone). Typical significance levels: p = 0.05 (findings have a 5% chance of being untrue) p = 0.01 (findings have a 1% chance of being untrue) [

Identification of the probability distribution

Selection of statistical tests –examples Research question Independent variable Dependent variable Statistical test Is stress counselling effective in reducing stress levels? Nominal groups (experimental and control) Attitude scores (stress levels) Paired t-test Do women prefer skin care products more than men? Nominal (gender) Attitude scores (product preference levels) Mann Whitney U (data not normally distributed) Does gender influence choice of coach? Nominal (choice of coach) Chi-square Do two interviewers judge candidates the same? Nominal Rank order scores Spearman’s rho (data not normally distributed) Is there an association between rainfall and sales of face creams? Rainfall (ratio data) Ratio data (sales) Pearson Product Moment (data normally distributed)

Nominal groups and quantifiable data (normally distributed) To compare the performance/attitudes of two groups, or to compare the performance/attitudes of one group over a period of time using quantifiable variables such as scores. Use paired t-test which compares the means of the two groups to see if any differences between them are significant. Assumption: data are normally distributed.

Paired t-test data set

Data outputs: test for normality Case Processing Summary Cases Valid Missing Total N Percent StressTime1 92 98.9% 1 1.1% 93 100.0% StressTime2 Tests of Normality Kolmogorov-Smirnov(a) Shapiro-Wilk Statistic df Sig. StressTime1 .095 92 .041 .983 .289 StressTime2 .096 .034 .985 .363 a Lilliefors Significance Correction

Data outputs: visual test for normality

95% Confidence Interval of the Difference Statistical output Paired Samples Statistics Mean N Std. Deviation Std. Error Mean Pair 1 StressTime1 10.3587 92 3.48807 .36366 StressTime2 8.7500 3.19555 .33316 Paired Samples Test Paired Differences df Sig. (2-tailed) Mean Std. Deviation Std. Error Mean 95% Confidence Interval of the Difference t Lower Upper Pair 1 Stress Time 1 Stress Time 2 1.60870 2.12239 .22127 1.16916 2.04823 7.270 91 .000

Nominal groups and quantifiable data (normally distributed) To compare the performance/attitudes of two groups, or to compare the performance/attitudes of one group over a period of time using quantifiable variables such as scores. Use Mann-Whitney U. Assumption: data are not normally distributed.

Example of data gathering instrument

Mann-Whitney U data set

Kolmogorov-Smirnov(a) Statistical output Tests of Normality Sex Kolmogorov-Smirnov(a) Shapiro-Wilk Statistic df Sig. Attitude 1 .298 32 .000 .815 2 .167 68 .909 Ranks a Lilliefors Significance Correction Test Statistics(a) Attitude Mann-Whitney U 492.500 Wilcoxon W 1020.500 Z -4.419 Asymp. Sig. (2-tailed) .000 a Grouping Variable: Sex Ranks Ranks Sex N Mean Rank Sum of Ranks Attitude 1 32 31.89 1020.50 2 68 59.26 4029.50 Total 100

Association between two nominal variables We may want to investigate relationships between two nominal variables – for example: Educational attainment and choice of career. Type of recruit (graduate/non-graduate) and level of responsibility in an organization. Use chi-square when you have two or more variables each of which contains at least two or more categories.

Chi-square data set

Statistical output Chi-Square Tests Value df Asymp. Sig. (2-sided) Exact Sig. (2-sided) Exact Sig. (1-sided) Pearson Chi-Square .382(b) 1 .536 Continuity Correction(a) .221 .638 Likelihood Ratio .383 Fisher's Exact Test .556 .320 Linear-by-Linear Association .380 .537 N of Valid Cases 201 a Computed only for a 2x2 table b 0 cells (.0%) have expected count less than 5. The minimum expected count is 33.08. Symmetric Measures Value Approx. Sig. Nominal by Nominal Phi .044 .536 Cramer's V N of Valid Cases 201 a Not assuming the null hypothesis. b Using the asymptotic standard error assuming the null hypothesis.

Correlation analysis Correlation analysis is concerned with associations between variables, for example: Does the introduction of performance management techniques to specific groups of workers improve morale compared to other groups? (Relationship: performance management/morale.) Is there a relationship between size of company (measured by size of workforce) and efficiency (measured by output per worker)? (Relationship: company size/efficiency.) Do measures to improve health and safety inevitably reduce output? (Relationship: health and safety procedures/output.)

Perfect positive and perfect negative correlations

Highly positive correlation

Strength of association based upon the value of a coefficient Correlation figure Description 0.00 0.01-0.09 0.10-0.29 0.30-0.59 0.60-0.74 0.75-0.99 1.00 None Negligible Weak Moderate Strong Very strong Perfect

Calculating a correlation for a set of data We may wish to explore a relationship when: The subjects are independent and not chosen from the same group. The values for X and Y are measured independently. X and Y values are sampled from populations that are normally distributed. Neither of the values for X or Y is controlled (in which case, linear regression, not correlation, should be calculated).

Associations between two ordinal variables For data that is ranked, or in circumstances where relationships are non-linear, Spearman’s rank-order correlation (Spearman’s rho), can be used.

Spearman’s rho data set

Statistical output Correlations MrJones MrsSmith Spearman's rho Correlation Coefficient 1.000 .779(**) Sig. (2-tailed) . .000 N 30 ** Correlation is significant at the 0.01 level (2-tailed).

Association between numerical variables We may wish to explore a relationship when there are potential associations between, for example: Income and age. Spending patterns and happiness. Motivation and job performance. Use Pearson Product-Moment (if the relationships between variables are linear). If the relationship is  or -shaped, use Spearman’s rho.

Pearson Product-Moment data set

Relationship between variables

Statistical output Descriptive Statistics Mean Std. Deviation N Rainfall 48.17 11.228 30 Sales 132.47 28.311 Correlations Rainfall Sales Pearson Correlation 1 -.813(**) Sig. (2-tailed) .000 N 30 ** Correlation is significant at the 0.01 level (2-tailed).

Summary Inferential statistics are used to draw conclusions from the data and involve the specification of a hypothesis and the selection of appropriate statistical tests. Some of the inherent danger in hypothesis testing is in making Type I errors (rejecting a hypothesis when it is, in fact, true) and Type II errors (accepting a hypothesis when it is false). For categorical data, non-parametric statistical tests can be used, but for quantifiable data, more powerful parametric tests need to be applied. Parametric tests usually require that the data are normally distributed.