Analysing and Interpreting Data Joel Faronbi. 2 ‘All meanings, we know, depend on the key of interpretation.’ -George Eliot.

Slides:



Advertisements
Similar presentations
Chapter 14 Analysing Qualitative Data Zina OLeary.
Advertisements

Chapter 13 Analysing Quantitative Data
Data Analysis Basics: Variables and Distribution.
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Processing and Fundamental Data Analysis CHAPTER fourteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences CHAPTER.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Introduction to statistics in medicine – Part 1 Arier Lee.
MSS 905 Methods of Missiological Research
Statistical Tests Karen H. Hagglund, M.S.
Analysing and Interpreting Data Chapter 11. O'Leary, Z. (2005) RESEARCHING REAL-WORLD PROBLEMS: A Guide to Methods of Inquiry. London: Sage. Chapter 11.2.
Data Management and Analysis
Data analysis Incorporating slides from IS208 (© Yale Braunstein) to show you how 208 and 214 are telling you many of the the same things; and how to use.
INTERPRET MARKETING INFORMATION TO TEST HYPOTHESES AND/OR TO RESOLVE ISSUES. INDICATOR 3.05.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Introduction to Educational Statistics
Introduction to research Analyzing and interpreting research data
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Data Management: Quantifying Data & Planning Your Analysis
CHAPTER 14, QUANTITATIVE DATA ANALYSIS. Chapter Outline  Quantification of Data  Univariate Analysis  Subgroup Comparisons  Bivariate Analysis  Introduction.
Quantifying Data.
BASIC STATISTICS WE MOST OFTEN USE Student Affairs Assessment Council Portland State University June 2012.
Measures of Central Tendency
Understanding Research Results
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
How do you know?: Interpreting and Analyzing Data NCLC 203 New Century College, George Mason University April 6, 2010.
Describing distributions with numbers
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Collecting, Presenting, and Analyzing Research Data By: Zainal A. Hasibuan Research methodology and Scientific Writing W# 9 Faculty.
Statistics and Research methods Wiskunde voor HMI Betsy van Dijk.
Research Tools and Techniques The Research Process: Step 7 (Data Analysis Part A) Lecture 28.
Foundations of Educational Measurement
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
SW388R6 Data Analysis and Computers I Slide 1 Central Tendency and Variability Sample Homework Problem Solving the Problem with SPSS Logic for Central.
Chapter 11 Descriptive Statistics Gay, Mills, and Airasian
Descriptive Statistics
Chapter Thirteen Validation & Editing Coding Machine Cleaning of Data Tabulation & Statistical Analysis Data Entry Overview of the Data Analysis.
Analyzing and Interpreting Quantitative Data
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
The Practice of Social Research Chapter 14 – Quantitative Data Analysis.
Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Chapter Twelve Copyright © 2006 John Wiley & Sons, Inc. Data Processing, Fundamental Data Analysis, and Statistical Testing of Differences.
Semester 2: Lecture 3 Quantitative Data Analysis: Univariate Analysis II Prepared by: Dr. Lloyd Waller ©
I. Introduction to Data and Statistics A. Basic terms and concepts Data set - variable - observation - data value.
Describing Distributions with Numbers Chapter 2. What we will do We are continuing our exploration of data. In the last chapter we graphically depicted.
Chapter 6: Analyzing and Interpreting Quantitative Data
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
1 UNIT 13: DATA ANALYSIS. 2 A. Editing, Coding and Computer Entry Editing in field i.e after completion of each interview/questionnaire. Editing again.
LIS 570 Summarising and presenting data - Univariate analysis.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
1 Take a challenge with time; never let time idles away aimlessly.
Exploratory data analysis, descriptive measures and sampling or, “How to explore numbers in tables and charts”
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Data Entry, Coding & Cleaning SPSS Training Thomas Joshua, MS July, 2008.
Measurements Statistics
Chapter 6 Introductory Statistics and Data
FINA262 Financial Data Analysis
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Analyzing and Interpreting Quantitative Data
Finding Answers through Data Collection
Basic Statistical Terms
Unit XI: Data Analysis in nursing research
Welcome!.
Ass. Prof. Dr. Mogeeb Mosleh
Chapter 6 Introductory Statistics and Data
Presentation transcript:

Analysing and Interpreting Data Joel Faronbi

2 ‘All meanings, we know, depend on the key of interpretation.’ -George Eliot

Introduction Think about analysis EARLY Start with a plan Code, enter, clean Analyze Interpret Reflect  What did we learn?  What conclusions can we draw?  What are our recommendations?  What are the limitations of our analysis?

Why do I need an analysis plan? To make sure the questions and your data collection instrument will get the information you want. To align your desired “report” with the results of analysis and interpretation. To improve reliability--consistent measures over time.

5 Effective Data Analysis Effective data analysis involves  keeping your eye on the main game  managing your data  engaging in the actual process of quantitative and / or qualitative analysis  presenting your data  drawing meaningful and logical conclusions

Key components of a data analysis plan Purpose of the research Questions What you hope to learn from the question managing your data Analysis technique- engaging in the actual process of quantitative and / or qualitative analysis Data presentation drawing meaningful and logical conclusions

7 The Big Picture Analysis should be approached as a critical, reflective, and iterative process that cycles between data and an overarching research framework that keeps the big picture in mind

8 Managing Data Regardless of data type, managing your data involves  familiarizing yourself with appropriate software  developing a data management system  systematically organizing and screening your data  entering the data into a program  and finally ‘cleaning’ your data

9 Statistics Being able to do statistics no longer means being able to work with formula It’s much more important for researchers to be familiar with the language and logic of statistics, and be competent in the use of statistical software

10 Data Types Different data types demand discrete treatment, so it’s important to be able to distinguish variables by  cause and effect (dependent or independent)  measurement scales (nominal, ordinal, interval, and ratio)

11 Descriptive Statistics Descriptive statistics are used to summarize the basic feature of a data set through  measures of central tendency (mean, mode, and median)  dispersion (range, quartiles, variance, and standard deviation)  distribution (skewness and kurtosis)

12 Inferential Statistics Inferential statistics allow researchers to assess their ability to draw conclusions that extent beyond the immediate data, e.g.  if a sample represents the population  if there are differences between two or more groups  if there are changes over time  if there is a relationship between two or more variables

13 Selecting Statistical Tests Selecting the right statistical test relies on  knowing the nature of your variables  their scale of measurement  their distribution shape  types of question you want to ask

14

Types of Variables Continuous variables:  Always numeric  Can be any number, positive or negative  Examples: age in years, weight, blood pressure readings, temperature, concentrations of pollutants and other measurements Categorical variables:  Information that can be sorted into categories  Types of categorical variables – ordinal, nominal and dichotomous (binary)

Categorical Variables: Nominal Variables Nominal variable – a categorical variable without an intrinsic order Examples of nominal variables:  Where a person lives in the U.S. (Northeast, South, Midwest, etc.)  Sex (male, female)  Nationality (Nigerian, American, Mexican, French)  Race/ethnicity (African American, Hispanic, White, Asian American)  Favorite pet (dog, cat, fish, snake)

Categorical Variables: Ordinal Variables Ordinal variable—a categorical variable with some intrinsic order or numeric value Examples of ordinal variables:  Education (no high school degree, HS degree, some college, college degree)  Agreement (strongly disagree, disagree, neutral, agree, strongly agree)  Rating (excellent, good, fair, poor)  Frequency (always, often, sometimes, never)  Any other scale (“On a scale of 1 to 5...”)

Categorical Variables: Dichotomous Variables Dichotomous (or binary) variables – a categorical variable with only 2 levels of categories  Often represents the answer to a yes or no question For example:  “Did you attend the church picnic on May 24?”  “Did you eat potato salad at the picnic?”  Anything with only 2 categories

Coding Coding – process of translating information gathered from questionnaires or other sources into something that can be analyzed Involves assigning a value to the information given—often value is given a label Coding can make data more consistent:  Example: Question = Sex  Answers = Male, Female, M, or F  Coding will avoid such inconsistencies

Coding Systems Common coding systems (code and label) for dichotomous variables:  0=No1=Yes (1 = value assigned, Yes= label of value)  OR:1=No2=Yes When you assign a value you must also make it clear what that value means  In first example above, 1=Yes but in second example 1=No  As long as it is clear how the data are coded, either is fine You can make it clear by creating a data dictionary to accompany the dataset

Coding: Dummy Variables A “dummy” variable is any variable that is coded to have 2 levels (yes/no, male/female, etc.) Dummy variables may be used to represent more complicated variables  Example: # of cigarettes smoked per week--answers total 75 different responses ranging from 0 cigarettes to 3 packs per week  Can be recoded as a dummy variable: 1=smokes (at all)0=non-smoker This type of coding is useful in later stages of analysis

Coding: Attaching Labels to Values Many analysis software packages allow you to attach a label to the variable values Example: Label 0’s as male and 1’s as female Makes reading data output easier: Without label:Variable SEXFrequencyPercent 02160% 11440% With label:Variable SEXFrequencyPercent Male2160% Female1440%

Coding- Ordinal Variables Coding process is similar with other categorical variables Example: variable EDUCATION, possible coding: 0 = Did not graduate from high school 1 = High school graduate 2 = Some college or post-high school education 3 = College graduate Could be coded in reverse order (0=college graduate, 3=did not graduate high school) For this ordinal categorical variable we want to be consistent with numbering because the value of the code assigned has significance

Coding – Ordinal Variables (cont.) Example of bad coding: 0 = Some college or post-high school education 1 = High school graduate 2 = College graduate 3 = Did not graduate from high school Data has an inherent order but coding does not follow that order—NOT appropriate coding for an ordinal categorical variable

Coding: Nominal Variables For coding nominal variables, order makes no difference Example: variable RESIDE 1 = Northeast 2 = South 3 = Northwest 4 = Midwest 5 = Southwest Order does not matter, no ordered value associated with each response

Coding: Continuous Variables Creating categories from a continuous variable (ex. age) is common May break down a continuous variable into chosen categories by creating an ordinal categorical variable Example: variable = AGECAT 1 = 0–9 years old 2 = 10–19 years old 3 = 20–39 years old 4 = 40–59 years old 5 = 60 years or older

Coding: Continuous Variables (cont.) May need to code responses from fill-in-the-blank and open-ended questions  Example: “Why did you choose not to see a doctor about this illness?” One approach is to group together responses with similar themes  Example: “didn’t feel sick enough to see a doctor”, “symptoms stopped,” and “illness didn’t last very long”  Could all be grouped together as “illness was not severe” Also need to code for “don’t know” responses”  Typically, “don’t know” is coded as 9

Coding Tip Though you do not code until the data is gathered, you should think about how you are going to code while designing your questionnaire, before you gather any data. This will help you to collect the data in a format you can use.

Data Cleaning One of the first steps in analyzing data is to “clean” it of any obvious data entry errors:  Outliers? (really high or low numbers) Example: Age = 110 (really 10 or 11?)  Value entered that doesn’t exist for variable? Example: 2 entered where 1=male, 0=female  Missing values? Did the person not give an answer? Was answer accidentally not entered into the database?

Data Cleaning (cont.) May be able to set defined limits when entering data  Prevents entering a 2 when only 1, 0, or missing are acceptable values Limits can be set for continuous and nominal variables  Examples: Only allowing 3 digits for age, limiting words that can be entered, assigning field types (e.g. formatting dates as mm/dd/yyyy or specifying numeric values or text) Many data entry systems allow “double-entry” – ie., entering the data twice and then comparing both entries for discrepancies Univariate data analysis is a useful way to check the quality of the data

31 Presenting Quantitative Data Presenting quantitative data often involves the production of graphs and tables These need to be 1. selectively generated so that they make relevant arguments 2. informative yet simple, so that they aid reader’s understanding

Univariate Data Analysis Univariate data analysis-explores each variable in a data set separately  Serves as a good method to check the quality of the data  Inconsistencies or unexpected results should be investigated using the original data as the reference point Frequencies can tell you if many study participants share a characteristic of interest (age, gender, etc.)  Graphs and tables can be helpful

Frequency table Student should draw table 33

Students to convert the table to graphs 34

Univariate Data Analysis (cont.) Examining continuous variables can give you important information:  Do all subjects have data, or are values missing?  Are most values clumped together, or is there a lot of variation?  Are there outliers?  Do the minimum and maximum values make sense, or could there be mistakes in the coding?

Univariate Data Analysis (cont.) Commonly used statistics with univariate analysis of continuous variables:  Mean – average of all values of this variable in the dataset  Median – the middle of the distribution, the number where half of the values are above and half are below  Mode – the value that occurs the most times  Range of values – from minimum value to maximum value

Statistics describing a continuous variable distribution 84 = Maximum (an outlier) 2 = Minimum 28 = Mode (Occurs twice) 33 = Mean 36 = Median (50 th Percentile)

Standard Deviation Figure left: narrowly distributed age values (SD = 7.6) Figure right: widely distributed age values (SD = 20.4)

Distribution and Percentiles  Distribution – whether most values occur low in the range, high in the range, or grouped in the middle  Percentiles – the percent of the distribution that is equal to or below a certain value Distribution curves for variable AGE 25 th Percentile (4 years) 25 th Percentile (6 years)

Analysis of Categorical Data Distribution of categorical variables should be examined before more in- depth analyses  Example: variable RESIDE Number of people answering example questionnaire who reside in 5 regions of the United States

Graph Distribution curves for variable AGE 25 th Percentile (4 years) 25 th Percentile (6 years)

Analysis of Categorical Data (cont.) Another way to look at the data is to list the data categories in tables Table shown gives same information as in previous figure but in a different format Table: Number of people answering sample questionnaire who reside in 5 regions of the United States FrequencyPercent Midwest 16 20% Northeast 13 16% Northwest 19 24% South 24 30% Southwest 8 10% Total %

43 Qualitative Data Analysis (QDA) In qualitative data analysis there is a common reliance on words and images to draw out rich meaning But there is an amazing array of perspectives and techniques for conducting an investigation

44 The QDA Process Qualitative data analysis creates new understandings by exploring and interpreting complex data from sources without the aid of quantification Data source include  interviews  group discussions  observation  journals  archival documents, etc

45 Uncovering and Discovering Themes The methods and logic of qualitative data analysis involve uncovering and discovering themes that run through raw data, and interpreting the implication of those themes for research questions

46 More on the QDA Process Qualitative data analysis generally involves  moving through cycles of inductive and deductive reasoning  thematic exploration (based on words, concepts, literary devises, and nonverbal cues)  exploration of the interconnections among themes Qualitative data analysis software can help with these tasks

47 Specialist QDA Strategies There are a number of paradigm and discipline based strategies for qualitative data analysis including  content analysis  discourse analysis  narrative analysis  conversation analysis  semiotics  hermeneutics  grounded theory

48 Presenting Qualitative Data Effective presentation of qualitative data can be a real challenge You’ll need to have a clear storyline, and selectively use your words and/or images to give weight to your story

49 Drawing Conclusions Your findings and conclusions need to flow from analysis and show clear relevance to your overall project Findings should be considered in light of  significance  current research literature  limitations of the study  your questions, aims, objectives, and theory

50