Math Candel Maastricht University. 1.Internal validity Do your conclusions reflect the “true state of nature” ? 2.External validity or generalizability.

Slides:



Advertisements
Similar presentations
Andrea M. Landis, PhD, RN UW LEAH
Advertisements

Do you think Research in psychology is Important? Why or why not?
Validity (cont.)/Control RMS – October 7. Validity Experimental validity – the soundness of the experimental design – Not the same as measurement validity.
Variance in Research Design Sources, threats to internal validity, and “Noise”
Defining Characteristics
Validity of Quantitative Research Conclusions. Internal Validity External Validity Issues of Cause and Effect Issues of Generalizability Validity of Quantitative.
Experimental Research Designs
Validity In our last class, we began to discuss some of the ways in which we can assess the quality of our measurements. We discussed the concept of reliability.
Decision Errors and Power
Lab exam when: Nov 27 - Dec 1 length = 1 hour –each lab section divided in two register for the exam in your section so there is a computer reserved for.
1 Internal and External Validity and Method of Control Ron D. Hays, Ph.D. - UCLA Department of Medicine: Division of General.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 4: An Overview of Empirical Methods 1.
Experiment Basics: Variables Psych 231: Research Methods in Psychology.
EXPERIMENTAL DESIGNS Criteria for Experiments
Validity, Sampling & Experimental Control Psych 231: Research Methods in Psychology.
Validity Validity of measurement Validity of research
Validity, Sampling & Experimental Control Psych 231: Research Methods in Psychology.
Reliability and Validity in Experimental Research ♣
MSc Applied Psychology PYM403 Research Methods Validity and Reliability in Research.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Statistics Micro Mini Threats to Your Experiment!
Non-Experimental designs: Developmental designs & Small-N designs
Sampling & Experimental Control Psych 231: Research Methods in Psychology.
Sampling and Experimental Control Goals of clinical research is to make generalizations beyond the individual studied to others with similar conditions.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 5 Making Systematic Observations.
Psych 231: Research Methods in Psychology
Variables cont. Psych 231: Research Methods in Psychology.
Psychology 242 Research Methods II Dr. David Allbritton
Validity, Reliability, & Sampling
Information from Samples Alliance Class January 17, 2012 Math Alliance Project.
Validity Lecture Overview Overview of the concept Different types of validity Threats to validity and strategies for handling them Examples of validity.
Chapter 8 Experimental Research
Experimental Design The Gold Standard?.
Applying Science Towards Understanding Behavior in Organizations Chapters 2 & 3.
Experimental Research
Chapter 1: Introduction to Statistics
V ALIDITY IN Q UALITATIVE R ESEARCH. V ALIDITY How accurate are the conclusions you make based on your data analysis? A matter of degree Non-reification.
Introduction To Research 589(A)
INTRO TO EXPERIMENTAL RESEARCH, continued Lawrence R. Gordon Psychology Research Methods I.
1 Chapter 3: Experimental Design. 2 Effect of Wine Consumption on Heart Disease Death Rate **Each data point represents a different country.
Understanding Research Design Can have confusing terms Research Methodology The entire process from question to analysis Research Design Clearly defined.
Independent vs Dependent Variables PRESUMED CAUSE REFERRED TO AS INDEPENDENT VARIABLE (SMOKING). PRESUMED EFFECT IS DEPENDENT VARIABLE (LUNG CANCER). SEEK.
Producing Data 1.
Methodology Matters: Doing Research in the Behavioral and Social Sciences ICS 205 Ha Nguyen Chad Ata.
Experiments and Causal Inference ● We had brief discussion of the role of randomized experiments in estimating causal effects earlier on. Today we take.
Research Methods ContentArea Researchable Questions ResearchDesign MeasurementMethods Sampling DataCollection StatisticalAnalysisReportWriting ?
Experiment Basics: Variables Psych 231: Research Methods in Psychology.
Objectives To understand the difference between parametric and nonparametric Know the difference between medically and statistically significant Understand.
Experimental Research
Research Strategies. Why is Research Important? Answer in complete sentences in your bell work spiral. Discuss the consequences of good or poor research.
Chapter 6 Research Validity. Research Validity: Truthfulness of inferences made from a research study.
Research Design ED 592A Fall Research Concepts 1. Quantitative vs. Qualitative & Mixed Methods 2. Sampling 3. Instrumentation 4. Validity and Reliability.
Experimental Research Methods in Language Learning Chapter 5 Validity in Experimental Research.
Chapter 8 – Lecture 6. Hypothesis Question Initial Idea (0ften Vague) Initial ObservationsSearch Existing Lit. Statement of the problem Operational definition.
1 Module 3 Designs. 2 Family Health Project: Exercise Review Discuss the Family Health Case and these questions. Consider how gender issues influence.
Introduction to Validity True Experiment – searching for causality What effect does the I.V. have on the D.V. Correlation Design – searching for an association.
Experimental Research Design Causality & Validity Threats to Validity –Construct (particular to experiments) –Internal –External – already discussed.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Can you hear me now? Keeping threats to validity from muffling assessment messages Maureen Donohue-Smith, Ph.D., RN Elmira College.
CRITICALLY APPRAISING EVIDENCE Lisa Broughton, PhD, RN, CCRN.
Experiments Why would a double-blind experiment be used?
Understanding Results
NRSG 790: Methods for Research and Evidence Based Practice
Chapter Eight: Quantitative Methods
Information from Samples
Chapter 6 Research Validity.
Experimental Clinical Psychology Session I
Experiment Basics: Variables
Experiments: Part 2.
Group Experimental Design
Presentation transcript:

Math Candel Maastricht University

1.Internal validity Do your conclusions reflect the “true state of nature” ? 2.External validity or generalizability Do your conclusions hold for other persons, situations, and times ?

1.Bad operationalisation of the constructs you want to measure 2.Incorrect design of the study 3.Errors in the statistical analysis 4.Wrong choice of the sample 5.Sample too small 6.Non-response, dropout and non-compliance

Does active coping lead to an increase in the quality of life as experienced by rheumatic patients ? If we carry out an empirical study, and the answer is yes, how sure can we be that this really is the case ?

How did we measure quality of life ? How good is this measure ? E.g. Sickness Impact Profile (SIP; 1989, 1994): –Emotional stability –Social behavior –Somatic Autonomy –Mobility control –Mobility range –Psychological autonomy

Social behavior (e.g.): I am cutting down the length of visits with friends I am doing fewer community activities I stay away from home only for brief periods of time Emotional stability (e.g.): I often act irritable toward those around me, for example, snap at people, give sharp answers, criticize easily I act disagreeable to family members, for example, I act spiteful, I am stubborn

Suppose we administer this scale on a different day, does it lead to the same scores on the different subscales ? (1) How reliable is the scale ? One could calculate a correlation between scores obtained at two different occasions: test-retest reliability Remedy for unreliability: Repeated measurements Motivation for multi-item measures !

(2) Are we sure that the scale is measuring what we would like to measure ? Does the scale have validity ? If the scale is reliable, this is still not guaranteed. The scale may still be invalid !

Content validity Is the scale representative of the domain to be measured ? Is there something missing ? Nomological or construct validity Does the scale behave as one would expect the scale to behave (e.g. on theoretical grounds) ? E.g. the score on SIP should correlate with the score on another quality of life measure

1.Respondent’s tendency to agree (or disagree) with each statement 2.Respondent’s tendency to give answers that are socially accepted 3.Respondent’s tendency to give extreme answers 4.Respondent’s tendency to bring some variation into his or her answers 5.Desire of the respondent to please the interviewers/researchers ……..

How good was the design of the study ? Are alternative explanations possible for the relations found ? Suppose the design of the study was: BaselineInterventionPostmeasurement What problems may be involved ?

Suppose the weather was good during the period between pre- and post-measurement, so there were less complaints related to the disease  History effects Rheuma is a progressive disease. One may expect a decrease in quality of life over time.  “Maturation”

Remedy: install a control group BaselineInterventionPostmeasurement BaselinePost measurement

Other effects that also may be under control: Testing effects  E.g. subjects are made aware of their functional health status and increased complaints may be due to pre-measurement Selection effects  The subjects are not assigned in a random fashion to the treatment and control group

Other possible threats: Diffusion of treatments  E.g. when patients are retrieved from the same hospital (rheumatology clinic) they may learn about each others therapy Remedy: Randomize at the level of hospitals

Observational studies: confounders ! E.g. the relation between smoking and blood pressure may also be explained by differences in physical exercise and differences in nutrition of smokers and non-smokers Remedies: Stratify on confounders, and perform analysis for each stratum Look for persons in both groups that match each other on confounders Include confounders as covariates in the analysis

Was the selection based on the variable that we would like to find an effect upon ? Watch out for regression towards the mean !

Quality of life before Quality of life after

Quality of life before Quality of life after

Quality of life before Quality of life after

Quality of life before Quality of life after Mean

Quality of life before Quality of life after Mean

Quality of life before Quality of life after Mean

Quality of life before Quality of life after Mean

Quality of life before Quality of life after Mean

Quality of life before Quality of life after Mean

Quality of life before Quality of life after Mean

In the treatment group not all people are strongly motivated to follow the program (and possibly to do some exercises at home) So a therapy may be effective but this may not be found in the study. Should we leave out the non-compliers ? Non-compliance: not all people are willing to follow the procedures as prescribed by a doctor or a program

The more often you do a statistical test, the larger the probability that you may find an effect, even if there is none The chance of a type I error may increase

Increase of type I error due to multiple testing

Select a sample that is large enough to be able to detect an effect (in an experiment) or a relation (in an observational study) If there is an effect or relation, but we are not able to detect it, we make a type II error

Relation between type II error and sample size

How can one maximize the external validity or the generalizability ? –Take a random sample from the population: best alternative, but not always feasible –Take a heterogeneous sample: stratified sampling: in case of interaction no average effects can be reported –Take a sample with subjects similar to modal persons for the strata: even per stratum there is some risk in interpreting relations

The target population is not reached: Are the participants different from the non- participants ? Are the people that dropout from the study different from the others ? Remedies: Minimize nonparticipation and dropout Obtain information on nonparticipation and dropout The target population is reached, but is one able to generalize to other target populations ?

Interaction between selected sample and treatment Interaction between time and treatment Interaction between setting and treatment Strategies to get insight into these interactions: -Replicate study with other groups, at other times and within other settings -Search literature for prior evidence