RELIABILITY Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova.

RELIABILITY Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova

Outline 1.Defining reliability 2.How to measure reliability 3.Reliability coefficient 4.Observed score and true score 5.SEM 6.Item analyses

Tests as measuring tools ‘A test is something (as a series of questions or exercises) for measuring the skill, knowledge, intelligence, capacities, or aptitudes of an individual or group’ (Merriam Webster Dictionary Online, 2013)

Tests as measuring tools ‘…a language test is a procedure for gathering evidence of general or specific language abilities from performance on tasks designed to provide a basis for predictions about an individual’s use of those abilities in real world contexts.’ (McNamara, 2000:11)

A reliable test A perfectly reliable test is ‘one which would give precisely the same results for a particular set of candidates regardless of when it happened to be administered.’ (Hughes, 1989:31)

An unreliable test A completely unreliable test is one ‘which would give sets of results unconnected with each other.’ (Hughes, 1989: 32)

Strategies to estimate reliability We can use statistics to estimate how reliable a test is: test-retest reliability; equivalent (parallel) forms reliability; internal consistency reliability.

Test-retest reliability ‘calculating a reliability estimate by administering a test on two occasions and calculating the correlation between the two sets of scores’ (Brown, 2002)

Equivalent (parallel/alternative) forms reliability ‘calculating a reliability estimate by administering two forms of a test and calculating the correlation between the two sets of scores’ (Brown, 2002)

Internal consistency reliability ‘calculating a reliability estimate based on a single form of a test administered on a single occasion using internal consistency equations’ (Brown, 2002)

Internal consistency reliability: calculating reliability from single administration of test; some commonly reported figures (reliability coefficients) are; - split-half; - Cronbach’s alpha. calculated automatically by many statistical software packages.

Split-half reliability: the test is split in half (e.g. odd / even) creating “equivalent forms”; the two “forms” are correlated with each other; the correlation coefficient is adjusted to reflect the entire test length.

Reliability coefficient: range: -1.0 (inverse relationship) to 0.0 (totally unreliable test) to 1.0 (perfectly reliable test); reliability coefficients are estimates of the systematic variance in the test scores; lower reliability coefficient = greater measurement error in the test score.

How high should reliability be? (Pope n.d.)

Standard error of measurement (SEM): This allows us to use the score that the test taker got for the test (observed score) and estimate what their true level of ability might be. Of course, we do not know, so the ‘true score’ that we estimate must be a range of numbers. Observed score. True score.

Maria’s scores: 505049525051494850 True score = observed score +/- error Standard error of measurement (SEM):

We would expect the student to score near the centre of the distribution most of the time. Standard error of measurement (SEM):

The standard error of measurement (SEM) is the standard deviation of all those scores averaged across persons and test administrations. (Brown, 2002) Standard error of measurement (SEM):

Sx √(1-rxx’) Sx – standard deviation of raw scores rxx’ – reliability coefficient Standard error of measurement (SEM):

1 SEM = 68% confidence 2 SEM = 95% confidence 3 SEM = 99.7% confidence Standard error of measurement (SEM):

Observed score = 50 SEM = 3 68%: from 47 to 53 95%: from 44 to 56 Standard error of measurement (SEM):

RELIABILITY Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova.

Similar presentations

Presentation on theme: "RELIABILITY Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

RELIABILITY Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova.

Similar presentations

Presentation on theme: "RELIABILITY Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova."— Presentation transcript:

Similar presentations

About project

Feedback