Presentation is loading. Please wait.

Presentation is loading. Please wait.

evaluating a test Test Usefulness (Bachman and Palmer, 1996)

Similar presentations


Presentation on theme: "evaluating a test Test Usefulness (Bachman and Palmer, 1996)"— Presentation transcript:

1 evaluating a test Test Usefulness (Bachman and Palmer, 1996)
Anne Mullen Université laval october 2014

2 Test Validity The Progressive Matrix of Validity (Messick, 1989) conceived to control the quality of the evaluation to guarantee that the results of the evaluation are precise to assure that the interpretations of the results are fair

3 Plan 1. Qualities of test usefulness definitions questions
2. Creating a valid test 3. Discussion and follow-up questions

4 Six Qualities of Test Usefulness
Reliability Construct Validity Authenticity Interactiveness Impact Practicality

5 Six Qualities Reliability Construct Validity Authenticity
Interactiveness Impact Practicality

6 Reliability seeks to ascertain that the results of an evaluation are similar measures the coherence of results from one evaluation to another verifies the variation between results in different evaluations a minimal level of reliability is determined by the context

7 Is this evaluation reliable?
does the evaluation allow for comparison between test-takers? does the evaluation allow for comparison with other groups of test-takers in the same session, in different sessions?

8 Six Qualities Reliability Construct Validity Authenticity
Interactiveness Impact Practicality

9 Construct Validity a measurement by which the results of an evaluation can be interpreted as an indicator of the ability that the evaluation is measuring is said to exist if the results of the evaluation are valid in a specific context and can be generalized (valid in another similar, but different context)

10 Does this evaluation measure the correct construct?
does the evaluation actually evaluate the desired ability? what other abilities are measured?

11 Six Qualities Reliability Construct Validity Authenticity
Interactiveness Impact Practicality

12 Authenticity the correspondence between the characteristics of the tasks of the context and those of the evaluation helps in the process of generalization of results

13 Is the evaluation authentic?
will the test-takers need to do similar activities in their present or future, academic or work lives?

14 Six Qualities Reliability Construct Validity Authenticity
Interactiveness Impact Practicality

15 Interactiveness the measure and the type of individual characteristics the test-taker uses when completing the tasks of the evaluation includes a) the goal b) the specific group being evaluated c) the specific context of the evaluation

16 Is the evaluation interactive?
does the evaluation reflect the classroom activities? does the evaluation lead the test-taker to use what has been taught and learned?

17 Six Qualities Reliability Construct Validity Authenticity
Interactiveness Impact Practicality

18 Impact the effects of the evaluation on a) society (employers),
b) educational systems (administrators, teachers) and c) other stakeholders (parents and test-takers) the consequences of the evaluation must be evaluated for each stakeholder

19 What is the impact of the evaluation?
how are the results of the test used? is anyone affected negatively by the evaluation? who benefits from the evaluation?

20 Six Qualities Reliability Construct Validity Authenticity
Interactiveness Impact Practicality

21 Practicality the measure and the evaluation of the resources:
a) human (test correctors, evaluators of the evaluation) b) material (space and equipment) c) time (test creation, the correction, analysis)

22 Is the evaluation practical?
can it be completed in the allotted time? can it be corrected easily and fairly for all test-takers? what resources are needed and are they readily available?

23 Determining Test Usefulness
Three principles to follow: find a middle ground between the 6 qualities have the six qualities combined and balanced evaluate for the context

24 Six Qualities Reliability Construct Validity Authenticity
Interactiveness Impact Practicality

25 Creation of an evaluation
You need to determine an evaluation for the following list of words: to devour, to dirty, to imbibe, to purchase, to relish, to swallow, to savour, to scorch, to slip, to taste,

26 Context The class is an intermediate 4-skills ESL class with 23 students. While listening to a text which included these ten words, take-takers were asked to answer comprehension questions. The 10 words were listed and defined due to their level of presumed difficulty. The teacher also orally explained the meaning of these words and answered any questions.

27 Is the text useful? does the evaluation allow for comparison between test-takers and groups over time? (Reliability) does the evaluation actually evaluate the desired ability? Do other abilities intervene? (Construct validity)

28 Is the text useful? does the evaluation reflect the test-taker’s present day or future reality? (Authenticity) does the evaluation lead the test-taker’s to use what has been taught and learned? (Interactiveness)

29 Is the text useful? what is the effect of the evaluation? (Impact)
is the evaluation easy to administer? (Practicality)

30 Thank you


Download ppt "evaluating a test Test Usefulness (Bachman and Palmer, 1996)"

Similar presentations


Ads by Google