2What is testing?Why do we have so many tests?What are the pros and cons to testing?How can we use testing to improve . . .?What types of tests do we take?
3THE CONSEQUENCE OF TESTING The results of psychological tests can change individuals’ lives in profound ways.
4DEFINITION OF A TESTA test is a standardized procedure for sampling behavior and describing it with categories or scores.
5DEFINITION OF A TEST (Cont.) Although different tests have different formats and applications, most tests have defining features/characteristics:Standardized procedureBehavior sampleScores or categoriesNorms or standardsPrediction of nontest behavior
6Test vs. Assessment A test is also different from an assessment Assessment refers to the entire process of collating information about individuals and subsequently using it to make predictionsTests represent only one source of information within the assessment processe.g. spelling is one aspect of writing, and so to assess it we would use a spelling test. Whereas to gauge up someone’s general writing ability we would have to assess the entire process (spelling, style, grammar, punctuation etc.)
7Psychological Testing A psychological test is a standardized measure of a sample of a person’s behavior that is used to measure the individual differences that exist among people.
8Psychological Testing All psychological tests have one characteristic that makes them both fascinating and remarkably practical – they try to make it possible to find out a great deal about a person in a very short period of time.
9Psychological Testing Tests can be useful in predicting how well a person might do in a particular career; in assessing one’s individual’s desires, interests, and attitudes; and in revealing psychological problems.These tests can also show how an individual compares to others.In some cases, tests can even be used to predict future behavior (though these are very controversial.)
10Psychological Testing There are also some dangers to testing.One of these dangers is that we tend to forget that tests are merely tools for measuring and predicting human behavior.In some cases, society starts to think of these tests (such as IQ tests), are the FINAL fact, but nothing could be further from the truth.
11Types of Psychometric Tests Two types of psychological tests are used by personnel selection practitioners:Tests of cognitive abilityCognitive assessment tests attempt to measure an individual’s ability to process information from their environmentTests of personality measuresPersonality measures are more concerned with people's dispositions to behave in certain ways in certain situations
12Different Categories of Psychometric Tests There are three categories of psychometric tests in use by psychologists:Normative tests – most psychometric tests where data exists which tell us the range of scores expected from the population under consideration e.g. IQ scoresCriterion referenced tests – tests commonly used in education where a candidate has to meet some pre- arranged standard.Idiographic tests – tests used in therapy to observe an individual’s progress over time
13Applications of Personality Tests Criminal psychologists might employ questionnaires to measure impulsivity and its relation to crimeHealth psychologists might measure people’s optimism in relation to their response to cancer diagnosisOccupational psychologists often employ personality tests to predict job performance and job suitability e.g. Furnham (1992) reported that workers with high ‘negative affect’ tend to be less productive and have less job satisfaction etc.
14Standardization Tests must be standardized. Standardization refers to two things.First, standardized tests must be administered and scored the same way every time.
15Establishing NormsOnce a test result is obtained, the examiner must translate the score into something useful.Suppose you score a 32 out of 50 on a test…What does that score mean?Well, if it is a reliable test, it means that you should be able to understand a certain amount of information on every test, regardless of the way in which it is written.In other words, this score should predict how you will perform at a given level.
16Establishing NormsWhen psychologists design a test to be used in a variety of settings, they usually set up a scale for comparison by establishing norms.This is usually done by transforming raw test scores into a percentile system.What is a percentile system?Percentile System: Ranking of test scores that indicates the ratio of scores lower and higher than a given score.In other words, you are all ranked from highest to lowest…Congratulations.
17Establishing NormsIn order to make comparisons, tests are first given to a large number of people in a sample group – let’s say 6th graders or army privates.Percentiles are then established on the basis of the scores that these people receive. (so, in other words, tests get harder as more people take them.)These percentiles are called “norms.”Norms: Standard of comparison for test results developed by giving the test to large, well-defined, groups of people.
18Establishing NormsYour percentile on an aptitude test, such as Scholastic Assessment Tests (SAT), reflects your standing among people of your age and grade who have taken these exams.Though, you should remember that norms are not standards, they merely represent what the majority of a given group will do.Word…
19Test ReliabilityThe ability of a test to give the same results under similar conditions.There are 3 basic ways of determining a tests reliability:1. Test-Retest Reliability: If a person retakes the test or takes a similar test within a short time after the first testing, does he or she receive approximately the same score?2. Interscorer Reliability: Does the test yield the same results when scored at different times by different people?3. Split-Half Reliability: Randomly dividing the test items in half and scoring each half separately. The two scores should be approximately the same.
20Test Validity Any test may be reliable but still not valid. Validity: The ability of a test to measure what it is intended to measure.For example; a test that consists primarily of Spanish vocabulary lists will not measure ability for engineering. A history test will not measure general learning ability. A test you take in physical education will not measure your ability to do math, etc…
21Test ValidityDetermining the validity of a test is more complex than assessing its reliability.One of the main methods for measuring validity is to find out how well a test predicts performance – Predictive validity.For example; a group of psychologists design a test to measure management ability. They ask questions about management systems, attitudes towards employees, and other relevant information.Will the people who score high on the test really make good managers?
22Controlling the use of Psychological Tests Ensure that the test is given by a qualified examiner and that the scores are properly usedTo prevent general familiarity with the test content which would invalidate the test
23Controlling the use of Psychological Tests cont.. Qualified ExaminerSelection of testAdministration and scoringInterpretation of scoresRole of Test UserTest scores are used as source of information in reaching practical decisionsUsers are teachers, counselors, adminisSecurity of Test Content and Communication of Test Information
24View from test taker’s perspective Test Anxiety – response was stimulated by ready visibility and its clearly apparent detrimental effects on performanceSchool achievements and intelligent test scores yielded significant negative correlationsIt appears likely that the relation between the anxiety and test performance is non linear, a large amt of anxiety is beneficial while large amount is detrimental
25Problems with Psychometric Tests Social Desirability – when faced with a psychometric test many people feel they are being judged and so alter their answers accordinglyPeople might engage in social desirability for two reasons:Self-deception – individuals are overly optimistic in their perceptions of their own positive personality features and play down their perceived negative aspectsImpression management – individuals try to appear ‘nice’ because they fear social disapproval
26Mood and Environmental Influence Mood does seem to play a part in how people go about performing in tests, especially those concerning personalitypeople in a good mood might answer the questionnaire completely differently than if they were in a bad moodFeatures of the environment (noise, heat & light) might also have an impact on our moods and our cognitive abilitiesHancock (1986) has shown that high temperature has a significant negative effect on vigilance, attention, memory and reaction time
27Ecological ValidityResearch that lacks ecological validity focuses on what an individual can do in a research environment instead of what they are usually doing in their everyday livesIf a test is not relevant to an individual’s lifestyle an individual probably will not perform well at itThis might be due to a lack of motivation or lack of relevant experience with the type of problem set than any lack of intellectual capability
28Cultural BiasA contentious issue in the field of psychometric testing is the possibility of bias in such tests against members of ethnic subgroups of the populatione.g. newly arrived immigrants will have difficulty with an intelligence test which asks them to name past leaders of the country to which they have recently immigratedAt the present time most standardised psychometric tests are based on western definitions and western cultural practices
29Are there Culture-Free Tests ? Attempts have been made to develop culture-free tests of intelligence, but on the whole these attempts have not been successful. This is due to several factors:Conceptions of intelligence vary widely from culture to cultureeven if the content of a test can be made culture- free, culture itself will still affect the results through directing attitudes towards tests, test-taking, competition, and so on
30Examples of RelativelyCulture-Free Psychometric Tests The Leiter International Performance Scale – Revised (Roid & Miller, 1997)covers four domains of functioning: reasoning, visualisation, attention and memoryThe Ravens Progressive Matrices (Court & Ravens, 1995)covers general cognitive abilityBoth the above tests are untimed and can be administered using virtually no language
31TEST WHY USE TESTS MAIN PARTICIPANTS TYPES OF TESTS Test userTest takerTest administratorTest publisherTest authorTest reviewerTest sponsorTYPES OF TESTSAPPLICATION OF TESTS
32PROBLEMCompute the corresponding transformed scores for the given Z scores.ZTPercentileStanine1.5002.0000.7500.555
33SOLUTIONZTPercentileStanine42.522.6641.5006593.328.52.0007097.729.50.75057.577.3470.55555.55716.61For 1st value, For 2nd value, For 3rd value,. ˙. T = 10(- 7.50) ˙. T = 10(1.500) ˙. T = 10(2) + 50. ˙. T = ˙. T = ˙. T = 70T = 10(Z) T = 10(Z) T = 10(Z) + 50S = 2(Z) S = 2(Z) S = 2(Z) + 5.5. ˙. S = 2( ) ˙. S = 2(1.5) ˙. S = 2(2) + 5.5. ˙. S = ˙. S = ˙. S =. ˙. S = 9.5
34Percentile, Percentile, Percentile, 0.5 0.5 0.5 0.5 0.2734 0.4332 Z = Z = 0P = P = 50(0.5 – )Z = Z = 1.500P = P = 93.32Percentile,0.50.50.4772Z = Z = 2P = P = 97.72
35For 4th value, For 5th value, T = 57.5 , S = 7 T = 55.55 , S = 6.61 0.50.50.50.50.210.2734Z = Z = 0.750P = 77.34Z = Z = 0.555P = 71
36PROBLEMThe following are the percentile scores from a Normal Distribution. Find out theCorresponding Z, T and Stanine scores.Sr.NoPercentileZTStanine116225350475598