Importance of Testing In Educational situations To determine the progress of students To ascertain achievement of educational objectives To make sound.

Slides:



Advertisements
Similar presentations
Test Development.
Advertisements

Chapter 5 Measurement, Reliability and Validity.
Part II Sigma Freud & Descriptive Statistics
Part II Sigma Freud & Descriptive Statistics
Standardized Tests What They Measure How They Measure.
Types of Tests. Why do we need tests? Why do we need tests?
Chapter Fifteen Understanding and Using Standardized Tests.
Introduction to Statistics Quantitative Methods in HPELS 440:210.
© 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Instrumentation Chapter Seven.
TSL 3112 – LANGUAGE ASSESSMENT BASIC TESTING TERMINOLOGY
Concept of Measurement
© 2008 McGraw-Hill Higher Education. All rights reserved. CHAPTER 16 Classroom Assessment.
1 Lesson One Introduction: Teaching and Testing/Assessment.
Uses of Language Tests.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
Standardized Testing and California Schools’ API Scores What’s the Connection?
Standardized Test Scores Common Representations for Parents and Students.
Classroom Assessment A Practical Guide for Educators by Craig A
WEEK 1 – TOPIC 1 OVERVIEW OF ASSESSMENT: CONTEXT, ISSUES AND TRENDS 1.
Measurement and Measurement Scales Measurement is the foundation of any scientific investigation Everything we do begins with the measurement of whatever.
Formative and Summative Assessment
Chapter 1 Assessment in Elementary and Secondary Classrooms
Measurement and Data Quality
Revision Sampling error
Chapter 14 Understanding and Using Standardized Tests Viewing recommendations for Windows: Use the Arial TrueType font and set your screen area to at least.
Chap. I Testing, Assessing, and Teaching
Introduction: Teaching and Testing/Assessment
Classroom Assessment and Grading
Instrumentation.
Foundations of Educational Measurement
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
The World of Assessment Consider the options! Scores based on developmental levels of academic achievement Age-Equivalent scores.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
EDU 385 Education Assessment in the Classroom
The What and the Why of Statistics The Research Process Asking a Research Question The Role of Theory Formulating the Hypotheses –Independent & Dependent.
Lesson Three Kinds of Test and Testing. Contents Kinds of Tests: Based on Purposes  Classroom use Classroom use  External examination Kinds of Testing:
Learning Objective Chapter 9 The Concept of Measurement and Attitude Scales Copyright © 2000 South-Western College Publishing Co. CHAPTER nine The Concept.
Selecting a Sample. Sampling Select participants for study Select participants for study Must represent a larger group Must represent a larger group Picked.
The Teaching Process. Problem/condition Analyze Design Develop Implement Evaluate.
CHAPTER OVERVIEW The Measurement Process Levels of Measurement Reliability and Validity: Why They Are Very, Very Important A Conceptual Definition of Reliability.
Assessment and Testing
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Measurement and Scaling
SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.
Introduction To Statistics. Statistics, Science, ad Observations What are statistics? What are statistics? The term statistics refers to a set of mathematical.
Variables It is very important in research to see variables, define them, and control or measure them.
INSTRUMENTATION QUESTIONNAIRE EDU 702 RESEARCH METHODOLOGY ZUBAIDAH ABDUL GHANI ( ) NORELA ELIAS ( ) ROSLINA AHMED TAJUDDIN ( )
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Sampling Design & Measurement Scaling
1 PAUF 610 TA 1 st Discussion. 2 3 Population & Sample Population includes all members of a specified group. (total collection of objects/people studied)
SECOND EDITION Chapter 5 Standardized Measurement and Assessment
Chapter 6 - Standardized Measurement and Assessment
ASSESSMENT CRITERIA Jessie Johncock Mod. 2 SPE 536 October 7, 2012.
Chapter 3 Selection of Assessment Tools. Council of Exceptional Children’s Professional Standards All special educators should possess a common core of.
Language Assessment. Evaluation: The broadest term; looking at all factors that influence the learning process (syllabus, materials, learner achievements,
By: Nurhayati M.Pd. Indraprasta University Jakarta Testing, Assessing, and Teaching.
Lesson 3 Measurement and Scaling. Case: “What is performance?” brandesign.co.za.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
Ch4-1. Testing.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
© 2009 Pearson Prentice Hall, Salkind. Chapter 5 Measurement, Reliability and Validity.
 Tool: A specific mechanism or strategy the researcher uses  Method: is the general approach (how to) that is taken to carry out research.
Data Collection Methods NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN.
Chapter 1 Assessment in Elementary and Secondary Classrooms
NATURE OF Measurement.
RELIABILITY OF QUANTITATIVE & QUALITATIVE RESEARCH TOOLS
Chapter 14 Assembling, Administering, and Appraising classroom tests and assessments.
Learning About Language Assessment. Albany: Heinle & Heinle
Understanding and Using Standardized Tests
TESTING AND EVALUATION IN EDUCATION GA 3113 lecture 1
Presentation transcript:

Importance of Testing In Educational situations To determine the progress of students To ascertain achievement of educational objectives To make sound decision based on evaluation To know how much learning has taken place. Teaching & Testing Relationship Testing at the service of teaching Washback / backwash effect Positive / negative

Concepts & Terms Test: A procedure designed to elicit a certain behavior from which one can make inferences about certain characteristics of an individual. Assessment: An ongoing Process and a kind of measurement which encompasses a wider domain than a test and is carried out in direct and indirect ways.

Concepts & Terms Measurement: Process of quantifying individuals’ characteristics according to specific rules & procedure Evaluation: The systematic gathering of information for the purpose of making decisions. Qualitative vs. Quantitative Evaluations

Teaching–assessment Relation TEACHING ASSESSMENT TESTS

Test 3 4 Measurement 5 1 Evaluation 2

1. non-test, non-measure evaluation Qualitative description of Ss performance 2. non-test measure for evaluation Teacher’s ranking for assigning grades 3. test for evaluative purpose Achievement testing 4. test for non-evaluative purpose Proficiency test for research 5. non-test measure for non-evaluative purpose Assigning code numbers to subjects for research

Nominal Scale  Not really a ‘scale’ because it does not scale objects along any dimension. It simply labels objects and gives the researcher the least amount of information about participants.  Gender : Male = 1Female = 2  Religious Affiliation :Catholic= 1Protestant= 2Jewish= 3 Muslim= 4Other= 5  yes/no responses  categorizing subject by hair colour  marital status  Race  political party affiliation  college major  Birthplace Nominal data is often generated in studies using a questionnaire design from closed, forced choice questions, e.g. type of pet (cat, dog, rat etc.)

Ordinal Scale  Numbers are used to place objects in order, but there is no information regarding the differences (intervals) between points on the scale. symptoms of depression from a psychiatric assessment? None= 0Mild= 1 Moderate= 2Severe= 3 Ranking students according to frequency of spelling errors the scores on a Likert questionnaire Strongly Agree= 5 Agree= 4 No opinion= 3 Disagree= 2 Strongly disagree= 1

Interval Scale  An interval scale is a scale on which equal intervals between objects represent equal meaningful differences. Determining scores on a grammar test A 10-degree difference has the same meaning anywhere along the scale.

Ratio Scale Ratio scales have a true zero point and are meaningful Physical scales of time length weight speed absolute temperature(Kelvin scale)

the categories of the variable: Property Nominal Ordinal Interval Ratio Distinctiveness ++++ Ordering +++ Equal interval ++ Absolute zero +

Test Genres Test Battery A group of tests standardized on the same population to yield comparable results and to produce a single score. Traditional vs. Computer-adaptive Discrete-point vs. Global/ Integrative Pragmatic vs. Functional/communicative Norm-referenced vs. Criterion-referenced Direct vs. Indirect Subjective vs. Objective Summative vs. Formative Power vs. Speed

NRT vs. CRT DimensionCRTNRT Purpose To determine whether each student has achieved specific skills or concepts. To find out how much students know before and after instruction To rank each student with respect to the achievement of others in broad areas of knowledge. To discriminate between high and low achievers. Content Measures specific skills making up a designated curriculum and identified by teachers and curriculum experts. Each skill is expressed as an instructional objective. Measures broad skill areas sampled from a variety of textbooks, syllabi, and the judgments of curriculum experts. Item Characteristics Each skill is tested by at least four items to obtain an adequate sample of performance and to minimize the guessing effect. The items which test any given skill are parallel in difficulty. Each skill is usually tested by less than four items. Items vary in difficulty. Selected items show high discrimination indexes. Score Interpretation Testers are compared with a preset standard for acceptable achievement. The performance of other examinees is irrelevant. A student's score is usually expressed as a percentage. Student achievement is reported for individual skills. Testers are compared with other examinees and assigned a score--usually expressed as a percentile, a grade equivalent score, or a stanine. Student achievement is reported for broad skill areas, although some norm-referenced tests do report student achievement for individual skills.

Test Items Alternate response Items True / False Yes / No Agree / Disagree Right / Wrong Fixed / Closed-ended response Items Multiple-Choice Matching Free / Open-ended response Items Short answer Gap-Fill Essay

Teacher-made vs. Standardized Tests  Teacher-made/classroom tests: Small scale, classroom tests generally prepared, administered, and scored by one teacher.  Standardized Tests: Tests with fixed contents, constant administration and scoring procedures and statistically acceptable characteristics.  Differences between TMD & SDT  Administration & scoring  Content sampling  Test construction  Norms & Standards  Purpose and use

Self-assessment 1. A test refers to a standard set of items to be answered. 2.Evaluation uses both tests and informal pieces of evidence for making a value judgment and decision. 4.Measurement refers to any device for obtaining information in a quantitative manner. 5.If a person Knows how to teach, he may not be necessarily able to judge the ability of his pupils. Mohd. Pazhouhesh T F T T

6. Educational decisions can be made without measurement or evaluation. 7. Summative evaluation involves the use of tests and quizzes for the purpose of determining the effectiveness of instructional programs. Mohd. Pazhouhesh F T

TThe process of gathering information to make proper decisions is called a. measurement b. testing c. evaluation d. examination TThe subjective judgment of a teacher about a student’s performance is a kind of evaluation. a. quantitative b. standard c. qualitative d. comprehensive Mohd. Pazhouhesh