Please check, just in case…. Announcements 1.Terminology Treasure Hunt due in two weeks (Oct 29). Please check the resources provided in the folder on.

Slides:



Advertisements
Similar presentations
Scoring Terminology Used in Assessment in Special Education
Advertisements

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Measures of Central Tendency. Central Tendency “Values that describe the middle, or central, characteristics of a set of data” Terms used to describe.
Statistics for the Social Sciences
Appraisal in Counseling Session 2. Schedule Finish History Finish History Statistical Concepts Statistical Concepts Scales of measurement Scales of measurement.
Measures of Central Tendency and Variability Chapter 5: Using Normal Curves For Evaluation.
Introduction to Educational Statistics
z-Scores What is a z-Score? How Are z-Scores Useful? Distributions of z-Scores Standard Normal Curve.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Classroom Assessment A Practical Guide for Educators by Craig A
Measurement Concepts & Interpretation. Scores on tests can be interpreted: By comparing a client to a peer in the norm group to determine how different.
BASIC STATISTICS WE MOST OFTEN USE Student Affairs Assessment Council Portland State University June 2012.
Measures of Central Tendency
Please check, just in case…. Announcements 1.Independent work session on the standardized test critiques next week. Meet Brigid at the Educational Diagnostician.
Today: Central Tendency & Dispersion
Measures of Central Tendency
The Data Analysis Plan. The Overall Data Analysis Plan Purpose: To tell a story. To construct a coherent narrative that explains findings, argues against.
Standardized Testing (1) EDU 330: Educational Psychology Daniel Moos.
Statistics Used In Special Education
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
LECTURE 06B BEGINS HERE THIS IS WHERE MATERIAL FOR EXAM 3 BEGINS.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Chapter 3 Understanding Test Scores Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Measures of Spread Chapter 3.3 – Tools for Analyzing Data I can: calculate and interpret measures of spread MSIP/Home Learning: p. 168 #2b, 3b, 4, 6, 7,
Chapter 11 Descriptive Statistics Gay, Mills, and Airasian
Smith/Davis (c) 2005 Prentice Hall Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
Descriptive Statistics
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Scores & Norms Derived Scores, scales, variability, correlation, & percentiles.
Measures of Dispersion & The Standard Normal Distribution 2/5/07.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Listening to Bob Marley makes me think of statistics and standardized testing….. Said no one ever But… it is still your responsibility.
Interpreting Performance Data
Descriptive Statistics
Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.
Standardized Testing (1) EDU 330: Educational Psychology Daniel Moos.
Page 1 Chapter 3 Variability. Page 2 Central tendency tells us about the similarity between scores Variability tells us about the differences between.
Measures of Dispersion & The Standard Normal Distribution 9/12/06.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.
Z-Scores Standardized Scores. Standardizing scores With non-equivalent assessments it is not possible to develop additive summary statistics. –e.g., averaging.
Chapter 4 & 5 The Normal Curve & z Scores.
TYPES OF SCORES (PP IN HUTCHINSON; MERRELL & PLANTE; SECOND PART OF HAYNES AND PINDZOLA )
RESEARCH & DATA ANALYSIS
Introduction to statistics I Sophia King Rm. P24 HWB
Today: Standard Deviations & Z-Scores Any questions from last time?
Standardized Testing. Basic Terminology Evaluation: a judgment Measurement: a number Assessment: procedure to gather information.
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
The Normal Distribution and Norm-Referenced Testing Norm-referenced tests compare students with their age or grade peers. Scores on these tests are compared.
Summation Notation, Percentiles and Measures of Central Tendency Overheads 3.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Module 5: Basic Concepts of Measurement. Module 5 focuses on concepts and terminology that will be helpful as you administer and interpret tests and other.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Interpreting Test Results using the Normal Distribution Dr. Amanda Hilsmier.
Describing Distributions Means Standard deviation Z scores Normal distribution Norms Tracking.
Unit 3: Science of Psychology
Basic Statistics Module 6 Activity 4.
Basic Statistics Module 6 Activity 4.
Science of Psychology AP Psychology
Please check, just in case…
Range.
But… it is still your responsibility
Presentation transcript:

Please check, just in case…

Announcements 1.Terminology Treasure Hunt due in two weeks (Oct 29). Please check the resources provided in the folder on e-reserves and bring them with you to class NEXT WEEK. 2.I need two volunteers to meet me at my office next week before class to help with materials. Volunteers needed after class too. 3.Make an appointment to meet with me with any questions you have about upcoming assignments or class topics/concepts.

Quick questions, quandaries, comments, or concerns?

APA Tip of the Day: Ampersand When there are two authors for a reference you cite, you need to cite both of them every time. When you cite them in a sentence, but not within parentheses, use “and.” When you put the citation within a parenthesis, use an ampersand (“&”).

Examples of ampersand and and: According to Gomez and Garcia (2012), “this is very interesting” (p. 107). “This is very interesting” (Gomez & Garcia, 2012, p. 107).

Topic: Psychometric terminology October 8, 2013

Tonight’s Terminology Reliability (review) Validity (review) Chronological age Raw score Norm-referenced (quick review) Normal curve

Definitions Reliability: 1) “Reliability refers to the results obtained with an assessment instrument and not to the instrument itself.” 2) “An estimate of reliability always refers to a particular type of consistency” (i.e. over time, inter- rater reliability, with different tasks).

Definitions, cont. Reliability: 3) “Reliability is a necessary but not sufficient condition for validity.” 4) “Reliability is primarily statistical.” (Linn & Gronlund, 2000, pp )

Definitions, cont. Validity: 1.) “Validity refers to the appropriateness of the interpretation of the results of an assessment procedure for a given group of individuals, not to the procedure itself.” 2.) “Validity is a matter of degree; it does not exist on an all-or-none basis.”

Validity: 3.) “Validity is always specific to some particular use or interpretation. No assessment is valid for all purposes.” 4.) “Validity is viewed as a unitary concept based on various kinds of evidence.” Definitions, cont.

Validity: 5.) “Validity involves an overall evaluative judgment. It requires an evaluation of the degree to which interpretations and use of assessment results are justified by supporting evidence and in terms of the consequences of those interpretations and uses.” (Linn & Gronlund, 2000, pp ) Definitions, cont.

“ Validity is an evaluation of the adequacy and appropriateness of the interpretations and use of assessment results.” (Linn & Gronlund, 2000, p. 73) Definitions, cont.

Chronological Age????? Chronos = sequential time

Raw score The number of items correct on a test. Without other information on the test, the raw score is meaningless – it must be interpreted for each child using the information from the test manual.

Norm-referenced Tests Describe “performance in terms of the relative position held in some known group (e.g., typed better than 90 percent of the class members).” (Linn & Gronlund, 2000, p. 42) NR assessments compare individual performance against others’ performance.

What is a Normal Distribution? It is the idea that for a number of human characteristics, such as height or weight, most of the “cases” will cluster around the middle, with fewer at the high and low ends. So, if you plot the number of cases at each possible data point, you end up with the “bell curve.”

Ways to talk about the “middle” (measures of central tendency) Mean: the statistical average Median: the score where half fall above and half fall below. Mode: the most frequent score

In a normal distribution, the mean, median, and mode are exactly the same.

Why is a “normal curve” important? Norm-referenced assessments are based on the assumption that human abilities, like height and shoe size, are “normally distributed.” A normal distribution allows us to calculate a number of important statistics, which allow us to compare an individual’s score with that of their norm group.

Testing terminology related to the “normal curve”: Measures of Variability: Range Standard deviation Standard score Z-score Deviation IQ Stanine Percentile rank Age equivalency Grade equivalency

Measures of Variability These tell us how spread out (or dispersed) the scores are.

#1 - Range A very rough measure of variability. It tells us the distance between the highest and the lowest score. If there are unusually high and/or low scores, this can be misleading. What is the range of height in our class?

#2 - Standard Deviation “A measure of the variability, or dispersion, of a distribution of scores” (Harcourt, 2000, p. 8).

Standard Deviation : A standard deviation corresponds to a particular percentage of the scores, both above and below the mean. Two thirds (68%) of scores will fall within 1 standard deviation of the mean. More than 95% of all scores fall between 2 standard deviations above and below average. Less than 2.5% of scores would be below 2 standard deviations below the mean.

This tells us how closely grouped together or how far apart the raw scores on a particular test are. A standard deviation corresponds to a particular percentage of the scores, both above and below the average score. For example, over 2/3 (68%) of scores will fall within 1 standard deviation of the average. More than 95% of all scores fall between 2 standard deviations above and below average. Standard Deviation

So, if the scores are very closely grouped, there will not be a lot of distance between the high and low raw scores for the majority of the students. If there is a large amount of difference between most students’ scores (variance), one standard deviation will include a wider range of raw scores. Standard Deviation, cont.

“Standard deviation is such an accurate measure of variability that if a distribution is reasonably normal, then by knowing only two numbers, the mean and the standard deviation, it is possible to reconstruct and redraw the distribution. Whenever the shape of a distribution of scores approaches normal, the standard deviation can be used as a measuring rod to lay off distances from the mean.”

“Normal” has a statistical definition: scores within average range – not really, really high and not really, really low. Really high and really low are usually set in terms of standard deviations. More than two standard deviation above or below the mean are scores which typically qualify someone for special education (either as gifted or with a disability) because more than 95% of scores would fall within 2 standard deviations of the mean.

So… Standard deviations are a way of considering a particular score with reference to distance from the mean and percentage of scores which fall within that distance. It helps us figure out how high or low a particular score is, in comparison to other scores.

Also… Standard deviation is an important statistic, because it lets us calculate “standard scores.”

Standard Score : This is a way of representing raw scores that tells us how far above or below average that score is, in a way that is comparable across tests and across time. Standard scores “translate” raw scores into a common way of representing scores.

Different Types of Standard Scores : Z-Score Deviation IQ Stanine T-score Percentile Age Equivalency Grade Equivalency

Z-Scores This simply tells you how many standard deviations a student’s score above or below the mean his/her score fell. If a student’s raw score is 1 ½ standard deviations below the mean, what is his/her z-score?

Standard Deviation & z-scores

People popularly refer to this as someone’s “IQ.” This is where 100 equals the average score, and each standard deviation equals 15 points. Deviation IQ

This is where 100 equals the average score, and each standard deviation equals 15 points. So, hypothetically 68% (2/3) of the population would receive a deviation IQ score of between 85 and 115. People popularly refer to this number as someone’s “IQ”. It is really just one of a number of ways of changing a raw score into a different kind of score that can be compared across tests or people. Deviation IQ

Stanine This is another kind of standard score, where the entire range of possible raw scores is divided up into nine groups. The lowest group of scores fall in Stanine 1. This highest group of scores fall in Stanine 9. Stanine 5 includes scores right in the middle (average) -- scores that fall in between the 40th and 60th percentile would be in Stanine 5.

Stanines Good for BROAD measurement of performance. Pros Can compare scores across tests Intervals are equal Cons Rough estimate of performance (not as sensitive as other standard scores)

Percentile Rank This tells us what percentage of children scored at or below your child on a particular measure (i.e. height, weight, test score).

This tells us what percentage of children scored below your child on a particular measure (i.e. height, weight, test score). For example, if your infant daughter is in the 35 th percentile for weight, that means that 35% of girls at the same age as your daughter weigh less than her. 65% of girls at the same age weigh the a same or more. Percentile Rank

Standard Deviation & Percentile Rank

Percentile Ranks Pros Generally easy to understand. Cons Need to be careful, because the intervals between ranks are not equal. For example, the difference between 50 th & 60 th percentile is much smaller than the difference between 70 th & 80 th percentiles.

Age Equivalency This tells us that a student’s score is similar to the average score of children at X age. Ex. If your son’s language development (as measured by a particular test) is at the age equivalent of 4-3, it means that he scored on that particular test at the same level as the average child who was 4 years and 3 months old.

Age Equivalents Pros Easy for parents to understand. Cons Not equal units of measurement. Basically just a “ballpark” figure. Gives the erroneous idea that individuals with developmental delays are just like child, but in an older body.

Grade Level Equivalency Very misunderstood concept. This compares a child’s score to the average performance of students at different grade levels, if they were to take the same test as your student.

Grade Level Equivalency Example If your second grader has a grade level equivalency of 3.1 in language arts, it means that your child scores the same as the average third grader, in the first month of the year, would have scored on the second grade test. This does not mean that your child is doing third grade work or that he should be moved up into the third grade. It does mean that he is performing, on second grade work, more like a third grader would.

Please take a minute for the minute paper.