Presentation on theme: "Accurate Assessment in the Common Core Era"— Presentation transcript:
1Accurate Assessment in the Common Core Era Eric Bright8th Grade MathCharleston Middle SchoolAccurate Assessment in the Common Core EraAssessing student content mastery
2Your Assessment Experience Have you ever gotten a grade you didn’t deserve? Why?What final grade should the student with the following grades get?F, D, C, F, F, B, D, F, C, F, F, B, F, A, F, AOpen discussion of what the purpose actually is.
3Your Assessment Experience What about now?Homework: FHomework: DQuiz: CTest: BProject: AFinal Exam: AOpen discussion of what the purpose actually is.What does a grade represent?
4Purpose of Assessment Bottom line: Does the student get it? Purpose: To determine to what degree a student has mastered content standards with a high degree of validity and reliability.Validity – the assessment measures what it is supposed to measureReliability – the assessment produces consistent results across evaluatorsOpen discussion of what the purpose actually is.
5Formative vs. Summative Formative – Assessment FOR learning; assessment that informs teaching and learning strategies for the teacher and/or studentMay be formal or informal (observations, effort, participation, exit slips, etc.)May be more qualitativeIncludes meaningful feedback to studentsPurpose: To improve student learning.Formative grades should occur all the time, but we cannot include it as part of a student’s grade because it doesn’t reflect mastery.
6Formative vs. Summative Formative Checklist – The assessment should…Tie directly to standardsFocus on student learning needsIdentify students’ current learning progressGive results that you can act onBe a regular part of instructionQuick and easy to give and gradeIf you don’t use the data, stop gathering it!
7Formative vs. Summative Summative – An assessment that summarizes the student’s mastery of a standardUsually formal (test, quiz, multiple choice, short response, word problems, projects, performance, portfolios, etc.)May be more quantitativePurpose: To give a picture of how well a student has mastered a standard at a specific time.Summative grades should not occur until we can reasonably expect a student to master content.
8Formative vs. Summative Summative Checklist – The assessment should…Tie directly to standardsInclude multiple levels of learning (Bloom’s: remember, understand, apply, analyze, evaluate, create)Summarize students’ overall learning progressGive results that all stakeholders can understandMake sure you have offered students examples of what meeting the standards looks like prior to a summative assessment.
9SPECIFIC ASSESSMENT TYPES Extra CreditClass ParticipationHomework CompletionHomework AccuracyProblem SolvingPop QuizQuizPre-TestPost-TestPerformance Assessment with Rubric
10Extra Credit Does the activity address a standard? Yes – Should be either formative or summative. This is regular credit.No – Should not be summative since it is not the grade level standard. This is formative at best.Does the activity go deeper into or below the grade level standard?Deeper – Should not be summative since it is not the grade level standard. It is enrichment and formative.Below – Should not be summative since it is not the grade level standard. It is remediation and formative.Extra credit is grade inflation, often from teachers who don’t want to deal with the consequences of having many F’s. That is an issue for another talk, but the key question there is why do so many students have F’s?
11Class ParticipationDefinition: A grade based solely on the frequency of participation in class.What does a participation grade measure?Content mastery? No, those with high content mastery may not participate and those without content mastery may participate frequently. This is not summative.Willingness to participate? Yes, which is critical to success in the real world, but does not reflect content mastery. Therefore, participation grades should be formative.Innovate: Keep track of participation with tally marks on a seating chart. Log it weekly in the grade book, but count it as worth 0%.
12Homework CompletionDefinition: A grade based solely on the amount of work completed and not the accuracy of that work.What does a completion grade measure?Content mastery? No, due to the lack of accuracy assessment. Therefore it is not summative.Effort? Yes, which is critical to a student’s academic success, but is still not summative. This is formative.Innovate: Keep completion grades in the grade book, but count them as worth 0%.
13The “No Homework” Policy Examine the research on homework.HW has no little to no effect on elementary students and begins having positive effects at the middle school levelPositive correlation between HW frequency and student achievementPositive correlation between HW completion and student achievementPositive correlation between HW that promotes self-regulation and student achievementNegative correlation between the relative amount of time spent on math HW versus other subjectsNegative correlation between drill/practice HW and student achievementConclusion: HW is important, but we need to rethink how we use it.
14Homework Accuracy What is the purpose of homework? To practice skills? Then it is formative, not summative.Consequence of not doing: Nothing. Natural consequences show up on summative assessment.Innovate: Write the homework completion rate on the top of each major summative assessment so students see the relationship between their practice and achievement.Innovate: Give a homework quiz at the end of every class period (or every other day) with two to four questions from the homework. Use this quiz as a formative grade.
15Homework Accuracy What is the purpose of homework? To get teacher feedback? Then it is formative, not summative.Consequence of not doing: Redo the homework so feedback can be given.Innovate: Don’t write a grade on this or else the students just toss it. Give feedback qualitatively instead of quantitatively.
16Homework Accuracy What is the purpose of homework? To learn content through discovery? Then it is formative, not summative.Consequence of not doing: Redo the homework.Innovate: Give less homework problems but have what is assigned take more thought with higher levels of Bloom’s taxonomy. Be less helpful which forces the students to think for themselves.
17Homework Accuracy What is the purpose of homework? To learn time management and organization? Then it is formative, not summative.Consequence of not doing: Create a homework completion plan with parents to train students in self-regulation skills.Innovate: Focus more on the time spent on task rather than the amount of homework completed. Ask parents to track or report that data to show growth.
18Homework Accuracy Is it possible for homework to be summative? Yes, but for it to be summative, students must have had the chance to master the skills. They need time to correct/revise homework before it is graded. Unfortunately, the homework loses its value as a formative assessment with this method.Show homework check paper.
19Problem Solving Activity Definition: An extended response situation requiring multiple steps to solve, use of multiple skills, and justification of reasoning and process.What is the purpose of the problem solving activity?To be exposed to new applications of content? This is formative.To demonstrate mastery of content through application? This is summative.
20Pop Quiz Summative assessment with no advanced notice. Are there “pop” football games? We’re not out to trap the students. Give students fair notice so that they can not only learn the material, but also develop and apply good study skills. (This assumes we teach self-regulation skills.)Innovate: Instead of pop quizzes, recent research is showing the benefits of practice quizzes. These formative assessments function like a pop quiz, but are not for a grade. They merely provide feedback on the learning process prior to the summative assessment.Give a quick four question quiz at the beginning of class as a warm-up activity. “Grade” it and go over it together instead of going over homework.
21QuizDefinition: A shorter assessment designed to assess mastery of a small set of skills.What is the purpose of the quiz?Establish reliability of mastery through multiple data entries? This is summative.Provide feedback to students about particular deficit skills before a culminating summative assessment? This is formative.Could it be a mixture of both?
22Pre-TestsDefinition: A test given before a unit of study to ascertain content already mastered.Why give a pre-test? All reasons are formative.Differentiation – Students who have already mastered content can move deeper into that content.Identify student leaders – Students with content mastery can be used to promote mathematical discourse.Show growth – Establishes a base line of where students are to compare with the post-test at the end of the unit.
23Post-TestDefinition: A test designed to show mastery over a whole unit of study.How does each type of test show mastery?Multiple choiceShort response (written or symbolic)Extended responsePBA or Project
24Performance Assessment with Rubric Projects or Extended Response ItemsThe rubric must address mastery of standards to be summative.Sample bad rubric: 3D Shape Children’s Story Book5 pts4 pts3 pts2 ptsPRESENTATION 1 X Posture, eye contact, grammar, pacing, clearness of speechExcellentGoodFairPoorREQUIREMENTS 2 X Has all shapes with theme and story that flows.NEATNESS AND CONSTRUCTION 3 X Book well built and illustrations neat and colored.CREATIVITY 3 X Well thought out and original themePROJECT 1 X Overall looks great with well thought out theme.Rubric does not match the standards and in fact has very little to do with math at all. Also note that the minimum grade a student could get is 20/50.
25Performance Assessment with Rubric A better rubric part 1: 3D Shape Children’s Story Book2 pts1 pts0 ptsDEFINITION OF CYLINDER Student accurately defines in his own wordsMastery level understandingGood understanding but copied some of the definitionDoes not show understandingDEFINITION OF CONE Student accurately defines in her own wordsDEFINITION OF SPHEREStudent accurately defines in his own wordsVOLUME OF CYLINDER Student accurately gives the formulaYesNoVOLUME OF CONE Student accurately gives the formulaVOLUME OF SPHERE Student accurately gives the formula
26Performance Assessment with Rubric A better rubric part 2: 3D Shape Children’s Story Book2 pts1 pts0 ptsFINDING VOLUME OF CYLINDER Student accurately finds volume and justifies solution with work X 2Mastery level understandingGood understanding but computation errorsDoes not show understandingFinding the volume makes sense in the context of story/problemYesPartiallyNoFINDING VOLUME OF CONE Student accurately finds volume and justifies solution with work X 2FINDING VOLUME OF SPHEREStudent accurately finds volume and justifies solution with work X 2MATHEMATICAL PRECISIONStudent maintains precision by using π≈ 3.14 and rounding final solutions to two decimal place x 26 or 7 problems solved with precision4 or 5 problems solved with precision< 4 problems solved with precision
27Performance Assessment with Rubric A better rubric part 3: 3D Shape Children’s Story Book2 pts1 pts0 ptsFINDING RADIUS OF CYLINDER OR CONE Student accurately finds radius and justifies solution with workMastery level understandingGood understanding but computation errorsDoes not show understandingFinding the radius makes sense in the context of the storyYesPartiallyNoFINDING HEIGHT OF CYLINDER OR CONE Student accurately finds height and justifies solution with workFinding the height makes sense in the context of the storyFINDING RADIUS OF SPHERE Student accurately finds radius and justifies solution with workFINDING VOLUME OF COMBINATION Student accurately finds volume and justifies solution with workFinding the volume makes sense in the context of the storyFINAL GRADE/50 ptsX 2 =__________%
28ASSESSMENT POLICIES Late Work Policy Cheating Giving Zeroes Group GradesPartial CreditTest RetakesCommon Assessments
29Late Work Innovate: Avoid late work in the first place: Give students advanced notice of any out-of-class summative assessments.Have benchmark due dates for those assessments.Reduce the amount of out-of-class summative assessments.If an assessment is turned in late:Be flexible depending on the circumstances, but have a written policy in place such as:Minus 10% to grade, but only accepted up to a week late.Note late work as a formative assessment and track it with individual students and parents. Grade assessment normally, but only accept work up to a week late.
30Cheating What constitutes cheating? Formative: Explaining what to do or just giving an answer without explaining why we do it.Summative: Explaining what to do or just giving an answer.If students cheat, what should be the consequences?Formative: Perhaps nothing except notifying parents. Natural consequences will occur on summative assessments.Summative: Take a different version of the assessment.
31The “No Zero” PolicyZero is an outlier and therefore we should only give 50% instead.False. If the student really knows 0% of the content, the zero is the best reflection of their content mastery. It is an outlier because the outcomes (grades) are not equally likely. 60% passing is a low standard.Giving a lower bound of 50% skews grades much higher. Think of a student with scores of 60, 10, 80, 10.
32The “No Zero” PolicyAny zero should be redone until the student passes.False and impossible. There is a time component to mastery. We expect mastery by a certain time. If students had the whole year to master content, they could save all assessments for the last day.Giving one chance for a retake assessment is reasonable since students learn at different rates, but beyond that either means bad teaching in the first place or that a student truly has not mastered the content.
33The “No Zero” PolicyThe “No Zero” policy came about because teachers gave zeros for formative assessments and then counted it toward a student’s grade (summative).This is not an inappropriate use of the zero. It is an inappropriate use of formative assessment.
34Zeros Innovate: Use the zero, but use it correctly! Why do we give a zero for summative grades?Incomplete? Then we don’t know how well a student has mastered that standard, so force the student to complete the assignment. If they refuse, our best guess is that they do not understand the topic, and the zero stands.Total lack of mastery? Then the zero is the most accurate representation of student mastery.
35Zeros Innovate: Use the zero, but use it correctly! Why do we give a zero for summative grades?Cheating? This does not accurately assess what a student has learned. A better consequence is to retake a similar assessment.Late? (When does late become incomplete? One day?) This does not accurately assess what a student has learned.Why is it late? If cheating, see that consequence. If effort, note that as a formative assessment.
36Group GradesDefinition: Giving the same grade (or slightly modified grades) to each student in a group.What does a group grade measure?Content mastery? It can, but one student may have achieved mastery while getting a poor grade due to someone else’s lack of mastery. This is not summative for each student.Innovate: Have students discuss ideas in a group, but…Don’t let them write anything down until they are on their own.Have them throw away their group work before filling out the summative assessment.Use group work only as a formative assessment or discovery task.
37Partial Credit Consider the following work on an algebra assessment: Was the mistake an algebraic mistake? This was a computation error, not an algebra error.Innovate: If we are assessing the algebra standard, the student appears to understand inverse operations. Perhaps 3/5 points.
38Test RetakesShould students be able to retake tests? What is the purpose of the retake?Purpose: Assess mastery. Yes, a retake might show new mastery of content.If a student retakes a test, should the new grade be averaged with the previous grade or should the new grade replace the previous one?Averaging acknowledges the struggle, but does not necessarily show the student’s current level of mastery.How many times can a student retake a test?It is impractical to allow multiple retakes. Since a summative assessment is tied to a time frame, giving one chance to retake an assessment reinforces that timeliness and also student responsibility.
39Common AssessmentsDefinition: Identical assessments that are given by different teachers who teach the same course.Purpose of Common Assessments:Establish inter-grader reliability for assessments.Count as a Type II assessment for teacher evaluations.Type I – MAP, PARCC, Universal ScreenerType II – District, grade level, or course-wide assessment adopted and approved by the school districtType III – Teacher createdGive a springboard for discussing student mastery for the purposes of lesson revision.Possible common assessments include: weekly or mid-chapter quizzes, unit or chapter tests, quarter or semester examsWe want to make sure that an A in one class is the same as an A in another class. This also assumes a common grading scale. CHS allows teachers to choose a grading scale. They have a common multiple choice final exam in Algebra of 50 questions but one teacher has a grading scale of is an A while another has is an A. This doesn’t work.
40GRADING SYSTEMS Grading on the Curve Total Points vs. Weighted Categories“Standards-Based” Grading
41Grading on the CurveChanging student grades methodically for a better grade distribution.This does not accurately assess student mastery of content if we have a clear picture of what mastery is. If we don’t know what mastery looks like, that must be satisfied before we can assess.Rather than making your grade distribution match the normal curve, ask yourself why the grades are distributed they way they are. This is a formative exercise.
42Grading on the Curve Are there too many low grades? Was it too soon to expect mastery? Eliminate the summative grade and give the assessment later. Use this as a formative assessment.Was the material poorly taught? Eliminate the summative grade and re-teach.Was the assessment too difficult? Eliminate the summative grade and give a better written assessment.Was the assessment accurate? Give students options for remediation, but move on in the curriculum.Are there too many high grades?Was it the assessment too easy? Eliminate the summative grade and give a better written assessment.Was the assessment accurate? Celebrate your students!
43Points vs. PercentsWhen talking about how to record grades of the same weight, this is irrelevant except for rounding differences.For example, these are the same grade:60%, 80%, 100%, 80% yields average of 80%3/5, 4/5, 5/5, 4/5 yields 16/20 = 80%These have a slight rounding error due to “not nice” denominators:65%, 71%, 59%, 71% yields 67%11/17, 12/17, 10/17, 12/17 yields 66%Moral: Choose the denominator wisely.
44Points vs. PercentsWhen in reference to weighted categories versus straight points, the differences are aesthetic because every grading system is weighted.Consider a typical “points” system:Homework worth 5 points eachQuizzes worth 50 points eachTests worth 100 points eachIs a test worth 20 times as much as homework?
45Points vs. PercentsIf there are 2 tests and 2 quizzes per quarter, but homework every day that gives us:225 points of homework (43%)100 points from quizzes (19%)200 points from tests (38%)
46Points vs. PercentsThink long-term. During the whole quarter say you typically have:8 Homework summative assessments (This is purely for demonstration purposes! HW should be a formative assessment!)4 Problem solving summative assessments4 Quizzes2 Unit tests1 Project1 Quarter Exam
47Points vs. Percents Now consider this weighted system: Homework worth 10% (This is purely for demonstration purposes! HW should be worth 0%!)Problem Solving worth 10%Quizzes worth 20%Unit Tests worth 30%Project worth 10%Quarter Exam worth 20%
48Points vs. Percents It is the same as this points system: Homework worth 100 points eachProblem Solving worth 200 points eachQuizzes worth 400 points eachUnit Tests worth 1200 points eachProject worth 800 pointsQuarter Exam worth 1600 pointsHint: Making everything out of 100 makes it easier for the students. So instead of saying a quiz is out of 400 points, tell students their grade counts four times.
49Points vs. Percents The difference is how you get to the end grade: In this case, both grades end with a final grade of 83% because we made sure the weight of the points matched the weight of the categories.HW1PS1HW2PS2QZ1HW3QZ2HW4TST1HW5PS3HW6PS4QZ3HW7QZ4HW8TST2ProjQ ExPoints for assignment10020040012008001600Joe Bob (%)80%75%70%85%60%100%90%93%50%65%88%96%Joe Bob (pts)8015070170240340901116601405026010566801536Weighted Grade78%69%71%77%84%83%82%81%Points Grade74%79%
50Points vs. PercentsThe difference is how you get to the end grade:
51“Standards-Based” Grading Standards-based grading usually doesn’t actually mean standards-based grading. It usually means grading with a rubric or something similar with:4 – Exceeds standard3 – Meets standard2 – Meets standard with assistance1 – Does not meet standardExample Report Card:8th Grade Pre-Algebra: 3Number System: 4Expression and Equations: 2Functions: 3Geometry: 2Statistics and Probability: 4Note: You can have this same break down of grades with a regular percent grading system by simply making your category of grades follow the Domain name and weighting summative assessments appropriately via points.
52“Standards-Based” Grading Fact: are not equally likely.4 may represent 90% – 100% accuracy3 may represent 80% – 90% accuracy2 may represent 60% – 80% accuracy1 may represent 0% – 60% accuracyEven using objective benchmarks, they are still not equally likely.Problem: You can’t average the scores, but we need to.Geometry scores of 0%, 90%, 90%, 90% average to 67.5%SB scores of 1, 4, 4, 4 average to 3.25 (meets)Which one signals to the parent there is a problem?To get an overall Geometry score we need to average to account for subcategories within Geometry (area, volume, Pythagorean Theorem, etc.)4 should not mean “above grade level” because that is formative, not summative for the grade level standards.
53“Standards-Based” Grading Potential Solution: Power Law AverageThe Power Law is basically a predictor of how the student would score on the next assessment based on previous performance. So scores of 1,2,3,4 might yield a 4 while scores of 4,3,2,1 might yield a 1.Problem: Power Law only works for individual skillsMost assessments cover a multitude of skills.Getting a Geometry score of 4 on the first assessment does not mean anything about the Geometry score on the next assessment if the first assessment covered 2D geometry while the second assessment covered 3D.
54“Standards-Based” Grading Problem: How do you deal with assessments that incorporate multiple standards or skills? You would need multiple grades for the same assessment(s).Problem: There is more information reported (usually) with SB grading, but it is still not useful. Does a 2 in Geometry mean I need help with transformations, volume, or the Pythagorean theorem?
55“Standards-Based” Grading Bottom Line: is just as flawed as a traditional grading system.Solution: Use good assessment practices in whatever grading system you use and many of the problems that the SB Grading movement is trying to tackle will be resolved.
56My Grading SystemWeighted Categories that are assessed based on standards masteryHomework Completion (0%) – DailyHomework Accuracy (0%) – Each specific skill or set of skillsMastery Task (0%) – Each specific skill or set of skillsProblem Solving (10%) – 4 to 8 per quarterWeekly Quiz (30%) – 4 to 6 per quarterUnit Pre-Test (0%) – 3 per quarterUnit Post-Test (40%) – 3 per quarterQuarter Project (10%) – 1 per quarterQuarter Exam (10%) – 1 per quarterEnrichment (0%) – As needed based on Pre-TestsRemediation (0%) – As needed for progress monitoringBut this is not perfect! We’re working to change it!Explain each one, summative vs. formative, valid since everything ties directly to standards, reliable since even partial credit is defined on common assessments.
57Getting Accurate Grades Keep in mind the purpose of assessments (formative and summative).We can’t grade the way we want or the way we’ve always done it.
58Accurate Assessment in the Common Core Era Eric Bright8th Grade MathCharleston Middle SchoolAccurate Assessment in the Common Core EraAssessing student content mastery