Applying SGP to the STAR Assessments Daniel Bolt Dept of Educational Psychology University of Wisconsin, Madison.

Slides:



Advertisements
Similar presentations
Continued Psy 524 Ainsworth
Advertisements

A Model Description By Benjamin Ditkowsky, Ph.D. Student Growth Models for Principal and Student Evaluation.
Forecasting Using the Simple Linear Regression Model and Correlation
BA 275 Quantitative Business Methods
STAR Assessments: Using data to drive your instruction 2012.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
Statistics for the Social Sciences Psychology 340 Spring 2005 Prediction cont.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
CS 8751 ML & KDDEvaluating Hypotheses1 Sample error, true error Confidence intervals for observed hypothesis error Estimators Binomial distribution, Normal.
Experimental Design, Statistical Analysis CSCI 4800/6800 University of Georgia Spring 2007 Eileen Kraemer.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
David Kaplan & Heidi Sweetman University of Delaware Two Methodological Perspectives on the Development of Mathematical Competencies in Young Children:
Predictive Analysis in Marketing Research
Today Concepts underlying inferential statistics
Joint Predictive Probabilities of Oral Reading Fluency for Reading Comprehension Young-Suk Kim & Yaacov Petscher Florida State University & Florida Center.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Objectives of Multiple Regression
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
1 MULTI VARIATE VARIABLE n-th OBJECT m-th VARIABLE.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Analyzing Institutional Assessment Results Longitudinally (with nonparametric effect sizes) Dr. Bradley Thiessen, Director of Institutional Research Problem:
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Fundamentals of Data Analysis Lecture 10 Management of data sets and improving the precision of measurement pt. 2.
IRT Model Misspecification and Metric Consequences Sora Lee Sien Deng Daniel Bolt Dept of Educational Psychology University of Wisconsin, Madison.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Why Model? Make predictions or forecasts where we don’t have data.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
CHEMISTRY ANALYTICAL CHEMISTRY Fall Lecture 6.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
1 1 Slide Simple Linear Regression Estimation and Residuals Chapter 14 BA 303 – Spring 2011.
1 Children Left Behind in AYP and Non-AYP Schools: Using Student Progress and the Distribution of Student Gains to Validate AYP Kilchan Choi Michael Seltzer.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Nonparametric Statistics
11.1 Heteroskedasticity: Nature and Detection Aims and Learning Objectives By the end of this session students should be able to: Explain the nature.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
STAR Reading. Purpose Periodic progress monitoring assessment Quick and accurate estimates of reading comprehension Assessment of reading relative to.
Methods of Presenting and Interpreting Information Class 9.
Stats Methods at IC Lecture 3: Regression.
Nonparametric Statistics
Why Model? Make predictions or forecasts where we don’t have data.
Correlation, Bivariate Regression, and Multiple Regression
Statistics for the Social Sciences
Model validation and prediction
Virtual COMSATS Inferential Statistics Lecture-26
Reliability & Validity
BA 275 Quantitative Business Methods
Nonparametric Statistics
Simple Linear Regression
Formative Assessments Director, Assessment and Accountability
Basic Practice of Statistics - 3rd Edition Inference for Regression
Statistics II: An Overview of Statistics
Essentials of Statistics for Business and Economics (8e)
Longitudinal Data & Mixed Effects Models
Logistic Regression.
Presentation transcript:

Applying SGP to the STAR Assessments Daniel Bolt Dept of Educational Psychology University of Wisconsin, Madison

Some Unique Features of STAR Assessments Same CAT assessment is administered within and across years (possible to generate look-up tables for the calculation of SGP) Multiple STAR administrations to a student are possible throughout the year, and can occur at different times and/or frequencies for different students

Why SGP may be Useful with STAR Assessments Different STAR scores may be associated with different amounts of measurement error (e.g., extremely low or high STAR scores are sometimes of questionable validity) There often exists more/less variability in growth observed across students at different initial STAR score levels

Some Practical Issues Related to Administration of STAR Assessments How frequently and at what intervals should STAR be administered to get reliable estimates of end-of-year scores? Does the answer depend on the initial (fall) score of a student? Is there practical value in the use of SGP for answering this question (in contrast to alternative approaches, such as ordinary least squares---OLS--- regression methods)?

Goals of the Present Study Examine SGP as a methodology for quantifying growth and for studying the precision of end-of-year growth predictions using STAR Assessments Compare SGP against competing methodologies (OLS regression) in terms of their reported precision of end-of- year predictions

GradeMathReading Early Literacy Sample Sizes (Students with Fall, Winter & Spring Assessments)

Evaluating the Precision of Spring Score Predictions How well do winter assessments improve our predictions of end-of-year outcomes? Are the winter assessments more/less useful depending on the fall score obtained by the student? In answering these questions, we find it useful to examine changes in the confidence intervals for spring scores defined by the SGP percentile cuts

Example of SGP Percentile Cuts, One Covariate

Example of SGP Percentile Cuts, Two Covariates

Comparison of SGP & OLS Intervals, Math Grade 1 According to OLS Standard Error of Prediction According to SGP %ile cuts

STAR Math, Grades 1-6 OLS SGP

STAR Math, Grades 7-12 OLS SGP

STAR Reading, Grades 1-6 OLS SGP

STAR Reading, Grades 7-12 OLS SGP

STAR Early Literacy, Grades K-4 OLS SGP

Adding Winter Scores as Covariates with SGP Using 80% interval width curves as a baseline, we can further examine how much the intervals are reduced when adding a winter assessment The decline in the 80% interval can be used as an indicator of the added precision provided by the winter assessment

STAR Math Example

STAR Reading Example

STAR Early Literacy Example

Some Examples of STAR Score Patterns SubjectFallWinterSpring Change in 80% Interval MATH (+432) (+321) (-208) (-209) READING (+41) (+40) (-148) (-256) LITERACY (+297) (+282) (-242) (-256)

STAR Math Example OLS SGP

STAR Math, Grade 1-6 OLS SGP

STAR Math, Grades 7-12 OLS SGP

STAR Reading Example OLS SGP

STAR Reading, Grades 1-6 OLS SGP

STAR Reading, Grades 7-12 OLS SGP

STAR Early Literacy Example OLS SGP

STAR Early Literacy, Grades K-4 OLSSGP

Evaluating Predicted Spring Scores in Terms of State Proficiency Thresholds The accuracy of SGP and OLS predictions can also be compared against the thresholds associated with state-specific proficiency categories By assuming normally distributed residuals (with constant variance) for OLS, SGP and OLS can each be used to define a probability that the spring score will exceed a predefined threshold

Proficiency Threshold Example of SGP Percentile Cuts against State- Defined Proficiency Category

MATHREADING Grade Proficiency Cut a PercentileOLS R 2,b SGP R 2,b Proficiency Cut a PercentileOLS R 2,b SGP R 2,b a Estimated STAR cutscores for the Tennessee Comprehensive Assessment Program (TCAP) b Efron’s Pseudo R 2 Comparing SGP and OLS on Accuracy of Proficiency Predictions

Extending SGP to Accommodate Multiple Intermediate Assessments How can additional intermediate assessments be used in SGP to further improve predicted spring scores? Challenge: Handling varying-time point assessment schedules One possible solution: Linear interpolation to fixed node locations

Conclusions and Future Directions Our SGP analyses suggest substantial variability in the precision of spring score predictions for STAR Math, Reading and Early Literacy depending on fall scores There is clear value in incorporating winter assessments into SGP---the largest value occurs for students with extreme fall scores in STAR Math, intermediate fall scores in STAR Reading More experimentation needed to determine how best to make use of multiple intermediate assessments within SGP