Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 25, 2012.

Slides:

Advertisements

Similar presentations

Bayesian Knowledge Tracing and Discovery with Models

Advertisements

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 23, 2012.

Bayesian Knowledge Tracing Prediction Models

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 30, 2012.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 March 12, 2012.

Item Response Theory in Health Measurement

Next Semester CSCI 5622 – Machine learning (Matt Wilder)  great text by Hastie, Tibshirani, & Friedman great text ECEN 5018 – Game Theory ECEN 5322 –

Navigating the parameter space of Bayesian Knowledge Tracing models Visualizations of the convergence of the Expectation Maximization algorithm Zachary.

Knowledge Inference: Advanced BKT Week 4 Video 5.

Knowledge Engineering Week 3 Video 5. Knowledge Engineering  Where your model is created by a smart human being, rather than an exhaustive computer.

Bayesian Knowledge Tracing and Other Predictive Models in Educational Data Mining Zachary A. Pardos PSLC Summer School 2011 Bayesian Knowledge Tracing.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 27, 2012.

Modeling Student Knowledge Using Bayesian Networks to Predict Student Performance By Zach Pardos, Neil Heffernan, Brigham Anderson and Cristina Heffernan.

Meta-Cognition, Motivation, and Affect PSY504 Spring term, 2011 January 26, 2010.

Ryan S.J.d. Baker Adam B. Goldstein Neil T. Heffernan Detecting the Moment of Learning.

Week 8 Video 4 Hidden Markov Models.

Effective Skill Assessment Using Expectation Maximization in a Multi Network Temporal Bayesian Network By Zach Pardos, Advisors: Neil Heffernan, Carolina.

+ Doing More with Less : Student Modeling and Performance Prediction with Reduced Content Models Yun Huang, University of Pittsburgh Yanbo Xu, Carnegie.

Lecture 5: Simple Linear Regression

Research Methods for the Learning Sciences Ken Koedinger Phil Pavlik TA: Ben Shih Lecture 3 Experimental Design.

Educational Data Mining Ryan S.J.d. Baker PSLC/HCII Carnegie Mellon University Richard Scheines Professor of Statistics, Machine Learning, and Human-Computer.

Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.

Determining the Significance of Item Order In Randomized Problem Sets Zachary A. Pardos, Neil T. Heffernan Worcester Polytechnic Institute Department of.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 6, 2012.

Case Study – San Pedro Week 1, Video 6. Case Study of Classification  San Pedro, M.O.Z., Baker, R.S.J.d., Bowers, A.J., Heffernan, N.T. (2013) Predicting.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 13, 2012.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 2, 2012.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 26, 2012.

Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 January 28, 2013.

Assessing Student Learning to Improve Teaching Jeff Bell and Jim Bidlack.

Advanced BKT February 11, Classic BKT Not learned Two Learning Parameters p(L 0 )Probability the skill is already known before the first opportunity.

Regression Lines. Today’s Aim: To learn the method for calculating the most accurate Line of Best Fit for a set of data.

Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 February 4, 2013.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 22, 2012.

Assessment embedded in step- based tutors (SBTs) CPI 494 Feb 12, 2009 Kurt VanLehn ASU.

Core Methods in Educational Data Mining HUDK4050 Fall 2014.

Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 January 23, 2013.

Core Methods in Educational Data Mining HUDK4050 Fall 2015.

Item Response Theory in Health Measurement

Core Methods in Educational Data Mining HUDK4050 Fall 2015.

Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Heriot Watt University 12th February 2003.

Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 6, 2013.

Core Methods in Educational Data Mining HUDK4050 Fall 2015.

MACHINE LEARNING 3. Supervised Learning. Learning a Class from Examples Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1)

Core Methods in Educational Data Mining HUDK4050 Fall 2014.

Core Methods in Educational Data Mining HUDK4050 Fall 2014.

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 6, 2012.

Core Methods in Educational Data Mining HUDK4050 Fall 2014.

CSC321: Lecture 8: The Bayesian way to fit models Geoffrey Hinton.

Core Methods in Educational Data Mining

Core Methods in Educational Data Mining

as presented on that date, with special formatting removed

Michael V. Yudelson Carnegie Mellon University

How to interact with the system?

Core Methods in Educational Data Mining

Item Analysis: Classical and Beyond

Special Topics in Educational Data Mining

Towards building a better cognitive model

Core Methods in Educational Data Mining

Detecting the Learning Value of Items In a Randomized Problem Set

Big Data, Education, and Society

Addressing the Assessing Challenge with the ASSISTment System

Knowledge Tracing Parameters can be learned with the EM algorithm!

Standards Based Grading

Neil T. Heffernan, Joseph E. Beck & Kenneth R. Koedinger

How to interact with the system?

Core Methods in Educational Data Mining

Item Analysis: Classical and Beyond

Core Methods in Educational Data Mining

Core Methods in Educational Data Mining

Presentation transcript:

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 25, 2012

Today’s Class Performance Factors Analysis

What is the key goal of PFA?

Measuring how much latent skill a student has, while they are learning – Expressed in terms of probability of correctness, the next time the skill is encountered – No direct expression of the amount of latent skill, except this probability of correctness How likely is Bob to be able to perform correctly, the next time he sees skill 7?

Use: PFA and IRT

What is the typical use of IRT? Assess a student’s knowledge of topic X Based on a sequence of items that are dichotomously scored – E.g. the student can get a score of 0 or 1 on each item

What is the typical use of PFA? Assess a student’s knowledge of topic X Based on a sequence of items that are dichotomously scored – E.g. the student can get a score of 0 or 1 on each item Where the student can learn on each item, due to help, feedback, scaffolding, etc.

Key assumptions Each item may involve multiple latent traits or skills – Different from IRT Each skill has success learning rate  and failure learning rate  There is also a difficulty parameter, but its semantics can vary – more on this later From these parameters, and the number of successes and failures the student has had on each relevant skill so far, we can compute the probability P(m) that the learner will get the item correct

Note The assumption that items may involve multiple skills is not seen in IRT or BKT (which we’ll talk about later) Why might this be a good assumption? Why might this be a bad assumption?

Note The assumption that learning rates are different between cases involving success and failure is not seen in BKT (which we’ll talk about later) Why might this be a good assumption? Why might this be a bad assumption?

Note PFA has no direct expression of the amount of latent skill, except a probability of correctness Why might this be beneficial? Why might this be non-optimal?

Simple PFA Not actually a variant that Pavlik uses, but will serve as a base to understand the more complex variants that use

Simple PFA

Let’s enter this into Excel Using pfa-modelfit-set1-v2.xlsx

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -2 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = 2 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = 0 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -50 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0 gamma(2,3) = 0 rho(1)=0 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0.1 gamma(2,3) = 0 rho(1)=0 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0 gamma(2,3) = 0 rho(1)= -0.1 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA Let’s try the following values: beta = -0.5 gamma(1)=0.1 gamma(2,3) = 0.1 rho(1)=0 rho(2,3) = 0 What are the effects of getting everything right (1 skill), getting everything wrong (1 skill), getting everything right (2 skills)

Simple PFA What parts of the parameter space are definitely degenerate? What parts of the parameter space are plausible? Are there any gray areas?

Let’s fit a Simple PFA model to this data pfa-modelfit-set1-v2.xlsx Students Let’s start with the simple baseline model We’ll use SSR (sum of squared residuals) as our goodness criterion Note that there are better ways to do this than in Excel, such as Expectation Maximization – More on this in the next lecture

 Parameters Pavlik uses three different  Parameters – Item – Item-Type – Skill

Let’s fit the other 3 PFA variants to this data How does goodness of fit change? How does the number of parameters change? – Also, data points/parameters ratio

Let’s fit the other 3 PFA variants to this data How does goodness of fit change? How does the number of parameters change? – Also, data points/parameters ratio Is there a concern about over-fitting? – We’ll talk about cross-validation later in the semester

Notes Jury is still out on whether PFA is better than other approaches

PFA.vs. BKT PFA beats BKT – Pavlik, Cen, & Koedinger (2009)  A’ = 0.01, 0.01, 0.02, 0.02 – Gong, Beck, & Heffernan (2010)  A’ = 0.01 – Pardos, Baker, Gowda, & Heffernan (in press)  A’ = 0.02 BKT beats PFA – Baker, Pardos, Gowda, Nooraei, & Heffernan (2011)  A’ = 0.03 – Pardos, Gowda, Baker, & Heffernan (2011)  Predicting post-test)  r = 0.24,  RMSE = 0.01

Final Thoughts PFA is a competitor for measuring student skill, which predicts the probability of correctness rather than latent knowledge Not yet used within Intelligent Tutoring Systems – Should it be? – We’ll discuss this again next class

PFA Questions? Comments?

Next Class Monday, January 30 3pm-5pm AK232 Bayesian Knowledge Tracing Corbett, A.T., Anderson, J.R. (1995) Knowledge Tracing: Modeling the Acquisition of Procedural Knowledge. User Modeling and User- Adapted Interaction, 4, Baker, R.S.J.d., Corbett, A.T., Aleven, V. (2008) More Accurate Student Modeling Through Contextual Estimation of Slip and Guess Probabilities in Bayesian Knowledge Tracing. Proceedings of the 9th International Conference on Intelligent Tutoring Systems, ASSIGNMENT ONE DUE

The End