Educational data mining overview & Introduction to Exploratory Data Analysis with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer.

Slides:



Advertisements
Similar presentations
Design Validation CSCI 5801: Software Engineering.
Advertisements

Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 January 23, 2012.
A Toolbox for Blackboard Tim Roberts
Causal Data Mining Richard Scheines Dept. of Philosophy, Machine Learning, & Human-Computer Interaction Carnegie Mellon.
Educational Data Mining Overview Ryan S.J.d. Baker PSLC Summer School 2010.
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Educational data mining overview & Introduction to Exploratory Data Analysis Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction.
Improving learning by improving the cognitive model: A data- driven approach Cen, H., Koedinger, K., Junker, B. Learning Factors Analysis - A General Method.
Knowledge Engineering Week 3 Video 5. Knowledge Engineering  Where your model is created by a smart human being, rather than an exhaustive computer.
Jewelry Inventory Management Software Your Logo Here Welcome to a demonstration of Del Mar Data Systems Jewelry Inventory Management.
An Individualized Web-Based Algebra Tutor D.Sklavakis & I. Refanidis 1 An Individualized Web-Based Algebra Tutor Based on Dynamic Deep Model Tracing Dimitrios.
Supporting (aspects of) self- directed learning with Cognitive Tutors Ken Koedinger CMU Director of Pittsburgh Science of Learning Center Human-Computer.
Learning from Learning Curves: Item Response Theory & Learning Factors Analysis Ken Koedinger Human-Computer Interaction Institute Carnegie Mellon University.
Data mining with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction & Psychology Carnegie Mellon University Ryan S.J.d.
Conclusion Our prediction model did a good job at predict 8 th grade math proficiency. It can be used to estimate 10 th grade score fairly well, too. But.
Searching for Patterns: Sean Early PSLC Summer School 2007 Question: Which is a better predictor of performance in a cognitive tutor, error rate or assistance.
eGovernance Under guidance of Dr. P.V. Kamesam IBM Research Lab New Delhi Ashish Gupta 3 rd Year B.Tech, Computer Science and Engg. IIT Delhi.
+ Doing More with Less : Student Modeling and Performance Prediction with Reduced Content Models Yun Huang, University of Pittsburgh Yanbo Xu, Carnegie.
CHAPTER 3 Describing Relationships
Educational Data Mining Overview John Stamper PSLC Summer School /25/2011 1PSLC Summer School 2011.
1 Learning from Learning Curves: Item Response Theory & Learning Factors Analysis Ken Koedinger Human-Computer Interaction Institute Carnegie Mellon University.
The Importance Of Transactions In The World Of Analytics Doug Aoyama Director, Product Marketing.
Classification and Prediction: Regression Analysis
DataShop: An Educational Data Mining Platform for the Learning Science Community John Stamper Pittsburgh Science of Learning Center Human-Computer Interaction.
John Stamper Human-Computer Interaction Institute Carnegie Mellon University Technical Director Pittsburgh Science of Learning Center DataShop.
PSLC DataShop Introduction Slides current to DataShop version John Stamper DataShop Technical Director.
Chapter 5 Models and theories 1. Cognitive modeling If we can build a model of how a user works, then we can predict how s/he will interact with the interface.
Chapter 8: Problem Solving
Case Study – San Pedro Week 1, Video 6. Case Study of Classification  San Pedro, M.O.Z., Baker, R.S.J.d., Bowers, A.J., Heffernan, N.T. (2013) Predicting.
Classroom Assessment A Practical Guide for Educators by Craig A
PSLC DataShop Introduction Slides current to DataShop version John Stamper DataShop Technical Director.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
MATH 90 CURRICULUM RENEWAL & MATH MAKES SENSE 9 WORKSHOP June 25th, 2010.
Wiley eGrade. What is eGrade? Web-based software that enables instructors to automate the process of assigning and grading homework and quiz assignments.
DataShop v7.1 Release Event Friday, November 1, 2013 LearnLabdatashop.org LearnLab
PSLC DataShop Introduction Slides current to DataShop version John Stamper DataShop Technical Director.
Using SAS® Information Map Studio
GRAPHING IN CHEMISTRY YEAR 11 DP CHEMISTRYR. SLIDER.
Noboru Matsuda Human-Computer Interaction Institute
Slide 1 Kirsten Butcher Elaborated Explanations for Visual/Verbal Problem Solving: Interactive Communication Cluster July 24, 2006.
Educational Data Mining: Discovery with Models Ryan S.J.d. Baker PSLC/HCII Carnegie Mellon University Ken Koedinger CMU Director of PSLC Professor of Human-Computer.
DAY 15: ACCESS CHAPTER 1 Rahul Kavi October 6,
Reserve Variability – Session II: Who Is Doing What? Mark R. Shapland, FCAS, ASA, MAAA Casualty Actuarial Society Spring Meeting San Juan, Puerto Rico.
DataShop Import Workshop Tuesday, June 14, 2011 pslcdatashop.org PSLC
Applying the Redundancy Principle ( Chapter 7) And using e-learning data for CTA Ken Koedinger 1.
Data mining with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction & Psychology Carnegie Mellon University.
Core Methods in Educational Data Mining HUDK4050 Fall 2015.
Using DataShop Tools to Model Students Learning Statistics Marsha C. Lovett Eberly Center & Psychology Acknowledgements to: Judy Brooks, Ken Koedinger,
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 February 6, 2012.
Data Mining Lab Student performance evaluation. Rate of learning varies from student to student May depend on similarity of the problem Is it possible.
Text2PTO: Modernizing Patent Application Filing A Proposal for Submitting Text Applications to the USPTO.
Data-Driven Education
Classroom Assessment A Practical Guide for Educators by Craig A
How to interact with the system?
Special Topics in Educational Data Mining
Using Bayesian Networks to Predict Test Scores
Mingyu Feng Neil Heffernan Joseph Beck
Big Data, Education, and Society
CSc4730/6730 Scientific Visualization
Microsoft Word - Formatting Pages
Introduction to PSLC DataShop
Addressing the Assessing Challenge with the ASSISTment System
Neil T. Heffernan, Joseph E. Beck & Kenneth R. Koedinger
How to interact with the system?
Intro to Machine Learning
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Core Methods in Educational Data Mining
Presentation transcript:

Educational data mining overview & Introduction to Exploratory Data Analysis with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction & Psychology Carnegie Mellon University

Overview DataShop Overview  Logging model  DataShop Features Quantitative models of learning curves  Power law, logistic regression  Contrasting KC models Exploratory Data Analysis Exercise (start) Knowledge Component Model Editing

Logging & Storage Models Education technologies are “instrumented” to produce log data We encourage a standard log format  XML format generalized from Ritter & Koedinger (1995)  Also convert log data from other formats

Relational Database -- complex!

Example activity generating “click stream” data Geometry Cognitive Tutor: “Making Cans” problem  Find the area of scrap metal left over after removing a circular area (the end of a can) from a metal square.  Student enters values in worksheet Tutor provides feedback & instruction  Records student’s actions & tutor responses Logs stored in files on school server or database at Carnegie Learning  Later imported into DataShop

DataShop logging model Main constructs:  Context message: the student, problem, and session with the tutor  Tool message: represents an action in the tool performed by a student or tutor  Tutor message: represents a tutor’s response to a student action

DataShop XML format: Context message Geometry Hampton PACT-AREA PACT-AREA-6 MAKING-CANS Dataset name Course unit Course section Problem

DataShop XML format: Tool & Tutor Messages (POG-AREA QUESTION2) INPUT-CELL-VALUE … [as above] … CORRECT

Example Stored Transactions Student interactions (or transactions) are stored in a relational database, can be exported as table  Example: Student S01 on Making-Cans problem

Transactions Info for each transaction  student(s), session, time, problem, problem step, attempt number, student action  tutor response, number of hints, knowledge component code Logging of on-line tools (e.g., a virtual lab) does not include tutor response

Step & Transaction Definitions A problem-solving activity typically involves many tool & tutor messages. “Steps” represent completion of possible subgoals or pieces of a problem solution “Transactions” are attempts at a step or requests for instructional help

Example: data aggregated by student-step

Overview DataShop Overview  Logging model  DataShop Features Quantitative models of learning curves  Power law, logistic regression  Contrasting KC models Exploratory Data Analysis Exercise (start) Knowledge Component Model Editing

DataShop Analysis Tools Dataset Info Performance Profiler Learning Curve Error Report Export Sample Selector

Meta data for given dataset PI’s get ‘edit’ privileges, others must request it Meta data for given dataset PI’s get ‘edit’ privileges, others must request it 15 Papers and Files storage Dataset Metrics Problem Breakdown table Dataset Info

Performance Profiler Aggregate by Step Problem KC Dataset Level Aggregate by Step Problem KC Dataset Level View measures of Error Rate Assistance Score Avg # Hints Avg # Incorrect Residual Error Rate View measures of Error Rate Assistance Score Avg # Hints Avg # Incorrect Residual Error Rate Multipurpose tool to help identify areas that are too hard or easy

View by KC or Student, Assistance Score or Error Rate Time is represented on the x- axis as ‘opportunity’, or the # of times a student (or students) had an opportunity to demonstrate a KC Visualizes changes in student performance over time Learning Curve

Provides a breakdown of problem information (by step) for fine- grained analysis of problem-solving behavior Attempts are categorized by student Provides a breakdown of problem information (by step) for fine- grained analysis of problem-solving behavior Attempts are categorized by student View by Problem or KC Error Report

Sample Selector Filter by Condition Dataset Level Problem School Student Tutor Transaction Filter by Condition Dataset Level Problem School Student Tutor Transaction Easily create a sample/filter to view a smaller subset of data Shared (only owner can edit) and private samples

Export Two types of export available By Transaction By Step Anonymous, tab-delimited file Easy to import into Excel! You can also export the Problem Breakdown table and LFA values!

Help/Documentation Extensive documentation with examples Contextual by tool/report Extensive documentation with examples Contextual by tool/report Glossary of common terms, tied in with PSLC Theory wiki

New Features Manage Knowledge Component models  Create, Modify & Delete KC models within DataShop Addition of Latency Curves to Learning Curve Reporting  Time to Correct  Assistance Time Problem Rollup & Export Enhanced Contextual Help

Overview DataShop Overview  Logging model  DataShop Features Quantitative models of learning curves  Power law, logistic regression  Contrasting KC models Exploratory Data Analysis Exercise (start) Knowledge Component Model Editing

Recall learning curve story Without decomposition, using just a single “Geometry” KC, no smooth learning curve. But with decomposition, 12 KCs for area concepts, a smooth learning curve. Upshot: A decomposed KC model fits learning & transfer data better than a “faculty theory” of mind

Learning curve analysis The Power Law of Learning (Newell & Rosenbloom, 1993) Y = a X b Y – error rate X – opportunities to practice a skill a – error rate on 1st opportunity b – learning rate After the log transformation “a” is the “intercept” or starting point of the learning curve “b” is the “slope” or steepness of the learning curve

More sophisticated learning curve model Generalized Power Law to fit learning curves  Logistic regression (Draney, Wilson, Pirolli, 1995) Assumptions  Different students may initially know more or less => use an intercept parameter for each student  Students learn at the same rate => no slope parameters for each student  Some productions may be more known than others => use an intercept parameter for each production  Some productions are easier to learn than others => use a slope parameter for each production These assumptions are reflected in detailed math model …

More sophisticated learning curve model Probability of getting a step correct (p) is proportional to: - if student i performed this step = X i, add overall “smarts” of that student =  i - if skill j is needed for this step = Y j, add easiness of that skill =  j add product of number of opportunities to learn = T j & amount gained for each opportunity =  j p  Use logistic regression because response is discrete (correct or not) Probability (p) is transformed by “log odds” “stretched out” with “s curve” to not bump up against 0 or 1 (Related to “Item Response Theory”, behind standardized tests …)

Different representation, same model Predicts whether student is correct depending on knowledge & practice Additive Factor Model (Draney, et al. 1995, Cen, Koedinger, Junker, 2006)

The Q Matrix How to represent relationship between knowledge components and student tasks?  Tasks also called items, questions, problems, or steps (in problems) Q-Matrix (Tatsuoka. 1983)  2* 8 is a single-KC item  2*8 – 3 is a conjunctive-KC item, involves two KCs 29 Item | KCAddSubMulDiv 2* *

30 Model Evaluation How to compare cognitive models? A good model minimizes prediction risk by balancing fit with data & complexity (Wasserman 2005) Compare BIC for the cognitive models BIC is “Bayesian Information Criteria” BIC = -2*log-likelihood + numPar * log(numOb) Better (lower) BIC == better predict data that haven’t seen Mimics cross validation, but is faster to compute

31 Model TitleLLBICnumPar G -2,1754,56626 Original-1,9114,27154 Item-1,7205, Data: the Geometry Area Unit 24 students, 230 items, 15 KCs

Learning curve constrast in Physics dataset …

Not a smooth learning curve -> this knowledge component model is wrong. Does not capture genuine student difficulties.

More detailed cognitive model yields smoother learning curve. Better tracks nature of student difficulties & transfer (Few observations after 10 opportunities yields noisy data)

Best BIC (parsimonious fit) for Default (original) KC model Better than simpler Single-KC model And better than more complex Unique-step (IRT) model

Overview DataShop Overview  Logging model  DataShop Features Quantitative models of learning curves  Power law, logistic regression  Contrasting KC models Exploratory Data Analysis Exercise (start) Knowledge Component Model Editing

Exploratory Data Analysis Exercise Goals: 1) Get familiar with data 2) Learn/practice Excel skills Tasks: 1) create a “step table” 2) graph learning curves

TWO_CIRCLES_IN_SQUARE problem: Initial screen

TWO_CIRCLES_IN_SQUARE problem: An error a few steps later

TWO_CIRCLES_IN_SQUARE problem: Student follows hint & completes prob

Exported File Loaded into Excel

See handout of exercise … Do some of in next session

Overview DataShop Overview  Logging model  DataShop Features Quantitative models of learning curves  Power law, logistic regression  Contrasting KC models Exploratory Data Analysis Exercise (start) Knowledge Component Model Editing

DataShop Demo Examples of exercise KC model editing

END