Using DataShop Tools to Model Students Learning Statistics Marsha C. Lovett Eberly Center & Psychology Acknowledgements to: Judy Brooks, Ken Koedinger,

Slides:



Advertisements
Similar presentations
AP Statistics Course Review.
Advertisements

Robert delMas, Joan Garfield, and Ann Ooms University of Minnesota
Educational data mining overview & Introduction to Exploratory Data Analysis Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction.
Statistics Measures of Regression and Prediction Intervals.
Describing Relationships Using Correlation and Regression
Model Assessment, Selection and Averaging
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Addressing the Testing Challenge with a Web-Based E - Assessment System that Tutors as it Assesses Nidhi Goel Course: CS 590 Instructor: Prof. Abbott.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
Correlation and Regression Analysis
Educational data mining overview & Introduction to Exploratory Data Analysis with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer.
Robert delMas (Univ. of Minnesota, USA) Ann Ooms (Kingston College, UK) Joan Garfield (Univ. of Minnesota, USA) Beth Chance (Cal Poly State Univ., USA)
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
PSLC DataShop Introduction Slides current to DataShop version John Stamper DataShop Technical Director.
Cognitively Informed Analytics to Improve Teaching and Learning Marsha C. Lovett, Ph.D. Director, Eberly Center for Teaching Excellence Teaching Professor,
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Quantitative Skills 1: Graphing
+ Chapter 2: Modeling Distributions of Data Section 2.2 Normal Distributions The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
CHAPTER 7: Exploring Data: Part I Review
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Statistical Analysis Topic – Math skills requirements.
Rational/Theoretical Cognitive Task Analysis Ken Koedinger Key reading: Zhu, X., & Simon, H. A. (1987). Learning mathematics from examples and by doing.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Educational Data Mining: Discovery with Models Ryan S.J.d. Baker PSLC/HCII Carnegie Mellon University Ken Koedinger CMU Director of PSLC Professor of Human-Computer.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Statistical Analysis Topic – Math skills requirements.
AP Statistics Semester One Review Part 1 Chapters 1-3 Semester One Review Part 1 Chapters 1-3.
Applying the Redundancy Principle ( Chapter 7) And using e-learning data for CTA Ken Koedinger 1.
Ch 5 Relationships Between Quantitative Variables (pg 150) –Will use 3 tools to describe, picture, and quantify 1) scatterplot 2) correlation 3) regression.
Data mining with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction & Psychology Carnegie Mellon University.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Midterm Review IN CLASS. Chapter 1: The Art and Science of Data 1.Recognize individuals and variables in a statistical study. 2.Distinguish between categorical.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
Howard Community College
Introductory Psychology: Statistical Analysis
Chapter 2: Modeling Distributions of Data
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Sampling Distributions
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Unit 2 Quantitative Interpretation of Correlation
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Mean and Median.
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Presentation transcript:

Using DataShop Tools to Model Students Learning Statistics Marsha C. Lovett Eberly Center & Psychology Acknowledgements to: Judy Brooks, Ken Koedinger, Oded Meyer, John Rinderle, Ross Strader, Candace Thille, and many others at the PSLC

Goal: specify knowledge components that best capture how students learn Introductory Statistics Possible Knowledge Components (KCs) What counts as a quantitative variable What graphic to use to display 2 quantitative variables How & when to compute correlation Why build, test, & refine KC models? Understand the nature of the domain Develop better instruction, e.g., cognitive tutors Others? Case Study: Introductory Statistics

Basic Cognitive Task Analysis (CTA) My KC modeling using DataShop Tools My applications of the KC modeling results DataShop Demo Plan for this session

Identify the set of knowledge components (KCs) required for good performance in a task domain Knowledge components can be skills, facts, even metacognitive knowledge (cf. facets, production rules) Examples: Area of a circle = π r 2 To find area of a ring, subtract smaller circle’s area from larger circle’s area To find area of a frame, subtract smaller rectangle’s area from larger rectangle’s area Cognitive Task Analysis Goal: specify knowledge components that capture how students learn in the domain

Cognitive Task Analysis (cont’d) But many knowledge decompositions are possible…

As students practice a given skill, performance improves with marginally decreasing returns Students’ Learning Curves Errors Practice

As students practice a given skill, performance improves with marginally decreasing returns Students’ Learning Curves Errors Practice A KC-model specifies the elements being practiced and maps them to data

KC: how to describe a quantitative variable’s distribution Sample from Instructor Model This KC is practiced in six activities, so we organize error data for those activities

(cf., Lovett & Chang, 2007) Sample from Instructor Model Error data for activities corresponding to this KC do not show the predicted learning curve KC: how to describe a quantitative variable’s distribution

(cf., Lovett & Chang, 2007) Same Sample from Cognitive Model Interpret histogram Interpret boxplot Interpret table

Organizing the data according to these KCs, we see the learning Same Sample from Cognitive Model KC’s:

Averaged learning curve across three KCs matches the predicted learning curve Same Sample from Cognitive Model

Compare two different KC models:  Based on instructor’s course design  Based on cognitive task analysis Compare models based on their fit to the data, using Bayesian Information Criterion (BIC) BIC = - maximized log likelihood of model under data, so lower is better (like χ 2 ) BIC penalizes for model complexity (like adjusted- R 2 ) Result preview: CTA model has better fit, even accounting for somewhat greater complexity My CTA Research with DataShop

Some KC’s learning curves smooth, but many are not Instructor’s KC model Apply linear regression to make predictions Identify type/features of study design Describe the distribution of a single variable Identify sampling method used in a study

Still room for refinement, but smoother learning curves Cognitive KC Model Results Average over all KCs

Cognitive KC model includes splits Describe relationship btn 2 quant variables Estimate r Interpret scatterplot Instructor’s model Cognitive model

Cognitive KC model includes merges Check axes on graph Check axes (qual vars) Check axes (quant vars) Instructor’s model Cognitive model

Cognitive KC model revealed insufficient practice given to several KCs Identifying lurking variables Recognizing Simpson’s paradox Applying the standard deviation rule Identifying outliers Computing conditional proportions … Student performance on those KCs was poor Instructor’s KC model did not reveal these deficiencies nor were course designers/instructors aware KC model results

Before KC modeling, 379 activities in the course 65 new activities added to target “challenging” KC’s Learning objectives for the course revised as well: 12 learning objectives added 3 learning objectives deleted KC model results offer feedback to course designers KC modeling guides instructional design

Typically, instructors track students’ learning via graded activities, e.g., distribution of exam scores KC Model Offers Feedback to Instructors Implications are rather coarse Results often too late, after unit is completed KC model enables KC-specific predictions of students’ learning during a unit, in time for instructors to adjust

Instructor’s Digital Dashboard for Learning v 2.0

DataShop is at pslcdatashop.org