Educational Data Mining and DataShop John Stamper Carnegie Mellon University 1 9/12/2012 PSLC Corporate Partner Meeting 2012.

Slides:



Advertisements
Similar presentations
Data Mining: Potentials and Challenges Rakesh Agrawal & Jeff Ullman.
Advertisements

Program Management School Agile & ADDIE Add-Up (AAAU) Elliott Masies Learning 2012 October 21-24, 2012.
12 August 2004 Strategic Alignment By Maria Rojas.
Intro to EDM Why EDM now? Which tools to use in class Week 1, video 1.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 March 12, 2012.
Educational Data Mining Overview Ryan S.J.d. Baker PSLC Summer School 2012.
Educational Data Mining Overview Ryan S.J.d. Baker PSLC Summer School 2010.
Knowledge Engineering Week 3 Video 5. Knowledge Engineering  Where your model is created by a smart human being, rather than an exhaustive computer.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
Discovery with Models Week 8 Video 1. Discovery with Models: The Big Idea  A model of a phenomenon is developed  Via  Prediction  Clustering  Knowledge.
Supporting (aspects of) self- directed learning with Cognitive Tutors Ken Koedinger CMU Director of Pittsburgh Science of Learning Center Human-Computer.
Mathematics Curriculum Proposal Transitioning to Iowa Core.
C SC 421: Artificial Intelligence …or Computational Intelligence Alex Thomo
Mgt 240 Lecture Decision Support Systems March 3, 2005.
6.2 Modelling the Learner ISE554 The WWW for eLearning.
Searching for Patterns: Sean Early PSLC Summer School 2007 Question: Which is a better predictor of performance in a cognitive tutor, error rate or assistance.
Collaborative Filtering Shaun Kaasten CPSC CSCW.
Meaningful Learning in an Information Age
Educational Data Mining Overview John Stamper PSLC Summer School /25/2011 1PSLC Summer School 2011.
Thoughts on Visualization as a Field of Research and as a Discipline July 17, 2007 Dagstuhl.
Business Intelligence
DataShop: An Educational Data Mining Platform for the Learning Science Community John Stamper Pittsburgh Science of Learning Center Human-Computer Interaction.
Matt Moxham EDUC 290. The Idaho Core Teacher Standards are ten standards set by the State of Idaho that teachers are expected to uphold. This is because.
Text Complexi ty in the Common Core Classroo m Patricia Coldren Lee County Schools k 12. nc. us.
IBM Proof of Technology Discovering the Value of SOA with WebSphere Process Integration © 2005 IBM Corporation SOA on your terms and our expertise WebSphere.
Classifiers, Part 1 Week 1, video 3:. Prediction  Develop a model which can infer a single aspect of the data (predicted variable) from some combination.
John Stamper Human-Computer Interaction Institute Carnegie Mellon University Technical Director Pittsburgh Science of Learning Center DataShop.
PSLC DataShop Introduction Slides current to DataShop version John Stamper DataShop Technical Director.
Crowdsourcing Predictors of Behavioral Outcomes. Abstract Generating models from large data sets—and deter¬mining which subsets of data to mine—is becoming.
1 Data Mining Books: 1.Data Mining, 1996 Pieter Adriaans and Dolf Zantinge Addison-Wesley 2.Discovering Data Mining, 1997 From Concept to Implementation.
Esri International User Conference | San Diego, CA Technical Workshops | Spatial Statistics: Best Practices Lauren Rosenshein, MS Lauren M. Scott, PhD.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Week 13: Technology Innovation and Course Review MIS5001: Management Information Systems David S. McGettigan Adapted from material by Arnold Kurtz, David.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
1. INTERNET MARKET RESEARCH 2. OPERATIONAL DATA TOOLS Info. for Competitive Marketing Advantages Maher ARAFAT, June, 2010.
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 2, 2012.
Trends in Business Intelligence & Analytics Keynote at Silicon India Rajgopal Kishore Vice President and Global Head of BI & Analytics, HCL Technologies.
Testing & modeling users. The aims Describe how to do user testing. Discuss the differences between user testing, usability testing and research experiments.
Quimba Software June 2008 Improvisation as a Training Framework for Emergency Managers Nina Zumel, PhD. Quimba Software Zeno Franco Larry Beutler, PhD.
Data Mining By Dave Maung.
Educational Data Mining: Discovery with Models Ryan S.J.d. Baker PSLC/HCII Carnegie Mellon University Ken Koedinger CMU Director of PSLC Professor of Human-Computer.
What’s the Big Deal About R? Tom Tiedeman, OCIO July 21, 2015.
Big data and learning: building and learning from LearnSphere (…because the next generation DataShop was just too incremental) John Stamper Pittsburgh.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
EFFECTIVELY INTEGRATING INSTRUCTIONAL SOFTWARE INTO TEACHING AND LEARRNING EVIRONMENT FOR THE HIGHEST POTENTIAL RELATIVE ADVANTAGE BY: BELTECH ETEC 602.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 January 23, 2013.
Data mining with DataShop Ken Koedinger CMU Director of PSLC Professor of Human-Computer Interaction & Psychology Carnegie Mellon University.
RULES Patty Nordstrom Hien Nguyen. "Cognitive Skills are Realized by Production Rules"
Management: An Applied Behavioral Sciences Approach
Libraries are Changing Keeping Up, Being Successful.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
HUDK5199: Special Topics in Educational Data Mining
Data-Driven Education
Improves your software testing
Language Technologies Institute Carnegie Mellon University
Using Learning Analytics in Personalized Learning
DISCOVERY & CUSTOMIZED WORK-BASED LEARNING
HUDK5199: Special Topics in Educational Data Mining
Big Data, Education, and Society
Core Methods in Educational Data Mining
Big Data, Education, and Society
Introduction to PSLC DataShop
Data Warehousing Data Mining Privacy
Welcome! Knowledge Discovery and Data Mining
CSE591: Data Mining by H. Liu
Presentation transcript:

Educational Data Mining and DataShop John Stamper Carnegie Mellon University 1 9/12/2012 PSLC Corporate Partner Meeting 2012

The Classroom of the Future Which picture represents the “Classroom of the Future”? 2 9/12/2012 PSLC Corporate Partner Meeting 2012

3 The Classroom of the Future The answer is both! Depends of how much money you have... … but maybe not what you think… 9/12/2012 PSLC Corporate Partner Meeting 2012

4 The Classroom of the Future Rich vs. Poor – Poor kids will be forced to rely on “cheap” technology – Rich kids will have access to “expensive” teachers We are seeing this today! – Waldorf school in Silicon Valley – no technology – NGLC Wave III Grants – MOOCs (AI Course at Stanford) – Growth of adaptive technology companies – Online instruction – … and more… 9/12/2012 PSLC Corporate Partner Meeting 2012

5 What does this mean? My view is that we cannot stop this, I believe we must accept that economics will force this route. We should focus on improving learning technology New ways to improve teacher-student access Add more adaptive features to learning software Intelligent Tutors, at scale, using data! 9/12/2012 PSLC Corporate Partner Meeting 2012

9/12/2012 PSLC Corporate Partner Meeting Educational Data Mining “Educational Data Mining is an emerging discipline, concerned with developing methods for exploring the unique types of data that come from educational settings, and using those methods to better understand students, and the settings which they learn in.” –

9/12/2012 PSLC Corporate Partner Meeting Classes of EDM Methods (Baker & Yacef, 2009) Prediction Clustering Relationship Mining Discovery with Models Distillation of Data For Human Judgment

9/12/2012 PSLC Corporate Partner Meeting Prediction Develop a model which can infer a single aspect of the data (predicted variable) from some combination of other aspects of the data (predictor variables) Does a student know a skill? Which students are off-task? Which students will fail the class?

9/12/2012 PSLC Corporate Partner Meeting Clustering Find points that naturally group together, splitting full data set into set of clusters Usually used when nothing is known about the structure of the data – What behaviors are prominent in domain? – What are the main groups of students?

9/12/2012 PSLC Corporate Partner Meeting Relationship Mining Discover relationships between variables in a data set with many variables – Association rule mining – Correlation mining – Sequential pattern mining – Causal data mining

9/12/2012 PSLC Corporate Partner Meeting Discovery with Models Pre-existing model (developed with EDM prediction methods… or clustering… or knowledge engineering) Applied to data and used as a component in another analysis

9/12/2012 PSLC Corporate Partner Meeting Distillation of Data for Human Judgment Making complex data understandable by humans to leverage their judgment Text replays are a simple example of this

9/12/2012 PSLC Corporate Partner Meeting Knowledge Engineering Creating a model by hand rather than automatically fitting model In one comparison, leads to worse fit to gold- standard labels of construct of interest than data mining (Roll et al, 2005), but similar qualitative performance

9/12/2012 PSLC Corporate Partner Meeting LearnLab The LearnLab has played a pivotal role in the creation of the EDM community The CMDM thrust of the center focuses on Educational Data Mining DataShop is also a key tool for the EDM community

9/12/2012 PSLC Corporate Partner Meeting DataShop Open repository for educational data Many large-scale datasets both public and private Tools for – exploratory data analysis – learning curves – domain model testing

9/12/2012 PSLC Corporate Partner Meeting DataShop Import/Export of data Custom fields Easy Knowledge Model creation and validation Web services for tools integration

Demo 9/12/2012 PSLC Corporate Partner Meeting

9/12/2012 PSLC Corporate Partner Meeting Engaging the KDD/ICDM Community Some hesitation from these groups – Educational data not interesting – Too applied – Not “big” enough for eScience This was one motivation for the 2010 KDD Cup

9/12/2012 PSLC Corporate Partner Meeting KDD Cup Competition  Knowledge Discovery and Data Mining (KDD) is the most prestigious conference in the data mining and machine learning fields  KDD Cup is the premier data mining challenge  2010 KDD Cup called “Educational Data Mining Challenge”  Ran from April 2010 through June 2010

9/12/2012 PSLC Corporate Partner Meeting KDD Cup Competition Competition goal is to predict student responses given tutor data provided by Carnegie Learning DatasetStudentsStepsFile size Algebra I ,3109,426,9663 GB Bridge to Algebra ,04320,768, GB

9/12/2012 PSLC Corporate Partner Meeting KDD Cup Competition  655 registered participants  130 participants who submitted predictions  3,400 submissions

9/12/2012 PSLC Corporate Partner Meeting KDD Cup Competition  Advances in prediction and cognitive modeling  Excitement in the KDD Community  The datasets are now in the “wild” and showing up in non KDD conferences  New competitions have been done and are in the works

Opportunities Huge potential for EDM and DataShop to improve educational systems DataShop is open and staff is available to help get users started Great option for creating capstone projects 9/12/2012 PSLC Corporate Partner Meeting

9/12/2012 PSLC Corporate Partner Meeting EDM Community is Online!   EDM 2013 in Memphis TN in July  Questions: