Predicting Students Drop Out: a Casestudy Gerben Dekker, Mykola Pechenizkiy and Jan Vleeshouwers.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

4-6 Arithmetic Sequences Warm Up Lesson Presentation Lesson Quiz
Bellwork If you roll a die, what is the probability that you roll a 2 or an odd number? P(2 or odd) 2. Is this an example of mutually exclusive, overlapping,
McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 40 The Stock Market Crashes.
3 Developmental Math Courses Basic Math, Elementary Algebra, Intermediate Algebra 6 College Level Math Courses College Algebra, Statistics, Finite Math.
UNITED NATIONS Shipment Details Report – January 2006.
Knowledge Dietary Managers Association 1 PART II - DMA Certification Exam Blueprint and Exam Development-
Welcome to THE KING EDWARD VI SCHOOL SIXTH FORM INFORMATION EVENING.
[Giovanni Anania, University of Calabria, Italy – Europes Strategy for the Outermost Regions (Brussels, May 2008) 1 The Expected Market Impact of.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
AYP Changes for 2007 K-20 Videoconference June 11, 2007 Presented by: JoLynn Berge OSPI Federal Policy Coordinator.
Determine Eligibility Chapter 4. Determine Eligibility 4-2 Objectives Search for Customer on database Enter application signed date and eligibility determination.
CALENDAR.
SBA to GLE: The Road Les Morse, Director Assessment & Accountability Alaska Department of Education & Early Development No Child Left Behind Winter Conference.
0 - 0.
1  1 =.
2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt Time Money AdditionSubtraction.
ALGEBRAIC EXPRESSIONS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULTIPLYING MONOMIALS TIMES POLYNOMIALS (DISTRIBUTIVE PROPERTY)
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Relational or operational: Primary students understanding of the equal sign Jodie Hunter University of Plymouth BSRLM November 2009.
Professor Dick Clements, Faculty of Engineering, University of Bristol Prior Knowledge of Mechanics.
COMP3740 CR32: Knowledge Management and Adaptive Systems
World-class Standards World Class Education Standards (WCES) are those standards that, when implemented through quality instruction and content, prepare.
On Comparing Classifiers : Pitfalls to Avoid and Recommended Approach
Mental Math Math Team Skills Test 20-Question Sample.
Primary research figuresPrimary research figures These are some of the results from my primary research. percentages of people who like/dislike the show.
Education. When does your school start in the morning and finish in the afternoon? (approximately)
VOORBLAD.
Effects on UK of Eustatic sea Level rise GIS is used to evaluate flood risk. Insurance companies use GIS models to assess likely impact and consequently.
1 Directed Depth First Search Adjacency Lists A: F G B: A H C: A D D: C F E: C D G F: E: G: : H: B: I: H: F A B C G D E H I.
1 Queen Elizabeth High School HOME of the KNIGHTS.
CMPT 275 Software Engineering
Machine Learning: Intro and Supervised Classification
Lets play bingo!!. Calculate: MEAN Calculate: MEDIAN
Past Tense Probe. Past Tense Probe Past Tense Probe – Practice 1.
Gerard Toohey Director, Student Administration Monash University VSU Roundtable discussion - Who's doing what with VSU? ATEM Victorian.
Limits (Algebraic) Calculus Fall, What can we do with limits?
Dept of Biomedical Engineering, Medical Informatics Linköpings universitet, Linköping, Sweden A Data Pre-processing Method to Increase.
Properties of Exponents
1 First EMRAS II Technical Meeting IAEA Headquarters, Vienna, 19–23 January 2009.
Before Between After.
Benjamin Banneker Charter Academy of Technology Making AYP Benjamin Banneker Charter Academy of Technology Making AYP.
Addition 1’s to 20.
25 seconds left…...
Subtraction: Adding UP
Test B, 100 Subtraction Facts
OPSB & RSD LEAP/GEE Scores in Context Cowen Institute for Public Education Initiatives Tulane University May 2008.
Week 1.
Number bonds to 10,
Copyright © 2010 Pearson Addison-Wesley. All rights reserved. Chapter 10 One- and Two-Sample Tests of Hypotheses.
Department of Mathematical Sciences August 15, /20 Math 1319 “Mathematics in the Modern World”
Weekly Attendance by Class w/e 6 th September 2013.
Highlights From the Survey on the Use of Funds Under Title II, Part A
Patient Survey Results 2013 Nicki Mott. Patient Survey 2013 Patient Survey conducted by IPOS Mori by posting questionnaires to random patients in the.
PROCEDURES TO USE TO ESTABLISH STANDARDS DEVELOPMENT COMMITTEES TO REPLACE COMMON CORE'S STANDARDS 1.
Australasian Higher Education Evaluation Forum (AHEEF) 2008 "Evidence based Decision Making: Scholarship and Practice" 2-3 October, 2008 Australasian Higher.
© 2006, François Brouard Case Real Group François Brouard, DBA, CA January 6, 2006.
The CRISP Data Mining Process. August 28, 2004Data Mining2 The Data Mining Process Business understanding Data evaluation Data preparation Modeling Evaluation.
Presented by Khawar Shakeel
Presentation transcript:

Predicting Students Drop Out: a Casestudy Gerben Dekker, Mykola Pechenizkiy and Jan Vleeshouwers

The Case Study Educational Data Mining in a practical setting Directed to a student advice procedure Eindhoven University of Technology, Electrical Engineering department

The Case Study: advice procedure PAGE 3July 2009 Exam results Pre-university student information September October November December January EXAMS HOLIDAY EXAMS Exam results ADVICE STUDENTS 30% 70% DEADLINE Talks with students etc.

Outline CRISP-DM Framework Understanding of context Data understanding Data preparation Modeling Evaluation Deployment Conclusions and further work PAGE 4July 2009

CRISP-DM Framework Understanding of context Data understanding Data preparation Modeling Evaluation Deployment PAGE 5July 2009

Understanding of context Situation at Electrical Engineering, Eindhoven University of Technology 40% dropout rate, small inflow Decision to dropout preferably before end of January Study advice by student counselor Objective for the department: More robust and objective advices PAGE 6July 2009

Understanding of context In data mining terms: Build model for academic success of a student Based on the currently available information Only information until December of year of enrollment. Objective for research: Try out applicability EDM in this context: −Enough data (amount)? −Enough data (type)? PAGE 7July 2009

Data understanding Data source Institutions’ database −Pre-university data −University data Resulting data Data from 648 students, from PAGE 8July 2009

Data preparation (pre-university data) Standard preparatory education: # courses Type of courses taken Average grades for total, science, and math Non-standard previous education: Type Grade PAGE 9July 2009

Data preparation (university data) Courses, grades, # attempts Many transformations needed: Reorganizations Partial exams Example: Calculus : 1 examination : 2 partial examinations : 5 partial examinations, or 1 examination. PAGE 10July 2009

Modeling (general) Classification task 2 class classification Criterion: finish all courses of first year in three years Several mining techniques applied Decision trees (+ensembles), bayesian classifiers, association rules Separate university/pre-university data first PAGE 11July 2009

Modeling (pre-university data) Base line model One rule classifier 68% accuracy using Science_mean No significant improvement using other classification techniques PAGE 12July 2009

Modeling (university data) Base line model One rule classifier 75% accuracy using Linear algebra AB Significant improvements using other models (80%) Decision trees slightly better than other models PAGE 13July 2009

Modeling (total set) Accuracies 80%, using attributes from both subsets Improvements using cost matrices Shape misclassification Small trade-offs accuracy and misclassification: Accuracy 79%, 52% of errors FP Accuracy 76%, 41% of errors FP Similarities between models Linear Algebra AB always root node Science Mean always high in tree PAGE 14July 2009

Modeling (decision tree) LinAlgAB < > 5.5 CalcA < > 5.15 VWO_Sc_mean 1 {good, excellent} {n/a, poor, avg, above avg} 0 79% Accuracy PAGE 15July 2009

Evaluation Detailed manual analysis by student counselor: Review the classification measure: −25% of False Negatives should be true negatives −How to classify skilled people who leave? Improve data transformations PAGE 16July 2009

Deployment Objectives More robust and objective advices: −80% accuracy is possible, clear directions for improvements. Try out applicability EDM in this context: −Enough data (amount)? −Yes, and more is not easily obtainable −Enough data (type)? −Would probably be very useful, but costly. Deployment possible after improvements PAGE 17July 2009

Conclusions and further work EDM can help in a study advice process: 80% accuracy is possible, clear directions for improvements. EDM can work using small datasets and a limited amount of data categories Further work: Improve data transformations Improve classification measure: better two- class, move to three-class Review use of additional data PAGE 18July 2009

Questions?