A “Holy Grail” of Machine Learing

Slides:

Advertisements

Similar presentations

My name is Dustin Boswell and I will be presenting: Ensemble Methods in Machine Learning by Thomas G. Dietterich Oregon State University, Corvallis, Oregon.

Advertisements

Boosting Rong Jin.

Longin Jan Latecki Temple University

ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.

CS Ensembles and Bayes1 Ensembles, Model Combination and Bayesian Combination.

Sparse vs. Ensemble Approaches to Supervised Learning

Ensemble Learning what is an ensemble? why use an ensemble?

A Brief Introduction to Adaboost

Ensemble Learning: An Introduction

Adaboost and its application

Three kinds of learning

Introduction to Boosting Aristotelis Tsirigos SCLT seminar - NYU Computer Science.

Bagging LING 572 Fei Xia 1/24/06. Ensemble methods So far, we have covered several learning methods: FSA, HMM, DT, DL, TBL. Question: how to improve results?

Machine Learning: Ensemble Methods

Saturation, Flat-spotting Shift up Derivative Weight Decay No derivative on output nodes.

For Better Accuracy Eick: Ensemble Learning

Ensembles of Classifiers Evgueni Smirnov

Machine Learning CS 165B Spring 2012

Zhangxi Lin ISQS Texas Tech University Note: Most slides are from Decision Tree Modeling by SAS Lecture Notes 6 Ensembles of Trees.

CS 391L: Machine Learning: Ensembles

LOGO Ensemble Learning Lecturer: Dr. Bo Yuan

Ensemble Classification Methods Rayid Ghani IR Seminar – 9/26/00.

CS Fall 2015 (© Jude Shavlik), Lecture 7, Week 3

Today Ensemble Methods. Recap of the course. Classifier Fusion

Ensemble Methods: Bagging and Boosting

Ensembles. Ensemble Methods l Construct a set of classifiers from training data l Predict class label of previously unseen records by aggregating predictions.

Ensemble Learning Spring 2009 Ben-Gurion University of the Negev.

CS Ensembles1 Ensembles. 2 A “Holy Grail” of Machine Learning Automated Learner Just a Data Set or just an explanation of the problem Hypothesis.

CLASSIFICATION: Ensemble Methods

BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.

ISQS 6347, Data & Text Mining1 Ensemble Methods. ISQS 6347, Data & Text Mining 2 Ensemble Methods Construct a set of classifiers from the training data.

CS Inductive Bias1 Inductive Bias: How to generalize on novel data.

E NSEMBLE L EARNING : A DA B OOST Jianping Fan Dept of Computer Science UNC-Charlotte.

Ensemble Methods in Machine Learning

Random Forests Ujjwol Subedi. Introduction What is Random Tree? ◦ Is a tree constructed randomly from a set of possible trees having K random features.

Classification Ensemble Methods 1

1 January 24, 2016Data Mining: Concepts and Techniques 1 Data Mining: Concepts and Techniques — Chapter 7 — Classification Ensemble Learning.

Classification and Prediction: Ensemble Methods Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.

Decision Trees IDHairHeightWeightLotionResult SarahBlondeAverageLightNoSunburn DanaBlondeTallAverageYesnone AlexBrownTallAverageYesNone AnnieBlondeShortAverageNoSunburn.

… Algo 1 Algo 2 Algo 3 Algo N Meta-Learning Algo.

1 Machine Learning Lecture 8: Ensemble Methods Moshe Koppel Slides adapted from Raymond J. Mooney and others.

Ensemble Classifiers.

Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.

Machine Learning: Ensemble Methods

Data Mining Practical Machine Learning Tools and Techniques

Bagging and Random Forests

Ensembles (Bagging, Boosting, and all that)

Chapter 13 – Ensembles and Uplift

Machine Learning: Ensembles

Reading: Pedro Domingos: A Few Useful Things to Know about Machine Learning source: /cacm12.pdf reading.

Ensemble Learning Introduction to Machine Learning and Data Mining, Carla Brodley.

Neuro-Computing Lecture 5 Committee Machine

ECE 5424: Introduction to Machine Learning

Ungraded quiz Unit 6.

INTRODUCTION TO Machine Learning

Combining Base Learners

Data Mining Practical Machine Learning Tools and Techniques

Introduction to Data Mining, 2nd Edition

Machine Learning Ensemble Learning: Voting, Boosting(Adaboost)

Chap. 7 Regularization for Deep Learning (7.8~7.12 )

Ensemble learning.

Model Combination.

Ensemble learning Reminder - Bagging of Trees Random Forest

Model generalization Brief summary of methods

INTRODUCTION TO Machine Learning 3rd Edition

CS 391L: Machine Learning: Ensembles

Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017

Ensembles (Bagging, Boosting, and all that)

Presentation transcript:

A “Holy Grail” of Machine Learing Outputs Just a Data Set or just an explanation of the problem Automated Learner Hypothesis Input Features

Ensembles Multiple diverse models (Inductive Biases) are trained on the same problem and then their outputs are combined The specific overfit of each learning model is averaged out If models are diverse (uncorrelated errors) then even if the individual models are weak generalizers, the ensemble can be very accurate Many different Ensemble approaches Stacking, Gating/Mixture of Experts, Bagging, Boosting, Wagging, Mimicking, Combinations Combining Technique M1 M2 M3 Mn

Bias vs. Variance Multiple trained models can average out the variance Leaving just the Bias Weak learners?

Bagging Bootstrap aggregating (Bagging) Each TS chosen uniformly at random with replacement from the original data set All hypotheses have an equal vote Bagging mostly focused on getting rid of variance Consistent strong empirical improvement Does not overfit (whereas boosting may), but may be more conservative overall on accuracy improvements Often used with the same learning algorithms and thus best for those which tend to give more diverse hypotheses based on initial random conditions

Boosting Many variations Boosting more aggressive on accuracy but in some cases could overfit and do worse – can theoretically converge to training set – similar to a constructive neural network (DMP) Boosting by resampling - (Each TS chosen randomly with distribution Di with replacement from the original data set. Typically the same size as the original data set.) Some learning algorithms can handle the weighted samples directly Potential to overfit, but empirically quite good

Ensemble Creation Approaches Goal is to get less correlated errors Injecting randomness – initial weights, different learning parameters, etc. Different Training sets – Bagging, Boosting, different features, etc. Forcing differences – different objective functions, auxiliary tasks Different machine learning models

Ensemble Combining Approaches Stacking Unweighted Voting Weighted voting – learned (single layer), based on accuracy, training set, Gating function – The gating function uses the input features to decide which combination (weights) of expert voting to use