Download presentation
Presentation is loading. Please wait.
Published byDamon Clifford Stewart Modified over 9 years ago
1
COMP24111: Machine Learning Ensemble Models Gavin Brown www.cs.man.ac.uk/~gbrown
3
The Usual Supervised Learning Approach data + labels Learning Algorithm Model Testing data Predicted label
4
Ensemble Methods data + labels Learning Algorithm “Committee” of Models Testing data Predicted label (voting or averaging) Model 1 Model 2 Model m
5
Ensemble Methods Like a “committee”…. -don’t want all models to be identical. -don’t want them to be different for the sake of it – sacrificing individual performance This is the “Diversity” trade-off. Equivalent to underfitting/overfitting – need the right level, just enough. “Committee” of Models Model 1 Model 2 Model m
6
Parallel Ensemble Methods data + labels Vote Model 1 Model 2 Model m Dataset 1 Dataset 2 Dataset m Ensemble
7
Generating a new dataset by “Bootstrapping” - sample N items with replacement from the original N
8
“Bagging” : Bootstrap AGGregatING data + labels Model 1 Dataset 1 Bootstrap: Sample N examples, with replacement, from the original N.
9
“Bagging” : Bootstrap AGGregatING data + labels Vote Model 1 Model 2 Model m Bootstrap 1 Bootstrap 2 Bootstrap m Bagged Ensemble
10
The “Bagging” algorithm
11
Model Stability Some models are almost completely unaffected by bootstrapping. These are stable models. Not so suitable for ensemble methods. Highly Stable Highly Unstable KNN Naïve Bayes Logistic Regression / Perceptrons Neural Networks Decision Trees
13
Use bagging on a set of decision trees. How can we make them even more diverse?
14
Random Forests
15
Real-time human pose recognition in parts from single depth images Computer Vision and Pattern Recognition 2011 Shotton et al, Microsoft Research Basis of Kinect controller Features are simple image properties Test phase: 200 frames per sec on GPU Train phase more complex but still parallel
16
Sequential Ensemble Methods data + labels Model 1 Model 2 Dataset 2 Model 3 Dataset 3 Model 4 Dataset 4 Each model corrects the mistakes of its predecessor.
17
Boosting – an informal description
18
Boosting
19
Ensemble Methods: Summary Treats models as a committee Two main types – dependent and independent §(boosting and bagging are two examples) Responsible for major advances in industrial ML Random Forests = Kinect Boosting = Face Recognition in Phone Cameras Tend to work extremely well “off-the-shelf” – no parameter tuning.
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.