Download presentation
Presentation is loading. Please wait.
Published byDwight Snow Modified over 8 years ago
1
1 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Can Machines Transfer Knowledge from Task to Task? Isabelle Guyon Clopinet, California http://clopinet.com/ul
2
2 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Web platform: Server made available by Prof. Joachim Buhmann, ETH Zurich, Switzerland. Computer admin.: Thomas Fuchs, ETH Zurich. Webmaster: Olivier Guyon, MisterP.net, France. Platform: Causality Wokbench. Co-orgnizers: David W. Aha, Naval Research Laboratory, USA. Gideon Dror, Academic College of Tel-Aviv Yaffo, Israel. Vincent Lemaire, Orange Research Labs, France. Graham Taylor, NYU, New-York. USA. Gavin Cawley, University of east Anglia, UK. Danny Silver, Acadiau University, Canada. Vassilis Athitsos, UT Arlington, Texas., USA. Protocol review and advising: Olivier Chapelle, Yahoo!, California, USA. Gerard Rinkus, Brandeis University, USA. Urs Mueller, Net-Scale Technilogies, USA. Yoshua Bengio, Universite de Montreal, Canada. David Grangier, NEC Labs, USA. Andrew Ng, Stanford Univ., Palo Alto, California, USA. Yann LeCun, NYU. New-York, USA. Richard Bowden, University of Surrey, UK. Philippe Dreuw, Aachen University, Germany. Ivan Laptev, INRIA, France. Jitendra Malik, UC Berkeley, USA. Greg Mori, Simon Fraser University, Canada. Christian Vogler, ILSP, Athens, Greece Data donors: Handwriting recognition (AVICENNA) -- Reza Farrahi Moghaddam, Mathias Adankon, Kostyantyn Filonenko, Robert Wisnovsky, and Mohamed Chériet (Ecole de technologie supérieure de Montréal, Quebec) contributed the dataset of Arabic manuscripts. The toy example (ULE) is the MNIST handwritten digit database made available by Yann LeCun and Corinna Costes. Object recognition (RITA) -- Antonio Torralba, Rob Fergus, and William T. Freeman, collected and made available publicly the 80 million tiny image dataset. Vinod Nair and Geoffrey Hinton collected and made available publicly the CIFAR datasets. See the techreport Learning Multiple Layers of Features from Tiny Images, by Alex Krizhevsky, 2009, for details. Human action recognition (HARRY) -- Ivan Laptev and Barbara Caputo collected and made publicly available the KTH human action recognition datasets. Marcin Marszałek, Ivan Laptev and Cordelia Schmid collected and made publicly available the Hollywood 2 dataset of human actions and scenes. Text processing (TERRY) -- David Lewis formatted and made publicly available the RCV1-v2 Text Categorization Test Collection. Ecology (SYLVESTER) -- Jock A. Blackard, Denis J. Dean, and Charles W. Anderson of the US Forest Service, USA, collected and made available the (Forest cover type) dataset. CREDITS
3
3 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul What is the problem?
4
4 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Can learning about...
5
5 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul help us learn about…
6
6 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Can learning about… publicly available data
7
7 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul help us learn about… Philip and Thomas Philip Anna Solene Anna, Thomas and GM Omar, Thomas Philip Martin Bernhard Philip Thomas personal data
8
8 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Transfer learning Philip and Thomas Philip Anna Solene Anna, Thomas and GM Omar, Thomas Philip Martin Bernhard Philip Thomas Common data representation
9
9 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul How?
10
10 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Vocabulary Target task labels Source task labels
11
11 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Vocabulary Target task labels Source task labels
12
12 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Vocabulary Target task labels Source task labels Domains the same? Labels available? Tasks the same?
13
13 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Taxonomy of transfer learning Adapted from: A survey on transfer learning, Pan-Yang, 2010. Transfer Learning Unsupervised TL Semi-supervised TL Inductive TL No labels in both source and target domains Labels avail. ONLY in source domain Labels available in target domain No labels in source domain Labels available in source domain Transductive TL Cross-task TL Same source and target task Different source and target tasks Self-taught TL Multi-task TL
14
14 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Taxonomy of transfer learning Adapted from: A survey on transfer learning, Pan-Yang, 2010. Transfer Learning Unsupervised TL Semi-supervised TL Inductive TL No labels in both source and target domains Labels avail. ONLY in source domain Labels available in target domain No labels in source domain Labels available in source domain Transductive TL Cross-task TL Same source and target task Different source and target tasks Self-taught TL Multi-task TL
15
15 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Unsupervised transfer learning
16
16 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul What can you do with NO labels? No learning at all: –Normalization of examples or features –Construction of features (e.g. products) –Generic data transformations (e.g. taking the log, Fourier transform, smoothing, etc.) Unsupervised learning: –Manifold learning to reduce dimension (and/or orthogonalize features) –Sparse coding to expand dimension –Clustering to construct features –Generative models and latent variable models
17
17 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Unsupervised transfer learning P R Source domain 1)
18
18 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Unsupervised transfer learning P 1)
19
19 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Unsupervised transfer learning P 1) P Target domain 2) Task labels C John
20
20 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Unsupervised transfer learning P Target domain C Emily
21
21 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Manifold learning PCA ICA Kernel PCA Kohonen maps Auto-encoders MDS, Isomap, LLE, Laplacian Eigenmaps Regularized principal manifolds
22
22 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Deep Learning Deep Belief Networks (stacks of Restricted Boltzmann machines) Stacks of auto-encoders Greedy layer-wise unsupervised pre-training of multi-layer neural networks and Bayesian networks, including: preprocessor reconstructor
23
23 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Clustering K-means and variants w. cluster overlap (Gaussian mixtures, fuzzy C-means) Hierarchical clustering Graph partitioning Spectral clustering
24
24 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Example: K-means Clusters of ULE valid after 5 it. Start with random cluster centers. Iterate: o Assign the examples to their closest center to form clusters. o Re-compute the centers by averaging the cluster members. Create features, e.g. f k = exp – ||x-x k ||
25
25 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Results on ULE: do better! Raw data: 784 features K-means: 20 features Current best: AUC=1, ALC=0.96 ALC=0.79ALC=0.84 AUC log2(num. tr. ex.) AUC log2(num. tr. ex.)
26
26 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Unsupervised learning (resources) Unsupervised Learning. Z. Ghahramani. http://www.gatsby.ucl.ac.uk/~zoubin/course04/ul.pdf http://www.gatsby.ucl.ac.uk/~zoubin/course04/ul.pdf Nonlinear dimensionality reduction. http://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction http://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering. Y. Bengio et al. http://books.nips.cc/papers/files/nips16/NIPS2003_AA23.pdf http://books.nips.cc/papers/files/nips16/NIPS2003_AA23.pdf Data Clustering: A Review. Jain et al. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.18.2720 Why Does Unsupervised Pre-training Help DL? D. Erhan et al. http://jmlr.csail.mit.edu/papers/volume11/erhan10a/erhan10a.pdf http://jmlr.csail.mit.edu/papers/volume11/erhan10a/erhan10a.pdf Efficient sparse coding algorithms. H. Lee et al. http://www.eecs.umich.edu/~honglak/nips06- sparsecoding.pdf http://www.eecs.umich.edu/~honglak/nips06- sparsecoding.pdf
27
27 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Taxonomy of transfer learning Adapted from: A survey on transfer learning, Pan-Yang, 2010. Transfer Learning Unsupervised TL Semi-supervised TL Inductive TL No labels in both source and target domains Labels avail. ONLY in source domain Labels available in target domain No labels in source domain Labels available in source domain Transductive TL Cross-task TL Same source and target task Different source and target tasks Self-taught TL Multi-task TL
28
28 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Cross-task transfer learning
29
29 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul How can you do it? Data representation learning: –Deep neural networks –Deep belief networks (re-use the internal representation created by the hidden units and/or output units) Similarity or kernel learning: –Siamese neural networks –Graph-theoretic methods
30
30 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Data representation learning Source task labels P C Source domain Sea 1)
31
31 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Data representation learning P 1)
32
32 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Data representation learning P 1) Target task labels P C Target domain John 2)
33
33 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul P C Target domain Emily Data representation learning
34
34 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Kernel learning P S Source domain P Source task labels same or different 1)
35
35 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Kernel learning P 1)
36
36 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Kernel learning P 1) Target task labels P C Target domain John 2)
37
37 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul P C Target domain Emily Kernel learning
38
38 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Cool results in cross-task transfer learning NLP (almost) from scratch. Collobert et al. 2011, submitted to JMLR Source task Target tasks pos=Part-Of-Speech tagging chunk=Chunking ner=Named Entity Recognition srl=Semantic Role Labeling Genuine or not
39
39 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Cross-task transfer (resources) A Survey on Transfer Learning. Pan and Yang. http://www1.i2r.a- star.edu.sg/~jspan/publications/TLsurvey_0822.pdfhttp://www1.i2r.a- star.edu.sg/~jspan/publications/TLsurvey_0822.pdf Distance metric learning: A comprehensive survey. Yang-Jin. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.91.47 32 http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.91.47 32 Signature Verification using a "Siamese" Time Delay Neural Network. Bromley et al. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.28.4792 http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.28.4792 Learning the kernel matrix with semi-definite programming, Lanckriet et al. http://jmlr.csail.mit.edu/papers/volume5/lanckriet04a/lanckriet04 a.pdf http://jmlr.csail.mit.edu/papers/volume5/lanckriet04a/lanckriet04 a.pdf NLP (almost) from scratch. Collobert et al. 2011, http://leon.bottou.org/morefiles/nlp.pdf. http://leon.bottou.org/morefiles/nlp.pdf
40
40 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Taxonomy of transfer learning Adapted from: A survey on transfer learning, Pan-Yang, 2010. Transfer Learning Unsupervised TL Semi-supervised TL Inductive TL No labels in both source and target domains Labels avail. ONLY in source domain Labels available in target domain No labels in source domain Labels available in source domain Transductive TL Cross-task TL Same source and target task Different source and target tasks Self-taught TL Multi-task TL
41
41 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Multi-task learning
42
42 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Multi-task learning Source task labels P C Source domain Sea Target task labels Target domain John
43
43 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Multi-task learning P C Target domain Emily
44
44 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Cool results in multi-task learning One-Shot Learning with a Hierarchical Nonparametric Bayesian Model, Salakhutdinov-Tenenbaum-Torralba, 2010
45
45 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Taxonomy of transfer learning Adapted from: A survey on transfer learning, Pan-Yang, 2010. Transfer Learning Unsupervised TL Semi-supervised TL Inductive TL No labels in both source and target domains Labels avail. ONLY in source domain Labels available in target domain No labels in source domain Labels available in source domain Transductive TL Cross-task TL Same source and target task Different source and target tasks Self-taught TL Multi-task TL
46
46 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Self-taught learning
47
47 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Self-taught learning P C Source domain Target task labels Target domain John
48
48 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Self-taught learning P C Target domain Emily
49
49 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Cool results in self-taught learning Source task Target task Unsupervised Semi-supervised Multi-task Self-taught Self-taught learning. R. Raina et al. 2007
50
50 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Inductive transfer learning (resources) Multitask learning. R. Caruana. http://www.cs.cornell.edu/~caruana/mlj97.pdf http://www.cs.cornell.edu/~caruana/mlj97.pdf Learning deep architectures for AI. Y. Bengio. http://www.iro.umontreal.ca/~lisa/pointeurs/TR1312.pdf http://www.iro.umontreal.ca/~lisa/pointeurs/TR1312.pdf Transfer Learning Techniques for Deep Neural Nets. S. M. Gutstein thesis. http://robust.cs.utep.edu/~gutstein/sg_home_files/thesis.pdf http://robust.cs.utep.edu/~gutstein/sg_home_files/thesis.pdf One-Shot Learning with a Hierarchical Nonparametric Bayesian Model. R. Salakhutdinov et al. http://dspace.mit.edu/bitstream/handle/1721.1/60025/ MIT-CSAIL-TR-2010-052.pdf?sequence=1 http://dspace.mit.edu/bitstream/handle/1721.1/60025/ MIT-CSAIL-TR-2010-052.pdf?sequence=1 Self-taught learning. R. Raina et al. http://www.stanford.edu/~rajatr/papers/icml07_SelfTau ghtLearning.pdf http://www.stanford.edu/~rajatr/papers/icml07_SelfTau ghtLearning.pdf
51
51 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Dec 2010-April 2011 http://clopinet.com/ul Goal: Learning data representations or kernels. Phase 1: Unsupervised learning (until Feb. 28) Phase 2: Cross-task transfer learning (from Mar. 1) Prizes: $6000 + free registrations + travel awards Dissemination: Workshops at ICML and IJCNN; proc. in JMLR W&CP. Evaluators Challenge target task labels Challenge data Validation data Development data Validation target task labels Source task labels Competitors Data represen- tations
52
52 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul July 2011, ICML - Dec 2011, NIPS http://clopinet.com/tl Multi-task learning setting: - Synthetic, Real-world - Supervised learning - Binary classification problems. - 5-10 secondary tasks, 1 primary - Impoverished primary task data in development set - Diversity of tasks with varying degree of relatedness to primary task Target task challenge labels Challenge data (target only) Validation data (target only) Development Data (source + target data) Target task validation labels All task labels Competitors Predic- tions
53
53 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul STEP 1: Develop a “generic” sign language recognition system that can learn new signs with a few examples. STEP 2: At conference: teach the system new signs. STEP 3: Live evaluation in front of audience. June 2011-June. 2012 http://clopinet.com/gshttp://clopinet.com/gs (in preparation) Challenge
54
54 Unsupervised and Transfer Learning Challenge http://clopinet.com/ul Conclusion Transfer learning algorithms offer solutions to problems in which – a lot of training samples are available for a source task, – fewer training samples are available for a similar but different target task. We stated a program of challenges featuring problems in which transfer learning is applicable.
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.