Competitions in machine learning: the fun, the art, and the science Isabelle Guyon Clopinet, Berkeley, California

Slides:



Advertisements
Similar presentations
Active Learning Challenge Active Learning Challenge Isabelle Guyon (Clopinet, California) Gavin Cawley (University of East Anglia,
Advertisements

Alexander Statnikov1, Douglas Hardin1,2, Constantin Aliferis1,3
Multi-Label Prediction via Compressed Sensing By Daniel Hsu, Sham M. Kakade, John Langford, Tong Zhang (NIPS 2009) Presented by: Lingbo Li ECE, Duke University.
SocialSync: Synchronization for a Smartphone Camera Network Richard Latimer Rice University September 7, 2014.
UCB Computer Vision Animals on the Web Tamara L. Berg CSE 595 Words & Pictures.
Can causal models be evaluated? Isabelle Guyon ClopiNet / ChaLearn
Machine Learning and Data Mining Course Summary. 2 Outline  Data Mining and Society  Discrimination, Privacy, and Security  Hype Curve  Future Directions.
Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,
Lecture 5: Causality and Feature Selection Isabelle Guyon
Data Visualization STAT 890, STAT 442, CM 462
Detecting Pedestrians by Learning Shapelet Features
Causality Workbenchclopinet.com/causality Results of the Causality Challenge Isabelle Guyon, Clopinet Constantin Aliferis and Alexander Statnikov, Vanderbilt.
Lecture 3: Introduction to Feature Selection Isabelle Guyon
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Graz University of Technology, AUSTRIA Institute for Computer Graphics and Vision Fast Visual Object Identification and Categorization Michael Grabner,
Real-time Computer Vision with Scanning N-Tuple Grids Simon Lucas Computer Science Dept.
| Alper Ortac | Computer Science Department | Ubiquitous Knowledge Processing Lab | © Prof. Dr. Iryna Gurevych | 1 Knowledge Management in Web.
A Database of Human Segmented Natural Images and Two Applications David Martin, Charless Fowlkes, Doron Tal, Jitendra Malik UC Berkeley
Feature selection methods from correlation to causality Isabelle Guyon NIPS 2008 workshop on kernel learning.
Lecture 2: Introduction to Feature Selection Isabelle Guyon
Cross Validation Framework to Choose Amongst Models and Datasets for Transfer Learning Erheng Zhong ¶, Wei Fan ‡, Qiang Yang ¶, Olivier Verscheure ‡, Jiangtao.
Feature Selection and Its Application in Genomic Data Analysis March 9, 2004 Lei Yu Arizona State University.
Lecture 6: Causal Discovery Isabelle Guyon
Computer Vision Group University of California Berkeley Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA Greg Mori and Jitendra Malik.
Authentication for Humans Rachna Dhamija SIMS, UC Berkeley DIMACS Workshop on Usable Privacy and Security Software July 7, 2004.
Statistical Learning: Pattern Classification, Prediction, and Control Peter Bartlett August 2002, UC Berkeley CIS.
INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Feature selection methods Isabelle Guyon IPAM summer school on Mathematics in Brain Imaging. July 2008.
Guidelines on Statistical Analysis and Reporting of DNA Microarray Studies of Clinical Outcome Richard Simon, D.Sc. Chief, Biometric Research Branch National.
AdaBoost Robert E. Schapire (Princeton University) Yoav Freund (University of California at San Diego) Presented by Zhi-Hua Zhou (Nanjing University)
RESULTS OF THE WCCI 2006 PERFORMANCE PREDICTION CHALLENGE Isabelle Guyon Amir Reza Saffari Azar Alamdari Gideon Dror.
Causality Workbenchclopinet.com/causality Cause-Effect Pair Challenge Isabelle Guyon, ChaLearn IJCNN 2013 IEEE/INNS.
Acquisitions, a Publisher’s Perspective Andrew Wafer Lead Games Designer External Development Studio Getting past the initial selection.
Baseline Methods for the Feature Extraction Class Isabelle Guyon Best BER=1.26  0.14% - n0=1000 (20%) – BER0=1.80% GISETTE Best BER=1.26  0.14% - n0=1000.
Feature Selection and Causal discovery Isabelle Guyon, Clopinet André Elisseeff, IBM Zürich Constantin Aliferis, Vanderbilt University.
Towards Improving Classification of Real World Biomedical Articles Kostas Fragos TEI of Athens Christos Skourlas TEI of Athens
Abstract Developing sign language applications for deaf people is extremely important, since it is difficult to communicate with people that are unfamiliar.
Participation in the NIPS 2003 Challenge Theodor Mader ETH Zurich, Five Datasets were provided for experiments: ARCENE: cancer diagnosis.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
1 Unsupervised and Transfer Learning Challenge Can Machines Transfer Knowledge from Task to Task? Isabelle Guyon Clopinet, California.
Xiaoxiao Shi, Qi Liu, Wei Fan, Philip S. Yu, and Ruixin Zhu
Category Discovery from the Web slide credit Fei-Fei et. al.
Multiple Instance Real Boosting with Aggregation Functions Hossein Hajimirsadeghi and Greg Mori School of Computing Science Simon Fraser University International.
Filter + Support Vector Machine for NIPS 2003 Challenge Jiwen Li University of Zurich Department of Informatics The NIPS 2003 challenge was organized to.
Skewing: An Efficient Alternative to Lookahead for Decision Tree Induction David PageSoumya Ray Department of Biostatistics and Medical Informatics Department.
Recognizing Action at a Distance Alexei A. Efros, Alexander C. Berg, Greg Mori, Jitendra Malik Computer Science Division, UC Berkeley Presented by Pundik.
Xiangnan Kong,Philip S. Yu Multi-Label Feature Selection for Graph Classification Department of Computer Science University of Illinois at Chicago.
Manu Chandran. Outline Background and motivation Over view of techniques Cross validation Bootstrap method Setting up the problem Comparing AIC,BIC,Crossvalidation,Bootstrap.
Lecture 5: Causality and Feature Selection Isabelle Guyon
1 Unsupervised and Transfer Learning Challenge Unsupervised and Transfer Learning Challenge Isabelle Guyon Clopinet, California.
Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
Consensus Group Stable Feature Selection
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Challenges in causality: Results of the WCCI 2008 challenge Isabelle Guyon, Clopinet Constantin Aliferis and Alexander Statnikov, Vanderbilt Univ. André.
ChaLearn HEP competitions Isabelle Guyon, ChaLearn and LRI, Université Paris Sud, Orsay for.
A Framework for Detection and Measurement of Phishing Attacks Reporter: Li, Fong Ruei National Taiwan University of Science and Technology 2/25/2016 Slide.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Unsupervised Learning of Video Representations using LSTMs
Introductory Seminar on Research: Fall 2017
What is Pattern Recognition?
Object Recognition & Detection
Bilinear Classifiers for Visual Recognition
Gesture Recognition Challenge
CLIENT ZipDial.
MILESTONE RESULTS Mar. 1st, 2007
Volodymyr Bobyr Supervised by Aayushjungbahadur Rana
Sign Language Recognition With Unsupervised Feature Learning
Presentation transcript:

Competitions in machine learning: the fun, the art, and the science Isabelle Guyon Clopinet, Berkeley, California

My Itinerary

My Kids

My Company

Recent Projects Melanoma App Drug toxicity via flow cytometry

How to keep up with the s.o.a?

Organize Challenges!

This Year: Unsupervised and Transfer Learning Challenges

Gesture and action recognition Image or video indexing/retrieval Recognition of sign languages Handwriting recognition Text processing Ecology Applications

Free registrations, cash prizes, 2 workshops (IJCNN 2011, ICML2011), proceedings in JMLR W&CP, much fun! Just starting … First ULT challenge

Large database of American Sign Language developed at Boston University ( including 3,800 signs with 3,000 unique class labels produced by native signers and 15 short narratives for a total of over 11,000 sign tokens altogether. Gesture Recognition Challenge

STEP 1: Develop a system that can learn new signs with a few examples. First screening of the competitors based on their performance of a validation dataset. STEP 2: On the site of the life competition, train the system with given “flash cards” to recognize new signs. Second screening of the competitors based on the learning curve of their system. STEP 3: Perform short sequences of signs (charades) in front of an audience. Win if your system gets the best recognition rate. Live Competition

DECEMBER 2011 NIPS 2001: Launching of UTL challenge. JUNE 2011 Workshop at CVPR 2011 (accepted) Launching Gesture Recognition challenge. JULY 2011 Workshop at ICML 2011 (planned). AUGUST 2011 Workshop at IJCNN 2011 (accepted). Results of UTL challenge. NOVEMBER 2011 Live Gesture Recognition Competition: ICCV 2011 (planned). When and Where?

Lessons learned!

Thousands to millions of low level features: select the most relevant one to build better, faster, and easier to understand learning machines. X n m n’ NIPS 2003 Feature Selection Challenge

Bioinformatics Quality control Machine vision Customer knowledge variables/features examples OCR HWR Market Analysis Text Categorization System diagnosis Applications of Feature Selection

All features Filter Feature subset Predictor All features Wrapper Multiple Feature subsets Predictor All features Embedded method Feature subset Predictor Filters, Wrappers, and Embedded Methods

1) For each feature subset, train predictor on training data. 2) Select the feature subset, which performs best on validation data. –Repeat and average if you want to reduce variance (cross-validation). 3) Test on test data. N variables/features M samples m1m1 m2m2 m3m3 Split data into 3 sets: training, validation, and test set. Bilevel Optimization

Generalization_error  Validation_error +  (C/m 2 ) m 2 : number of validation examples, N: total number of features, n: feature subset size. With high probability: n Error Try to keep C of the order of m 2. Complexity of Feature Selection

Lung Cancer SmokingGenetics Coughing Attention Disorder Allergy AnxietyPeer Pressure Yellow Fingers Car Accident Born an Even Day Fatigue WCCI 2008: Causation and Prediction Challenge

Simple univariate predictive model, binary target and features, all relevant features correlate perfectly with the target, all irrelevant features randomly drawn. With 98% confidence, abs(feat_weight) < w and  i w i x i < v. n g number of “ good ” (relevant) features n b number of “ bad ” (irrelevant) features m number of training examples. Insensitivity to Irrelevant Features

Active Learning Challenge AISTATS & WCCI 2010

Web platform: Server made available by Prof. Joachim Buhmann, ETH Zurich, Switzerland. Computer admin.: Thomas Fuchs, ETH Zurich. Webmaster: Olivier Guyon, MisterP.net, France. Protocol review and advising: David W. Aha, Naval Research Laboratory, USA. Abe Schneider, Knexus Research, USA. Graham Taylor, NYU, New-York. USA. Andrew Ng, Stanford Univ., Palo Alto, California, USA Vassilis Athitsos, University of Texas at Arlington, Texas, Usa. Ivan Laptev, INRIA, France. Jitendra Malik, UC Berkeley, California, USA Christian Vogler, ILSP Athens, Greece. Sudeep Sarkar, University of South Florida, USA. Philippe Dreuw, RWTH Aachen University, Germany. Richard Bowden, Univ. Surrey, UK. Greg Mori, Simon Fraser University, Canada. Data collection and preparation: Vassilis Athitsos, University of Texas at Arlington, Texas, USA Isabelle Guyon, Clopinet, California, USA. Graham Taylor, NYU, New-York. USA. Ivan Laptev, INRIA, France. Jitendra Malik, UC Berkeley, California, USA. Baseline methods and beta testing: The following researchers experienced in the domain will be providing baseline results: Vassilis Athitsos, University of Texas at Arlington, Texas, USA. Graham Taylor, NYU, New-York. USA. Andrew Ng, Stanford Univ., Palo Alto, California, USA. Yann LeCun, NYU. New-York, USA. Ivan Laptev, INRIA, France. The following researchers were top ranking participants in past challenges but are not experienced in the domain will also give it a try: Alexander Borisov (Intel, USA) Hugo-Jair Escalante (INAOE, México) Amir Saffari (Graz Univ., Austria) Alexander Statnikov (NYU, USA) Credits

1)Feature Extraction, Foundations and Applications I. Guyon, S. Gunn, et al. Springer, ) Challenges in Machine Learning Collection published by Microtome. Papers on the challenges reprinted from JMLR and JMLR W&CP Resources

Join the Hall of Frame!