Strong Supervision from Weak Annotation: Interactive Training of Deformable Part Models S. Branson, P. Perona, S. Belongie.

Slides:

Advertisements

Similar presentations

Université du Québec École de technologie supérieure Face Recognition in Video Using What- and-Where Fusion Neural Network Mamoudou Barry and Eric Granger.

Advertisements

Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.

Efficient Large-Scale Structured Learning

Face Alignment at 3000 FPS via Regressing Local Binary Features

Large-Scale Object Recognition with Weak Supervision

Fast intersection kernel SVMs for Realtime Object Detection

Beyond Actions: Discriminative Models for Contextual Group Activities Tian Lan School of Computing Science Simon Fraser University August 12, 2010 M.Sc.

1 PEGASOS Primal Efficient sub-GrAdient SOlver for SVM Shai Shalev-Shwartz Yoram Singer Nati Srebro The Hebrew University Jerusalem, Israel YASSO = Yet.

1 Transfer Learning Algorithms for Image Classification Ariadna Quattoni MIT, CSAIL Advisors: Michael Collins Trevor Darrell.

Efficient and Numerically Stable Sparse Learning Sihong Xie 1, Wei Fan 2, Olivier Verscheure 2, and Jiangtao Ren 3 1 University of Illinois at Chicago,

CSC321: Introduction to Neural Networks and Machine Learning Lecture 20 Learning features one layer at a time Geoffrey Hinton.

On the Object Proposal Presented by Yao Lu

Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.

© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,

Machine learning Image source:

Machine Learning Theory Maria-Florina Balcan Lecture 1, Jan. 12 th 2010.

Online Learning Algorithms

Generic object detection with deformable part-based models

Latent Boosting for Action Recognition Zhi Feng Huang et al. BMVC Jeany Son.

Machine learning Image source:

Machine Learning Theory Maria-Florina (Nina) Balcan Lecture 1, August 23 rd 2011.

Unsupervised Learning. CS583, Bing Liu, UIC 2 Supervised learning vs. unsupervised learning Supervised learning: discover patterns in the data that relate.

Self-paced Learning for Latent Variable Models

Dual Coordinate Descent Algorithms for Efficient Large Margin Structured Prediction Ming-Wei Chang and Scott Wen-tau Yih Microsoft Research 1.

Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.

Online Learning for Collaborative Filtering

Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.

Object Detection with Discriminatively Trained Part Based Models

Boris Babenko 1, Ming-Hsuan Yang 2, Serge Belongie 1 1. University of California, San Diego 2. University of California, Merced OLCV, Kyoto, Japan.

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Tony Jebara, Columbia University Advanced Machine Learning & Perception Instructor: Tony Jebara.

Face Detection Using Large Margin Classifiers Ming-Hsuan Yang Dan Roth Narendra Ahuja Presented by Kiang “Sean” Zhou Beckman Institute University of Illinois.

Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.

Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

A Discriminatively Trained, Multiscale, Deformable Part Model Yeong-Jun Cho Computer Vision and Pattern Recognition,2008.

Strong Supervision From Weak Annotation Interactive Training of Deformable Part Models ICCV /05/23.

Learning by Loss Minimization. Machine learning: Learn a Function from Examples Function: Examples: – Supervised: – Unsupervised: – Semisuprvised:

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

Discriminative Machine Learning Topic 4: Weak Supervision M. Pawan Kumar Slides available online

Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.

Experience Report: System Log Analysis for Anomaly Detection

When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.

Recent developments in object detection

Support vector machines

The Relationship between Deep Learning and Brain Function

Object detection with deformable part-based models

CSC321: Neural Networks Lecture 22 Learning features one layer at a time Geoffrey Hinton.

Data Driven Attributes for Action Detection

Table 1. Advantages and Disadvantages of Traditional DM/ML Methods

10701 / Machine Learning.

Supervised Learning Seminar Social Media Mining University UC3M

Machine Learning Basics

CS 188: Artificial Intelligence

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Group Norm for Learning Latent Structural SVMs

Students: Meiling He Advisor: Prof. Brain Armstrong

Janardhan Rao (Jana) Doppa, Alan Fern, and Prasad Tadepalli

Random Sampling over Joins Revisited

CSCI B609: “Foundations of Data Science”

On-going research on Object Detection *Some modification after seminar

Overview of Machine Learning

Approaching an ML Problem

المشرف د.يــــاســـــــــر فـــــــؤاد By: ahmed badrealldeen

Lecture 15: Data Cleaning for ML

Support vector machines

Introduction to Neural Networks

Motivation State-of-the-art two-stage instance segmentation methods depend heavily on feature localization to produce masks.

Logistic Regression Geoff Hulten.

Presentation transcript:

Strong Supervision from Weak Annotation: Interactive Training of Deformable Part Models S. Branson, P. Perona, S. Belongie

Methods by level of supervision simple models, weak annotations least effort, potentially fast learning, not best results complicated models, weak annotation state-of-the-art performance (multiple instance learning, latent parts, latent structural SVM) non-convex optimization slow training Hard to pinpoint error source (optimization error, inappropriate model or feature space, insufficient training data)

Methods by level of supervision Strong annotation Very time consuming Easy learning task (possible convex optimization) Generalization guarrantee Sensitive to quality and style of annotation Qusetion: Is it possible to have strong supervision properties with weak annotation computational efficiency?

Yes! Interactive Labelling and Online Learning 1- Model part structures with structured models 2- Bring up a new image, predict the part locations with current model Correct the wrong locations 3-Update the learned model and go to 2

Interactive Labelling + Online Learning

Interactive Labelling + Online Learning Real time detection Easy update Tree-structured deformable parts model with dynamic programming is a good choice! Online Learning Fast model updating Convex optimization Stochastic gradient descent

Related works Interactive labelling Grab cut (Segmentation) Label me video Visipedia (attributes)

Related works Active learning Intelligent computer decide which image to annotate More savings than interactive labeling In comparison to strong supervision Higher computational complexity Fewer theoretical guarantees

Model and Interactive UI

Learning framework Strong Convex formualtion Gradient computable by one inference Stochastic gradient descent Process an image at each step Pegasus!

Theoretical properties Pegasos Faster convergence rate than linear SVM Performance guarantee with the number of iterations Training time does not increase with increasing number of images (for a specific performance) Slower steps than linear svm Inference at each step Interactive labelling Loss function is defined as the number of misplaced part Number of annotation is bounded!

Some results...

Tables 50 images: 4000 images: 6.6 / 13 correction. 19.7 seconds

Conclusion Framework for large scale annotation Simulataneous learning of structured models Nice theoretical properties, seen in practice

Cross-category Object Recognition (CORE) University of Illinois at Urbana-Champaign more detailed models and for exploring cross-category generalization in object recognition

CORE – Data overview Images from ImageNet, thus coming with object hierarchy Binary attributes Pose Sorrounding context Viewpoint Etc.

CORE – data overview Ploygon labels Objects Pre-defined parts of a category

CORE – data overview Segmentation mask Materials

Quality Measures Way of collecting images Which attributes Easily annotatable Unsure buttun Quality assurance methods...