HCP: A Flexible CNN Framework for Multi-Label Image Classification

Slides:



Advertisements
Similar presentations
Request Dispatching for Cheap Energy Prices in Cloud Data Centers
Advertisements

SpringerLink Training Kit
Luminosity measurements at Hadron Colliders
From Word Embeddings To Document Distances
Choosing a Dental Plan Student Name
Virtual Environments and Computer Graphics
Chương 1: CÁC PHƯƠNG THỨC GIAO DỊCH TRÊN THỊ TRƯỜNG THẾ GIỚI
THỰC TIỄN KINH DOANH TRONG CỘNG ĐỒNG KINH TẾ ASEAN –
D. Phát triển thương hiệu
NHỮNG VẤN ĐỀ NỔI BẬT CỦA NỀN KINH TẾ VIỆT NAM GIAI ĐOẠN
Điều trị chống huyết khối trong tai biến mạch máu não
BÖnh Parkinson PGS.TS.BS NGUYỄN TRỌNG HƯNG BỆNH VIỆN LÃO KHOA TRUNG ƯƠNG TRƯỜNG ĐẠI HỌC Y HÀ NỘI Bác Ninh 2013.
Nasal Cannula X particulate mask
Evolving Architecture for Beyond the Standard Model
HF NOISE FILTERS PERFORMANCE
Electronics for Pedestrians – Passive Components –
Parameterization of Tabulated BRDFs Ian Mallett (me), Cem Yuksel
L-Systems and Affine Transformations
CMSC423: Bioinformatic Algorithms, Databases and Tools
Some aspect concerning the LMDZ dynamical core and its use
Bayesian Confidence Limits and Intervals
实习总结 (Internship Summary)
Current State of Japanese Economy under Negative Interest Rate and Proposed Remedies Naoyuki Yoshino Dean Asian Development Bank Institute Professor Emeritus,
Front End Electronics for SOI Monolithic Pixel Sensor
Face Recognition Monday, February 1, 2016.
Solving Rubik's Cube By: Etai Nativ.
CS284 Paper Presentation Arpad Kovacs
انتقال حرارت 2 خانم خسرویار.
Summer Student Program First results
Theoretical Results on Neutrinos
HERMESでのHard Exclusive生成過程による 核子内クォーク全角運動量についての研究
Wavelet Coherence & Cross-Wavelet Transform
yaSpMV: Yet Another SpMV Framework on GPUs
Creating Synthetic Microdata for Higher Educational Use in Japan: Reproduction of Distribution Type based on the Descriptive Statistics Kiyomi Shirakawa.
MOCLA02 Design of a Compact L-­band Transverse Deflecting Cavity with Arbitrary Polarizations for the SACLA Injector Sep. 14th, 2015 H. Maesaka, T. Asaka,
Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,
Fuel cell development program for electric vehicle
Overview of TST-2 Experiment
Optomechanics with atoms
داده کاوی سئوالات نمونه
Inter-system biases estimation in multi-GNSS relative positioning with GPS and Galileo Cecile Deprez and Rene Warnant University of Liege, Belgium  
ლექცია 4 - ფული და ინფლაცია
10. predavanje Novac i financijski sustav
Wissenschaftliche Aussprache zur Dissertation
FLUORECENCE MICROSCOPY SUPERRESOLUTION BLINK MICROSCOPY ON THE BASIS OF ENGINEERED DARK STATES* *Christian Steinhauer, Carsten Forthmann, Jan Vogelsang,
Particle acceleration during the gamma-ray flares of the Crab Nebular
Interpretations of the Derivative Gottfried Wilhelm Leibniz
Advisor: Chiuyuan Chen Student: Shao-Chun Lin
Widow Rockfish Assessment
SiW-ECAL Beam Test 2015 Kick-Off meeting
On Robust Neighbor Discovery in Mobile Wireless Networks
Chapter 6 并发:死锁和饥饿 Operating Systems: Internals and Design Principles
You NEED your book!!! Frequency Distribution
Y V =0 a V =V0 x b b V =0 z
Fairness-oriented Scheduling Support for Multicore Systems
Climate-Energy-Policy Interaction
Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,
Ch48 Statistics by Chtan FYHSKulai
The ABCD matrix for parabolic reflectors and its application to astigmatism free four-mirror cavities.
Measure Twice and Cut Once: Robust Dynamic Voltage Scaling for FPGAs
Online Learning: An Introduction
Factor Based Index of Systemic Stress (FISS)
What is Chemistry? Chemistry is: the study of matter & the changes it undergoes Composition Structure Properties Energy changes.
THE BERRY PHASE OF A BOGOLIUBOV QUASIPARTICLE IN AN ABRIKOSOV VORTEX*
Quantum-classical transition in optical twin beams and experimental applications to quantum metrology Ivano Ruo-Berchera Frascati.
The Toroidal Sporadic Source: Understanding Temporal Variations
FW 3.4: More Circle Practice
ارائه یک روش حل مبتنی بر استراتژی های تکاملی گروه بندی برای حل مسئله بسته بندی اقلام در ظروف
Decision Procedures Christoph M. Wintersteiger 9/11/2017 3:14 PM
Limits on Anomalous WWγ and WWZ Couplings from DØ
Presentation transcript:

HCP: A Flexible CNN Framework for Multi-Label Image Classification Source :  IEEE Transactions on Pattern Analysis & Machine Intelligence, 2016, 38(9):1901-1907. Authors : Wei Y, Wei X, Lin M, et al. Speaker : Jiefan Tan Date : 2017/6/29 國立台灣大學資訊工程學系

Outline Introduction Proposed method Experiment Conclusions Hypotheses Extraction Training HCP Experiment Conclusions

Introduction- single-label image classification Hand-crafted features classifier Image classification +10% CNN

Introduction- single-label VS. Multi-label images Roughly aligned mis-aligned occluded

Proposed method- infrastructure of the proposed HCP

Proposed method- Hypotheses Extraction Source image Bounding boxes-BING Hypotheses-Bb Hypotheses-HS(resize) Filter out hypotheses: 1.Small area(<900 pixel) 2.High height/width (width/height)ratios(>4) High object detection recall rate Small number of hypotheses High computational efficiency

Proposed method- Training HCP pre-train (parameter initialization) single-label image image-fine-tuning(initialize final fully-connected) multi-label image hypotheses-fine-tuning hypotheses cross-hypothesis max-pooling 𝑣 (𝑗) =max⁡( 𝑣 1 𝑗 , 𝑣 2 𝑗 ,…, 𝑣 𝑚 𝑗 ) 𝑣 𝑖 𝑖=1,…,𝑚 is the vector of output j=1,…c is the 𝑗 𝑡ℎ component of 𝑣 𝑖 m is the number of images c is the number of categories

Proposed method- Training HCP cross-hypothesis max-pooling squared loss 𝐽= 1 𝑁 𝑖=1 𝑁 𝑘=1 𝑐 ( 𝑝 𝑖𝑘 − 𝑝 𝑖𝑘 ) 2 𝑝 𝑖 is the ground-truth probability vector of the 𝑖 𝑡ℎ image 𝑝 𝑖 is the predictive probability vector N is the number of images c is the number of categories

Proposed method- HCP

Experiment Datasets Shared CNN VOC 2007 (trainval/test = 5011/4952) Alex Net VGG Net

Experiment Results 1.mAP-mean of Average Precision 2.Number-hypotheses

Experiment Results

Conclusions No ground-truth bounding box information is required. Robust to noisy and/or redundant hypotheses. Can be well pre-trained by a large single-label image dataset. The HCP outputs are intrinsically multi-label prediction results.

Thank you!