A New Method for Automatic Clothing Tagging Utilizing Image-Click-Ads Introduction Conclusion Can We Do Better to Reduce Workload?

Slides:



Advertisements
Similar presentations
POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.
Advertisements

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Indoor Segmentation and Support Inference from RGBD Images Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus.
3rd Workshop On Semantic Perception, Mapping and Exploration (SPME) Karlsruhe, Germany,2013 Semantic Parsing for Priming Object Detection in RGB-D Scenes.
A Unified Framework for Context Assisted Face Clustering
Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance Dhruv Batra, Carnegie Mellon University Adarsh Kowdle, Cornell.
From Interactive to Semantic Image Segmentation Varun Gulshan Supervisors: Prof. Andrew Blake Prof. Andrew Zisserman 20 Jan 2012.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Hongliang Li, Senior Member, IEEE, Linfeng Xu, Member, IEEE, and Guanghui Liu Face Hallucination via Similarity Constraints.
LARGE-SCALE IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill road building car sky.
Structured Hough Voting for Vision-based Highway Border Detection
Patch to the Future: Unsupervised Visual Prediction
Parsing Clothing in Fashion Photographs
INTRODUCTION Heesoo Myeong, Ju Yong Chang, and Kyoung Mu Lee Department of EECS, ASRI, Seoul National University, Seoul, Korea Learning.
Proportion Priors for Image Sequence Segmentation Claudia Nieuwenhuis, etc. ICCV 2013 Oral.
Unsupervised Transfer Classification Application to Text Categorization Tianbao Yang, Rong Jin, Anil Jain, Yang Zhou, Wei Tong Michigan State University.
A Graphical Model For Simultaneous Partitioning And Labeling Philip Cowans & Martin Szummer AISTATS, Jan 2005 Cambridge.
1. Introduction Humanising GrabCut: Learning to segment humans using the Kinect Varun Gulshan, Victor Lempitksy and Andrew Zisserman Dept. of Engineering.
Assuming normally distributed data! Naïve Bayes Classifier.
Li-Jia Li Yongwhan Lim Li Fei-Fei Chong Wang David M. Blei B UILDING AND U SING A S EMANTIVISUAL I MAGE H IERARCHY CVPR, 2010.
Robust Higher Order Potentials For Enforcing Label Consistency
LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.
Rodent Behavior Analysis Tom Henderson Vision Based Behavior Analysis Universitaet Karlsruhe (TH) 12 November /9.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
Presented by Zeehasham Rasheed
Scalable Text Mining with Sparse Generative Models
Entropy and some applications in image processing Neucimar J. Leite Institute of Computing
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Thien Anh Dinh1, Tomi Silander1, Bolan Su1, Tianxia Gong
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Experiments Test different parking lot images captured in different luminance conditions The test samples include 1300 available parking spaces and 1500.
INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,
Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.
Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Saliency Aggregation: A Data- driven Approach Long Mai Yuzhen Niu Feng Liu Department of Computer Science, Portland State University Portland, OR,
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
A Free Software tool for Automatic Tuning of Segmentation Parameters SPT 3.0 Pedro Achanccaray, Victor Ayma, Luis Jimenez, Sergio Garcia, Patrick Happ,
DISCRIMINATIVELY TRAINED DENSE SURFACE NORMAL ESTIMATION ANDREW SHARP.
Edge Preserving Spatially Varying Mixtures for Image Segmentation Giorgos Sfikas, Christophoros Nikou, Nikolaos Galatsanos (CVPR 2008) Presented by Lihan.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Automated GrowCut for Segmentation of Endoscopic Images Farah Deeba, Francis M. Bui, Khan A. Wahid Department Of Electrical And Computer Engineering University.
Experience Report: System Log Analysis for Anomaly Detection
Hybrid Deep Learning for Reflectance Confocal Microscopy Skin Images
A. M. R. R. Bandara & L. Ranathunga
Finding Things: Image Parsing with Regions and Per-Exemplar Detectors
Liang Lin, Shuicheng Yan
Wei Wei, PhD, Zhanglong Ji, PhD, Lucila Ohno-Machado, MD, PhD
Nonparametric Semantic Segmentation
Bidirectional CRF for NER
Fast Preprocessing for Robust Face Sketch Synthesis
Saliency detection Donghun Yeo CV Lab..
Machine Learning Basics
Adversarially Tuned Scene Generation
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Learning to Combine Bottom-Up and Top-Down Segmentation
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
8-3 RRAM Based Convolutional Neural Networks for High Accuracy Pattern Recognition and Online Learning Tasks Z. Dong, Z. Zhou, Z.F. Li, C. Liu, Y.N. Jiang,
Learning Object Context for Dense Captioning
Hierarchical Relational Models for Document Networks
“Traditional” image segmentation
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Presentation transcript:

A New Method for Automatic Clothing Tagging Utilizing Image-Click-Ads Introduction Conclusion Can We Do Better to Reduce Workload?

A Tradeoff between Workload and Accuracy Low workload Low accuracy High workload High accuracy Low Workload, High Accuracy? Utilizing Image-Click-Ads Introduction Conclusion Manual Tagging Tool Assisted Tagging Tool

The Objective Automatically segment object regions in an image With intuitive tagging interface For the purpose of achieving low workload, high accuracy for the tagging task Utilizing Image-Click-Ads Introduction Conclusion

The Research Problem To design a new automatic segmentation method for: Reducing workload Producing high accuracy tagged images In the domain of clothing images Utilizing Image-Click-Ads Introduction Conclusion

Background: Image Parsing Models Utilizing Image-Click-Ads Introduction Conclusion Image Parsing (Semantic Segmentation) Unary TermPairwise Term MRF Probability of each node for each class Local neighborhood knowledge CRF Probability of each node for each class More complex prior knowledge CRF is better CRF example

Color-category labels [Liu et al 2014]: blue shirt Data driven method [Yamaguchi et al 2013]: retrieve similar images from dataset CRF based method [Yamaguchi et al 2012] Related Work: Clothing Parsing Utilizing Image-Click-Ads Introduction Conclusion fashion images == common images? Unary term Prior distribution: pairwise co-occurrence of clothing labels Prior distribution: Probability of neighboring pairs having the same label

Drawbacks (1/5) Utilizing Image-Click-Ads Introduction Conclusion Problem 1: Fails to Preserve Local Contrast Cause: Large variations in configuration and appearance, resulting in unreliable prior distribution knowledge, a common problem of CRF.

Drawbacks (2/5) Utilizing Image-Click-Ads Introduction Conclusion Problem 2: Background Spill Cause: feature similarity

Drawbacks (3/5) Utilizing Image-Click-Ads Introduction Conclusion Problem 3: Incomplete Region Prediction Cause: occlusion

Drawbacks (4/5) Utilizing Image-Click-Ads Introduction Conclusion Problem 4: Over Smoothing of Infrequent Label Mentioned in [Yamaguchi et al 2012] Cause: unbalanced labeled data (low frequency, small region) Unbalanced Labeled Data For example Probability = 0.4

Drawbacks (5/5) Utilizing Image-Click-Ads Introduction Conclusion Problem 5: High Computational Cost Cause: hierarchical segmentation method

A deviation of MRF (Re-Weighted MRF) by introducing Background Prior : addressing background spill problem Occlusion Prior: addressing occlusion problem A Re-weighted Pairwise Term: addressing over smoothing of infrequent labels problem Integrated with SLIC superpixel segmentation method Addressing high computational cost Obtains better performance of clothing parsing in Parsing accuracy Processing efficiency The Proposed Method Utilizing Image-Click-Ads Introduction Conclusion

The Proposed Method: Workflow pose estimation superpixel background prior occlusion prior Re-Weighted MRF Utilizing Image-Click-Ads Introduction Conclusion

Two New Priors Utilizing Image-Click-Ads Introduction Conclusion Background Prior Occlusion Prior NON: neighbors of neighborhood For VFor V*

Re-weighted Pairwise Term Utilizing Image-Click-Ads Introduction Conclusion Probability prediction Original Pairwise Term

Online Image-Click-Ads Tagging System: EyeDentifyIt 3.0

Evaluation: Training and Inference Training data set  Fashionista [Yamaguchi et al. 2012] in EyeDentifyIt Feature vector  RGB color [m X n X 3]  CIELAB color [m X n X 3]  Gabor feature [m X n X 4]  Absolute 2D coordinates [m X n X 2]  Relative 2D coordinates [m X n X 28] Training and inference  Logistic regress (liblinear library)  Max-flow min-cut (gco-v3.0) Utilizing Image-Click-Ads Introduction Conclusion 37

Quantitative Evaluation (1/2) Utilizing Image-Click-Ads Introduction Conclusion Compare Pixel ACC and MAGR among MRF Reweighted Pairwise Term (RW) Reweighted Pairwise Term + Background Prior (RW+BP) Reweighted Pairwise Term + Background Prior + occlusion prior (RW+BP+OP) Result 1: steady improvements with more priors Result 2: RW reaches the best MAGR 38

method Pixel AccMAGR Training Time Processing Time Re-Weighted MRF 90.5%63.0%631.8 sec5.2 sec [Yamaguchi et al 2012] 85.1%57.2% sec81.5 sec Baseline 77.6%12.8%N/A Quantitative Evaluation (2/2) Utilizing Image-Click-Ads Introduction Conclusion Compare between Re-Weighted MRF CRF [Yamaguchi et al 2012] Baseline: naively predict to be all background Result 3: 5.4% gain on pixel ACC, 5.8% gain on MAGR, 86.1% gain on training time, 93.6% on processing time 39

Qualitative Evaluation (1/3) CRF [Yamaguchi et al 2012] Re-Weighted MRFMRF Resolved P1: Fails to preserve local contrast P4: Over smoothing of infrequent label 40

Qualitative Evaluation (2/3) CRF [Yamaguchi et al 2012] RW+BPRW+BP+OP Resolved P2: Background spill P3: Incomplete region prediction 41

Contributions Developed a new automated clothing parsing method Proposed background prior, occlusion prior to resolve background spill and occlusion problem in clothing parsing Proposed re-weighted pairwise term for MRF model to justify infrequent small label prediction Demonstrated MRF performs better than CRF in conditions that the local knowledge is more trust worthy than the statistical model learned from training dataset Integrated in EyeDentifyIt 3.0, driven by Image-Click- Ads framework Utilizing Image-Click-Ads Introduction Conclusion 42

Thank You ! 谢 谢! 46