BrainStorming 樊艳波 2015-7-20. Outline Several papers on icml15 & cvpr15 PALM Information Theory Learning.

Slides:



Advertisements
Similar presentations
Nonnegative Matrix Factorization with Sparseness Constraints S. Race MA591R.
Advertisements

Accelerated, Parallel and PROXimal coordinate descent IPAM February 2014 APPROX Peter Richtárik (Joint work with Olivier Fercoq - arXiv: )
Feature Selection as Relevant Information Encoding Naftali Tishby School of Computer Science and Engineering The Hebrew University, Jerusalem, Israel NIPS.
Unsupervised Learning Clustering K-Means. Recall: Key Components of Intelligent Agents Representation Language: Graph, Bayes Nets, Linear functions Inference.
CS Statistical Machine learning Lecture 13 Yuan (Alan) Qi Purdue CS Oct
Image Congealing (batch/multiple) image (alignment/registration) Advanced Topics in Computer Vision (048921) Boris Kimelman.
Multi-Task Compressive Sensing with Dirichlet Process Priors Yuting Qi 1, Dehong Liu 1, David Dunson 2, and Lawrence Carin 1 1 Department of Electrical.
Patch to the Future: Unsupervised Visual Prediction
Graph Laplacian Regularization for Large-Scale Semidefinite Programming Kilian Weinberger et al. NIPS 2006 presented by Aggeliki Tsoli.
Ilias Theodorakopoulos PhD Candidate
Nonlinear Unsupervised Feature Learning How Local Similarities Lead to Global Coding Amirreza Shaban.
Discriminative Segment Annotation in Weakly Labeled Video Kevin Tang, Rahul Sukthankar Appeared in CVPR 2013 (Oral)
1 Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science Carnegie Mellon University joint work with Tzu-Kuo Huang, Le.
Jun Zhu Dept. of Comp. Sci. & Tech., Tsinghua University This work was done when I was a visiting researcher at CMU. Joint.
Unsupervised Feature Selection for Multi-Cluster Data Deng Cai et al, KDD 2010 Presenter: Yunchao Gong Dept. Computer Science, UNC Chapel Hill.
Graph Based Semi- Supervised Learning Fei Wang Department of Statistical Science Cornell University.
Supervised and Unsupervised learning and application to Neuroscience Cours CA6b-4.
1 Transfer Learning Algorithms for Image Classification Ariadna Quattoni MIT, CSAIL Advisors: Michael Collins Trevor Darrell.
Dimensional reduction, PCA
2010/5/171 Overview of graph cuts. 2010/5/172 Outline Introduction S-t Graph cuts Extension to multi-label problems Compare simulated annealing and alpha-
Independent Component Analysis (ICA) and Factor Analysis (FA)
Mehdi Ghayoumi MSB rm 132 Ofc hr: Thur, a Machine Learning.
Clustering with Bregman Divergences Arindam Banerjee, Srujana Merugu, Inderjit S. Dhillon, Joydeep Ghosh Presented by Rohit Gupta CSci 8980: Machine Learning.
Peter Richtárik (joint work with Martin Takáč) Distributed Coordinate Descent Method AmpLab All Hands Meeting - Berkeley - October 29, 2013.
Radial Basis Function Networks
Mathematical Programming in Support Vector Machines
Incomplete Graphical Models Nan Hu. Outline Motivation K-means clustering Coordinate Descending algorithm Density estimation EM on unconditional mixture.
Overview of Kernel Methods Prof. Bennett Math Model of Learning and Discovery 2/27/05 Based on Chapter 2 of Shawe-Taylor and Cristianini.
Cao et al. ICML 2010 Presented by Danushka Bollegala.
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
Non Negative Matrix Factorization
Semisupervised Learning A brief introduction. Semisupervised Learning Introduction Types of semisupervised learning Paper for review References.
Machine Learning Seminar: Support Vector Regression Presented by: Heng Ji 10/08/03.
Fast and incoherent dictionary learning algorithms with application to fMRI Authors: Vahid Abolghasemi Saideh Ferdowsi Saeid Sanei. Journal of Signal Processing.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
CS Statistical Machine learning Lecture 10 Yuan (Alan) Qi Purdue CS Sept
Virtual Vector Machine for Bayesian Online Classification Yuan (Alan) Qi CS & Statistics Purdue June, 2009 Joint work with T.P. Minka and R. Xiang.
Direct Robust Matrix Factorization Liang Xiong, Xi Chen, Jeff Schneider Presented by xxx School of Computer Science Carnegie Mellon University.
-Arnaud Doucet, Nando de Freitas et al, UAI
A Clustering Method Based on Nonnegative Matrix Factorization for Text Mining Farial Shahnaz.
EE4-62 MLCV Lecture Face Recognition – Subspace/Manifold Learning Tae-Kyun Kim 1 EE4-62 MLCV.
Biointelligence Laboratory, Seoul National University
Linear Models for Classification
High-dimensional Error Analysis of Regularized M-Estimators Ehsan AbbasiChristos ThrampoulidisBabak Hassibi Allerton Conference Wednesday September 30,
Network Lasso: Clustering and Optimization in Large Graphs
Lecture 2: Statistical learning primer for biologists
Matrix Factorization and its applications By Zachary 16 th Nov, 2010.
CS Statistical Machine learning Lecture 12 Yuan (Alan) Qi Purdue CS Oct
Regression Analysis1. 2 INTRODUCTION TO EMPIRICAL MODELS LEAST SQUARES ESTIMATION OF THE PARAMETERS PROPERTIES OF THE LEAST SQUARES ESTIMATORS AND ESTIMATION.
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Optimal Reverse Prediction: Linli Xu, Martha White and Dale Schuurmans ICML 2009, Best Overall Paper Honorable Mention A Unified Perspective on Supervised,
Estimating variable structure and dependence in multi-task learning via gradients By: Justin Guinney, Qiang Wu and Sayan Mukherjee Presented by: John Paisley.
PAC-Bayesian Analysis of Unsupervised Learning Yevgeny Seldin Joint work with Naftali Tishby.
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Progress Report #2 Alvaro Velasquez. Project Selection I chose to work with Nasim Souly on the project titled “Subspace Clustering via Graph Regularized.
Jinbo Bi Joint work with Jiangwen Sun, Jin Lu, and Tingyang Xu
Zhu Han University of Houston Thanks for Dr. Mingyi Hong’s slides
Learning Mid-Level Features For Recognition
Multimodal Learning with Deep Boltzmann Machines
Machine Learning Basics
Nonnegative polynomials and applications to learning
CSCI 5922 Neural Networks and Deep Learning: NIPS Highlights
Basic Algorithms Christina Gallner
Probabilistic Models for Linear Regression
Globally Maximizing Locally Minimizing unsupervised discriminant projection with applications to face and palm biometrics PAMI 2007 Bo Yang 2/25/2019.
Learning Incoherent Sparse and Low-Rank Patterns from Multiple Tasks
Cases. Simple Regression Linear Multiple Regression.
Non-Negative Matrix Factorization
NON-NEGATIVE COMPONENT PARTS OF SOUND FOR CLASSIFICATION Yong-Choon Cho, Seungjin Choi, Sung-Yang Bang Wen-Yi Chu Department of Computer Science &
Presentation transcript:

BrainStorming 樊艳波

Outline Several papers on icml15 & cvpr15 PALM Information Theory Learning.

Multi-view Sparse Co-clustering via Proximal Alternating Linearized Minimization(CVPR15) Single view rank-one approximation.

Multi-view co-clustering Objective Function. Optimization: PALM.

Proximal Alternating Linearized Minimization for Nonconvex and Nonsmooth Problems(Bolte, Mathematical Programming14) General nonconvex nonsmooth function Example:

Coordinate descent methods Gauss-Seidel iteration scheme: –Drawback: strict convexity assumption. Proximal regularization of the Gauss-Seidel scheme: –Existing problems: Hard to optimize each step exactly.

Proposed PALM method Idea: linear expansion

Proposed PALM method Algorithm.

An example: Sparse NMF Generate

Entropic Graph-based Posterior Regularization. MaxwellW Libbrecht(ICML15) Objective Function. R: regularization term over, Existing ways : L0, L1, L2...

Posterior Regularization Posterior regularization: –Nearby variables have simple posterior distributions. –More nature. Formulation.

Optimation & Simulations EM-like algorithm. has closed form.

Unsupervised Simultaneous Orthogonal Basis Clustering Feature Selection.(CVPR15) Regularized regression model: Proposed Model. B: latent cluster center. E: encoding matrix. each row of E has only one non-zero term.

Just for Insights

Optimal Graph Learning with Partial Tags and Multiple Features for Image and Video Annotation.(CVPR15) Extensions: out of sample annotation; noise label information.

Information-Theoretic Dictionary Learning for Image Classification(TPAMI14) Traditional Dictionary learning. ITL based Dictionary learning. –D^{0}: initialization by k-svd or others. –Dictionary compactness: –Dictionary discrimination: –Dictionary representation: Optimization: gauss process, kernel density estimates...