Ensemble Clustering.

Slides:



Advertisements
Similar presentations
Partitional Algorithms to Detect Complex Clusters
Advertisements

Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Data Clustering Methods
COMP 328: Final Review Spring 2010 Nevin L. Zhang Department of Computer Science & Engineering The Hong Kong University of Science & Technology
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Communities in Heterogeneous Networks Chapter 4 1 Chapter 4, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool,
Graph Based Semi- Supervised Learning Fei Wang Department of Statistical Science Cornell University.
A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts Dhillon, Inderjit S., Yuqiang Guan, and Brian Kulis.
Clustering… in General In vector space, clusters are vectors found within  of a cluster vector, with different techniques for determining the cluster.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
Unsupervised Learning: Clustering Rong Jin Outline  Unsupervised learning  K means for clustering  Expectation Maximization algorithm for clustering.
A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts
Cluster Analysis.  What is Cluster Analysis?  Types of Data in Cluster Analysis  A Categorization of Major Clustering Methods  Partitioning Methods.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Clustering.
Semi-Supervised Clustering Jieping Ye Department of Computer Science and Engineering Arizona State University
Cliff Rhyne and Jerry Fu June 5, 2007 Parallel Image Segmenter CSE 262 Spring 2007 Project Final Presentation.
POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.
Clustering Unsupervised learning Generating “classes”
A Cumulative Voting Consensus Method for Partitions with a Variable Number of Clusters Hanan G. Ayad, Mohamed S. Kamel, ECE Department University of Waterloo,
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Image Segmentation Rob Atlas Nick Bridle Evan Radkoff.
Computer Vision James Hays, Brown
Exploring the Parameter Space of Image Segmentation Algorithms Talk at NCHU p 1 TexPoint fonts used in EMF. Read the TexPoint manual before you.
Clustering of DNA Microarray Data Michael Slifker CIS 526.
Cluster Analysis Part II. Learning Objectives Hierarchical Methods Density-Based Methods Grid-Based Methods Model-Based Clustering Methods Outlier Analysis.
Cut-based & divisive clustering Clustering algorithms: Part 2b Pasi Fränti Speech & Image Processing Unit School of Computing University of Eastern.
Machine Learning Problems Unsupervised Learning – Clustering – Density estimation – Dimensionality Reduction Supervised Learning – Classification – Regression.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models Jing Gao 1, Feng Liang 2, Wei Fan 3, Yizhou Sun 1, Jiawei Han 1 1.
Particle Filters for Shape Correspondence Presenter: Jingting Zeng.
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29/03/07 Blind city classification using aggregation of clusterings.
Ensembles of Partitions via Data Resampling
Using Support Vector Machines to Enhance the Performance of Bayesian Face Recognition IEEE Transaction on Information Forensics and Security Zhifeng Li,
CLASSIFICATION: Ensemble Methods
Nir Geffen Yotam Margolin Supervisor Professor Zeev Volkovich 1 ORT BRAUDE COLLEGE – SE DEPT
A Toolkit for Remote Sensing Enviroinformatics Clustering Fazlul Shahriar, George Bonev Advisors: Michael Grossberg, Irina Gladkova, Srikanth Gottipati.
Neural Networks - Lecture 81 Unsupervised competitive learning Particularities of unsupervised learning Data clustering Neural networks for clustering.
Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.
SemiBoost : Boosting for Semi-supervised Learning Pavan Kumar Mallapragada, Student Member, IEEE, Rong Jin, Member, IEEE, Anil K. Jain, Fellow, IEEE, and.
Learning Spectral Clustering, With Application to Speech Separation F. R. Bach and M. I. Jordan, JMLR 2006.
Cluster Analysis Potyó László. Cluster: a collection of data objects Similar to one another within the same cluster Similar to one another within the.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
CS 8751 ML & KDDData Clustering1 Clustering Unsupervised learning Generating “classes” Distance/similarity measures Agglomerative methods Divisive methods.
Gaussian Mixture Models and Expectation-Maximization Algorithm.
Prototype Classification Methods Fu Chang Institute of Information Science Academia Sinica ext. 1819
Lecture 2: Statistical learning primer for biologists
A new initialization method for Fuzzy C-Means using Fuzzy Subtractive Clustering Thanh Le, Tom Altman University of Colorado Denver July 19, 2011.
Chapter 13 (Prototype Methods and Nearest-Neighbors )
About Me Swaroop Butala  MSCS – graduating in Dec 09  Specialization: Systems and Databases  Interests:  Learning new technologies  Application of.
A Comparison of Resampling Methods for Clustering Ensembles
Ultra-high dimensional feature selection Yun Li
Advanced Artificial Intelligence Lecture 8: Advance machine learning.
Adaptive Cluster Ensemble Selection Javad Azimi, Xiaoli Fern {azimi, Oregon State University Presenter: Javad Azimi. 1.
Non-parametric Methods for Clustering Continuous and Categorical Data Steven X. Wang Dept. of Math. and Stat. York University May 13, 2010.
Motion Segmentation at Any Speed Shrinivas J. Pundlik Department of Electrical and Computer Engineering, Clemson University, Clemson, SC.
SemiBoost : Boosting for Semi-supervised Learning Pavan Kumar Mallapragada, Student Member, IEEE, Rong Jin, Member, IEEE, Anil K. Jain, Fellow, IEEE, and.
Clustering Machine Learning Unsupervised Learning K-means Optimization objective Random initialization Determining Number of Clusters Hierarchical Clustering.
Document Clustering with Prior Knowledge Xiang Ji et al. Document Clustering with Prior Knowledge. SIGIR 2006 Presenter: Suhan Yu.
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
Correlation Clustering
Clustering Clustering definition: Partition a given set of objects into M groups (clusters) such that the objects of each group are ‘similar’ and ‘different’
Semi-Supervised Clustering
Hanan Ayad Supervisor Prof. Mohamed Kamel
Document Clustering Based on Non-negative Matrix Factorization
Neuro-Computing Lecture 5 Committee Machine
A Consensus-Based Clustering Method
Consensus Partition Liang Zheng 5.21.
Spectral Clustering Eric Xing Lecture 8, August 13, 2010
Segmentation (continued)
Presentation transcript:

Ensemble Clustering

Ensemble Clustering clustering algorithm 1 partition 1 combine unlabeled data partition 2 Final partition …… …… … … clustering algorithm N partition N Combine multiple partitions of given data into a single partition of better quality

Why Ensemble Clustering? Different clustering algorithms may produce different partitions because they impose different structure on the data; No single clustering algorithm is optimal Different realizations of the same algorithm may generate different partitions

Why Ensemble Clustering? Goal Exploit the complementary nature of different partitions Each partition can be viewed as taking a different “look” or “cut” through data Punch, Topchy, and Jain, PAMI, 2005

Challenge I: how to Generate clustering ensembles? Produce a clustering ensemble by either Using different clustering algorithms E.g. K-means, Hierarchical Clustering, Fuzzy C-means, Spectral Clustering, Gaussian Mixture Model,…. Running the same algorithm many times with different parameters or initializations, e.g., run K-means algorithm N times using randomly initialized clusters centers use different dissimilarity measures use different number of clusters Using different samples of the data E.g. many different bootstrap samples from the givendata Random projections (feature extraction) E.g. project the data onto a random subspace Feature selection E.g. use different subsets of features

Challenge II: how to combine multiple partitions? According to (Vega-Pons & Ruiz-Shulcloper, 2011), ensemble clustering algorithms can be divided into Median partition based approaches Object co-occurrence based approaches Relabeling/voting based methods Co-association matrix based methods Graph based methods

Median partition based approaches Basic idea: find a partition P that maximizes the similarity between P and all the N partitions in the ensemble: P1, P2, …, PN Need to define the similarity between two partitions Normalized mutual information (Strehl & Ghosh, 2002) Utility function (Topchy, Jain, and Punch, 2005) Fowlkes-Mallows index (Fowlkes & Mallows, 1983) Purity and inverse purity (Zhao & Karypis, 2005) P2 S2 P3 P1 S1 S3 P SN … …. SN-1 PN PN-1

Relabeling/voting based methods Basic idea: first find the corresponding cluster labels among multiple partitions, then obtain the consensus partition through a voting process. (Ayad & Kamel, 2007; Dimitriadou et. al, 2002; Dudoit & Fridlyand, 2003; Fischer & Buhmann, 2003; Tumer & Agogino, 2008; etc) Re-labeling Voting P1 P2 P3 v1 1 3 2 v2 v3 v4 v5 v6 P1 P2 P3 v1 1 v2 v3 2 v4 v5 3 v6 P* 1 2 3 Hungarian algorithm

Co-association matrix based methods Basic idea: first compute a co-association matrix based on multiple data partitions, then apply a similarity-based clustering algorithm (e.g., single link and normalized cut) to the co- association matrix to obtain the final partition of the data. (Fred & Jain, 2005; Iam-On et. al, 2008; Vega-Pons & Ruiz-Shulcloper, 2009; Wang et. al, 2009; Li et. al, 2007; etc)

Graph based methods P1 P2 P3 v1 1 v2 2 v3 v4 v5 3 v6 4 P* 1 2 3 Basic idea: construct a weighted graph to represent multiple clustering results from the ensemble, then find the optimal partition of data by minimizing the graph cut (Fern & Brodley, 2004; Strehl & Ghosh, 2002; etc) P1 P2 P3 v1 1 v2 2 v3 v4 v5 3 v6 4 P* 1 2 3 Graph clustering

ENSEMBLE CLUSTERING IN IMAGE SEGMENTATION Ensemble Clustering using Semidefinite Programming, Singh et al, NIPS 2007

Other research problems Ensemble Clustering Theory Ensemble clustering converges to true clustering as the number of partitions in the ensemble increases (Topchy, Law, Jain, and Fred, ICDM, 2004) Bound the error incurred by approximation (Gionis, Mannila, and Tsaparas, TKDD, 2007) Bound the error when some partitions in the ensemble are extremely bad (Yi, Yang, Jin, and Jain, ICDM, 2012) Partition selection Adaptive selection (Azimi & Fern, IJCAI, 2009) Diversity analysis (Kuncheva & Whitaker, Machine Learning, 2003)