1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 29 Nov 11, 2005 Nanjing University of Science & Technology.

Slides:



Advertisements
Similar presentations
Clustering Clustering of data is a method by which large sets of data is grouped into clusters of smaller sets of similar data. The example below demonstrates.
Advertisements

K-means Clustering Given a data point v and a set of points X,
Numerical Solution of Linear Equations
Clustering.
Transportation Problem (TP) and Assignment Problem (AP)
1 Transportation problem The transportation problem seeks the determination of a minimum cost transportation plan for a single commodity from a number.
CSCI 5417 Information Retrieval Systems Jim Martin Lecture 16 10/18/2011.
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Introduction to Bioinformatics
Андрей Андреевич Марков. Markov Chains Graduate Seminar in Applied Statistics Presented by Matthias Theubert Never look behind you…
The Simplex Method: Standard Maximization Problems
Clustering… in General In vector space, clusters are vectors found within  of a cluster vector, with different techniques for determining the cluster.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
MAE 552 – Heuristic Optimization Lecture 26 April 1, 2002 Topic:Branch and Bound.
Theory and Applications
August 12, 2003 IV. FUZZY SET METHODS - CLUSTER ANALYSIS: Math Clinic Fall IV. FUZZY SET METHODS for CLUSTER ANALYSIS and (super brief) NEURAL NETWORKS.
K-means Clustering. What is clustering? Why would we want to cluster? How would you determine clusters? How can you do this efficiently?
Clustering Ram Akella Lecture 6 February 23, & 280I University of California Berkeley Silicon Valley Center/SC.
Ulf Schmitz, Pattern recognition - Clustering1 Bioinformatics Pattern recognition - Clustering Ulf Schmitz
Chapter 10 Real Inner Products and Least-Square (cont.)
Advanced Statistical Methods for Research Math 736/836
CSE554AlignmentSlide 1 CSE 554 Lecture 8: Alignment Fall 2014.
Chapter 3: The Fundamentals: Algorithms, the Integers, and Matrices
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 13 Oct 14, 2005 Nanjing University of Science & Technology.
ENCI 303 Lecture PS-19 Optimization 2
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 23 Nov 2, 2005 Nanjing University of Science & Technology.
Topic III The Simplex Method Setting up the Method Tabular Form Chapter(s): 4.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 21 Oct 28, 2005 Nanjing University of Science & Technology.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 14 Oct 14, 2005 Nanjing University of Science & Technology.
Matrices Matrices For grade 1, undergraduate students For grade 1, undergraduate students Made by Department of Math.,Anqing Teachers college.
So Far……  Clustering basics, necessity for clustering, Usage in various fields : engineering and industrial fields  Properties : hierarchical, flat,
CLUSTERING. Overview Definition of Clustering Existing clustering methods Clustering examples.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 24 Nov 2, 2005 Nanjing University of Science & Technology.
CSE554AlignmentSlide 1 CSE 554 Lecture 8: Alignment Fall 2013.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 25 Nov 4, 2005 Nanjing University of Science & Technology.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 12 Sept 30, 2005 Nanjing University of Science & Technology.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 8 Sept 23, 2005 Nanjing University of Science & Technology.
1 Network Models Transportation Problem (TP) Distributing any commodity from any group of supply centers, called sources, to any group of receiving.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 30 Nov 11, 2005 Nanjing University of Science & Technology.
CS 8751 ML & KDDData Clustering1 Clustering Unsupervised learning Generating “classes” Distance/similarity measures Agglomerative methods Divisive methods.
Copyright © 2016, 2012 Pearson Education, Inc
Prototype Classification Methods Fu Chang Institute of Information Science Academia Sinica ext. 1819
Fuzzy C-Means Clustering
Compiled By: Raj Gaurang Tiwari Assistant Professor SRMGPC, Lucknow Unsupervised Learning.
PART 10 Pattern Recognition 1. Fuzzy clustering 2. Fuzzy pattern recognition 3. Fuzzy image processing FUZZY SETS AND FUZZY LOGIC Theory and Applications.
Vector Quantization Vector quantization is used in many applications such as image and voice compression, voice recognition (in general statistical pattern.
Information-Theoretic Co- Clustering Inderjit S. Dhillon et al. University of Texas, Austin presented by Xuanhui Wang.
1 What happens to the location estimator if we minimize with a power other that 2? Robert J. Blodgett Statistic Seminar - March 13, 2008.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 28 Nov 9, 2005 Nanjing University of Science & Technology.
Application of the GA-PSO with the Fuzzy controller to the robot soccer Department of Electrical Engineering, Southern Taiwan University, Tainan, R.O.C.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Fuzzy C-means Clustering Dr. Bernard Chen University of Central Arkansas.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 2 Nanjing University of Science & Technology.
Computational Intelligence Winter Term 2015/16 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
The Matrix Equation A x = b (9/16/05) Definition. If A is an m  n matrix with columns a 1, a 2,…, a n and x is a vector in R n, then the product of A.
May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.
K-MEANS CLUSTERING. INTRODUCTION- What is clustering? Clustering is the classification of objects into different groups, or more precisely, the partitioning.
Digital logic COMP214  Lecture 2 Dr. Sarah M.Eljack Chapter 1 1.
Machine Learning Lecture 4: Unsupervised Learning (clustering) 1.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Clustering (2) Center-based algorithms Fuzzy k-means Density-based algorithms ( DBSCAN as an example ) Evaluation of clustering results Figures and equations.
EE611 Deterministic Systems Controllability and Observability of Continuous-time Systems Kevin D. Donohue Electrical and Computer Engineering University.
CSE 554 Lecture 8: Alignment
Linear Algebra Lecture 4.
Clustering (3) Center-based algorithms Fuzzy k-means
Pattern Recognition: Statistical and Neural
SOLUTION OF NONLINEAR EQUATIONS
Simple Kmeans Examples
Linear Algebra Lecture 16.
Pattern Recognition: Statistical and Neural
Presentation transcript:

1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 29 Nov 11, 2005 Nanjing University of Science & Technology

2 Lecture 29 Topics 1. Review Fuzzy Sets and Fuzzy Partitions 2. Fuzzy C- Means Clustering Algorithm Preliminaries 3. Fuzzy C-Means Clustering Algorithm Details 4. Fuzzy C-Means Clustering Algorithm Example 5. Comments about Fuzzy Clustering

3 Example of a Fuzzy Set: Define a Fuzzy set A by the following membership function Or equivalently Example Review

4 S Function defined on S Review

5 Membership Functions for Fuzzy Clusters Domain Pattern Vectors Membership Function for F 1 Membership Function for F 2 EXAMPLE Review

6 A Fuzzy Partition F, of a set S, is defined by its membership functions for the fuzzy sets F k : k =1, 2,..., K ) ] Fuzzy Partition Review

7 where Each value bounded by 0 and 1 Sum of each columns values =1 Sum of each row is less than n Review

8 Conversion of Fuzzy Clustering into Crisp Clustering ) ] C C C Fuzzy Clusters Crisp Clustering All membership values are equal to 1 or 0 Only one 1 in each column where membership value for a Fuzzy cluster is a maximum

9 Fuzzy Clusters Crisp Clusters from Fuzzy Clusters Cl 1 : [ ] Cl 2 : [ ] Cl 3 : [ ] Cl 1 : [ ] Cl 2 : [ ] Cl 3 : [ ] Example: Given Fuzzy clusters below convert to Crisp Clusters

10 Cl 1 : [ ] Cl 2 : [ ] Cl 3 : [ ] Cl 1 : { x 1, x 2 } Cl 2 : { x 4, x 6 } Cl 3 : { x 3, x 5, x 7 } Cl 1 : { x 1 x 2 } Cl 2 : { x 4 x 6 } Cl 3 : { x 3 x 5 x 7 } Answer: Crisp Clusters

11 Fuzzy C-Means Clustering Preliminary Given a set S composed of pattern vectors which we wish to cluster ) ] C C C S = { x 1, x 2,..., x N } Define C Cluster Membership Functions...

12 Define C Cluster Centroids as follows Let V i be the Cluster Centroid for Fuzzy Cluster Cl i, i = 1, 2, …, C Define a Performance Objective J m as where

13 The Fuzzy C-Means Algorithm minimizes J m by selecting V i and i, i =1, 2, …, C by an alternating iterative procedure as described in the algorithm’s details m = Fuzziness Index (m >1 ) Higher numbers being more fuzzy A is a symmetric positive definite matrix N s is total number of pattern vectors Definitions

14 Fuzzy C-Means Clustering Algorithm (a) Flow Diagram No Yes

15 Fuzzy C-Means Clustering Algorithm (b) Details of Steps in the Algorithm

16 Step 1: Initialization Select initial membership functions such that This is equivalent to specifying fuzzy clusters F 1, F 2, …, F C

17 One Method to accomplish this selection is to chose r ki randomly from the open interval (0, 1) and then Normalize

18 Step 2: Computation of Fuzzy Centroids Compute the Fuzzy Centroids as

19 Step 3: Compute New Fuzzy Membership Functions Using the V i, i = 1, 2, …, C from step 2 compute i (k)

20 Alternative method for computing the Membership Functions where

21 If membership functions do not change convergence has occurred If algorithm converges then the i represent the fuzzy clusters and we Stop If Convergence has not occurred and the number of iterations is less than some preassigned maximum value (MAXIT) then return to step 2 If otherwise then Stop with no solution. Step 4: Check for Convergence

22 Usually convergences Answer not unique as depends upon initial conditions. Can converge to a local minimum Can be used to produce hard clusters Properties of Fuzzy C-Means Algorithm

23 Example – Application of Fuzzy Clustering Algorithm Given the following set of data vectors (a) Perform a Fuzzy Clustering of the data using Fuzzy C-Means Algorithm to obtain two fuzzy clusters. Try several initial conditions. Is the result unique? (use MAXIT=1000) (b) Using the results of (a) give a crisp clustering of the data. (c) Repeat (a) and (b) for three Fuzzy Clusters.

24 Plot of Data for Fuzzy clustering example

25 (a) Solution for two clusters Fuzzy cluster membership functions randomly selected Calculation of Fuzzy Centroids

26 Calculation of New Membership Functions Calculation of Performance Not the same as preceding iteration Membership function (No convergence) Number of iterations not greater than 1000 therefore the iterations continue.

27 Results Converge at Iteration 17 Cluster membership Functions Performance Measure Cluster Centroids

28 (a) Final Cluster membership Functions Cl 2 : Cl 1 :

29 (b) Solution Crisp Clustering Cl 1 = { x 1, x 2, x 3 } Cl 2 = { x 4, x 5 } Crisp Membership functions Set Assignment Cl 2 : Cl 1 : Fuzzy Membership functions

30 J m = (c) Solution for three fuzzy clusters Applying the Fuzzy Clustering Algorithm convergence was obtained in ?? iterations as Final Cluster membership functions Cl 1 : Cl 2 : Cl 3 :

31 (c) Solution Crisp Clustering Cl 1 = { x 1, x 2 } Cl 2 = { x 4, x 5 } Cl 3 = { x 3 } Membership functions Set Assignment F 1 : [ 1, 1, 0, 0, 0 ] F 2 : [ 0, 0, 0, 1, 1 ] F 3 : [ 0, 0, 1, 0, 0 ] “Crisp Clusters”

32 Comments: Fuzzy Clustering Can be used to Produce Hard Clustering The larger the value of m the fuzzier the clusters The Fuzzy algorithm is relatively stable and usually converges in a reasonable number of iterations The Fuzzy algorithm is relatively insensitve to initial conditions

33 Of the two different fuzzy clusterings given below, which clustering is the Fuzzier ??? Cl 1 : [ ] Cl 2 : [ ] Cl 1 : [ ] Cl 2 : [ ] # 1 # 2 or

34 Cl 1 : [ ] Cl 2 : [ ] Cl 1 : [ ] Cl 2 : [ ] # 1 # 2 ANSWER: #1 is the fuzzier of the two different clusterings

35 Why is #1 the Fuzzier of the two ???

36 Why is #1 the Fuzzier of the two ??? ANSWER: Because the cluster membership functions contain many entries close to 0.5 ( for the two class case) as opposed to values close to 0 and 1. For the M class case values close to 1/M would indicate most fuzziness.. * *

37 Lecture 29 Topics 1. Reviewed Fuzzy Sets and Fuzzy Partitions 2. Presented Fuzzy C- Means Clustering Algorithm Preliminaries 3. Gave Fuzzy C-Means Clustering Algorithm Details 4. Showed an Example of the Fuzzy C- Means Clustering Algorithm 5. Made a few comments about Fuzzy Clustering in General.

38 End of Lecture 29