Fuzzy rule-based system derived from similarity to prototypes

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

Graphical Technique of Inference
Support Vector Machines
Neural Approach for Personalized Emotional Model in Human-Robot Interaction Mária Virčíková, Martin Pala, Peter Smolar, Peter Sincak Technical University.
Heterogeneous Forests of Decision Trees Krzysztof Grąbczewski & Włodzisław Duch Department of Informatics, Nicholas Copernicus University, Torun, Poland.
PROBABILISTIC DISTANCE MEASURES FOR PROTOTYPE-BASED RULES Włodzisław Duch Department of Informatics, Nicolaus Copernicus University, Poland, School of.
Fuzzy Logic E. Fuzzy Inference Engine. “antecedent” “consequent”
Data Clustering Methods
Fuzzy rule-based system derived from similarity to prototypes Włodzisław Duch Department of Informatics, Nicolaus Copernicus University, Poland School.
Jierui Xie, Boleslaw Szymanski, Mohammed J. Zaki Department of Computer Science Rensselaer Polytechnic Institute Troy, NY 12180, USA {xiej2, szymansk,
Support Vector Neural Training Włodzisław Duch Department of Informatics Nicolaus Copernicus University, Toruń, Poland School of Computer Engineering,
PART 7 Constructing Fuzzy Sets 1. Direct/one-expert 2. Direct/multi-expert 3. Indirect/one-expert 4. Indirect/multi-expert 5. Construction from samples.
Fuzzy Logic E. Fuzzy Inference Engine. “antecedent” “consequent”
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Global Visualization of Neural Dynamics
Chapter 4 (part 2): Non-Parametric Classification
Competent Undemocratic Committees Włodzisław Duch, Łukasz Itert and Karol Grudziński Department of Informatics, Nicholas Copernicus University, Torun,
Support Feature Machine for DNA microarray data Tomasz Maszczyk and Włodzisław Duch Department of Informatics, Nicolaus Copernicus University, Toruń, Poland.
WELCOME TO THE WORLD OF FUZZY SYSTEMS. DEFINITION Fuzzy logic is a superset of conventional (Boolean) logic that has been extended to handle the concept.
CSCI 347 / CS 4206: Data Mining Module 04: Algorithms Topic 06: Regression.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
JM - 1 Introduction to Bioinformatics: Lecture VIII Classification and Supervised Learning Jarek Meller Jarek Meller Division.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
_____KOSYR 2001______ Rules for Melanoma Skin Cancer Diagnosis Włodzisław Duch, K. Grąbczewski, R. Adamczak, K. Grudziński, Department of Computer Methods,
Chapter 8 The k-Means Algorithm and Genetic Algorithm.
Data Mining Practical Machine Learning Tools and Techniques Chapter 4: Algorithms: The Basic Methods Section 4.6: Linear Models Rodney Nielsen Many of.
Fuzzy Inference (Expert) System
Computational Intelligence: Methods and Applications Lecture 12 Bayesian decisions: foundation of learning Włodzisław Duch Dept. of Informatics, UMK Google:
So Far……  Clustering basics, necessity for clustering, Usage in various fields : engineering and industrial fields  Properties : hierarchical, flat,
Computational Intelligence: Methods and Applications Lecture 20 SSV & other trees Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Fuzzy Systems Michael J. Watts
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Computational Intelligence: Methods and Applications Lecture 33 Decision Tables & Information Theory Włodzisław Duch Dept. of Informatics, UMK Google:
Computational Intelligence: Methods and Applications Lecture 29 Approximation theory, RBF and SFN networks Włodzisław Duch Dept. of Informatics, UMK Google:
Computational Intelligence: Methods and Applications Lecture 22 Linear discrimination - variants Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.
Computational Intelligence: Methods and Applications Lecture 26 Density estimation, Expectation Maximization. Włodzisław Duch Dept. of Informatics, UMK.
Introduction of Fuzzy Inference Systems By Kuentai Chen.
Computational Intelligence: Methods and Applications Lecture 14 Bias-variance tradeoff – model selection. Włodzisław Duch Dept. of Informatics, UMK Google:
Fuzzy Systems Simulation Session 5
Clustering (1) Clustering Similarity measure Hierarchical clustering
Fuzzy Logic in Pattern Recognition
Support Feature Machine for DNA microarray data
Semi-Supervised Clustering
Chapter 7. Classification and Prediction
Deep Feedforward Networks
Fuzzy Systems Michael J. Watts
Intro to Machine Learning
Instance Based Learning
Boosting and Additive Trees
Introduction to Fuzzy Logic
Department of Informatics, Nicolaus Copernicus University, Toruń
Fuzzy logic Introduction 3 Fuzzy Inference Aleksandar Rakić
K-means and Hierarchical Clustering
K Nearest Neighbor Classification
Tomasz Maszczyk and Włodzisław Duch Department of Informatics,
Roberto Battiti, Mauro Brunato
Projection of network outputs
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Instance Based Learning
Clustering Techniques
EE513 Audio Signals and Systems
Visualization of the hidden node activities or hidden secrets of neural networks. Włodzisław Duch Department of Informatics Nicolaus Copernicus University,
Nearest Neighbors CSC 576: Data Mining.
Visualization of the hidden node activities or hidden secrets of neural networks. Włodzisław Duch Department of Informatics Nicolaus Copernicus University,
Support Vector Neural Training
Heterogeneous adaptive systems
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Using Clustering to Make Prediction Intervals For Neural Networks
Data Mining CSCI 307, Spring 2019 Lecture 11
Presentation transcript:

Fuzzy rule-based system derived from similarity to prototypes Włodzisław Duch Department of Informatics, Nicolaus Copernicus University, Poland School of Computer Engineering, Nanyang Technological University, Singapore Marcin Blachnik Division of Computer Methods, Department of Elektrotehnology, The Silesian University of Technology,

Plan What is it all about? Fuzzy rule systems and prototype rule based systems. From prototype rules to fuzzy rules and vice versa, with examples. Results of applications on real datasets. Conclusions.

Motivation Understanding data, situations, recognizing objects or making diagnosis people frequently use similarity to known cases, and rarely use logical reasoning, but soft computing experts use logic instead of similarity ... Relations between similarity and logic are not clear. Q1: How to obtain the same decision borders in Fuzzy Logic systems and Prototype Rule Based systems? Q2: What type of similarity measure corresponds to a typical fuzzy functions and vice versa? Q3: How to transform one type of a system into another type preserving their decision borders? Q4: Are there any advantages of such transformations? Q5: Can we understand data better using prototypes instead of logical rules?

Fuzzy Rule Based System Learning process includes: for each feature, select shapes of membership functions and the number of these functions; optimize parameters of the membership functions (such as positions and spreads) using training data; aggregate input information and calculate final rule activations for each category; assign membership degrees to output classes; write the set of F-rules and interpret them.

Prototype Rule Based System Learning process involves: specify the number and positions of prototypes; select similarity or dissimilarity (distance) functions (we use distance functions); calculate distance (similarity) to each prototype; assign P-rule to the output class as a rule; choices are: If P=argminp’(D(X,P’)) Then Class(X)=Class(P) This is a nearest prototype rule, similar to the fuzzy logic rule: If R=maxk MembFk(X) Then Class(X)<=Class(R) Another form of P-rules is based on similarity threshold: If D(X,P)≤dp Then C Taking D(X,P) distance crisp logic rules are obtained

Advantages of prototype based rules Inspired by cognitive psychology: it may be easier to understand prototypes and similarity than fuzzy rules P-rules may be defined for nominal features using probabilistic distance measures (such as VDM), while F-rules require numerical inputs. Many algorithms for prototype selection and optimization exist but they have not been applied to understand data and their relation to fuzzy rules have not been explored; Applications of P-rules to real datasets give excellent results generating small number of prototypes.

Value Difference Matrix (VDM) VDM – probability difference measure for 1 attribute for many attributes VDM measure can be also applied for continuous features, in the simplest way using discretization and interpolation, or other probability estimation techniques (Gaussian smoothing, Parzen windows, etc).

P-rules  F-rules Condition: preserve classification borders Q: how are membership functions and distance functions related? Can one obtain new, interesting membership functions from known distance functions and vice versa? For all additive distance functions exp transformation changes distances D of P-rules into products of MF of F-rules: MF=exp(-D) Example: Euclidean distance is equivalent to Gaussian MFs Algebraic (product) T-norm is obtained with Gaussian MFs

Visualization Euclidean distance function Decision border MF for attrib 1 MF for attrib 2 Euclidean distance function Square of Canberra distance function

VDM distance => membership functions Decision border MF for attrib 1 MF for attrib 2 DVDM distance function IVDM distance function

Inverse transformation For all product T-norm D = -ln(F) Advantages: New type of distance functions are generated. Example: distances generated from triangular functions.

Applications to real data Gene expression data for 2 types of leukaemia (Golub et al, Science 286 (1999) 531-537 Description: 2 classes, 1100 features, 3 most relevant selected. Used methods: 1 prototype/class LVQ, DVDM similarity measure. Results (number of misclassified vectors): Searching for Promoters in DNA strings Description: 2 classes, 57 features, all symbolic features. Used methods: 9 prototypes for promoters, 12 for nonpromoters, generated using C-means + LVQ, with VDM similarity measure. Results: 5 misclassified vectors in leave one out test. Data Set Golub et al P-rules Train 3 Test 5

Conclusions First step in understanding relations between fuzzy and similarity-based systems was made. Prototype rules can be expressed using fuzzy rules and vice versa. New possibilities in both fields: new type of membership functions; new type of distance functions; VDM measure used in P-rules leads to a natural shape of membership functions in fuzzy logic for symbolic data. Expert knowledge can be captured in both types of rules, but sometimes it is easier to express as P-rules and sometimes as F-rules. Many open problems remain.

Thank You for lending your ears ... Speaker: Marcin Blachnik