Presentation is loading. Please wait.

Presentation is loading. Please wait.

A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto.

Similar presentations


Presentation on theme: "A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto."— Presentation transcript:

1 A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto Bugarín, and Senén Barro Present by Chia-Hao Lee

2 2 outline Introduction Fuzzy Quantifiers –Probabilistic Quantifier Fuzzification Mechanisms New View in Crisp Representatives –FA Quantifier Fuzzification mechanism –Properties of the Model Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval –Fuzzy Quantifiers and Information Retrieval –Example Information Retrieval Experiments Conclusion

3 3 Introduction The ability of fuzzy quantifiers to model linguistic statements in a natural way has proved useful in diverse areas such as expert systems, data mining, control systems, database systems, etc. In the information retrieval (IR) field, fuzzy quantification has been proposed for handling expressive queries giving rise to flexible query language.

4 4 Fuzzy Quantifiers A fuzzy set X its membership function is denoted as : for example: : the powerset of E : the fuzzy powerset of E stands for the α-cut of level α of X

5 5 Definition 1 (Classic Quantifier) : A classic s-ary quantifier on a base or referential set E is a mapping Q : A typical example of a classic quantifier is the following definition of an all statement which can be used for sentences such as “ ” : Fuzzy Quantifiers

6 6 For example : Let us consider the evaluation of the sentence “80% or more of students are Spanish” where the properties “students” and “Spanish” are, respectively, defined as X 1 ={1,0,1,0,1,0,1,1}, X 2 ={1,0,1,0,1,0,0,0} and “80% or more” is defined as in (1). Then

7 7 Definition 2 (Fuzzy Quantifier) : An s-ary fuzzy quantifier Q on a base set is a mapping An example of a fuzzy quantifier is, which can defined as a fuzzy extension of 1 using a typical definition for the fuzzy inclusion operator: Fuzzy Quantifiers

8 8 For example : Let us consider the evaluation of sentence “all tall people are blond” in the referential set. Let us assume that properties “tall” and “blond” are, respectively, defined as Using expression (2) then: In many cases, it is not easy to achieve consensus on an intuitive and generally applicable expression for implementing a given quantified expression. Fuzzy Quantifiers

9 9 Definition 3 (Semi-fuzzy Quantifier) : An s-ary semi-fuzzy quantifier Q on a base set is a mapping which assigns a gradual result to each choice of crisp.

10 10 Fuzzy Quantifiers Examples of semi-fuzzy quantifier are :

11 11 Fuzzy Quantifiers For example : Let us consider the evaluation of the sentence “about 80% or more of the students are Spanish”. Let us assume that properties “students” and “Spanish” are, respectively, defined as X 1 ={1,0,1,0,1,0,1,1}, X 2 ={1,0,1,0,1,0,0,0} then

12 12 Fuzzy Quantifiers Although semi-fuzzy quantifiers are much more intuitive and easier to define than fuzzy quantifiers, they cannot be directly applied for handling linguistic statements, since semi-fuzzy quantifiers are defined on crisp sets. Such methods are known as quantifier fuzzification mechanisms (QFM) and formally defined as a mapping with domain in the universe of semi-fuzzy quantifiers and range in the universe of fuzzy quantifiers:

13 13 Fuzzy Quantifiers Probabilistic Quantifier Fuzzification Mechanisms : In the universe of discourse E is finite and expressions and unary then both expressions collapse into the same discrete expression The value can be interpreted as the probability that is selected as the crisp representative for the fuzzy set X.

14 14 Fuzzy Quantifiers For example : Let us consider the evaluation of the quantified sentence “almost all students are tall.” Suppose that we model the property tall for a referential set of students through the fuzzy set tall and we support the quantified expression “almost all” by means of the following semi-fuzzy quantifier :

15 15 Fuzzy Quantifiers given the fuzzy set tall, the values are and the fuzzification process runs as follows:

16 16 New View on Crisp Representatives Given a fuzzy set, the process that selects a number of elements in E to be includes in a crisp representative of X can be viewed as a random process in which n mutually independent binary decisions are made. Every individual decision involving an element may be viewed as a Bernoulli trial whose probability of success equals.

17 17 New View on Crisp Representatives Definition 4 ( ) : We define the probability that a crisp set is a crisp representative of X as Definition 5 ( ) : Let be a semi-fuzzy quantifier. For simplicity, fuzzification process :

18 18 New View on Crisp Representatives We will denote by a referential containing m elements. By we will denote a crisp (fuzzy) set on this referential. Let us consider a unary semi-fuzzy quantitative quantifier : a function with the form

19 19 New View on Crisp Representatives For this case, the expression becomes And we instead of

20 20 New View on Crisp Representatives Example of the approach

21 21 New View on Crisp Representatives It can be proved that all the value can be obtained with a complexity

22 22 New View on Crisp Representatives We can advance that the model is well-behaved because it fulfills the properties of correct generalization of crisp expressions, induced operations, external negation, internal negation, duality, internal meets, monotonicity in arguments monotonicity in quantifiers and coherece with logic.

23 23 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval IR is the science concerned with the effective and efficient retrieval of information for the subsequent use by interested parties. IR models differ in the way in which documents and queries are represented and matched. The proposal designs a general framework based on the NVM method in which quantifiers with different degrees of expressiveness can be handled.

24 24 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval Consider a query with the form. Given a document of the document base, every query term produces a score which represents the connection between the document’s semantics and the term. Formally, every document induces a fuzzy set on the set of query terms which is defined applying the popular weighting strategy : the raw frequency of term in the document : the maximum raw frequency computed over all terms mentioned by the document

25 25 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval The fuzzy set models the connection between the document and every query component. Quantification can now be applied on for evaluating the quantified symbol all.

26 26 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval Example : Let us suppose that we apply the following power function for supporting a given query quantification symbol Q : Imagine a query and consider a document whose fuzzy set induced on the query components is Applying now the fuzzification process explained along this paper, the query-document matching is assigned a score n : the number of query terms

27 27 Applying the FA Quantifier Fuzzificaiton Mechanism for Information Retrieval Let us now apply the NVM approach to handle the same example. The score assigned is equal to It follows that the final value yielded by the NVM method is:

28 28 Information Retrieval Experiment We ran experiments against the Wall Street Journal (WSJ) documents, which are about 173,000 news articles (from 1987 to 1992). Natural language documents are preprocessed as follow: –First, common words such as prepositions, articles, etc. are eliminated. –Second, terms are reduced to their syntactical root by applying the popular Porter’s stemmer.

29 29 Information Retrieval Experiment We tried out different semi-fuzzy quantifiers for relaxing the interpretation of the quantified statement all and, for each semi-fuzzy quantifier, both the fuzzification approach and the NVM approach were applied. We experimented with power functions and exponential functions, both of them normalized in the interval as follows :

30 30 Information Retrieval Experiment

31 31 Information Retrieval Experiment

32 32 Information Retrieval Experiment

33 33 Information Retrieval Experiment

34 34 Information Retrieval Experiment

35 35 Information Retrieval Experiment

36 36 Conclusion In the paper, we present a new probabilistic quantifier fuzzification mechanism, its efficient implementation and its application for the basic information retrieval task.


Download ppt "A Probabilistic Quantifier Fuzzification Mechanism: The Model and Its Evaluation for Information Retrieval Felix Díaz-Hemida, David E. Losada, Alberto."

Similar presentations


Ads by Google