Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Database and Info. Systems Lab. University of Illinois at Urbana-Champaign RankFP : A Framework for Rank Formulation and Processing Hwanjo Yu, Seung-won.

Similar presentations


Presentation on theme: "The Database and Info. Systems Lab. University of Illinois at Urbana-Champaign RankFP : A Framework for Rank Formulation and Processing Hwanjo Yu, Seung-won."— Presentation transcript:

1 The Database and Info. Systems Lab. University of Illinois at Urbana-Champaign RankFP : A Framework for Rank Formulation and Processing Hwanjo Yu, Seung-won Hwang, Kevin Chen-Chuan Chang

2 AIM 2 The Context: AIMing to the Top Enabling ad-hoc ranking in data retrieval Rank Formulation Rank Processing select * from houses order by [ranking function F] limit 3 ranked results query top-3 houses

3 AIM 3 Problem: Enabling Ad-hoc Ranking To enable ad-hoc ranking, we observe two major barriers: Usability: Ranking should be “user-friendly”, for ordinary users to easily specify their ranking criteria Efficiency: Ranking should be “DB-friendly” to be amenable to efficient processing  We propose a framework combining user-friendly formulation and DB-friendly processing.

4 AIM 4 Our Insight: Combining Usability and Efficiency We combine qualitative model for usability and quantitative model for efficiency Qualitative model  Query condition is represented as a relative ordering of objects  User-friendly by alleviating user from specifying the absolute score on each object  Example > Quantitative model  Query condition is represented as a mapping F of objects into absolute numerical scores  DB-friendly, by attaining the absolute score on each object  Example F( )=0.9 F( )=0.5

5 AIM 5 Our Solution: RankFP (RANK Formulation and Processing) For usability, we propose a qualitative formulation front-end which enables rank formulation by ordering samples For efficiency, we learn a quantitative ranking function F which is readily expressible using order by clause in SQL sample S (unordered) Sample Selection: generate new S Function Learning: learn new F ranking R* over S Over S: R F  R* ? 1 2 3 45 no yes F ranking function Rank Formulation Rank Processing ranked results processing of Q Q: select * from houses order by F limit k

6 AIM 6 Implementation SVM Learner order by F sampled top results interface PostgreSQL Top-k results if R F  R* ?

7 AIM 7 Task 1: Rank Formulation Front-end (Ranking  Classification) Challenge: Unlike a conventional learning problem of classifying objects into groups, we need to learn a function inducing a desired ordering of all objects Solution: Transform ranking into a classification on pairwise differences [Herbrich2000] and adopt learning algorithms (e.g., SVM) to learn pairwise classification function F learning algorithms: a binary classifier 1 - F a - b b - c c - d d - e a - c … … ranking view: c > b > d > e > a c b d e a classification view: 0 0 1 1 0 pairwise diff. classification [Herbrich2000] R. Herbrich, et. al. Large margin rank boundary for ordinal regression. MIT Press, 2000.

8 AIM 8 Task 2: Rank Processing Back-end (Classification  Ranking) Challenge: While the classification is for each pair of objects, we need to efficiently rank the entire database. Solution: We develop duality connecting a pairwise classification function F, also as a global per-object ranking function. Suppose the rank function F is linear Classification View: Ranking View: F(u i -u j )>0  F(u i )- F(u j )>0  F(u i )> F(u j ) b d e a c F(a-b)? F(a-c)? F(a-d)? ….. F Rank with F(. ) e.g., F(c)>F(b)>F(d)>… Further: Optimization of Top-k Order-by [SIGMOD’05]

9 AIM 9 Conclusion: Summary To support ranking for data retrieval, we develop RankFP, an iterative learning and processing framework, combining: Usability: Developing a learning front-end, which enables qualitative rank formulation Efficiency: Transforming the classification to a global rank function for efficient processing

10 AIM 10 Thank You! For more information: The AIM Project: http://aim.cs.uiuc.edu


Download ppt "The Database and Info. Systems Lab. University of Illinois at Urbana-Champaign RankFP : A Framework for Rank Formulation and Processing Hwanjo Yu, Seung-won."

Similar presentations


Ads by Google