Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.

Slides:



Advertisements
Similar presentations
ANHAI DOAN ALON HALEVY ZACHARY IVES CHAPTER 16: KEYWORD SEARCH PRINCIPLES OF DATA INTEGRATION.
Advertisements

Efficient IR-Style Keyword Search over Relational Databases Vagelis Hristidis University of California, San Diego Luis Gravano Columbia University Yannis.
I/O-Algorithms Lars Arge Fall 2014 September 25, 2014.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
DISCOVER: Keyword Search in Relational Databases Vagelis Hristidis University of California, San Diego Yannis Papakonstantinou University of California,
Mining Web’s Link Structure Sushanth Rai University of Texas at Arlington
Effective Keyword Search in Relational Databases Fang Liu (University of Illinois at Chicago) Clement Yu (University of Illinois at Chicago) Weiyi Meng.
Chapter 12: Expert Systems Design Examples
Structural Web Search Using a Graph-Based Discovery System Nitish Manocha, Diane J. Cook, and Lawrence B. Holder University of Texas at Arlington
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Circumventing Data Quality Problems Using Multiple Join Paths Yannis Kotidis, Athens University of Economics and Business Amélie Marian, Rutgers University.
FACT: A Learning Based Web Query Processing System Hongjun Lu, Yanlei Diao Hong Kong U. of Science & Technology Songting Chen, Zengping Tian Fudan University.
Keyword Proximity Search on XML Graphs Vagelis Hristidis Yannis Papakonstatinou Andrey Presenter: Feng Shao.
Chapter 19: Information Retrieval
Quality-driven Integration of Heterogeneous Information System by Felix Naumann, et al. (VLDB1999) 17 Feb 2006 Presented by Heasoo Hwang.
Graph Algebra with Pattern Matching and Aggregation Support 1.
Bidirectional Expansion for Keyword Search on Graph Databases Varun Kacholia Shashank Pandit Soumen Chakrabarti S. Sudarshan.
CHAMELEON : A Hierarchical Clustering Algorithm Using Dynamic Modeling
Authors: Bhavana Bharat Dalvi, Meghana Kshirsagar, S. Sudarshan Presented By: Aruna Keyword Search on External Memory Data Graphs.
NUITS: A Novel User Interface for Efficient Keyword Search over Databases The integration of DB and IR provides users with a wide range of high quality.
Keyword Search in Relational Databases Jaehui Park Intelligent Database Systems Lab. Seoul National University
Search Engines and Information Retrieval Chapter 1.
Keyword Search on External Memory Data Graphs Bhavana Bharat Dalvi, Meghana Kshirsagar, S. Sudarshan PVLDB 2008 Reported by: Yiqi Lu.
1 Overview of Databases. 2 Content Databases Example: Access Structure Query language (SQL)
DBXplorer: A System for Keyword- Based Search over Relational Databases Sanjay Agrawal Surajit Chaudhuri Gautam Das Presented by Bhushan Pachpande.
DBease: Making Databases User-Friendly and Easily Accessible Guoliang Li, Ju Fan, Hao Wu, Jiannan Wang, Jianhua Feng Database Group, Department of Computer.
1 Chapter 19: Information Retrieval Chapter 19: Information Retrieval Relevance Ranking Using Terms Relevance Using Hyperlinks Synonyms., Homonyms,
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
G-SPARQL: A Hybrid Engine for Querying Large Attributed Graphs Sherif SakrSameh ElniketyYuxiong He NICTA & UNSW Sydney, Australia Microsoft Research Redmond,
DBXplorer: A System for Keyword- Based Search over Relational Databases Sanjay Agrawal, Surajit Chaudhuri, Gautam Das Cathy Wang
Querying Structured Text in an XML Database By Xuemei Luo.
Harikrishnan Karunakaran Sulabha Balan CSE  Introduction  Database and Query Model ◦ Informal Model ◦ Formal Model ◦ Query and Answer Model 
GEORGIOS FAKAS Department of Computing and Mathematics, Manchester Metropolitan University Manchester, UK. Automated Generation of Object.
CS 533 Information Retrieval Systems.  Introduction  Connectivity Analysis  Kleinberg’s Algorithm  Problems Encountered  Improved Connectivity Analysis.
Scaling Personalized Web Search Authors: Glen Jeh, Jennfier Widom Stanford University Written in: 2003 Cited by: 923 articles Presented by Sugandha Agrawal.
Presenter: Shanshan Lu 03/04/2010
SPARQL Query Graph Model (How to improve query evaluation?) Ralf Heese and Olaf Hartig Humboldt-Universität zu Berlin.
Workshop on Software Product Archiving and Retrieving System Takeo KASUBUCHI Hiroshi IGAKI Hajimu IIDA Ken’ichi MATUMOTO Nara Institute of Science and.
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Mehdi Kargar Aijun An York University, Toronto, Canada Keyword Search in Graphs: Finding r-cliques.
Complex Queries over Web Repositories Sriram Raghavan and Hector Garcia-Molina Computer Science Department Stanford University Gülfem IŞIKLAR M.Mirac KOCATÜRK.
Templated Search over Relational Databases Date: 2015/01/15 Author: Anastasios Zouzias, Michail Vlachos, Vagelis Hristidis Source: ACM CIKM’14 Advisor:
 Enhancing User Experience  Why it is important?  Discussing user experience one-by-one.
Multi-object Similarity Query Evaluation Michal Batko.
Date: 2012/08/21 Source: Zhong Zeng, Zhifeng Bao, Tok Wang Ling, Mong Li Lee (KEYS’12) Speaker: Er-Gang Liu Advisor: Dr. Jia-ling Koh 1.
Keyword Search on Graph-Structured Data
Date: 2013/4/1 Author: Jaime I. Lopez-Veyna, Victor J. Sosa-Sosa, Ivan Lopez-Arevalo Source: KEYS’12 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang KESOSD.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Basics of Databases and Information Retrieval1 Databases and Information Retrieval Lecture 1 Basics of Databases and Information Retrieval Instructor Mr.
1 Holistic Twig Joins: Optimal XML Pattern Matching Nicolas Bruno, Nick Koudas, Divesh Srivastava ACM SIGMOD 2002 Presented by Jun-Ki Min.
Block-level Link Analysis Presented by Lan Nie 11/08/2005, Lehigh University.
Keyword Searching and Browsing in Databases using BANKS Charuta Nakhe, Arvind Hulgeri, Gaurav Bhalotia, Soumen Chakrabarti, S. Sudarshan Presented by Sushanth.
Learning to Create Data-Integration Queries Partha Pratim Talukdar, Marie Jacob, Muhammad Salman Mehmood, Koby Crammer, Zachary G. Ives, Fernando Pereira,
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Developing GRID Applications GRACE Project
XRANK: RANKED KEYWORD SEARCH OVER XML DOCUMENTS Lin Guo Feng Shao Chavdar Botev Jayavel Shanmugasundaram Abhishek Chennaka, Alekhya Gade Advanced Database.
Information Retrieval
Associative Query Answering via Query Feature Similarity
A Comparative Study of Link Analysis Algorithms
Keyword Searching and Browsing in Databases using BANKS
Information Retrieval
KDD Reviews 周天烁 2018年5月9日.
Keyword Searching and Browsing in Databases using BANKS
Keyword Searching and Browsing in Databases using BANKS
Bidirectional Query Planning Algorithm
Chapter 31: Information Retrieval
Information Retrieval and Web Design
Chapter 19: Information Retrieval
Introduction to XML IR XML Group.
Presentation transcript:

Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Outline  Introduction  Database and Query Model  Searching for the best answers  Browsing features of BANKS  Experiment  Conclusion

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Introduction  Search engines on Web have popularized an unstructured querying and browsing Simple and user-friendly Users just type in keywords and follow hyperlink  Relational databases are commonly searched using structured query language Users need to know the schema  Keyword searching techniques cannot be used on data stored in databases It often splits across the tables/tuples due to normalization

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Introduction(cond..)  BANKS (Browsing And Keyword Searching) a system which enables keyword-based search on relational databases, together with data and schema browsing User BANKS system Database HTTPJDBC

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Introduction(cond..)  BANKS (Browsing And Keyword Searching) a framework for keyword querying of relational database a novel and efficient heuristic algorithm for executing keyword queries key features of BANKS system

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Outline  Introduction  Database and Query Model Informal Model Formal Model Query and Answer Model  Searching for the best answers  Browsing features of BANKS  Experiment  Conclusion

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model  Informal Model Model Description each tuple in db fk-pk-Link database node in the graph directed edge directed graph   

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model The Schema

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model A Fragment of the Database

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model  Informal Model(cond.) An answer to a query should be a subgraph connecting nodes matching the keywords. The importance of a link depends upon the type of the link i.e. what relations it connects and on its semantics Ignoring directionality would cause problems because of “hubs” which are connected to a large numbers of nodes.

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model  Informal Model(cond.) We may restrict the information node to be from a selected set of nodes of the graph We incorporate another interesting feature, namely node weights, inspired by prestige rankings Node weights and tree weights need to be combined to get an overall relevance score

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model  Formal Database Model Nodes and edges Node Weight : N(u) Depends on the prestige Set the node prestige = the indegree of the node Nodes that have multiple pointers to them get a higher prestige Node score N = root node weight + ∑ leaf node weight

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model  Formal Database Model (Cond.) Edge Weights Some pupluar tuples can be connected many other tuples  Edge with forward and backward edge weights Weight of a forward link = the strength of the proximity relationship between two tuples (set to 1 by default) Weight of a backward link = indegree of edges pointing to the node Total edge weight = ∑ edge weights Edge score E = 1 / Total edge weight

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model  Formal Database Model (Cond.) Overall relevance score = Node weights + Edge Weight Normalize in the range [0,1] Combine using weighting factor Additive: (1- ) E + N; multiplicative: E * N

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model  Query and Answer Model Query A set of keywords e.g.{k 1,k 2,…k n } A set of nodes S i = {S 1,S 2,…S n } Locate nodes matching search terms t 1,t 2,…t n Answer Model A rooted directed tree connecting keyword nodes Relevance score of an answer tree Relevance scores of it nodes and its edge weight

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model  Answer Model A rooted directed tree connecting keyword nodes Multiple answers Ranked by proximity + prestige Proximity  edges weights Prestige  indegree of nodes Relevance score of an answer tree Relevance scores of it nodes and its edge weight

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Database and Query Model Result of query “sudarshan soumen”

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Outline  Introduction  Database and Query Model  Searching for the best answers Backward expanding search algorithm  Browsing features of BANKS  Experiment  Conclusion

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Searching for the best answers  Backward expanding search algorithm Offers a heuristic solution for incrementally computing query results. Assume that the graph fits in memory Start at leaf nodes each containing a query keyword Run concurrent single source shortest path algorithm from each such node Traverses the graph edges backwards Confluence of backward paths identify answer tree roots Output a node whenever it is on the intersection of the sets of nodes reached from each keyword Answer trees may not be generated in relevance order Insert answers to a small buffer (heap) Output highest ranked answer from buffer to user when buffer is full

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Searching for the best answers Model (Query : Charuta Sudarshan Roy ) S. SudarshanPrasan Roy writes author paper Charuta BANKS: Keyword search…

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Outline  Introduction  Database and Query Model  Searching for the best answers  Browsing features of BANKS  Experiment  Conclusion

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Browsing  BANKS system provides A rich interface to browse data stored in a relational database Automatically generates browsable views of database relations and query results Schema browsing and data browsing A hyperlink to the referenced tuple Templates for several predefined ways of displaying data

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Browsing Data browsing

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Browsing Schema browsing

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Outline  Introduction  Database and Query Model  Searching for the best answers  Browsing features of BANKS  Experiment  Conclusion

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Error scores vs parameter choices  The rankings are relatively stable across different choices of parameter values  = 0.2 coupled with log scaling of edges weights does best

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Outline  Introduction  Database and Query Model  Searching for the best answers  Browsing features of BANKS  Experiment  Conclusion

Seoyoung AhnKeyword Searching and Browsing in Databases using BANKS Conclusion  BANKS system provides an integrated browsing and keyword querying system for relational databases allows users with no knowledge of database systems or schema to query and browse relational database with ease