ASSISTED BROWSING THROUGH SEMISTRUCTURED DATA PROBLEM The development of the RDF standard highlights the fact that a great deal of useful information is.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Revised WE :55 EST Created TH Lesson 01c. Learning Preferences / Bringing Learners and Library Skills Together Copyright ©
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
The Experience Factory May 2004 Leonardo Vaccaro.
Search Engines and Information Retrieval
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
ISP 433/633 Week 10 Vocabulary Problem & Latent Semantic Indexing Partly based on G.Furnas SI503 slides.
The Last Lecture Agenda –1:40-2:00pm Integrating XML and Search Engines—Niagara way –2:00-2:10pm My concluding remarks (if any) –2:10-2:45pm Interactive.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Modern Information Retrieval Chapter 5 Query Operations.
Intelligent User Interfaces Research Group Directed by: Frank Shipman.
Query Reformulation: User Relevance Feedback. Introduction Difficulty of formulating user queries –Users have insufficient knowledge of the collection.
Chapter 19: Information Retrieval
Connecting Diverse Web Search Facilities Udi Manber, Peter Bigot Department of Computer Science University of Arizona Aida Gikouria - M471 University of.
Query Operations: Automatic Global Analysis. Motivation Methods of local analysis extract information from local set of documents retrieved to expand.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
An Extension to XML Schema for Structured Data Processing Presented by: Jacky Ma Date: 10 April 2002.
The Exchange of Retrieval Knowledge about Services between Agents Mirjam Minor Mike Wernicke.
Intended for novice users as an introduction to the online catalog’s capabilities. The guide would be available on the New Brighton Public Library’s website.
A Comparative Study of Search Result Diversification Methods Wei Zheng and Hui Fang University of Delaware, Newark DE 19716, USA
What is a sketch? Chapter 1.2 addendum Sketching User Experiences: The Workbook.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Welcome to the Science Direct tutorial By the end of this tutorial you should be able to: Do a basic search to find references Use search techniques to.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Planning a search strategy.  A search strategy may be broadly defined as a conscious approach to decision making to solve a problem or achieve an objective.
COMP 208/214/215/216 – Lecture 8 Demonstrations and Portfolios.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
ESSnet on microdata linking and data warehousing in statistical production: Metadata Quality in the Statistical Data Warehouse.
Keyword Query Routing.
Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.
EEL 5937 Ontologies EEL 5937 Multi Agent Systems Lecture 5, Jan 23 th, 2003 Lotzi Bölöni.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Welcome to the Business Source Premier tutorial By the end of this tutorial you should be able to: Do a basic search to find references Use search techniques.
Evaluating & Maintaining a Site Domain 6. Conduct Technical Tests Dreamweaver provides many tools to assist in finalizing and testing your website for.
Design and Implementation of a Rationale-Based Analysis Tool (RAT) Diploma thesis from Timo Wolf Design and Realization of a Tool for Linking Source Code.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Similarity-Based Object Metadata Browser Progress Report Rod McFarland CPSC 533C.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Query Models CSCI 572: Information Retrieval and Search Engines Summer 2010.
Lecture 5 Frames. Associative networks, rules or logic do not provide the ability to group facts into associated clusters or to associate relevant procedural.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Query Refinement and Relevance Feedback.
METADATA MANAGEMENT AT ISTAT: CONCEPTUAL FOUNDATIONS AND TOOLS Istituto Nazionale di Statistica ITALY.
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Harnessing the Deep Web : Present and Future -Tushar Mhaskar Jayant Madhavan, Loredana Afanasiev, Lyublena Antova, Alon Halevy January 7,
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Information Organization: Overview
David Huynh, Stefano Mazzocchi, David Karger Piggy Bank: Experience the Semantic Web inside your web browser Web Semantics: Science, Services and Agents.
Change Control Module P5 LEARNING OBJECTIVES: LEARNING OUTCOMES
SIS: A system for Personal Information Retrieval and Re-Use
Information Retrieval on the World Wide Web
Chapter 2 Database Environment.
-A File System for Lots of Tiny Files
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
Information Retrieval
CSA3212: User Adaptive Systems
Manuscript Transcription Assistant Initiative
Magnet & /facet Zheng Liang
Information Organization: Overview
Information Retrieval and Web Design
Lab 2: Information Retrieval
Discussion Class 9 Google.
Information Retrieval
Presentation transcript:

ASSISTED BROWSING THROUGH SEMISTRUCTURED DATA PROBLEM The development of the RDF standard highlights the fact that a great deal of useful information is in the form of semistructured data — objects connected by relations fitting no rigorous schema. Information is inherently semi- structured in nature: while it usually has an inherent structure behind it, it always has exceptions. To make use of this infor- mation it is important to be able to search and browse semistructured repositories. Approach When modifications are suggested, users need to choose among the many routes to take next, rather than finding the routes themselves. Since users often follow a common set of strategies for searching, we have built several agents each capable of suggesting routes following one strategy. Given any starting point, including an intermediate search result, together these agents enumerate several routes users can take next. Information searches generally involve dialogues between users and computers. Users respond to intermediate results by modifying previous queries to further narrow the relevant search spaces. We provide agents implementing generic search strategies that are useful at any kind of starting point. These strategies include: Attribute Selection: Pick other items that share certain attributes with the item at the starting point. Follow Links: Pick items explicitly connected to the starting item. Containing Collections: Enumerate collections containing the starting item. Similarly Visited: Recall items that have in the past been viewed at a similar time as the starting item. GENERIC STRATEGIES COLLECTION-SPECIFIC STRATEGIES Collections are a necessary concept for information management and a required mechanism for returning search results. We provide agents implementing strategies specifically tuned for aiding searches that start at collections. These strategies include: Collection Refinement: Narrow the starting collection by grouping terms present in that collection’s member elements. Similar Collections: Find other collections containing elements similar to some elements of the starting collection (using the vector-space model). The screenshot below shows suggestions for refining a collection of documents based on their types and due dates. When new metadata is made available (e.g. classification of these documents into projects), the system will pick them up and make new suggestions appropriately. USER INTERFACE Browsing suggestions are made available at all time in a collapsible docking pane. Convenient suggestions are included in context menus (screenshot above). Suggestions are grouped by strategies. Appropriate widgets are used based on the nature of the suggestion (e.g. a date range selector for refining a collection by date). As in web searches, users can revisit intermediate steps of the search. Vector-space model is used to represent collections so that traditional information retrieval algorithms can be leveraged. Vector-space model consists of vectors representing documents with each dimension representing a term (attribute-values pairs). Similarity in documents becomes the dot-product of these vectors. Vineet Sinha - David Karger - David Huynh -