Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Using CAB Abstracts to Search for Articles. Objectives Learn what CAB Abstracts is Know the main features of CAB Abstracts Learn how to conduct searches.
Basic Searching Engineering Village. Agenda What is Engineering Village? Setting up a personal account Searching Engineering Village How to.
Advanced Searching Engineering Village.
Engineering Village ™ Basic Searching.
Engineering Village ™ ® Basic Searching On Compendex ®
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Web Mining Research: A Survey
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
The University of Kansas Vitalseek Dr. Susan Gauch.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
What’s new in search? Internet Librarian Oct 29 th 2007.
Setting up a Profile LRC Information Literacy Series: 7 (Google Scholar Citations) By Shri Ram.
1 Chapter 6 Understanding Computers, 11 th Edition Software Ownership Rights Software license: agreement, either included in a software package or displayed.
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE KIEV, 31 JANUARY.
Internet and Social Networking Research Tools for Academic Writing Copyright © 2014 Todd A. Whittaker
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
HOW SEARCH ENGINE WORKS. Aasim Bashir.. What is a Search Engine? Search engine: It is a website dedicated to search other websites and there contents.
Using a Web Browser What does a Web Browser do? A web browser enables you to surf the World Wide Web. What are the most popular browsers?
SEO AdWords Social Media Website Projects Presents: Portal Polo Industrial.
THOMSON SCIENTIFIC Web of Science Using the specialized search and analyze features Jackie Stapleton, librarian Fall 2006.
Selecting a Topic and Purpose
Master Thesis Defense Jan Fiedler 04/17/98
DAY 2: TIPS ON SEARCHING WISELY Robert Phipps August
SCOPUS AND SCIVAL EVALUATION AND PROMOTION OF UKRAINIAN RESEARCH RESULTS PIOTR GOŁKIEWICZ PRODUCT SALES MANAGER, CENTRAL AND EASTERN EUROPE LVIV, 11 SEPTEMBER.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Do's and don'ts to improve your site's ranking … Presentation by:
©2003 Paula Matuszek CSC 9010: Text Mining Applications Document Summarization Dr. Paula Matuszek (610)
Personalized Search Xiao Liu
Dr Jamal Roudaki Faculty of Commerce Lincoln University New Zealand.
DAY 2: TIPS ON SEARCHING WISELY Tazin Afrin August 22,
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Usability Issues in Metasearch Interface Design: persectives of an information provider LITA Human Machine Interface Interest Group June 25, 2004 Oliver.
PEERSPECTIVE.MPI-SWS.ORG ALAN MISLOVE KRISHNA P. GUMMADI PETER DRUSCHEL BY RAGHURAM KRISHNAMACHARI Exploiting Social Networks for Internet Search.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Data Mining for Web Intelligence Presentation by Julia Erdman.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
Facilitating Document Annotation using Content and Querying Value.
Personalization with user’s local data Personalizing Search via Automated Analysis of Interests and Activities 1 Sungjick Lee Department of Electrical.
© 2010 Deep Web Technologies, Inc. Taking the Library Back from Google Abe Lederman, President and CTO Deep Web Technologies May 12, 2010.
COLLABORATIVE SEARCH TECHNIQUES Submitted By: Shikha Singla MIT-872-2K11 M.Tech(2 nd Sem) Information Technology.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Current Information To help you find current news and information, many search engines and directories include a hyperlink to a "What's new" page. Many.
LIS618 lecture 8 Thomas Krichel Lexis/Nexis Lexis is a specialized legal research service Nexis is primarily a news services adds an important.
A System for Automatic Personalized Tracking of Scientific Literature on the Web Tzachi Perlstein Yael Nir.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
IST 497G Ron Grzywacz November 2002 Personalization of Information Retrieval.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Facilitating Document Annotation Using Content and Querying Value.
DAY 2: EFFECTIVE INTERNET SEARCHES Sravanthi Lakkimsetty January 13,
Research Methods in Business and Economics4 Jan Brzozowski, PhD.
EndNote Ver.X7: A Reference Management Software
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Searching the Web for academic information Ruth Stubbings.
Using Google Scholar Ronald Wirtz, Ph.D.Calvin T. Ryan LibraryDec Finding Scholarly Information With A Popular Search Engine Tool.
Google Scholar and ShareLaTeX
CS 430: Information Discovery
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
The Economy of Distributed Metadata Authoring
CS 430: Information Discovery
Data Mining Chapter 6 Search Engines
Introduction to Information Retrieval
Web Mining Research: A Survey
Information Retrieval and Web Design
Information Retrieval and Web Design
Presentation transcript:

Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin September 2000

Web Information Retrieval Prof. Alessandro Agostini 2 Abstract Actual Web search engines treat search requests in isolation: they give identical results to identical queries made by different users in different contexts. This “one size fits all” model limits diversity, competition and functionality on the web. To find an answer to this problem, next-generation Web search engines will make increasing use of context information, either by using explicit or implicit context information from users or by adding functionalities within restricted contexts.

Web Information Retrieval Prof. Alessandro Agostini 3 Introduction The web is more and more representative of our society and millions of different people publish and organize information in it. These people differ for background, knowledge and expectations and this makes the web very diverse in terms of content and structure. Even if databases used in traditional information retrieval systems are strictly structured, current web search engines are similar in operation to these information retrieval systems. Web search engines return ranked lists of relevant documents in response to user queries, but just a few of these results may be valuable to a user.

Web Information Retrieval Prof. Alessandro Agostini 4 In fact the value depends on the context of the query, i.e. the education, interests, experiences of the user and further information about the current request. Even if the major search engines make an incredible amount of information quickly and easily available and they are widely used, they do have significant limitations: Often out-of-date Index only a part of the publicly indexable web Do not index documents requiring authentication Do not index sites equally The need for better web search engines is becoming increasingly important.

Web Information Retrieval Prof. Alessandro Agostini 5 Inserting context in Web search Understand the context of search requests Adding explicit context information Automatically inferring implicit context information in personalized search engines Guessing what the user wants Restrict the context of search engines Specialized search engines Identifying communities on the web Locating specialized search engines

Web Information Retrieval Prof. Alessandro Agostini 6 Adding explicit context information Provided by the user in the form of keywords added to a query (e.g. “home” or “homepage”) The Inquirus 2 project, developed at NEC Research Institute, asks the user making a query also to explicitly choose a category (e.g. “personal home pages”, “research papers”). The system uses this context information: To select the search engines to send queries to To modify queries To select the ordering policy Inquirus 2 has proven to be highly effective at improving search precision within given categories.

Web Information Retrieval Prof. Alessandro Agostini 7 Inferring implicit context information: personalized search The Watson project is an example of client-based personalized search engine that extracts implicit context information (e.g. words, font size, etc). It automatically infers user’s context information basing on the content of the current document being edited in Microsoft Word or viewed in Internet Explorer. Watson uses the inferred information to modify the user’s query and forwards the new query to web search engines. Client-based systems like Watson have limited functionalities.

Web Information Retrieval Prof. Alessandro Agostini 8 Personalized search Search engines that know all of user’s previous requests and interest could use that information to tailor results. A server-based search engine like Google could infer user interests and rank links in agreement with them. Server-based full-scale personalization is currently too expensive for the major web search engines: hopefully it will be more feasible over time. Personalizes search services deal with the problems of consistency and privacy.

Web Information Retrieval Prof. Alessandro Agostini 9 Guessing what user wants It is an increasingly common search engines technique that aims to produce additional results and links to the user’s query, trying to guess what user wants giving also links related to the keywords (e.g. stock symbols -> stock quotes links + company information links). This technique is limited to cases where potential context can be identified based on keyword query and can be improved by a personalized search engines. Clustering search results into categories (e.g. “News”) allows the user to easily narrow results.

Web Information Retrieval Prof. Alessandro Agostini 10 Restricting the context of search engines Another way to add context into web search is to restrict the context of the search engine, i.e. to create specialized search engines for specific domains. Thousands of specialized search engines already exist and provide higher functionality than the regular web search engines, also allowing to retrieve information not accessible using conventional search engines in a specific domain (

Web Information Retrieval Prof. Alessandro Agostini 11 Specialized search engines CiteSeer (ResearchIndex) is the world’s largest specialized free, public search engine for scientific literature, currently indexing over articles and citations. CiteSeer incorporates many features specific to scientific literature: it automates the creation of citation indices, provides easy access to their context and gives functionality for extracting information found in research articles.

Web Information Retrieval Prof. Alessandro Agostini 12 Identifying communities on the web Domain-specific search engines need a method for locating the subset of the web within their domain. A good method has been successfully identified in communities (Flake). A web community is defined as a collection of pages where each member has more links inside the community than outside of the community. Because there is no central authority governing the formation of links on the web, this discovery is important and allows identification of communities independent of the specific words used inside pages.

Web Information Retrieval Prof. Alessandro Agostini 13 Locating specialized search engines Many queries that would be best served by specialized services are likely to be sent to the major web search engines because the overheads in locating a specialized engine are too great. Much research has been done in this area and several methods of selecting search engines based on user queries have been proposed. It would be very useful if the major search engines attempted to direct users to the best specialized search engine, but they have economic reasons not to provide such a service.

Web Information Retrieval Prof. Alessandro Agostini 14 Does one size fit all? All users receive the same responses for given queries and that is the great benefit of the web: to allow equalizing access to information. Whereas, not much appears to be equal on the web: influenced by search engines, the distribution of traffic is disproportionate, the majority of links go to a small number of very popular sites, the results to queries follow unpredictable ranking criteria: “winners take all”. Specialized search engines may provide less biased results (e.g. Yellow pages), but most of the people use the major web search engines.

Web Information Retrieval Prof. Alessandro Agostini 15 Conclusions New search engines should make a greater use of context to mitigate any negative effects of biased access to information on the web and to increase competition, diversity and functionality. Web search is today one of the most challenging problems of the Internet and as web search becomes a more important function within society, the need for even better web search services is becoming increasingly important.

Web Information Retrieval Prof. Alessandro Agostini 16 Thank you for your attention.