Leveraging Publisher’s Search Engines to Deliver Relevant Results to Users Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC 28 th.

Slides:



Advertisements
Similar presentations
Open Source Intelligence: Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC IOP 06 Sheraton Premier, Tysons Corner, Virginia January.
Advertisements

Comparison of BIDS ISI (Enhanced) with Web of Science Lisa Haddow.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
Advanced Searching Engineering Village.
Engineering Village ™ Basic Searching.
© 2012 Deep Web Technologies, Inc. Swetswise Searcher Powered by Explorit Research Accelerator By Abe Lederman President and CTO Copenhagen, Denmark 11.
© 2009 Deep Web Technologies, Inc. Federated Search: A Tool for Knowledge Discovery iGroup Online Education Conference Presented by Abe Lederman Founder.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
Page 1 June 2, 2015 Optimizing for Search Making it easier for users to find your content.
Engineering Village ™ ® Basic Searching On Compendex ®
Not All Federated Search Engines are Created Equal Abe Lederman, President and CTO Deep Web Technologies, Inc. Next Generation Library Technologies, May.
Information Retrieval in Practice
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Introduction to Library Research Gabriela Scherrer Reference Librarian for English Languages and Literatures, University Library of Bern.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Lesson 2 Technology: Federated Searching Explained.
Overview of Search Engines
 Search engines are programs that search documents for specified keywords and returns a list of the documents where the keywords were found.  A search.
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
Federated Search: True Enterprise Search Abe Lederman, President and CTO Deep Web Technologies Search Engine Meeting – April 28-29, 2008.
Global Discovery: Turning Vision into Reality Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC Symposium: Global Discovery on the.
Abe Lederman, President and CTO Deep Web Technologies 2008 STIP Working Meeting, April 23, 2008 Federated Search: The Technology For Making Global Discovery.
Divide and Conquer: Challenges in Scaling Federated Search Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC SearchEngine Meeting.
© 2011 Deep Web Technologies, Inc. By Abe Lederman President and CTO June 26, 2011 Understanding Differences Between Federated Search and Discovery Services.
Bibliometrics toolkit: ISI products Website: Last edited: 11 Mar 2011 Thomson Reuters ISI product set is the market leader for.
© 2012 Deep Web Technologies, Inc. 03 December 2012 By Abe Lederman, CEO Deep Web Technologies Show and Tell Presentation to.
Science Research: Journey to 10,000 Sources Presented by: Abe Lederman, President and Founder Deep Web Technologies, Inc. Special Libraries Association.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Reference Databases Guides 2010 ISI Web of Science IEEE ACM SpringerLink Wilson Proquest Wiley-Blackwell ScienceDirect Scopus SciFinder.
© 2010 Deep Web Technologies, Inc. By Abe Lederman President and CTO Explorit Federated Search.
© 2009 Deep Web Technologies, Inc. Federated Search Presentation Explorit Research Accelerator Focus Deep. Get Results.
© 2013 Deep Web Technologies, Inc. Abe Lederman President and CTO Deep Web Technologies ANKOS 2013 Annual Meeting April 26, 2013 Federated Search: A Discovery.
Web Scale Discovery Service Vs Federated Search NIKESH NARAYANAN
OpenURL Link Resolvers 101
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Applying Grid Computing Research to Commercial IR Applications Presented by Carl Sylvia, SBIR Project Manager Deep Web Technologies, LLC GGF-14 – June.
Not All Federated Searches are Created Equal Abe Lederman, President and CTO Deep Web Technologies Thomson Scientific Government Event, April 10, 2008.
© 2012 Deep Web Technologies, Inc. SwetsWise Medical Searcher Powered by Explorit Research Accelerator By Abe Lederman President and CTO July 15, 2012.
University of North Texas Libraries Building Search Systems for Digital Library Collections Mark E. Phillips Texas Conference on Digital Libraries May.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Abe Lederman, President and CTO Deep Web Technologies, Inc. ScienceEducation.gov Meeting National Academy of Sciences, March 18, 2009 A Look at the Technology.
IL Step 3: Using Bibliographic Databases Information Literacy 1.
1 Relevance Ranking in the Scholarly Domain Dr. Tamar Sadeh LIBER Conference Tartu, Estonia, June 2012 Dr. Tamar Sadeh LIBER Conference Tartu, Estonia,
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
© 2009 Deep Web Technologies, Inc. Federated Search for Academic Libraries Explorit Research Accelerator Focus Deep. Get Results.
© 2009 Deep Web Technologies, Inc. Federated Search for Government Agencies Explorit Research Accelerator Focus Deep. Get Results.
Uniting Global Information with Federated Search Abe Lederman, President, Deep Web Technologies Dr. Rosanne Hessmiller, CEO, Ferguson-Lynch Presentation.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Bibliometrics toolkit Website: xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Further info: xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Scopus Scopus was launched by Elsevier in.
1 OSTI - Accelerating Science Information Dr. Walter L. Warnick Director U.S. Department of Energy Office of Scientific and Technical Information Federal.
Deep Web Technologies Presentation to Gale for PowerSearchPlus Abe Lederman, President and Founder Maxine Swisa, Vice President of Engineering May 18,
Federated Search: The Good and the Bad Abe Lederman, President and CTO Deep Web Technologies, Inc. APLA May 9, 2008.
© 2010 Deep Web Technologies, Inc. Taking the Library Back from Google Abe Lederman, President and CTO Deep Web Technologies May 12, 2010.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Saving Time with Federated Search Abe Lederman, President, Deep Web Technologies Terry Colby, Director of Sales, Deep Web Technologies Websearch University,
Taking the Library Back from Google Abe Lederman, President and CTO October 18-20, 2007.
Improving E-Book Access via a Library Developed Full-Text Search Tool Jill E. Foust, MLS Phillip Bergen, MA, MS Gretchen L. Maxeiner, MA, MS Health Sciences.
Building Search Systems for Digital Library Collections
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Fred Dirkse CEO, OIC Group, Inc.
IL Step 3: Using Bibliographic Databases
Introduction to Information Retrieval
By Abe Lederman President and CTO June 26, 2011
Uniting Global Information with Federated Search
Uniting Global Information with Federated Search
ADVANCED SEARCH ON WESTLAWNEXT
Access to Quality, Deep Web Research Content
Presentation transcript:

Leveraging Publisher’s Search Engines to Deliver Relevant Results to Users Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC 28 th Annual Scholarly Publishing Meeting – Virginia – June 9, 2006

Abe’s Background Earned B.S. and M.S. Computer Science degrees, MIT 18 years experience developing sophisticated information retrieval applications Cofounded Verity, 1988 Consulted to LANL, Deployed first “federated search” portal in the Federal government, 1999 Founded Deep Web Technologies (DWT), 2002 DWT is a New Mexico based company focused on providing state-of-the-art software solutions which search, retrieve, aggregate, and analyze content from web-based databases.

The Problem: Searching a large number of sources can lead to a flood of results

Relevance ranking begins as soon as the user clicks the Search button

Ranking Recipe Source Selection Query Language Search Conductor Ranking Algorithms INGREDIENTS MIX WELL AND SERVE UP RELEVANT RESULTS

Source Selection Optimizer Search Conductor Source Selection Optimizer Source Descriptions Previous Results

Powerful Query Language Takes advantage of search capabilities of each source Supports full Boolean operators where possible Supports fielded search Translates natural language questions into query syntax

Select sources to search Can I get more results from “good” sources? Enough good results? YES Deliver results to user YES NO Perform Search Get Next Results Search Conductor

Challenges in Organizing and Ranking Results Multi-tier Relevance Ranking User-driven Ranking Clustering of Results

Multi-tier Relevance Ranking QuickRank – Ranks results based on occurrence of search terms in title, author, and snippet MetaRank – Ranks results utilizing custom algorithms applied to meta- data DeepRank – Downloads and indexes full-text documents HEAVY LIFTING REQUIRED!

User-driven Ranking Credibility of source Date range Document length Document type Geographic proximity Popularity of document Reading level Relevance Desired: Blending (weighing) of above criteria

Clustering

Attributes of Successful Federated Search Powerful query language that takes advantage of publisher search capabilities Source selection optimizer will reduce unnecessary searches Search conductor gets more results from sources bringing back good results A tool that highlights best search results Caching of search results

Advice for Publishers Use good search engines with good relevance ranking Return 100 or more results at a time Return meta-data (author, journal, snippet) as part of result list Provide access to your content through XML Gateway or Web Services Speed up search time

Abe Lederman 301 N Guadalupe, Ste 201 Santa Fe, NM Thank You!