Choosing a Search Engine Taly Sharon Thanks to Ariel Frank, Bar-Ilan University

Slides:



Advertisements
Similar presentations
Searching The Internet Practical Strategies. URLs Look at the URL to determine what type of organization produced the site..com is a commercial site..edu.
Advertisements

Important Information This presentation was created by Patrick Crispen.This presentation was created by Patrick Crispen. You are free to reuse this presentation.
Natural Language Processing WEB SEARCH ENGINES August, 2002.
“The Computer as an Educational Tool: Productivity and Problem Solving” ©Richard C. Forcier and Don E. Descy.
Computer Information Technology – Section 3-2. The Internet Objectives: The Student will: 1. Understand Search Engines and how they work 2. Understand.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
Search Engine Usability Taly Sharon
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
Search engines. The number of Internet hosts exceeded in in in in in
Eric Sieverts University Library Utrecht IT Department Institute for Media & Information Management (Hogeschool van Amsterdam)
A.Frank 1 Internet Resources Discovery (IRD) Whither Search Engine (SE)?! Some Practical Recommendations.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.
Searching the World Wide Web From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 1 Introduction Directories, Search.
The Players The Majors Dead Search Engines International Search Engines Metasearch Engines.
Internet Resources Discovery (IRD) Meta-Search Engines (MSEs)
Search Engine Usability Taly Sharon
Search Tools for the Internet Adapted from: Kathy Schrock M. Rosettis St. Augustine CHS.
What are search engines? Tools used for locating web pages Automated software programs known as spiders or bots to survey the Web and build their databases.
Library 10 – Information Competency Search Engines.
Search Engine Optimization By Tom Fallenstein. Introduction Why you want high rankings Why you want high rankings Keywords Keywords Tools to help choose.
An introductory presentation Webinar: Search Engine Optimisation.
How Search Engines Work General Search Strategies Dr. Dania Bilal IS 587 SIS Fall 2007.
Cutting Through the Clutter Searching the Web. There is a wealth of information waiting for you on the internet, if you know the right tools to use and.
Shayna Keces Reference Librarian Intermediate Internet Searching Or How to really find information on the internet.
Planned Giving Design Center. What is the Planned Giving Design Center? National network of websites dedicated to advancing philanthropy.
Out-Googling Google: Finding What Google Misses Karen Blakeman Internet Search & Retrieval Strategy Adviser UKeiG UK
Refining – Finding Words/expanding Taly Sharon
Promotion & Cataloguing AGCJ 407 Web Authoring in Agricultural Communications.
New Search Engine Initiatives in the Age of Google Dominance Eighth Southern African Online Information Meeting By Greg R. Notess SearchEngineShowdown.com.
Stop Searching and Start FINDING: Strategies for Effective Web Research Created by: Patrick Douglas Crispen Modified by: Mr. Carmichael.
1999 Asian Women's Network Training Workshop Tools for Searching Information on the Web  Search Engines  Meta-searchers  Information Gateways  Subject.
Hotbot A Search Engine Case Study. Introduction  Owned by Terra/Lycos.  One of the largest web search engines.  Uses the Inktomi database combined.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Search Engine Interfaces search engine modus operandi.
Overview What is a Web search engine History Popular Web search engines How Web search engines work Problems.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
SEO  What is it?  Seo is a collection of techniques targeted towards increasing the presence of a website on a search engine.
Google and More Search Engines and Web Based Directories, how to target a search and evaluate the results.
Proprietary & confidential 1 The Future of Search JJ Hollowell CIO, icrossing, Inc. Spring 2005.
Where do I find it? Created by Connie CampbellConnie Campbell.
Search Engines June 20, 2005 LIBS100 Linda Galloway.
Web Index D irectory WEB Which kind to use? All Which kind to use? All S earch E ngine General SpecialtyGeneralSpecialty Meta-S earch.
Stop Searching and Start FINDING: Strategies for Effective Web Research.
Search Pages and Results LIS 385E: Information Architecture and Design By: Alex Chung
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
Can scientific collaboration and excellence be measured by Web presence and Web links? Judit Bar-Ilan Bar-Ilan University and The Hebrew University of.
Meta Search Engines Taly Sharon. T.Sharon Search Engine Seminar2 Contents Search Engines (SEs) generations Meta Search Engine (MSE) Why use several SEs.
Internet and WWW. Internet Network linking computers to other computers Access to numerous resources – Communications systems Instant messaging.
Sandra Nijjar Professor: Veronica Harris Course: CMP 230 Information Literacy Date 7/3/2011.
Selected Internet Search Engines Search Engine Database Advanced/ Boolean Other search options Miscellaneous Google Google google.co m Advanced Search.
The World Wide Web: Information Resource. How a Search Engine works… How Search Works - YouTube
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
WISER Humanities: Quality Information on the Internet Johanneke Sytsema Linguistics Subject Consultant Judy Reading Reader.
Internet Power Searching: Finding Pearls in a Zillion Grains of Sand By Daniel Arze.
Instructor: Shayna Keces Finding information on the internet Basic Internet Search Techniques August 2002.
Search Engines and Subject Directories. Search Engines Automated Travel the entire open web When web page changes, search engines will eventually spot.
Internet Power Searching Finding Pearls in a Zillion Grains of Sand By Amelia Kassel Found in “Technical Communication” on page 198.
Learning how to search on the web “If all you ever do is all you’ve ever done, then all you’ll ever get is all you’ve ever got.” (author unknown)
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
Searching the Web for academic information Ruth Stubbings.
So You Think You Know How To Use The Internet?
Using Search Tools on the Internet
Internet Searching: Finding Quality Information
Search Engines & Subject Directories
Eric Sieverts University Library Utrecht Institute for Media &
Search Engines & Subject Directories
Search Engines & Subject Directories
Web Searching Everything, now..
Presentation transcript:

Choosing a Search Engine Taly Sharon Thanks to Ariel Frank, Bar-Ilan University

82% Loyal to SE iProspect

Search Engines Diverging Looking at the organic or natural listings for more than 485,000 first page search results, the study found that: Dogpile

Experienced Searchers use More Search Engines HarvestDigital

General rules for choosing SEs Use "major" SEs that are both well-known and well-used (and that hopefully won’t be downgraded or disappear soon ). Prefer SEs that employ both a huge index and a comprehensive directory (gives better results; can also switch between). Stick to SEs of established companies that treat search as their main business/expertise.

Google Trends ask&ctab=0&geo=all&date=all&sort=0

Criteria for Choosing SEs 1.Database (different) 2.Ranking algorithm 3.Query options (site, intitle, inrurl…) 4.Added values/features (clustering, define, NLP, …) 5.User Interface (UI)

Who Powers Whom? Major distinct databases: –Google –Yahoo –MSN –Ask –Wisenut, Exalead, etc. The rest of the search engines use the same databases as the above search engines – different retrieval algorithms, see:

SE Database Facts Summary Google is feeding from DMOZ Google is feeding Excite, Hotbot, iwon, Netscape and Aol search Yahoo! Is fed from Inktomi and feeding excite Ask is fed from Google and dmoz Directories –Yahoo! Is not fed from dmoz –But almost everyone else is!

Google

Yahoo!

Ask

Directories

Why use Google? (1) Biggest, most comprehensive coverage:  ~8 billion Web pages (but ~1 billion of it isn’t full-text searchable!)  ~11 billion documents, if you count images and newsgroup postings. Fastest around. Most relevant results (voted 3 times most outstanding SE by Search Engine Watch readers). Provides good directory results (PageRanks results of DMOZ Open Directory).

Why use Google? (2) Has thinnest/cleanest interface around. But provides rich set of advanced search features/tools(/hacks). Finds similar/related pages. Supports Web pages translation. Cached (HTML) copy of pages (great for quick view of DOCs/PDFs and for 404s ). Google alert – use of push technology.

Share of Searches Share Of Searches: July 2006

Why use Yahoo! search? (1) Has brand new Yahoo! search – gives highly relevant Web results (at Google level ). Still supports an expert’s humanly-compiled directory (dir.yahoo.com).dir.yahoo.com Has (also) a thin interface ( search.yahoo.com ) while providing a rich set of advanced search features/shortcuts. search.yahoo.com

Why use Yahoo! search? (2) For legacy reasons (oldest of all directories). Puts particular emphasis on personalization and customization ( my.yahoo.com ). my.yahoo.com Had enough of Googlism ( ). It devoured/uses (know-how from) Overture (Inktomi, AltaVista and AllTheWeb, etc…) Has many specialty SEs – better than Google.

Hidden Gem Yahoo! Search Subscriptions

Google in 1998 – looking up at Yahoo!? Source: Internet archive’s Wayback machine

Search Relevancy

6 Reasons to use Yahoo! 1.Long queries (>32 terms, >256 chars) –Especially useful when using OR 2.Search for XML/RSS 3.Better link: search More extensive results More options (linkdomain:, linksite:) 4.Mix syntax Link: site:gov 5.Google is the most exposed to Spams. 6.Some special services.

Why use MSN? Relatively new -- re-written in One of the 3 Major DBs. Direct answers -- from Microsoft Encarta®, encyclopedia. Direct actions -- to MSN channels. 1.When you need more results 2.When you need some unique query options: –prefer: –ip: –contains: (music contains:wma) –Feed:, hasfeed: 3.When you need UI options (especially sorting): Date Popularity Exact/approximate match

Why use Ask? Small Index but interesting results Provides ExpertRank -“subject specific” ranking of pages. Provides a Natural Language interface (uses NLP). Refine: Suggests related searches.  Comments: Name AskJeeves changed to Ask Teoma gone with the Resources (results, refine, resources)

Why Use Ask? Query suggestions/fill Q&A engine Smart Answers Query refinements Different results

Ask

Why Use Exalead New Search engine Another stand-alone database Advanced search features: –Words starting with –Words at proximity –Search method: exact search/automatic word stemming/phonetic search/approximate spelling –Document sorting: relevance/oldest/newest –Modification date: simply write date!!!!

Why use Exalead Preferences – instant page translation Filters/refinements: –Related terms –Related categories (DMOZ) –Web site location –Document type (PDF/TXT/DOC/PPT) –Result presentation (documents/ documents+thumbnails/thumbnails) –Preview

A9 Great UI Searches also books Visual Yellow pages and street photos Leader of innovative services –Search history –split view Good for obscure topics (because it searches books)

Some Notes A9 – customize, special features AOL – good for beginners Looksmart – Findarticle Lycos – what people are talking about (people,forums) MSN – generally less results but growing Yahoo – MM, local/people searches and more Gigablast – site: search (but small index) up to 500 sites!

GigaBlast

Practical recommendations Two major SEs (usually use both): 1.Google (GG) 2.Yahoo! search (YH) or MSN One Meta-SE (as a backup): 3.Dogpile or Clusty Don’t forget the invisible web! Note: Choices are not Hebrew oriented.

Hebrew Search Engines? MSN, Google, Yahoo Clusty (MSE) Netex (directory), Walla, Nana Morfix Start, a Many more (try Heb query of your favorite SE)

Bibliography/Credits   searchenginewatch.com searchenginewatch.com  searchengineshowdown.com/ searchengineshowdown.com/  ngine.html ngine.html  infopeople.org/search infopeople.org/search     (Hebrew)    

Exercises 1.Find the page this was quoted from: "EcoOcean cooperates with the Heschel Center in educating" 2.Find a page that has a Flash communication demo. 3.Who provides search feed to Netscape? 4.Search for pages, books, and pictures about the invisible web. 5.Find a picture of ABC Pizza House in Cambridge MA. 6.Find information about “Meryl Stripp”. You are not sure of the correct spelling (try with the given spelling). Which Search engine is useful here? 7.You get a list of 10 websites you want to run a query on. Which Search engine can run them together? –Example: "taly sharon" (site:acm.org OR site:dblp.com OR site:googleguide.co.il OR site:googleguide.com OR site:sharon-it.com OR site:ifla.org OR site:media.mit.edu OR technion.ac.il OR site:netanya.ac.il OR site:biu.ac.il) 8.What if you had 400 websites? 9.What is the west wing? Suggest options to narrow this search.