Search Engine Comparisons By: Thomie Ventura. Search Engines Today, much, but not all, of the work we do revolves around the web Today, much, but not.

Slides:



Advertisements
Similar presentations
Getting Your Web Site Found. Meta Tags Description Tag This allows you to influence the description of your page with the web crawlers.
Advertisements

Natural Language Processing WEB SEARCH ENGINES August, 2002.
Internet Research Internet Applications. The Internet is not the Web Because of the great popularity of the World Wide Web, people think the Internet.
The Invisible Web Definition Searching. The Invisible Web Also called: deep content hidden internet dark matter.
Exploring the Deep Web Brunvand, Amy, Kate Holvoet, Peter Kraus, and David Morrison. "Exploring the Deep Web." PPT--Download University of Utah.
Exploring the Academic Invisible Web Das wissenschaftliche Invisible Web erkunden Dr. Dirk Lewandowski Heinrich-Heine-Universität Düsseldorf, Information.
The process of increasing the amount of visitors to a website by ranking high in the search results of a search engine.
1 Pertemuan 19 Searching Mechanisms Matakuliah: M0284/Teknologi & Infrastruktur E-Business Tahun: 2005 Versi: >
Google & Beyond Expert Internet Searching Tools & Strategies.
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
1 ETT 429 Spring 2007 Microsoft Publisher II. 2 World Wide Web Terminology Internet Web pages Browsers Search Engines.
Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
 Popularity of browsers:  Popularity of search.
Internet Research Search Engines & Subject Directories.
 Search engines are programs that search documents for specified keywords and returns a list of the documents where the keywords were found.  A search.
SEARCHING ON THE INTERNET
IDK0040 Võrgurakendused I Building a site: Publicising Deniss Kumlander.
SEARCH ENGINE By Ms. Preeti Patel Lecturer School of Library and Information Science DAVV, Indore E mail:
OPTIMISING AND PROMOTING YOUR WEBSITE Michael Heraghty, Heraghty Internet Consultants
Search Engine Optimization (SEO) Week 07 Dynamic Web TCNJ Jean Chu.
Cutting Through the Clutter Searching the Web. There is a wealth of information waiting for you on the internet, if you know the right tools to use and.
Internet Research, Second Edition- Illustrated 1 Internet Research: Unit A Searching the Internet Effectively.
1 Web Developer Foundations: Using XHTML Chapter 11 Web Page Promotion Concepts.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Chapter 6 The World Wide Web. Web Pages Each page is an interactive multimedia publication It can include: text, graphics, music and videos Pages are.
XHTML Introductory1 Linking and Publishing Basic Web Pages Chapter 3.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
 Popularity of browsers:  Popularity of search.
Searching the Internet CSCI-N 100 Department of Computer and Information Science.
The Internet : Exploration, Evaluation, and Elaboration presented by Kathy Schrock.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Overview What is a Web search engine History Popular Web search engines How Web search engines work Problems.
MG 25th March '02 Searching the web TABLE OF CONTENTS Page(s) 1-4 Description and explanation of search engines on the web. Table 1 Engine Comparison.
SEO  What is it?  Seo is a collection of techniques targeted towards increasing the presence of a website on a search engine.
Validating, Promoting, & Publishing Your Web Site Writing For the Web The Internet Writer’s Handbook 2/e.
Do's and don'ts to improve your site's ranking … Presentation by:
Chapter 9 Publishing and Maintaining Your Site. 2 Principles of Web Design Chapter 9 Objectives Understand the features of Internet Service Providers.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
The Internet Do you really know what is out there?
Internet Research Tips Daniel Fack. Internet Research Tips The internet is a self publishing medium. It must be be analyzed for appropriateness of research.
Search Engines Reyhaneh Salkhi Outline What is a search engine? How do search engines work? Which search engines are most useful and efficient? How can.
Search Engines1 Searching the Web Web is vast. Information is scattered around and changing fast. Anyone can publish on the web. Two issues web users have.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Searching Tutorial By: Lola L. Introduction:  When you are using a topic, you might want to use “keyword topics.” Using this might help you find better.
The Savvy Cyber Teacher ® Using the Internet Effectively in the K-12 Classroom 1 Copyright © 2003 Stevens Institute of Technology, CIESE, All Rights Reserved.
1 Internet Research Third Edition Unit A Searching the Internet Effectively.
Search Tools and Search Engines Searching for Information and common found internet file types.
 Network  A _____ of computers that can _________ w/ each other  Examples of hardware  ______________ & communication lines  Internet  Hardware.
Uncovering the Invisible Web. Back in the day… Students used to research using resources hand-picked by librarians and teachers. These materials were.
Internet Research – Illustrated, Fourth Edition Unit A.
SEO Friendly Website Building a visually stunning website is not enough to ensure any success for your online presence.
SEO for Google in Hello I'm Dave Taylor from Webmedia.
Website Design, Development and Maintenance ONLY TAKE DOWN NOTES ON INDICATED SLIDES.
By R. O. Nanthini and R. Jayakumar.  tools used on the web to find the required information  Akeredolu officially described the Web as “a wide- area.
SEARCH ENGINES The World Wide Web contains a wealth of information, so much so that without search facilities it could be impossible to find what you were.
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
Search Engine Mortality & New Directions Greg R. Notess Internet Librarian International London 28 March 2001.
Learning how to search on the web “If all you ever do is all you’ve ever done, then all you’ll ever get is all you’ve ever got.” (author unknown)
SEO FOR REDESIGN Eric Werner. DON’T WAIT “ We are going to wait until the redesign is complete to work on SEO” No problem unless any of the following.
Frompo is a Next Generation Curated Search Engine. Frompo has a community of users who come together and curate search results to help improve.
Session 5: How Search Engines Work. Focusing Questions How do search engines work? Is one search engine better than another?
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
SEARCH ENGINE by: by: B.Anudeep B.Anudeep Y5CS016 Y5CS016.
Chapter Five Web Search Engines
Search Engines & Subject Directories
Search Engine Mortality & New Directions
Search Engines & Subject Directories
Search Engines & Subject Directories
Presentation transcript:

Search Engine Comparisons By: Thomie Ventura

Search Engines Today, much, but not all, of the work we do revolves around the web Today, much, but not all, of the work we do revolves around the web Internet is accessible to almost anyone Internet is accessible to almost anyone Impact on businesses, schools, professionals, home users Impact on businesses, schools, professionals, home users Web is changing every day, but everything is still not ACCESSIBLE Web is changing every day, but everything is still not ACCESSIBLE

FTP Servers Only way of sharing files up to 1990 Only way of sharing files up to 1990 FTP Servers and FTP Clients FTP Servers and FTP Clients Down Side Down Side Servers were mostly known through word of mouth Servers were mostly known through word of mouth Not everyone was setting up their servers Not everyone was setting up their servers

Grandfather, Grandmother, Mother Archie ( Grandfather) Archie ( Grandfather) Used FTP file Servers Used FTP file Servers Veronica (Grandmother) Veronica (Grandmother) Used Gopher file Servers Used Gopher file Servers World Wide Web Wanderer (Mother) World Wide Web Wanderer (Mother) First Robot First Robot Caused Controversy Caused Controversy Are Robots a good or bad thing for the Internet? Are Robots a good or bad thing for the Internet?

“Web Search” What exactly does it mean? What exactly does it mean? Involve tools ? Involve tools ? Accessing proprietary databases such as or Accessing proprietary databases such as or We’ll focus on “web search” as an open web source, and look at a searchers point of view We’ll focus on “web search” as an open web source, and look at a searchers point of view

Difficulty Coping Volume and Speed of the web and Search Engines Volume and Speed of the web and Search Engines Something new happens each day Something new happens each day So many things to do, so little time to do it So many things to do, so little time to do it Dynamic nature of web searching (indexing new documents) Dynamic nature of web searching (indexing new documents) Staying up-to-date with traditional tools( also undergo changes) Staying up-to-date with traditional tools( also undergo changes) Other random issues that arise everyday Other random issues that arise everyday

Will an “open web” search engine always have my answers? Questions that should arise about searching the web Questions that should arise about searching the web How long did it take to get it? How long did it take to get it? What is the database or search engine? What is the database or search engine? What kinds of questions will it help me answer? What kinds of questions will it help me answer? Open web will not always give me the answer Open web will not always give me the answer What can it be used for? What can it be used for?

Quality of Information Anyone can become a publisher Anyone can become a publisher Evaluating content is crucial Evaluating content is crucial Reputation Reputation Background Background Qualifications Qualifications Where did it come from? Where did it come from? What its purpose? What its purpose? Relevant to my topic? Relevant to my topic?

Limitations of General Web Search Tools Spiders don’t crawl in real-time Spiders don’t crawl in real-time Recency Recency Linked or Submitted Sites Linked or Submitted Sites If a website contains 1000 pages, does not mean Search Engines make all of them accessible If a website contains 1000 pages, does not mean Search Engines make all of them accessible

Invisible or Hidden Web resources Examples: Examples: Interacting resources, return “custom” sites Interacting resources, return “custom” sites Registration Registration Why is it hidden? Why is it hidden? Created on the fly Created on the fly Spiders don’t fill in registration forms Spiders don’t fill in registration forms “No-Robot” Tag “No-Robot” Tag

Hidden is not always bad Research and Effort Research and Effort Without proper tools, we can make large databases even larger Without proper tools, we can make large databases even larger Google Google Altavista Altavista Excite Excite Distributing Information Properly Distributing Information Properly

Specialized Focused and Site Specific Search Tools Necessary and Important Necessary and Important Hidden Web is out of reach of general purpose Search Engines Hidden Web is out of reach of general purpose Search Engines More Precision than Recall More Precision than Recall Examples: Examples: [ ksenglish/query.htm], [ ksenglish/query.htm], [ ksenglish/query.htm] [ ksenglish/query.htm]

Identifying and Collecting Specialized Engines Profusion Profusion [ [ [ Librarians Index Librarians Index Covers large amount of specialized and invisible web databases Covers large amount of specialized and invisible web databases [ [ [

Meta – Search Engines Major Disadvantages Major Disadvantages You get it all!! High Recall Low Precision You get it all!! High Recall Low Precision Basics of Search Engines used Basics of Search Engines used Send queries to “pay for placement” engines Send queries to “pay for placement” engines A good metasearch Engine A good metasearch Engine

Old Pages, GONE! Trying to find old pages? Trying to find old pages? Contact webmaster Contact webmaster Fortunately Fortunately Archiving Old Material Archiving Old Material Example: Example: [ [ [ ALexa Research ALexa Research [ [ [ carries over 18 terabytes of data covering some 5 million Web sites and some 1.9 billion pages carries over 18 terabytes of data covering some 5 million Web sites and some 1.9 billion pages

Search Engine Sizes This is a search engine size analysis as of December 11, 2001 Google Dominates

Sizes Over Time

Closer Look

Dealing with Coping Use the Search Engine Use the Search Engine Conduct research on a topic Conduct research on a topic This will get you familiar with search engine This will get you familiar with search engine You can see how results are displayed You can see how results are displayed Relevancy of returned documents Relevancy of returned documents Let you gather your own bookmarks Let you gather your own bookmarks

Understanding limitations What to do with these limitations? What to do with these limitations? Know limitations Know limitations Use more than one search engine Use more than one search engine Use “specialized” search engines that go deeper into a site to collect more information Use “specialized” search engines that go deeper into a site to collect more information Use “invisible web” resources Use “invisible web” resources Use web directories, and bookmark important sites Use web directories, and bookmark important sites

Ability to Search Multimedia Now Available, but still expanding Now Available, but still expanding Wait weeks now becomes instant Wait weeks now becomes instant search tools that provide access to video and audio material using a non-text mechanism to access the material ex: searching a specific background or type color search tools that provide access to video and audio material using a non-text mechanism to access the material ex: searching a specific background or type color Still image tools Still image tools Google, Altavista, and Fast, use text surrounding image Google, Altavista, and Fast, use text surrounding image

Become Aware of Multimedia Search Video Searches Video Searches Virage Virage TVeyes TVeyes ShadowTv ShadowTv Wordwave Wordwave SpeechBot (keyword search engine demo by Compaq, uses speech technology to create real-time transcripts) SpeechBot (keyword search engine demo by Compaq, uses speech technology to create real-time transcripts) Image Searches Image Searches Webseek (search or browse criteria in image) Webseek (search or browse criteria in image) Visoo( uses software that looks for words embedded in image Visoo( uses software that looks for words embedded in image

Making Old Pages Stay Long Term? Long Term? Offer comments ( suggest how material can be more accessible and searcheable, a great archive of content without the correct means of accessing it will be a hassle and is not great) Offer comments ( suggest how material can be more accessible and searcheable, a great archive of content without the correct means of accessing it will be a hassle and is not great) Short Term? Short Term? Take advanatage of Googles cache feature ( google crawls a site and makes a copy unless unauthorized, and puts it on server, if site is gone, the copy is in googles server, you must go to search results and next to URL go to “cached”, will not always be there, next time spider crawls site and it is missing it will not save onto server Take advanatage of Googles cache feature ( google crawls a site and makes a copy unless unauthorized, and puts it on server, if site is gone, the copy is in googles server, you must go to search results and next to URL go to “cached”, will not always be there, next time spider crawls site and it is missing it will not save onto server (lets you save web pages, and access them) (lets you save web pages, and access them)