How Search Engines Work General Search Strategies Dr. Dania Bilal IS 587 SIS Fall 2007.

Slides:



Advertisements
Similar presentations
Getting Your Web Site Found. Meta Tags Description Tag This allows you to influence the description of your page with the web crawlers.
Advertisements

Google Chrome & Search C Chapter 18. Objectives 1.Use Google Chrome to navigate the Word Wide Web. 2.Manage bookmarks for web pages. 3.Perform basic keyword.
Computer Information Technology – Section 3-2. The Internet Objectives: The Student will: 1. Understand Search Engines and how they work 2. Understand.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
Searching and Researching the World Wide: Emphasis on Christian Websites Developed from the book: Searching and Researching on the Internet and World Wide.
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
 Popularity of browsers:  Popularity of search.
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
Web Searching. Web Search Engine A web search engine is designed to search for information on the World Wide Web and FTP servers The search results are.
SEARCH ENGINE By Ms. Preeti Patel Lecturer School of Library and Information Science DAVV, Indore E mail:
XP Tutorial 6 New Perspectives on Microsoft Windows XP 1 Microsoft Windows XP Searching for Information Tutorial 6.
Lesson 12 — The Internet and Research
HOW SEARCH ENGINE WORKS. Aasim Bashir.. What is a Search Engine? Search engine: It is a website dedicated to search other websites and there contents.
DATA COMMUNICATION DONE BY: ALVIN SAMPATH CARLVIN SAMPATH.
Web Search Created by Ejaj Ahamed. What is web?  The World Wide Web began in 1989 at the CERN Particle Physics Lab in Switzerland. The Web did not gain.
Search Engines. Internet protocol (IP) Two major functions: Addresses that identify hosts, locations and identify destination Connectionless protocol.
Using a Web Browser What does a Web Browser do? A web browser enables you to surf the World Wide Web. What are the most popular browsers?
Search Engine optimization.  Search engine optimization (SEO) is the process of affecting the visibility of a website or a web page in a search engine's.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Basic Web Applications 2. Search Engine Why we need search ensigns? Why we need search ensigns? –because there are hundreds of millions of pages available.
Overview In this tutorial you will: learn different ways to conduct a web search learn how to save and print search results learn about social bookmarking.
 Popularity of browsers:  Popularity of search.
The Internet : Exploration, Evaluation, and Elaboration presented by Kathy Schrock.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
WHAT IS A SEARCH ENGINE A search engine is not a physical engine, instead its an electronic code or a software programme that searches and indexes millions.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
SEO  What is it?  Seo is a collection of techniques targeted towards increasing the presence of a website on a search engine.
Searching the Web by Lorrie Brazier Revised by Paula Walton.
1/28: The Internet & Website Design What is the Internet? –Parts of the Internet –Internet & WWW basics –Searching the WWW Website design considerations.
Fourth Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.
What to Know: 9 Essential Things to Know About Web Searching Janet Eke Graduate School of Library and Information Science University of Illinois at Champaign-Urbana.
Search Engine Optimization & Pay Per Click Advertising
Search Engines AGCM 4143 Electronic Communications in Agriculture.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
The Internet Do you really know what is out there?
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Web Page Design Introduction. The ________________ is a large collection of pages stored on computers, or ______________ around the world. Hypertext ________.
Search Tools and Search Engines Searching for Information and common found internet file types.
Company LOGO In the Name of Allah,The Most Gracious, The Most Merciful King Khalid University College of Computer and Information System Websites Programming.
Chapter 1 Getting Listed. Objectives Understand how search engines work Use various strategies of getting listed in search engines Register with search.
1 SEARCHING FOR TRUTH Locating Information on the WWW chapter 5.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Google search in general  Google Search, commonly referred to as Google Web Search or just Google, is a web search engine owned by Google Inc. It is.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
SEARCH ENGINES The World Wide Web contains a wealth of information, so much so that without search facilities it could be impossible to find what you were.
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
1 UNIT 13 The World Wide Web. Introduction 2 Agenda The World Wide Web Search Engines Video Streaming 3.
Search Engine and Optimization 1. Introduction to Web Search Engines 2.
Search Engine Optimization Miami (SEO Services Miami in affordable budget)
Third Edition Discovering the Internet Discovering the Internet Complete Concepts and Techniques, Second Edition Chapter 3 Searching the Web.
Lecture 4 Access Tools/Searching Tools. Learning Objectives To define access tools To identify various access tools To be able to formulate a search strategy.
Seminar on seminar on Presented By L.Nageswara Rao 09MA1A0546. Under the guidance of Ms.Y.Sushma(M.Tech) asst.prof.
1 Chapter 5 (3 rd ed) Your library is an excellent resource tool. Your library is an excellent resource tool.
Session 5: How Search Engines Work. Focusing Questions How do search engines work? Is one search engine better than another?
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
The Web Web Design. 3.2 The Web Focus on Reading Main Ideas A URL is an address that identifies a specific Web page. Web browsers have varying capabilities.
Search Engine Optimization
Chapter 8 Browsing and Searching the Web
Search Engines and Search techniques
Objective % Explain concepts used to create websites.
ITE 130 Web Searching.
Fred Dirkse CEO, OIC Group, Inc.
Searching EIT, Author Gay Robertson, 2017.
Chapter 16 The World Wide Web.
Objective Explain concepts used to create websites.
Presentation transcript:

How Search Engines Work General Search Strategies Dr. Dania Bilal IS 587 SIS Fall 2007

Fun Quiz Take the search engine quiz located at /search_engine_quiz/blsearchenginequiz.h tm /search_engine_quiz/blsearchenginequiz.h tm Record the no. of incorrect answers Share the results of the quiz with a classmate.

How Search Engines Work? They collect information from selected web sites The employ special software robots, called spiders, to crawl web pages Spiders build lists of the words found in Web sites. When a spider is building its lists, the spider is Web crawling. When a spider is building its lists, the spider is Web crawling. Spiders store the lists in the engine’s database The engine’s indexing software builds an index of words Information is matched against query input and retrieved (processing algorithm)

How Spiders and Crawlers Work? They begin with popular and heavily used web servers. They begin with a popular site, collect the words on its pages and follow every link found within the site. Spiders travel across pages and the most widely used portions of the Web Spiders travel across pages and the most widely used portions of the Web

How Spiders and Crawlers Work? A dedicated server of URLs is built by a search engine company (e.g., Google) so that spiders collect information quickly More than one spider is used to craw web pages at a time Google uses 3-4 spiders and collect over 100 pages per second Google uses 3-4 spiders and collect over 100 pages per second

How Spiders and Crawlers Work? When no dedicated URL server is used, search engine company relies on ISP for the domain names (translated into addresses) to use for crawling the web Delay in gathering information Delay in gathering information Delay in updating information Delay in updating information Lack of control over URL addresses Lack of control over URL addresses

Google Spider and How it Works A spider looks at the html or xml or other coding used to build a web page and collects information from the meta-tags It indexes words within the actual text of a page It indicates where the words were found (URL, title, headings, etc.) It disregards initial articles It disregards pages that should not be crawled or indexed

Google Spider and How it Works It uses Robot-Exclusion Protocol in disregarding pages Implemented in the meta-tag section at the beginning of a Web page Implemented in the meta-tag section at the beginning of a Web page Tells a spider to leave the page alone, neither index the words on the page nor try to follow its links Tells a spider to leave the page alone, neither index the words on the page nor try to follow its links Franklin, C. How Internet Search Engines Work. engine.htm engine.htm engine.htm

How Search Engines Store Words Indexed? The process varies among engines Words are stored with no. of times they appear on a pages (posting) Weight is assigned to each word. Words appearing near top of a page may have more weight than those appearing in subheadings, in links, in meta tags, in title, etc.

How Search Engines Store Words Indexed? Information is encoded to save space Information is indexed An index of words is built by the automatic indexer (indexing software) An index of words is built by the automatic indexer (indexing software) A hash table is created with an assigned weight or value for each word indexed A hash table is created with an assigned weight or value for each word indexed Hashing allows for even the distribution of popular entries (e.g., letter M) with those that are less popular (e.g., letter X) for quick retrieval Hashing allows for even the distribution of popular entries (e.g., letter M) with those that are less popular (e.g., letter X) for quick retrieval

Using General Directories Yahoo and its family Browsing directory Directory database Directory database Small and human-selected and indexed Small and human-selected and indexed Searching using keywords Search database Search database Larger and non-selective database Larger and non-selective database Spider and machine indexing Spider and machine indexing

Yahoo Yahoo.com Works like a search engine rather than a directory Works like a search engine rather than a directory Searches the web Searches the web Exercise: search under my name and see how Yahoo processes query while you’re inputting information Exercise: search under my name and see how Yahoo processes query while you’re inputting information Directory found under more or at

Yahoo Search Engine Search Web Web Images Images Videos Videos Local information Local information Shopping Shopping More… More…

Yahoo Advanced Search Advanced Search feature Shown on screen after you perform a search, or by going directly to Shown on screen after you perform a search, or by going directly to TF-8&p=dr+dania+bilal&fr=yfp-t TF-8&p=dr+dania+bilal&fr=yfp-t TF-8&p=dr+dania+bilal&fr=yfp-t TF-8&p=dr+dania+bilal&fr=yfp-t-471 Lots of search features to explore Lots of search features to explore

Yahoo Advanced Search Features BooleanPhraseCurrencyDomain File format CountryLanguageOther

Yahoo Advanced Search Features Exercise Perform a search on a topic of your choice Perform a search on a topic of your choice Use Boolean equivalents Use Boolean equivalents All the words=AND The exact phrase=phrase; proximity search Any of these words=OR None of these words=Not Choose part of page to search Choose part of page to search Choose language other than English Choose language other than English Report results in class Report results in class

Yahoo Search Services For searching specific content area such as Search Services Search Services Web Search Find anything from across the Web Web Search Find anything from across the Web Web Search Web Search Answers Ask questions and get answers from real people Answers Ask questions and get answers from real people Answers Audio Search Find over 50mm audio files from across the Web Audio Search Find over 50mm audio files from across the Web Audio Search Audio Search Creative Commons Search Find Creative Commons content that you can share or re-use in your own works Creative Commons Search Find Creative Commons content that you can share or re-use in your own works Creative Commons Search Creative Commons Search Directory Search Search or browse Yahoo!'s categorized guide to the Web Directory Search Search or browse Yahoo!'s categorized guide to the Web Directory Search Directory Search Image Search Find over 1.6 Billion photos and illustrations from all over the Web Image Search Find over 1.6 Billion photos and illustrations from all over the Web Image Search Image Search Job Search Search for jobs, post your resume and more on Yahoo! HotJobs Job Search Search for jobs, post your resume and more on Yahoo! HotJobs Job Search Job Search Local Find everything in your area from dry cleaners to day spas Local Find everything in your area from dry cleaners to day spas Local Maps Find maps and driving directions for anywhere you want to go Maps Find maps and driving directions for anywhere you want to go Maps Mobile Search Find whatever, wherever you are Mobile Search Find whatever, wherever you are Mobile Search Mobile Search My Web (Beta) The newest way to save, share and organize any page you want on the Web My Web (Beta) The newest way to save, share and organize any page you want on the Web My Web My Web News Search Search for news stories and related photos, videos and audio clips News Search Search for news stories and related photos, videos and audio clips News Search News Search

Yahoo Next Cutting edge technology at Yahoo Cutting edge technology at Yahoo Blogs, Web 2.0, use of alltheweb, Yahoo Maps, Podcasts, audio and all other features that are in Beta testing Blogs, Web 2.0, use of alltheweb, Yahoo Maps, Podcasts, audio and all other features that are in Beta testing

Yahoo Preferences Customize Yahoo to fit your needs Go to Preferences from the Web search page Edit preferences based on your needs Edited preferences are saved in browser on desktop

General Search Strategies in Search Engines

Strategies Boolean Boolean equivalents Proximity and phrase searching Searching within a field Search limits

Yahoo Search Strategies Explore Yahoo’s help page Read the Search Tips Read the search limit parameters such as Intitle: Intitle: url: url: inurl: inurl: Read how to use Boolean equivalents and other search parameters

General Search Engines Besides Yahoo Search

Engines and Information Need Several general search engines on the Web Select engine(s) that best fit your need Visit the Web Search Guide for latest information: engines/General_AllPurpose_Search_Engine s.htm engines/General_AllPurpose_Search_Engine s.htm engines/General_AllPurpose_Search_Engine s.htm engines/General_AllPurpose_Search_Engine s.htm

Hands-on Activity Browe the list of general search engines in Web Search Guide Explore 4 of the engines listed Wisenut, Snap.com, Lycos, Exalead Wisenut, Snap.com, Lycos, Exalead Search under my name in each engine Search under my name in each engine Compare the results by viewing the first two pages retrieved Compare the results by viewing the first two pages retrieved How many overlaps were found among the three engines How many overlaps were found among the three engines How many unique results were found in each engine How many unique results were found in each engine

Specialized Search Engines Web Search Guide has a listing of specialized search engines Web companion to the textbook, chapter 3 describes a variety of specialized engines Explore chapter 3 familiarize yourself with the engines described

Hands-on Activity Find the answer or relevant information for these two queries using an appropriate, specialized search engine: Do squirrels hybernate? Do squirrels hybernate? Find me a list of foreign-owned companies based in the U.S., organized by state. Find me a list of foreign-owned companies based in the U.S., organized by state.