Search Engines Jan Damsgaard Dept. of Informatics Copenhagen Business School

Slides:



Advertisements
Similar presentations
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 4.1 Chapter 4 : Searching the Web The mechanics.
Advertisements

Natural Language Processing WEB SEARCH ENGINES August, 2002.
IS530 Lesson 12 Boolean vs. Statistical Retrieval Systems.
INTERNET A collection of networks. History ARPANet – developed for security of sending in case of a nuclear attack IDEA – the system would not go down.
Page 1 June 2, 2015 Optimizing for Search Making it easier for users to find your content.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Search Engines and Subject Directories Selecting the Best Way to Find Information.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
How Search Engines Work Source:
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
Searching the World Wide Web From Greenlaw/Hepp, In-line/On-line: Fundamentals of the Internet and the World Wide Web 1 Introduction Directories, Search.
The Fragmented Web Notes on Chapter 12 For In765 Judith Molka-Danielsen.
Unit 3 Web Search Engines. Can You Find the Answers? n Connect to Google Google n Search for items on Iran Records ________ n Combine Iran with nuclear.
WHAT HAVE WE DONE SO FAR?  Weeks 1 – 8 : various components of an information retrieval system  Now – look at various examples of information retrieval.
Internet Research Search Engines & Subject Directories.
Search Tools for the Internet Adapted from: Kathy Schrock M. Rosettis St. Augustine CHS.
SEARCH ENGINE By Ms. Preeti Patel Lecturer School of Library and Information Science DAVV, Indore E mail:
How Search Engines Work General Search Strategies Dr. Dania Bilal IS 587 SIS Fall 2007.
Cutting Through the Clutter Searching the Web. There is a wealth of information waiting for you on the internet, if you know the right tools to use and.
Information Literacy What is it?. Information Literacy Ability to locate, organize, evaluate and use information Combines computer and research skills.
Searching “Search results are only as good as the query you pose and how you search. There is no silver bullet”
1 Web Developer Foundations: Using XHTML Chapter 11 Web Page Promotion Concepts.
Lesson 12 — The Internet and Research
Search Engines Meta Engines People Directories Subject Directories Domains explained URLs explained Hypertext Language Contents.
Yahoo! Acquires Inktomi March 19 th, Yahoo!
Promotion & Cataloguing AGCJ 407 Web Authoring in Agricultural Communications.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Search Engine Interfaces search engine modus operandi.
Overview What is a Web search engine History Popular Web search engines How Web search engines work Problems.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Searching Information. General Steps Identifying Key Words, Synonyms, and Key Phrases Constructing an effective search statement Advance search/boolean.
 Search Engine Search Engine  Steps to Search for webpages pertaining to a specific information Steps to Search for webpages pertaining to a specific.
1 Search Engines Emphasis on Google.com. 2 Discovery  Discovery is done by browsing & searching data on the Web.  There are 2 main types of search facilities.
Search Engines AGCM 4143 Electronic Communications in Agriculture.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet October 30, The Internet URL’s Search Engines Boolean Operators Internet Searches Scavenger Hunt.
The Internet 8th Edition Tutorial 4 Searching the Web.
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
استاد : مهندس حسین پور ارائه دهنده : احسان جوانمرد Google Architecture.
Search Engine Marketing SEM = Search Engine Marketing SEO = Search Engine Optimization optimizing (altering/changing) your page in order to get a higher.
Search Engines Reyhaneh Salkhi Outline What is a search engine? How do search engines work? Which search engines are most useful and efficient? How can.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Chapter 8  Government and Universities over 30 years  Who’s connected today? ◦ Individuals ◦ Educational institutions ◦ Government ◦ Research ◦ Medical.
Search Tools and Search Engines Searching for Information and common found internet file types.
Web Search Engines AGED Search Engines Search engines (most have directories, too)  Yahoo  AltaVista  Lycos
Search Engines By: Faruq Hasan.
A process of taking your best guesses. Companies have web sites where you can access your information.
Unit 1—Computer Basics Lesson 3 The Internet and Research.
Chapter 1 Getting Listed. Objectives Understand how search engines work Use various strategies of getting listed in search engines Register with search.
Selected Internet Search Engines Search Engine Database Advanced/ Boolean Other search options Miscellaneous Google Google google.co m Advanced Search.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Search Engines Information Technology and Social Life March 2, 2005.
Internet Power Searching: Finding Pearls in a Zillion Grains of Sand By Daniel Arze.
SEO BASICS Internet Marketers #SEOmkt3730 Done By: Evan Clough Ashley Sellers Erik Wilson Stephen Glover.
Internet Power Searching Finding Pearls in a Zillion Grains of Sand By Amelia Kassel Found in “Technical Communication” on page 198.
Search Engine Mortality & New Directions Greg R. Notess Internet Librarian International London 28 March 2001.
SEMINAR ON INTERNET SEARCHING PRESENTED BY:- AVIPSA PUROHIT REGD NO GUIDED BY:- Lect. ANANYA MISHRA.
SEARCH ENGINE by: by: B.Anudeep B.Anudeep Y5CS016 Y5CS016.
Internet Searching How many Search Engines are there? What is a spider and how is it important to the Internet? What are the three main parts of a search.
Using Search Tools on the Internet
Search Engines and Search techniques
Types of Search Questions
CIW Lesson 6 Web Search Engines.
Search Engines & Subject Directories
Search Engine Mortality & New Directions
Search Engines & Subject Directories
Search Engines & Subject Directories
Information Search Week 4.
Presentation transcript:

Search Engines Jan Damsgaard Dept. of Informatics Copenhagen Business School

EBUSSJan Damsgaard, 2004 Introduction u How to find relevant information on the web a major problem u Size, growth, lack of universal semantic organization major impediments u Two major strategies 1.Improve users’ search capability by using raw computer power: search engines 2.Help organize user relevant information into meaningful categories and bundles of services: portals

EBUSSJan Damsgaard, 2004 Definitions u Search engine –Specific information retrieval software which provides as results URL and descriptions web pages u Portal –Site that forms a major site for users when they connect to web; portals combine directories, services and search capabilities and personalization

EBUSSJan Damsgaard, 2004 Search Engines u Technical and business solutions that provide these services on a mass scale are important internet phenomena for two reasons: –1) they obtain immense hit rates and therefore are major points of origin for any internet activity –2) they are most important means to channel user search and retrieval –Therefore they are strategically important as reflected in the valuations of the search engine companies in the market t t

msn.dk dr.dk krak.dk tv2.dk eniro.dk ekstrabladet.dk ofir.dk tdconline.dk bt.dk sol.dk netdoktor.dknetdoktor.dk (26) FDIM (top ti)

EBUSSJan Damsgaard, 2004 Look at the stickiness Top 10 sites in November 2000 in terms of minutes spend per month

EBUSSJan Damsgaard, 2004 Where Do Search Engines Develop Market Value? u Market recognition, leading to –popular use and adoption –selling add impressions –long term contracts for search engine functionality u Market assessment of real options associated with the recognition of the tool in the marketplace –future value-added alliance and spin-offs

EBUSSJan Damsgaard, 2004 Search engine basics u Basic information retrieval techniques u Market trends and capabilities u Awareness of popular assessment metrics for search engine performance u Search engine business models

EBUSSJan Damsgaard, 2004 How Search Engines Work u Three components: –spider or link crawler software agent –index or catalog database of content –search engine software or combined meta-search engine u Require significant hardware horsepower, server connectivity and database capabilities u If not connected, you submit your links

EBUSSJan Damsgaard, 2004 How do search engines work u Add keywords to text fields u Critical is the choice of the keywords, possibilities of their combination and how the search engine exploits the results u Multilingual support u Another issue is how it organizes search result

The most popular search engines Search Engine Total from Dec Total from March 2002 Total from Aug Google9,7328,3716,567 AlltheWeb6,7574,3884,969 AltaVista5,4193,4323,112 WiseNut4,6645,0094,587 HotBot3,6802,8693,277 MSN Search3,2672,5233,005 Teoma3,2591,8392,219 NLResearch2,3523,6103,321 Gigablast2,352NA

EBUSSJan Damsgaard, 2004 Popularity over time u March 2002:Google, WiseNut, AlltheWeb March 2002 u August 2001:Google, Fast, WiseNut August 2001 u April 2001:Google, Fast, MSN (Inktomi) April 2001 u Oct. 2000:Fast, Google, Northern Light Oct u July 2000:iWon, Google, AltaVista July 2000 u April 2000:Fast, AltaVista, Northern Light April 2000 u Feb. 2000:Fast, Northern Light, AltaVista Feb u Jan. 2000:Fast, Northern Light, AltaVista Jan u Nov. 1999:Northern Light, Fast, AltaVista Nov u Sept. 1999:Fast, Northern Light, AltaVista Sept u Aug. 1999:Fast, Northern Light, AltaVista Aug u May 1999:Northern Light, AltaVista, Anzwers May 1999 u March 1999:Northern Light, AltaVista, HotBot March 1999 u January 1999:Northern Light, AltaVista, HotBot January 1999 u August 1998:AltaVista, Northern Light, HotBot August 1998 u May 1998:AltaVista, HotBot, Northern Light May 1998 u February 1998: HotBot, AltaVista, Northern Light February 1998 u October 1997:AltaVista, HotBot, Northern Light October 1997 u September 1997:Northern Light, Excite, HotBot September 1997 u June 1997:HotBot, AltaVista, Infoseek June 1997 u October 1996:HotBot, Excite, AltaVista October

EBUSSJan Damsgaard, 2004 Also specific services u E.g. Google provides –Find pdf files –Stock quotes –Cached links –Similar pages –Who links to you –Specific site –Dictionary definitions –Find Maps

Major design issues: completeness and relevance The set of relevant replies The set of obtained results The larger the overlap the better in terms of completeness The smaller the set of not relevant Replies the more relevant search How to organize the results for fast reviewing

EBUSSJan Damsgaard, 2004 Page Ranking for Relevance u Biased or unbiased by search engine? u The size of the search space (pages e.g. google addresses currently 1,346,966,000 pages) u Use of keywords: in title, meta-tags information in HTML code, or near top of the page u Use of other facilities like semantic nets or reliability indices (E.g. google uses page ranks and filtering) u Daily, weekly, monthly WebCrawler software refresher u For an analysis see

EBUSSJan Damsgaard, 2004 Special features of search engines u Multi-lingua searches u Natural language interfaces u Image searches u Agents (specific crawlers and service providers, , news agents, shopping and trading agents)

EBUSSJan Damsgaard, 2004 Search Assistance Features u Phrase Searching –finds terms you enter into the search box as a phrase; tells you in results whether any full or partial matches found u Stemming –Ability for search engine to search for variations of word based on stem t Entering "swim" might also find "swims" and maybe "swimming," depending on the search engine, in some other languages more important t Some search engines have stemming switched on by default u Clustering –Allows only one page per site to be represented in the results

EBUSSJan Damsgaard, 2004 Conclusions u Search engines are key elements of Internet business u Next wave will integrate new interfaces and new access channels (Digital TV, wireless) u Mass scale business with the value of installed base