Information Retrieval and Web Design

Slides:



Advertisements
Similar presentations
INFO624 - Week 2 Models of Information Retrieval Dr. Xia Lin Associate Professor College of Information Science and Technology Drexel University.
Advertisements

Natural Language Processing WEB SEARCH ENGINES August, 2002.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
Mastering the Internet, XHTML, and JavaScript Chapter 7 Searching the Internet.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
1 ETT 429 Spring 2007 Microsoft Publisher II. 2 World Wide Web Terminology Internet Web pages Browsers Search Engines.
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
What is a search engine? A program that indexes documents, then attempts to match documents relevant to a user's search requests. The term search engine.
Internet Research Search Engines & Subject Directories.
 Search engines are programs that search documents for specified keywords and returns a list of the documents where the keywords were found.  A search.
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
Library 10 – Information Competency Search Engines.
SEARCH ENGINE By Ms. Preeti Patel Lecturer School of Library and Information Science DAVV, Indore E mail:
1 Web Developer Foundations: Using XHTML Chapter 11 Web Page Promotion Concepts.
1 Web Developer & Design Foundations with XHTML Chapter 13 Key Concepts.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Courtney Forsmann IT Help Desk Manager Lewis-Clark State College October 1, 2014.
Ihr Logo Chapter 7 Web Content Mining DSCI 4520/5240 Dr. Nick Evangelopoulos Xxxxxxxx.
Downloading defined: Downloading is the process of copying a file (such as a game or utility) from one computer to another across the internet. When you.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Search Engine Interfaces search engine modus operandi.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Do's and don'ts to improve your site's ranking … Presentation by:
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
The Internet 8th Edition Tutorial 4 Searching the Web.
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
Searching Tutorial By: Lola L. Introduction:  When you are using a topic, you might want to use “keyword topics.” Using this might help you find better.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Data Mining for Web Intelligence Presentation by Julia Erdman.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
How Do We Find Information?. Key Questions  What are we looking for?  How do we find it?  Why is it difficult? “A prudent question is one-half of wisdom”
Search Tools and Search Engines Searching for Information and common found internet file types.
CPT 499 Internet Skills for Educators Session Three Class Notes.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Chapter 1 Getting Listed. Objectives Understand how search engines work Use various strategies of getting listed in search engines Register with search.
The World Wide Web: Information Resource. How a Search Engine works… How Search Works - YouTube
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
By R. O. Nanthini and R. Jayakumar.  tools used on the web to find the required information  Akeredolu officially described the Web as “a wide- area.
G053 - Lecture 02 Search Engines Mr C Johnston ICT Teacher
Setting up a search engine KS 2 Search: appreciate how results are selected.
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
SEARCH ENGINE by: by: B.Anudeep B.Anudeep Y5CS016 Y5CS016.
Crawling When the Google visit your website for the purpose of tracking, Google does this with help of machine, known as web crawler, spider, Google bot,
Education 499-R01 Search Basics.
Information Organization: Overview
Chapter Five Web Search Engines
SEARCH ENGINES & WEB CRAWLER Akshay Ghadge Roll No: 107.
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
CIW Lesson 6 Web Search Engines.
IST 516 Fall 2011 Dongwon Lee, Ph.D.
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Search Engines & Subject Directories
Information Retrieval
Data Mining Chapter 6 Search Engines
Introduction to Information Retrieval
Search Engines & Subject Directories
Search Engines & Subject Directories
Information Retrieval and Web Design
Information Organization: Overview
Information Retrieval and Web Design
Information Retrieval and Web Design
Information Retrieval and Web Design
Information Retrieval and Web Design
Human and Computer Interaction (H.C.I.) &Communication Skills
Information Retrieval and Web Design
Presentation transcript:

Information Retrieval and Web Design Lecture (5) Prepared by Dr. Dunia Hamid Hameed

Web Search Engines Web search engines explore the existing (semantics-free) structure of the Web and try to find documents that match user search criteria: that is, to bring semantics into the process of web search.

The basic idea is to use a set of words (or terms) that the user specifies and retrieve documents that include (or do not include) those words. This is the keyword search approach, well known from the area of information retrieval (IR).

After retrieving a set of documents ranked by their degree of matching the keyword query, they are further ranked by importance (popularity, authority), usually based on the web link structure.

Web Crawlers Browsing the Web is a very useful way to explore a collection of linked web documents as long as we know good starting points: URLs of pages from the topic or area in which we are interested. However, general search for information about a specific topic or area through browsing alone is impractical.

A better approach is to have web pages organized by topic or to search a collection of pages indexed by keywords. The former is done by topic directories and the latter, by search engines.

Collecting “all” web documents can be done by browsing the Web systematically and exhaustively and storing all visited pages. This is done by crawlers (also called spiders or robots).

Topic Directories Web pages are organized into hierarchical structures that reflect their meaning. These are known as topic directories, or simply directories, and are available from almost all web search portals.

Topic Directories The directory structure is often used in the process of web search to better match user criteria or to specialize a search within a specific set of pages from a given category.

Topic Directories The directories are usually created: Manually with the help of thousands of web page creators and editors. There are also approaches to do this automatically by applying machine learning methods for classification and clustering.

Semantic Web Semantic web is a recent initiative led by the web consortium (w3c.org). Its main objective is to bring formal knowledge representation techniques into the web.

Semantic Web The problem here is that the nice format of web pages is very difficult for computers to understand—something that we expect search engines to do.

Semantic Web – Basic Idea The main idea behind the semantic web is to add formal descriptive material to each web page that although invisible to people would make its content easily understandable by computers.