WEB SCIENCE: SEARCHING THE WEB. Basic Terms Search engine Software that finds information on the Internet or World Wide Web Web crawler An automated program.

Slides:



Advertisements
Similar presentations
Getting Your Web Site Found. Meta Tags Description Tag This allows you to influence the description of your page with the web crawlers.
Advertisements

Crawling, Ranking and Indexing. Organizing the Web The Web is big. Really big. –Over 3 billion pages, just in the indexable Web The Web is dynamic Problems:
Search Engines. What is a search engine? Search engines use automated software programs (spider, crawler, robot) to crawl the WWW by following links.
Information on the Internet. http hypertext transfer protocol Web clients (browsers) make request to the web server. Looks for web page written in HTML.
1 ETT 429 Spring 2007 Microsoft Publisher II. 2 World Wide Web Terminology Internet Web pages Browsers Search Engines.
+ Search Engine Optimisation PAGNIER Hugo INMAS gpe C TERRADE Joffrey.
Internet Research Search Engines & Subject Directories.
 Search engines are programs that search documents for specified keywords and returns a list of the documents where the keywords were found.  A search.
Meta Tags What are Meta Tags And How Are They Best Used?
Search Engine Optimization
Search Engine Optimization (SEO) Week 07 Dynamic Web TCNJ Jean Chu.
How Search Engines Work. Any ideas? Building an index Dan taylor Flickr Creative Commons.
SEO Essentials Let Your Customers Find You. What is SEO? The process of improving the visibility of a website or a webpage in search engines o Uses “organic”
Wasim Rangoonwala ID# CS-460 Computer Security “Privacy is the claim of individuals, groups or institutions to determine for themselves when,
HOW SEARCH ENGINE WORKS. Aasim Bashir.. What is a Search Engine? Search engine: It is a website dedicated to search other websites and there contents.
ECommerce Marketing Strategies Rae Montgomery May 16-20, 2005 Oklahoma City, OK.
Search Engine optimization.  Search engine optimization (SEO) is the process of affecting the visibility of a website or a web page in a search engine's.
Courtney Forsmann IT Help Desk Manager Lewis-Clark State College October 1, 2014.
Downloading defined: Downloading is the process of copying a file (such as a game or utility) from one computer to another across the internet. When you.
WHAT IS A SEARCH ENGINE A search engine is not a physical engine, instead its an electronic code or a software programme that searches and indexes millions.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
SEO  What is it?  Seo is a collection of techniques targeted towards increasing the presence of a website on a search engine.
Search Engine Optimization & Pay Per Click Advertising
1 Search Engine Optimization An introduction to optimizing your web site for best possible search engine results.
SEO : Search Engine Optimization. SEO : How It Works Web is a Network of Links Search Engines use automated robots or crawlers to scour the Web for content.
Web Searching. How does a search engine work? It does NOT search the Web (when you make a query) It contains a database with info on numerous Web sites.
Search engines are the key to finding specific information on the vast expanse of the World Wide Web. Without sophisticated search engines, it would be.
Lecture 4 Title: Search Engines By: Mr Hashem Alaidaros MKT 445.
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
استاد : مهندس حسین پور ارائه دهنده : احسان جوانمرد Google Architecture.
Search Engine Marketing SEM = Search Engine Marketing SEO = Search Engine Optimization optimizing (altering/changing) your page in order to get a higher.
Search Engines Reyhaneh Salkhi Outline What is a search engine? How do search engines work? Which search engines are most useful and efficient? How can.
Search Engines By: Faruq Hasan.
Digital Literacy Concepts and basic vocabulary. Digital Literacy Knowledge, skills, and behaviors used in digital devices (computers, tablets, smartphones)
SEO Friendly Website Building a visually stunning website is not enough to ensure any success for your online presence.
What is Web Information retrieval from web Search Engine Web Crawler Web crawler policies Conclusion How does a web crawler work Synchronization Algorithms.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Search Engine Optimization
Pamela Drake December 11, 2015 SEARCH ENGINE OPTIMIZATON (SEO)
What is Seo? Search Engine Optimization for Dummies.
SEARCH ENGINE OPTIMIZATION. What is Search Engine Optimization?  Search engine optimization ( SEO ) is the process of affecting the visibility of a website.
Search Engine Optimization SEO… In Design. Introduction: What is SEO? - Is a process of improving the visibility of a website/ webpage in search engine.
By Pamela Drake SEARCH ENGINE OPTIMIZATION. WHAT IS SEO? Search engine optimization (SEO) is the process of affecting the visibility of a website or a.
Week 1 Introduction to Search Engine Optimization.
Search Engine Optimization Presented By:- ARKA Softwares Effective! Affordable! Time Groove
SEARCH ENGINES The World Wide Web contains a wealth of information, so much so that without search facilities it could be impossible to find what you were.
General Architecture of Retrieval Systems 1Adrienn Skrop.
SEO - TECHNIQUES Types of SEO SEO techniques can be classified into two broad categories : 1.White Hat SEO 2.Black Hat SEO
Why You Should Optimize Your Website Content. Optimizing a website's content, in order to obtain a high search engine ranking is what Search Engine Optimization.
Search Engine Optimization Miami (SEO Services Miami in affordable budget)
Data mining in web applications
Search Engine Optimization
Search Engine Optimization (SEO)
Search Engine Optimization(S.E.O)
Search Engine Optimization
Chapter Five Web Search Engines
SEARCH ENGINES & WEB CRAWLER Akshay Ghadge Roll No: 107.
Lecture 7. Web Search. Author: Aleksey Semyonov
SEO Services in Hyderabad
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Guido Paniccia. Best SEO Service Provider in Canada Guido Paniccia.
Objective % Explain concepts used to create websites.
Search Engines & Subject Directories
What is a Search Engine EIT, Author Gay Robertson, 2017.
DIGITAL MARKETING IS AN UMBRELLA TERM FOR THE MARKETING OF PRODUCT OR SERVICES USING DIGITAL TECHNOLOGIES, MAINLY ON THE INETRENET, BUT ALSO INCLUDING.
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
Search Engines & Subject Directories
Search Engines & Subject Directories
Prepared by G.sunil Kumar Contents:- What is E-commerce? What is SEO? What is E-Commerce SEO? Benefits of SEO What is website Types of SEO SEO On-page.
Presentation transcript:

WEB SCIENCE: SEARCHING THE WEB

Basic Terms Search engine Software that finds information on the Internet or World Wide Web Web crawler An automated program that surfs the web and indexes and/or copies the website Also known as bots, web spiders, web robots Meta-tag Extra information that tags the HTML document HyperLink or Link A reference/link to another web page

How do you evaluate a search engine? Time taken to return results Number of results Quality of results

How does a web crawler work? 1. Start at a webpage 2. Download the HTML content 3. Search for the HTML link tags 4. Repeat steps 2-3 for each of the links 5. When a website has been completely indexed, load and crawl other websites

Parallel Web Crawling Speed up your web crawling by running on multiple computers at the same time (i.e. parallel computing How often should you crawl the entire Internet? How many copies of the Internet should you keep? What are the different ways to index a webpage? Meta keywords Content Page rank (# links to page)

Basic Search Engine Algorithm 1. Crawl the Internet 2. Save meta keywords for every page 3. Save the content and popular words on the page 4. When somebody needs to find something, search for matching keywords or content words Problem: Nothing stops you from inserting your own keywords or content that do not relate to the page’s *actual* content

PageRank Algorithm 1. Crawl the Internet 2. Save the content and index the contents’ popular words 3. Identify the links on the page 4. Each link to an already indexed page increases the PageRank of that linked page 5. When somebody needs to find something, search for matching keywords or content words, BUT rank the search results according to PageRank Problem: Create a bunch of websites that link to a single specific page (

Shallow Web vs. Deep Web Shallow web Websites and content that are easily visible to “dumb search engines” Content publicly links to other content Shallow web content tends to be static content (unchanging) Deep web Websites and content that tend to be dynamic and/or unlinked Private web sites Unlinked content Smarter search engines can crawl the deep web

Search Engine Optimization (SEO) Meta keywords Words the relate to your content Human-readible URLs i.e. avoid complicated dynamically created URLs Links to your page on other websites Page visits Others? White hat vs. black hat SEO White hats are the good guys. When would they be used? Black hats are the bad guys. When would they be used?

Search Engine Design Assumptions are key to design! Major problem in older search engines: People gamed the search results Results were not tailored to the user What assumptions does a typical search engine make now? (i.e. what factors influence search today?)