Presentation is loading. Please wait.

Presentation is loading. Please wait.

Search Engines: Exploring Google By Habib.ur.Rehman Assistant librarian & Coordinator Lincoln Corner Central Library, University of Peshawar.

Similar presentations


Presentation on theme: "Search Engines: Exploring Google By Habib.ur.Rehman Assistant librarian & Coordinator Lincoln Corner Central Library, University of Peshawar."— Presentation transcript:

1 Search Engines: Exploring Google By Habib.ur.Rehman Assistant librarian & Coordinator Lincoln Corner Central Library, University of Peshawar

2 What is search engine? A computer program that retrieves documents or files or data from a database or from a computer network (especially from the internet) worldweb dictionary A software program that searches a database and gathers and reports information that contains or is related to specified terms. A search engine is a website that searches the Internet for pages and documents relevant to the search terms given. Search engines use robots known as spiders to 'crawl' the web for new content to add to the possibilities for search results.

3 A search engine is a website that searches the Internet for pages and documents relevant to the search terms given. Search engines use robots known as spiders to 'crawl' the web for new content to add to the possibilities for search results. A SEARCH engine is software designed specifically to allow you to find anything you want on the Internet. There are many search engines available. All you need to do is pick the one that you would like to use, insert the search string (what you are looking for) and start the search. You will get back a list of entries matching your entry and then all you need to do is double click on the lines that interest you and you will be taken to that homepage. http://www.Ask.com

4 Three Types of Search Engines Crawler-based search engines Human-powered directories Hybrid search engines

5 Crawler-based search engines, such as Google (http://www.google.com), create their listings automatically. They "crawl" or "spider" the web, then people search through what they have found. If web pages are changed, crawler-based search engines eventually find these changes, and that can affect how those pages are listed. Page titles, body copy and other elements all play a role. The life span of a typical web query normally lasts less than half a second, yet involves a number of different steps that must be completed before results can be delivered to a person seeking information. The following graphic (Figure 1) illustrates this life span (from http://www.google.com/corporate/tech.html):

6 Crawler-based search engines The life span of a typical web query 1. The web server sends the query to the index servers. The content inside the index servers is similar to the index in the back of a book - it tells which pages contain the words that match the query. 2. The query travels to the doc servers, which actually retrieve the stored documents. Snippets are generated to describe each search result. 3. The search results are returned to the user in a fraction of a second.

7 Human-powered directories A human-powered directory, such as the Open Directory Project (http://www.dmoz.org/about.html) depends on humans for its listings. (Yahoo!, which used to be a directory, now gets its information from the use of crawlers.) A directory gets its information from submissions, which include a short description to the directory for the entire site, or from editors who write one for sites they review. A search looks for matches only in the descriptions submitted. Changing web pages, therefore, has no effect on how they are listed. Techniques that are useful for improving a listing with a search engine have nothing to do with improving a listing in a directory. The only exception is that a good site, with good content, might be more likely to get reviewed for free than a poor site.

8 Hybrid search engines Today, it is extremely common for crawler-type and human-powered results to be combined when conducting a search. Usually, a hybrid search engine will favor one type of listings over another. For example, MSN Search (http://www.imagine- msn.com/search/tour/moreprecise.aspx) is more likely to present human-powered listings from LookSmart (http://search.looksmart.com/). However, it also presents crawler-based results, especially for more obscure queries.

9 Top Search Engines Top Search Engines for 2010 (By Volume) Year GoogleYahoo!BingAskTotal 2010-03-0671.07%14.46%9.55%3.0%98.09% 2010-02-0671.35%14.60%9.56%2.55%98.06% 2010 0171.61%14.76%9.13%2.66%98.16% Source: http://www.seoconsultants.com/search-engines/http://www.seoconsultants.com/search-engines/

10 Top Search Engines for 2010 (By Visit) Top Search Engines Source: http://www.hitwise.comhttp://www.hitwise.com

11 20 SEARCH The Web's Best Search Engine List! http://www.20search.com/

12 Recommended Search Strategy 1.Analyse your topic to decide where to begin 2.Select key words and phrases that are relevant to your topic 3.Pick the right starting place to begin your research (search engine, directory, or invisible web) 4. Use the “advanced search” screen and “search tips” advice available on most good websites 4.Learn as you go and vary your approach with what you learn. Don’t persist with any strategy that doesn’t work

13 Why use the ‘Recommended Internet Links’ page All websites have been selected for their quality and authority from reliable sources The directory focuses heavily on websites that are of particular relevance to the Pacific Islands and Vanuatu The directory focuses heavily on websites that are of particular relevance to the USP teaching programme.

14 Google Facts You DONT Know Google started in January, 1996 as a research project at Stanford University, by Ph.D. candidates Larry Page and Sergey Brin when they were 24 years old and 23 years old respectively. The name “Google” was an accident. A spelling mistake made by the original founders who thought they were going for “Googol”. Google is the largest American company (by market capitalization) The Google search engine receives about a billion search requests per day. Google has the largest network of translators in the world. Google consists of over 450,000 servers Number of languages in which you can have the Google home page set up, including Urdu and Latin : 88 The infamous “I m feeling lucky” button is nearly never used. However, it costs Google $110 Million a Year

15 Recommended general search engines Popular search engines Google / Google scholar Ask.com Yahoo!Search *Google alone is not sufficient. Even though probably the biggest search engine, studies show that less than half the searchable web is fully searchable in google. Tip 1: Use the ‘advanced search’ option with ALL search engines to refine your search Tip 2: Become familiar with your search engine by looking at the ‘Advanced Search Tips’ page

16 Google (advanced) - Example

17 Googling to the max Google is the biggest search engine database in the world Google ranks pages on three criteria:  Popularity – based on the number of links to a page and the importance of the pages that link  Importance – traffic, quality of links  Word proximity and occurrence in results

18 Google tips Use the advanced search screen Use quotations when searching a phrase Type your search terms as a statement not as a question. (Google will try to match the words that you have entered.) Google can search more than just documents. Explore the Google site to search for maps, news, images and photographs, books, music, videos, blogs etc.

19 Google “special” searches and shortcuts “ “ : always use quotations to search a phrase - hyphen: always hyphenate a word that is sometimes hyphenated eg. same-sex searches same-sex, samesex and same sex ~ synonyms: let google “think” of synonyms eg. ~youth finds youth, juvenile, adolescent Intitle: Requires terms to appear in the title of the document eg.intitle:”global warming” Allintitle: Requires all terms to appear in the title of the document eg. Allintitle: traditional knowledge intellectual property pacific

20 More google shortcuts.. Site: used to search within a particular site eg. Site:un.org “discrimination against women” Inurl: requires terms to be in the url eg. Inurl:usp forsyth will find all references to Forsyth on websites with usp in the url. Filetype: only searches particular types of documents eg. Filetype:ppt “legal research” will locate power point presentations on legal research. Movie: only searches movie reviews!

21 And more google shortcuts! Use google as a calculator eg. 6*2 Use google to find maps eg. Map:”port vila” Use google as a dictionary eg. Define:”mens rea”

22 Searching the “invisible” web The “invisible” web is estimated to be two to three times bigger than the “visible” web The invisible web consists of a vast amount of documents that are contained within searchable specialised databases that are not themselves linked web pages You need to identify these databases and search on them rather than via google. These databases include subscription only databases (eg. Lexis) but also freely available databases (eg. The Emalus Library’s ‘Pacific Law Journal Index’) Identify ‘databases’ on the invisible web by using specialist subject directories (such as the Emalus ‘Recommended Internet Links’ page, Sosig, Weblaw etc), studying major internet sites in your area of interest or by including the word ‘database’ or ‘index’ in your search. Eg. “human rights” database

23 Evaluating web pages What can the URL tell you? – is this a personal website? What type of domain does it come from? eg..com indicates it is a commercial site whilst.edu indicates it is an educational site Is there any information on the webpage on who published the materials or information about the authors of the website itself? – can you trust the information on the website? Is the page dated? Is it up-to-date? What are the authors credentials on the subject? – are they an expert? Are sources documented with footnotes and verifiable references? Are there well annotated links to other sources on the topic? Do other reputable sites link to this webpage?

24 Final comments “garbage in garbage out” – computers do not think so how you structure your search will determine the effectiveness of your search results Be critical of what you find on the internet and verify the authority, reliability and currency of all materials Do not rely solely on one search tool such as Google. Make use of 2 or more search engines, relevant internet subject directories and explore the invisible web Search the databases in the invisible web that you have access to via your USP library ‘Law Resources’ website (Encyclopedia Britannica, Oxford Reference Library, Westlaw, Lexis, Pacific Law Journal Index etc.)

25 What has Google ever done for us? Google Scholar Google Finance Google News Google Calendar Google Docs Google Drive Google Checkout Google Mobile Google Gmail G-phone Google knol - wiki

26 Don’t ever store user information. 30% Give users access to and editing permission over the data they keep. 18% Empower users to manage and improve the relevancy of their own search results. 15% Be transparent about filtering they use to display results, capture information and disclose biases.14% Give users the opportunity to opt out at will. 10% Have regular open conversations with users on the use of user data. 6% None of the above 5% Give users the tools to curate and prune search history. 4%

27 Advanced Features of Google Fill in the Blanks – “*” Diacritics Query modifiers GAPS (proximity search)

28 Fill in the Blanks – “*”

29 You can replace unknown words with an asterisk - “*”. Google returns results substituting the “*” with words most frequently used in the context of the query.

30 Use “*” for:

31 Possible Uses for “*” Busted! Searching out suspected plagiarism.

32 Possible Uses for “*” All spellings of a word will be found. Including results with common misspellings.

33 Possible Uses for “*” Finding common variations.

34 Fun with “*” Finding parodies.

35 Do not use “*” for:

36 Fill in the Blanks – “*” …Google will treat the characters on either side of the “*” like separate keywords. Replacing a character. If you try to use “*” to fill in a letter, number, or symbol…

37 If you search for a word with a diacritic (distinguishing mark) in it… …will Google return results with or without the diacritic? Diacritics

38 A search for unité produces results for both unite and unité. The answer is: both!

39 Diacritics To limit your search to only unité, add a “-” followed by unite.

40 Diacritics Notice how the number of results has decreased.

41 Query Modifiers Use these commands in the search window.  intitle: Find sites with one search term in the title.  allintitle: Find sites with all search terms in the title.  inurl: Find sites with one search term in the URL.  allinurl: Find sites with all search terms in the URL.  site: Limit your search to a specific web site.  filetype: Specify a type of document to search.

42 Query Modifiers – intitle: Find sites with one search term in the title.

43 Query Modifiers – intitle: …and ingredient anywhere in the document. This search returns sites with the word shampoo in the title… Find sites with one search term in the title.

44 Query Modifiers – allintitle: Find sites with ALL search terms in the title.

45 Query Modifiers – allintitle: Notice fewer “hits” when shampoo AND ingredient must be found in the title of the page. Find sites with all search terms in the title.

46 Query Modifiers – inurl: Find sites with one search term in the URL.

47 Query Modifiers – inurl: …and ingredient anywhere in the document. This search returns sites with the word shampoo in the URL… Find sites with one search term in the URL.

48 Query Modifiers – allinurl: Find sites with ALL search terms in the URL.

49 Query Modifiers – allinurl: Find sites with all search terms in the URL. Notice fewer “hits” when shampoo AND ingredient must be found in the title of the page.

50 Query Modifiers – site: Limit your search to a specific web site.

51 Example 1 Enter search terms, then qualifier. Query Modifiers – site: Finds elephant race on the Cal State Fullerton site. Limit your search to a specific web site.

52 Example 2 Enter search terms, then qualifier. Query Modifiers – site: Finds dinosaur on the Smithsonian Institute site. Limit your search to a specific web site.

53 Example 3 Enter search terms, then qualifier. Query Modifiers – site: Limits search of schwarzenegger to official California senate pages. Limit your search to a specific web site.

54 Specify a type of document to search. Query Modifiers – filetype:

55 Specify a type of document to search. pdf – Adobe readable files

56 GAPS by staggernation.com Google API Proximity Search  A script that searches Google for two search terms that appear within a certain proximity of each other on a page. Studies show that the closer search terms are in proximity, the better chance that the document is relevant to the searcher’s need.  For more info go to: http://www.staggernation.com/gaps/readme.html

57 Search Engine Showdown For in depth information on how Google and other web search engines work, go to Greg Notess’ Search Engine Showdown: The User’s Guide to Web Searching at http://www.searchengineshowdown.com/ http://www.searchengineshowdown.com/

58 Lesser Used Databases of Google Images Video News Maps Books Groups Labs Products Scholar Directory

59 One Search – Many Databases For most of these Google databases we will use one search term: “science olympiad”  The quotation marks will require these words to appear together in this order

60 About Google Images Google analyzes  Text on the web page adjacent to the image  Captions  Other factors It ’ s not fool proof! Get more info online at http://images.google.com/help/faq_images.html

61 About Google Video Google analyzes  Text on the web page adjacent to the video  Captions  Other factors Option to use Google ’ s SafeSearch Filtering (moderate or strict modes) Get more info online at http://video.google.com/video_about.html

62 About Google News Indexed web news in English No human intervention News alert service available Get more info at: http://news.google.com/intl/en_us/about_google_news.html

63 About Google Maps Mapping technology is sourced largely from from NAVTEQ and TeleAtlas Local business info is compiled from web search results, U.S. Yellow Pages, and directly from businesses themselves Satellite map data can be 1-3 years old Get more info online at http://maps.google.com/support/

64 About Google Books Results can include either excerpts or full text of books Displays links of bookstores and libraries where each book can be found Results come from two sources  Google books partner program -  Google books library project Get more info online at http://images.google.com/help/faq_images.html

65 About Google Groups Separate from the web Searches usenet and news groups  This can be useful when searching for a particular error message and hints for solutions Get more info online at http://groups.google.com/googlegroups/tour3/index.html

66 About Google Labs Cool new things google engineers are trying out  All prototypes not ready for primetime or even beta status For more info: http://labs.google.com/faq.html

67 Two views avaialable: Grid View and List View Sort by price Specify price range Group by store or product Modify (expand or narrow) your search About Google Products

68 Beta – they want feedback! Ranks store sites based on relevance They do not accept payment for placement within search results Get more info at: http://www.google.com/products/intl/en_us/about.html

69 About Google Scholar Results taken from scholarly literature Google ranks articles by weighing:  Full text  Author  Publication in which article appears  Number of article ’ s citations in other scholarly literature More info: http://images.google.com/help/faq_images.html

70 About Google Directory (Open Directory) Web directory (not a search engine) Human-edited Lists and categorizes web sites No ranking Also used by AOL search, Netscape search, HotBot, Lycos, and others More info: http://www.Google.Com/dirhelp.Html


Download ppt "Search Engines: Exploring Google By Habib.ur.Rehman Assistant librarian & Coordinator Lincoln Corner Central Library, University of Peshawar."

Similar presentations


Ads by Google