Presentation is loading. Please wait.

Presentation is loading. Please wait.

Tamas Doszkocs, Ph.D. Computer Scientist Meta Searching and Clustering.

Similar presentations


Presentation on theme: "Tamas Doszkocs, Ph.D. Computer Scientist Meta Searching and Clustering."— Presentation transcript:

1 Tamas Doszkocs, Ph.D. Computer Scientist doszkocs@nlm.nih.gov Meta Searching and Clustering

2 What has been will be again, what has been done will be done again, there is nothing new under the sun. (Ecclesiastes 1:9-14 NIV)

3 Meta Searching and Clustering A Brief History Clustering MetaSearching Metadata and Semantics Clustering Examples Meta-Search and Clustering Engines A Clustering GYM AllPlus Web X.Y Trends

4 Related Topics :( that we won’t talk about ):

5 Clustering –"Finding a name for something is a way of conjuring its existence, of making it possible for people to see a pattern where they didn't see anything before“ Howard Rheingold –Purpose: order out of chaos –Indexes and Table of Contents are as old as human records –Luhn, H. P. (1959). Keyword-in-Context Index for Technical Literature (KWIC Index). Yorktown Heights, N. Y.: IBM. –Automatic Information Organization and Retrieval. G Salton - 1968 - McGraw Hill –An Associative Interactive Dictionary - Doszkocs - 1978 –Dialog RANK command 1993 –Northern Light clustering, or "embedded folders", 1999

6 Meta-Searching Purpose: distributed and enhanced search to find more relevant items AID, 1978, MEDLINE, TOXLINE, Hepatitis Databank –Doszkocs, Tamas E. “AID, an Associative Interactive Dictionary for Online Searching” On-Line Review, v2 n2 p163-73 Jun 1978 Chemical Substances Information Network, 1978-198 –Information Retrieval in Toxicology, H.M. Kissman, Annual Review of Pharmacology and Toxicology, April 1980, Vol. 20, Pages 285-305 CITE, 1979 –T. E. Doszkocs and B. A. Rapp. Searching MEDLINE in English: A prototype user interface with natural language query, ranked output, and relevance feedback. In Proceedings of the American Society for Information Science, pages 131--139, White Plains, NY, 1979. Knowledge Industry Publications, Inc Dialog OneSearch, 1987 Associative Concept Navigation in MEDLINE and other NLM Databases via a Mosaic - Forms - WWW Interface Combining Natural Language Processing, Expert Systems and (un)Conventional Information Retrieval Techniques. In Second International World Wide Web Conference, Chicago, Illinois, USA, October 1994. http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/Searching/doszkocs/doszkocs.htmlhttp://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/Searching/doszkocs/doszkocs.html The Open Web and the Hidden Web

7 Metadata and Semantics Wilf Lancaster, Vocabulary Control for Information Retrieval, 1972 –Dublin Core http://www.dublincore.org/ –Federated Searching Interface Techniques for Heterogeneous OAI Repositories http://jodi.ecs.soton.ac.uk/Articles /v02/i04/Liu/http://jodi.ecs.soton.ac.uk/Articles /v02/i04/Liu/ –eXchangeable Faceted Metadata Language http://purl.oclc.org/NET/xfml/core /http://purl.oclc.org/NET/xfml/core / –SIMILE (Semantic Interoperability of Metadata and Information in unLike Environments) http://simile.mit.edu/ –Folksonomies http://flickr.com –Semantic Web http://www.few.vu.nl/~frankh /http://www.few.vu.nl/~frankh / https://scholarsbank.uoregon. edu/dspace/bitstream/1794/32 69/1/ccq_sem_web.pdfhttps://scholarsbank.uoregon. edu/dspace/bitstream/1794/32 69/1/ccq_sem_web.pdf –Ontology Lookup Service http://www.ebi.ac.uk/ontolog y-lookup/http://www.ebi.ac.uk/ontolog y-lookup/ –Web Services for Controlled Vocabularies http://www.asis.org/Bulletin/J un-06/vizine- goetz_houghton_childress.ht mlhttp://www.asis.org/Bulletin/J un-06/vizine- goetz_houghton_childress.ht ml

8 Examples of Search Result Clustering Jerry’s Guide to the Web, 1994 Jerry Yang and David Filo’s Yahoo! 1995 –a directory of web sites, organized in a hierarchy of subject descriptors –Librarians at Yahoo Surfing is to Yahoo! what the Dewey Decimal System is to libraries. In other words, Surfing is the categorization of websites. It also happens to be how Yahoo! began. Today our Surfing team continues its passion for finding, evaluating, and organizing information on the Internet. They have a voracious appetite for learning about new topics. They are curious individuals who are skilled at intuitively and efficiently analyzing and classifying diverse, unstructured pieces of information across the Yahoo! network. Surfers are critical to the relevance and intuitive nature of information presented on Yahoo!. http://careers.yahoo.com/job_descriptions.html Google vs. Yahoo automatic vs. controlled indexing

9 The Remains of the Yahoo Directory

10

11

12 Open Directory Project

13 PubMed Related Articles

14 Folksonomy and Tagging in Flickr

15 Query Refinement with Subject Headings

16 Clustering with Multiple Criteria

17 Multi-faceted Clustering in an OPAC

18 Analyzing Search Results

19 Examples of Meta Search Engines The NLM ToxSeek System

20 Clustering of Search Results with Phrases

21 PolyMeta Clustering

22 Visualizing Topical Clusters

23 Multi-faceted Visualization

24 Clustering in A GYM Ask Google Yahoo MSN

25 Yahoo health

26 Google Health Searches

27 Microsoft Search Result Clustering

28 Clustering Sophistication: or the lack of it

29 AllPlus Clustering: the WHO

30 Clustering and Search Refinement with Natural Language and Controlled Vocabularies

31 The NLM AllPlus Search Demo

32 Web 2.0 Content Mashups in AllPlus

33 HyperGraph Cluster Visualization in AllPlus

34 The All in AllPlus Discovery –Meta-Searching –Clustering – Meaning Morphology Syntax Semantics –Metadata –Thesauri + –Visualization –Web X.Y

35 Trends –Web x.0 Content mashups Improved UI Social Search and Knowledge Organization Query Understanding –Meaning –User intent –Multi-faceted clustering –Multi-dimensional Information Spaces Google http://searchmash.com http://searchmash.com –Digital Libraries –Data Mining and Analysis –Information Visualization –Semantic Web

36

37 Tamas Doszkocs, Ph.D. Computer Scientist doszkocs@nlm.nih.gov Meta Searching and Clustering


Download ppt "Tamas Doszkocs, Ph.D. Computer Scientist Meta Searching and Clustering."

Similar presentations


Ads by Google