Tamas Doszkocs, Ph.D. Computer Scientist Meta Searching and Clustering.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
Data Science for Business: Semantic Verses Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
“ Leveraging SharePoint 2010 Search Technologies ” With: Ivan Neganov.
Health Information Literacy Manual Presentation Module 2 Searching Tools.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Interfaces for Selecting and Understanding Collections.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
Introduction Web Development II 5 th February. Introduction to Web Development Search engines Discussion boards, bulletin boards, other online collaboration.
WHAT HAVE WE DONE SO FAR?  Weeks 1 – 8 : various components of an information retrieval system  Now – look at various examples of information retrieval.
Search Engines and Metasearch Engines From Dr. Gene Jonjsma.
Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law: any public.
N-gram Topic Models for Bibliometric Analysis Gideon Mann, David Mimno, and Andrew McCallum Can topic models provide better measurements of the impact.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Aardvark Anatomy of a Large-Scale Social Search Engine.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Selecting a Topic and Purpose
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
WHAT IS A SEARCH ENGINE. Widescreen Presentation Proteus, Keeper of Knowledge. Proteus is synonymous with change and success.
Web Searching Basics Dr. Dania Bilal IS 530 Fall 2009.
Search Engine Interfaces search engine modus operandi.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Keyword vs. Controlled Vocabulary Searching 12 Basic Skills for IQ.
29-30 October, 2006, Estonia 1 IST4Balt Information analysis using social bookmarking and other tools IST4Balt Information analysis using social bookmarking.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
CSM06 Information Retrieval Lecture 1a – Introduction Dr Andrew Salway
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Presented by Dr. S. C. Jindal Librarian Central Science Library University of Delhi Delhi Information Competency.
Search Engine Architecture
Search Engines Reyhaneh Salkhi Outline What is a search engine? How do search engines work? Which search engines are most useful and efficient? How can.
Search Engines.
Indexing Mathematical Abstracts by Metadata and Ontology IMA Workshop, April 26-27, 2004 Su-Shing Chen, University of Florida
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
World Wide Web Library 150 Week 8. The Web The World Wide Web is one part of the Internet. No one controls the web Diverse kinds of services accessed.
Indexes and Abstracts: Dissecting the Resource By M. Leedy.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
1 Automatic indexing Salton: When the assignment of content identifiers is carried out with the aid of modern computing equipment the operation becomes.
Workshop on The Transformation of Science Max Planck Society, Elmau, Germany June 1, 1999 TOWARDS INFORMATIONAL SCIENCE Indexing and Analyzing the Knowledge.
Digital Literacy Concepts and basic vocabulary. Digital Literacy Knowledge, skills, and behaviors used in digital devices (computers, tablets, smartphones)
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Chapter 3 Searching the Literature. Literature Reviews are Important conceptual literature – books and articles written by experts related research –
Chapter 3 Searching the Literature. Reading the Literature You will need to understand what has already been written about a topic before you can ask.
Smart Web Search Agents Data Search Engines >> Information Search Agents - Traditional searching on the Web is done using one of the following three: -
Lecture 4 Access Tools/Searching Tools. Learning Objectives To define access tools To identify various access tools To be able to formulate a search strategy.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Information Literacy University of Namibia Library 2006.
Metasearch: Top-Level Interface, Reference Applications
Information Retrieval and Web Search
Search Engine Architecture
CIW Lesson 6 Web Search Engines.
Information Retrieval and Web Search
Federated & Meta Search
Information Retrieval and Web Search
Taxonomies, Lexicons and Organizing Knowledge
Information Integration for Digital Libraries
American Library Association Online Resource Center
ثانيا :أدوات البحث عبر الانترنت
Web Mining Department of Computer Science and Engg.
Search Engine Architecture
Information Retrieval and Web Search
PHARM Library Orientation
Presentation transcript:

Tamas Doszkocs, Ph.D. Computer Scientist Meta Searching and Clustering

What has been will be again, what has been done will be done again, there is nothing new under the sun. (Ecclesiastes 1:9-14 NIV)

Meta Searching and Clustering A Brief History Clustering MetaSearching Metadata and Semantics Clustering Examples Meta-Search and Clustering Engines A Clustering GYM AllPlus Web X.Y Trends

Related Topics :( that we won’t talk about ):

Clustering –"Finding a name for something is a way of conjuring its existence, of making it possible for people to see a pattern where they didn't see anything before“ Howard Rheingold –Purpose: order out of chaos –Indexes and Table of Contents are as old as human records –Luhn, H. P. (1959). Keyword-in-Context Index for Technical Literature (KWIC Index). Yorktown Heights, N. Y.: IBM. –Automatic Information Organization and Retrieval. G Salton McGraw Hill –An Associative Interactive Dictionary - Doszkocs –Dialog RANK command 1993 –Northern Light clustering, or "embedded folders", 1999

Meta-Searching Purpose: distributed and enhanced search to find more relevant items AID, 1978, MEDLINE, TOXLINE, Hepatitis Databank –Doszkocs, Tamas E. “AID, an Associative Interactive Dictionary for Online Searching” On-Line Review, v2 n2 p Jun 1978 Chemical Substances Information Network, –Information Retrieval in Toxicology, H.M. Kissman, Annual Review of Pharmacology and Toxicology, April 1980, Vol. 20, Pages CITE, 1979 –T. E. Doszkocs and B. A. Rapp. Searching MEDLINE in English: A prototype user interface with natural language query, ranked output, and relevance feedback. In Proceedings of the American Society for Information Science, pages , White Plains, NY, Knowledge Industry Publications, Inc Dialog OneSearch, 1987 Associative Concept Navigation in MEDLINE and other NLM Databases via a Mosaic - Forms - WWW Interface Combining Natural Language Processing, Expert Systems and (un)Conventional Information Retrieval Techniques. In Second International World Wide Web Conference, Chicago, Illinois, USA, October The Open Web and the Hidden Web

Metadata and Semantics Wilf Lancaster, Vocabulary Control for Information Retrieval, 1972 –Dublin Core –Federated Searching Interface Techniques for Heterogeneous OAI Repositories /v02/i04/Liu/ /v02/i04/Liu/ –eXchangeable Faceted Metadata Language / / –SIMILE (Semantic Interoperability of Metadata and Information in unLike Environments) –Folksonomies –Semantic Web / / edu/dspace/bitstream/1794/32 69/1/ccq_sem_web.pdfhttps://scholarsbank.uoregon. edu/dspace/bitstream/1794/32 69/1/ccq_sem_web.pdf –Ontology Lookup Service y-lookup/ y-lookup/ –Web Services for Controlled Vocabularies un-06/vizine- goetz_houghton_childress.ht mlhttp:// un-06/vizine- goetz_houghton_childress.ht ml

Examples of Search Result Clustering Jerry’s Guide to the Web, 1994 Jerry Yang and David Filo’s Yahoo! 1995 –a directory of web sites, organized in a hierarchy of subject descriptors –Librarians at Yahoo Surfing is to Yahoo! what the Dewey Decimal System is to libraries. In other words, Surfing is the categorization of websites. It also happens to be how Yahoo! began. Today our Surfing team continues its passion for finding, evaluating, and organizing information on the Internet. They have a voracious appetite for learning about new topics. They are curious individuals who are skilled at intuitively and efficiently analyzing and classifying diverse, unstructured pieces of information across the Yahoo! network. Surfers are critical to the relevance and intuitive nature of information presented on Yahoo!. Google vs. Yahoo automatic vs. controlled indexing

The Remains of the Yahoo Directory

Open Directory Project

PubMed Related Articles

Folksonomy and Tagging in Flickr

Query Refinement with Subject Headings

Clustering with Multiple Criteria

Multi-faceted Clustering in an OPAC

Analyzing Search Results

Examples of Meta Search Engines The NLM ToxSeek System

Clustering of Search Results with Phrases

PolyMeta Clustering

Visualizing Topical Clusters

Multi-faceted Visualization

Clustering in A GYM Ask Google Yahoo MSN

Yahoo health

Google Health Searches

Microsoft Search Result Clustering

Clustering Sophistication: or the lack of it

AllPlus Clustering: the WHO

Clustering and Search Refinement with Natural Language and Controlled Vocabularies

The NLM AllPlus Search Demo

Web 2.0 Content Mashups in AllPlus

HyperGraph Cluster Visualization in AllPlus

The All in AllPlus Discovery –Meta-Searching –Clustering – Meaning Morphology Syntax Semantics –Metadata –Thesauri + –Visualization –Web X.Y

Trends –Web x.0 Content mashups Improved UI Social Search and Knowledge Organization Query Understanding –Meaning –User intent –Multi-faceted clustering –Multi-dimensional Information Spaces Google –Digital Libraries –Data Mining and Analysis –Information Visualization –Semantic Web

Tamas Doszkocs, Ph.D. Computer Scientist Meta Searching and Clustering