IEEE Knowledge Media Networking KMN’02 Keynote Address, CRL, Kyoto Japan, July 11, 2002 Concept Switching in the Interspace: Networking Infrastructure.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Taxonomies, Lexicons and Organizing Knowledge Wendi Pohs, IBM Software Group.
Image Information Retrieval Shaw-Ming Yang IST 497E 12/05/02.
Bioinformatics Director Lecture University of Michigan Medical School February 7, 2000 Building Analysis Environments Beyond the Genome and the Web Bruce.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Michigan Life Sciences Corridor Bioinformatics, University of Michigan March 14, 2001 Building Analysis Environments Beyond the Genome and the Web Bruce.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
1 CS 430 / INFO 430 Information Retrieval Lecture 27 Classification 2.
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
Interfaces for Selecting and Understanding Collections.
Automating Keyphrase Extraction with Multi-Objective Genetic Algorithms (MOGA) Jia-Long Wu Alice M. Agogino Berkeley Expert System Laboratory U.C. Berkeley.
High-Performance Digital Library Classification Systems: PI: Hsinchun Chen, The University of Arizona From Information Retrieval to Knowledge Management.
GL12 Conf. Dec. 6-7, 2010NTL, Prague, Czech Republic Extending the “Facets” concept by applying NLP tools to catalog records of scientific literature *E.
BeeSpace: An Interactive Environment for Analyzing Nature and Nurture in Societal Roles Bruce Schatz Institute for Genomic Biology University of Illinois.
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Bioinformatics Seminar Department of Computer Science, UIUC February 25, 2005 Analysis Environments For Functional Genomics Bruce R. Schatz CANIS Laboratory.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
University of Illinois at Urbana-Champaign INSTITUTE FOR GENOMIC BIOLOGY BeeSpace: An Interactive Environment for Functional Analysis of Social Behavior.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Defining Text Mining Preprocessing Transforming unstructured data stored in document collections into a more explicitly structured intermediate format.
Bio-Medical Information Retrieval from Net By Sukhdev Singh.
Dr. Susan Gauch When is a rock not a rock? Conceptual Approaches to Personalized Search and Recommendations Nov. 8, 2011 TResNet.
International Conference on Digital Libraries November 16, 2000 Kyoto, Japan Digital Libraries of Community Knowledge: The Coming World of the Interspace.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
Producción de Sistemas de Información Agosto-Diciembre 2007 Sesión # 8.
GSLIS Proseminar February 24, 2003 The Evolution of the Net: Predicting Network Infrastructure Bruce R. Schatz Graduate School of Library and Information.
Kohonen Mapping and Text Semantics Xia Lin College of Information Science and Technology Drexel University.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
CNI Spring Meeting April 26, 1999 Washington, DC THE NET OF THE 21st CENTURY: Concepts across the Interspace Bruce Schatz CANIS Laboratory Graduate School.
University of Malta CSA3080: Lecture 4 © Chris Staff 1 of 14 CSA3080: Adaptive Hypertext Systems I Dr. Christopher Staff Department.
Department of Computer Science seminar University of Illinois, February 14, 2005 The Evolution of the Net: Predicting Global Infrastructure Bruce R. Schatz.
1 CS 430: Information Discovery Lecture 25 Cluster Analysis 2 Thesaurus Construction.
Indexing Mathematical Abstracts by Metadata and Ontology IMA Workshop, April 26-27, 2004 Su-Shing Chen, University of Florida
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
1 CS 430: Information Discovery Lecture 23 Cluster Analysis 2 Thesaurus Construction.
Translating Dialects in Search: Mapping between Specialized Languages of Discourse and Documentary Languages Vivien Petras UC Berkeley School of Information.
CODE (Committee on Digital Environment) July 26, 2000 Rice University THE NET OF THE 21st CENTURY: Concepts across the Interspace Bruce Schatz CANIS Laboratory.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Workshop on The Transformation of Science Max Planck Society, Elmau, Germany June 1, 1999 TOWARDS INFORMATIONAL SCIENCE Indexing and Analyzing the Knowledge.
Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.
Revolutionary System Models, The Net, & The Public Interest The Interspace Prototype ( ) Digital Libraries Initiative ( ) Worm Community.
Revolution & Kids: Building the Future of the Net & Understanding the Structures of the World Bruce R. Schatz CANIS - Community Systems Laboratory University.
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
1 KMeD: A Knowledge-Based Multimedia Medical Database System Wesley W. Chu Computer Science Department University of California, Los Angeles
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Department of Social Informatics Graduate School of Informatics Kyoto University, Japan July 8, 2004 The Social Informatics of Healthcare Infrastructure.
BeeSpace: An Interactive Environment for Functional Analysis of Social Behavior Bruce Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Semantic Interoperability for Geographic Information Systems Tobun Dorbin Ng Artificial Intelligence Lab The University of Arizona.
1 CS 430: Information Discovery Lecture 28 (a) Two Examples of Cluster Analysis (b) Conclusion.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Graduate School of Informatics Kyoto University, November 14, 2001 Functions of the Interspace Infrastructure for Concept Spaces Bruce Schatz CANIS Laboratory.
Visual Information Retrieval
Applications of the Interspace Analysis for Community Repositories
Introduction Multimedia initial focus
Data and Applications Security Developments and Directions
CSE 635 Multimedia Information Retrieval
Introduction to Information Retrieval
Information Retrieval in Digital Libraries: Bringing Search to the Net
Presentation transcript:

IEEE Knowledge Media Networking KMN’02 Keynote Address, CRL, Kyoto Japan, July 11, 2002 Concept Switching in the Interspace: Networking Infrastructure for Community Knowledge Bruce Schatz CANIS Laboratory Graduate School of Library and Information Science University of Illinois at Urbana-Champaign Graduate School of Informatics, Kyoto University

THE THIRD WAVE OF NET EVOLUTION PACKETS OBJECTS CONCEPTS

from Objects to Concepts from Syntax to Semantics Infrastructure is Interaction with Abstraction Internet is packet transmission across computers Interspace is concept navigation across repositories CONCEPT SPACES

Technology Engineering Electrical FORMAL INFORMAL (manual) (automatic) IEEE communities groups individuals LEVELS OF INDEXES

THE DISTRIBUTED WORLD Community Repositories in the Interspace Peer to Peer Networking Infrastructure Every Person performs Every Role USERrequest LIBRARIANreference INDEXERclassify PUBLISHERquality AUTHORgenerate

Meta Data How to Represent the Community Knowledge Automatic and Interactive Representation Techniques for Capturing the Fundamental Structure

Meta Maps How to Locate the Community Knowledge Automatic and Interactive Location Techniques for Capturing the Fundamental Landscape

CONCEPTS ACROSS THE INTERSPACE

SCALABLE SEMANTICS Automatic indexing Domain-Independent indexing Statistical clustering Compute Context of concepts within documents documents within repositories

CROSS-OVERS IN SEMANTIC INDEXING

COMPUTING CONCEPTS ‘92: 4,000 (molecular biology) ‘93: 40,000 (molecular biology) ‘95: 400,000 (electrical engineering) ‘96: 4,000,000 (engineering) ‘98: 40,000,000 (medicine)

SIMULATING A NEW WORLD Obtain discipline-scale collection MEDLINE from NLM, 10M bibliographic abstracts human classification: Medical Subject Headings Partition discipline into Community Repositories 4 core terms per abstract for MeSH classification 32K nodes with core terms (classification tree) Community is all abstracts classified by core term 40M abstracts containing 280M concepts concept spaces took 2 days on NCSA Origin 2000 Simulating World of Medical Communities 10K repositories with > 1K abstracts (1K w/ > 10K)

COMMUNITY PROCESSING

Semantic Indexing Extracting Concepts (AI) Canonical noun phrases Generic statistical parser Computing Context (IR) Co-occurrence frequency, in collection Useful interactively, not strict ordering

System Side Infrastructure Classification Technologies for Multimedia Documents Phrases (multi-word nouns) Concepts (generic phrases) Types (identified concepts) Clusters (grouped types) Structures (semantic universals)

INTERSPACE NAVIGATION Semantic Indexes for Community Repositories Navigating Abstractions within Repository concept space & category map Interactive browsing by Community experts *

Interspace Remote Access Client

Navigation in MEDSPACE For a patient with Rheumatoid Arthritis Find a drug that reduces the pain (analgesic) but does not cause stomach (gastrointestinal) bleeding Choose Domain

Concept Search

Concept Navigation

Retrieve Document

Navigate Document

Retrieve Document

Category Map

Category Navigation

Concept Navigation

User Side Infrastructure Navigation Technologies for Search Interfaces Exact Match (noun phrases) Relationship List (concept suggestions) Cluster Comparison (groups to groups) Spreading Activation (group intersections) Artificial Landscapes (semantic distances)

SWITCHING In the Interspace… each Community maintains its own repository Switching is navigating Across repositories use your vocabulary to search another specialty

Medicine Session

Categories and Concepts

Concept Switching

Document Retrieval

CONCEPT SWITCHING “Concept” versus “Term” set of “semantically” equivalent terms Concept switching region to region (set to set) match term Semantic region Concept Space

ENGINEERING SESSION

Engineering Categories & Concepts

Further Concept Navigation

Searching via Concept Suggestion

Switching Across Repositories

Future Technologies Concept Switching Spreading activation, type tagging Dynamic Indexing On-the-fly collections, during session Path Matching Aggregating indexes, many repositories

Semantic Analysis of Multimedia Collections of Objects containing Units Text: community repository (topic proximity) document abstracts containing noun phrases Image: aerial photograph (spatial proximity) feature regions containing texture tiles Units -- media-dependent (statistical parsers) Indexes -- media-independent (statistical clusters)

Media Interoperability Model text concept space & category map (geoscience) 1M phrases in 500K abstracts from Georef and Petroleum Abstracts image concept & category maps in aerial photos visual thesaurus maps for 200K regions in 800 images (6M tiles) geographic map (where) v. semantic map (what) spatial gazetteer as bridge image text number

Text and Number Interoperability Text and AVHRR Query: Show me information about Santa Barbara area with mild temperature and high vegetation density. Integrated Result: Within the bounding geography location, 2 documents and 88 AVHRR records related to the integrated query are retrieved.

Image Concept Switching Image Query: By browsing a texture (tile) catalog, show me information about residential and farm land areas. Result: A set of related images are retrieved and shown in the Results Frame. The full-size image #368 is displayed with its place names and tile locations.

INFORMATION SPACEFLIGHT Landscape as category map visualization Valleys are semantic clusters Hills are semantic distances Traversal across multiple levels of abstraction

Category Maps

SELF-ORGANIZING MAPS (SOMs)

INFORMATION SPACEFLIGHT

Flying through Cyberspace

THE NET OF THE 21st CENTURY Beyond Objects to Concepts Beyond Search to Analysis Problem Solving via Cross-Correlating Multimedia Information across the Net Every community has its own special library Every community does semantic indexing true The Interspace is true Cyberspace

Subject Assignment Improved Search by Identifying Subjects Human Indexers classify Documents From Subject Thesaurus and Knowledge Interactive Support for Community Curators (Subject Experts but Classification Amateurs) Use Concept Spaces to Suggest Subjects From Related Documents in the Collection See Best Paper Nominee at ACM DL 98

Structure Assignment Improved Search by Identifying Structures Human Indexers classify Clusters From Generic Structures beyond Subjects Universal Structures Cross-Cultural Interactive Support for Community Curators (Subject Experts but Classification Amateurs) Necessary for Peer-Peer Infrastructure When Ordinary Persons form Communities

The Structures of Everyday Life Bodies(individuals) Food and Clothes Buildings(groups) Houses and Cities Transportation(physical interactions) Rails (trains) and Roads (cars) Communication(logical interactions) Phones (talking) and Computers (retrieving)

Navigating Universal Structures A planet for every kid’s local environment Federating the planets into a universe Ordering all planets from kid’s Point Of View Kids Universe Flying through the Kids Universe Finding similar kids from different POVs Connecting historically through museums