Cyborg Categorization Salvation for Search? Tom Reamy Information Architect Charles Schwab © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights.

Slides:



Advertisements
Similar presentations
Taxonomy Development in an Enterprise Context Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Advertisements

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Top Tips Enterprise Content Management Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Making Search Relevant SchemaLogic Gary Carlson Chief Taxonomist
Metadata Strategies Alternatives for creating value from metadata Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
PolyAnalyst Data and Text Mining tool Your Knowledge Partner TM www
Improving Navigation and Findability Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
Buy, Build, Automate: Why you should Buy Your Taxonomy Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Cyborg Categorization The Basics Tom Reamy Knowledge Architect Intranet Consultant.
Enterprise Information Architecture A Platform for Integrating Your Organization’s Information and Knowledge Activities Tom Reamy Chief Knowledge Architect.
Information and Business Work
Faceted Navigation: Search and Browse Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Innovation in Search? Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Model of Taxonomy Development Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Taxonomy Boot Camp Panel Text Analytics Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement: Relevance Feedback Information Filtering.
IR & Metadata. Metadata Didn’t we already talk about this? We discussed what metadata is and its types –Data about data –Descriptive metadata is external.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Automatic Facets: Faceted Navigation and Entity Extraction Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Beyond Sentiment Mining Social Media A Panel Discussion of Trends and Ideas Marie Wallace, IBM Marcello Pellacani, Expert System Fabio Lazzarini, CRIBIS.
Enterprise Semantic Infrastructure Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Scatter/Gather : A Cluster Based Approach to Large Document Collections Alyssa Katz LIS 551 March 23, 2003.
Expanding Enterprise Roles for Librarians Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Unstructured Content Management Taxonomic Publishing Models Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Selecting Taxonomy Software Who, Why, How Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
SEARCHING ON THE INTERNET
Taxonomy and Knowledge Organization Taxonomy in Context Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Knowledge Maps An Intellectual Infrastructure for KM Tom Reamy Knowledge Architect Intranet Consultant.
Building a Foundation for Info Apps Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture Professional.
IBE312: Ch15 Building an IA Team & Ch16 Tools & Software 2013.
Enterprise Search/ Text Analytics Evaluation Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics And Text Mining Best of Text and Data
Knowledge Management and Technology for Today’s Legal Professional L. Keith Lipman, Esquire Director, Advanced Technology Solutions.
SemTech Text Analytics Evaluation Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Taxonomies and Faceted Navigation Getting the Best of Both
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Academic Research to Support Arguments.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
Content Categorization Tools Taxonomies & Technologies for Infrastructure Solutions Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture.
Text Analytics Summit Text Analytics Evaluation Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics Software Choosing the Right Fit Tom Reamy Chief Knowledge Architect KAPS Group Text Analytics World October 20.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Faceted Navigation Design Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Metadata and Taxonomies The Best of Both Worlds Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
Integrating an Enterprise Taxonomy with Local Variations Tom Reamy Chief Knowledge Architect KAPS Group Taxonomy Boot Camp.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
Enterprise Semantic Infrastructure Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Folksonomy Folktales Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Selecting Taxonomy Software Who, Why, How Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Text Analytics Workshop Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
MIS 2000 Chapter 15 Knowledge Management. Outline Knowledge Explicit and Tacit Knowledge Knowledge Management Activities Computer-Aided Design/Manufacturing.
Advanced Semantics and Search Beyond Tag Clouds and Taxonomies Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services.
Text Analytics A Tool for Taxonomy Development Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture.
Text Analytics Workshop Applications Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Subject Headings Objective: Students will understand that both books and articles are assigned words to describe their contents. These terms are referred.
The NIH Enterprise Information Portal IMPAC II GM Lead Users Group April 10, 2002.
Knowledge Retrieval Taxonomies & Auto-Categorization Tom Reamy Knowledge Architect Intranet Consultant.
Bringing Order to the Web : Automatically Categorizing Search Results Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Hao Chen Susan Dumais.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Taxonomy Development An Infrastructure Model Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Information Organization: Overview
Taxonomies, Lexicons and Organizing Knowledge
Introduction into Knowledge and information
Information Organization: Overview
Presentation transcript:

Cyborg Categorization Salvation for Search? Tom Reamy Information Architect Charles Schwab © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

Categorization Explosion l Autonomy l Semio l Verity l Inxight l Topical Net l Mohomine l Simile l H5Technologies l GammaSite l MetaTagger l Applied Semantics l Sageware l SmartLogik l Quiver l PurpleYogi l Other - Tacit

Categorization: Why Now? l Forrester: Must Search Stink? l Browse and Search l Need a Taxonomy l Problem: Expensive to develop Taxonomies l Buy Search to get Categorization

News Feeds - Corporate Intranets l News Feeds and Content providers –uniform content, size and structure –professional writers –Simple or standard vocabulary l Corporate intranet –Wildly varied content –Mix of good, bad, and ugly writers –Tower of Babel: Acronyms, special meanings © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

Auto-Categorization: the How l Rules l Catalog by Example l Statistical Clustering l Support Vector Machines l Machine Learning l World Knowledge © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

Automatic vs. Humanatic l Humans are better, but not as consistent –General bin, understandable mistakes –Bring outside contexts to the document l Purpose, similar documents, common sense l Computers are faster and cheaper. –Faster yes, Cheaper ? –Cost of poorer quality categorization l Intranet: 20,000 users taking 60 seconds longer = $20,000 a week © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

The Answer is Cyborg l Integration not Assimilation l Human and Computer Integration –Iterative, distributed work flow, ease of use l Cyborg and Content Management –Categorization and keywords by Subject Matter Experts l Cyborg and Search –Computers and people learn from each other

Create the Taxonomy l Top Level Taxonomy Categories –Human intensive, Cluster - random creativity l Grow the Taxonomy - 2nd - 3rd Levels –Humans - create rules, select training sets –Computers - Taxonomy Builders, Refine rules or training sets l Essential Feature –White Box Categorization –Customize algorithm, not just results © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

Refine the Taxonomy l Initial Phase: Information Architect Effort l Suggest –Provisional Categorization, Meta Data –Automatic Summarization l Support –Distributed Work flow –Visualization of taxonomic relationships © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

Maintain the Taxonomy l Intranets - ongoing human efforts –Can’t pass on the cost to your customers - they work for the same company as you l Continue and Improve Refinement –Collaborative Categorization l Features: –Smart Learning categorization –Integration - Content management, Search © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

Apply the Taxonomy l Integration of Search and Categorization –Browse and Search –Real time clustering, customiztion of results –support collaborative filtering l Integration with Content Management –Integrated Distributed Work Flow –Support Taxonomic Publishing Model l Integration with Expertise & Processes © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

Lessons Learned l Out of the Box, Out of Your Mind l Play well with others l Brain surgery is funl l World revolves around you l Quality counts and size matters l Let a Hundred flowers Bloom l The End © 2001 Charles Schwab & Co., Inc., member NYSE/SIPC. All rights reserved. ( )

The END l Really.