Taxonomies, Lexicons and Organizing Knowledge

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Taxonomy as Content Outline, Site Map and Search Aid SLA NWR Vancouver October 6, 2006 Marjorie M.K. Hlava President
Endeca Taking a different path Cindi Holt Information Services Manager September, 2007.
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
IPY and Semantics Siri Jodha S. Khalsa Paul Cooper Peter Pulsifer Paul Overduin Eugeny Vyazilov Heather lane.
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
Taxonomies, Lexicons and Organizing Knowledge Wendi Pohs, IBM Software Group.
Introduction KWizCom Business Card Founded in 2005 Headquartered in Toronto Global provider of add-ons and services customers worldwide Business.
Leveraging Your Taxonomy to Increase User Productivity MAIQuery and TM Navtree.
Information and Business Work
Taxonomies in Electronic Records Management Systems May 21, 2002.
Semantic Search Jiawei Rong Authors Semantic Search, in Proc. Of WWW Author R. Guhua (IBM) Rob McCool (Stanford University) Eric Miller.
Expanding Enterprise Roles for Librarians Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services
Libraries and Institutional Content Management Systems
Sunday May 4 – 5 PM Bradford, Hlava, McNaughton
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
® IBM Software Group © IBM Corporation IBM Information Server Metadata Management.
Text Analytics And Text Mining Best of Text and Data
Cutting Through the Clutter Searching the Web. There is a wealth of information waiting for you on the internet, if you know the right tools to use and.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Enterprise Asset Management
Landing the Raven: Positioning the Knowledge Discovery System in the Enterprise Wendi Pohs, Iris Associates
Get More Value from Your Reference Data—Make it Meaningful with TopBraid RDM Bob DuCharme Data Governance and Information Quality Conference June 9.
SharePoint Users Group Content Classification Step by Step SharePoint 2007 and 2010.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
M ODULE 5 – S HARE P OINT 2010 C ONTENT T YPES.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Enterprise Information Management WITH SHAREPOINT SERVER 2013.
Terminology and Standards Dan Gillman US Bureau of Labor Statistics.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Real World Case Study KM Summer Institute June Rano Joshi, Vorsite.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Text Analytics A Tool for Taxonomy Development Tom Reamy Chief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture.
IA Tools to Inform IA Summit 2003 Madonnalisa G. Chan.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Search Strategies & Catalog Instruction Frederic Murray Assistant Professor MLIS, University of British Columbia BA, Political Science, University of Iowa.
Module 9 User Profiles and Social Networking. Module Overview Configuring User Profiles Implementing SharePoint 2010 Social Networking Features.
Achieving Semantic Interoperability at the World Bank Designing the Information Architecture and Programmatically Processing Information Denise Bedford.
Controlled Vocabulary & Thesaurus Design Associative Relationships & Thesauri.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Empowering the Knowledge Worker End-User Software Engineering in Knowledge Management Witold Staniszkis The 17th International.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Transportation Agenda 19. Transportation Your Role: Designer Designers organize SharePoint content and determine how to display that content Typical tasks.
Information Literacy University of Namibia Library 2006.
ServiceNow Implementation Workshop CMS Self Service Portal.
DARE: Domain analysis and reuse environment Minwoo Hong William Frakes, Ruben Prieto-Diaz and Christopher Fox Annals of Software Engineering,
Witold Staniszkis Empowering the Knowledge Worker End-User Software Engineering in Knowledge Management Witold Staniszkis
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
Information Organization: Overview
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Chapter 6 Database Design
IDPro Taxonomy … and Body of Knowledge
Federated & Meta Search
PolyAnalyst Data and Text Mining tool
European Network of e-Lexicography
From a thesaurus standard to a general knowledge organization standard?! 04/12/2018.
Transportation Research Thesaurus:
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Metadata in Digital Preservation: Setting the Scene
Magnet & /facet Zheng Liang
Overview of Oracle Site Hub
KNOWLEDGE MANAGEMENT (KM) Session # 40
Zach Wahl and Tatiana Baquero Project Performance Corporation (PPC)
Product Overview.
Information Organization: Overview
Microsoft Azure Data Catalog
Presentation transcript:

Taxonomies, Lexicons and Organizing Knowledge Wendi Pohs, IBM Software Group, wpohs@us.ibm.com Infotoday 2003 Content Management Symposium May 8, 2003 11/8/2018

Agenda Benefits, business and technical Definitions Planning and Implementation Issues Futures Q&A 11/8/2018

The Mantra Knowledge is in the eye of the beholder, but reflecting end user needs is as critical as representing texts....and it takes work! 11/8/2018

Business Benefits - Lifecycle Integration eLearning Technology, Government, Pharmaceutical Regulatory Compliance Pharmaceutical, Government Corporate accountability Financial, Life Sciences Intellectual Capital Management Consulting, Law firms, Financial Innovation, Discovery Government, Pharmaceutical, Retail, Technology 11/8/2018

Technical Benefits Integration with content management systems Site creation Site navigation Enhance full text search Gap analysis Personalization Defining skills, areas of expertise 11/8/2018

Definitions: Taxonomy “The science, laws or principles of classification” (From the Greek: rules of arrangement) Biology (Linnaeus) Education (Bloom) A hierarchical collection of categories and documents Structure and content 11/8/2018

Definitions: Lexicon A word book or dictionary Vocabulary of a particular field of study Keywords, synonyms, jargon 11/8/2018

Definitions: Directory More general than taxonomy Natural structure Wide vs deep Category structure less controlled File system Yahoo (http://www.yahoo.com) Yellow Pages Corporate Web sites (http://www.ibm.com) 11/8/2018

Definitions: Thesaurus Controlled vocabulary Subject headings, labels Synonyms (U, UF) Relation types (TT, BT, NT,SN, HN, RT, SA) Examples: http://www.loc.gov/flicc/wg/taxonomy.html 11/8/2018

Definitions: Meta-data and Tags Properties, attributes: information describing types of data [Crandall] The ‘energy’ required to keep things organized [Earley] Tags <META>, <Source> Document Properties $CreatedBy 11/8/2018

Definitions: Classification Analyzing documents and assigning them to predefined categories Rule-based vs natural Statistical vs semantic Classification schemes Dewey Library of Congress Industry-specific 11/8/2018

Planning: Initial Analysis Determine user needs thru content, knowledge audits What is the objective of the system? What are typical "day in the life" scenarios? Do you need to comply with existing standards? Select representative content No need to include every document in every source Look for a subset of documents with Good meta-data (Titles, Authors) Rich, representative body text 11/8/2018

Implementation: Creation and strategy Create an initial taxonomy On paper, on a whiteboard, in a spreadsheet Look at existing databases, Web sites, org charts Reuse good, representative categories Prototype taxo structure (flat, hierarchy, associative) Review the initial taxonomy Determine a categorization strategy Rules-based, keyword-based, statistical, others Review taxonomy creation, content management tools Consider resource requirements for taxonomy maintenance 11/8/2018

Implementation: Testing and Maintenance Test the taxonomy Track queries to determine accuracy Enable categorization to test strategies; refine if necessary Test with disparate user groups Maintain taxonomy Establish a workable change-management process Move documents, promote/demote categories, merge/delete as necessary Add more content Iterate 11/8/2018

Issues: Understand the BIG issues Maintenance Content expert or info professional Multiple taxonomies Organizational “perfection complex” [Chait] Categorization strategy Manual, automatic, both 11/8/2018

Issues: Multiple taxonomies Many editors Term approval process, synonyms Standard tools across the enterprise Federated taxonomies Taxonomy links, “cross-connections,” facets, views Taxonomy mapping 11/8/2018

Futures Methods: Feature extraction, statistical analysis, rules-based, better semantics, label generation Starter taxonomies, imports "Plug and play" classifiers Taxonomy mapping Interfaces: Visualization, better training tools Semantic Web 11/8/2018

11/8/2018

Q&A 11/8/2018