Quality Taxonomies Jim Nisbet Senior Vice President of Technology Semio Corporation Knowledge Technologies 2001 March 5 th, 2001.

Slides:



Advertisements
Similar presentations
1 Open Ontology Repository Planning Meeting for Ontology repositories: approaches, technologies, collaboration Ken Baclawski June 15, 2009.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Chapter 1: Introduction
 Copyright 2006 Digital Enterprise Research Institute. All rights reserved. The Future is Now JeromeDL A Digital Library on Social Semantic.
System Integration Verification and Validation
Why, what were the idea ? 1.Create a data infrastructure, 2.Data + the knowledge products that are produced on the basis of data a) Efficiant access to.
Data Modeling and Database Design Chapter 1: Database Systems: Architecture and Components.
Taxonomies of Knowledge: Building a Corporate Taxonomy Wendi Pohs, Iris Associates
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Information and Business Work
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
Developed by Reneta Barneva, SUNY Fredonia Product Metrics for Software.
Unit 251 Implementation and Integration Implementation Unit Testing Integration Integration Approaches.
Environmental Terminology System and Services (ETSS) June 2007.
SchemaLogic Workshop Part 2 Tools for Enterprise Metadata Management and Synchronization Prepared for the University of Washington Information School Applied.
Thesaurus Design and Development
SE 555 – Software Requirements & Specifications Introduction
1 Languages for aboutness n Indexing languages: –Terminological tools Thesauri (CV – controlled vocabulary) Subject headings lists (CV) Authority files.
Introduction to Software Testing
CS 4310: Software Engineering
Quality of Information systems. Quality Quality is the degree on which a product satifies the requirements Quality management requires that : that requirements.
Software Quality SEII-Lecture 15
S/W Project Management Software Process Models. Objectives To understand  Software process and process models, including the main characteristics of.
Quality Assurance ITEC Rick Price. Expectations This course is not purely a lecture course – Classroom participation is a large portion – Everyone.
Controlled Vocabulary & Thesaurus Design Planning & Maintenance.
Topics Covered: Software requirement specification(SRS) Software requirement specification(SRS) Authors of SRS Authors of SRS Need of SRS Need of SRS.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Software System Engineering: A tutorial
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
1 Software Quality CIS 375 Bruce R. Maxim UM-Dearborn.
Software Quality Applied throughout SW Engineering Process Encompasses ▫ Analysis, design, coding, testing, tools ▫ Formal tech reviews ▫ Multi-tiered.
 Explain the role of a system analyst.  Identify the important parts of SRS document.  Identify the important problems that an organization would face.
Topic (1)Software Engineering (601321)1 Introduction Complex and large SW. SW crises Expensive HW. Custom SW. Batch execution.
1 Software quality - Definition IEEE 1. The degree to which a system, component, or process meets specified requirements. 2. The degree to which a system,
Software Requirements Engineering CSE 305 Lecture-2.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Software Measurement & Metrics
IT Requirements Management Balancing Needs and Expectations.
Some Sub-Activities within Requirements Engineering 1.Prototyping 2.Requirements Documentation 3.Requirements Validation 4.Requirements Measurements 5.Requirements.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
EPA’s Environmental Terminology System and Services (ETSS) Michael Pendleton Data Standards Branch, EPA/OEI Ecoiformatics Technical Collaborative Indicators.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
BE-SECBS FISA 2003 November 13th 2003 page 1 DSR/SAMS/BASP IRSN BE SECBS – IRSN assessment Context application of IRSN methodology to the reference case.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
OHT 1.1 Galin, SQA from theory to implementation © Pearson Education Limited 2004 The uniqueness of software quality assurance The environments for which.
About Quality Pre paired By: Muhammad Azhar. Scope What is Quality Quality Attributes Conclusion on software Quality Quality Concepts Quality Costs.
1 15 quality goals for requirements  Justified  Correct  Complete  Consistent  Unambiguous  Feasible  Abstract  Traceable  Delimited  Interfaced.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
Quality Models in Software Engineering Literature: An Analytical and Comparative Study Rafa E. Al-Qutaish, PhD Al Ain University of Science and Technology.
1 Quality Attributes of Requirements Documents Lecture # 25.
Chapter 2 Object-Oriented Paradigm Overview. Getting Acquainted with the Class Project Read the requirements specification carefully Make note of any.
Software Testing and Quality Assurance 1. What is the objectives of Software Testing?
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Software Requirements Specification Document (SRS)
Requirements Analysis
Metrics "A science is as mature as its measurement tools."
CS3320-Chap21 Office Hours TR 1:00-2:15 PM W 2:30-3:30 PM By appointment.
Software Engineering Lecture 10: System Engineering.
Semantics and the EPA System of Registries Gail Hodge IIa/ Consultant to the U.S. Environmental Protection Agency 18 April 2007.
Requirements. Outline Definition Requirements Process Requirements Documentation Next Steps 1.
Chapter 2 Object-Oriented Paradigm Overview
16CS202 & Software Engineering
Source & Courtesy: Doc. S. Dapkūnas
Lecture 15: Technical Metrics
Software engineering Lecturer: Nareena.
Introduction to Software Testing
Application of new TQM framework
Presentation transcript:

Quality Taxonomies Jim Nisbet Senior Vice President of Technology Semio Corporation Knowledge Technologies 2001 March 5 th, 2001

Ontology / Taxonomy Root Ontology Taxonomy Generation Static Discovery Dynamic Discovery

What is Quality ?  “Best value for the money”  According to this definition, you are entitled to get high performance from a costly product; likewise a low cost product or service is expected to be a poor delivery. For example, a loose demo delivery is both predictable and acceptable, since its quality is: low conformance / low cost.

What is Quality ?  “Good Quality is Nominal Conformance”  Taxonomy Quality is defined as Taxonomy Conformance to: Valid requirements; Explicitly documented development standards; and, Implicit characteristics that are expected of all professionally developed taxonomies, such as the desire for good maintainability.

Standards  ISO International Organization for Standardization. Documentation—Guidelines for the Establishment and Development of Monolingual Thesauri. 2nd ed. n.p.: ISO, (ISO (E)). (Available in the U.S. from American National Standards Institute)  ISO International Organization for Standardization. Documentation—Guidelines for the Establishment and Development of Multilingual Thesauri. n.p.: ISO, (ISO (E)). (Available in the U.S. from American National Standards Institute)  ANSI/NISO Z National Information Standards Institute. Guidelines for the Construction, Format, and Management of Monolingual Thesauri. Bethesda, MD: NISO Press, p. (ANSI/NISO Z )  SEMIO Quality Plan v  ISO/IEC Topic Maps  RDF Please refer to RDF at and XML at

Project Plan 1.Kick-off 2.Requirements Review 3.Lexicon Review 4.Taxonomy Review 5.Tags Review 6.Final Review

1. Kick-off  Objectives Purpose Scope Scale Users Conditions of receipt  Roles Supplier Customer –Admin –KE –Experts –Users  Planning  Training and Transfer

2. Requirements Review  Sources  Lexicon  Ontology  Install

Sources  Dispersion (Multiplicity, Size, Homogeneity)  Refresh  Access

Typical Patterns  Disparity  Adjust sources  Adjust crawl strategy  Isolate communities / taxonomies

Lexicon  Vocabularies, etc.  Substitutions: Acronyms, Synonyms, etc.  Preferred Keywords: Brand Names, etc.  Banned Keywords

Typical Patterns  Lack of requirements  Use Librarian Resources

Ontology  Thesaurus ?  Is the information domain analysis complete, consistent, and accurate ?  Is the partitioning of the problem complete ?

Typical Patterns  Directory versus Taxonomy  Isolate “directory” branches  Thesaurus versus Taxonomy  Put an ontology on top of thesaurus  Check ASAP match of thesaurus generics with extracted lexicon  Very high level design for top categories requirements  Plan to work bottom-up  See also Taxonomy (functions, combinations, etc.)

Install  Implementation / Integration: Are external and internal interfaces properly defined? Are all requirements traceable to the system level? Has prototyping been conducted for the user/customer? Is performance achievable within the constraints imposed by other system elements? Are requirements consistent with schedule, resources, and budget?

Typical Patterns  Scale  Security  Missing Documents

3. Lexicon Review  Coverage Extracted words / Words (Extracted Index / Index)  Sources bench-marking Coverage Extraction quality Topic distribution  Structure Most Frequent Phrases Most Productive Generics  Substitutions  Exceptions

Typical Patterns  Low level of frequency / quality for the most meaningful content  Increase size of value corpus  Filter and re-import lexicon

4. Taxonomy Review  Taxonomy Operation Correctness Reliability Usability Integrity Efficiency  Taxonomy Revision Maintainability Flexibility Testability  Taxonomy Transition Portability Reusability Interoperability

Tax Liability Loan Term loan Short-term loan Unique Beginner Life Form Generic Specific Varietal Folk Taxonomies Design The Berlin and Kay model: Taxonomy = Nomenclature + Terminology

Correctness  Accuracy  Completeness  Consistency

Accuracy  Precision  Recall

Completeness TaxonomyMapsLexiconCollection

Concentration Works Against Quality Lexicon Document Collection Maps Taxonomy Tagging  Tagging Coverage  Ontology Coverage  Hook Coverage  Map Coverage  Lexical Coverage  Collection Coverage

Consistency: Typical Patterns  Objectivization  Hyperonymy  Speciation  Necessity

Objectivization Employment Firing Hiring Salaries  Avoid functional categories  Don’t mix functions / objects  Exhaust scripts  Match idiomatic phrases

Genericity Parts Air Conditioning Belts and Hoses Body Brake System Chassis Engine Exhaust System Fuel System Glass Ignition  Avoid meronymy  Don’t mix meronymy / hyperonymy  Exhaust prototypes

Speciation Person Unwelcome person Unpleasant person Selfish person Opportunist Backscratcher  Avoid “strings” of categories  Avoid (non-idioms) properties for categories (WordNet)

Necessity  Avoid non-productive categories  Avoid combinations of categories

Nomenclature (Design Structure) Quality Index  Depth  Width  Balance

Complexity Index  Cyclometric complexity increases with number of Cross References within the Taxonomy, giving an indication of complexity and difficulty of testing.  Taxonomy Complexity Index combines: autonomy closure similarity typicality commonality redundancy stability

Maturity index  The IEEE standard suggests a taxonomy maturity index to provide an indication of the stability of the taxonomy.  Maturity Index combines: number of modules in current ontology / taxonomy. number of modules in current ontology / taxonomy that have been changed. number of modules added to current ontology / taxonomy. number of modules deleted from the previous version of the ontology / taxonomy.

5. Tags Review  Document coverage  Concepts coverage Liability Federal Funds 0.746

6. Final Review  Receipt  Maintenance

Quality Taxonomies Jim Nisbet Knowledge Technologies 2001