Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Chapter 11 Designing the User Interface
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Object-Oriented Analysis and Design
Search Engines and Information Retrieval
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Automating Discovery from Biomedical Texts Marti Hearst & Barbara Rosario UC Berkeley Agyinc Visit August 16, 2000.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Presented by Zeehasham Rasheed
Kari R. Schougaard, PhD Stud. Værktøjer og Teknikker, 2006 UNIVERSITY OF AARHUS Department of Computer Science Unified Modeling Language Visual language.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Chapter 13: Designing the User Interface
Basic Concepts The Unified Modeling Language (UML) SYSC System Analysis and Design.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Search Engines and Information Retrieval Chapter 1.
Slide 1 Wolfram Höpken RMSIG Reference Model Special Interest Group Second RMSIG Workshop Methodology and Process Wolfram Höpken.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Modern Information Retrieval Computer engineering department Fall 2005.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Programming in Java Unit 3. Learning outcome:  LO2:Be able to design Java solutions  LO3:Be able to implement Java solutions Assessment criteria: 
-1- Philipp Heim, Thomas Ertl, Jürgen Ziegler Facet Graphs: Complex Semantic Querying Made Easy Philipp Heim 1, Thomas Ertl 1 and Jürgen Ziegler 2 1 Visualization.
Sharing lessons through effective modelling Hilary Dexter University of Manchester Tom Franklin Franklin Consulting.
TOPIC CENTRIC QUERY ROUTING Research Methods (CS689) 11/21/00 By Anupam Khanal.
UMLS Unified Medical Language System. What is UMLS? A Unified knowledge representation system Project of NLM Large scale Distributed First launched in.
Unified Modeling Language* Keng Siau University of Nebraska-Lincoln *Adapted from “Software Architecture and the UML” by Grady Booch.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Mining the Biomedical Research Literature Ken Baclawski.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
Exam 2 Review Software Engineering CS 561. Outline Requirements Development UML Class Diagrams Design Patterns Users, Usability, and User Interfaces Software.
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
5. 2Object-Oriented Analysis and Design with the Unified Process Objectives  Describe the activities of the requirements discipline  Describe the difference.
Object Oriented Analysis & Design By Rashid Mahmood.
Information Retrieval in Practice
Witold Staniszkis Empowering the Knowledge Worker End-User Software Engineering in Knowledge Management Witold Staniszkis
Advanced Computer Systems
The Semantic Web By: Maulik Parikh.
Visual Information Retrieval
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Sharing lessons through effective modelling
Modern Information Retrieval
Independent Study of Ontologies
Introduction to Unified Modeling Language (UML)
Information Retrieval and Web Search
Presented by: Hassan Sayyadi
Natural Language Processing (NLP)
Systems Analysis and Design in a Changing World, 6th Edition
Information Retrieval and Web Search
Social Knowledge Mining
Informatics 121 Software Design I
University of Houston-Clear Lake
CSE 635 Multimedia Information Retrieval
Introduction to Systems Analysis and Design Stefano Moshi Memorial University College System Analysis & Design BIT
Semantic Markup for Semantic Web Tools:
Ying Dai Faculty of software and information science,
Information Networks: State of the Art
CS246: Information Retrieval
Natural Language Processing (NLP)
Chapter 10: Compilers and Language Translation
Introduction to Search Engines
Natural Language Processing (NLP)
Presentation transcript:

Kenneth Baclawski et. al. PSB 2000 2002/11/7 Sa-Im Shin Knowledge Representation and Indexing using the Unified Medical Language System Kenneth Baclawski et. al. PSB 2000 2002/11/7 Sa-Im Shin

Contents Introduction Related Work Using the UMLS to Express Document Semantics Constructing Knowledge Representations Semantic Based User Interfaces Usability Survey Survey Results Conclusion After introduction and related work, I’ll explain the method of using ~, Constructing ~ and semantic ~. Then we’ll talk about usability ~ and survey ~, finally make conclusion.

Introduction Document Understanding Syntactic understanding Lexical scanning, Morphological Analysis, Parsing Semantic Understanding : Ontology Framework for understanding & representing knowledge Meaning of the vocabulary terms <- Vocabulary in the community Unified Medical Language System (UMLS) Rich & well structured ontology in biomedical domain Need for adaptation for application There are tow kind of document ~. One is the syntactic ~ like lexical ~. Our focused understanding is semantic ~ especially ontology. Ontology is the framework ~. And meaning ~ is represented by vocabulary ~. Unified ~ is the one of the famous ontology in bioinformatics. UMLS is rich ~, but it need some processing for adaptation ~.

Related Work Ontology-based techniques Representing, indexing & retrieving documents First stage (1992 ~): Relatively small ontology World Wide Web (1996~ ) General large ontology OntoSeek, InQuizit Ontology for biological knowledge Hafner, Fridman, Schulze-Kremer UMLS Larger than OntoSeek Knowledge representation by text processing Using typical NLP techniques Since 1990, there have been many ontology-based effort for representing, ~. At first, relatively small ontology is constructed. With the spread of world ~, general and large ~ appears like OntoSeek and InQuizit. Hafner, ~ are ontology ~. And UMLS is larger ~ using ~

Using the UMLS to Express Document Semantics Ontology components for knowledge representation Semantic categories (types) : 130 Semantic relationships : 50 Categorical links : 7000 Semantic concepts : 475,000 Concept maps : Mapping from source data to semantic concept (1,000,000) Categorization : Semantic concept to semantic category (600,000) Conceptual links : Concepts & Relationship (372,808) Explanation Knowledge Representation Rules Ontology ~ are like these. The numbers in this list are the number in the UMLS.

Constructing Knowledge Representations System interface Input : Document Output : visual representation Background knowledge Domain-independent knowledge Syntactic knowledge, POS & grammar rules Domain-dependent knowledge Semantic knowledge, Terminology of the domain This paper classifies background ~ into domain-~ and domain-~. Domain-~ contains syntactic ~, and semantic ~ are domain-.

Constructing Knowledge Representations Lexeme -> conceptual units (tokens) Original source data -> semantic concept of ontology Extracting elementary units (lexeme) Extracting protein & chemical names Parse tree & anaphoric references -> knowledge representation Knowledge representation rules : Condition & action Fragmenting & indexing Distributed high-performance indexing engine Sequence of tokens -> parse tree Anaphora construction : Pronoun -> Anaphoric references Reducing ambiguity by ontology This figure show the whole processing for constructing ~. First scanning module, we ~, used domain independent ~ is word ~ and domain-dependent ~ is the domain-~. Then throughout tokenization, we ~. We use ~ in domain-independent ~ and ~ in domain-dependent ~. In parsing ~, we ~ with domain-independent grammar ~ and domain-dependent relationship ~, and the result of this module are parse ~. With these parse ~ we represent knowledge with knowledge ~ containing various condition ~. And for fragmenting ~, we use distributed ~.

Semantic Based User Interfaces Developing user interface Insight into the queries & retrieved data Keynets Diagrammatic knowledge representation User interface is developed by diagrammatic ~ keynets to help understand queries ~.

Usability Survey Line Box question, description of a pharmacological product Medline keywords in the text appearing in the UMLS Corresponding biomedical keynet representation of biomedical text Line Relationship between two concepts Label : relationship type (property) Dot : Source of relationship (statement) cf) object / predicate This figure shows the shape of user survey. Diagram in the center is keynet representation. Box represents ~ with word ~. Line shows the relationship ~. Label express relationship ~ like “part of ~”, and dot mark source ~. And especially in the case of “part of”, we divide boxes into object ~. Survey contains three example. Biomedical text is a question, ~, biomedical keywords are medline ~, and previous biomedical keynet. Box Representing semantic concept Word or short phrase in UMLS

Usability Survey Feedback Semantic differential Feedback for keynet Interpreted nine questions Gauging the subject’s overall reaction 0 to 9 Feedback for keynet Using semantic differential Presentation of keynet Comparison of three examples Also we feedback our results using semantic ~. Semantic ~ is interpreted ~ to gauge ~ and the scale is 0 ~. Feedback evaluate the presentation ~ by comparison ~.

Survey Results Ease of Understanding Limit complexity High understandability Limit complexity More detailed views when user’s request Keyword vs. Keynet Keynet is more useful than keyword in some cases These survey results help easy understanding and limit complexity for high ~. More ~ are provided when ~. And keynet ~.

Conclusion Role & requirements of ontology Evaluation Constructing knowledge representation NLP tools for Using UMLS ontology Evaluation Usefulness of representing biomedical knowledge Favorable results As a conclusion, We construct knoledge ~ and NLP ~. and we evaluste the usefulness ~ then got favorable ~.

Future Works Short term goal Long term goal Reflecting Feedback Development of new class of semantic-based search engine New usability survey Long term goal Dynamic interaction with user Expanding subject areas Short term future ~ are reflecting ~ to ~ and new ~. Long term goal is implementing dynamic ~ and expanding ~.

Discussion Issue I think this paper lacks of evaluation part. In fact, evaluation for ontology and representation is difficult problem. Do you have good idea about evaluation methods for these topics ? What do you think about important evaluated faces in these topics ? In this paper, system provide user feedback for developing system. This feedback results contains user’s evaluation and grade of various aspects about the system. And I think usefulness of the expression in other application system is also very important factor. So, we can apply this result into input of other application system, and can compare the performance with other representation or without this result.