Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters 1 - 5 Presented by Sole.

Slides:



Advertisements
Similar presentations
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Advertisements

Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart.
01 -1 Lecture 01 Artificial Intelligence Topics –Introduction –Knowledge representation –Knowledge reasoning –Machine learning –Applications.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 The Enhanced Entity- Relationship (EER) Model.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Presented by Zeehasham Rasheed
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Machine Learning in Natural Language Processing Noriko Tomuro November 16, 2006.
Data Mining – Intro.
12 -1 Lecture 12 User Modeling Topics –Basics –Example User Model –Construction of User Models –Updating of User Models –Applications.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Introduction to Machine Learning Approach Lecture 5.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Data Mining Techniques
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Chun-Hung Chou
Knowledge Acquisition. Concepts of Knowledge Engineering Knowledge engineering The engineering discipline in which knowledge is integrated into computer.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Knowledge representation
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
Some working definitions…. ‘Data Mining’ and ‘Knowledge Discovery in Databases’ (KDD) are used interchangeably Data mining = –the discovery of interesting,
Dimitrios Skoutas Alkis Simitsis
Learning from observations
Discovering Descriptive Knowledge Lecture 18. Descriptive Knowledge in Science In an earlier lecture, we introduced the representation and use of taxonomies.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Machine Learning Chapter 5. Artificial IntelligenceChapter 52 Learning 1. Rote learning rote( โรท ) n. วิถีทาง, ทางเดิน, วิธีการตามปกติ, (by rote จากความทรงจำ.
Copyright R. Weber Machine Learning, Data Mining INFO 629 Dr. R. Weber.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
KU NLP Machine Learning1 Ch 9. Machine Learning: Symbol- based  9.0 Introduction  9.1 A Framework for Symbol-Based Learning  9.2 Version Space Search.
Prepared by: Mahmoud Rafeek Al-Farra
Use of FCA in the Ontology Extraction Step for the Improvement of the Semantic Information Retrieval Peter Butka TU Košice, Slovakia.
Some questions -What is metadata? -Data about data.
Extracting Keyphrases to Represent Relations in Social Networks from Web Junichiro Mori and Mitsuru Ishizuka Universiry of Tokyo Yutaka Matsuo National.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
1 Knowledge Acquisition and Learning by Experience – The Role of Case-Specific Knowledge Knowledge modeling and acquisition Learning by experience Framework.
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
Data Mining and Decision Support
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
TEXT CLASSIFICATION AND CLASSIFIERS: A SURVEY & ROCCHIO CLASSIFICATION Kezban Demirtas
Biological data representation and data mining Xin Chen
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Information Organization: Overview
Sentiment analysis algorithms and applications: A survey
ece 627 intelligent web: ontology and beyond
Data Mining: Concepts and Techniques Course Outline
Basic Intro Tutorial on Machine Learning and Data Mining
Prepared by: Mahmoud Rafeek Al-Farra
Presented by: Prof. Ali Jaoua
Text Categorization Berlin Chen 2003 Reference:
Information Organization: Overview
Presentation transcript:

Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters 1 - 5 Presented by Sole

Introduction Artificial intelligence Build systems that incorporate knowledge about a domain to reason on the basis of this knowledge and solve problems not encountered before Include explicit and symbolic representation of knowledge about a domain Symbolic representation and procedural aspects are separated so that it can be reused across systems Example for Yoyo the cat Which symbols to use and what they stand for?

Introduction Ontology Defines what is important in a domain and how concepts are related Knowledge-based system: determine which symbols are needed and how they are interpreted Logical level: interpretation can be constraint according to the ontology by axiomatizing symbols Issues Costly to construct Time-consuming Significant coverage of domain is needed Meaning and consistent generalization are required The trade-off between modeling a large amount of knowledge and providing as many abstractions as possible to keep the model concise makes ontology engineering indeed a challenging Knowledge Acquisition Bottleneck

Introduction Solution Automatically learn ontologies from data Goal: bridging the gap between World of symbols (words used in natural language) World of concepts (abstractions of human thought) Challenge Correctness and consistency of the model can not be guaranteed Human post-processing definitely necessary Automatically learned ontologies need to be inspected, validated, and modified by humans before they can be applied for applications relying on logical reasoning Rest of book will be about explaining in more details ontology learning

Ontologies Definition Philosophical discipline Computer Science Science of existence or the study of being Computer Science Formal specifications of a conceptualization Resources representing the conceptual model underlying a certain domain, describing it in a declarative fashion and thus cleanly separating it from procedural aspects

Ontologies Example

Learning from Text Ontology learning Acquire a domain model from data Lifting : XML-DTDs, UML diagrams, databases Semi-structured sources: HTML, XML Unstructured sources: ontology learning from text The task is inherently complex and challenging mainly due to two reasons. * There is typically only a small part of the authors' domain knowledge involved in the creation process, such that the process of reverse engineering can, at best, only partially reconstruct the authors' model. *World knowledge – unless we are considering a text book or dictionary - is rarely mentioned explicitly.

Learning from Text Meaning triangle Every language has symbols that evoke a concept that refers to a concrete individual in the world

Learning from Text Ontology population Learning concepts and relations Knowledge markup or annotation: select text fragments and assign them to an ontological concept Applications Several methods have been developed in recent years Challenge No consensus within ontology learning community on concrete tasks for ontology learning Comparison between approaches is difficult

Learning from Text Ontology learning tasks (layer cake)

Learning from Text Terms: Task: find a set of relevant concepts and relations E.g., words, multi-word compounds State-of-the-art IR methods NLP methods: POS tagger, statistical approaches Linguistic realization of domain specific concepts (keywords in a domain) Input: text Output: concepts

Learning from Text Synonyms: Task: find words which denote the same concept E.g., synsets on WordNet State-of-the-art Semantically-similar words Sense disambiguation and synonym discovery Latent Semantic Indexing (LSI) Statistical information measures defined over the Web to detect synonyms

Learning from Text Concepts: Task: find intentional definitions of concept, their extension, and lexical signs used to refer to them State-of-the-art Clusters of related terms LSI-based techniques Discovery of hierarchies of named entities Know-it-all system OntoLearn system Intension: natural language description of concept The Know-It-All system [Etzioni et al., 2004a] also aims at learning the extension of given concepts, such as, for example, all the actors appearing on the Web. In the approach of Evans [Evans, 2003], the concepts as well as their extensions are thus derived automatically, while Etzioni et al. [Etzioni et al., 2004a] essentially learn the extension of existing concepts. Finally, other systems learn concepts intentionally. The OntoLearn system [Velardi et al., 2005], for example, derives WordNet-like glosses for domain specific concepts on the basis of a compositional interpretation of the meaning of compounds.

Learning from Text Hierarchies: Task: concept hierarchy induction, refinement and lexical extension State-of-the-art Lexico-syntactic patterns Clustering algorithm to automatically derive concept hierarchies Analysis of term co-occurrence in same sentence/document

Learning from Text Relations: Task: learn relations identifiers or labels as well as their appropriate domain and range State-of-the-art Association rules Syntactic-dependencies Very few approaches address the issue of learning ontology relations from text

Learning from Text Axiom schemata instantiations: General axioms Task: learn which concepts, relations, or pair of concepts the axioms in a given system apply to General axioms Task: derive more complex relationships and connections between concepts and relations Logical interpretations constraining the interpretation of concepts and relations

Learning from Text Population: Task: learn instances of concepts and relations State-of-the-art Associated to well-known tasks for which a variety of approaches have been developed Information extraction Named entity recognition

Basics Natural Language Processing Basic formalisms and techniques necessary for understanding the rest of the book NLP-> “green car” green is the color value to describe car Co-references John Adams J. Adams

Syntactic analysis: parsing Basics NLP Pre-processing steps Chunking, also called shallow or partial parsing, applies shallow processing techniques (typically regular expressions and finite automata) to group together words to larger syntactic and meaning-bearing constituents, typically with a head which is modified by other words in the unit. Chunking Syntactic analysis: parsing

Syntactic dependencies Basics NLP Pre-processing The museum houses an impressive collection of medieval and modern art. The building combines geometric abstraction with classical references that allude to the Roman influence on the region. Bank River Financial Institution Contextual features Syntactic dependencies

Basics Similarity measures NLP Context is often represented as vector in high dimensional space E", the dimensions corresponding to words found in the context of the word in question. This vector-based context representation constitutes the core of the so called vector space model used in information retrieval

Basics Similarity measures Binary similarity measures NLP Similarity measures Binary similarity measures Geometric similarity measures

Basics Similarity measures Measures based on probability distribution NLP Similarity measures Measures based on probability distribution Hypothesis testing When assessing the degree of association between words, the HQ hypothesis assumes that the probability of the two words is independent of each other, i.e. Piwi,W2) = Piwi) P{w2) The independence hypothesis is rejected in case the observed probability is found to significantly differ from P{wi,W2) as defined above.

Basics Term relevance Weight the importance of a term in a document NLP Term relevance Weight the importance of a term in a document

Basics NLP WordNet Lexical database for the English language

Basics Formal concept analysis Formal objects: concepts + Formal attributes: characteristics describing objects Incidence relation: information about which attributes hold for each object = Formal context

Basics FCA Example

Basics FCA Example Mention that there exist algorithms that can be used to create FC

Basics Machine learning Automatic recognition/detection of patterns and regularities within sample data Patterns can be used to understand/describe the data or to make predictions Learning process Supervised Predicts the appropriate category for an example from a set of categories represented by a set of labels Unsupervised Search for common and frequent structures within the data (data exploration)

Basics Supervised learning Regression Classification ML Numeric prediction (labels are continue values) Classification Assign proper category to a given example Target value Feature vector

Basics Classifiers Tools Bayesian Classifiers Decision Trees ML Classifiers Bayesian Classifiers Decision Trees Instance-Based Learning Support Vector Machines Artificial Neural Networks Tools WEKA RapidMiner

Basics ML Examples

Basics Unsupervised learning ML Unsupervised learning Clustering: find groups of similar objects in data There is no labeled data to train from Classification Hierarchical vs. non-hierarchical Non-hierarchical algorithms produce a set of groups Hierarchical algorithms order groups in a tree structure Hard vs. soft Hard: elements are assigned to distinct clusters Soft: elements are assigned to clusters with a certain degree of membership

Basics Algorithms K-means Hierarchical clustering ML Algorithms K-means Hierarchical clustering Hierarchical Agglomerative (Bottom-Up) Clustering Divisive (Top-Down) Clustering

Datasets Corpus description

Datasets Concept hierarchies