Drawings as Models of Syntactic Structure: Theory and Algorithms by Mathias Möhl supervised by Marco Kuhlmann final talk of diploma thesis at Programming.

Slides:



Advertisements
Similar presentations
T YPE I SOMORPHISM O LA M AHMOUD S UPERVISED BY : D R. M ARCELO F IORE UNIVERSITY OF CAMBRIDGE Computer Laboratory Theory & Semantics Group I NTRODUCTION.
Advertisements

1 EP2210 Fairness Lecture material: –Bertsekas, Gallager, Data networks, 6.5 –L. Massoulie, J. Roberts, "Bandwidth sharing: objectives and algorithms,“
Based on: Petri Nets and Industrial Applications: A Tutorial
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 – CHAPTER 4 GRAPHS 1.
Transportation Problem (TP) and Assignment Problem (AP)
R Yun-Nung Chen 資工碩一 陳縕儂 1 /39.  Non-projective Dependency Parsing using Spanning Tree Algorithms (HLT/EMNLP 2005)  Ryan McDonald, Fernando.
GRAMMAR & PARSING (Syntactic Analysis) NLP- WEEK 4.
LTAG Semantics on the Derivation Tree Presented by Maria I. Tchalakova.
1 Introduction to Computability Theory Lecture5: Context Free Languages Prof. Amos Israeli.
1 Lecture 4 Topics –Problem solving Subroutine Theme –REC language class The class of solvable problems Closure properties.
CS5371 Theory of Computation
1 Structures and Strategies for State Space Search 3 3.0Introduction 3.1Graph Theory 3.2Strategies for State Space Search 3.3Using the State Space to Represent.
Validating Streaming XML Documents Luc Segoufin & Victor Vianu Presented by Harel Paz.
Semantics with Applications Mooly Sagiv Schrirber html:// Textbooks:Winskel The.
Degree A F H G B E D C. Adjacent A F H G B E D C.
The College of Saint Rose CIS 433 – Programming Languages David Goldschmidt, Ph.D. from Concepts of Programming Languages, 9th edition by Robert W. Sebesta,
Visibility Graphs and Motion Planning Kittiphan Techakittiroj for the Degree of Master of Science Department of Computer Science, Ball State University,
DIPLOMA THESIS Peter Černo Clearing Restarting Automata Supervised by RNDr. František Mráz, CSc.
Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi and introduced in Tree-adjoining grammars are somewhat similar to context-free.
Understanding PML Paulo Pinheiro da Silva. PML PML is a provenance language (a language used to encode provenance knowledge) that has been proudly derived.
LDK R Logics for Data and Knowledge Representation Modeling First version by Alessandro Agostini and Fausto Giunchiglia Second version by Fausto Giunchiglia.
THE BIG PICTURE Basic Assumptions Linguistics is the empirical science that studies language (or linguistic behavior) Linguistics proposes theories (models)
Syntax Analysis The recognition problem: given a grammar G and a string w, is w  L(G)? The parsing problem: if G is a grammar and w  L(G), how can w.
Chapter 19 Implementing Trees and Priority Queues Fundamentals of Java.
Topics in artificial intelligence 1/1 Dr hab. inż. Joanna Józefowska, prof. PP Reasoning and search techniques.
1 Steiner Tree Algorithms and Networks 2014/2015 Hans L. Bodlaender Johan M. M. van Rooij.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 4.
Image task, Set and Lighting Year 11 Revision. Look at the images you have been given: Write 1 comment about the costume Write 3 descriptive words about.
Saeid Pashzadeh Jan 2009 Theory of Computation 1.
Daisy Arias Math 382/Lab November 16, 2010 Fall 2010.
Management of XML and Semistructured Data Lecture 10: Schemas Monday, April 30, 2001.
Computer Science 210 Computer Organization Course Introduction.
Informal Description f(x) is continuous at x=c if and only if there are no holes, jumps, skips or gaps in the graph of f(x) at c.
THEORY OF COMPUTATION Komate AMPHAWAN 1. 2.
Recursive Data Structures and Grammars Themes –Recursive Description of Data Structures –Grammars and Parsing –Recursive Definitions of Properties of Data.
LDK R Logics for Data and Knowledge Representation Propositional Logic Originally by Alessandro Agostini and Fausto Giunchiglia Modified by Fausto Giunchiglia,
Why Study Automata? What the Course is About Administrivia 1 Welcome to CSE309.
CS6234 Advanced Algorithms - Convex hull. Terminologies – ◦ Convex hull of a set Q of points is the smallest convex polygon P for which each point in.
1 The tree data structure Outline In this topic, we will cover: –Definition of a tree data structure and its components –Concepts of: Root, internal, and.
Final Review  Syntax  Semantics/Pragmatics  Sociolinguistics  FINAL will be part open book, and part closed book  Will use similar tasks as Problem.
Machine Learning Lecture 1: Intro + Decision Trees Moshe Koppel Slides adapted from Tom Mitchell and from Dan Roth.
Grammar Set of variables Set of terminal symbols Start variable Set of Production rules.
1. For minimum vertex cover problem in the following graph give
DEFINITIONS FOR DR. HALVERSON’S CLASSES. DEFINING TERMS A correctly formed DEFINITION of a term consists of 2 parts  Noun – statement of the basic nature.
Mid-Terms Exam Scope and Introduction. Format Grades: 100 points -> 20% in the final grade Multiple Choice Questions –8 questions, 7 points each Short.
The Church-Turing Thesis Chapter Are We Done? FSM  PDA  Turing machine Is this the end of the line? There are still problems we cannot solve:
Theory of Computation. Introduction to The Course Lectures: Room ( Sun. & Tue.: 8 am – 9:30 am) Instructor: Dr. Ayman Srour (Ph.D. in Computer Science).
Review : Theory of Computation. Regular Language and Finite Automata Context-free Language and Pushdown Automata Turing Machine and Recursive Enumerable.
CSCE 210 Data Structures and Algorithms
Graphs: Definitions and Basic Properties
Recursively Enumerable and Recursive Languages
Formal Language & Automata Theory
Chapter 3 Context-Free Grammar and Parsing
Review : Theory of Computation
Context Sensitive Grammar & Turing Machines
Minimum Spanning Tree 8/7/2018 4:26 AM
LIMITS OF ALGORITHMIC COMPUTATION
Formal Language.
Managing XML and Semistructured Data
By John E. Hopcroft, Rajeev Motwani and Jeffrey D. Ullman
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Logics for Data and Knowledge Representation
Theory of Computation Lecture #
Ling 566 Oct 14, 2008 How the Grammar Works.
PRESENTATION: GROUP # 5 Roll No: 14,17,25,36,37 TOPIC: STATISTICAL PARSING AND HIDDEN MARKOV MODEL.
Ch. 1 Vocabulary Alice.
Sub: Theoretical Foundations of Computer Sciences
Investigating the Hausdorff Distance
Owen Rambow 6 Minutes.
Computer Science 210 Computer Organization
Presentation transcript:

Drawings as Models of Syntactic Structure: Theory and Algorithms by Mathias Möhl supervised by Marco Kuhlmann final talk of diploma thesis at Programming Systems Lab. Saarland University, Prof. Smolka

2 / 25 Dependency analysis This isa sentence The depedency analysis of a sentence consists of two relations: a tree (  dependencies among words) a total order (  word order) formal model: drawings A drawing is a relational structure (V;S, ), where (V;S) forms a tree and (V; ) is a total order. Definition

3 / 25 The task two relaxations of projectivity: gap degree well-nestedness constraint language for well-nested drawings saturation algorithm for enumeration definition of TAG drawing TAG’ness = well-nestedness + gap 1 structural properties of drawings description language Tree Adjoining Grammar (TAG)

4 / 25 Part I Structural properties of drawings

5 / 25 Some terminology for any node v of a drawing (V;S, ) we define: yield(v) := S*v cover(v) = Convex-Hull(S*v) yield(5)={2,4,5} cover(5)={2,3,4,5} Example A drawing is projective, iff the yield of each node equals its cover. Definition

6 / 25 Example Gaps in drawings A gap of a node v is a maximal convex set in cover(v)-yield(v). The number of gaps of a node is called its gap degree node 1 has two gaps: {3} and {5,6} its gap degree is two The gap degree of a drawing is the maximum of the gap degrees of the nodes gap degree 0  projective The gap degree is a measure for the non-projectivity of a drawing

7 / 25 Contributions related to gaps Algorithm to compute the gap degree O(n*g) (n= number of nodes; g=gap degree ; g<n) A drawing has at most O(n²) gaps and at most O(n*log(n)) different gaps worst case examples O(n*log(n))O(n²)

8 / 25 Well-nestedness A drawing is well-nested if is satisfies the constraint: All disjoint subtrees are non-interleaving. Two subtrees T 1, T 2 interleave, if there are nodes l 1, r 1  T 1 and l 2, r 2  T 2 such that l 1 l 2 r 1 r 2. l1l1 l2l2 r1r1 r2r2 interleaving subtrees l1l1 l2l2 r1r1 r2r2 non-interleaving subtrees l1l1 r1r1 l2l2 r2r2 l1l1 l2l2 r1r1 r2r2 Definition Well-nestedness is a second kind of relaxation of projectivity Orthogonal to the gap degree projective well-nested

9 / 25 Contributions related to well-nesterness A well-nested drawing has at most O(n) different gaps worst case examples O(n*log(n))O(n²) bound for well-nested drawings: O(n) not well-nested

10 / 25 Contributions related to well-nestedness projective planarwell-nested two algorithms to test well-nestedness; both O(n²) first algorithm: tests for sibling nodes, if their subtrees interleave second algorithm: reduction to a cycle test (  details later) projective drawings  planar drawings  well-nested drawings

11 / 25 Contributions related to well-nestedness In well-nested drawings each node has a gap forest The gap forest of a node v describes the relative position among the subtrees rooted at its children If the drawing is not well-nested, some nodes have no gap forest: ?   

12 / 25 Contributions related to well-nestedness abcdefgh a c e f gap forest of node b: Example a c e f b

13 / 25 Part II A description language for drawings

14 / 25 Description language projective drawings are describable as tree structure + local order This isa sentence is subjobj This: subj sentence: obj det a: det for well-nested drawings local order is not sufficient:  goal: find local description for order in well-nested drawings This isa sentence a a

15 / 25 Description language description of order in our approach: extended form of gap forests (contains some additional nodes and edge labels) Example cadb e c a d b e b c self 1 2 ed 22 11 each well-nested drawing has a unique description order is described locally for each node

16 / 25 Description language underspecified description of gap forests with constraint language describes sets of drawings (with the same tree structure) saturation algorithm to enumerate all described gap forests (NP) (related to saturation algorithms for dominance constraints)... Example c a d b e constraints: self is in the first gap of c b before c......

17 / 25 Part III Tree Adjoining Grammar

18 / 25 Tree Adjoining Grammar (TAG) TAG derivation combines elementary trees into derived tree like VB NP1 NP2 * Dan NP2 S VB does what NP1  like VB NP1 NP2 VB Dan S does what derivation tree records the combining operations lexicalised TAG: each elementary tree corresponds to one word does like Dan what derived tree elementary trees derivation tree

19 / 25 TAG drawings consist of derivation tree + leaf-order of derived tree structural characterisation: TAG drawings = wellnested drawings with gap degree ≤ 1 Theorem: like VB NP1 NP2 VB Dan S does what derived tree does like Dan what derivation tree what doesDan like drawing + 

20 / 25 finally a technical detail: Reducing well-nestedness to a cycle-test

21 / 25 b c a d c a d b ■ tree edges ■ gap edges Two types of edges: tree edges and gap edges  Theorem: A drawing is well-nested if and only if its gap graph is acyclic b c a d c a d b ■ tree edges ■ gap edges  x x The gap graph of a drawing

22 / 25 Proof: I. If a drawing is not well-nested, its gap graph contains a cycle. II. If the gap graph contains a cycle, the drawing is not well-nested. Part I. l1l2r1r2 If the drawing is not well-nested, there exist two disjoint subtrees with interleaving nodes: l1r1l2r2  drawing with interleaving subtreesgap graph with cycle The gap graph of a drawing

23 / 25 Proof: II. If the gap graph contains a cycle, the drawing is not well-nested. If the gap graph contains a cycle, it contains a cycle in which all nodes reached by a gap edge are pairwise disjoint y1 x1 y2 x2 yn xn... If x1 and x2 are not disjoint either x1 dominates x2or x2 dominates x1 y1 x1 y2 x2 yn xn... y1 x1 y2 x2 yn xn... The gap graph of a drawing

24 / 25 Assume that the drawing is well- nested. Then the path implies C(x1)  C(x2) y1 x1 y2 x2 yn xn... y1x2 x1 y1x2 x1 The gap graph of a drawing Proof: II. If the gap graph contains a cycle, the drawing is not well-nested. C(x1)  C(x2) ...  C(xn)  C(x1)  C(x1)  C(x1)

25 / 25 Main contributions Formalisation of drawings Measures for non-projectivity of drawings: gap-degree well-nestedness Description language for well-nested drawings. Characterisation of TAG drawings (well-nested + gap 1) Future work: tree bank evaluations grammar formalism based on drawings structural properties of other formalisms

26 / 25 References Manuel Bodirsky, Marco Kuhlmann, and Mathias Möhl. Well-nested drawings as models of syntactic structure. In 10th Conference on Formal Grammar and 9th Meeting on Mathematics of Language, Edinburgh, Scotland, UK, Manuel Bodirsky and Martin Kutz. Pure dominance constraints. In Proceedings of the 19th Annual Symposium on Theoretical Aspects of Computer Science (STACS 2002), Mike Daniels and W. Detmar Meurers. Improving the efficiency of parsing with discontinuous constituents. In Shuly Wintner, editor, Proceedings of NLULP’02: The 7th International Workshop on Natural Language Understanding and Logic Program- ming, number 92 in Datalogiske Skrifter, pages 49–68, Copenhagen, Roskilde Universitetscenter. Denys Duchier and Joachim Niehren. Dominance constraints with set operators. In Proceedings of the First International Conference on Computational Logic (CL2000), volume 1861 of Lecture Notes in Computer Science, pages 326–341. Springer, July 2000.

27 / 25 References Alexander Koller. Constraint-based and graph-based resolution of ambiguities in natural language. PhD thesis, Universität des Saarlandes, Martin Plátek, Tomáš Holan, and Vladislav Kuboˇn. On relax-ability of word-order by d-grammars. In Cristian Calude, Michael Dinneen, and Smaranda Sburlan, editors, Combinatorics, Computability and Logic, Discrete Mathematics and Theoretical Computer Science, pages 159–174. Springer, Berlin, Anssi Yli-Jyrä. Multiplanarity – a model for dependency structures in treebanks. In Second Workshop on Treebanks and Linguistic Theories, Mathematical Modelling in Physics, Engineering and Cognitive Sciences, pages 189–200, Växjö, Sweden, 2003.