DIPLOMA THESIS Peter Černo Clearing Restarting Automata Supervised by RNDr. František Mráz, CSc.

Slides:



Advertisements
Similar presentations
Language and Automata Theory
Advertisements

Chapter 5 Pushdown Automata
C O N T E X T - F R E E LANGUAGES ( use a grammar to describe a language) 1.
Pushdown Automata Chapter 12. Recognizing Context-Free Languages Two notions of recognition: (1) Say yes or no, just like with FSMs (2) Say yes or no,
Pushdown Automata Chapter 12. Recognizing Context-Free Languages We need a device similar to an FSM except that it needs more power. The insight: Precisely.
CFGs and PDAs Sipser 2 (pages ). Long long ago…
Peter Černo and František Mráz. Introduction Δ -Clearing Restarting Automata: Restricted model of Restarting Automata. In one step (based on a limited.
PETER ČERNO FRANTIŠEK MRÁZ Clearing Restarting Automata.
Chapter Chapter Summary Languages and Grammars Finite-State Machines with Output Finite-State Machines with No Output Language Recognition Turing.
Finite Automata Section 1.1 CSC 4170 Theory of Computation.
CFGs and PDAs Sipser 2 (pages ). Last time…
1 Introduction to Computability Theory Lecture3: Regular Expressions Prof. Amos Israeli.
Fall 2006Costas Busch - RPI1 Deterministic Finite Automata And Regular Languages.
CS Master – Introduction to the Theory of Computation Jan Maluszynski - HT Lecture 4 Context-free grammars Jan Maluszynski, IDA, 2007
1 Languages and Finite Automata or how to talk to machines...
Foundations of (Theoretical) Computer Science Chapter 2 Lecture Notes (Section 2.2: Pushdown Automata) Prof. Karen Daniels, Fall 2009 with acknowledgement.
1 Foundations of Software Design Lecture 23: Finite Automata and Context-Free Grammars Marti Hearst Fall 2002.
79 Regular Expression Regular expressions over an alphabet  are defined recursively as follows. (1) Ø, which denotes the empty set, is a regular expression.
Normal forms for Context-Free Grammars
COP4020 Programming Languages
LIMITED CONTEXT RESTARTING AUTOMATA AND MCNAUGHTON FAMILIES OF LANGUAGES Friedrich Otto Peter Černo, František Mráz.
Diploma Thesis Clearing Restarting Automata Peter Černo RNDr. František Mráz, CSc.
Formal Grammars Denning, Sections 3.3 to 3.6. Formal Grammar, Defined A formal grammar G is a four-tuple G = (N,T,P,  ), where N is a finite nonempty.
Languages & Strings String Operations Language Definitions.
Lecture 16 Oct 18 Context-Free Languages (CFL) - basic definitions Examples.
Scattered Context Grammars Alexander Meduna Faculty of Information Technology Brno University of Technology Brno, Czech Republic, Europe.
Tree-adjoining grammar (TAG) is a grammar formalism defined by Aravind Joshi and introduced in Tree-adjoining grammars are somewhat similar to context-free.
Learning Restricted Restarting Automata Presentation for the ABCD workshop in Prague, March 27 – 29, 2009 Peter Černo.
CSCI 2670 Introduction to Theory of Computing August 24, 2005.
Pushdown Automata (PDA) Intro
CLEARING RESTARTING AUTOMATA AND GRAMMATICAL INFERENCE Peter Černo Department of Computer Science Charles University in Prague, Faculty of Mathematics.
1 Chapter 3 Describing Syntax and Semantics. 3.1 Introduction Providing a concise yet understandable description of a programming language is difficult.
Pushdown Automata (PDAs)
Grammars CPSC 5135.
Learning Automata and Grammars Peter Černo.  The problem of learning or inferring automata and grammars has been studied for decades and has connections.
GRAMMATICAL INFERENCE OF LAMBDA-CONFLUENT CONTEXT REWRITING SYSTEMS Peter Černo Department of Computer Science Charles University in Prague, Faculty of.
Introduction to Language Theory
Pushdown Automata Part I: PDAs Chapter Recognizing Context-Free Languages Two notions of recognition: (1) Say yes or no, just like with FSMs (2)
Copyright © by Curt Hill Grammar Types The Chomsky Hierarchy BNF and Derivation Trees.
Part VII. Models for Context-Free Languages 1/50.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 4.
CS 321 Programming Languages and Compilers Lectures 16 & 17 Introduction to Formal Languages Regular Languages Lexical Analysis.
Finite Automata – Definition and Examples Lecture 6 Section 1.1 Mon, Sep 3, 2007.
1 A well-parenthesized string is a string with the same number of (‘s as )’s which has the property that every prefix of the string has at least as many.
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
Foundations of (Theoretical) Computer Science Chapter 2 Lecture Notes (Section 2.2: Pushdown Automata) Prof. Karen Daniels, Fall 2010 with acknowledgement.
Grammars A grammar is a 4-tuple G = (V, T, P, S) where 1)V is a set of nonterminal symbols (also called variables or syntactic categories) 2)T is a finite.
Finite Automata Chapter 1. Automatic Door Example Top View.
CSC312 Automata Theory Lecture # 26 Chapter # 12 by Cohen Context Free Grammars.
Chapter 4: Syntax analysis Syntax analysis is done by the parser. –Detects whether the program is written following the grammar rules and reports syntax.
Formal Languages Finite Automata Dr.Hamed Alrjoub 1FA1.
Donghyun (David) Kim Department of Mathematics and Physics North Carolina Central University 1 Chapter 2 Context-Free Languages Some slides are in courtesy.
Algorithms for hard problems Automata and tree automata Juris Viksna, 2015.
CSCI 4325 / 6339 Theory of Computation Zhixiang Chen Department of Computer Science University of Texas-Pan American.
Pushdown Automata Chapter 12. Recognizing Context-Free Languages Two notions of recognition: (1) Say yes or no, just like with FSMs (2) Say yes or no,
PETER ČERNO FRANTIŠEK MRÁZ Clearing Restarting Automata.
Chapter 2. Formal Languages Dept. of Computer Engineering, Hansung University, Sung-Dong Kim.
Costas Busch - LSU1 Deterministic Finite Automata And Regular Languages.
Chapter 1 INTRODUCTION TO THE THEORY OF COMPUTATION.
Modeling Arithmetic, Computation, and Languages Mathematical Structures for Computer Science Chapter 8 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesAlgebraic.
Deterministic Finite-State Machine (or Deterministic Finite Automaton) A DFA is a 5-tuple, (S, Σ, T, s, A), consisting of: S: a finite set of states Σ:
CIS Automata and Formal Languages – Pei Wang
Course 2 Introduction to Formal Languages and Automata Theory (part 2)
A HIERARCHY OF FORMAL LANGUAGES AND AUTOMATA
Closure Properties for Regular Languages
Deterministic Finite Automata And Regular Languages Prof. Busch - LSU.
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Chapter 2 Context-Free Language - 01
Chapter 1 Regular Language
Teori Bahasa dan Automata Lecture 6: Regular Expression
Presentation transcript:

DIPLOMA THESIS Peter Černo Clearing Restarting Automata Supervised by RNDr. František Mráz, CSc.

Clearing Restarting Automata Represent a new restricted model of restarting automata. Can be learned very efficiently from positive examples and the extended model enables to learn effectively a large class of languages. In the thesis we relate the class of languages recognized by these automata to Chomsky hierarchy and study their formal properties.

Diploma Thesis Outline Chapter 1 gives a short introduction to the theory of automata and formal languages. Chapter 2 gives an overview of several selected models related to our model. Chapter 3 introduces our model of clearing restarting automata. Chapter 4 describes two extended models of clearing restarting automata.

Selected Models Contextual Grammars by Solomon Marcus: – Are based on adjoining (inserting) pairs of strings/contexts into a word according to a selection procedure. Pure grammars by Mauer et al.: – Are similar to Chomsky grammars, but they do not use auxiliary symbols – nonterminals. Church-Rosser string rewriting systems: – Recognize set of words which can be reduced to an auxiliary symbol Y. Each maximal sequence of reductions ends with the same irreducible string. Associative language description by Cherubini et al.: – Work on so-called stencil trees which are similar to derivation trees but without nonterminals. The inner nodes are marked by an auxiliary symbol Δ.

Selected Models Restarting automata: – Introduced by Jančar et al. in 1995 in order to model the so-called `analysis by reduction’. – Analysis by reduction is a technique used in linguistics to analyze sentences of natural languages that have free word order.

Formal Definition Let k be a positive integer. k -clearing restarting automaton ( k-cl-RA - automaton for short) is a couple M = (Σ, I) : – Σ is a finite nonempty alphabet, ¢, $ ∉ Σ. – I is a finite set of instructions (x, z, y), z ∊ Σ +, x ∊ LC k = Σ k ∪ ¢.Σ ≤k-1 (left context) y ∊ RC k = Σ k ∪ Σ ≤k-1.$ (right context) – The special symbols: ¢ and $ are called sentinels.

Formal Definition A word w = uzv can be rewritten to uv (denoted uzv ⊢ M uv ) if and only if there exist an instruction i = (x, z, y) ∊ I such that: – x is a suffix of ¢.u – y is a prefix of v.$ A word w is accepted if and only if w ⊢ * M λ where ⊢ * M is reflexive and transitive closure of ⊢ M. The k-cl-RA -automaton M recognizes the language L(M) = {w ∊ Σ* | M accepts w}.

Example Language L = {a n b n | n ≥ 0}. Can be recognized by the 1-cl-RA -automaton M = ({a, b}, I), where the instructions I are: – R1 = (a, ab, b) – R2 = (¢, ab, $) For instance: – aaaabbbb ⊢ R1 aaabbb ⊢ R1 aabb ⊢ R1 ab ⊢ R2 λ. Now we see that the word aaaabbbb is accepted.

Regular Languages Theorem: All regular languages can be recognized by clearing restarting automata using only instructions with left contexts starting with ¢. Theorem: If M = (Σ, I) is a k-cl-RA -automaton such that for each (x, z, y) ∊ I: ¢ is a prefix of x or $ is a suffix of y then L(M) is a regular language.

Non-Context-Free Languages Theorem: The family of languages recognized by 1-cl-RA -automata is strictly included in the family of context-free languages. Theorem: 2-cl-RA -automata can recognize some non-context-free languages.

Problem with cl-RA -automata Theorem: The language: L 1 = {a n cb n | n ≥ 0} ∪ {λ} is not recognized by any cl-RA -automaton.

Extended Models Δ- clearing restarting automata – Can leave a mark – a symbol Δ – at the place of deleting besides rewriting into the empty word. – Can recognize Greibach’s hardest-context-free language. Δ*- clearing restarting automata – Can rewrite a subword w into Δ k where k ≤ |w|. – Can recognize all context-free languages.

Conclusion The main goal of the thesis was successfully achieved. The results of the thesis were presented in: – ABCD workshop, Prague, March 2009 – NCMA workshop, Wroclaw, August 2009 – An extended version of the paper from the NCMA workshop was accepted for publication in Fundamenta Informaticae. Many interesting theoretical questions and problems are still open or under investigation.

Thank You