Regular Expressions: Review

Slides:



Advertisements
Similar presentations
Theory of Computation CS3102 – Spring 2014 A tale of computers, math, problem solving, life, love and tragic death Nathan Brunelle Department of Computer.
Advertisements

CS 345: Chapter 9 Algorithmic Universality and Its Robustness
NP Completeness Tractability Polynomial time Computation vs. verification Power of non-determinism Encodings Transformation & reducibilities P vs. NP “Completeness”
Recap CS605: The Mathematics and Theory of Computer Science.
Formal Languages: Review Alphabet: a finite set of symbols String: a finite sequence of symbols Language: a set of strings String length: number of symbols.
Nathan Brunelle Department of Computer Science University of Virginia Theory of Computation CS3102 – Spring 2014 A tale.
1 Introduction to Computability Theory Lecture3: Regular Expressions Prof. Amos Israeli.
1 Introduction to Computability Theory Lecture12: Decidable Languages Prof. Amos Israeli.
1 Introduction to Computability Theory Lecture4: Regular Expressions Prof. Amos Israeli.
1 Introduction to Computability Theory Lecture3: Regular Expressions Prof. Amos Israeli.
1 Introduction to Computability Theory Lecture11: Variants of Turing Machines Prof. Amos Israeli.
Introduction to Computability Theory
1 The scanning process Main goal: recognize words/tokens Snapshot: At any point in time, the scanner has read some input and is on the way to identifying.
Context-Free Grammars: Review Basic idea: set of production rules induces a language Finite set of variables: V = {V 1, V 2,..., V k } Finite set of terminals:
Final Exam Review Cummulative Chapters 0, 1, 2, 3, 4, 5 and 7.
Nathan Brunelle Department of Computer Science University of Virginia Theory of Computation CS3102 – Spring 2014 A tale.
Nathan Brunelle Department of Computer Science University of Virginia Theory of Computation CS3102 – Spring 2014 A tale.
Regular Expressions: Review A FA for a regular expressions can be built by composition: Ex: all strings over S={a,b} where  a “b” preceding an “a” (a+b)
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
REGULAR LANGUAGES.
4b 4b Lexical analysis Finite Automata. Finite Automata (FA) FA also called Finite State Machine (FSM) –Abstract model of a computing entity. –Decides.
Automata & Formal Languages, Feodor F. Dragan, Kent State University 1 CHAPTER 3 The Church-Turing Thesis Contents Turing Machines definitions, examples,
D E C I D A B I L I T Y 1. 2 Objectives To investigate the power of algorithms to solve problems. To explore the limits of algorithmic solvability. To.
1 Section 13.1 Turing Machines A Turing machine (TM) is a simple computer that has an infinite amount of storage in the form of cells on an infinite tape.
1 Turing Machines and Equivalent Models Section 13.1 Turing Machines.
98 Nondeterministic Automata vs Deterministic Automata We learned that NFA is a convenient model for showing the relationships among regular grammars,
1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 12 Mälardalen University 2007.
1 Introduction to the Theory of Computation Regular Expressions.
Theory of Computation Automata Theory Dr. Ayman Srour.
Lecture 09: Theory of Automata:2014 Asif NawazUIIT, PMAS-Arid Agriclture University Rawalpindi. Kleene’s Theorem and NFA.
Theory of Computation. Introduction to The Course Lectures: Room ( Sun. & Tue.: 8 am – 9:30 am) Instructor: Dr. Ayman Srour (Ph.D. in Computer Science).
Theory of Computation Automata Theory Dr. Ayman Srour.
Topic 3: Automata Theory 1. OutlineOutline Finite state machine, Regular expressions, DFA, NDFA, and their equivalence, Grammars and Chomsky hierarchy.
Cpt S 317: Spring 2009 Reading: Chapter 8
Language Recognition MSU CSE 260.
CSE202: Introduction to Formal Languages and Automata Theory
Kleene’s Theorem and NFA
Lexical analysis Finite Automata
CSCI 2670 Introduction to Theory of Computing
Non Deterministic Automata
Regular Expressions.
Chapter 7 PUSHDOWN AUTOMATA.
CSE 105 theory of computation
CSCI 2670 Introduction to Theory of Computing
CSE 105 theory of computation
Formal Languages: Review
Chapter 2 FINITE AUTOMATA.
Intro to Theory of Computation
REGULAR LANGUAGES AND REGULAR GRAMMARS
Summary.
Jaya Krishna, M.Tech, Assistant Professor
Non-Deterministic Finite Automata
Non Deterministic Automata
4b Lexical analysis Finite Automata
CSCI 2670 Introduction to Theory of Computing
CS21 Decidability and Tractability
CS21 Decidability and Tractability
4b Lexical analysis Finite Automata
CS21 Decidability and Tractability
Decidability and Tractability
Formal Languages: Review
CSE 105 theory of computation
NP Completeness • Tractability • Polynomial time
Formal Languages: Review
Theoretical Foundations of Computer Sciences
CHAPTER 1 Regular Languages
Sofya Raskhodnikova; Intro Theory of Computation
CSE 105 theory of computation
Presentation transcript:

Regular Expressions: Review A FA for a regular expressions can be built by composition: Ex: all strings over S={a,b} where $ a “b” preceding an “a” (a+b)*b(a+b)*a(a+b)* = (a+b)*ba(a+b)* Why? b a e e e a a e e b a e e e e e e e b e e b e e b a e b a e Remove previous start/final states

FA Minimization: Review Idea: “Equivalent” states can be merged: b a e 16 states! merge merge b a e e b a a,b b a a,b e b a a,b 3 states!

FA Minimization: Review Theorem [Hopcroft 1971]: the number N of states in a FA can be minimized within time O(N log N). Based on earlier work by [Huffman 1954] and [Moore 1956]. Conjecture: Minimizing the number of states in a nondeterministic FA can not be done in polynomial time. Theorem: Minimizing the number of states in a pushdown automaton (or TM) is undecidable. Project idea: implement a finite automaton minimization tool. Try to design it to run reasonably efficiently. Consider also including: A regular-expression-to-FA transformer, A non-deterministic-to-deterministic FA converter.

FAs and Regular Expressions: Review Theorem: Any FA accepts a language denoted by some RE. Proof: Use “generalized finite automata” where each transition can be a regular expression (not just a symbol), and: Only one super start state and one (separate) super final state. Every state has transitions to all other states (including itself), except the super start state, with no incoming transitions, and the super final state, which has no outgoing transitions. M M M’ e Ø e e Ø Ø e e Ø Ø e Ø Ø e e Ø Ø e Original FA M Generalized FA (GFA) M’

FAs and Regular Expressions: Review Now reduce the size of the GFA by one state at each step. A transformation step is as follows: P P P + RS*T qi qj qi qj qi qj R T q’ RS*T S Such a transformation step is always possible, until the GFA has only two states, the super-start and super-final states: M’ P Label of last remaining transition is the regular expression corresponding to the language of the original FA! Corollary: FAs and REs denote the same class of languages.

Regular Expressions Identities: Review R+S = S+R R(ST) = (RS)T R(S+T) = RS+RT (R+S)T = RT+ST Ø* = e* = e R+Ø = Ø+R = R Re = eR = R (R*)* = R* (e + R)* = R* (R*S*)* = (R+S)* R+e ≠ R RØ ≠ R

Extra credit: use this tool! (to implement some nontrivial TMs, PDAs, grammars, etc.)

Why Study Non-determinism? 1. Helps understand the ubiquitous concept of parallelism / concurrency; 2. Illuminates the structure of problems; 3. Can help save time & effort by solving intractable problems more efficiently; 4. Enables vast, deep, and general studies of “completeness” theories; 5. Helps explain why verifying proofs & solutions seems to be easier than constructing them;

Why Study Non-determinism? 6. Gave rise to new and novel mathematical approaches, proofs, and analyses; 7. Robustly decouples / abstracts complexity from underlying computational models; 8. Gives disciplined techniques for identifying “hardest” problems / languages; 9. Forged new unifications between computer science, math & logic; 10. Non-determinism is interesting fun, and cool!

Problem: compute 1111111112 in your head. 1111111112 = 12345678987654321 111111111 ´111111111 111111111 111111111 111111111 111111111 111111111 111111111 111111111 111111111 111111111 12345678987654321

Problem: What is the approximate value of: (1+9^(-(4^(7*6))))^(3^(2^85)) ≈ ? = e to 18,457,734,525,360,901,453,873,570 digits of precision! 1 + = 85 ) ( 3 2 6*7 9 -(4 ) 1 + = 85 1 ) ( 3 2 6*7 9 4 N 1 + = ? 85 1 ) ( 3 2 N

Problem: Does the Pythagorean theorem generalize to arbitrary figures on the sides of a right triangle?

Problem: Does every closed simple curve contain the vertices of an equilateral triangle? What approaches fail? What techniques work and why? Lessons and generalizations

Problem: Can an 8x8 board with two opposite corners missing be tiles with 31 dominoes? What approaches fail? What techniques work and why? Lessons and generalizations

Problem: Explain the apparent discrepancy between the areas of the two arrangements. 5 5 13 5 13 What approaches fail? What techniques work and why? Lessons and generalizations

Problem: Explain the apparent discrepancy between the areas of the three arrangements. What approaches fail? What techniques work and why? Lessons and generalizations

Turing Machine “Enhancements” Larger alphabet: old: Σ={0,1} new: Σ’ ={a,b,c,d} Idea: Encode larger alphabet using smaller one. Encoding example: a=00, b=01, c=10, d=11 old: δ b b a d c 1 1 new: δ'

Turing Machine “Enhancements” Double-sided infinite tape: 1 1 1 1 Idea: Fold into a normal single-sided infinite tape 1 1 1 1 1 1 1 old: δ L/R new: δ' L/R R/L

Turing Machine “Enhancements” Multiple heads: b b a b a b b a a Idea: Mark heads locations on tape and simulate B B b a b A A a b b B B A Modified δ' processes each “virtual” head independently: Each move of δ is simulated by a long scan & update δ' updates & marks all “virtual” head positions

Turing Machine “Enhancements” Multiple tapes: 1 1 1 1 1 1 1 1 1 1 1 Idea: Interlace multiple tapes into a single tape Modified δ' processes each “virtual” tape independently: Each move of δ is simulated by a long scan & update δ' updates R/W head positions on all “virtual tapes”

Turing Machine “Enhancements” Two-dimensional tape: This is how compilers implement 2D arrays! 1 1 1 1 1 1 Idea: Flatten 2-D tape into a 1-D tape $ Modified 1-D δ' simulates the original 2-D δ: Left/right δ moves: δ' moves horizontally Up/down δ moves: δ' jumps between tape sections

Turing Machine “Enhancements” Non-determinism: 1 1 1 1 1 1 1 Idea: Parallel-simulate non-deterministic threads $ $ $ Modified deterministic δ' simulates the original ND δ: Each ND move by δ spawns another independent “thread” All current threads are simulated “in parallel”

Turing Machine “Enhancements” Combinations: 9 . 1 4 5 3 W o l ! d r H e 3 ND Π α ω ν λ τ Idea: “Enhancements” are independent (and commutative WRT preserving the language recognized). Theorem: Combinations of “enhancements” do not increase the power of Turing machines.