Jan 2004CSA3050: NLG21 CSA3050: Natural Language Generation 2 Surface Realisation Systemic Grammar Functional Unification Grammar see J&M Chapter 20.3.

Slides:



Advertisements
Similar presentations
Kees van Deemter Matthew Stone Formal Issues in Natural Language Generation Lecture 4 Shieber 1993; van Deemter 2002.
Advertisements

 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Feature Structures and Unification.
Lexical Functional Grammar : Grammar Formalisms Spring Term 2004.
Semantics (Representing Meaning)
APA Style Grammar. Verbs  Use active rather than passive voice, select tense and mood carefully  Poor: The survey was conducted in a controlled setting.
Why study grammar? Knowledge of grammar facilitates language learning
Chapter 4 Syntax.
How Language Use Varies
Natural Language Generation: Discourse Planning
Chapter 20: Natural Language Generation Presented by: Anastasia Gorbunova LING538: Computational Linguistics, Fall 2006 Speech and Language Processing.
Natural Language Generation Research Presentation Presenter Shamima Mithun.
Natural Language Processing - Feature Structures - Feature Structures and Unification.
A Pipelined Architecture Document Planning Microplanning Surface Realisation Document Plan Text Specification.
NLP and Speech 2004 Feature Structures Feature Structures and Unification.
Some Linguistic Tools. Lexical Categories (Parts of Speech)
Some Linguistic Tools. Linguistic features are analysed at the sentence level often to explore: (i) Interpersonal meaning (ii) Ideational meaning (iii)
Features & Unification Ling 571 Deep Processing Techniques for NLP January 31, 2011.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Introduction to Software Design Chapter 1. Chapter 1: Introduction to Software Design2 Chapter Objectives To become familiar with the software challenge.
Natural Language Generation Martin Hassel KTH CSC Royal Institute of Technology Stockholm
 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Feature Structures and Unification.
Let remember from the previous lesson what is Knowledge representation
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
AuxiliariesAuxiliaries. Auxiliaries A verb used to add a functional or grammatical meaning to a clause in which it appears. Functions in a supporting.
Sentences, Paragraphs, and Compositions
Using Rhetorical Grammar in the English 90 Classroom.
Introduction to Natural Language Generation
Language Objectives. Planning Teachers should write both content and language objectives Content objectives are drawn from the subject area standards.
9/8/20151 Natural Language Processing Lecture Notes 1.
Theme 1 Grammar. Kinds of Sentences  Declarative sentence- makes a statement, ends with a period  Interrogative sentence- asks a question, ends with.
October 2006Advanced Topics in NLP1 CSA3050: NLP Algorithms Finite State Transducers for Morphological Parsing.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2007 Lecture4 1 August 2007.
Paraphrasing and Plagiarism. PLAGIARISM Plagiarism is using data, ideas, or words that originated in work by another person without appropriately acknowledging.
IV. SYNTAX. 1.1 What is syntax? Syntax is the study of how sentences are structured, or in other words, it tries to state what words can be combined with.
7. Parsing in functional unification grammar Han gi-deuc.
Grammar Race!. What is a sentence? Sentences express complete thoughts; they have a subject and a predicate. Subjects are nouns or pronouns (or phrases.
Rules, Movement, Ambiguity
Three Basic Functions are generally noted: there is perhaps nothing more subtle than language is, and nothing has as many different uses. Without a doubt,
The Minimalist Program
Parts of Speech Major source: Wikipedia. Adjectives An adjective is a word that modifies a noun or a pronoun, usually by describing it or making its meaning.
November 2003CSA4050: Computational Morphology IV 1 CSA405: Advanced Topics in NLP Computational Morphology IV: xfst.
Topic and the Representation of Discourse Content
Active and Passive Voice Verbs 08/16/13. Active and Passive Voice Verbs State Standard W1.2 Use precise language, action verbs, sensory details, appropriate.
Natural Language Generation Martin Hassel KTH NADA Royal Institute of Technology Stockholm
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
NLP. Introduction to NLP (U)nderstanding and (G)eneration Language Computer (U) Language (G)
April 2010Semantic Grammar1 A short guide to Blackburn’s Grammar of English.
3.3 A More Detailed Look At Transformations Inversion (revised): Move Infl to C. Do Insertion: Insert interrogative do into an empty.
November 2003Computational Morphology VI1 CSA4050 Advanced Topics in NLP Non-Concatenative Morphology – Reduplication – Interdigitation.
Discourse & Natural Language Generation Martin Hassel KTH NADA Royal Institute of Technology Stockholm
System and the axis of Choice  Systems are list of choices which are available in the grammar of a language.  It could be a list of things b/w which.
L3 THE LINGUISTIC COMPONENTS OF CA.
Integrating Technology into Developmental Writing Sentence Patterns.
Chapter 4 Syntax a branch of linguistics that studies how words are combined to form sentences and the rules that govern the formation of sentences.
Introduction to Logic Common Forms and Functions of Language
Grammar Grammar analysis.
THEMATIC AND INFORMATION STRUCTURES
CSC 594 Topics in AI – Natural Language Processing
System Software Unit-1 (Language Processors) A TOY Compiler
An Introduction to the Government and Binding Theory
Lecture – VIII Monojit Choudhury RS, CSE, IIT Kharagpur
Appendix A: Basic Grammar and Punctuation Reference
AN EXAMPLE OF A VERY MEDIOCRE TO POOR ASSIGNMENT
Introduction to In-Text Citations
Lexical Functional Grammar
Terms Related to Sentences in General
Course content – the syllabus and educational framework
Linguistic aspects of interlanguage
Mariana Berenguer AP Language and Composition
Presentation transcript:

Jan 2004CSA3050: NLG21 CSA3050: Natural Language Generation 2 Surface Realisation Systemic Grammar Functional Unification Grammar see J&M Chapter 20.3

Jan 2004CSA3050: NLG22 Surface Realisation within NLG

Jan 2004CSA3050: NLG23 Revealing Quotations Language understanding is somewhat like counting from one to infinity; language generation is like counting from infinity back to one. [Y. Wilks, 1988] Generation from what?! [C. Longuet- Higgins] Moral: there is no consensus about the level of representation we start from. This applies in particular to surface realisation.

Jan 2004CSA3050: NLG24 Surface Realisation There is, however, some agreement that output should be specified in terms of function rather than syntactic form Consider the case of voice (syntactic form) The system will save the document vs. The document will be saved by the system Output of a discourse planner is not in terms of voice, active v. passive. More likely in terms of notions like topic, focus etc. whose surface realisation is left to a subsequent processing stage.

Jan 2004CSA3050: NLG25 Two Approaches to Surface Realisation Systemic Grammar (Halliday 1985) –Systemic Functional Linguistics –Language as a resource for expressing meaning in context. –Sentences are regarded collections of functions that are mapped to grammatical forms. Functional Unification Grammar (Kay 1979) –Grammar is a "feature structure" (recursive attribute/value pairs). –Input specification is another feature structure –Realisation viewed as further specification of input to yield output sentence via fundamental operation of functional unification.

Jan 2004CSA3050: NLG26 Systemic Grammar: Multi-Layered Sentence Analysis the systemwillsavethe document Moodsubjectfinitepredicatorobject Transitivityactorprocessgoal Themethemerheme Mood layer defines interaction between reader and writer: telling v. asking v. ordering Transitivity layer expresses propositional content Theme layer deals with topic (highlighting function)

Jan 2004CSA3050: NLG27 Systemic Grammar Grammar is represented as a graph called a system network. This comprises and systems (curly braces) –conjunctive features in boldface or systems (straight vertical lines) –disjunctive features in normal face realisation statements (in italic). –specify how disjunctive features are realised

Jan 2004CSA3050: NLG28

Jan 2004CSA3050: NLG29 Realisation Statements +X: insert the function X +predicator X=Y: conflate the functions X and Y goal = subject X>Y: order X somewhere before Y subject > predicator X/A: function X has grammatical feature Y subject/noun phrase X!L assign function X to lexical item L passive!be

Jan 2004CSA3050: NLG210 Generation Procedure 1.Traverse network from left to right, choosing the appropriate features and collecting the appropriate realisation statements. 2.Build an intermediate expression that obeys the constraints set by the realisation statements. 3.Recurse back through the grammar at a lower level for functions not fully specified.

Jan 2004CSA3050: NLG211 Example Input ( :process save-1 :actor system-1 :goal document-1 :speechact assertion :tense future ) N.B. This specification resembles that used by PENMAN system (Mann 1983) based on KL- ONE knowledge base.

Jan 2004CSA3050: NLG212 Generation 1 Start with clause feature –insert predicator and classify as verb Proceed to mood subsystem after interrogating input specification (assertion is specified) choose indicative and declarative features. Insert subject and finite functions with ordering subject > finite > predicator

Jan 2004CSA3050: NLG213 Output of Mood System Moodsubjectfinitepredicator

Jan 2004CSA3050: NLG214 Generation 2 Proceed to Transitivity subsystem. Assuming save-1 is a material process (from KB) –Insert goal and process functions –Conflate process with (finite, predicator) pair –Proceed to Voice subsystem –Choose Active feature and obey realisation statements.

Jan 2004CSA3050: NLG215 Output of Transitivity System Moodsubjectfinitepredicatorobject Transitivityactorprocessgoal

Jan 2004CSA3050: NLG216 Generation 3 Proceed to Theme subsystem Since there is no thematic specification in the input, choose Unmarked Theme –Insert theme and rheme functions –Conflate theme with subject and rheme with predicator,object

Jan 2004CSA3050: NLG217 Output of Mood Subsystem Moodsubjectfinitepredicatorobject Transitivityactorprocessgoal Themethemerheme

Jan 2004CSA3050: NLG218 Generation 4 Generation now recursively enters the grammar at lower levels to fully specify –the phrases (noun phrase network, auxiliary network – not shown here) –lexical items (usually through entries in the knowledge base) –and possibly morphological choices

Jan 2004CSA3050: NLG219 Result of Generation the systemwillsavethe document Moodsubjectfinitepredicatorobject Transitivityactorprocessgoal Themethemerheme

Jan 2004CSA3050: NLG220 Functional Unification Grammar Build the generation grammar as a feature structure (FS) Build input specification as an FS Unify the two together (functional unification operation). Linearize result.

Jan 2004CSA3050: NLG221 Feature Structures Feature structures are essentially sets of attribute value pairs attribute1value1 attribute2value2 : attribute3value3

Jan 2004CSA3050: NLG222 Feature Structure 2 These can be used to express facts of different kinds namemike height156 : eyesgreen numbersing gendermasc : casenom human factsagreement facts

Jan 2004CSA3050: NLG223 Feature Structures 3 Furthermore, values can themselves be feature structures numbersing gendermasc casenom catNP agreement

Jan 2004CSA3050: NLG224 Functional Unification The information in feature structures can be combined together by means of the functional unification (  ) operation. namemike height156 eyesgreen namemike height156 eyesgreen  =

25 Grammar as a Feature Structure

Jan 2004CSA3050: NLG226 Input Specification – as a Feature Structure catS actorheadlexsystem processheadlexsave tensefuture goalheadlexdocument

27 Partial Result catS actorcatNP headlexsystem processcatVP number{actor number} headlexsave tensefuture goalcatNP headlexdocument pattern(actor process goal)

28 Full Result