Natural Language Processing what it does what is involved why is it difficult brief history.

Slides:



Advertisements
Similar presentations
Computational language: week 10 Lexical Knowledge Representation concluded Syntax-based computational language Sentence structure: syntax Context free.
Advertisements

 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Feature Structures and Unification.
Natural Language Processing Lecture 2: Semantics.
Syntax. Definition: a set of rules that govern how words are combined to form longer strings of meaning meaning like sentences.
Statistical NLP: Lecture 3
DEFINITE CLAUSE GRAMMARS Ivan Bratko University of Ljubljana Faculty of Computer and Information Sc.
1 CS 385 Fall 2006 Chapter 14 Understanding Natural Language Problems.
Fall 2008Programming Development Techniques 1 Topic 9 Symbol Manipulation Generating English Sentences Section This is an additional example to symbolic.
LING NLP 1 Introduction to Computational Linguistics Martha Palmer April 19, 2006.
For Monday Read Chapter 23, sections 3-4 Homework –Chapter 23, exercises 1, 6, 14, 19 –Do them in order. Do NOT read ahead.
Natural Language Processing - Feature Structures - Feature Structures and Unification.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Understanding Natural Language
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
NLP and Speech 2004 Feature Structures Feature Structures and Unification.
Artificial Intelligence 2005/06 From Syntax to Semantics.
 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Feature Structures and Unification.
A modern approach input sentence syntax analysis (parsing) semantic analysis pragmatic analysis target representation grammar lexicon semantic rules contextual.
CHAPTER 7 Natural Language Processing. Natural language processing is a branch of AI whose goal is to facilitate communication between humans and computers.
1 Understanding Natural Language The Natural Language Understanding Problem 14.1Deconstructing Language: A Symbolic Analysis 14.2Syntax 14.3Syntax.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
Syntax LING October 11, 2006 Joshua Tauberer.
Knowledge Representation why bother ? what is it ? what do we represent ? how is it represented ? Kn Rep n strategies inferencing example tasks.
Artificial Intelligence Simon Lynch This session what is the subject about? course structure an example AI system.
Context-Free Grammar CSCI-GA.2590 – Lecture 3 Ralph Grishman NYU.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Computing Science, University of Aberdeen1 CS4025: Grammars l Grammars for English l Features l Definite Clause Grammars See J&M Chapter 9 in 1 st ed,
9/8/20151 Natural Language Processing Lecture Notes 1.
CCSB354 ARTIFICIAL INTELLIGENCE (AI)
ICS611 Introduction to Compilers Set 1. What is a Compiler? A compiler is software (a program) that translates a high-level programming language to machine.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
1 CS 385 Fall 2006 Chapter 14 Understanding Natural Language (omit 14.4)
For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.
Natural Language Processing Lecture 6 : Revision.
CS : Language Technology for the Web/Natural Language Processing Pushpak Bhattacharyya CSE Dept., IIT Bombay Constituent Parsing and Algorithms (with.
Chapter 15 Natural Language Processing (cont)
Natural Language Processing Artificial Intelligence CMSC February 28, 2002.
Introduction to CL & NLP CMSC April 1, 2003.
NLP. Introduction to NLP Is language more than just a “bag of words”? Grammatical rules apply to categories and groups of words, not individual words.
GTRI.ppt-1 NLP Technology Applied to e-discovery Bill Underwood Principal Research Scientist “The Current Status and.
Transition Network Grammars for Natural Language Analysis - W. A. Woods In-Su Yoon Pusan National University School of Electrical and Computer Engineering.
For Wednesday Read chapter 23 Homework: –Chapter 22, exercises 1,4, 7, and 14.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
Parsing with Context-Free Grammars for ASR Julia Hirschberg CS 4706 Slides with contributions from Owen Rambow, Kathy McKeown, Dan Jurafsky and James Martin.
CSA2050 Introduction to Computational Linguistics Lecture 1 What is Computational Linguistics?
Rules, Movement, Ambiguity
Artificial Intelligence: Natural Language
The man bites the dog man bites the dog bites the dog the dog dog Parse Tree NP A N the man bites the dog V N NP S VP A 1. Sentence  noun-phrase verb-phrase.
CSA2050 Introduction to Computational Linguistics Parsing I.
Computing Science, University of Aberdeen1 CS4025: Grammars l Grammars for English l Features l Definite Clause Grammars See J&M Chapter 9 in 1 st ed,
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 2.
Natural Language Processing Slides adapted from Pedro Domingos
Natural Language Processing (NLP)
Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.
Human Language...human brains, have been armed by habits and methods, mind-tools and information, drawn from millions of other brains which are not ancestral.
NATURAL LANGUAGE PROCESSING
Natural Language Processing
Statistical NLP: Lecture 3
Basic Parsing with Context Free Grammars Chapter 13
CS : Speech, NLP and the Web/Topics in AI
CSC 594 Topics in AI – Applied Natural Language Processing
BBI 3212 ENGLISH SYNTAX AND MORPHOLOGY
Natural Language - General
Natural Language Processing
CS246: Information Retrieval
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

Natural Language Processing what it does what is involved why is it difficult brief history

sentence  structured rep n s of meaning "how old is my help3.doc file?" Lisp: (query (file-detail 'date "C:/help3.doc")) "the large cat chased the rat" Logic: (  1 s 1  large(s 1 )  feline(s 1 ))  (  1 s 2  rodent(s 2 ))  chased(s 1, s 2 ) "the young boy ate a bad apple" CD Graph...see next page...

CD graph "the young boy ate a bad apple"

what is involved symbolic computation ie: symbols manipulated by symbol processors search & inference knowledge representation techniques

prejudice, politics, etc ambiguity... syntactic semantic pragmatic why is it difficult

example sentences the old man the boats my car drinks petrol I saw the Eiffel Tower flying to Paris he opened the door with the key he opened the door with the squeaking hinge the boy kicked the ball under the tree the boy kicked the wall under the tree put the bottles in the box on the shelf by the door

1950sRussian  English translation 1956Chomsky 1960sPattern matching 1970sParsing & some KnRep 1980sKn & inference 1990sbig dreams small results 2000+quietly promising (brief) history of language processing

matching: Sir

matching: Student

matching: Elisa

a modern approach input sentence syntax analysis (parsing) semantic analysis pragmatic analysis target representation grammar lexicon semantic rules contextual information morphological processing

step 1- morphological processing objective: strip words into roots & modifiers issues inflection(cat pl  cat-s) derivation(happy adj  happiness noun) compounding(toothpaste)

morphological processing - notes all(?) spoken lngs exhibit morphology easier to handle in written lngs if not iconic some morphology describes infm beyond syntax eg:proximity(Tamil, Setswana, etc) case speaker / listener peer relationship

morphology examples

step 2- syntax analysis objectives:1check for correctness 2produce phrase structure uses parsera rule-based search engine grammarcontext-free production rules lexicondictionary of words & their categories

syntax rules parts of speech rules of combination consider the cat chases the mouse all large black dogs chase cats

example 1 - using Lkit (build-lexicon '((a determiner) (cat noun) (dog noun) (the determiner) (chased verb))) (build-grammar '((s1 (sentence -> noun-phrase verb-phrase)) (np (noun-phrase -> determiner noun)) (vp (verb-phrase -> verb noun-phrase)) ))

example 1 - output (parse 'sentence '(the dog chased a cat)) complete-edge 0 5 s1 sentence (the dog...) nil s1 sentence -> (noun-phrase verb-phrase) Syntax (sentence (noun-phrase (determiner the) (noun dog)) (verb-phrase (verb chased) (noun-phrase (determiner a) (noun cat)))) Semantics (sentence)

so what ? we want meaning

Remember: "the young boy ate a bad apple" how can semantics be encoded as symbols? the boy / an apple? young/old, happy/sad, good/bad? how can semantics be generated? what can be inferred from semantics?

Reminder: "the young boy ate a bad apple"

symbolic representation of semantics (actor (root boy) (id boy#732) (tags animate human male) (qual (age (val 5) (approx 3))) (quant specific)) (action (primitve INGEST)) (object (root apple) (id nil) (tags physob veg fruit food) (qual (phy-state -4)) (quant non-specific))

semantics in lexicon a simple example (build-lexicon '((a det any ) (cat noun feline ) (chased verb hunts ) (dog noun canine ) (the det specific) ))

semantics in grammar rules (s1 (sentence -> noun-phrase verb-phrase) (actor. noun-phrase) (action. verb-phrase.action) (object. verb-phrase.object) ) (np (noun-phrase -> det noun) (det. noun) ) (vp (verb-phrase -> verb noun-phrase) (action. verb) (object. noun-phrase) )

semantics - results (parse 'sentence '(the dog chased a cat)) complete-edge 0 5 s1 sentence (the dog...) nil s1 sentence -> (noun-phrase verb-phrase) Syntax (sentence (noun-phrase (det the) (noun dog)) (verb-phrase (verb chased) (noun-phrase (det a) (noun cat)))) Semantics (sentence (actor (specific canine)) (action hunts) (object (any feline)))

semantics in lexicon - checks 1 (a det (sems. any)) (all det (sems. every)) (cat noun (sems. feline) (num. sing)) (cats noun (sems. feline) (num. plur)) (chase verb (sems. hunts) (num. plur)) (chases verb (sems. hunts) (num. sing)) (dog noun (sems. canine) (num. sing)) (dogs noun (sems. canine) (num. plur)) (the det (sems. specific))

semantics in grammar - checks 1 (s1 (sentence -> noun-phrase verb-phrase) (actor. noun-phrase.sems) (action. verb-phrase.action) (object. verb-phrase.object) ; check number of noun-phrase & verb-phrase (if (noun-phrase.number = verb- phrase.number) numeric-agreement-ok numeric-agreement-bad )

semantics - results (parse 'sentence '(the dog chases a cat)) complete-edge 0 5 s1 sentence (the dog...) nil s1 sentence -> (noun-phrase verb-phrase) Syntax (sentence (noun-phrase (det the) (noun dog)) (verb-phrase (verb chases) (noun-phrase (det a) (noun cat)))) Semantics (sentence (actor specific canine) (action. hunts) (object any feline) numeric-agreement-ok)

semantics - results (parse 'sentence '(the dogs chases a cat)) complete-edge 0 5 s1 sentence (the dog...) nil s1 sentence -> (noun-phrase verb-phrase) Syntax (sentence (noun-phrase (det the) (noun dog)) (verb-phrase (verb chases) (noun-phrase (det a) (noun cat)))) Semantics (sentence (actor specific canine) (action. hunts) (object any feline) numeric-agreement-bad)

semantics in grammar - checks 2 (s1 (sentence -> noun-phrase verb-phrase) (fail if noun-phrase.number /= verb-phrase.number) (actor. noun-phrase.sems) (action. verb-phrase.action) (object. verb-phrase.object) )

semantics - results (parse 'sentence '(the dog chases a cat)) Semantics (sentence (actor specific canine) (action. hunts) (object any feline)) (parse 'sentence '(the dogs chases a cat)).... failed....

semantics in grammar - checks 3 (s1 (sentence -> noun-phrase verb-phrase) (glitch numeric-agreement if not noun-phrase.number = verb-phrase.number) (actor. noun-phrase.sems) (action. verb-phrase.action) (object. verb-phrase.object) )

semantics - results (parse 'sentence '(the dogs chases a cat)) complete-edge 0 5 s1 sentence (the dogs...) nil Glitches: (numeric-agreement) s1 sentence -> (noun-phrase verb-phrase) Syntax (sentence (noun-phrase (det the) (noun dogs)) (verb-phrase (verb chases) (noun-phrase (det a) (noun cat)))) Semantics (sentence (actor specific canine) (action. hunts) (object any feline))

example 2 - lexicon (a det any ) (cat noun feline ) (chase verb hunts ) (dog noun canine ) (the det specific) (black adj (color black)) (large adj (size 7/10)) (small adj (size 3/10))

example 2 - grammar (build-grammar '((np (noun-phrase -> ?det *adj noun) (if det (quantification. det) (quantification undefined)) (qualifiers. *.adj) (object. noun) ))

example 2 - results (parse 'noun-phrase '(small black dog)) complete-edge 0 3 np noun-phrase (small...) nil np noun-phrase -> (?det *adj noun) Syntax (noun-phrase (adj small) (adj black) (noun dog)) Semantics (noun-phrase (quantification undefined) (qualifiers ((size. 3/10)) ((color. black))) (object canine))

example 2 - results small dogs chase the small cats and large dogs chase the large cats (sentence conjunction ((actor (quant undefined) (qual (size. 3/10)) (object. canine)) (action. hunts) (object (quant. specific) (qual (size. 3/10)) (object. feline))) ((actor (quant undefined) (qual (size. 7/10)) (object. canine)) (action. hunts) (object (quant. specific) (qual (size. 7/10)) (object. feline))))

semantic processing (one approach) semantic rules in grammar  1st stage case frame verb form  primitive action case frame disambiguate & fill additional case frame slots check references with world and/or dialog do statement level inference integrate with dialog do event sequence dialog

step-1: produce raw case frame verb cases the cat chased the rat in the kitchen the cat chased the rat into the kitchen common cases sourcestart-timeinstrument destinationend-timebeneficiary locationduration

the ambiguity problem eg: the boy kicked the ball under the tree grammar rules S  S PP S  NP VP NP  ?det *adj noun NP  NP PP

example frame #1 actor (quant specific) (tags animate male human) (qual (age (range 3 13))) (root boy) action (root kick) object (root ball) (tags manip) (posn-relative (locator beneath) (object (root tree)...etc... )

example frame #2 actor (quant specific) (tags animate male human) (qual (age (range 3 13))) (root boy) action (root kick) object (root ball) (tags manip) dest (posn-relative (locator beneath) (object (root tree)...etc... )

example verb form #1 primitive strike prohibited object (tags manip) slots instrument (part-of $actor foot) legal start-time, end-time, duration instrument, beneficiary, location illegal source, dest

example verb form #2 primitive push required object (tags manip) slots instrument (part-of $actor foot) legal source, dest, start-time, end-time, instr, beneficiary, locatn, duration

semantic processing (one approach) ×semantic rules in grammar  1st stage case frame ×verb form  primitive action case frame ×disambiguate & fill additional case frame slots Þcheck references with world and/or dialog Þdo statement level inference integrate with dialog do event sequence dialog

integration with dialog dialogs have... players (actors) props (objects) locations (from case frames) themes (derived) event sequences (from themes) plans (from themes and/or derived)

event sequence set of... players (actors) props (objects) series of... semantically encoded activities (matched) escapes, exceptions & alternatives

reading – grammars, etc A good source of links & references... “Computational Analysis of Prepositions” sharaf/computational-analysis-of-prepositions/3hc3uny2z7r41/4# if you only plan to read one article... Baldwin, T. Kordoni, V and Villavicencio, A Prepositions in Applications: A Survey and Introduction to the Special Issue ". Computational Linguistics 35 (2): 119–149. also... Litkowski, Kenneth C. and Orin Hargraves SemEval-2007 task 06: Word- sense disambiguation of prepositions. In Proceedings of the 4th International Workshop on Semantic Evaluations, pages 24–29, Prague. Disambiguation of Preposition Sense Using Linguistically Motivated Features, Stephen Tratz and Dirk Hovy. Proceedings of the NAACL HLT Student Research Workshop and Doctoral Consortium, pages 96–100, Boulder, Colorado, June c 2009 Association for Computational Linguistics

reading – grammars, etc the NLP dictionary: for practical help with building grammars check the following (it is about 10 years old but then so is the English language :o) A Grammar Writer’s Cookbook. Miriam Butt, Tracy Holloway King, Marma-Eugenia Niño and Fridirique Segond also (for writing larger grammars) it is useful to find a book on grammar for tutors and/or students of English as a second language. for a broad (if a little formal) take on semantics try dipping into... Semantics-Oriented Natural Language Processing Mathematical Models and Algorithms. Vladimir Fomichov A. 2010

reading – kn rep for NLP logic and knowledge representation – a guide %20PR.pdf representing events for NLP representing%20events%22&source=web&cd=6&sqi=2&ved=0CEgQFjAF&url=h ttp%3A%2F%2Fwww.aaai.org%2Focs%2Findex.php%2FFSS%2FFSS10%2Fpap er%2Fdownload%2F2183%2F2819&ei=f6oWT_e7DeKC4gTMpaijBA&usg=AFQjC NFYmurwJR9oqfCRBimVprWRK45kew&cad=rja semantic networks & frames (2005) VERL: An Ontology Framework for Video Events (2005)