Intro to Natural Language Processing1 Introduction to Natural Language Processing August 28, 2012 Lecture #1.

Slides:



Advertisements
Similar presentations
Natural Language Processing (or NLP) Reading: Chapter 1 from Jurafsky and Martin, Speech and Language Processing: An Introduction to Natural Language Processing,
Advertisements

Introduction to Computational Linguistics
Semantics (Representing Meaning)
CS 4705 Natural Language Processing Julia Hirschberg COMS 4705 Fall 2010.
Leksička semantika i pragmatika 5. predavanje. Ambiguity Find at least 5 meanings of this sentence: –I made her duck I cooked waterfowl for her benefit.
Introduction to Natural Language Processing A.k.a., “Computational Linguistics”
Syntax and Context-Free Grammars Julia Hirschberg CS 4705 Slides with contributions from Owen Rambow, Kathy McKeown, Dan Jurafsky and James Martin.
Statistical NLP: Lecture 3
Discourse Martin Hassel KTH NADA Royal Institute of Technology Stockholm
Introduction and Jurafsky Model Resource: A Probabilistic Model of Lexical and Syntactic Access and Disambiguation, Jurafsky 1996.
Oct 2009HLT1 Human Language Technology Overview. Oct 2009HLT2 Acknowledgement Material for some of these slides taken from J Nivre, University of Gotheborg,
1 Words and the Lexicon September 10th 2009 Lecture #3.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
Intro to Natural Language Processing1 Introduction to Natural Language Processing September 1, 2009 Lecture #1.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for Natural Language Processing Ling 571 January 3, 2011 Gina-Anne Levow.
Introduction to Computational Linguistics Lecture 2.
CS 4705 Lecture 1 CS4705 Introduction to Natural Language Processing.
CS4705 Natural Language Processing Fall What will we study in this course? How can machines recognize and generate text and speech? – Human language.
CS 4705 Natural Language Processing What is Natural Language Processing? The study of human languages and how they can be represented computationally.
CS4705 Natural Language Processing Fall What will we study in this course? How can machines recognize and generate text and speech? – Human language.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
PSY 369: Psycholinguistics Some basic linguistic theory part3.
CS4705 Natural Language Processing Fall  How can machines recognize and generate text and speech? ◦ Human language phenomena ◦ Theories, often.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Lecture 2, 7/22/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 2 22 July 2005.
9/8/20151 Natural Language Processing Lecture Notes 1.
Linguistics, Pragmatics & Natural Grammar
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
1 Natural Language Processing Gholamreza Ghassem-Sani.
Computational Linguistics INTroduction
1 Natural Language Processing Gholamreza Ghassem-Sani Fall 1383.
1 Computational Linguistics Ling 200 Spring 2006.
Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.
Natural Language Processing Rogelio Dávila Pérez Profesor – Investigador
THE BIG PICTURE Basic Assumptions Linguistics is the empirical science that studies language (or linguistic behavior) Linguistics proposes theories (models)
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
Discourse Analysis Force Migration and Refugee Studies Program The American University in Cairo Professor Robert S. Williams.
Introduction to CL & NLP CMSC April 1, 2003.
Natural Language Processing Daniele Quercia Fall, 2000.
Levels of Language 6 Levels of Language. Levels of Language Aspect of language are often referred to as 'language levels'. To look carefully at language.
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Linguistic Essentials
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
Parsing with Context-Free Grammars for ASR Julia Hirschberg CS 4706 Slides with contributions from Owen Rambow, Kathy McKeown, Dan Jurafsky and James Martin.
CSA2050 Introduction to Computational Linguistics Lecture 1 What is Computational Linguistics?
Rules, Movement, Ambiguity
Artificial Intelligence: Natural Language
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 1.
ACE TESOL Diploma Program – London Language Institute OBJECTIVES You will understand: 1. The terminology and concepts of semantics, pragmatics and discourse.
Deep structure (semantic) Structure of language Surface structure (grammatical, lexical, phonological) Semantic units have all meaning components such.
CSE467/567 Computational Linguistics Carl Alphonce Computer Science & Engineering University at Buffalo.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
Natural Language Processing Slides adapted from Pedro Domingos
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Natural Language Processing (NLP)
NATURAL LANGUAGE PROCESSING
Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571 January 4, 2016 Gina-Anne Levow.
Natural Language Processing [05 hours/week, 09 Credits] [Theory]
Statistical NLP: Lecture 3
Natural Language Processing
Natural Language - General
Linguistic Essentials
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

Intro to Natural Language Processing1 Introduction to Natural Language Processing August 28, 2012 Lecture #1

Intro to Natural Language Processing2 Course Information Instructor: Prof. Kathy McCoy Times: Tues/Thurs 9:30-10:45 Place: 102A Smith Hall Course Syllabus Home page:

Intro to Natural Language Processing3 Text Required Text: Daniel Jurafsky and James H. Martin, Speech and Language Processing, Second Edition, Prentice- Hall.Speech and Language Processing, Second Edition

Intro to Natural Language Processing4 What is Natural Language Processing? The study of human languages and how they can be represented computationally and analyzed and generated algorithmically –The cat is on the mat. --> on (mat, cat) –on (mat, cat) --> The cat is on the mat. Studying NLP involves studying natural language, formal representations, and algorithms for their manipulation

Intro to Natural Language Processing5 What is Natural Language Processing? Building computational models of natural language comprehension and production Other Names: Computational Linguistics (CL) Human Language Technology (HLT) Natural Language Engineering (NLE) Speech and Text Processing

Intro to Natural Language Processing6 Engineering Perspective Use CL as part of a larger application: –Spoken dialogue systems for telephone based information systems –Components of web search engines or document retrieval services Machine translation Question/answering systems Text Summarization –Interface for intelligent tutoring/training systems Emphasis on –Robustness (doesn’t collapse on unexpected input) –Coverage (does something useful with most inputs) –Efficiency (speech; large document collections)

Intro to Natural Language Processing7 Cognitive Science Perspective Goal: gain an understanding of how people comprehend and produce language. Goal: a model that explains actual human behaviour Solution must: explain psycholinguistic data be verified by experimentation

Intro to Natural Language Processing8 Theoretical Linguistics Perspective In principle, coincides with the Cognitive Science Perspective CL can potentially help test the empirical adequacy of theoretical models. Linguistics is typically a descriptive enterprise. Building computational models of the theories allows them to be empirically tested. E.g., does your grammar correctly parse all the grammatical examples in a given test suite, while rejecting all the ungrammatical examples?

Intro to Natural Language Processing9 Orientation of this Class Emphasis on principles and techniques Emphasis on processing textual input (as opposed to speech) More oriented towards symbolic than statistical approaches

Intro to Natural Language Processing10 Language as Goal-Oriented Behaviour We speak for a reason, e.g., –get hearer to believe something –get hearer to perform some action –impress hearer Language generators must determine how to use linguistic strategies to achieve desired effects Language understanders must use linguistic knowledge to recognise speaker’s underlying purpose

Intro to Natural Language Processing11 Examples (1)It’s hot in here, isn’t it? (2)Can you book me a flight to London tomorrow morning? (3) P:What time does the train for Washington, DC leave? C:6:00 from Track 17.

Speech and Language Processing - Jurafsky and Martin 12 Why Should You Care? Two trends amount of is now 1.An enormous amount of knowledge is now available in machine readable form as natural language text 2.Conversational agents are becoming an important form of human- computer communication 3.Much of human-human communication is now mediated by computers

Intro to Natural Language Processing13 Knowledge needed to understand and produce language Phonetics and phonology: how words are related to sounds that realize them Morphology: how words are constructed from more basic meaning units Syntax: how words can be put together to form correct utterances Lexical semantics: what words mean Compositional semantics: how word meanings combine to form larger meanings Pragmatics: how situation affects interpretation of utterance Discourse structure: how preceding utterances affects processing of next utterance

Intro to Natural Language Processing14 What can we learn about language? Phonetics and Phonology: speech sounds, their production, and the rule systems that govern their use –tap, butter –nice white rice; height/hot; kite/cot; night/not... –city hall, parking lot, city hall parking lot –The cat is on the mat. The cat is on the mat?

Intro to Natural Language Processing15 Morphology How words are constructed from more basic units, called morphemes friend + ly = friendly nounSuffix -ly turns noun into an adjective (and verb into an adverb)

Intro to Natural Language Processing16 Morphology: words and their composition –cat, cats, dogs –child, children –undo, union

Intro to Natural Language Processing17 Syntactic Knowledge how words can be put together to form legal sentences in the language what structural role each word plays in the sentence what phrases are subparts of other phrases modifier noun phrase The white book by Jurafsky and Martin is fascinating. prepositional phrase

Intro to Natural Language Processing18 Syntax: the structuring of words into larger phrases –John hit Bill –Bill was hit by John (passive) –Bill, John hit (preposing) –Who John hit was Bill (wh-cleft)

Intro to Natural Language Processing19 Semantic Knowledge What words mean How word meanings combine in sentences to form sentence meanings The sole died. (selectional restrictions) Syntax and semantics work together! (1) What does it taste like? (2) What taste does it like? N.B. Context-independent meaning fishshoe part

Intro to Natural Language Processing20 Semantics: the (truth-functional) meaning of words and phrases –gun(x) & holster(y) & in(x,y) –fake (gun (x)) (compositional semantics) –The king of France is bald (presupposition violation) –bass fishing, bass playing (word sense disambiguation)

Intro to Natural Language Processing21 Pragmatics and Discourse: The influence of Context “Going Home” – A play in one act Scene 1: Pennsylvania Station, NY Bonnie: Long Beach? Passerby: Downstairs, LIRR Station. Scene 2: Ticket Counter, LIRR Station Bonnie: Long Beach? Clerk: $4.50.

Intro to Natural Language Processing22 Scene 3: Information Booth, LIRR Station Bonnie: Long Beach? Clerk: 4:19, Track 17. Scene 4: On the train, vicinity of Forest Hills Bonnie: Long Beach? Clerk: Change at Jamaica. Scene 5: On the next train, vicinity of Lynbrook Bonnie: Long Beach? Clerk: Right after Island Park.

Intro to Natural Language Processing23 Pragmatic Knowledge What utterances mean in different contexts He rushed to the bank. Jon was hot and desperate for a dunk in the river. river bank Jon suddenly realised he didn’t have any cash. financial institution

Intro to Natural Language Processing24 Discourse Structure Much meaning comes from simple conventions that we generally follow in discourse How we refer to entities –Indefinite NPs used to introduce new items into the discourse A woman walked into the cafe. –Definite NPs can be used to refer to subsequent references The woman sat by the window. –Pronouns used to refer to items already known in discourse She ordered a cappuccino.

Intro to Natural Language Processing25 Discourse Relations Relationships we infer between discourse entities Not expressed in either of the propositions, but from their juxtaposition 1. (a) I’m hungry. (b) Let’s go to the Fuji Gardens. 2. (a) Bush supports big business. (b) He’ll vote no on House Bill 1711.

Intro to Natural Language Processing26 Discourse and Temporal Interpretation Syntax and semantics: “him” refers to Max Lexical semantics and discourse: the pushing occurred before the falling. Max fell. John pushed him. explanation

Intro to Natural Language Processing27 Discourse and Temporal Interpretation Max fell. John pushed him. John and Max were struggling at the edge of the cliff. Here discourse knowledge tells us the pushing event occurred after the falling event

Intro to Natural Language Processing28 World knowledge What we know about the world and what we can assume our hearer knows about the world is intimately tied to our ability to use language I took the cake from the plate and ate it.

Intro to Natural Language Processing29 Ambiguity The categories of knowledge of language can be thought of as ambiguity-resolving components How many different interpretations does the above sentence have? How can each ambiguous piece be resolved? Does speech input make the sentence even more ambiguous? I made her duck.

Speech and Language Processing - Jurafsky and Martin 30 Ambiguity Computational linguists are obsessed with ambiguity Ambiguity is a fundamental problem of computational linguistics Resolving ambiguity is a crucial goal

Speech and Language Processing - Jurafsky and Martin 31 Ambiguity Find at least 5 meanings of this sentence: –I made her duck

Speech and Language Processing - Jurafsky and Martin 32 Ambiguity Find at least 5 meanings of this sentence: –I made her duck I cooked waterfowl for her benefit (to eat) I cooked waterfowl belonging to her I created the (plaster?) duck she owns I caused her to quickly lower her head or body I waved my magic wand and turned her into undifferentiated waterfowl

Speech and Language Processing - Jurafsky and Martin 33 Ambiguity is Pervasive I caused her to quickly lower her head or body –Lexical category: “duck” can be a N or V I cooked waterfowl belonging to her. –Lexical category: “her” can be a possessive (“of her”) or dative (“for her”) pronoun I made the (plaster) duck statue she owns –Lexical Semantics: “make” can mean “create” or “cook”

Speech and Language Processing - Jurafsky and Martin 34 Ambiguity is Pervasive Grammar: Make can be: –Transitive: (verb has a noun direct object) I cooked [waterfowl belonging to her] –Ditransitive: (verb has 2 noun objects) I made [her] (into) [undifferentiated waterfowl] –Action-transitive (verb has a direct object and another verb) –I caused [her] [to move her body]

Speech and Language Processing - Jurafsky and Martin 35 Ambiguity is Pervasive Phonetics! –I mate or duck –I’m eight or duck –Eye maid; her duck –Aye mate, her duck –I maid her duck –I’m aid her duck –I mate her duck –I’m ate her duck –I’m ate or duck –I mate or duck

Speech and Language Processing - Jurafsky and Martin 36 Dealing with Ambiguity Four possible approaches : 1.Tightly coupled interaction among processing levels; knowledge from other levels can help decide among choices at ambiguous levels. 2.Pipeline processing that ignores ambiguity as it occurs and hopes that other levels can eliminate incorrect structures.

Speech and Language Processing - Jurafsky and Martin 37 Dealing with Ambiguity 3.Probabilistic approaches based on making the most likely choices 4.Don’t do anything, maybe it won’t matter 1.We’ll leave when the duck is ready to eat. 2.The duck is ready to eat now. –Does the “duck” ambiguity matter with respect to whether we can leave?

Phonological / morphological analyser SYNTACTIC COMPONENT SEMANTIC INTERPRETER CONTEXTUAL REASONER Sequence of words Syntactic structure (parse tree) Logical form Meaning Representation Spoken input For speech understanding Phonological & morphological rules Grammatical Knowledge Semantic rules, Lexical semantics Pragmatic & World Knowledge Indicating relns (e.g., mod) between words Thematic Roles Selectional restrictions Basic Process of NLU “He loves Mary.” Mary He loves  x loves(x, Mary) loves(John, Mary) 38Intro to Natural Language Processing

39 It’s not that simple Syntax affects meaning 1. (a) Flying planes is dangerous. (b) Flying planes are dangerous. Meaning and world knowledge affects syntax 2.  (a) Flying insects is dangerous. (b) Flying insects are dangerous. 3. (a) I saw the Grand Canyon flying to LA. (b) I saw a condor flying to LA.

Words (Input) Syntactic Structure and Logical Form Final Meaning Words (Response) Syntactic Structure and Logical Form of Response Meaning of Response Parsing Contextual Interpretation Realisation Utterance Planning Application Reasoning Lexicon and Grammar Discourse Context Application Context 40Intro to Natural Language Processing

41 Can machines think? Alan Turing: the Turing test (language as test for intelligence) Three participants: a computer and two humans (one is an interrogator) Interrogator’s goal: to tell the machine and human apart Machine’s goal: to fool the interrogator into believing that a person is responding Other human’s goal: to help the interrogator reach his goal

Intro to Natural Language Processing42 Examples Q:Please write me a sonnet on the topic of the Forth Bridge. A:Count me out on this one. I never could write poetry. Q:Add to A: (after a pause)

Intro to Natural Language Processing43 Example (from a famous movie) Dave Bowman: Open the pod bay doors, HAL. HAL: I’m sorry Dave, I’m afraid I can’t do that.

Intro to Natural Language Processing44 Deconstructing HAL Recognizes speech and understands language Decides how to respond and speaks reply With personality Recognizes the user’s goals, adopts them, and helps to achieve them Remembers the conversational history Customizes interaction to different individuals Learns from experience Possesses vast knowledge, and is autonomous

Intro to Natural Language Processing45 The state of the art and the near- term future World-Wide Web (WWW) Sample scenarios: –generate weather reports in two languages –provide tools to help people with SSI to communicate –translate Web pages into different languages –speak to your appliances –find restaurants –answer questions –grade essays (?) –closed-captioning in many languages –automatic description of a soccer gams

Intro to Natural Language Processing46 NLP Applications Speech Synthesis, Speech Recognition, IVR Systems (TOOT: more or less succeeds)Speech Synthesismore less Information Retrieval (SCANMail demo)SCANMail demo Information Extraction –Question Answering (AQUA)AQUA Machine Translation (SYSTRAN)SYSTRAN Summarization (NewsBlaster)NewsBlaster Automated Psychotherapy (Eliza)Eliza

Intro to Natural Language Processing47 Web demos Dialogue –ELIZA –DiaLeague Machine Translation (Systran & Altavista) –Systran –Babel Fish Question-answering –Ask Jeeves Summarization (IBM) – summarizeDemo.html Speech synthesis (CSTR at Edinburgh) –Festival

Intro to Natural Language Processing48 The alphabet soup (NLP vs. CL vs. SP vs. HLT vs. NLE) NLP (Natural Language Processing) CL (Computational Linguistics) SP (Speech Processing) HLT (Human Language Technology) NLE (Natural Language Engineering) Other areas of research: Speech and Text Generation, Speech and Text Understanding, Information Extraction, Information Retrieval, Dialogue Processing, Inference Related areas: Spelling Correction, Grammar Correction, Text Summarization