Introduction to Computational Linguistics Dr. Radhika Mamidi ENG 270 Lecture 2.

Slides:



Advertisements
Similar presentations
Dr. Radhika Mamidi ENG 270 Lecture 2. History: ’s Major influences on the development of CL -Development of formal language theory (Chomsky,
Advertisements

Computational language: week 10 Lexical Knowledge Representation concluded Syntax-based computational language Sentence structure: syntax Context free.
Introduction to Computational Linguistics
CS Morphological Parsing CS Parsing Taking a surface input and analyzing its components and underlying structure Morphological parsing:
Natural Language Understanding Difficulties: Large amount of human knowledge assumed – Context is key. Language is pattern-based. Patterns can restrict.
Chapter Chapter Summary Languages and Grammars Finite-State Machines with Output Finite-State Machines with No Output Language Recognition Turing.
LING NLP 1 Introduction to Computational Linguistics Martha Palmer April 19, 2006.
For Monday Read Chapter 23, sections 3-4 Homework –Chapter 23, exercises 1, 6, 14, 19 –Do them in order. Do NOT read ahead.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
NLP and Speech 2004 Feature Structures Feature Structures and Unification.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
By Rohana Mahmud (NLP week 1-2)
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
Linguisitics Levels of description. Speech and language Language as communication Speech vs. text –Speech primary –Text is derived –Text is not “written.
CSE (c) S. Tanimoto, 2008 Natural Language Understanding 1 Natural Language Understanding Outline: Motivation Structural vs Statistical Approaches.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
1 Natural Language Processing INTRODUCTION Husni Al-Muhtaseb Tuesday, February 20, 2007.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Natural Language Understanding
What is Natural Language Processing (NLP)
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Lecture 2, 7/22/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 2 22 July 2005.
9/8/20151 Natural Language Processing Lecture Notes 1.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
1.Syntax: the rules of sentence formation; the component of the mental grammar that represent speakers’ knowledge of the structure of phrase and sentence.
Computational Linguistics Yoad Winter *General overview *Examples: Transducers; Stanford Parser; Google Translate; Word-Sense Disambiguation * Finite State.
For Friday Finish chapter 23 Homework: –Chapter 22, exercise 9.
APPLICATIONS OF CONTEXT FREE GRAMMARS BY, BRAMARA MANJEERA THOGARCHETI.
1 Computational Linguistics Ling 200 Spring 2006.
Natural Language Processing Lecture 6 : Revision.
Natural Language Processing Rogelio Dávila Pérez Profesor – Investigador
CS 8520: Artificial Intelligence Natural Language Processing Introduction Paula Matuszek Fall, 2008.
THE BIG PICTURE Basic Assumptions Linguistics is the empirical science that studies language (or linguistic behavior) Linguistics proposes theories (models)
Natural Language Processing Artificial Intelligence CMSC February 28, 2002.
Introduction to CL & NLP CMSC April 1, 2003.
Text Language Technology Natural Language Understanding Natural Language Generation Speech Recognition Speech Synthesis Text Meaning Speech.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Transition Network Grammars for Natural Language Analysis - W. A. Woods In-Su Yoon Pusan National University School of Electrical and Computer Engineering.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
CSA2050 Introduction to Computational Linguistics Lecture 1 What is Computational Linguistics?
ICS 482: Natural language Processing Pre-introduction
Artificial Intelligence: Natural Language
CSE573 Autumn /23/98 Natural Language Processing Administrative –PS3 due today –PS4 out Wednesday, due Friday 3/13 (last day of class) special.
CSE467/567 Computational Linguistics Carl Alphonce Computer Science & Engineering University at Buffalo.
Natural Language Processing Chapter 1 : Introduction.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Syntax.
Natural Language Processing (NLP)
Overview of Statistical NLP IR Group Meeting March 7, 2006.
NATURAL LANGUAGE PROCESSING
Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571 January 4, 2016 Gina-Anne Levow.
By Kyle McCardle.  Issues with Natural Language  Basic Components  Syntax  The Earley Parser  Transition Network Parsers  Augmented Transition Networks.
MENTAL GRAMMAR Language and mind. First half of 20 th cent. – What the main goal of linguistics should be? Behaviorism – Bloomfield: goal of linguistics.
Basic Parsing with Context Free Grammars Chapter 13
Natural Language Understanding
Machine Learning in Natural Language Processing
Natural Language - General
Natural Language Understanding
Natural Language Processing
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

Introduction to Computational Linguistics Dr. Radhika Mamidi ENG 270 Lecture 2

CL vs NLP CL and NLP are related with the focus being different. CL and NLP are related with the focus being different. Computational Linguistics aims to model language as people do. Computational Linguistics aims to model language as people do. Natural Language Processing is processing language from a computational point of view in order to build different applications and tools. Natural Language Processing is processing language from a computational point of view in order to build different applications and tools. Applications on the computer side Applications on the computer side

History: ’s Development of formal language theory Development of formal language theory (Chomsky, Kleene, Backus) – Formal characterization of classes of grammar (context-free, regular) – Association with relevant automata Probability theory: language understanding as Probability theory: language understanding as decoding through noisy channel (Shannon) – Use of information theoretic concepts like entropy to measure success of language models.

Symbolic vs. Stochastic Symbolic Symbolic – Use of formal grammars as basis for natural language processing and learning systems. (Chomsky, Harris) – Use of logic and logic based programming for characterizing syntactic or semantic inference (Kaplan, Kay,Pereira) – First toy natural language understanding and generation systems (Woods, Minsky, Schank, Winograd) – Discourse Processing: Role of Intention, Focus (Grosz, Sidner, Hobbs) Stochastic Modeling Stochastic Modeling – Probabilistic methods for early speech recognition, OCR (Bledsoe and Browning, Jelinek, Black, Mercer)

: Return of Empiricism Use of stochastic techniques for part of speech tagging, parsing, word sense disambiguation, etc. Use of stochastic techniques for part of speech tagging, parsing, word sense disambiguation, etc. Comparison of stochastic, symbolic and other models for language understanding and learning tasks. Comparison of stochastic, symbolic and other models for language understanding and learning tasks.

1993-Present Advances in software and hardware Advances in software and hardware create NLP needs for information retrieval (web), machine translation, spelling and grammar checking, speech recognition and synthesis. create NLP needs for information retrieval (web), machine translation, spelling and grammar checking, speech recognition and synthesis.

Language and Intelligence: Turing Test Turing test: -- machine, human, and human judge Judge asks questions of computer and human. -- Machine’s job is to act like a human -- Human’s job is to convince judge that he’s not the machine. Machine judged “intelligent” if it can fool judge. Judgment of “intelligence” linked to appropriate answers to questions from the system.

ELIZA A simple “Rogerian Psychologist” A simple “Rogerian Psychologist” Uses pattern Matching to carry on limited form of conversation. Uses pattern Matching to carry on limited form of conversation. It gives a feeling that it is “human” It gives a feeling that it is “human” Seems to pass the “Turing Test” Seems to pass the “Turing Test” It is one of the first chatbots. It is one of the first chatbots.

Ambiguity - Mental processing He showed me the mouse - rodent/object He showed me the mouse - rodent/object The leopard was spotted - verb/adjective The leopard was spotted - verb/adjective She hit the boy with the umbrella She hit the boy with the umbrella I am reading a book on films - now-a-days/right now I am reading a book on films - now-a-days/right now Mary promised Sally (i) to go to her (i) party Mary promised Sally (i) to go to her (i) party Mary (i) persuaded Sally to go to her (i) party Mary (i) persuaded Sally to go to her (i) party

What’s involved in an “intelligent” Answer? Analysis: Decomposition of the signal (spoken or written) eventually into meaningful units. This involves … Phonology Phonology Morphology Morphology Syntax Syntax Discourse Analysis Discourse Analysis Semantics Semantics Pragmatics Pragmatics

Levels of Language Processing Phonology Phonology Morphology Morphology Syntax Syntax Semantics Semantics Pragmatics Pragmatics Discourse Analysis Discourse Analysis

Examples Pronounce “GHOTI” Pronounce “GHOTI” I scream, A nameless man I scream, A nameless man change, kite, park, fine change, kite, park, fine Fine for parking! Fine for parking! Flying planes can be dangerous. Flying planes can be dangerous. If the baby doesn’t thrive on raw milk, boil it! If the baby doesn’t thrive on raw milk, boil it! How was it? How was it?

Speech/Character Recognition Decomposition into words, segmentation of words into appropriate phones or letters Decomposition into words, segmentation of words into appropriate phones or letters Requires knowledge of phonological patterns Requires knowledge of phonological patterns

Applications Text to speech Text to speech Riyadh is the capital city of the Kingdom of Saudi Arabia. Riyadh is a beautiful place. I love living here. Use: Public announcements – airport, railway stations Speech Recognition Speech Recognition Use: Pronunciation dictionaries, mobile phones, voice commands in pc

Some problems Grapheme to Phoneme conversion Grapheme to Phoneme conversion Different spellings – same pronunciation Different spellings – same pronunciation Same spellings – different pronunciation Same spellings – different pronunciation Example: Example: read, bow, dove read, bow, dove reed-read, bear-bare reed-read, bear-bare Numbers, Names, Acronyms Numbers, Names, Acronyms 1980, St., PSU 1980, St., PSU

Memory General Knowledge Lexicon Syntactic Rules Semantic Rules Discourse Rules Lexical Processing INPUTS Syntactic Processing Semantic Processing Discourse Processing OUTPUTS Hetararchical model of Language Processing

Morphological Analysis Inflectional morphology Inflectional morphology :word variation reflects features like tense, number, degree, gender :grammatical category remains same eg. eat-eats, boy-boys, thin-thinner Derivational morphology Derivational morphology :word variation changes grammatical category eg. act-actor, boy-boyish :word variation maintains grammatical category :word variation maintains grammatical category eg. fair-unfair, like-dislike eg. fair-unfair, like-dislike Inflection follows Derivation: act--actor--actors Inflection follows Derivation: act--actor--actors Morphological analyzer Morphological analyzer identifies roots and affixes identifies roots and affixes

Syntactic Parsing Process of identifying syntactic structure of a valid sentence Process of identifying syntactic structure of a valid sentence Represented by trees, rules and networks Represented by trees, rules and networks Syntax Components Syntax Components Phrase Structure Rules Phrase Structure Rules Transformational Rules Transformational Rules Syntactic Parsers Syntactic Parsers e.g. Augmented Transition Networks e.g. Augmented Transition Networks

Syntax Component Chomsky’s (1965) model of language Chomsky’s (1965) model of language Phrase Structure rules generate deep structures Phrase Structure rules generate deep structures Deep Structure holds all the syntactic information needed to derive the meaning of a sentence Deep Structure holds all the syntactic information needed to derive the meaning of a sentence This is fed into the semantic component to obtain acceptable combinations This is fed into the semantic component to obtain acceptable combinations Transformational rules map deep structures to surface structure Transformational rules map deep structures to surface structure Surface Structure has words in the right order Surface Structure has words in the right order This is obtained after feeding surface structure into the phonological component This is obtained after feeding surface structure into the phonological component

Chomsky’s model SYNTAX COMPONENT Surface structures Transformational rules Phrase Structure Rules Deep structures PHONOLOGICAL COMPONENT Phonological rules Selection restriction rules Lexicon SEMANTIC COMPONENT

Augmented Transition Networks Developed by Woods (1970) Developed by Woods (1970) Series of states with arrows (arcs) linking one state to the next Series of states with arrows (arcs) linking one state to the next Works through a sentence from left to right Works through a sentence from left to right The arcs are labelled The arcs are labelled Group of words stored temporarily in ‘register’ Group of words stored temporarily in ‘register’ helps in look ahead - which arc to take next helps in look ahead - which arc to take next

s1s2s3 NPVP S: s1 s2 s3 articlenoun Empty Adj loop NP: s1 s2 s3 verb NP VP:

S NPVP N V NP Riyadh is art beautiful Adj a Noun place

Example of syntactic analysis by ‘Link parser’. Riyadh is a beautiful place. (S (NP Riyadh) (VP is (VP is (NP a beautiful place)) (NP a beautiful place)).).)