Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.

Slides:



Advertisements
Similar presentations
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Advertisements

Introduction to Computational Linguistics
Introduction to Computational Linguistics
Leksička semantika i pragmatika 5. predavanje. Ambiguity Find at least 5 meanings of this sentence: –I made her duck I cooked waterfowl for her benefit.
Introduction to Natural Language Processing A.k.a., “Computational Linguistics”
Introduction to the course Day 1 LING Computational Linguistics Harry Howard Tulane University.
Language Perception and Comprehension
Statistical NLP: Lecture 3
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
Oct 2009HLT1 Human Language Technology Overview. Oct 2009HLT2 Acknowledgement Material for some of these slides taken from J Nivre, University of Gotheborg,
Introduction to Semantics and Pragmatics. LING NLP 2 NLP tends to focus on: Syntax – Grammars, parsers, parse trees, dependency structures.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
Natural Language Processing (NLP) Overview and history of the field Knowledge of language The role of ambiguity Models and Algorithms Eliza, Turing, and.
By Rohana Mahmud (NLP week 1-2)
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
PRAGMATICS. 3- Pragmatics is the study of how more gets communicated than is said. It explores how a great deal of what is unsaid is recognized. 4.
Regular Expressions and Automata Chapter 2. Regular Expressions Standard notation for characterizing text sequences Used in all kinds of text processing.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
What is Natural Language Processing (NLP)
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Lecture 2, 7/22/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 2 22 July 2005.
9/8/20151 Natural Language Processing Lecture Notes 1.
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
Search and Decoding in Speech Recognition
Chapter 14: Artificial Intelligence Invitation to Computer Science, C++ Version, Third Edition.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Parser-Driven Games Tool programming © Allan C. Milne Abertay University v
Chapter 10: Compilers and Language Translation Invitation to Computer Science, Java Version, Third Edition.
1 Computational Linguistics Ling 200 Spring 2006.
Introduction to CL & NLP CMSC April 1, 2003.
S1: Chapter 1 Mathematical Models Dr J Frost Last modified: 6 th September 2015.
Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
CS 124/LINGUIST 180 From Languages to Information
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Linguistic Essentials
Introduction to Computational Linguistics
1 The main topics in AI Artificial intelligence can be considered under a number of headings: –Search (includes Game Playing). –Representing Knowledge.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
CSE467/567 Computational Linguistics Carl Alphonce Computer Science & Engineering University at Buffalo.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
NATURAL LANGUAGE PROCESSING
Dialogue Modeling 2. Indirect Requests “Can I have a cup of coffee?”  One approach to dealing with these kinds of requests is by plan-based inference.
Selecting Relevant Documents Assume: –we already have a corpus of documents defined. –goal is to return a subset of those documents. –Individual documents.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
By Kyle McCardle.  Issues with Natural Language  Basic Components  Syntax  The Earley Parser  Transition Network Parsers  Augmented Transition Networks.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
CSC 594 Topics in AI – Natural Language Processing
Natural Language Processing [05 hours/week, 09 Credits] [Theory]
Natural Language Processing
Statistical NLP: Lecture 9
CSCI 5832 Natural Language Processing
CSCI 5832 Natural Language Processing
CSCI 5832 Natural Language Processing
Natural Language Processing
CPSC 503 Computational Linguistics
CS246: Information Retrieval
Artificial Intelligence 2004 Speech & Natural Language Processing
Information Retrieval
Presentation transcript:

Natural Language Processing Introduction

2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting tasks involving human languages. We are also concerned with the insights that such computational work gives us into human processing of language.

3 Why Should You Care? amount of is now 1.An enormous amount of knowledge is now available in machine readable form as natural language text 2.Conversational agents are becoming an important form of human-computer communication 3.Much of human-human communication is now mediated by computers

Commercial World Lots of exciting stuff going on … 4

5 Google Translate

6

7 Web Q/A

8 Weblog Analytics Data-mining of Weblogs, discussion forums, message boards, user groups, and other forms of user generated media  Product marketing information  Political opinion tracking  Social network analysis  Buzz analysis (what’s hot, what topics are people talking about right now).

9 Major Topics 1.Words 2.Syntax 3.Meaning 4. Applications exploiting each

10 Applications First, what makes an application a language processing application (as opposed to any other piece of software)?  An application that requires the use of knowledge about human languages  Example: Is Unix wc (word count) an example of a language processing application?

11 Applications Word count?  When it counts words: Yes  To count words you need to know what a word is. That’s knowledge of language.  When it counts lines and bytes: No  Lines and bytes are computer artifacts, not linguistic entities

12 Caveat NLP has an AI aspect to it.  We’re often dealing with ill-defined problems  We don’t often come up with exact solutions/algorithms  We can’t let either of those facts get in the way of making progress

13 Course Material We’ll be intermingling discussions of:  Linguistic topics  E.g. Morphology, syntax, semantics  Formal systems  E.g. Regular languages, context-free grammars  Applications  E.g. Machine translation, information extraction

14 Topics: Linguistics Word-level processing Syntactic processing Lexical and compositional semantics

15 Topics: Techniques Finite-state methods Context-free methods First order logic Probability models Supervised machine learning methods

16 Ambiguity Computational linguists are obsessed with ambiguity Ambiguity is a fundamental problem of computational linguistics Resolving ambiguity is a crucial goal

17 Ambiguity Find at least 5 meanings of this sentence:  I made her duck

18 Ambiguity Find at least 5 meanings of this sentence:  I made her duck I cooked waterfowl for her benefit (to eat) I cooked waterfowl belonging to her I created the (plaster?) duck she owns I caused her to quickly lower her head or body I waved my magic wand and turned her into undifferentiated waterfowl

19 Ambiguity is Pervasive I caused her to quickly lower her head or body  Lexical category: “duck” can be a N or V I cooked waterfowl belonging to her.  Lexical category: “her” can be a possessive (“of her”) or dative (“for her”) pronoun I made the (plaster) duck statue she owns  Lexical Semantics: “make” can mean “create” or “cook”

20 Ambiguity is Pervasive Grammar: Make can be:  Transitive: (verb has a noun direct object)  I cooked [waterfowl belonging to her]  Ditransitive: (verb has 2 noun objects)  I made [her] (into) [undifferentiated waterfowl]  Action-transitive (verb has a direct object and another verb)  I caused [her] [to move her body]

21 Dealing with Ambiguity Four possible approaches : 1.Tightly coupled interaction among processing levels; knowledge from other levels can help decide among choices at ambiguous levels. 2.Pipeline processing that ignores ambiguity as it occurs and hopes that other levels can eliminate incorrect structures.

22 Dealing with Ambiguity 3.Probabilistic approaches based on making the most likely choices 4.Don’t do anything, maybe it won’t matter 1.We’ll leave when the duck is ready to eat. 2.The duck is ready to eat now. Does the “duck” ambiguity matter with respect to whether we can leave?

23 Models and Algorithms By models we mean the formalisms that are used to capture the various kinds of linguistic knowledge we need. Algorithms are then used to manipulate the knowledge representations needed to tackle the task at hand.

24 Models State machines Rule-based approaches Logical formalisms Probabilistic models

25 Algorithms Many of the algorithms that we’ll study will turn out to be transducers; algorithms that take one kind of structure as input and output another. Unfortunately, ambiguity makes this process difficult. This leads us to employ algorithms that are designed to handle ambiguity of various kinds

26 Paradigms In particular..  State-space search  To manage the problem of making choices during processing when we lack the information needed to make the right choice  Dynamic programming  To avoid having to redo work during the course of a state-space search CKY, Earley, Minimum Edit Distance, Viterbi, Baum-Welch  Classifiers  Machine learning based classifiers that are trained to make decisions based on features extracted from the local context