1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.

Slides:



Advertisements
Similar presentations
Introduction to Computational Linguistics
Advertisements

Corpus Processing and NLP
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Shallow Processing: Summary Shallow Processing Techniques for NLP Ling570 December 7, 2011.
CS4705 Natural Language Processing.  Regular Expressions  Finite State Automata ◦ Determinism v. non-determinism ◦ (Weighted) Finite State Transducers.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
CS4705 Natural Language Processing Fall What will we study in this course? How can machines recognize and generate text and speech? – Human language.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
Machine Learning in Natural Language Processing Noriko Tomuro November 16, 2006.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Research methods in corpus linguistics Xiaofei Lu.
Natural Language Processing Ellen Back, LIS489, Spring 2015.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Lecture 2, 7/22/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 2 22 July 2005.
ELN – Natural Language Processing Giuseppe Attardi
Introduction to NLP.
9/8/20151 Natural Language Processing Lecture Notes 1.
Introduction to Natural Language Processing Heshaam Faili University of Tehran.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Computational Linguistics Yoad Winter *General overview *Examples: Transducers; Stanford Parser; Google Translate; Word-Sense Disambiguation * Finite State.
COMP 791A: Statistical Language Processing
NLP.
Linguistics & AI1 Linguistics and Artificial Intelligence Linguistics and Artificial Intelligence Frank Van Eynde Center for Computational Linguistics.
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.
Natural Language Processing Guangyan Song. What is NLP  Natural Language processing (NLP) is a field of computer science and linguistics concerned with.
1 Statistical NLP: Lecture 9 Word Sense Disambiguation.
Introduction to CL & NLP CMSC April 1, 2003.
인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.
Introduction to Human Language Technologies Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 1: Overview
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
For Wednesday No reading Homework –Chapter 23, exercise 15 –Process: 1.Create 5 sentences 2.Select a language 3.Translate each sentence into that language.
ICS 482: Natural language Processing Pre-introduction
30 March – 8 April 2005 Dipartimento di Informatica, Universita di Pisa ML for NLP With Special Focus on Tagging and Parsing Kiril Ribarov.
Lecture 1 Lec. Maha Alwasidi. Branches of Linguistics There are two main branches: Theoretical linguistics and applied linguistics Theoretical linguistics.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
CSE467/567 Computational Linguistics Carl Alphonce Computer Science & Engineering University at Buffalo.
Natural Language Processing Chapter 1 : Introduction.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
CS 4705 Natural Language Processing Who am I? Julia Hirschberg –Computational Linguist in CS –Focus: Spoken Language Processing –Lab: The Speech Lab,
SIMS 296a-4 Text Data Mining Marti Hearst UC Berkeley SIMS.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
January 10, Csci 2111: Data and File Structures Instructor: Nathalie Japkowicz Objectives of the Course and Preliminaries.
By Kyle McCardle.  Issues with Natural Language  Basic Components  Syntax  The Earley Parser  Transition Network Parsers  Augmented Transition Networks.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
Natural Language Processing [05 hours/week, 09 Credits] [Theory]
CSI 3120 Concepts of Programming Languages
Natural Language Processing (NLP)
Statistical NLP: Lecture 9
CS4705 Natural Language Processing
Natural Language Processing (NLP)
PRESENTATION: GROUP # 5 Roll No: 14,17,25,36,37 TOPIC: STATISTICAL PARSING AND HIDDEN MARKOV MODEL.
Artificial Intelligence 2004 Speech & Natural Language Processing
Information Retrieval
Natural Language Processing (NLP)
Presentation transcript:

1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of the Course and Preliminaries

2 Why study Natural Language Processing (NLP)? 4 NLP is a very important current area of investigation as it is necessary to many useful applications. 4 These applications include: information retrieval, extraction, and filtering; intelligent Web searching; spelling and grammar checking; automatic text summarization; pseudo- understanding and generation of natural language; and multi-lingual systems including machine translation.

3 Why study NLP Statistically? 4 Up until the late 1980’s, NLP was mainly investigated using a rule-based approach. 4 However, rules appear too strict to characterize people’s use of language. 4 This is because people tend to stretch and bend rules in order to meet their communicative needs. 4 Methods for making the modeling of language more accurate are needed and statistical methods appear to provide the necessary flexibility.

4 Subdivisions of NLP 4 Parts of Speech and Morphology (words, their syntactic function in sentences, and the various forms they can take). 4 Phrase Structure and Syntax (regularities and constraints of word order and phrase structure). 4 Semantics (the study of the meaning of words (lexical semantics) and of how word meanings are combined into the meaning of sentences, etc.) 4 Pragmatics (the study of how knowledge about the world and language conventions interact with. literal meaning).

5 Topics Covered in this course 4 Studying Words: –Collocations –Statistical inference –Word sense disambiguation –Lexical Acquisition 4 Studying Grammars: –Markov Models –Part-of-Speech Tagging –Probabilistic Context Free Grammars 4 Application: Statistical Alignment and Machine Translation

6 Tools and Resources Used 4 Probability/Statistical Theory: Statistical Distributions, Bayesian Decision Theory. 4 Linguistics Knowledge: Morphology, Syntax, Semantics and Pragmatics. 4 Corpora: Bodies of marked or unmarked text to which statistical methods and current linguistic knowledge can be applied in order to discover novel linguistic theories or interesting and useful knowledge organization.

7 Course Requirements written and programming assignments (40%) 4 An in-class presentation of a current research paper (10%) 4 A Mid-Term (20%) 4 A Final Project (30%)

8 Textbook and other useful information 4 Foundations of Statistical Natural Language Processing, by Chris Manning and Hinrich Schutze, MIT Press. 4 Current literature available from the Web will be used for class presentations. 4 Class’ Website: csi5180_fall_2003.html 4 Check the class website for a companion website for the textbook and other statistical NLP resources.