CS4705 Natural Language Processing Fall 2008. What will we study in this course? How can machines recognize and generate text and speech? – Human language.

Slides:



Advertisements
Similar presentations
European Masters Program in Language and Communication Technologies Free University.
Advertisements

CS 4705 Natural Language Processing Julia Hirschberg COMS 4705 Fall 2010.
Team Software Project - Ebnenasir - Spring CS 3141: Team Software Project - Introduction Ali Ebnenasir Department of Computer Science Michigan Technological.
Introduction to Computer Programming I CSE 113
Drawing Trees & Ambiguity in Trees. Some Phrase Structure Rules of English S’ -> (Comp) S S’ -> (Comp) S S -> {NP/S’} (T) VP S -> {NP/S’} (T) VP VP 
Course Info Course Topics and approximate Schedule Assignments and Grade Breakdown The usual Stuff including "How to fail this course" Students introduce.
Grammar of English: Introduction Eng August 2014.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
IT 240 Intro to Desktop Databases Introduction. About this course Design a database: Entity Relation (ER) modeling and normalization techniques Create.
CS 4705 Lecture 1 CS4705 Introduction to Natural Language Processing.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
CS4705 Natural Language Processing Fall What will we study in this course? How can machines recognize and generate text and speech? – Human language.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Adrian Ilie COMP 14 Introduction to Programming Adrian Ilie Summer Session II, 2005 MTWRF 9:45-11:15 am.
1 Natural Language Processing for the Web Prof. Kathleen McKeown 722 CEPSR, Office Hours: Wed, 1-2; Mon 3-4 TA: Fadi Biadsy 702 CEPSR,
UMass Lowell Computer Science Foundations of Computer Science Prof. Karen Daniels Fall, 2009 Lecture 1 Introduction/Overview Th. 9/3/2009.
CS 4705 Natural Language Processing What is Natural Language Processing? The study of human languages and how they can be represented computationally.
COMS W1004 Introduction to Computer Science May 28, 2008.
Statistical Methods in Computer Science Course Introduction Ido Dagan.
CSC 171 – FALL 2004 COMPUTER PROGRAMMING LECTURE 0 ADMINISTRATION.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
Introduction to Artificial Intelligence Prof. Kathleen McKeown 722 CEPSR, TAs: Kapil Thadani 724 CEPSR, Phong Pham TA Room.
COMP 110 Introduction to Programming Mr. Joshua Stough August 22, 2007 Monday/Wednesday/Friday 3:00-4:15 Gardner Hall 307.
Course Introduction Computer Science Department CS141:Computer Programming 1 Fall 2014 Dr. Hamid Al-Hamadi.
COMS W1004 Introduction to Computer Science May 27, 2009.
CS4705 Natural Language Processing Fall  How can machines recognize and generate text and speech? ◦ Human language phenomena ◦ Theories, often.
Natural Language Processing Prof: Jason Eisner Webpage: syllabus, announcements, slides, homeworks.
Strategic Management BPS Fall 2015
COP4020/CGS5426 Programming languages Syllabus. Instructor Xin Yuan Office: 168 LOV Office hours: T, H 10:00am – 11:30am Class website:
Computer Science 102 Data Structures and Algorithms V Fall 2009 Lecture 1: administrative details Professor: Evan Korth New York University 1.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
CSCI 1301 Principles of Computer Science I
Introduction to Natural Language Processing Heshaam Faili University of Tehran.
COURSE ADDITION CATALOG DESCRIPTION To include credit hours, type of course, term(s) offered, prerequisites and/or restrictions. (75 words maximum.) 4/1/091Course.
Lecture 1 Page 1 CS 111 Summer 2015 Introduction CS 111 Operating System Principles.
CST 229 Introduction to Grammars Dr. Sherry Yang Room 213 (503)
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
Presentation by Dianne Smith, MJE. Something went wrong In jet crash, expert says.
CS 390 Introduction to Theoretical Computer Science.
Information Retrieval and Web Search Lecture 1. Course overview Instructor: Rada Mihalcea Class web page:
Language Development: The Course Jan. 6, The Course Designed to give students a comprehensive understanding of language development, primarily in.
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
CHAPTER 13 NATURAL LANGUAGE PROCESSING. Machine Translation.
Computer Science 102 Data Structures and Algorithms CSCI-UA.0102 Fall 2012 Lecture 1: administrative details Professor: Evan Korth New York University.
Discrete Mathematics CS204 Spring CS204 Discrete Mathematics Instructor: Professor Chin-Wan Chung (Office: Rm 3406, Tel:3537) 1.Lecture 1)Time:
Trustworthy Semantic Web Dr. Bhavani Thuraisingham The University of Texas at Dallas Review for the Final Exam December 8, 2008.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Introduction to computers and programming Instructor: Alex Iskold.
Two Views of Computing Language / Functions Machine / Storage CSCI 312 CSCI 313.
Introduction to ECE 2401 Data Structure Fall 2005 Chapter 0 Chen, Chang-Sheng
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Computational Structures Tim Sheard & James Hook Portland State University Class Preliminaries.
Introduction to Linguistics Ms. Suha Jawabreh Lecture # 1.
Introduction English Language. Course and unit outline: Looking closely at unit 1: AOS 1 Folio task Text book.
Statistics Fall Introduction2 Wed, Aug 22, 2007 Introduction Dr. Robb T. Koether Office: Bagby 114 Office phone: Home phone: (before.
Survey of Linguistic Method and Theory Ling 400. Instructor Instructor: Prof. Toshiyuki OgiharaInstructor: Prof. Toshiyuki Ogihara.
Drawing Trees & Ambiguity in Trees
1 An Introduction to Computational Linguistics Mohammad Bahrani.
CS 4705 Natural Language Processing Who am I? Julia Hirschberg –Computational Linguist in CS –Focus: Spoken Language Processing –Lab: The Speech Lab,
COP4020 INTRODUCTION FALL COURSE DESCRIPTION Programming Languages introduces the fundamentals of the design and implementation of programming languages.
Computer Networks CNT5106C
Introduction: What is AI? CMSC Introduction to Artificial Intelligence January 3, 2002.
Introduction: What is AI? CMSC Introduction to Artificial Intelligence January 7, 2003.
Introduction to Artificial Intelligence Prof. Kathleen McKeown 722 CEPSR Tas: Andrew Rosenberg Speech Lab, 7 th Floor CEPSR Sowmya Vishwanath TA Room.
Computer Network Fundamentals CNT4007C
CS101 Computer Programming I
Computer Networks CNT5106C
Computer Science 102 Data Structures CSCI-UA
Why Study Spoken Language?
Why Study Spoken Language?
CMPUT101: Purpose of the Course
Presentation transcript:

CS4705 Natural Language Processing Fall 2008

What will we study in this course? How can machines recognize and generate text and speech? – Human language phenomena – Theories, often drawn from linguistics, psychology – Algorithms – Applications

Newspaper Titles – "Something Went Wrong In Jet Crash, Expert Says" – "Police Begin Campaign To Run Down Jaywalkers" – "Drunk Gets Nine Months In Violin Case" – "Farmer Bill Dies In House" – "Iraqi Head Seeks Arms" – "Enraged Cow Injures Farmer With Ax" – "Stud Tires Out" – "Eye Drops Off Shelf" – "Teacher Strikes Idle Kids" – "Squad Helps Dog Bite Victim"

Knowledge Needed Morphology: word formation Syntax: word order Semantics: word meaning and word composition Pragmatics: influence of context/situation Goal: Discover what the speaker meant

Morphology “Stud tires out” – “Tires”: a noun or a verb? Internet search: union activities in New York – Union/unions; activities/activity – Active? Action? Actor? – New vs. New York

Syntax Word Order – John hit Bill – Bill was hit by John – Bill hit John – Bill, John hit – Who John hit was Bill Constituent Structure – "Teacher Strikes Idle Kids“ – “Enraged Cow Injures Farmer With Ax”

Syntax Word Order – John hit Bill – Bill was hit by John – Bill, John hit – Who John hit was Bill Constituent Structure – “[Teacher Strikes] [Idle] [Kids]“ – “Enraged Cow Injures Farmer With Ax”

Syntax Word Order – John hit Bill – Bill was hit by John – Bill, John hit – Who John hit was Bill Constituent Structure – “[Teacher] [Strikes] [Idle Kids]“ – “Enraged Cow Injures Farmer With Ax”

Syntax Word Order – John hit Bill – Bill was hit by John – Bill, John hit – Who John hit was Bill Constituent Structure – "Teacher Strikes Idle Kids“ – “[Enraged Cow] [Injures] [Farmer With Ax]”

Syntax Word Order – John hit Bill – Bill was hit by John – Bill, John hit – Who John hit was Bill Constituent Structure – "Teacher Strikes Idle Kids“ – “[Enraged Cow] [Injures] [Farmer] [With Ax]”

Semantics Word meaning – John picked up a bad cold. – John picked up a large rock. – John picked up Radio Netherlands on his radio. Composition of meaning – Squad helps dog bite victim – Enraged cow injures farmer with ax

Pragmatics – The influence of context “Going Home'' - A play in one act Scene 1: Pennsylvania Station, NY Bonnie: Long Beach? Passerby: Downstairs, LIRR Station.

Scene 2: Ticket Counter, LIRR Station Bonnie: Long Beach? Clerk: $4.50.

Scene 3: Information Booth, LIRR Station Bonnie: Long Beach? Clerk: 4:19, Track 17.

Scene 4: On the train, vicinity of Forest Hills Bonnie: Long Beach? Conductor: Change at Jamaica.

Scene 5: On the next train, vicinity of Lynbrook Bonnie: Long Beach? Conductor: Right after Island Park.

Algorithms Rule-based/Symbolic – Parsers – Finite state automata Probabilistic – Learned from observation – Predicting best guess – Statistical

Current Real World Applications Searching very large text and speech corpora: e.g. the Web Question answering over the web Translating between one language and another: e.g. Arabic and English Summarizing very large amounts of text: e.g. your , the news Dialogue systems: e.g. Amtrak’s ‘Julie’Julie

Instructor Kathy McKeown Office: 722 CEPSR Head NLP GroupNLP Group 25 years at Columbia, Department Chair for 6 Research – Summarization – Question Answering – Language Generation – Multimedia Explanation

Logistics Instructor: Kathy McKeown – – Office and hours: CEPSR 722, Tues 4-5, Wed 4-5 Teaching Assistant: Madhav Krishna – – Office and hours: NLP Lab, 7LW, M 4:30-5:30, Thurs 4-5 Syllabus available at

Text: Daniel Jurafsky and James H. Martin, Speech and Language Processing, 2 nd edition, Prentice-Hall, 2000 (available at CU Bookstore) Speech and Language Processing Assignments: – 4 homework assignments – Midterm and final exams – Four ‘free’ late days for homework assignments – After that, 10% off per day late – You must get a CS account Evaluation: 50% homework + 40% exams+ 10% class participation

Academic Integrity Copying or paraphrasing someone's work (code included), or permitting your own work to be copied or paraphrased, even if only in part, is forbidden, and will result in an automatic grade of 0 for the entire assignment or exam in which the copying or paraphrasing was done. Your grade should reflect your own work. If you are going to have trouble completing an assignment, talk to the instructor or TA in advance of the due date please. Everyone: Read/write protect your homework files at all times.

For Next Class Look at syllabus Read Chapters 1-2 of J&M Questions?