Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.

Slides:



Advertisements
Similar presentations
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Advertisements

Data Mining and Text Analytics By Saima Rahna & Anees Mohammad Quranic Arabic Corpus.
CS 4705 Natural Language Processing Julia Hirschberg COMS 4705 Fall 2010.
Overview of the Hindi-Urdu Treebank Fei Xia University of Washington 7/23/2011.
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
1/7 INFO60021 Natural Language Processing Harold Somers Professor of Language Engineering.
Are Linguists Dinosaurs? 1.Statistical language processors seem to be doing away with the need for linguists. –Why do we need linguists when a machine.
CS4705 Natural Language Processing Fall What will we study in this course? How can machines recognize and generate text and speech? – Human language.
David Farwell, Stephen Helmreich Computing Research Laboratory/New Mexico State University Lori Levin, Teruko Mitamura Language Technologies Institute/Carnegie.
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
1/23 Applications of NLP. 2/23 Applications Text-to-speech, speech-to-text Dialogues sytems / conversation machines NL interfaces to –QA systems –IR systems.
1.3 Executing Programs. How is Computer Code Transformed into an Executable? Interpreters Compilers Hybrid systems.
An Information Theoretic Approach to Bilingual Word Clustering Manaal Faruqui & Chris Dyer Language Technologies Institute SCS, CMU.
Korea Terminology Research Center for Language and Knowledge Engineering Infrastructures in Korea and for the Korean Language Key-Sun Choi.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
ELN – Natural Language Processing Giuseppe Attardi
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
Introduction to Natural Language Processing Heshaam Faili University of Tehran.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Computational Linguistics Yoad Winter *General overview *Examples: Transducers; Stanford Parser; Google Translate; Word-Sense Disambiguation * Finite State.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Machine Translation, Digital Libraries, and the Computing Research Laboratory Indo-US Workshop on Digital Libraries June 23, 2003.
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
Linguistics & AI1 Linguistics and Artificial Intelligence Linguistics and Artificial Intelligence Frank Van Eynde Center for Computational Linguistics.
CS 4705 Natural Language Processing Fall 2010 What is Natural Language Processing? Designing software to recognize, analyze and generate text and speech.
Morphological Preprocessing for Statistical Machine Translation Nizar Habash Columbia University NLP Meeting 10/19/2006.
Suléne Pilon & Danie Prinsloo Overview: Teaching and Training in South Africa 25 November 2008;
L’età della parola Giuseppe Attardi Dipartimento di Informatica Università di Pisa ESA SoBigDataPisa, 24 febbraio 2015.
ACL Birds of a Feather Corpus Annotation with Interlingual Content Interlingual Annotation of Multilingual Text Corpora Bonnie Dorr, David Farwell, Rebecca.
Arabic Tokenization, Part-of-Speech Tagging and Morphological Disambiguation in One Fell Swoop Nizar Habash and Owen Rambow Center for Computational Learning.
Approaches to Machine Translation CSC 5930 Machine Translation Fall 2012 Dr. Tom Way.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
For Wednesday No reading Homework –Chapter 23, exercise 15 –Process: 1.Create 5 sentences 2.Select a language 3.Translate each sentence into that language.
ICS 482: Natural language Processing Pre-introduction
1 Compiler Design (40-414)  Main Text Book: Compilers: Principles, Techniques & Tools, 2 nd ed., Aho, Lam, Sethi, and Ullman, 2007  Evaluation:  Midterm.
Interlingua Annotation Owen Rambow Advaith Siddharthan Kathleen McKeown
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Translingual Information Management Stephan Busemann Language Technology Lab German Research Center for Artificial Intelligence.
CS460/IT632 Natural Language Processing/Language Technology for the Web Lecture 1 (03/01/06) Prof. Pushpak Bhattacharyya IIT Bombay Introduction to Natural.
LREC 2004, 26 May 2004, Lisbon 1 Multimodal Multilingual Resources in the Subtitling Process S.Piperidis, I.Demiros, P.Prokopidis, P.Vanroose, A. Hoethker,
1 An Introduction to Computational Linguistics Mohammad Bahrani.
CS 4705 Natural Language Processing Who am I? Julia Hirschberg –Computational Linguist in CS –Focus: Spoken Language Processing –Lab: The Speech Lab,
1 CPA: Where do we go from here? Research Institute for Information and Language Processing, University of Wolverhampton; UPF Barcelona; University of.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
Natural Language Processing [05 hours/week, 09 Credits] [Theory]
RECENT TRENDS IN SMT By M.Balamurugan, Phd Research Scholar,
Approaches to Machine Translation
Natural Language Processing (NLP)
--Mengxue Zhang, Qingyang Li
Approaches to Machine Translation
Natural Language Processing (NLP)
CS224N Section 3: Corpora, etc.
Artificial Intelligence 2004 Speech & Natural Language Processing
Owen Rambow 6 Minutes.
Natural Language Processing (NLP)
Presentation transcript:

Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin Jansche, Rebecca Passonneau, Owen Rambow What we do: o Researchers o Work with Kathy and Julia o Our own projects o Sometimes teach o Supervise students (PhD, Masters, independent studies) Some of us are in CEPSR, some in the Interchurch Building Some NLP Group meetings will take place in Interchurch Center

CLiMB 2: Computational Linguistics for Metadata Building, phase 2 Becky Passonneau (with University of Maryland) Interactive workbench for image cataloguers/indexers: Use NLP to extract descriptive terms from scholarly text Mellon Foundation

Automated Readers Advisor, Heiskell Talking Books and Braille Library (NYPL) Becky Passonneau Replace some of librarians’ tasks in current over-the-phone borrowing sytem with automated dialogue system Use Wizard-of-Oz paradigm for data collection Joint project with CCNY (Esther Levin) ozVariant.ppt ozVariant.ppt

Tracking Emergent Narrative Skills (TENS) Becky Passonneau Current data set: ten-year olds retelling silent movies Develop quantitative methods to compare semantic and pragmatic content (e.g., adapt Pyramid Method for evaluating summary content) Pyramid Method Joint project with University of Connecticut (Elena Levy)

Arabic NLP CADIM Group: Mona Diab, Nizar Habash, Owen Rambow Focus on Standard Arabic AND the dialects NLP tools for Arabic: o Morphological analysis (exists) o Morphological tagging (exists, best-performing)  Tokenization  POS tagging (best-performing)  Diacritization (best-performing) o Word-sense disambiguation (in progress) o Sentence-boundary detection for ASR (in progress) o Parsing (initial research) o Names-entity recognition (joint with Fair Isaacs, in progress) o …

Machine Translation Nizar Habash Focus: Arabic-English MT Different hybrid MT approaches explored o Linguistic preprocessing for Statistical MT  Morphological and Syntactic preprocessing o Adding statistical resources to rule-based MT systems  Automatically extracted phrase tables combined with Generation-Heavy MT Columbia first time participation in NIST MTEval (2006)

Word Sense Modeling and Disambiguation Mona Diab Using corpora (including multilingual parallel and similar) for unsupervised learning Arabic WordNet Arabic PropBank

Multilingual Metagrammars Owen Rambow (with University of Pennsylvania) Goal: high-level abstract representation of syntax of (many/all) natural languages, from which we can automatically generate grammars that can be used for NLP Have: Universal Grammar component and language-specific modules for Korean, German, Yiddish Next: Icelandic, Mainland Scandinavian, English, Kashmiri, …