1/23 Applications of NLP. 2/23 Applications Text-to-speech, speech-to-text Dialogues sytems / conversation machines NL interfaces to –QA systems –IR systems.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

What is Word Study? PD Presentation: Union 61 Revised ELA guide Supplement (and beyond)
Introduction to Computational Linguistics
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING NLP-AI IIIT-Hyderabad CIIL, Mysore ICON DECEMBER, 2003.
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Natural Language Processing AI - Weeks 19 & 20 Natural Language Processing Lee McCluskey, room 2/07
1/26 Applications of NLP. 2/26 Applications What uses of the computer involve language? What language use is involved? What are the main problems? How.
Basi di dati distribuite Prof. M.T. PAZIENZA a.a
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
1 Information Retrieval and Web Search Introduction.
ITCS 6010 Natural Language Understanding. Natural Language Processing What is it? Studies the problems inherent in the processing and manipulation of.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
1/16 LELA Language and Computers Harold Somers Professor of Language Engineering.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Overview of Search Engines
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Information Retrieval in Practice
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Copyright R. Weber INFO 629 Concepts in Artificial Intelligence Fall 2004 Professor: Dr. Rosina Weber.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Introduction to NLP.
BTANT 129 w5 Introduction to corpus linguistics. BTANT 129 w5 Corpus The old school concept – A collection of texts especially if complete and self-contained:
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Linguistics & AI1 Linguistics and Artificial Intelligence Linguistics and Artificial Intelligence Frank Van Eynde Center for Computational Linguistics.
1 Computational Linguistics Ling 200 Spring 2006.
Defining Text Mining Preprocessing Transforming unstructured data stored in document collections into a more explicitly structured intermediate format.
Natural Language Processing Rogelio Dávila Pérez Profesor – Investigador
Language Technology I © 2005 Hans Uszkoreit Language Technology I 2005/06 Hans Uszkoreit Universität des Saarlandes and German Research Center for Artificial.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Introduction to Computational Linguistics
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
CSE573 Autumn /20/98 Planning/Language Administrative –PS3 due 2/23 –Midterms back today –Next topic: Natural Language Processing reading Chapter.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Friday Finish chapter 24 No written homework.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Natural Language Processing Chapter 1 : Introduction.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
NATURAL LANGUAGE PROCESSING Zachary McNellis. Overview  Background  Areas of NLP  How it works?  Future of NLP  References.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Natural Language Processing Group Computer Sc. & Engg. Department JADAVPUR UNIVERSITY KOLKATA – , INDIA. Professor Sivaji Bandyopadhyay
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Basics of Natural Language Processing Introduction to Computational Linguistics.
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
Chapter 2 Data, Text, and Web Mining. Data Mining Concepts and Applications  Data mining (DM) A process that uses statistical, mathematical, artificial.
INTRODUCTION TO APPLIED LINGUISTICS
Introduction to Machine Translation
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Information Retrieval and Web Search
Information Retrieval and Web Search
Information Retrieval and Web Search
Data Warehousing and Data Mining
CSE 635 Multimedia Information Retrieval
Information Retrieval and Web Search
Artificial Intelligence 2004 Speech & Natural Language Processing
Information Retrieval
Presentation transcript:

1/23 Applications of NLP

2/23 Applications Text-to-speech, speech-to-text Dialogues sytems / conversation machines NL interfaces to –QA systems –IR systems Text summarization and text mining Story understanding – inference, paraphrase Machine Translation Better word processing Language teaching Assistive computing

3/23 Speech applications (apart form the speech processing aspects) Text-to-speech –Homograph disambiguation –Prosody determination Speech-to-text –To support phoneme recognition –Homophone disambiguation –Filtering of performance errors

4/23 Dialogue systems Usually speech-driven, but text also appropriate Modern application is automatic transaction processing Limited domain may simplify language aspect Domain model will play a big part

5/23 Dialogue systems Apart from speech issues, NL components include … Topic tracking Anaphora resolution Reply generation

6/23 (also know as) Conversation machines Another old AI goal (cf. Turing test) Also (amazingly) for amusement Mainly speech, but also text based Early famous approaches include ELIZA, which showed what you could do by cheating Modern versions have a lot of NLP, especially discourse modelling, and focus on the language generation component

7/23 QA systems NL interface to knowledge database Handling queries in a natural way Must understand the domain Even if typed, dialogue must be natural Handling of anaphora e.g. When is the next flight to Sydney? And the one after? What about Melbourne then? OK I’ll take the last one.

8/23 IR systems Like QA systems, but the aim is to retrieve information from textual sources that contain the info, rather than from a structured data base Two aspects –Understanding the query (cf Google, Ask Jeeves)Google –Processing text to find the answer Named Entity Recognition

9/23

10/23

11/23

12/23 Named entity recognition Typical textual sources involve names (people, places, corporations), dates, amounts, etc. NER seeks to identify these strings and label them Clues are often linguistic Also involves recognizing synonyms, and processing anaphora

13/23 Automatic summarization Renewed interest since mid 1990s, probably due to growth of WWW Different types of summary –indicative vs. informative –abstract vs. extract –generic vs. query-oriented –background vs. just-the-news –single-document vs. multi-document

14/23 Automatic summarization topic identification stereotypical text structure cue words high-frequency indicator phrases intratext connectivity discourse structure centrality topic fusion concept generalization semantic association summary generation sentence planning to achieve information compaction

15/23 Text mining Discovery by computer of new, previously unknown information, by automatically extracting information from different written resources (typically Internet) Cf data mining (e.g. using consumer purchasing patterns to predict which products to place close together on shelves), but based on textual information Big application area is biosciences

16/23 Text mining preprocessing of document collections (text categorization, term extraction) storage of the intermediate representations techniques to analyze these intermediate representations (distribution analysis, clustering, trend analysis, association rules, etc.) visualization of the results.

17/23 Story understanding An old AI application Involves … –Inference –Ability to paraphrase (to demonstrate understanding) Requires access to real-world knowledge Often coded in “scripts” and “frames”

18/23 Machine Translation Oldest non-numerical application of computers Involves processing of source-language as in other applications, plus … –Choice of target-language words and structures –Generation of appropriate target-language strings Main difficulty is source-language analysis and/or cross-lingual transfer implies varying levels of “understanding”, depending on similarities between the two languages

19/23 Machine Translation First approaches perhaps most intuitive: look up words and then do local rearrangement “Second generation” took linguistic approach: grammars, rule systems, elements of AI Recent (since 1990) trend to use empirical (statistical) approach based on large corpora of parallel text –Use existing translations to “learn” translation models, either a priori (Statistical MT ≈ machine learning) or on the fly (Example-based MT ≈ case-based reasoning) –Convergence of empirical and rationalist (rule-based) approaches: learn models based on treebanks or similar.

20/23 Better word processing Spell checking for homonyms Grammar checking Especially for non-native users –Interference checking Intelligent word processing –Find/replace that knows about morphology, syntax

21/23 Language teaching CALL As in previous slide (grammar checking) but linked to models of –The topic –The learner –The teaching strategy Grammars (etc) can be used to create language-learning exercises and drills

22/23 Assistive computing Interfaces for disabled Many devices involve language issues, e.g. –Text simplification or summarization for users with low literacy (partially sighted, dyslexic, non-native speaker, illiterate, etc.) –Text completion (predictive or retrospective) Works on basis of probabilities or previous examples

23/23 Conclusion Many different applications But also many common elements –Basic tools (lexicons, grammars) –Ambiguity resolution –Need (but impossibility of having) for real-world knowledge Humans are really very good at language –Can understand noisy or incomplete messages –Good at guessing and inferring