Computational Intelligence 696i Language Lecture 6 Sandiway Fong.

Slides:



Advertisements
Similar presentations
Introduction to Computational Linguistics
Advertisements

Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE Massimo Poesio LECTURE 6 The Cognitive Perspective: Generative Lexicon.
Section 4: Language and Intelligence Overview Instructor: Sandiway Fong Department of Linguistics Department of Computer Science.
Computational Intelligence 696i Language Lecture 7 Sandiway Fong.
LING 581: Advanced Computational Linguistics Lecture Notes April 27th.
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
The Unreasonable Effectiveness of Data Alon Halevy, Peter Norvig, and Fernando Pereira Kristine Monteith May 1, 2009 CS 652.
Search Engines and Information Retrieval
C SC 620 Advanced Topics in Natural Language Processing Sandiway Fong.
Computational Intelligence 696i Language Lecture 5 Sandiway Fong.
Prof. John Nestor ECE Department Lafayette College Easton, Pennsylvania ECE Senior Design I Lecture 14 - Reading Technical.
LING 388 Language and Computers Take-Home Final Examination 12/9/03 Sandiway FONG.
LING 388: Language and Computers Sandiway Fong Lecture 23: 11/15.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
LING 438/538 Computational Linguistics Sandiway Fong Lecture 25: 11/29.
Russell Taylor Lecturer in Computing & Business Studies.
Methodology Conceptual Database Design
Lecture Two Database Environment Based on Chapter Two of this book:
WEB DESIGNING Prof. Jesse A. Role Ph. D TM UEAB 2010.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
What To Tell Students April What To Tell Your Students about the GRE ® revised General Test TM Play video ETS — Listening. Learning. Leading. ®
Microsoft Visual Basic 2012 CHAPTER ONE Introduction to Visual Basic 2012 Programming.
Microsoft Visual Basic 2005 CHAPTER 1 Introduction to Visual Basic 2005 Programming.
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
1 Integrated Development Environment Building Your First Project (A Step-By-Step Approach)
Database Design - Lecture 1
Search Engines and Information Retrieval Chapter 1.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Prepare Yourself for IR Research ChengXiang Zhai Department of Computer.
CS 7: Introduction to Computer Programming Algorithms.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
LING 388: Language and Computers Sandiway Fong Lecture 24: 11/17.
LING 438/538 Computational Linguistics Sandiway Fong Lecture 29.
Natural Language Processing Introduction. 2 Natural Language Processing We’re going to study what goes into getting computers to perform useful and interesting.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Related terms search based on WordNet / Wiktionary and its application in ontology matching RCDL'2009 St. Petersburg Institute for Informatics and Automation.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Artificial Intelligence Programming in Prolog Lecture 1: An Introduction 23/09/04.
Fundamental Programming: Fundamental Programming K.Chinnasarn, Ph.D.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
SSO: THE SYNDROMIC SURVEILLANCE ONTOLOGY Okhmatovskaia A, Chapman WW, Collier N, Espino J, Conway M, Buckeridge DL Ontology Description The SSO was developed.
NLP ? Natural Language is one of fundamental aspects of human behaviors. One of the final aim of human-computer communication. Provide easy interaction.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
CSA2050 Introduction to Computational Linguistics Lecture 1 What is Computational Linguistics?
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
1 Class exercise II: Use Case Implementation Deborah McGuinness and Peter Fox CSCI Week 8, October 20, 2008.
CPSC 422, Lecture 27Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 27 Nov, 16, 2015.
DESIGNING AN ARTICLE Effective Writing 3. Objectives Raising awareness of the format, requirements and features of scientific articles Sharing information.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Microsoft Visual Basic 2015 CHAPTER ONE Introduction to Visual Basic 2015 Programming.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Reading literacy. Definition of reading literacy: “Reading literacy is understanding, using and reflecting on written texts, in order to achieve one’s.
LING 581: Advanced Computational Linguistics
Automatic Writing Evaluation
Lexicons, Concept Networks, and Ontologies
Introduction to Visual Basic 2008 Programming
Exploring and Navigating: Tools for GermaNet
Chapter 2 Database Environment.
HCC class lecture 13 comments
WordNet WordNet, WSD.
Information Retrieval
Presentation transcript:

Computational Intelligence 696i Language Lecture 6 Sandiway Fong

Administriva Reminder: –Homework 1 due today (midnight) –Homework 2 discussed today due in one week (next Tuesday midnight) submit to

Administriva – Graphical User Interface Versions (GUI): –As a Mac OS X application (wnconnect) –As a Windows application –As a Linux application Text-based User Interface Versions: –run under a (free) Prolog interpreter –As Prolog software compiled for Windows (SWI-Prolog) –As platform-independent Prolog software (prologwn) without a GUI

Last Time semantic networks based around language –WordNet Princeton University) handbuilt (ad hoc) network of synonym sets (synsets) connected by semantic relations e.g. isa, part of, antonymy, causation etc. large-scale (and free) lexical resource –139,000 entries (word senses) v1.7 –10,000 verbs (polysemy 2) –20,000 adjectives (1.5)

Last Time semantic networks based around language –WordNet Princeton University) handbuilt (ad hoc) network of synonym sets (synsets) connected by semantic relations e.g. isa, part of, antonymy, causation etc. Example (Semantic Opposition): –an instance of the frame problem –John mended the torn/red dress –mend: x CAUS y BECOME –John CAUS the torn/red dress BECOME –antonym relation between adjective and the end state

Semantic Opposition Event-based Models of Change and Persistence in Language (Pustejovsky, 2000): –John mended the torn dress –John mended the red dress what kind of knowledge is invoked here? –can exploit the network –or –GL Model Generative Lexicon (Pustejovsky 1995)w

Two Problems –linguistically relevant puzzles –outside syntax 1.Semantic Opposition 2.Logical Metonomy

Logical Metonomy –also can be thought of an example of gap filling –with eventive verbs: begin and enjoy –Pustejovsky (1995), Lascarides & Copestake (1995) and Verspoor (1997) Examples: –John began the novel(reading/writing) –John began [reading/writing] the novel –X began Y ⇒ X began V-ing Y –The author began the unfinished novel back in 1962(writing) –The author began [writing] the unfinished novel back in 1962

Logical Metonomy –also can be thought of an example of gap filling... –Pustejovsky (1995), Lascarides & Copestake (1995) and Verspoor (1997) –eventive verbs: begin and enjoy Examples: –John began the novel(reading/writing) –The author began the unfinished novel back in 1962(writing) One idea about the organization of the lexicon (GL): –novel: qualia structure: telic role: read(purpose/function) agentive role: writing(creation) constitutive role: narrative (parts) formal role: book, disk(physical properties)

Logical Metonomy Examples: –John began the novel(reading/writing) –The author began the unfinished novel back in 1962(writing) More Examples: (enjoy) –Mary enjoyed [reading] the novel(reading) –!!The visitor enjoyed [verb] the door(?telic role) –Mary enjoyed [seeing] the garden(seeing...) author ⇔ read author ⇔ write

Logical Metonomy Examples: –John began the novel(reading/writing) –The author began the unfinished novel back in 1962(writing) More Examples: (enjoy) –Mary enjoyed [reading] the novel(reading) –!!The visitor enjoyed [verb] the door(?telic role) –Mary enjoyed [seeing] the garden(seeing...)

Logical Metonomy Multiple telic roles: –Mary enjoyed [seeing] the garden(seeing) –Mary enjoyed inspecting the garden –Mary enjoyed visiting the garden –Mary enjoyed strolling through the garden –Mary enjoyed rollerblading in the garden –Mary enjoyed sitting in the garden –Mary enjoyed dozing in the garden

Logical Metonomy easily defeasible: –He really enjoyed your book(reading) –He really enjoyed [reading] your book – –My goat eats anything. –He really enjoyed [verb] your book(reading) –(eating)

Logical Metonomy easily defeasible: –My dog eats everything. –!He really enjoyed [verb] your shoe(eating) very different in character from the other gap-filling examples we’ve seen: –not defeasible –John is too stubborn [someone] to talk to [John] –John is too stubborn [John] to talk to Bill

WordNet and Telic Role Computation Example: –John enjoyed [verb] the cigarette(smoking)

WordNet and Telic Role Computation Example: –!John enjoyed [verb] the dirt(?telic role)

WordNet and Telic Role Computation Example: –!John enjoyed [verb] the wine(drinking)

WordNet and Telic Role Computation Example: –!John enjoyed [verb] the door(?telic role)

WordNet Applications –WordNet Applications. Morato et al. In Proceedings of the GWC 2004, pp. 270–278. Improvements to WordNet Machine Translation Information Retrieval Conceptual Identification/Disambiguation Document Classification Query Expansion Image Retrieval (Other)

WordNet Applications Examples: –Information retrieval and extraction query term expansion (synonyms etc.) cross-linguistic information retrieval (multilingual WordNet) –Concept identification in natural language word sense disambiguation WordNet senses and ontology (isa-hierarchy) –Semantic distance computation relatedness of words –Document structuring and categorization determine genre of a paper (WordNet verb categories)

Homework 2

GRE Educational Testing Service (ETS) – –13 million standardized tests/year –Graduate Record Examination (GRE) verbal section of the GRE –vocabulary GRE vocabulary –word list Word list retention –word matching exercise (from a GRE prep book) –homework 2

Task: Match each word in the first column with its definition in the second column accolade abate aberrant abscond acumen abscission acerbic accretion abjure abrogate deviating abolish keen insight lessen in intensity sour or bitter building up depart secretly renounce removal praise

Homework 2 (for 10 pts) use WordNet to “solve” the word match puzzle come up with an algorithm or procedure that produces a good assignment for words in the left column to those on the right –minimum threshold for acceptable algorithms: 9/10 right describe your algorithm and show in detail how it works on the given example (you could but you don’t have to turn in a program)

Task: Match each word in the first column with its definition in the second column accolade abate aberrant abscond acumen abscission acerbic accretion abjure abrogate deviation abolish keen insight lessen in intensity sour or bitter building up depart secretly renounce removal praise

Task: Match each word in the first column with its definition in the second column accolade abate aberrant abscond acumen abscission acerbic accretion abjure abrogate deviation abolish keen insight lessen in intensity sour or bitter building up depart secretly renounce removal praise

Discussion

Language and Intelligence if a computer program can be written to do as well as humans on the GRE test, is the program intelligent? Can such a program be written? –Math part: no problem –Verbal part: tougher, but parts can be done right now... homework 2 analogies antonyms two essay sections –(Issue-Perspective, Argument-Analysis)

a look at the future of educational testing... e-rater Interesting things to Google

Interesting things to Google e-rater FAQs Q. What's the technology used in e-rater? –e-rater uses NLP to identify the features of the faculty-scored essays in its sample collection and store them-with their associated weights-in a database. –When e-rater evaluates a new essay, it compares its features to those in the database in order to assign a score. – Because e-rater is not doing any actual reading, the validity of its scoring depends on the scoring of the sample essays from which e-rater 's database is created.

Interesting things to Google e-rater FAQs Q. How often does the computer's score agree with the score of a faculty reader? –Almost all the time. –ETS researchers found exact agreement, or a difference of only one point, in as many as 98 percent of the comparisons between the computer's scores and those of a trained essay- reader using the same scoring guides and scoring system.

Interesting things to Google e-rater FAQs Q. How do students feel about being scored by a machine? –Most of today's students have had experience with instant feedback in computer programs and are becoming more comfortable with the idea of computerized scoring.

Interesting things to Google – CriterionSM: Online essay evaluation: An application for automated evaluation of student essays. –Burstein, J., Chodorow, M., & Leacock, C. (2003) –In Proceedings of the Fifteenth Annual Conference on Innovative Applications of Artificial Intelligence, Acapulco, Mexico. –(This paper received an AAAI Deployed Application Award.) e-rater: –trained on 270 essays scored by human readers –evaluates syntactic variety, discourse, topical content, lexical complexity –50 features critique: –grammar checker (agreement, verb formation, punctuation, typographical errors) –bigram model of English

Interesting things to Google –... recent news on automated bogus paper generation – SCIgen - An Automatic CS Paper Generator –SCIgen is a program that generates random Computer Science research papers, including graphs, figures, and citations. It uses a hand-written context-free grammar to form all elements of the papers. Achievements: –one out of two papers got accepted at the World Multiconference on Systemics, Cybernetics and Informatics (WMSCI) –Rooter: A Methodology for the Typical Unification of Access Points and Redundancy “We implemented our scatter/ gather I/O server in Simula- 67, augmented with opportunistically pipelined extensions. ”