START: Natural Language Access to Information Boris Katz, Gary Borchardt, Sue Felshin, Jimmy Lin, Jerome McFarland, Ali Ibrahim, Luciano Castagnola, Baris.

Slides:



Advertisements
Similar presentations
DAML Queries/Life Cycle SRI International. Parts of Ontologies (used in the examples to follow) Assumptions Researcher String lastName firstName Publication-ref.
Advertisements

Web Passive Voice Tutor: an Intelligent Computer Assisted Language Learning System over the WWW Maria Virvou & Victoria Tsiriga Department of Informatics,
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Augmented Transition Networks
RDF Tutorial.
Sub-constituents of NP in English September 12, 2007.
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Question Answering Avishek Gosh ( )‏ Abhishek Agarkar ( )‏ Nikhilesh Sharma ( )‏ Harshada Gune ( )‏
Words Words Words! Helping ELL Students Develop Vocabulary.
1 Words and the Lexicon September 10th 2009 Lecture #3.
Browsers and Servers CGI Processing Model ( Common Gateway Interface ) © Norman White, 2013.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
NLDB’04 – June 23 – 25, Salford, Manchester, United Kingdom 0/9 Semantic Tagging and Chunk- Parsing in Dynamic Modeling G. Fliedl, Ch. Kop, H.C. Mayr,
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
Natural Language Query Interface Mostafa Karkache & Bryce Wenninger.
1 Extracting RDF Data from Unstructured Sources Based on an RDF Target Schema Tim Chartrand Research Supported By NSF.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Specialized Application Software
System Analysis and Design
Computing ESSENTIALS     CHAPTER Ch 9Copyright 2003 The McGraw-Hill Companies, Inc Graphics, Multimedia, and Artificial Intelligence computing.
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
1 Chapter 11 Developing Custom Help. 11 Chapter Objectives Use HTML to create customized Help topics for an application Use the HTML Help Workshop to.
Tutorial 7 Working with Multimedia. XP Objectives Explore various multimedia applications on the Web Learn about sound file formats and properties Embed.
1 The World Wide Web. 2  Web Fundamentals  Pages are defined by the Hypertext Markup Language (HTML) and contain text, graphics, audio, video and software.
Multimedia Enabling Software. The Human Perceptual System Since the multimedia systems are intended to be used by human, it is a pragmatic approach to.
March 1, 2009 Dr. Muhammed Al-Mulhem 1 ICS 482 Natural Language Processing INTRODUCTION Muhammed Al-Mulhem March 1, 2009.
The Internet & The World Wide Web Notes
Adding Common Sense into Artificial Intelligence Common Sense Computing Initiative Software Agents Group MIT Media Lab.
Information Retrieval – and projects we have done. Group Members: Aditya Tiwari ( ) Harshit Mittal ( ) Rohit Kumar Saraf ( ) Vinay.
Web Document Analysis: How can Natural Language Processing Help in Determining Correct Content Flow? Hassan Alam, Fuad Rahman and Yuliya Tarnikova Human.
U:/msu/course/cse/103 Day 25, Slide 1 Back-up PHP Files If you have not yet passed the 3.0 BT, make back-up copies of ALL.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
Methods For Web Page Design 6. Methods Why use one? What it covers –Possibly all stages Feasibility Analysis Design Implementation Testing –Maybe just.
2013Dr. Ali Rodan 1 Handout 1 Fundamentals of the Internet.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Using Common Sense Reasoning to Create Intelligent Mobile Applications Software Agents Group MIT Media Lab.
Tutorial 7 Working with Multimedia. XP Objectives Explore various multimedia applications on the Web Learn about sound file formats and properties Embed.
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
A Tripartite Question Answering Architecture for Integrating Diverse Knowledge Resources Boris Katz, Gary Borchardt, Sue Felshin and Jimmy Lin MIT Computer.
Chapter 8 Database Applications and Implications.
8 Parts of Speech. Verb A verb asserts something about the subject of the sentence and express actions, events, or states of being. The puppy plays with.
LAS LINKS DATA ANALYSIS. Objectives 1.Analyze the 4 sub-tests in order to understand which academic skills are being tested. 2.Use sample tests to practice.
Tutorial 7 Working with Multimedia. New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition 2 Objectives Explore various multimedia applications.
Metadata for the GPII Liddy Nevile. DRD metadata.
Database Management Systems.  Database management system (DBMS)  Store large collections of data  Organize the data  Becomes a data storage system.
Artificial Intelligence: Natural Language
Introduction to Dialogue Systems. User Input System Output ?
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Data Mining: Text Mining
MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz
Language Arts The Eight Parts of Speech The Eight Parts of Speech.
Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.
NATURAL LANGUAGE PROCESSING
An Architecture for Adaptive Content Extraction in Wireless Networks Phil West Greg Foster Peter Clayton Submitted to the South African Telecommunications.
Aware Discovering characteristics of habitable question answering systems with iterative formative evaluation Bill Ogden
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
WHAT'S THE DIFFERENCE BETWEEN A WEB APPLICATION STREAMING NETWORK AND A CDN? INSTART LOGIC.
AQUAINT Mid-Year Workshop: Observations and Comments Jimmy Lin MIT Artificial Intelligence Laboratory.
Unit 2: Verbs, Adverbs, Prepositions, Conjunctions, and Interjections 7 th English.
Visual Information Retrieval
DATA INTEGRATION FOR LANGUAGE DOCUMENTATION
Representing Structure and Behavior with Trees
CS246: Information Retrieval
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

START: Natural Language Access to Information Boris Katz, Gary Borchardt, Sue Felshin, Jimmy Lin, Jerome McFarland, Ali Ibrahim, Luciano Castagnola, Baris Temelkuran, Aaron Fernandes, Alp Simsek, Jonathan Wolfe, Matthew Bilotti MIT Artificial Intelligence Lab

I had a dream... Library of Congress ?

Reality What we can do: Understand ordinary sentences and questions What we can’t do (yet): 1. Full-text NL understanding still beyond reach Common sense implication Intersentential reference Summarization 2. Not all information is language—most Web resources are not textual Maps and Images Sound and Video Multimedia Web resources are distributed across numerous non-traditional databases

Bridging the Gap Library of Congress + In 1492, Columbus sailed the ocean blue. An object at rest tends to remain at rest. Four score and seven years ago our forefathers brought forth

The Solution: Natural Language Annotations Annotations bridge the gap between our ability to analyze natural language sentences and our desire to access the huge amount of data available in our libraries and on the Web. Annotations are collections of natural language sentences and phrases that describe the content of various information segments. START analyzes these annotations creates the necessary representational structures produces special pointers to the information segments summarized by the annotations

Natural Language Annotations Annotation “Mars’s year is long.” + Questions “How long is the Martian year?” “How long is a year on Mars?” “How many days are in a Martian year?” … START knowledge base Annotator User is year long related-to year Mars... one Mars year lasts 687 Earth days.

noun molecule quantity two det Parsing a noun NP N PP NPprep converts VP S A chain of reactions converts each molecule of glucose into two smaller molecules of pyruvate. each NP N PP of glucose into smaller prepNP N PP molecules of pyruvate N V chain noun reactions of

Ternary expressions (T-expressions) A chain of reactions converts each molecule of glucose into two smaller molecules of pyruvate. into molecules-5> into molecules converts chain molecule related-to reactions glucose related-to pyruvate related-to each quantifier two quantity smaller is

T-expression Representation List of node-link-node triples Nouns, adjectives are nodes Links cover: relationships between verbs and their arguments fundamental semantic relationships: “is-a” (for equality, membership, and subclass relationships), “related-to” (for possessives, etc.) modification of nouns: “quantifier”, “quantity”, “is” (for adjectives) prepositions

S-rules for Structural Variation S-rule for the Property Factoring alternation: emotional- reaction- verb someone 1 someone 2 with something related-to someone 1 someone 1 emotional-reaction-verb someone 2 with something someone 1 ’s something emotional-reaction- verb someone 2 emotional- reaction- verb something 1 someone 2 something 1 related-to someone 1 The president impressed the country with his determination. The president’s determination impressed the country. Emotional reaction verbs: surprisestun amazestartle impressplease embarrassannoy etc.

Sample Assertion A chain of reactions converts each molecule of glucose into two smaller molecules of pyruvate. into molecules converts chain molecule related-to reactions glucose related-to pyruvate related-to each quantifier two quantity smaller is into molecules-5>

Sample Query How are the glucose molecules converted into pyruvate molecules? into molecules converts molecules glucose related-to pyruvate related-to something into molecules-5>

Matching Matcher T-expressions from Query T-expressions from Assertion into molecules converts chain molecule related-to reactions glucose related-to pyruvate related-to each quantifier two quantity smaller is something Key: Input Processing Query Processing

A. Reply by Generating A chain of reactions converts each molecule of glucose into two smaller molecules of pyruvate. Generator Displayed Answer Ternary expressions Query: How are the glucose molecules converted into pyruvate molecules? into molecules converts chain molecule related-to reactions glucose related-to pyruvate related-to each quantifier two quantity smaller is Answer:

Reply by Generating: Example

B. Reply from annotation Find resource Displayed Answer Ternary expressions related-to picture Cog Annotated resource + Query: Show me a picture of Cog.

Reply from annotation: Example

C. Reply from annotation with script directs any-person any-IMDb-movie + Gone with the Wind (1939) was directed by George Cukor, Victor Fleming, and Sam Wood. Source: The Internet Movie Database Script get m/Details? match regexp... IMDb T-exps Run script Displayed Answer Find resource Query: Who directed Gone with the Wind?

Reply from annotation with script: Example

NASA POTUS Webster Uniform Access START NL questions Multimedia responses Omnibase Queries Data Local knowledge base of ternary expressions Core vocabulary Uniform interface to multiple database formats (Web, text, etc.) Integration time independent of size of database Extended lexicon U.S. Census IMDb

How START works Web browser START Parser Matcher English Input T-exps Database of T-exps T-exps from KB Generator HTML English Annotations Scripts Omnibase (external knowledge) Native knowledge Scripts WWW Potus IMDb World Factbook U.S. Census

Q. "I'd like to speak to Trevor." Q. "Is Trevor in his office?" A. "Trevor is in his office but he is on the phone." A. "Trevor is in his office but he is talking to Boris now." A. "Trevor is in his office; however, he doesn't want to be disturbed until 2pm." Multi-Modal Interaction