Linguistic Spatial Reasoning Jim Keller Electrical and Computer Engineering Department University of Missouri-Columbia I get by with a lot of help from.

Slides:



Advertisements
Similar presentations
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California USA
Advertisements

Image Retrieval: Current Techniques, Promising Directions, and Open Issues Yong Rui, Thomas Huang and Shih-Fu Chang Published in the Journal of Visual.
Rulebase Expert System and Uncertainty. Rule-based ES Rules as a knowledge representation technique Type of rules :- relation, recommendation, directive,
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Computational Biology, Part 23 Biological Imaging II Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Designing Multimedia with Fuzzy Logic Enrique Diaz de Leon * Rene V. Mayorga ** Paul D. Guild *** * ITESM, Guadalajara Campus, Mexico ** Faculty of Engineering,
Computational Intelligence Research in ECE Target detection and Recognition Linguistic Scene Description Sketch Understanding What’s Coming? Jim Keller,
Center for Modeling & Simulation.  A Map is the most effective shorthand to show locations of objects with attributes, which can be physical or cultural.
Proposed concepts illustrated well on sets of face images extracted from video: Face texture and surface are smooth, constraining them to a manifold Recognition.
Spatial Reasoning with Guinness References Acknowledgements University of Missouri, Columbia, MO.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
HIGGINS Error handling strategies in a spoken dialogue system Rolf Carlson, Jens Edlund and Gabriel Skantze Error handling research issues The long term.
Geographic Information Systems GIS Output. 1. Color Theory Additive primaries blue, green, and red Subtractive primaries yellow, cyan, and magenta.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Workshop on Earth Observation for Urban Planning and Management, 20 th November 2006, HK 1 Zhilin Li & Kourosh Khoshelham Dept of Land Surveying & Geo-Informatics.
Spatial Reasoning for Semi-Autonomous Vehicles Using Image and Range Data Marjorie Skubic and James Keller Students: Sam Blisard, George Chronis, Grant.
UMC – HCI seminar series 1 Human Computer Interaction Query by Sketch Chi-Ren Shyu Department of Computer Engineering and Computer Science University of.
1Ellen L. Walker Recognizing Objects in Computer Images Ellen L. Walker Mathematical Sciences Dept Hiram College Hiram, OH 44234
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Object Recognition Using Genetic Algorithms CS773C Advanced Machine Intelligence Applications Spring 2008: Object Recognition.
Automated rule Generation Maryam Mustafa Sarah Karim
Fuzzy Medical Image Segmentation
Symbolic Description and Visual Querying of Image Sequences Using Spatio-Temporal Logic Del Bimbo, E. Vicario and D. Zingoni IEEE Transactions on Knowledge.
June 12, 2001 Jeong-Su Han An Autonomous Vehicle for People with Motor Disabilities by G. Bourhis, O.Horn, O.Habert and A. Pruski Paper Review.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Artificial Intelligence Dr. Paul Wagner Department of Computer Science University of Wisconsin – Eau Claire.
Chapter 14: Artificial Intelligence Invitation to Computer Science, C++ Version, Third Edition.
A Simple Method to Extract Fuzzy Rules by Measure of Fuzziness Jieh-Ren Chang Nai-Jian Wang.
Introduction to Genetic Algorithms and Evolutionary Computation
Interactive Discovery and Semantic Labeling of Patterns in Spatial Data Thomas Funkhouser, Adam Finkelstein, David Blei, and Christiane Fellbaum Princeton.
David L. Chen Fast Online Lexicon Learning for Grounded Language Acquisition The 50th Annual Meeting of the Association for Computational Linguistics (ACL)
Chapter 8 The k-Means Algorithm and Genetic Algorithm.
Department of Electrical Engineering, Southern Taiwan University Robotic Interaction Learning Lab 1 The optimization of the application of fuzzy ant colony.
Describing Images using Inferred Visual Dependency Representations Authors : Desmond Elliot & Arjen P. de Vries Presentation of Paper by : Jantre Sanket.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Digital Image Processing CCS331 Relationships of Pixel 1.
Evolving Virtual Creatures & Evolving 3D Morphology and Behavior by Competition Papers by Karl Sims Presented by Sarah Waziruddin.
David L. Chen and Raymond J. Mooney Department of Computer Science The University of Texas at Austin Learning to Interpret Natural Language Navigation.
Template matching and object recognition. CS8690 Computer Vision University of Missouri at Columbia Matching by relations Idea: –find bits, then say object.
Geographic Information Systems GIS Output. 1. Color Theory ► Additive primaries blue, green, and red ► Subtractive primaries yellow, cyan, and magenta.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
Describing Images using Inferred Visual Dependency Representations Authors : Desmond Elliot & Arjen P. de Vries Presentation of Paper by : Jantre Sanket.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
FORSVARETS FORSKNINGSINSTITUTT Transformation of geographical information into linguistic sentences: Two case studies Jan Terje Bjørke.
Synthesizing Natural Textures Michael Ashikhmin University of Utah.
Graphic Organizers. Introduction Definition Effectiveness Resources.
Image Emotional Semantic Query Based On Color Semantic Description Wei-Ning Wang, Ying-Lin Yu Department of Electronic and Information Engineering, South.
Chapter 10. The Explorer System in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans On, Kyoung-Woon Biointelligence Laboratory.
2004 謝俊瑋 NTU, CSIE, CMLab 1 A Rule-Based Video Annotation System Andres Dorado, Janko Calic, and Ebroul Izquierdo, Senior Member, IEEE.
A Patent Document Retrieval System Addressing Both Semantic and Syntactic Properties Liang Chen*,Naoyuki Tokuda+, Hisahiro Adachi+ *University of Northern.
DATA OUTPUT  maps  tables. DATA OUTPUT output from GIS does not have to be a map many GIS are designed with poor map output capabilities types of output:
Spring, 2005 CSE391 – Lecture 1 1 Introduction to Artificial Intelligence Martha Palmer CSE391 Spring, 2005.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Application of the GA-PSO with the Fuzzy controller to the robot soccer Department of Electrical Engineering, Southern Taiwan University, Tainan, R.O.C.
System Model: Physical Module Constraints: 1)A module must have a minimum of two adjacent free sides in order to move. 2) A module must have an adjacent.
Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.
Investigate Plan Design Create Evaluate (Test it to objective evaluation at each stage of the design cycle) state – describe - explain the problem some.
Evolutionary Computing Systems Lab (ECSL), University of Nevada, Reno 1 Authors : Siming Liu, Christopher Ballinger, Sushil Louis
Multimedia Content-Based Retrieval
CSc4730/6730 Scientific Visualization
Information Design and Visualization
Geographic Information Systems
Presentation transcript:

Linguistic Spatial Reasoning Jim Keller Electrical and Computer Engineering Department University of Missouri-Columbia I get by with a lot of help from my friends…. Big group of faculty and students at the University of Missouri, the University of Florida, Notre Dame University, and the University of Guelph

Our Work Language Generation – Modeling of spatial relationships – Natural scene understanding Communication: human and machine Text 2 Sketch Human Geography Summarization – Most of the night, Mary had medium restlessness and low motion Common Theme – Generation of “un-natural” natural language – Modeling and matching based on linguistic statements – Users of natural language understanding systems

Outline Linguistic Scene Description (External) Linguistic Scene Description (Ego Centered) Human/Robot Dialog Sketched Route Understanding Text-to-Sketch – Inverse of above

Scene Description “B is to the right of A”? B A Natural scene understanding is an important aspect of computer vision Spatial relations among image objects play a vital role in the description of a scene

Scene Description Human intuition varies considerably – Vague concepts of what spatial relationships should mean – Uncertainty of how they can model differing human perceptions Using fuzzy set-based definitions supports the “Principle of Least Commitment” of David Marr Considerable amount of argument about the "best" method – Much of the debate centered on human intuition – Interesting results in Pattern Recognition applications Need flexible mechanisms that can be tailored – Perhaps to individual human experts

angle 180° -180° Relative position between 2-D objects The histogram of angles

 v A B +180°-180° angle  Histograms of Forces (Matsakis) Elegant Theory Much more flexible than angles Incorporates Metric information

Choice of the function  force   (d) distance d Gravitational Forces A B

Force-Histograms +180°-180° angle

Truth Membership of “A is RIGHT of B” F AB is called the F  histogram associated with (A,B) Divided into 4 quadrants

A is RIGHTof B The average direction  r (RIGHT) of the effective forces is computed The degree of truth a r (RIGHT) of the proposition “A is to the right of B” is computed as: a r (RIGHT) =  (  r (RIGHT))  b r (RIGHT)

Features for Linguistic Description Compute  1, the Primary Direction [d 1 = a(  1 )] Produces Basic Directional Term (e.g., RIGHT of)  2, the Secondary Direction [d 2 = a(  2 )] Supplements the description (e.g., “but a little above”) m 1 and m 2 specify how good the description is for each direction – combination of F 0 and F 2

gravitational constant -  -    A system of 27 fuzzy rules and meta-rules allows meaningful linguistic descriptions to be produced. 1. Each histogram gives its opinion about the relative position between the objects that are considered. 2. The two opinions are combined. Four numeric and two symbolic features result from this combination. Linguistic Scene Description

m 1 highmedium-highmedium-lowlow high perfectly-- 1 nearly 1 medium-high -- 2 nearly 2 loosely 1 medium-low mostlyloosely 2 3 d 1 low no primary direction no primary direction Examples of Fuzzy Rules and Meta-Rules Each linguistic output uses hedges from a dictionary of about thirty adverbs and other terms that can be tailored to individual users Fuzzy Rule-based System

m 2 highmediumlow high somewhatstrongly medium a littleslightly d 2 low no secondary direction no secondary direction Fuzzy rules for secondary direction Secondary directions

Simple Examples Green = satisfactory; Orange = Rather satisfactory; Red = Unsatisfactory

We used LADAR range images of the power-plant at China Lake, CA. They were processed by applying a median filter and a pseudo-intensity filter. The filtered images were segmented and labeled manually. Linguistic Scene Description Example

To the LEFT, but a little ABOVE. The system handles a rich language to describe the spatial organization of scene regions. It produces good intuitive results. The system here describes the relative position of the red object with respect to the group of buildings (in blue). Linguistic Scene Description Example

Linguistic Spatial Scene Description There are 5 missile launchers (1, 2, 3, 6, 8) They surround a center vehicle (4) The image includes a SAM site A convoy of vehicles (5, 7, 9, 10) is BelowRight of the SAM site Pseudo-intensity image of surface-to-air missile (SAM) site with convoy Objects detected and automatically labeled by extended ATR system Output of Scene Description Fuzzy Rule Base (from initial system)

Horizontal descriptions “ The fish can is to the left of the truck box ” “ The truck box is to the right of the fish can but extends to the front ” “ The candy box is to the left of the truck box ” “ The truck box is to the right of the candy box but extends to the front” “ The candy box is surrounded ” and “ The fish can is surrounded. ” Vertical descriptions “ The fish can is to the left of the truck box ” “ The truck box is to the right of the fish can ” “ The truck box is to the right of the candy box ” “ The candy box is on top of the fish can ” “ The fish can is below the candy box. ” Near descriptions “ The fish can and the candy box are near ” “ The fish can and the truck box are somewhat near ” “ The candy box and the truck box are somewhat near. ” An extension to the third Dimension

Outline Linguistic Scene Description (External) Linguistic Scene Description (Ego Centered) Human/Robot Dialog Sketched Route Understanding Text-to-Sketch – Inverse of above

Human/Robot Dialog Spatial Reasoning incorporated into NRL’s Natural Language Understanding System for mobile robots Sensed data results in a “grid map” that displays occupancy of cells (doesn’t need to be binary) Grid map after component labeling – robot heading towards Object 5

Generating Spatial Descriptions from Robot Sensors (here, NOMAD range sensors) Fuzzy Rules features

DETAILED SPATIAL DESCRIPTIONS for 6 OBJECTS: Object 1 is mostly behind me but somewhat to the right (the description is satisfactory). The object is very close. Object 2 is behind me (the description is satisfactory) The object is very close. Object 3 is to the left of me but extends to the rear relative to me (the description is satisfactory). The object is very close. Object 4 is mostly to the right of me but somewhat forward (the description is satisfactory). The object is very close. Object 5 is in front of me (the description is satisfactory). The object is very close. Object 6 is to the left-front of me (the description is satisfactory). The object is close. Scene 1

Human-Driven Spatial Language for Human-Robot Interaction Investigating spatial language for eldercare scenario in the home – An elderly resident has lost an object; the robot will help the resident to find it. – EXAMPLE: The eyeglasses are behind the lamp on the table to the left of the bed in the bedroom Start with human-subject experiments; Develop spatial language algorithms to match the results NSF project # IIS – U of Missouri (Marge Skubic) – U of Notre Dame (Laura Carlson)

The virtual scene used for the human subject experiments Robot addressee vs. Human addressee Vary the speaker-addressee alignment Where to find it vs. how to find it Vary candidate reference objects Hallway LivingroomBedroom

Example “where” Statements the key is on the white table in the room to his left the book is on the wooden table in the back of the room to his right the wallet's in the right room behind the bed on the table next to the lamp and the plant the glasses case is down the hall in the right room -on the right side of the room as you enter, it's between the two chairs on a table next to a statue the mug's down the hall in the left room on a table on the light brown table in front of the couch next to a purse and a hat

Example “how” Statements move forward into the intersection, look left, move to the table at the far end of the room directly across from where he is and then look down and he'll find the car keys if you turn around and then walk left or walk forward about two steps then walk left into the bedroom, walk forward until you have passed the bed on your left, turn left around the foot of the bed, proceed forward, turn left again, walk forward towards the wall near the head of the bed, look down on the bedside table and there’s the object.

Outline Linguistic Scene Description (External) – Results in Scene Matching Linguistic Scene Description (Ego Centered) Human/Robot Dialog Sketched Route Understanding Text-to-Sketch – Inverse of above

Extracting a Navigation Path from a Sketch Start: Move forward When Object #3 is loosely to the left-front Then Turn right When Object #3 is to the left Then Move forward When Object #4 is mostly in front Then Turn left When Object #4 is to the right Then Move forward When Object #4 is to the right and Object #5 is in front Then Stop Robot path

Outline Linguistic Scene Description (External) – Results in Scene Matching Linguistic Scene Description (Ego Centered) Human/Robot Dialog Sketched Route Understanding Text-to-Sketch – Inverse of above

University of Missouri Consider a frantic (and fictitious) call like this: – I saw terrorist X slip into a building up ahead. I don’t know where I am in the city and I can’t read the signs. I’m walking towards the location. – There is a somewhat long, thin rectangular shaped parking lot that extends forward. – To the immediate right of that parking lot is a parking garage. – I see a moderately small rectangular building close to me that is mostly to my left but partially forward. – Across a 4-way intersection, there is a small rectangular building close to me on my left that extends behind me. – There is another small rectangular building across the street that is mostly to the front of me, but somewhat to the left. – A short distance to the right of that building is a small L-shaped office. – I’ve reached another 4-way intersection, there is a large L-shaped building that extends to the rear. That’s where he entered. Can we automatically pinpoint the location? Motivation: Funded by the National Geospatial Intelligence Agency

ACADEMIC RESEARCH GRANT Histograms of forces for red structure (upper left) Structures into database Histograms of forces + Fuzzy rules Convert to graph Natural Language Build graphics sketch to match Text descriptions IHMC Parser MATCHING Best approach to date: Evolutionary Computation Algorithm New approach: Hybrid Algorithm EC basic structure Subgraph Isomorphism locally in database in addition to mutation MU Text-to-Sketch System

ACADEMIC RESEARCH GRANT Syntactic parse tree for: “There is a large rectangular building is to my left.” Corresponding logical form graph of the deep semantic parse J. Allen, M. Swift and W. de Beaumont, “Deep Semantic Analysis of Text”, Proc., Symp. On Semantics in Systems for Text Processing, J. Allen, Natural Language Understanding, 2 nd Ed. Redwood City, CA, USA: Benjamin-Cummings, Automatically extract objects and relationships from IHMC’s deep semantic parse Right now, language variation is restricted Some Details: Parser

ACADEMIC RESEARCH GRANT Example of “directly to the front,” which produces a constrained search region, combined with the distance descriptor “somewhat close.” A compound linguistic description: “perfectly to the left of the blue building” and “mostly to the right, but somewhat above the red building” An illustration of “somewhat to the right” which produces a rather large search region Building blocks: Intelligent object placement scheme Too expensive to use guided random search only – Directional Fuzzy Templates I. Sledge and J. Keller, "Mapping natural language to imagery: Placing objects intelligently", Proc. IEEE Int. Conf. Fuzzy Syst., Jeju Island, Korea, August, 2009, pp (Best student paper award).

ACADEMIC RESEARCH GRANT “To the immediate right of that parking lot is a large parking garage that is the same length as the parking lot.” Fuzzy Initial Placement template Optimized PlacementInitial Placement To the right but a little bit forward Perfectly to the right Linguistic Descriptions From fuzzy rule base Example of Sketch Creation After Parsing

ACADEMIC RESEARCH GRANT (a)“There is a somewhat long, thin rectangular shaped parking lot that extends forward relative to me on my right.” (b) “To the immediate right of that parking lot is a large parking garage that is the same length as the parking lot.“ (c) “I see a moderately small rectangular building close to me that is mostly to my left but partially forward.” (d) “Travelling to a 4-way intersection, there is a small rectangular building close to me on my left that extends behind me.” (e) “There is another small rectangular building across the street that is mostly to the front of me, but somewhat to the left.” (f) Left image: Final sketch (after a few more descriptions). Right image: Ground truth recovered from matching algorithm Sketch Building Example

ACADEMIC RESEARCH GRANT Evolutionary Computation Approach Sketch converted to chromosome structure: building = gene Each gene attributed with its Histograms of Forces

ACADEMIC RESEARCH GRANT Initial Generation & fitness evaluation of chromosome Add n random chromosomes to population over the Geospatial database Each gene attributed with its Histograms of Forces Blue are query HoFs Red are those of current Chromosome

ACADEMIC RESEARCH GRANT MUTATION for Reproduction Simplest version Example: Building “B” chosen for replacement Take Neighbors of “B” in database as potential replacements and choose one whose histograms best match sketch Green: in the chromosome Blue: chosen replacement To add diversity: every 100 generations, least fit 10% replaced by random chromosomes

ACADEMIC RESEARCH GRANT Iteration & Convergence Fitness: Fitness: Fitness: Red Building replaced by Blue in this chromosome When a chromosome’s match quality exceeds q min, output results

ACADEMIC RESEARCH GRANT A walk by Shakespeare’s Pizza in Columbia

ACADEMIC RESEARCH GRANT A walk by Shakespeare’s Pizza in Columbia: Up close and personal

Human Geography Project University of Missouri and University of Florida HuGeo based knowledge Pattern Recognition Techniques Simulation and Modeling GIS Geospatial representation and reasoning Conflation Confidence assessment Text2Sketch Math models Repast Sampling methods Swarm Intelligence Uncertainty Modeling (Probabilistic, fuzzy, belief ) Classifiers Clustering Ontologies Multi and Hyper Spectral analysis Fusion

HuGeo Project Overview What’s needed/missing in HuGeo-based Knowledge Discovery component? – Extraction of concepts and events from blogs, twitter, news sources, etc – Interpretation of sentences – Placing them geographically i.e., we need NLP

Conclusions We model and utilize spatial language – We’re great at modeling, pattern recognition, fusion Our languages are not very flexible To take the next steps, we need better and deeper language models Help!