Learning Procedural Knowledge through Observation -Michael van Lent, John E. Laird – 인터넷 기술 전공 022ITI02 성유진.

Slides:

Advertisements

Similar presentations

Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California

Advertisements

Learning Procedural Planning Knowledge in Complex Environments Douglas Pearson March 2004.

Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.

Automating Software Module Testing for FAA Certification Usha Santhanam The Boeing Company.

Machine Learning: Intro and Supervised Classification

ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS

1 Update on Learning By Observation Learning from Positive Examples Only Tolga Konik University of Michigan.

JSIMS 28-Jan-99 1 JOINT SIMULATION SYSTEM Modeling Command and Control (C2) with Collaborative Planning Agents Randall Hill and Jonathan Gratch University.

What is Software Design?. Systems Development Life- Cycle Planning Analysis Design Implementation Design.

ACCOUNTING INFORMATION SYSTEMS

Chapter 12: Expert Systems Design Examples

Expert System Human expert level performance Limited application area Large component of task specific knowledge Knowledge based system Task specific knowledge.

Learning from Observations Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 18 Spring 2004.

Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.

A Summary of the Article “Intelligence Without Representation” by Rodney A. Brooks (1987) Presented by Dain Finn.

Learning from Observations Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 18 Fall 2005.

Software Testing Using Model Program DESIGN BY HONG NGUYEN & SHAH RAZA Dec 05, 2005.

VLSI Systems--Spring 2009 Introduction: --syllabus; goals --schedule --project --student survey, group formation.

Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.

1 Learning from Behavior Performances vs Abstract Behavior Descriptions Tolga Konik University of Michigan.

DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):

Learning from Observations Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 18 Fall 2004.

SIM5102 Software Evaluation

Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.

Chapter 1 Principles of Programming and Software Engineering.

Ideas for Explainable AI

© 2006 Pearson Addison-Wesley. All rights reserved2-1 Chapter 2 Principles of Programming & Software Engineering.

Chapter 3 Planning Your Solution

EMBEDDED SOFTWARE Team victorious Team Victorious.

Dr. Pedro Mejia Alvarez Software Testing Slide 1 Software Testing: Building Test Cases.

Section 2: Science as a Process

System/Software Testing

1 Shawlands Academy Higher Computing Software Development Unit.

Chapter 2 The process Process, Methods, and Tools

1 BTEC HNC Systems Support Castle College 2007/8 Systems Analysis Lecture 9 Introduction to Design.

Human-Centered Information Visualization Jiajie Zhang, Kathy Johnson, Jack Smith University of Texas at Houston Jane Malin NASA Johnson Space Center July.

INTRODUCTION TO MACHINE LEARNING. $1,000,000 Machine Learning  Learn models from data  Three main types of learning :  Supervised learning  Unsupervised.

Business Analysis and Essential Competencies

Author: James Allen, Nathanael Chambers, etc. By: Rex, Linger, Xiaoyi Nov. 23, 2009.

CHA2555 Week2: Knowledge Representation, Logic and Planning Lee McCluskey First term:

1 The Software Development Process  Systems analysis  Systems design  Implementation  Testing  Documentation  Evaluation  Maintenance.

SOFTWARE DESIGN.

Chapter 12 Evaluating Products, Processes, and Resources.

Synthetic Cognitive Agent Situational Awareness Components Sanford T. Freedman and Julie A. Adams Department of Electrical Engineering and Computer Science.

110/19/2015CS360 AI & Robotics AI Application Areas  Neural Networks and Genetic Algorithms  These model the structure of neurons in the brain  Humans.

University of Windsor School of Computer Science Topics in Artificial Intelligence Fall 2008 Sept 11, 2008.

1 Test Selection for Result Inspection via Mining Predicate Rules Wujie Zheng

1 Introduction to Software Testing. Reading Assignment P. Ammann and J. Offutt “Introduction to Software Testing” ◦ Chapter 1 2.

MODES-650 Advanced System Simulation Presented by Olgun Karademirci VERIFICATION AND VALIDATION OF SIMULATION MODELS.

Introduction to Earth Science Section 2 Section 2: Science as a Process Preview Key Ideas Behavior of Natural Systems Scientific Methods Scientific Measurements.

Learning, page 1 CSI 4106, Winter 2005 Symbolic learning Points Definitions Representation in logic What is an arch? Version spaces Candidate elimination.

The Software Development Process

1 CSCD 326 Data Structures I Software Design. 2 The Software Life Cycle 1. Specification 2. Design 3. Risk Analysis 4. Verification 5. Coding 6. Testing.

MACHINE LEARNING 10 Decision Trees. Motivation  Parametric Estimation  Assume model for class probability or regression  Estimate parameters from all.

Joseph Xu Soar Workshop Learning Modal Continuous Models.

© 2006 Pearson Addison-Wesley. All rights reserved2-1 Chapter 2 Principles of Programming & Software Engineering.

© 2006 Pearson Addison-Wesley. All rights reserved 2-1 Chapter 2 Principles of Programming & Software Engineering.

Structured Programming (4 Credits)

Data Mining and Decision Support

Beyond Chunking: Learning in Soar March 22, 2003 John E. Laird Shelley Nason, Andrew Nuxoll and a cast of many others University of Michigan.

1 Learning through Interactive Behavior Specifications Tolga Konik CSLI, Stanford University Douglas Pearson Three Penny Software John Laird University.

1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.

Rigorous Testing by Merging Structural and Behavioral UML Representations Presented by Chin-Yi Tsai.

Understanding Naturally Conveyed Explanations of Device Behavior Michael Oltmans and Randall Davis MIT Artificial Intelligence Lab.

NTT-MIT Collaboration Meeting, 2001Leslie Pack Kaelbling 1 Learning in Worlds with Objects Leslie Pack Kaelbling MIT Artificial Intelligence Laboratory.

Software Testing.

Profiling based unstructured process logs

Software Life Cycle Models

Verification and Validation

Paul Scerri and Nancy Reed

Presentation transcript:

Learning Procedural Knowledge through Observation -Michael van Lent, John E. Laird – 인터넷 기술 전공 022ITI02 성유진

- C O N T E N T - INTRODUCTION REATED WORK KNOMIC EXPERIMENTS AND RESULTS FUTURE WORK CONCLUSION

Key words. Machine learning. Knowledge acquisition. Rule learning. User modeling Expert effort vs. Research effort for a variety of knowledge acquisition approaches (Figure 1) 1. INTRODUCTION (1/2) Figure 1

Hypothesis. “ Learning procedural knowledge from observations of an expert is more efficient than the standard knowledge-acquisition approach and is a more tractable research problem than unsupervised learning approach ” Primary Goal. This research is to explore is to explore and evaluate observation as a knowledge source for learning procedural knowledge 1. INTRODUCTION (2/2)

2. RELATED WORK (1/3) Behavioral Cloning. Learning by observation - to learning the knowledge necessary to fly a simulated airplane along a specific flight plan in the Silicon Graphics fight simulator (Bain and Sammut). Situation/action examples taken from observation of an expert. Used to build decision trees that decide which action to take based on current sensor input. Advantage - effectiveness in a complex, non-deterministic, dynamic environment. Defect - does not learn knowledge that allows the agent to dynamically select goals and procedures

2. RELATED WORK (2/3) The OBSERVER system. STRIPS-style operator - pre-condition - dynamically select based on the current sensor input - assume : operator action always performed in a single time step and without error Advantage. more expressive knowledge format Defect. problem in more complex task. contain no noise or dynamic change

2. RELATED WORK (3/3) The TRAIL system. learning algorithm and a planning algorithm to create teleo-operators. inductive logic programming to learn TOPs based on positive and negative examples from the traces OBSERVER, TRAIL ’ s TOPs :. difficultly in complex domains with uncertain action

3. K N O M I C (1/5) KNOMIC. learning-by-observation system based on a general framework for learning procedural knowledge from observation of an expert Knowledge Representation. selecting appropriate goals. performing actions to achieve and maintain those goals. knomic operator - non-deterministic action : implemented in the environment determined by the environment may or may not have the expected outcome - recover when the actions and environment don ’ t behave as expected : have multiple procedures for achieving its goals. classification of each operator as homeostatic, one-time or repeatable

3. K N O M I C (2/5) Learning – by-Observation Framework. major advantage : modularity Expert Environmental Interface Environment Observation Generation Execution Architecture Operator Classification Knowledge Formatting Condition Learning Output Commands Parameters& SensorStep1 Annotations Observation TracesStep2 Step3 Operator Conditions Step4 Learned Task Knowledge Step5 Formatted Knowledge -Figure 2: The learning-by-observation framework

3. K N O M I C (3/5) Knomic is one instantiation of the learning-by-observation framework Expert Environmental Interface ModSAF Observation Generation Soar Architecture Operator Classification Knowledge Formatting Specific to General Induction Output Commands Parameters& Sensor Annotations Observation Traces Operator Conditions Learned Task Knowledge Soar Productions

3. K N O M I C (4/5) Observation Trace Generation. sensor inputs the expert receives. actions the expert performs each cycle. annotation : whenever the goal he/she is seeking to achieve changes the task is being performed during a review of his/her behavior to avoid interrupt Specific-to-General Condition Learning. Find-S specific-to-general induction algorithm x 1 =, + x 2 =, + h 0 = h 1 = h 2 = Hypotheses H Specific General Instances X

3. K N O M I C (5/5) Operator Classification. examining if and when the expert reselects an operator as its goal conditions change from achieved to unachieved. homeostatic operator - to maintain its goal conditions as true once they are achieved - become untrue, immediately re-activated to re-achieved. one-time operator - only achieve its goal conditions once and then never be re-activated. repeatable operator - not be immediately re-activated but can be reselected if trigged by another operator Soar production generation. hierarchical, symbolic, propositional representation allowing some numerical relations. three classes of Soar production - operator pre-condition production, goal-achieved productions, action application production

4. EXPERIMENTS AND RESULTS (1/3) 1th experiment. accuracy of the Knomic system when provided with error-free observation 2th experiment. knomic ’ s accuracy with more realistic observation traces generated from observations of a human expert domain : air combat. 31operators in a pour level hierarchy. operator conditions : conjunctive. domain are triggered by external events sensed by the expert Evaluation Criteria. using the learned knowledge by the agent. compared to rules for the same task created by human programmer. fully correct, functionally correct, incorrect correct

4. EXPERIMENTS AND RESULTS (2/3) Experiment 1. four observation traces. 30minutes Result. 1900decision cycles sensor input. 40 goal annotation. 140 output command.101 fully correct, 29 functionally correct,10 incorrect. 10 incorrect production six of 10 : extraneous condition 3 of 10 : due to missing sensors in the environmental interface final incorrect : requires a negated test

4. EXPERIMENTS AND RESULTS (3/3) Experiment 2. two observation traces by a human expert. initialization, takeoff, racetrack. 45 production - 29 correct, 13 functionally correct, 3 incorrect. This experiment shows that KnoMic can successfully learn from observations of human experts but additional research needs to explore a more robust algorithm.

5. FUTURE WORK better learning algorithm. more powerful biases. aid in removing the additional extraneous conditions learning knowledge representation. include structured sensors, operator parameters prove annotation. automatically segmenting the observation traces - one possible based on these preliminary operators, the rest of the traces could be automatically annotated - another possible detect shifts in behavior through statistical analysis of the behavior traces

5. CONCLUSION focuses on the applications of a relatively simple learning algorithm to real world problem “ As demonstrated here, even a simple learning algorithm can be effective when embedded in a carefully designed framework. Hopefully, future research will find that more sophisticated learning algorithms are even more effective. ”