© H. Hajimirsadeghi, School of ECE, University of Tehran Conceptual Imitation Learning Based on Functional Effects of Action Hossein Hajimirsadeghi School.

Slides:

Advertisements

Similar presentations

Viktor Zhumatiya, Faustino Gomeza,

Advertisements

Information Processing Technology Office Learning Workshop April 12, 2004 Seedling Overview Learning Hierarchical Reactive Skills from Reasoning and Experience.

Reinforcement Learning

Dialogue Policy Optimisation

Large Vocabulary Unconstrained Handwriting Recognition J Subrahmonia Pen Technologies IBM T J Watson Research Center.

Perception and Perspective in Robotics Paul Fitzpatrick MIT Computer Science and Artificial Intelligence Laboratory Humanoid Robotics Group Goal To build.

 INTRODUCTION  STEPS OF GESTURE RECOGNITION  TRACKING TECHNOLOGIES  SPEECH WITH GESTURE  APPLICATIONS.

Yiannis Demiris and Anthony Dearden By James Gilbert.

ROBOT BEHAVIOUR CONTROL SUCCESSFUL TRIAL OF MARKERLESS MOTION CAPTURE TECHNOLOGY Student E.E. Shelomentsev Group 8Е00 Scientific supervisor Т.V. Alexandrova.

Hidden Markov Models Theory By Johan Walters (SR 2003)

SA-1 Body Scheme Learning Through Self-Perception Jürgen Sturm, Christian Plagemann, Wolfram Burgard.

SSP Re-hosting System Development: CLBM Overview and Module Recognition SSP Team Department of ECE Stevens Institute of Technology Presented by Hongbing.

Spatio-Temporal Sequence Learning of Visual Place Cells for Robotic Navigation presented by Nguyen Vu Anh date: 20 th July, 2010 Nguyen Vu Anh, Alex Leng-Phuan.

Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.

Fuzzy Inference System Learning By Reinforcement Presented by Alp Sardağ.

Incremental Learning of Temporally-Coherent Gaussian Mixture Models Ognjen Arandjelović, Roberto Cipolla Engineering Department, University of Cambridge.

Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.

Agents to Simulate Social Human Behaviour in a Work Team Agents to Simulate Social Human Behaviour in a Work Team Barcelona, February Arantza Aldea.

 For many years human being has been trying to recreate the complex mechanisms that human body forms & to copy or imitate human systems  As a result.

Cognitive Computer Vision 3R400 Kingsley Sage Room 5C16, Pevensey III

Particle Swarm Optimization Algorithms

Isolated-Word Speech Recognition Using Hidden Markov Models

The free-energy principle: a rough guide to the brain? K Friston Summarized by Joon Shik Kim (Thu) Computational Models of Intelligence.

INTRODUCTION TO MACHINE LEARNING. $1,000,000 Machine Learning  Learn models from data  Three main types of learning :  Supervised learning  Unsupervised.

Graphical models for part of speech tagging

An Architecture for Empathic Agents. Abstract Architecture Planning + Coping Deliberated Actions Agent in the World Body Speech Facial expressions Effectors.

Vinay Papudesi and Manfred Huber.  Staged skill learning involves:  To Begin:  “Skills” are innate reflexes and raw representation of the world. 

REINFORCEMENT LEARNING LEARNING TO PERFORM BEST ACTIONS BY REWARDS Tayfun Gürel.

Stochastic Algorithms Some of the fastest known algorithms for certain tasks rely on chance Stochastic/Randomized Algorithms Two common variations – Monte.

MODULE 23 COGNITION/THINKING. THINKING Thinking is a cognitive process in which the brain uses information from the senses, emotions, and memory to create.

Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.

Segmental Hidden Markov Models with Random Effects for Waveform Modeling Author: Seyoung Kim & Padhraic Smyth Presentor: Lu Ren.

 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.

Fundamentals of Hidden Markov Model Mehmet Yunus Dönmez.

Memory Components, Forgetting, and Strategies

Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles Jin Chen Sep 2012.

Hybrid Behavior Co-evolution and Structure Learning in Behavior-based Systems Amir massoud Farahmand (a,b,c) (

Beyond Gazing, Pointing, and Reaching A Survey of Developmental Robotics Authors: Max Lungarella, Giorgio Metta.

Recognition, Analysis and Synthesis of Gesture Expressivity George Caridakis IVML-ICCS.

Key Centre of Design Computing and Cognition – University of Sydney Concept Formation in a Design Optimization Tool Wei Peng and John S. Gero 7, July,

Hidden Markov Models in Keystroke Dynamics Md Liakat Ali, John V. Monaco, and Charles C. Tappert Seidenberg School of CSIS, Pace University, White Plains,

Curiosity-Driven Exploration with Planning Trajectories Tyler Streeter PhD Student, Human Computer Interaction Iowa State University

Learning Agents MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way.

Project Lachesis: Parsing and Modeling Location Histories Daniel Keeney CS 4440.

Chapter 7. Learning through Imitation and Exploration: Towards Humanoid Robots that Learn from Humans in Creating Brain-like Intelligence. Course: Robots.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:

Chapter 1. Cognitive Systems Introduction in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans Park, Sae-Rom Lee, Woo-Jin Statistical.

Performance Comparison of Speaker and Emotion Recognition

Presented by: Fang-Hui Chu Discriminative Models for Speech Recognition M.J.F. Gales Cambridge University Engineering Department 2007.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Elements of a Discrete Model Evaluation.

Chapter 8. Learning of Gestures by Imitation in a Humanoid Robot in Imitation and Social Learning in Robots, Calinon and Billard. Course: Robots Learning.

1 Hidden Markov Models Hsin-min Wang References: 1.L. R. Rabiner and B. H. Juang, (1993) Fundamentals of Speech Recognition, Chapter.

EEL 6586: AUTOMATIC SPEECH PROCESSING Hidden Markov Model Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida March 31,

An Energy-Efficient Approach for Real-Time Tracking of Moving Objects in Multi-Level Sensor Networks Vincent S. Tseng, Eric H. C. Lu, & Kawuu W. Lin Institute.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.

Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.

Visual Recognition Tutorial1 Markov models Hidden Markov models Forward/Backward algorithm Viterbi algorithm Baum-Welch estimation algorithm Hidden.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.

1 Algoritmos Genéticos aplicados em Machine Learning Controle de um Robo (em inglês)

Traffic Simulation L2 – Introduction to simulation Ing. Ondřej Přibyl, Ph.D.

Introduction to Machine Learning, its potential usage in network area,

EEL 6586: AUTOMATIC SPEECH PROCESSING Hidden Markov Model Lecture

Chapter 6: Temporal Difference Learning

Chapter 3: The Reinforcement Learning Problem

Overview of Machine Learning

The free-energy principle: a rough guide to the brain? K Friston

Chapter 3: The Reinforcement Learning Problem

October 6, 2011 Dr. Itamar Arel College of Engineering

Chapter 6: Temporal Difference Learning

CHAPTER I. of EVOLUTIONARY ROBOTICS Stefano Nolfi and Dario Floreano

Presentation transcript:

© H. Hajimirsadeghi, School of ECE, University of Tehran Conceptual Imitation Learning Based on Functional Effects of Action Hossein Hajimirsadeghi School of Electrical and Computer Engineering, University of Tehran, Iran 28/04/2011

© H. Hajimirsadeghi, School of ECE, University of Tehran Outline Introduction –Imitation Learning –Concepts –Conceptual Imitation Learning –Problem Statement Hidden Markov Models –Definition & Main Problems The Proposed Algorithm Experiments Conclusions 2

© H. Hajimirsadeghi, School of ECE, University of Tehran What is Imitation Learning? Imitation Learning is A Type of Social Learning –Transmitting skills and knowledge from an agent to another agent Why is it Beneficial?: –In General: Safety Increase Speed Increase Energy Consumption Decrease –In Robotics: User-friendly and simple means of programming 3

© H. Hajimirsadeghi, School of ECE, University of Tehran Concept What is a Concept? –A representation of world in agent’s mind (General) –A unit of knowledge or meaning made out of some other units which share some characteristics (Zentall et al., 2002) Example: A Specific Food Example: General Food Concept 4

© H. Hajimirsadeghi, School of ECE, University of Tehran Concept Representations Exemplar Prototype 5

© H. Hajimirsadeghi, School of ECE, University of Tehran Types of Concepts Perceptual Concepts Relational Concepts Associative Concepts 6 A Concept Perceptual Space Needs an external information Perceptual SimilarityPerception & Functional Similarity Functional Similarity

© H. Hajimirsadeghi, School of ECE, University of Tehran A Real Example of Relational Concepts 2 Concept of Respect

© H. Hajimirsadeghi, School of ECE, University of Tehran Conceptual Imitation Learning Low Level Imitation –Mimicking True Imitation –Understanding –Generalization –Recognition –Generation 8 Needs Conceptualization & Abstraction

© H. Hajimirsadeghi, School of ECE, University of Tehran State-of-the-Art Works on Imitation and Conceptual Abstraction 9 Perceptual Concepts Samejima et al. (2002) Cadone & Nakamura (2006) Inamura et al. (2004) Calinon & Billard (2004) Calinon et al. (2005) Billard et al. (2006) Takano & Nakamura (2006) Lee et al. (2008) Kulic et al. (2008, 2009) Relational Concepts Mobahi et al. (2005, 2007) Hajimirsadeghi et al. (2010) Using modular controllers and predictors Stochastic Modeling with Hidden Markov Models Integration of Recognition and Regeneration Using Associative Neural Networks Autonomous & Incremental Concept Learning & Acquisition One-to-one relation between concepts and actions Only for Single Observations Deterministic Modeling Learning Concept through Interaction with the Teacher

© H. Hajimirsadeghi, School of ECE, University of Tehran Our Proposed Model 10 Stochastic Modeling with Hidden Markov Models Integration of Recognition and Regeneration Autonomous & Incremental Concept Learning & Acquisition Each Concept is Represented by All Perceptual Variants of an Action Suitable for Sequence of Observations Relational Concepts Functional Similarity is Identified by the Effects

© H. Hajimirsadeghi, School of ECE, University of Tehran Problem Statement Proposing an Incremental and Gradual Learning Algorithm for Autonomous Acquisition, Generalization, Recognition, and Regeneration of Relational Concepts through perception of Spatio-Temporal demonstrations and Identifying their Functional Effects. Main Ideas: –Using Prototypes (Start From Exemplar, End with Prototypes) –A Prototype Abstracts Perceptually Similar Demonstrations. –A Concept Emerges as a Set of Prototypes which Have Similar Functionalities. –Functional Similarity between Demonstrations is Understood by Recognizing their Functional Effects (External Information). 11

© H. Hajimirsadeghi, School of ECE, University of Tehran Hidden Markov Models 12

© H. Hajimirsadeghi, School of ECE, University of Tehran Main Problems for HMMs Training –Given or Evaluation –Given and Sequence Generation –Give 13 Solution: Forward Algorithm Solution: Baum-Welch Algorithm (Re-estimation Formulas) Solution: Estimation of State Duration + Greedy Selection of Consecutive States and Observations + Curve Fitting HMMs can be used for Both Recognition and Generation Conceptual Imitation Learning

© H. Hajimirsadeghi, School of ECE, University of Tehran The Proposed Algorithm Some Definitions: –An exemplar is an HMM trained by only one demonstration –A prototype is an HMM made out of unifying perceptually the same exemplars –Exemplars are stored in the Working Memory (WM) –Prototypes are stored in the Long- term Memory (LTM) –A concept is a set of HMM exemplars and prototypes, sharing the same functional effects. 14 Concept 1 Concept 2 Concept Concepts Prototype LTM Exemplar WM

15 x := Sense() The effect has an equivalent sensory-motor concept in the memory Find the most probable prototype of concept Make new exemplar with x Make new concept with this exemplar Make new exemplar with x for the concept Yes No There is at least one prototype for concept … is the minimum log likelihood of the sequences previously encoded into the HMM prototype The effect of demonstrated action is recognized A New Action is Demonstrated Effect := the equivalent sensory-motor concept in the memory

Yes No Cluster exemplars and prototypes of the concept Prototyping criteria are satisfied Make new prototypes for the concept Yes No 16 Being Sufficiently Cohered Including Sufficient Number of Elements …

17 After Learning (Recall Phase) C1 Action 1 Concepts Actions C2 Action 2 C3 Action 3 3. Probability of Observation is Computed Against All the Prototypes Prototypes & Exemplars 2. The New Demonstration is Perceived (Perception Sequence) 1. An Action is Demonstrated 4. Most Probable Concept is retrived 5. The action is Executed

© H. Hajimirsadeghi, School of ECE, University of Tehran Experiment: Conceptual Hand Gesture Imitation Based on their Emotional Effects There are a teacher, a humanoid robot, and a human agent The teacher demonstrates a gesture The human agent makes an emotional response (effect of the teacher’s action) The robot perceive the demonstrations and recognize the emotional response 18 Action 3Action 2Action 1 Human Agent’s Response Concept# - Striking from Right Striking from Left Angry FaceAnger1 - Hitting the Chest Hitting the HeadUnhappy FaceUnhappiness2 -- Throwing Fist Up & Down Happy FaceHappiness3 Caressing the Face Sketching Heart Sign Air Kiss Caressing the Robot’s Tactile Sensor Love4 -- Cut-Throat Gesture Disgusted FaceDisgust5

© H. Hajimirsadeghi, School of ECE, University of Tehran Experiment: Conceptual Hand Gesture Imitation Based on their Emotional Effects Kinesthetic Teaching for Making Demonstrations 19 For Facial Expression Recognition, we used Eigen Face Algorithm (Turk 91) Principal Component Analysis 1-Nearest Neighbor

© H. Hajimirsadeghi, School of ECE, University of Tehran 20 Results Perception Sequences are incrementally entered to the learning algorithm K-fold Cross Validation with k=5 Scoring Mechanism: –+1(Hit) –-1(Miss)

© H. Hajimirsadeghi, School of ECE, University of Tehran SumDisgustLoveHappinessUnhappinessAnger Experiment # Results Number of Generated Prototype For Each Experiment 21

© H. Hajimirsadeghi, School of ECE, University of Tehran Robot Gesture Reproduction Results 22

© H. Hajimirsadeghi, School of ECE, University of Tehran Conclusion An Incremental and Gradual Learning Algorithm for Autonomous Acquisition, Generalization, Recognition, and Regeneration of Relational Concepts through perception of Spatio-Temporal demonstrations and their Functional Effects Outcome: An Agent is Trained Who can make Functional Effects in the Environment 23

© H. Hajimirsadeghi, School of ECE, University of Tehran Conclusions Consequences of Imitation Learning by Relational Concepts: –Recognition of Novel Demonstrations of the Learned Concepts –No Need of Motor Learning for Previously Learned Concepts –If Motor Programs are Learned for the Perceptual Variants of A Concept, Flexibility of Choice between the alternatives – Less Concepts Smaller Representation of World Simpler Interaction with World Smaller Memory Simpler Search –Ease of Knowledge Transfer from an Agent to Another Agent from a Situation to Another Situation 24

© H. Hajimirsadeghi, School of ECE, University of Tehran Thanks for Your Attention 28/04/2011

© H. Hajimirsadeghi, School of ECE, University of Tehran Clustering Clustering All HMM Exemplars and Prototypes of A Concept Pseudo-Distance Definition (Rabiner 1989) Agglomerative Hierarchical Clustering 18

© H. Hajimirsadeghi, School of ECE, University of Tehran Proto-Symbol Space of HMM Prototypes (Using Multidimensional Scaling Method) Results 23

© H. Hajimirsadeghi, School of ECE, University of Tehran What is Imitation Learning? Imitation Learning is A Type of Social Learning –Transmitting skills and knowledge from an agent to another agent Why is it Beneficial?: –In General: Safety Increase Speed Increase Energy Consumption Decrease –In Robotics: User-friendly means of programming Better regeneration of human-like movements understanding mechanisms for developmental organization of perception- action integration in animals. 3

© H. Hajimirsadeghi, School of ECE, University of Tehran Conceptual Imitation Learning Low Level Imitation –Mimicking True Imitation –Understanding –Recognition –Generalization –Generation Importance of Conceptual Imitation Learning –Recognition of Novel Demonstrations –No Need of Motor Learning for Previously Learned Concepts –Less Memory, Easy Search –Ease of Knowledge Transfer from Agent to Agent –For Concepts with Functional Abstraction: Less Concept, Smaller Representation of World, Simpler Interaction with World Motor Learning for Only one of the Perceptual Variants –Else: Flexibility of Choice between the alternatives Ease of Knowledge Transfer from a Situation to Another Situation 8 Needs Conceptualization & Abstraction

© H. Hajimirsadeghi, School of ECE, University of Tehran Importance of HMMs for Conceptual Imitation Learning Simultaneous Modeling of the Statistical Variations in –Dynamics of Observation Sequence & –Amplitude of Observations A Unified Mathematical Model for Both –Recognition –Generation 14

© H. Hajimirsadeghi, School of ECE, University of Tehran Clustering Clustering All HMM Exemplars and Prototypes of A Concept Pseudo-Distance Definition (Rabiner 1989) Agglomerative Hierarchical Clustering Conditions For Cluster Selection: –Falling Behind the Threshold Distance –Surpassing Minimum Number of Elements 19

© H. Hajimirsadeghi, School of ECE, University of Tehran Clustering 20 C1 Action 1 Prototypes and ExemplarsConcepts Actions Also Save the value of for the new prototypes Prototyping the Selected Clusters and save in the LTM LTM

© H. Hajimirsadeghi, School of ECE, University of Tehran Experiment: Human-Robot Interaction Task Conceptual Hand Gesture Imitation The concepts are Relational Demonstrations are incrementally entered to the proposed algorithm 19

© H. Hajimirsadeghi, School of ECE, University of Tehran 21 Results Perception Sequence is a 2-D Signal of Changes in the Hand Path of Demonstrator K-fold Cross Validation with k=5 Reinforcement Signals: –+1(reward) –-1(punishment) Parameter Settings:

© H. Hajimirsadeghi, School of ECE, University of Tehran Results Recognition Accuracy After Learning –Use Only Prototypes –Use Prototypes and Exemplars 26

© H. Hajimirsadeghi, School of ECE, University of Tehran Conclusion An Incremental and Gradual Learning Algorithm for Autonomous Acquisition, Generalization, Recognition, and Regeneration of Relational Concepts through perception of Spatio-Temporal demonstrations of the Teacher –Using Prototypes to Represent Concepts –A Prototype Abstracts Perceptually Similar Demonstrations of a Concept –A Concept Comprises a Set of Perceptual Prototypes which Have Similar Functionalities. –Functional Similarity between Demonstrations is understood by Interaction with the Teachers (External Information). 28

© H. Hajimirsadeghi, School of ECE, University of Tehran Conclusions Future Works: –Using HMMs for Multimodal Integration of Heterogeneous Perceptions Representation and Recognition of Multimodal Concepts –Concept Recognition with Incomplete Observation Sequences –Conceptual Imitation Learning Based on Functional Effects of Action E.g., emotional effects of action –Multi-Resolution Representation of Concepts by Hierarchical Organization of Prototypes 30