Fuzzy Reinforcement Learning Agents By Ritesh Kanetkar Systems and Industrial Engineering Lab Presentation May 23, 2003.

Slides:

Advertisements

Similar presentations

Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California

Advertisements

Reinforcement Learning

Slides from: Doug Gray, David Poole

On-line learning and Boosting

Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.

Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.

COSC 878 Seminar on Large Scale Statistical Machine Learning 1.

Optimal resampling using machine learning Jesse McCrosky.

GridFlow: Workflow Management for Grid Computing Kavita Shinde.

LEARNING FROM OBSERVATIONS Yılmaz KILIÇASLAN. Definition Learning takes place as the agent observes its interactions with the world and its own decision-making.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Reinforcement Learning

Fuzzy Logic Based on a system of non-digital (continuous & fuzzy without crisp boundaries) set theory and rules. Developed by Lotfi Zadeh in 1965 Its advantage.

Bayesian Reinforcement Learning with Gaussian Processes Huanren Zhang Electrical and Computer Engineering Purdue University.

Reinforcement Learning Rafy Michaeli Assaf Naor Supervisor: Yaakov Engel Visit project’s home page at: FOR.

Application of Reinforcement Learning in Network Routing By Chaopin Zhu Chaopin Zhu.

Fuzzy Inference System Learning By Reinforcement Presented by Alp Sardağ.

NORM BASED APPROACHES FOR AUTOMATIC TUNING OF MODEL BASED PREDICTIVE CONTROL Pastora Vega, Mario Francisco, Eladio Sanz University of Salamanca – Spain.

1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.

1 Kunstmatige Intelligentie / RuG KI Reinforcement Learning Johan Everts.

LEARNING FROM OBSERVATIONS Yılmaz KILIÇASLAN. Definition Learning takes place as the agent observes its interactions with the world and its own decision-making.

Information Fusion Yu Cai. Research Article “Comparative Analysis of Some Neural Network Architectures for Data Fusion”, Authors: Juan Cires, PA Romo,

Lab 01 Fundamentals SE 405 Discrete Event Simulation

Radial Basis Function Networks

Neuro-fuzzy Systems Xinbo Gao School of Electronic Engineering Xidian University 2004,10.

COGNITIVE RADIO FOR NEXT-GENERATION WIRELESS NETWORKS: AN APPROACH TO OPPORTUNISTIC CHANNEL SELECTION IN IEEE BASED WIRELESS MESH Dusit Niyato,

Soft Computing Colloquium 2 Selection of neural network, Hybrid neural networks.

Machine Learning. Learning agent Any other agent.

FAULT DIAGNOSIS OF THE DAMADICS BENCHMARK ACTUATOR USING NEURO-FUZZY SYSTEMS WITH LOCAL RECURRENT STRUCTURE FAULT DIAGNOSIS OF THE DAMADICS BENCHMARK ACTUATOR.

Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.

Reinforcement Learning

Using Neural Networks in Database Mining Tino Jimenez CS157B MW 9-10:15 February 19, 2009.

Chapter 3 Neural Network Xiu-jun GONG (Ph. D) School of Computer Science and Technology, Tianjin University

Machine Learning Seminar: Support Vector Regression Presented by: Heng Ji 10/08/03.

Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 14/15 – TP19 Neural Networks & SVMs Miguel Tavares.

Department of Electrical Engineering, Southern Taiwan University Robotic Interaction Learning Lab 1 The optimization of the application of fuzzy ant colony.

INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY, P.P , MARCH An ANFIS-based Dispatching Rule For Complex Fuzzy Job Shop Scheduling.

Fuzzy Inference (Expert) System

Stock Price Prediction Using Reinforcement Learning

Methodology of Simulations n CS/PY 399 Lecture Presentation # 19 n February 21, 2001 n Mount Union College.

Neural-Network-Based Fuzzy Logical Control and Decision System 主講人虞台文.

Reinforcement Learning

ANFIS (Adaptive Network Fuzzy Inference system)

Neural Networks Chapter 7

PART 9 Fuzzy Systems 1. Fuzzy controllers 2. Fuzzy systems and NNs 3. Fuzzy neural networks 4. Fuzzy Automata 5. Fuzzy dynamic systems FUZZY SETS AND FUZZY.

CUHK Learning-Based Power Management for Multi-Core Processors YE Rong Nov 15, 2011.

Investigation of Autonomy and Coordination Xiaobing Zhao Computer Integrated Manufacturing (CIM) Lab Systems and Industrial Engineering The University.

Hazırlayan NEURAL NETWORKS Backpropagation Network PROF. DR. YUSUF OYSAL.

Reinforcement Learning AI – Week 22 Sub-symbolic AI Two: An Introduction to Reinforcement Learning Lee McCluskey, room 3/10

CHEE825 Fall 2005J. McLellan1 Nonlinear Empirical Models.

Organic Evolution and Problem Solving Je-Gun Joung.

Reinforcement Learning for Mapping Instructions to Actions S.R.K. Branavan, Harr Chen, Luke S. Zettlemoyer, Regina Barzilay Computer Science and Artificial.

Chapter 10 FUZZY CONTROL Chi-Yuan Yeh.

Reinforcement Learning. Overview Supervised Learning: Immediate feedback (labels provided for every input). Unsupervised Learning: No feedback (no labels.

The article written by Boyarshinova Vera Scientific adviser: Eltyshev Denis THE USE OF NEURO-FUZZY MODELS FOR INTEGRATED ASSESSMENT OF THE CONDITIONS OF.

A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.

Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.

Reinforcement Learning for 3 vs. 2 Keepaway P. Stone, R. S. Sutton, and S. Singh Presented by Brian Light.

School of Industrial and Systems Engineering, Georgia Institute of Technology 1 Defuzzification Filters and Applications to Power System Stabilization.

Introduction to Machine Learning, its potential usage in network area,

Introduction of Reinforcement Learning

Neural Networks Advantages Criticism

Continous-Action Q-Learning

Introduction to Scheduling Chapter 1

The use of Neural Networks to schedule flow-shop with dynamic job arrival ‘A Multi-Neural Network Learning for lot Sizing and Sequencing on a Flow-Shop’

Chapter 1: Introduction

Fuzzy Logic Based on a system of non-digital (continuous & fuzzy without crisp boundaries) set theory and rules. Developed by Lotfi Zadeh in 1965 Its advantage.

What is Artificial Intelligence?

Presentation transcript:

Fuzzy Reinforcement Learning Agents By Ritesh Kanetkar Systems and Industrial Engineering Lab Presentation May 23, 2003

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering What is a agent? An agent is a computer system situated in some environment, and that is capable of autonomous action in this environment in order to meet its design objectives. An autonomous agent should be able to act without the direct intervention of humans or other agents, and should have control over its own actions and internal state.

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Why Agents? Ability to act autonomously Flexibility, scalability and modularity characteristics Real-time performance Suitability for distributed applications Ability to work co-operatively in teams

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Learning in Agents Supervised Learning  Neural Network Unsupervised Learning  Reinforcement Learning

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Supervised vs. Unsupervised Supervised Learning  Learning under a skilled teacher  Learning through presentation of input-output pairs  Given a set of inputs attempts to predict the output values Unsupervised Learning  No supervisor present  Only data available is through feedback  Learning through evaluation of actions

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Reinforcement Learning Maps states to actions Input is current state S1 Output is selected action Action change the state to S2 After evaluating the mapping a reinforcement signal is given to the agent

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Reinforcement Learning Advantages  Less environment oriented programming  Works in changing environment Problems  Large number of possible states  Consider only discrete events ( Real world problems are continuous)

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering b1b1 b3 b2 a1a1 a3 a2 R=0. 5 S1S2 T=30 M1 T=30 M1 T=20 M2 T=10 M3 S3 How RL works?

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Continued Aim 1 : To find the shortest processing time. Ideal Actions : a3 – b3. Assumptions :  Action with highest utility is chosen  Each machine bids for the part as per its utility value (initially all 0. 5).  The winning machine gives a part of its utility to the previous winning agent for successfully creating the state for him.

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Continued ( Rule for reward) Rule for giving reward to previous winning agent t (min) r Reward from state S0 and S1, say 0.25 for our model.

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Continued (Calculations of utility value) IterationStateMachine selectedUtility value 1S1M2( ) = S1M1( ) = S1M3( ) = 0.55

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Continued (Changes in utility value) IterationsM1M2M3M1M2M

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Use of Fuzzy Logic Fuzzy logic to map states (environment) to actions. Problem tackled is of the elimination of discrete events by use of fuzzy logic. Fuzzy logic to integrate the multiple rewards into a single feedback signal. Due to large action space we cannot use traditional lookup tables. So generalization of mapping is required. Incorporation of human language.

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Problems Agents as dynamical systems interacting with the environment Network of agents (Multi-agent system) Multiple reward system Multiple criteria systems Continues events system Large state space in real world problems Bargaining problems

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Fuzzy Inference System (FIS)

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering FIS FIS rule base is made of N rules : R i : If s1 = L 1i and ……and sn = L N1i then y1 = O 1o and ……and yn = O N1o Where, Si = input vector R i = i’th rule L ji = Fuzzy label Yi = Output vector

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Fuzzy Inference System (FIS) Layer 1: Input layer  Defines the input variables needed to describe the states completely. Layer 2: Linguistic Labels  This layer does the fuzzification process. Layer 3: Rules  This layer defines the if-else rules giving rule truth values. Layer 4: Output layer  Gives the FIS output.

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Assumptions Number of input variables and fuzzy labels are selected depending on problem Number of rules is determined by numbers of elements in first two layers. (Product of labels for each input variable) Each have a predefined number of outputs So only most difficult part left is the conclusion of all possible combinations (Rule conclusion)

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering What it does? Maps states to actions. Rules can be formulated in human language. Each rule contains:  Value V i to approx. optimal evaluation function.  Action set U i  Parameter vector w i giving the weight of different action in a rule to approximate policy. Final output is the weighted average of all the actions.

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering FIS Output (Primary reinforcement) (Internal reinforcement through critic)

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Procedure Estimate the evaluation function corresponding to current state. V t (S t+1 )= v t.Ф t+1 Compute the TD error є t+1. Tune the parameters v and w. Estimate the new evaluation function with new conclusion vector v t+1. Learning rate updating. Computing and triggering of global action U t+1

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Problem Single machine scheduling problem 3 parts Each part with individual earliness-tardiness penalties, due dates and processing times 19 time slots on machine Minimize the deviation from due dates reducing the penalties

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering Work in progress Currently working with the single machine scheduling problem with earliness/tardiness penalty and due dates. Identifying the various parameters. Understanding the mathematics behind the FIS. Incorporating bargaining model in FIS.

COMPUTER INTEGRATED MANUFACTURING LAB Department of Systems and Industrial Engineering References Fuzzy Inference System Learning by Reinforcement Methods – Lionel Jouffe (IEEE) Dynamic single machine scheduling under distributed decision making – Pooja Dewan, Sanjay Joshi (IJPR) Evolutionary Learning agents for shop floor control- Bruno Maione, David Naso (IEEE) A fuzzy logic based methodology to rank shop floor dispatching rules – Albert Petroni (IJPE) Multi Agent Reinforcement Learning with bidding for automatic segmentation of action sequence – Ron Sun (IEEE) AI depot - (for RL) RL – An Introduction (Suttons and Barto) Matlab fuzzy logic toolbox tutorials

Thank You