Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lecture 2 Multi-Agent Systems Lecture 2 University “Politehnica” of Bucarest 2004 - 2005 Adina Magda Florea

Similar presentations


Presentation on theme: "Lecture 2 Multi-Agent Systems Lecture 2 University “Politehnica” of Bucarest 2004 - 2005 Adina Magda Florea"— Presentation transcript:

1 Lecture 2 Multi-Agent Systems Lecture 2 University “Politehnica” of Bucarest 2004 - 2005 Adina Magda Florea adina@cs.pub.ro http://turing.cs.pub.ro/blia_2005

2 Models of agency and architectures Lecture outline n Conceptual structures of agents n Cognitive agent architectures n Reactive agent architectures n Layered architectures

3 1. Conceptual structures of agents 1.1 Agent rationality n An agent is said to be rational if it acts so as to obtain the best results when achieving the tasks assigned to it. n How can we measure the agent’s rationality? n A measure of performance, an objective measure if possible, associated to the tasks the agent has to execute. 3

4 n An agent is situated in an environment n An agent perceives its environment through sensors and acts upon the environment through effectors. n Aim: design an agent program = a function that implements the agent mapping from percepts to actions. We assume that this program will run on some computing device which we will call the architecture. Agent = architecture + program n The environment –accessible vs. inaccessible –deterministic vs. non-deterministic –static vs. dynamic –discrete vs. continue 4

5 1.2 Agent modeling E = {e 1,.., e,..} P = {p 1,.., p,..} A = {a 1,.., a,..} Reflex agent see : E  P action : P  A env : E x A  E (env : E x A  P (E)) Environment env Decision component action Execution component action Perception component see Agent 5

6 Agent modeling Several reflex agents see : E  P env : E x A 1 x … A n  P(E) inter : P  I action : P x I  A Environment env Decision component action Execution component action Perception component see Agent (A1) Agent (A2) Agent (A3) Interaction component inter 6 I = {i1,…,i,..}

7 Cognitive agents Agents with states S = {s1,…,s,…} n action : S x I  A i n next : S x P  S n inter : S x P  I n see : E  P n env : E x A 1 x … A n  P(E) Agent modeling 7

8 Agents with states and goals goal : E  {0, 1} Agents with utility utility : E  R Environment non-deterministic env : E x A  P (E) The probability estimated by the agent that the result of an action (a) execution in state e will be the new state e’ Agent modeling 8

9 Agents with utility The expected utility of an action in a state e, from the agent’s point of view The principle of Maximum Expected Utility (MEU) = a rational agent must choose the action that will bring the maximum expected utility Agent modeling 9

10 How to model? n Getting out of a maze –Reflex agent –Cognitive agent –Cognitive agent with utility 3 main problems: n what action to choose if several available n what to do if the outcomes of an action are not known n how to cope with changes in the environment 10

11 2. Cognitive agent architectures 2.1 Rational behaviour AI and Decision theory n AI = models of searching the space of possible actions to compute some sequence of actions that will achieve a particular goal n Decision theory = competing alternatives are taken as given, and the problem is to weight these alternatives and decide on one of them (means-end analysis is implicit in the specification of competing alternatives) n Problem 1 = deliberation/decision vs. action/proactivity n Problem 2 = the agents are resource bounded 11

12 General cognitive agent architecture 12 Information about itself - what it knows - what it believes - what is able to do - how it is able to do - what it wants environment and other agents - knowledge - beliefs Communication Interactions Control Output Input Other agents Environment Scheduler& Executor State Planner Reasoner

13 2.2 FOPL models of agency n Symbolic representation of knowledge + use inferences in FOPL - deduction or theorem proving to determine what actions to execute n Declarative problem solving approach - agent behavior represented as a theory T which can be viewed as an executable specification (a) Deduction rules At(0,0)  Free(0,1)  Exit(east)  Do(move_east) Facts and rules about the environment At(0,0) Wall(1,1)  x  y Wall(x,y)   Free(x,y) Automatically update current state and test for the goal stateAt(0,3) 13

14 FOPL models of agency (b) Use situation calculus = describe change in FOPL n Situation = the state resulting after executing an action Logical terms consisting of the initial state S0 and all situations that are generated by applying an action to a situation Result(Action,State) = NewState n Fluents = functions or predicates that vary from one situation to the next At(location, situation) 14

15 FOPL models of agency At((0,0), S 0 )  Free(0,1)  Exit(east)  At((0,1), Result(move_east,S 0 )) Try to prove the goal At((0,3), _) and determine the actions that lead to it - means-end analysis KB -| {Goal} and keep track o associated actions 15

16 Advantages of FOPL - simple, elegant - executable specifications Disadvantages - difficult to represent changes over time other logics - decision making is deduction and selection of a strategy - intractable - semi-decidable 16

17 2.3 BDI architectures n High-level specifications of a practical component of an architecture for a resource-bounded agent. n It performs means-end analysis, weighting of competing alternatives and interactions between these two forms of reasoning n Beliefs = information the agent has about the world n Desires = state of affairs that the agent would wish to bring about n Intentions = desires (or actions) that the agent has committed to achieve n BDI - a theory of practical reasoning - Bratman, 1988 n intentions play a critical role in practical reasoning - limits options, DM simpler 17

18 18 BDI particularly compelling because: n philosophical component - based on a theory of rational actions in humans n software architecture - it has been implemented and successfully used in a number of complex fielded applications –IRMA - Intelligent Resource-bounded Machine Architecture –PRS - Procedural Reasoning System n logical component - the model has been rigorously formalized in a family of BDI logics –Rao & Georgeff, Wooldrige –(Int A i  )   (Bel A i  )

19 19 BDI Architecture Belief revision Beliefs Knowledge Deliberation process percepts Desires Opportunity analyzer Intentions Filter Means-end reasonner Intentions structured in partial plans Plans Library of plans Executor B = brf(B, p) D = options(B, D, I) I = filter(B, D, I)  = plan(B, I) actions

20 Roles and properties of intentions n Intentions drive means-end analysis n Intentions constraint future deliberation n Intentions persist n Intentions influence beliefs upon which future practical reasoning is based Agent control loop B = B 0 I = I 0 D = D 0 while true do get next perceipt p B = brf(B,p) D = options(B, D, I) I = filter(B, D, I)  = plan(B, I) execute(  ) end while 20

21 Commitment strategies o If an option has successfully passed trough the filter function and is chosen by the agent as an intention, we say that the agent has made a commitment to that option o Commitments implies temporal persistence of intentions; once an intention is adopted, it should not be immediately dropped out. Question: How committed an agent should be to its intentions? o Blind commitment o Single minded commitment o Open minded commitment Note that the agent is committed to both ends and means. 21

22 B = B 0 I = I 0 D = D 0 while true do get next perceipt p B = brf(B,p) D = options(B, D, I) I = filter(B, D, I)  = plan(B, I) while not (empty(  ) or succeeded (I, B) or impossible(I, B)) do  = head(  ) execute(  )  = tail(  ) get next perceipt p B = brf(B,p) if not sound( , I, B) then  = plan(B, I) end while 22 Reactivity, replan Dropping intentions that are impossible or have succeeded Revised BDI agent control loop single-minded commitment

23 B = B 0 I = I 0 D = D 0 while true do get next perceipt p B = brf(B,p) D = options(B, D, I) I = filter(B, D, I)  = plan(B, I) while not (empty(  ) or succeeded (I, B) or impossible(I, B)) do  = head(  ) execute(  )  = tail(  ) get next perceipt p B = brf(B,p) D = options(B, D, I) I = filter(B, D, I)  = plan(B, I) end while 23 Replan if reconsider(I, B) then Revised BDI agent control loop open-minded commitment

24 3. Reactive agent architectures Subsumption architecture - Brooks, 1986 n (1) Decision making = {Task Accomplishing Behaviours} –Each behaviour = a function to perform an action –Brooks defines TAB as finite state machines –Many implementations: situation  action n (2) Many behaviours can fire simultaneously 24

25 Subsumption architecture n A TAB is represented by a competence module (c.m.) n Every c.m. is responsible for a clearly defined, but not particular complex task - concrete behavior n The c.m. are operating in parallel n Lower layers in the hierarchy have higher priority and are able to inhibit operations of higher layers n c.m. at the lower end of the hierarchy - basic, primitive tasks; n c.m. at higher levels - more complex patterns of behaviour and incorporate a subset of the tasks of the subordinate modules  subsumtion architecture 25

26 Module 1 can monitor and influence the inputs and outputs of Module 2 M1 = move around while avoiding obstacles  M0 M2 = explores the environment looking for distant objects of interests while moving around  M1  Incorporating the functionality of a subordinated c.m. by a higher module is performed using suppressors (modify input signals) and inhibitors (inhibit output) 26 Competence Module (1) Move around Competence Module (0) Avoid obstacles Inhibitor nodeSupressor node Sensors Competence Module (2) Explore environ Competence Module (0) Avoid obstacles Effectors Competence Module (1) Move around Input (percepts) Output (actions)

27  More modules can be added: Replenishing energy Optimising paths Making a map of territory Pick up and put down objects Behavior (c, a) – pair of condition-action describing behavior R = { (c, a) | c  P, a  A}- set of behavior rules   R x R - binary inhibition relation on the set of behaviors, total ordering of R function action( p: P) var fired: P(R), selected: A begin fired = {(c, a) | (c, a)  R and p  c} for each (c, a)  fired do if   (c', a')  fired such that (c', a')  (c, a) then return a return null end 27

28 n Every c.m. is described using a subsumption language based on AFSM - Augmented Finite State Machines n An AFSM initiates a response as soon as its input signal exceeds a specific threshold value. n Every AFSM operates independently and asynchronously of other AFSMs and is in continuos competition with the other c.m. for the control of the agent - real distributed internal control n 1990 - Brooks extends the architecture to cope with a large number of c.m. - Behavior Language Other implementations of reactive architectures n Steels - indirect communication - takes into account the social feature of agents n Advantages of reactive architectures n Disadvantages 28

29 4. Layered agent architectures n Combine reactive and pro-active behavior n At least two layers, for each type of behavior n Horizontal layering - i/o flows horizontally n Vertical layering - i/o flows vertically 29 Layer n … Layer 2 Layer 1 perceptual input Action output Layer n … Layer 2 Layer 1 Layer n … Layer 2 Layer 1 Action output Action output perceptual input perceptual input Horizontal Vertical

30 TouringMachine n Horizontal layering - 3 activity producing layers, each layer produces suggestions for actions to be performed n reactive layer - set of situation-action rules, react to precepts from the environment n planning layer - pro-active behavior - uses a library of plan skeletons called schemas - hierarchical structured plans refined in this layer n modeling layer - represents the world, the agent and other agents - set up goals, predicts conflicts - goals are given to the planning layer to be achieved n Control subsystem - centralized component, contains a set of control rules - the rules: suppress info from a lower layer to give control to a higher one - censor actions of layers, so as to control which layer will do the actions 30

31 InteRRaP n Vertically layered two pass agent architecture n Based on a BDI concept but concentrates on the dynamic control process of the agent Design principles n the three layered architecture describes the agent using various degrees of abstraction and complexity n both the control process and the KBs are multi-layered n the control process is bottom-up, that is a layer receives control over a process only when this exceeds the capabilities of the layer beyond n every layer uses the operations primitives of the lower layer to achieve its goals Every control layer consists of two modules: - situation recognition / goal activation module (SG) - planning / scheduling module (PS) 31

32 32 Cooperative planning layer SG PSSGPS Local planning layer Behavior based layer World interface Sensors Effectors Communication Social KB Planning KB World KB actions percepts InteRRaPInteRRaP

33 33 Beliefs Social model Mental model World model Situation Cooperative situation Local planning situation Routine/emergency sit. Goals Cooperative goals Local goals Reactions Options Cooperative option Local option Reaction Operational primitive Joint plans Local plans Behavior patterns Intentions Cooperative intents Local intentions Response Sensors Effectors BDI model in InteRRaP filter plan options PS SG

34  Muller tested InteRRaP in a simulated loading area.  A number of agents act as automatic fork-lifts that move in the loading area, remove and replace stock from various storage bays, and so compete with other agents for resources 34

35  BDI Architectures  First implementation of a BDI architecture: IRMA  [Bratman, Israel, Pollack, 1988] M.E. BRATMAN, D.J. ISRAEL et M. E. POLLACK. Plans and resource-bounded practical reasoning, Computational Intelligence, Vol. 4, No. 4, 1988, p.349-355.  PRS  [Georgeff, Ingrand, 1989] M. P. GEORGEFF et F. F. INGRAND. Decision-making in an embedded reasoning system, dans Proceedings of the Eleventh International Joint Conference on Artificial Intelligence (IJCAI-89), 1989, p.972-978.  Successor of PRS: dMARS  [D'Inverno, 1997] M. D'INVERNO et al. A formal specification of dMARS, dans Intelligent Agents IV, A. Rao, M.P. Singh et M. Wooldrige (eds), LNAI Volume 1365, Springer-Verlag, 1997, p.155-176.   Subsumption architecture  [Brooks, 1991] R. A. BROOKS. Intelligence without reasoning, dans Actes de 12th International Joint Conference on Artificial Intelligence (IJCAI-91), 1991, p.569-595.  35

36  TuringMachine  [Ferguson, 1992] I. A. FERGUSON. TuringMachines: An Architecture for Dynamic, Rational, Mobile Agents, Thèse de doctorat, University of Cambridge, UK, 1992.  InteRRaP  [Muller, 1997] J. MULLER. A cooperation model for autonomous agents, dans Intelligent Agents III, LNAI Volume 1193, J.P. Muller, M. Wooldrige et N.R. Jennings (eds), Springer-Verlag, 1997, p.245-260. BDI Implementations The Agent Oriented Software Group n Third generation BDI agent system using a component based approached. Implemented in Java n http://www.agent-software.com.au/shared/home/ http://www.agent-software.com.au/shared/home/ JASON n http://jason.sourceforge.net/ http://jason.sourceforge.net/ 36


Download ppt "Lecture 2 Multi-Agent Systems Lecture 2 University “Politehnica” of Bucarest 2004 - 2005 Adina Magda Florea"

Similar presentations


Ads by Google