Forward-Chaining Partial-Order Planning Amanda Coles, Andrew Coles, Maria Fox and Derek Long (to appear, ICAPS 2010)

Slides:

Advertisements

Similar presentations

Introduction to Embedded Systems Resource Management - III Lecture 19.

Advertisements

Artificial Intelligence 2005/06 Partial Order Planning.

Minimum Clique Partition Problem with Constrained Weight for Interval Graphs Jianping Li Department of Mathematics Yunnan University Jointed by M.X. Chen.

Planning II: Partial Order Planning

10 October 2006 Foundations of Logic and Constraint Programming 1 Unification An overview Need for Unification Ranked alfabeths and terms. Substitutions.

Causal-link Planning II José Luis Ambite. 2 CS 541 Causal Link Planning II Planning as Search State SpacePlan Space AlgorithmProgression, Regression POP.

Constraint Based Reasoning over Mutex Relations in Graphplan Algorithm Pavel Surynek Charles University, Prague Czech Republic.

CLASSICAL PLANNING What is planning ?  Planning is an AI approach to control  It is deliberation about actions  Key ideas  We have a model of the.

This lecture topic (two lectures) Chapter 6.1 – 6.4, except 6.3.3

Traveling Salesperson Problem

Exploiting Symmetry in Planning Maria Fox Durham Planning Group University of Durham, UK.

Classical Planning via Plan-space search COMP3431 Malcolm Ryan.

Time Constraints in Planning Sudhan Kanitkar

Plan Generation & Causal-Link Planning 1 José Luis Ambite.

Hybrid Systems Presented by: Arnab De Anand S. An Intuitive Introduction to Hybrid Systems Discrete program with an analog environment. What does it mean?

Methods of Proof Chapter 7, second half.. Proof methods Proof methods divide into (roughly) two kinds: Application of inference rules: Legitimate (sound)

Constraint Optimization Presentation by Nathan Stender Chapter 13 of Constraint Processing by Rina Dechter 3/25/20131Constraint Optimization.

Planning with Resources at Multiple Levels of Abstraction Brad Clement, Tony Barrett, Gregg Rabideau Artificial Intelligence Group Jet Propulsion Laboratory.

Graph-based Planning Brian C. Williams Sept. 25 th & 30 th, J/6.834J.

Best-First Search: Agendas

ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.

1 Chapter 16 Planning Methods. 2 Chapter 16 Contents (1) l STRIPS l STRIPS Implementation l Partial Order Planning l The Principle of Least Commitment.

Symmetry as a Prelude to Implied Constraints Alan Frisch, Ian Miguel, Toby Walsh University of York.

Constraint Logic Programming Ryan Kinworthy. Overview Introduction Logic Programming LP as a constraint programming language Constraint Logic Programming.

Artificial Intelligence Constraint satisfaction Chapter 5, AIMA.

Ryan Kinworthy 2/26/20031 Chapter 7- Local Search part 1 Ryan Kinworthy CSCE Advanced Constraint Processing.

Constraint Satisfaction Problems

Jean-Charles REGIN Michel RUEHER ILOG Sophia Antipolis Université de Nice – Sophia Antipolis A global constraint combining.

CS121 Heuristic Search Planning CSPs Adversarial Search Probabilistic Reasoning Probabilistic Belief Learning.

Chapter 5 Outline Formal definition of CSP CSP Examples

Local Search Techniques for Temporal Planning in LPG Paper by Gerevini, Serina, Saetti, Spinoni Presented by Alex.

Distributed Constraint Optimization * some slides courtesy of P. Modi

PLANNING Partial order regression planning Temporal representation 1 Deductive planning in Logic Temporal representation 2.

Finite Capacity Scheduling 6.834J, J. Overview of Presentation What is Finite Capacity Scheduling? Types of Scheduling Problems Background and History.

CRITICAL PATH ANALYSIS aka NETWORK PATH ANALYSIS DESIGNED to help managers who are planning complex projects that involve many interrelated tasks The idea.

Homework 1 ( Written Portion )  Max : 75  Min : 38  Avg : 57.6  Median : 58 (77%)

Using Abstraction in Multi-Rover Scheduling Bradley J. Clement and Anthony C. Barrett Artificial Intelligence Group Jet Propulsion Laboratory {bclement,

22/11/04 AIPP Lecture 16: More Planning and Operators1 More Planning Artificial Intelligence Programming in Prolog.

Chapter 11 Heap. Overview ● The heap is a special type of binary tree. ● It may be used either as a priority queue or as a tool for sorting.

Practical Dynamic Programming in Ljungqvist – Sargent (2004) Presented by Edson Silveira Sobrinho for Dynamic Macro class University of Houston Economics.

CP Summer School Modelling for Constraint Programming Barbara Smith 2. Implied Constraints, Optimization, Dominance Rules.

Hande ÇAKIN IES 503 TERM PROJECT CONSTRAINT SATISFACTION PROBLEMS.

For Friday No reading Homework: –Chapter 11, exercise 4.

CAS 721 Course Project Implementing Branch and Bound, and Tabu search for combinatorial computing problem By Ho Fai Ko ( )

15.053Tuesday, April 9 Branch and Bound Handouts: Lecture Notes.

1 Chapter 16 Planning Methods. 2 Chapter 16 Contents (1) l STRIPS l STRIPS Implementation l Partial Order Planning l The Principle of Least Commitment.

Branch-and-Cut Valid inequality: an inequality satisfied by all feasible solutions Cut: a valid inequality that is not part of the current formulation.

June 6 th, 2005 ICAPS-2005 Workshop on Constraint Programming for Planning and Scheduling 1/12 Stratified Heuristic POCL Temporal Planning based on Planning.

AI Lecture 17 Planning Noémie Elhadad (substituting for Prof. McKeown)

Chapter 2) CSP solving-An overview Overview of CSP solving techniques: problem reduction, search and solution synthesis Analyses of the characteristics.

Problem Reduction So far we have considered search strategies for OR graph. In OR graph, several arcs indicate a variety of ways in which the original.

OR Chapter 8. General LP Problems Converting other forms to general LP problem : min c’x  - max (-c)’x   = by adding a nonnegative slack variable.

Planning I: Total Order Planners Sections

Temporal Planning with Continuous Change J.Scott Penbrethy Daniel S. Weld Presented by - Parag.

By J. Hoffmann and B. Nebel

Heuristic Search Planners. 2 USC INFORMATION SCIENCES INSTITUTE Planning as heuristic search Use standard search techniques, e.g. A*, best-first, hill-climbing.

Global Register Allocation Based on

CSPs: Search and Arc Consistency Computer Science cpsc322, Lecture 12

Inference and search for the propositional satisfiability problem

CSPs: Search and Arc Consistency Computer Science cpsc322, Lecture 12

Class #17 – Thursday, October 27

Study Guide for ES205 Yu-Chi Ho Jonathan T. Lee Nov. 7, 2000

Graph-based Planning Slides based on material from: Prof. Maria Fox

Graphplan/ SATPlan Chapter

Class #19 – Monday, November 3

Chapter 5: General search strategies: Look-ahead

Artificial Intelligence

Graphplan/ SATPlan Chapter

Russell and Norvig: Chapter 11 CS121 – Winter 2003

Graphplan/ SATPlan Chapter

Presentation transcript:

Forward-Chaining Partial-Order Planning Amanda Coles, Andrew Coles, Maria Fox and Derek Long (to appear, ICAPS 2010)

Summary Forward-chaining planning eliminates the threat resolution of POP, at the price of over- commitment. Issues arise in temporal planning, due to needless ordering constraints leading to backtracking. Can modify a forward-chaining approach to construct a partial-order, avoiding this. Further, can modify a TRPG heuristic to encourage search to find lower makespan plans. Implemented and evaluated in the planner POPF

Overview (Temporal) Forward-Chaining Planning Issues with using a Total Order Reducing Commitment Heuristic Guidance for Lower Makespan Plans EvaluationConclusions

Forward Chaining Temporal Planning A state S is a tuple of: Propositional Facts Propositional Facts Values of task variables Values of task variables A Queue of actions that have not yet finished A Queue of actions that have not yet finished The Plan to reach S The Plan to reach S The Constraints on the steps in P The Constraints on the steps in P The plan consists of the starts and ends of actions: A and A denote the start/end of A, resp. A and A denote the start/end of A, resp.

light_match match1 light m1 ¬light m1 mend_fuse fuse1 match1 0: light_match_start match1 1: mend_fuse_start fuse1 match1 2: mend_fuse_end fuse1 match1 3: light_match_end match1 lms mfs1mfe1 lme Epsilon separation (0.01) Simple Example

Overview (Temporal) Forward-Chaining Planning Issues with using a Total Order Reducing Commitment Heuristic Guidance for Lower Makespan Plans EvaluationConclusions

Issues with Using a Total Order To resolve threats, F.C. planning uses a total order. When applying an action A: A cannot violate preconditions of earlier actions, as it comes after them (demotion); A cannot violate preconditions of earlier actions, as it comes after them (demotion); Subsequent actions cannot delete its preconditions, as A comes sooner (promotion) Subsequent actions cannot delete its preconditions, as A comes sooner (promotion) The drawback is that needless ordering constraints are added: If A does not interfere with the preceding step, it still must come after it. If A does not interfere with the preceding step, it still must come after it. Motivates partial-order lifting, but this first needs a solution to be found.

Total Orders of Start/End Actions Two actions, A and B: B is longer than A; B is longer than A; No interaction between A and B ; No interaction between A and B ; But, B must precede A But, B must precede A The planner chooses a (partial) plan: A B B

A B B A A A was added to the plan before B, theyBecause A was added to the plan before B, they are ordered as shown (in a total-order). are ordered as shown (in a total-order). But, Awill not be applicable until after BBut, Awill not be applicable until after B The planner will have to backtrack, over all the intermediateThe planner will have to backtrack, over all the intermediate decisions, and add B to the plan earlier than A decisions, and add B to the plan earlier than A.

Overview (Temporal) Forward-Chaining Planning Issues with using a Total Order Reducing Commitment Heuristic Guidance for Lower Makespan Plans EvaluationConclusions

Reducing Commitment Record additional information at each state concerning which steps achieve / delete / depend on each fact. Use this information to commit to fewer ordering constraints Still resolve threats based on the intuition of forward-chaining expansion: new actions cannot threaten the preconditions of earlier actions.

Extending the State: Propositional To capture ordering information we add: F +, F -, where F + (p) (F - (p)) is the index of the of the step that most recently added (deleted) p FP, where FP(p) is a set of pairs : denotes that step i has an instantaneous condition on p ( at start or at end ) denotes that step i has an instantaneous condition on p ( at start or at end ) denotes that step i marks the end of an action with an over all condition on p denotes that step i marks the end of an action with an over all condition on p

Starting an Action A at Step i For each at start condition p: t(F + (p)) + ε t(i) For each at start del. effect p, assign F - (p) = i, t(F + (p)) + ε t(i), and in FP(P), t(j) + d t(i) For each at start add effect p, assign F + (p) = i, and if F - (p) i, t(F - (p)) + ε t(i) For each over all condition p: If F + (p) i, t(F + (p)) t(i) (To apply the end of an action: similar process, but without over all conditions) A

A B B A A : (action B) [5.00] 3.01: (action A) [2.00]

Extending the State: Numeric For numbers we are a little more strict: V eff, where V eff (v) is the step of the action to most recently have an effect on v VP, where VP(v) contains steps that depend on the value of v, each step i such that: i has a precondition on v, or is the start of an action whose duration constraint contains v; or, i has a precondition on v, or is the start of an action whose duration constraint contains v; or, i has an effect that depends on v i has an effect that depends on v VI, where VI(v) is a set of pairs (s,e), marking the start/end indices of actions in the event queue (Q) with an over all condition depending on v (Also, V cts to handle linear continuous numeric change – see paper for details.)

Starting an Action A at Step i: For each variable v relevant to at start conditions, effects, or the actions duration: t(V eff (v)) + ε t(i) For each v on which A has an at start eff, apply the effect to V, and: (s,e) in VI(v), t(s) + ε t(i) and t(i) + ε t(e) For each variable v relevant to an over all, add (i,j) to VI(v), and if was not relevant to the start of A: t(V eff (v)) + ε t(i) A

Overview (Temporal) Forward-Chaining Planning Issues with using a Total Order Reducing Commitment Heuristic Guidance for Lower Makespan Plans EvaluationConclusions

Heuristic Guidance Have seen how the search space can be modified to reduce excessive ordering constraints; There is still no pressure to prefer choices that lead to a partial-order with a lower makespan Could use partial-order lifting a posteriori for similar quality results? Could use partial-order lifting a posteriori for similar quality results? Given we know the makespan implications of action choices, how can we factor this into the decision making during search?

Revisiting the Temporal RPG The Temporal RPG consists of time-stamped fact and action layers. To evaluate a state S: Fact layer f=0.0 contains the facts in S; Fact layer f=0.0 contains the facts in S; Action layer a=0.00 contains actions whose preconditions are satisfied in f=0.0; Action layer a=0.00 contains actions whose preconditions are satisfied in f=0.0; Effects of actions appear in the next layer; the end of an action A is delayed until dur(A) after A start first appears. Effects of actions appear in the next layer; the end of an action A is delayed until dur(A) after A start first appears. What about the extra information we now have in S?

Bounding Preconditions and Effects on Facts When adding actions to the partial order, for a proposition p: Any action requiring p to satisfy a precondition will need to come after t(F + (p)) and t(F - (p)) Any action requiring p to satisfy a precondition will need to come after t(F + (p)) and t(F - (p)) Any action with an add (delete) effect on p will need to come after t(F - (p)) ( t(F + (p)) resp.) Any action with an add (delete) effect on p will need to come after t(F - (p)) ( t(F + (p)) resp.) From checking temporal constraints, we have a lower-bound on each step, t min (i) Thus, the earliest point we can use p is: l(p) = max { t min (F + (p)), t min (F - (p)) + ε }

Bounding (continued) Similarly, for each numeric precondition/effect referring to a variable set vars, it cannot be used until: L(vars) = max v in vars t min (v eff (v)) With these bounds, for any state S, we can build a TRPG starting at time zero: Delay fact p until layer L(p) Delay fact p until layer L(p) Delay numeric preconditions/effects until L(vars) for their respective variable sets Delay numeric preconditions/effects until L(vars) for their respective variable sets Then, actions which do not interfere with existing choices will appear sooner in the TRPG.

Overview (Temporal) Forward-Chaining Planning Issues with using a Total Order Reducing Commitment Heuristic Guidance for Lower Makespan Plans EvaluationConclusions

Evaluation Planner POPF, based on the code for COLIN (IJCAI09) First test: Control: run COLIN, then apply partial-order lifting to the solution Control: run COLIN, then apply partial-order lifting to the solution POPF, but using the original heuristic from COLIN. POPF, but using the original heuristic from COLIN. Second test, also considering domains with deadlines: COLIN then partial-order lifter COLIN then partial-order lifter POPF, new heuristic. POPF, new heuristic.

Test 1: Time Taken

Test 1: Makespan

Test 2: Time Taken

Test 2: Makespan

Test 2: Time Taken (Deadlines)

Conclusions Have shown how a partial-order can be expanded in a forwards direction; Adapting the heuristic allows one to trade time performance for a reduction in makespan; In domains with deadlines, performance is: substantially improved (fivefold improvement in coverage in the Satellite variants). In domains with deadlines, performance is: substantially improved (fivefold improvement in coverage in the Satellite variants). In the paper: approach also works with domains containing linear-continuous change