RePOP: Reviving Partial Order Planning XuanLong Nguyen & Subbarao Kambhampati Yochan Group:

Slides:



Advertisements
Similar presentations
Forward-Chaining Partial-Order Planning Amanda Coles, Andrew Coles, Maria Fox and Derek Long (to appear, ICAPS 2010)
Advertisements

Informed search algorithms
Constraint Satisfaction Problems Russell and Norvig: Chapter
Causal-link Planning II José Luis Ambite. 2 CS 541 Causal Link Planning II Planning as Search State SpacePlan Space AlgorithmProgression, Regression POP.
CLASSICAL PLANNING What is planning ?  Planning is an AI approach to control  It is deliberation about actions  Key ideas  We have a model of the.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
1 Graphplan José Luis Ambite * [* based in part on slides by Jim Blythe and Dan Weld]
Solving Problem by Searching
Effective Approaches for Partial Satisfaction (Over-subscription) Planning Romeo Sanchez * Menkes van den Briel ** Subbarao Kambhampati * * Department.
Graph-based Planning Brian C. Williams Sept. 25 th & 30 th, J/6.834J.
Planning Graphs * Based on slides by Alan Fern, Berthe Choueiry and Sungwook Yoon.
Fluent Merging: A General Technique to Improve Reachability Heuristics and Factored Planning Menkes van den Briel Department of Industrial Engineering.
Utilizing Problem Structure in Local Search: The Planning Benchmarks as a Case Study Jőrg Hoffmann Alberts-Ludwigs-University Freiburg.
Classical Planning via State-space search COMP3431 Malcolm Ryan.
Planning Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 11.
Ryan Kinworthy 2/26/20031 Chapter 7- Local Search part 1 Ryan Kinworthy CSCE Advanced Constraint Processing.
CSE 5731 Lecture 21 State-Space Search vs. Constraint- Based Planning CSE 573 Artificial Intelligence I Henry Kautz Fall 2001.
A: A Unified Brand-name-Free Introduction to Planning Subbarao Kambhampati Jan 28 th My lab was hacked and the systems are being rebuilt.. Homepage is.
18 th Feb Using reachability heuristics for PO planning Planning using Planning Graphs.
Constraint Satisfaction Problems
CSE 574: Planning & Learning Subbarao Kambhampati 1/17: State Space and Plan-space Planning Office hours: 4:30—5:30pm T/Th.
A: A Unified Brand-name-Free Introduction to Planning Subbarao Kambhampati Jan 28 th My lab was hacked and the systems are being rebuilt.. Homepage is.
Refinement Planing CSE 574 April 15, 2003 Dan Weld.
1 BLACKBOX: A New Paradigm for Planning Bart Selman Cornell University.
Planning Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 11.
Chapter 5 Outline Formal definition of CSP CSP Examples
1 Planning Chapters 11 and 12 Thanks: Professor Dan Weld, University of Washington.
Classical Planning via State-space search COMP3431 Malcolm Ryan.
RePOP: Reviving Partial Order Planning
GraphPlan Alan Fern * * Based in part on slides by Daniel Weld and José Luis Ambite.
For Wednesday Read chapter 12, sections 3-5 Program 2 progress due.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Jonathon Doran. The Planning Domain A domain describes the objects, facts, and actions in the universe. We may have a box and a table in our universe.
Homework 1 ( Written Portion )  Max : 75  Min : 38  Avg : 57.6  Median : 58 (77%)
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
For Monday Read chapter 12, sections 1-2 Homework: –Chapter 10, exercise 3.
Heuristics in Search-Space CSE 574 April 11, 2003 Dan Weld.
Planning as Propositional Satisfiabililty Brian C. Williams Oct. 30 th, J/6.834J GSAT, Graphplan and WalkSAT Based on slides from Bart Selman.
Review: Tree search Initialize the frontier using the starting state While the frontier is not empty – Choose a frontier node to expand according to search.
For Friday No reading Homework: –Chapter 11, exercise 4.
Automated Planning and Decision Making Prof. Ronen Brafman Automated Planning and Decision Making Partial Order Planning Based on slides by: Carmel.
Advanced Algorithm Design and Analysis (Lecture 14) SW5 fall 2004 Simonas Šaltenis E1-215b
June 6 th, 2005 ICAPS-2005 Workshop on Constraint Programming for Planning and Scheduling 1/12 Stratified Heuristic POCL Temporal Planning based on Planning.
Chapter 5 Constraint Satisfaction Problems
AI Lecture 17 Planning Noémie Elhadad (substituting for Prof. McKeown)
Best-first search Idea: use an evaluation function f(n) for each node –estimate of "desirability"  Expand most desirable unexpanded node Implementation:
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
© Daniel S. Weld 1 Logistics Travel Wed class led by Mausam Week’s reading R&N ch17 Project meetings.
Automated Planning and Decision Making Prof. Ronen Brafman Automated Planning and Decision Making Graphplan Based on slides by: Ambite, Blyth and.
Graphplan CSE 574 April 4, 2003 Dan Weld. Schedule BASICS Intro Graphplan SATplan State-space Refinement SPEEDUP EBL & DDB Heuristic Gen TEMPORAL Partial-O.
Temporal Planning with Continuous Change J.Scott Penbrethy Daniel S. Weld Presented by - Parag.
AAAI of 20 Deconstructing Planning as Satisfiability Henry Kautz University of Rochester in collaboration with Bart Selman and Jöerg Hoffmann.
IBM Labs in Haifa © 2005 IBM Corporation Assumption-based Pruning in Conditional CSP Felix Geller and Michael Veksler.
Heuristic Search Planners. 2 USC INFORMATION SCIENCES INSTITUTE Planning as heuristic search Use standard search techniques, e.g. A*, best-first, hill-climbing.
EBL & DDB for Graphplan (P lanning Graph as Dynamic CSP: Exploiting EBL&DDB and other CSP Techniques in Graphplan) Subbarao Kambhampati Arizona State University.
1 Chapter 6 Planning-Graph Techniques. 2 Motivation A big source of inefficiency in search algorithms is the branching factor  the number of children.
Review: Tree search Initialize the frontier using the starting state
CSPs: Search and Arc Consistency Computer Science cpsc322, Lecture 12
Planning as Search State Space Plan Space Algorihtm Progression
Classical Planning via State-space search
CSPs: Search and Arc Consistency Computer Science cpsc322, Lecture 12
RePOP: Reviving Partial Order Planning
CSPs: Search and Arc Consistency Computer Science cpsc322, Lecture 12
Class #17 – Thursday, October 27
Graph-based Planning Slides based on material from: Prof. Maria Fox
Graphplan/ SATPlan Chapter
Class #19 – Monday, November 3
Graphplan/ SATPlan Chapter
Graphplan/ SATPlan Chapter
[* based in part on slides by Jim Blythe and Dan Weld]
Presentation transcript:

RePOP: Reviving Partial Order Planning XuanLong Nguyen & Subbarao Kambhampati Yochan Group:

In the beginning it was all POP.Then it was cruelly UnPOPped The good times return with Re(vived)POP

A recent (turbulent) history of planning 1970s Advent of CSP style compilation approach: Graphplan [Blum & Furst] SATPLAN [Kautz & Selman] Use of reachability analysis and Disjunctive constraints Domination of heuristic state search approach: HSP/R [Bonet & Geffner] UNPOP [McDermott]: POP is dead! Importance of good Domain-independent heuristics UCPOP [Penberthy &Weld] IxTeT [Ghallab et al] The whole world believed in POP and was happy to stack 6 blocks! Hoffman’s FF – a state search planer swept through AIPS-00 competition! NASA’s highly publicized RAX still a POP dinosaur POP believed to be good framework to handle temporal and resource planning [Smith et al, 2000] UCPOPUNPOP RePOP

To show that POP can be made very efficient by exploiting the same ideas that scaled up state search and Graphplan planners –Effective heuristic search control –Use of reachability analysis –Handling of disjunctive constraints RePOP, implemented on top of UCPOP –Dramatically better than all known partial-order planners –Outperforms Graphplan and competitive with state search planners in many (parallel) domains Outline RePOP: A revival for partial order planning

Partial plan representation P = (A,O,L,OC,UL) A: set of action steps in the plan S 0,S 1,S 2 …,S inf O: set of action ordering S i < S j,… L : set of causal links OC: set of open conditions (subgoals remain to be satisfied) UL: set of unsafe links where p is deleted by some action S k p SiSi SjSj p SiSi SjSj S0S0 S1S1 S2S2 S3S3 S inf p ~p g1g1 g2g2 g2g2 oc 1 oc 2 G={g 1,g 2 }I={q 1,q 2 } q1q1 Flaw: Open condition OR unsafe link Solution plan: A partial plan with no remaining flaw Every open condition must be satisfied by some action No unsafe links should exist (i.e. the plan is consistent) POP background

Algorithm 1. Let P be an initial plan 2. Flaw Selection: Choose a flaw f ( either open condition or unsafe link ) 3. Flaw resolution: If f is an open condition, choose an action S that achieves f If f is an unsafe link, choose promotion or demotion Update P Return NULL if no resolution exist 4. If there is no flaw left, return P else go to 2. S0S0 S1S1 S2S2 S3S3 S in f p ~p g1g1 g2g2 g2g2 oc 1 oc 2 q1q1 Choice points Flaw selection (open condition? unsafe link?) Flaw resolution (how to select (rank) partial plan?) Action selection (backtrack point) Unsafe link selection (backtrack point) S0S0 S inf g1g2g1g2 1. Initial plan: 2. Plan refinement (flaw selection and resolution): POP background

Our approach (main ideas) 1. Ranking partial plans: use an effective distance-based heuristic estimator. 2. Exploit reachability analysis: use invariants to discover implicit conflicts in the plan. 3. Unsafe links are resolved by posting disjunctive ordering constraints into the partial plan: avoid unnecessary and exponential multiplication of failures due to promotion/demotion splitting State-space idea of distance heuristic CSP ideas of consistency enforcement

1. Ranking partial plans using distance-based heuristic 1. Ranking Function: f(P) = g(P) + w h(P) g(P): number of actions in P h(P): estimate of number of new actions needed to refine P to become a solution plan w: increase the greediness of the heuristic search 2. Estimating h(P) h(P) is estimated by relaxing some constraints present in the partial plan P Negative effects of actions are relaxed P has no unsafe link flaws h(P) becomes the number of actions (cost(S) ) needed to achieve the set of open condition S from the initial state S0S0 S1S1 S2S2 S3S3 p ~p g1g1 g2g2 g2g2 oc 1 oc 2 q1q1 S inf S4S4 P h(P) = 2 S5S5

Distance-based heuristic estimate Estimate cost(S) 1. Build a planning graph PG from the initial state. 2. Cost(S) := 0 if all subgoals in S are in level Let p be a subgoal in S that appears last in PG. 4. Pick an action a in the graph that first achieves p 5. Update cost(S) := 1 + cost(S+Prec(a) – Eff(a)) 6. Replace S = S+Prec(a) – Eff(a), goto 2 p a 0123 SS+Prec(a)-Eff(a) a + Any state-space heuristic can be adapted + Relaxing negative effects makes the estimate inaccurate in serial domains.

2. Handling unsafe link flaws SiSi SkSk SjSj p ~p q Prec(a) 1. For each unsafe link threatened by another step S k : Add disjunctive constraint to O S k < S i V S i < S j 2. Whenever a new ordering constraint is introduced to O, perform the constraint propagations: S 1 < S 2 V S 3 < S 4 ^ S 4 < S 3  S 1 < S 2 S 1 < S 2 ^ S 2 < S 3  S 1 < S 3 S 1 < S 2 ^ S 2 < S 1  False p SiSi SjSj Avoid the unnecessary exponential multiplication of failing partial plans

3. Detecting indirect conflicts using reachability analysis SkSk Prec(S k ) 1.Reachability analysis to detect invariant: on(a,b) and clear(b) How to get state information in a partial plan 3. Cutset: Set of literals that must be true at some point during execution of plan For each action a, pre-C(S k ) = Prec(S k ) U {p | is a link and S i < S k < S j } post-C(S k ) = Eff(S k ) U {p | is a link and S i < S k < S j } 4. If exists a cutset that violates of a variant, the partial plan is invalid and should be pruned p SiSi SjSj p SiSi SjSj SmSm SnSn q SiSi SjSj p Eff(S k ) Disadvantage: Inconsistency checking is passive and maybe expensive Prec(S k ) + p + qEff(S k ) + p + q

Detecting indirect conflicts using reachability analysis SkSk Prec(S k ) 1.Generalizing unsafe link: S k threatens iff p is mutually exclusive (mutex) with either Prec(S k ) or Eff(S k ) 2.Unsafe link is resolved by posting disjunctive constraints (as before) S k < S i V S i < S j SmSm SnSn q SiSi SjSj p Eff(S k ) Detects indirect conflicts early Derives more disjunctive constraints to be propagated p SiSi SjSj

Experiments on RePOP RePOP is implemented on top of UCPOP planner using the three presented ideas –Written in Lisp, runs on Linux, 500MHz, 250MB Compare RePOP against UCPOP, Graphplan and AltAlt in a number of benchmark domains –Time –Solution quality

Comparing planning time (summary) 1.RePOP is very good in parallel domains (gripper, logistics, rocket, parallel blocks world) Outperforms Graphplan in many domains Competitive with AltAlt Completely dominates UCPOP 2.RePOP still inefficient in serial domains: Travel, Grid, 8-puzzle Repop vs. UCPOP Graphplan AltAlt

Comparing planning time (time in seconds) Repop vs. UCPOP Graphplan AltAlt ProblemUCPOPRePOPGraphplanAltAlt Gripper Gripper min1.15 Gripper Rocket-a Rocket-b Logistics-a Logistics-b Logistics-c Logistics-d Bw-large-a45.78(5.23) Bw-large-b-(18.86) Bw-large-c-(137.84)

Some solution quality metrics 1. Number of actions 2. Makespan: minimum completion time (number of time steps) 3. Flexibility: Average number of actions that do not have ordering constraints with other actions Repop vs. UCPOP Graphplan AltAlt Num_act=4 Makespan=2 Flex = 1 4 Num_act=4 Makespan=2 Flex = Num_act=4 Makespan=4 Flex = 0

Comparing solution quality (summary) RePOP generates partially ordered plans Number of actions: RePOP typically returns shortest plans Number of time steps (makespan): Graphplan produces optimal number of time steps RePOP comes close Flexibility: RePOP typically returns the most flexible plans

Comparing solution quality Number of actions/ time steps Flexibility degree ProblemRePOPGraphplanAltAltRePOPGraphplanAltAlt Gripper-821/ 1523/ 1521/ Gripper-1027/ 1929/ 1927/ Gripper-2059/ 39-59/ Rocket-a35/ 1640/ 736/ Rocket-b34/1530/ 734/ Logistics-a52/ 1380/ 1164/ Logistics-b42/ 1379/ 1353/ Logistics-c50/ 15-70/ Logistics-d69/ 33-85/ Bw-large-a(8/5) -11/ 49/ Bw-large-b(11/8) -18/ 511/ Bw-large-c(17/ 10) - -19/

Ablation studies ProblemUCPOP+ CE+ HP+CE+HP (RePOP) Gripper-8*6557/ 3881*1299/ 698 Gripper-10*11407/ 6642*2215/ 1175 Gripper-12*17628/ 10147*3380/ 1776 Gripper-20***11097/ 5675 Rocket-a**30110/ / 4261 Rocket-b**85316/ / Logistics-a**411/ / 436 Logistics-b**920/ / 271 Logistics-c**4939/ / 4796 Logistics-d***16572/ CE: Consistency enforcement techniques (reachability analysis and disjunctive constraint handling HP: Distance-based heuristic

Conclusion Developed effective techniques for improving partial-order planners: –Ranking partial plan heuristics, –Disjunctive representation for unsafe links, –Use of reachability analysis Presented and evaluated RePOP –Brings POP to the realm of effective planning algorithms –Can now exploit the flexibility of POP without too much efficiency penalty

Future Work Extend RePOP to deal with time and resource constraints Extend RePOP to deal with partially instantiated actions Improve the efficiency of RePOP in serial domains –Serial domains inherent weakness of POP? –Real-world domains tend to admit partially ordered plans Devising effective admissible heuristics for POP