3/25: Leaving STRIPS Planning and going to Sapa. Administrivia 3/25  Homework 4 due next class  Midterm soon after that  Will be take home  Will have.

Slides:



Advertisements
Similar presentations
Learning to Improve the Quality of Plans Produced by Partial-order Planners M. Afzal Upal Intelligent Agents & Multiagent Systems Lab.
Advertisements

Forward-Chaining Partial-Order Planning Amanda Coles, Andrew Coles, Maria Fox and Derek Long (to appear, ICAPS 2010)
Practical Planning: Scheduling and Hierarchical Task Networks Chapter CS 63 Adapted from slides by Tim Finin and Marie desJardins.
Language for planning problems
Planning Module THREE: Planning, Production Systems,Expert Systems, Uncertainty Dr M M Awais.
CLASSICAL PLANNING What is planning ?  Planning is an AI approach to control  It is deliberation about actions  Key ideas  We have a model of the.
Situation Calculus for Action Descriptions We talked about STRIPS representations for actions. Another common representation is called the Situation Calculus.
Time Constraints in Planning Sudhan Kanitkar
Plan Generation & Causal-Link Planning 1 José Luis Ambite.
PROBABILITY. Uncertainty  Let action A t = leave for airport t minutes before flight from Logan Airport  Will A t get me there on time ? Problems :
CS3500 Software Engineering Project Management (1) In 1986 one well-known software engineer (Tom DeMarco) made the simple but important observation: “You.
Project Management Operations -- Prof. Juran. 2 Outline Definition of Project Management –Work Breakdown Structure –Project Control Charts –Structuring.
When is Temporal Planning Really Temporal? William Cushing Subbarao Kambhampati Special thanks to: J. Benton, Menkes van den Briel Mausam Daniel Weld.
Planning and Scheduling. 2 USC INFORMATION SCIENCES INSTITUTE Some background Many planning problems have a time-dependent component –  actions happen.
1 Classical STRIPS Planning Alan Fern * * Based in part on slides by Daniel Weld.
Happy Spring Break!. Integrating Planning & Scheduling Subbarao Kambhampati Scheduling: The State of the Art.
CPSC 322, Lecture 18Slide 1 Planning: Heuristics and CSP Planning Computer Science cpsc322, Lecture 18 (Textbook Chpt 8) February, 12, 2010.
Planning Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 11.
Planning II GraphPlan and Russell and Norvig: ch.12 CMSC421 – Fall 2006 based on material from Jim Blythe, JC Latombe, Marie desJardins and Daphne Koller.
Artificial Intelligence Chapter 11: Planning
3/25  Monday 3/31 st 11:30AM BYENG 210 Talk by Dana Nau Planning for Interactions among Autonomous Agents.
Metric/Temporal Planning 10/11  We will have regular class on Wednesday  There will be a guest lecture on Nov 3 rd  By Kanna Rajan on how NASA used.
4/1: Search Methods and Heuristics  Progression: Sapa (TLPlan; FF)  Regression: TP4  Partial order: Zeno (IxTET)
Extending Graphplan to handle Resources Presenter: Pham Van Cuong Department of Computer Science New Mexico State University.
Planning CSE 473. © Daniel S. Weld Topics Agency Problem Spaces Search Knowledge Representation Reinforcement Learning InferencePlanning Supervised.
Planning Graph-based Heuristics for Cost-sensitive Temporal Planning Minh B. Do & Subbarao Kambhampati CSE Department, Arizona State University
Handling non-determinism and incompleteness. Problems, Solutions, Success Measures: 3 orthogonal dimensions  Incompleteness in the initial state  Un.
Metric/Temporal Planning. Metric Temporal Planning  MTP Adds time and resources to planning Special cases: TP: Temporal planning RP: Resource Planning.
Temporal Planning  action models  Using PDDL2.1 standard  how to model the search  Progression; Regression; PO planning  how to extract good heuristics.
Planning Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 11.
Hash Tables1 Part E Hash Tables  
1 Planning Chapters 11 and 12 Thanks: Professor Dan Weld, University of Washington.
SAPA: A Domain-independent Heuristic Temporal Planner Minh B. Do & Subbarao Kambhampati Arizona State University.
Planning II CSE 573. © Daniel S. Weld 2 Logistics Reading for Wed Ch 18 thru 18.3 Office Hours No Office Hour Today.
Tier I: Mathematical Methods of Optimization
A Framework for Planning in Continuous-time Stochastic Domains Håkan L. S. Younes Carnegie Mellon University David J. MuslinerReid G. Simmons Honeywell.
Lesson №2. is the unique activity that has a beginning and an end time, aimed at achieving a predetermined result/goal, the creation of a specific, unique.
1 Endgame Logistics  Final Project Presentations  Tuesday, March 19, 3-5, KEC2057  Powerpoint suggested ( to me before class)  Can use your own.
Homework 1 ( Written Portion )  Max : 75  Min : 38  Avg : 57.6  Median : 58 (77%)
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Problem Solving with Decisions
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
1 The LPSAT Engine and its Application to Metric Planning Steve Wolfman University of Washington CS&E Advisor: Dan Weld.
Test Taking Tips Test Prep  Preparation for your first test should begin on the first day of class; this includes paying attention.
1 Short Term Scheduling. 2  Planning horizon is short  Multiple unique jobs (tasks) with varying processing times and due dates  Multiple unique jobs.
Cost-Optimal Planning with Constraints and Preferences in Large State Spaces Stefan Edelkamp, Shahid Jabbar, Mohammed Nazih University of Dortmund.
Extending PDDL to Model Stochastic Decision Processes Håkan L. S. Younes Carnegie Mellon University.
PROJECT MANAGEMENT Approaches By- Rahul Jain. Plan your work first…..then work your plan.
Thermochemistry Energy, heat: Uses and implications.
AI Lecture 17 Planning Noémie Elhadad (substituting for Prof. McKeown)
1 CMSC 471 Fall 2004 Class #21 – Thursday, November 11.
1 CMSC 471 Fall 2002 Class #24 – Wednesday, November 20.
Temporal Planning with Continuous Change J.Scott Penbrethy Daniel S. Weld Presented by - Parag.
Planning in FOL Systems sequences of actions to achieve goals.
CSE573 Autumn /11/98 Planning Administrative –PS3 due 2/23 –Midterm Friday Last time –regression planning algorithm –introduction to plan-space.
Heuristic Search Planners. 2 USC INFORMATION SCIENCES INSTITUTE Planning as heuristic search Use standard search techniques, e.g. A*, best-first, hill-climbing.
Heuristic Optimization Methods
Introduction Contents Sungwook Yoon, Postdoctoral Research Associate
Planning: Heuristics and CSP Planning
Planning: Heuristics and CSP Planning
Planning José Luis Ambite.
Graphplan/ SATPlan Chapter
Planning CSE 573 A handful of GENERAL SEARCH TECHNIQUES lie at the heart of practically all work in AI We will encounter the SAME PRINCIPLES again and.
Project Management Operations -- Prof. Juran.
Planning Problems On(C, A)‏ On(A, Table)‏ On(B, Table)‏ Clear(C)‏
Class #19 – Monday, November 3
Class #20 – Wednesday, November 5
Heuristic Planning with Time and Resources
Memory-Based Learning Instance-Based Learning K-Nearest Neighbor
Class #17 – Tuesday, October 30
Presentation transcript:

3/25: Leaving STRIPS Planning and going to Sapa

Administrivia 3/25  Homework 4 due next class  Midterm soon after that  Will be take home  Will have a “Shock and Awe” flavor  You can be an “embedded exam taker” by suggesting problems  Today:  Metric/Temporal planning (MTP)  Representation issues  Modeling MTP in Progression and Regression  And Graphplan and PO planning etc. etc.

Metric Temporal Planning  Time: Durative actions; Temporal constraints on goals (deadlines; inter-goal constraints – eg. Make sure to be in airport 2 hours before you are in the plane); Exogenous events  Durations may be static or “dynamic”  duration depends on the context—eg. Time to fill your gas tank depends on how empty the tank is to begin with  Advanced issues: Uncertain durations…  Modeling issues: When are preconditions needed? How long will they persist? When are effects given?  A default assumption is to say that all preconditions are needed at the beginning and must hold during the entire action’s duration. And that all effects will be available at the end of the action  E.g Consider “Grading homeworks” action—when are the homeworks needed? When are the grades available? What does your teacher tell you?  Planning issues: How to support concurrency?(see next slide) How to support multi- objective (cost/make-span/robustness) optimization  Resources: Actions may consume/produce (continuous quantity) “resources”  Modeling issues: How to model resource availability (especially over the duration of an action)  Planning issues: How to efficiently reason with continuous quantities during planning Special cases: TP: Temporal planning RP: Resource Planning

Concurrency  Suppose I tell you that a plan P contains actions A1… A10, each with duration d1…d10, then what is the makespan (execution duration) of P?  Makespan(P) >= max(d1…d10)  If Makespan(P) = Sum(d1…d10), then it is a strictly serial plan  If Makespan(P) > Sum(d1..d10), then there is idle-time in the plan  Actions don’t need to start right after the preceding action  Think of the bank teller gossiping with his colleague in between servicing each customer  Planned idle/slack time may not always be a bad thing—it can sometimes improve the robustness of the plan  Think of three travel plans involving connections in Minneapolis: Plan 1 schedules 5 min for connection time; plan 2 schedules 1 hour; plan 3 schedules 2 days. Which one is better (all else being equal).

Some Brand Names  Planners that can handle similar types of temporal and resource constraints:  TLPlan, HSTS, IxTexT, Zeno, SAPA  TlPlan, SAPA are progression-based planners  HSTS,IxTET,Zeno are partial-order-based planners  TlPlan,HSTS are domain-customized planners; the rest are domain independent  Planners that can handle a subset of constraints:  Only temporal: TGP, TPG, LPGP  Only resources: LPSAT, GRT-R, Kautz-Walser  Subset of temporal and resource constraints: TP4, Resource-IPP  LPGP and LPSAT are “loosely-coupled” systems. LPSAT connects SAT and LP solvers; LPGP connects Graphplan and LPsolver  Issues of how “tight” is the loose-connection.  TGP,TPG,LPGP are Graphplan-based  LPSAT is based on SAT encodings being sent to LP solvers  Kautz-Walser is based solely on LP encodings

Approaches for MTP  In theory, pretty much every one of the approaches we saw for classical planning can be (and have been) extended to MTP (with varying degrees of scalability)  There are some interesting tradeoffs  PO planners are easiest to extend to support the concurrency needed for durative actions  Have harder time handling resources (because resource consumption depends on exactly what actions occurred before this time point)  Progression planners easiest to extend to support resource consuming actions  But harder time handling concurrency (need to consider “advancing clock” as a separate option in addition to applying one of the actions)

3/27: Our Road Map  Will focus on conjunctive planning approaches— with special attention to Sapa  action models  Using PDDL2.1 standard  how to model the search  Progression; Regression; PO planning  how to extract good heuristics

Action Representation Flying (in-city ?airplane ?city1) (fuel ?airplane) > 0  (in-city ?airplane ?city1) (in-city ?airplane ?city2) consume (fuel ?airplane) Durative with E A = S A + D A Instantaneous effects e at time t e = S A + d, 0  d  D A Preconditions need to be true at the starting point, and protected during a period of time d, 0  d  D A Action can consume or produce continuous amount of some resource Action Conflicts: Consuming the same resource One action’s effect conflicting with other’s precondition or effect

(:durative-action burn_match :parameters () :duration (= ?duration 15) :condition: (and (at start have_match) (at start have_strikepad)) :effect (and (at start have_light) (at end (not have_light)) ) have_match, have strikepad have_light ~have_light (dur: 15) (:durative-action cross_cellar :parameters () :duration (= ?duration 10) :condition (and (at start have_light) (over all have_light) (at start at_steps)) :effect (and (at start (not at_steps)) (at start crossing)(at end at_fuse_box) ) have_light (dur: 10) have_light, at_steps at_fuse_box ~at_steps, crossing PDDL 2.1 (Level 2) Pure Durative Actions

PDDL 2.1 Level 3: Durative actions and numeric quantities (but discrete effects) The entire energy to be consumed is “encumbered” at the very beginning (even though it gets consumed Slowly over the full duration.

PDDL 2.1 Level 4: Durative actions and numeric quantities (with continuous effects: )

Issues in modeling continuous change by discrete vs. continuous effects  Consider the action of boiling a pan of water  The quantity “temperature of water” changes continuously over the duration of the action  We can ignore continuous effects by specifying that temperature is C at the end  Easy to handle; can only access the temperature at the end of the action; Reduces concurrency (what if we also put a blow torch to the pan to “hasten” the process?)  Or we can specify that the temperature of the water raises at a linear rate until it becomes 100  Harder to handle; but allows more concurrency (the total rate of increase is summation of all the individual rates of increase)

PDDL 2.1 Standard: Summary  Durations  Static and dynamic durations allowed  Also allows duration inequalities  Preconditions  Can be “at start” or “over all” (throughout the duration)  Doesn’t model preconditions being needed for arbitrary durations in the middle  Effects  Can be “at start” or “at end”  This makes effects “discrete”  Numeric quantities  Can be present in the preconditions or effects  Presence in the effects can be “discrete” (“at start”/”at end”) or continuous  Continuous change specified by giving a “rate” at which the quantity changes  Non-linear rate harder

State of the Art (as of IPC2002)  At IPC 2002; PDDL 2.1 standard had three levels  Level 1: STRIPS/ADL  Level 2: +Durative Actions  FF, LPG, SAPA….  Level 3: +Numeric quantities discrete change  Sapa, LPG  Level 4: +Continuous change  None at IPC  Some planners can handle it “in theory” but none are scalable

Problem Representation  Achievement Goals are specified as a list where pi needs to hold by time t i  ti is the deadline by which G must hold. It can be metric time (e.g. make clear(b) true by 2pm.)  If ti is omitted we will assume that G is a non-deadline goal (must be true by the time the plan is done.  “Persist Goals” are specified as a condition and an interval over which it must hold  A persist goal may be supported by different actions for the different parts of the duration ( “goal reduction” a la ZENO )  E.g. striking multiple matches to have light over a duration

Plan representation A1 A2 A3 Drive(cityA,cityB) Q At(truck,B) An executable plan must provide -- the actions that need to be executed -- the start times for each of the actions  Or a set of simple temporal constraints on the set of actions (S.T.C. are generalization of partial orders) E.g. A1—[4,5]  A2 (means 4 <= ST(A2) – ST(A1) <= 5 ) Plan views: Pert and Gantt charts GANTT Chart is what is shown on the right PERT shows the Causal links

Plan Quality Measures  Makespan: Clock time for the execution of the plan (more concurrency  lower makespan)  Slack: The difference between the deadline for a goal and the time by which the plan achieves it  Tardiness is negative slack  Optimize max/min/average slack/tardiness measures  Cost: Sum of costs of all the actions  Can be split into multiple dimensions, one corresponding to each resource A1 A2 A3 Drive(cityA,cityB) Q At(truck,B) Can two plans with same make-span have different slack measures?