Presentation is loading. Please wait.

Presentation is loading. Please wait.

© 2003 Warren B. Powell Slide 1 Approximate Dynamic Programming for High Dimensional Resource Allocation NSF Electric Power workshop November 3, 2003 Warren.

Similar presentations


Presentation on theme: "© 2003 Warren B. Powell Slide 1 Approximate Dynamic Programming for High Dimensional Resource Allocation NSF Electric Power workshop November 3, 2003 Warren."— Presentation transcript:

1 © 2003 Warren B. Powell Slide 1 Approximate Dynamic Programming for High Dimensional Resource Allocation NSF Electric Power workshop November 3, 2003 Warren Powell CASTLE Laboratory Princeton University http://www.castlelab.princeton.edu © 2003 Warren B. Powell, Princeton University

2 © 2003 Warren B. Powell Slide 2 Schneider National

3 © 2003 Warren B. Powell Slide 3 Schneider National

4 © 2003 Warren B. Powell Slide 4

5 © 2003 Warren B. Powell Slide 5

6 © 2003 Warren B. Powell Slide 6

7 © 2003 Warren B. Powell Slide 7 Air Mobility Command Air Mobility Command Fuel Cargo Handling Ramp Space Maintenance Cargo Holding

8 © 2003 Warren B. Powell Slide 8 The optimization challenge

9

10 Special equipment

11 © 2003 Warren B. Powell Slide 11 State variables Modeling the military airlift problem: »State variables: »Control variables:

12 © 2003 Warren B. Powell Slide 12 State variables We can formulate the problem of determining what to do with our aircraft as a dynamic program:

13 © 2003 Warren B. Powell Slide 13 State variables If we only have N=1 aircraft:

14 © 2003 Warren B. Powell Slide 14 State variables What if we have N>1 aircraft?

15 © 2003 Warren B. Powell Slide 15 State variables Number of resources Number of attributes Number of zeroes in size of state space

16 © 2003 Warren B. Powell Slide 16 Outline An algorithmic strategy for high-dimensional asset allocation problems

17 © 2003 Warren B. Powell Slide 17 Approximate dynamic programming Systems evolve through a cycle of exogenous and endogenous information Time

18 © 2003 Warren B. Powell Slide 18 Approximate dynamic programming Systems evolve through a cycle of exogenous and endogenous information Time

19 © 2003 Warren B. Powell Slide 19 Approximate dynamic programming Using this state variable, we obtain the optimality equations: Problem: Curse of dimensionality Three curses State space Outcome space Action space (feasible region)

20 © 2003 Warren B. Powell Slide 20 Approximate dynamic programming The computational challenge: How do we find ? How do we compute the expectation? How do we find the optimal solution?

21 © 2003 Warren B. Powell Slide 21 Approximate dynamic programming Approximation methodology: Can’t compute this!!!Don’t know what this is!

22 © 2003 Warren B. Powell Slide 22 Adaptive dynamic programming Alternative: Change the definition of the state variable: Time

23 © 2003 Warren B. Powell Slide 23 Adaptive dynamic programming Now our optimality equation looks like: We drop the expectation and solve the conditional problem: Finally, we substitute in our approximation:

24 © 2003 Warren B. Powell Slide 24 Adaptive dynamic programming Approximating the value function: »We choose approximations of the form:

25 © 2003 Warren B. Powell Slide 25 Approximate dynamic programming This period Future

26 © 2003 Warren B. Powell Slide 26 Approximate dynamic programming Our basic strategy: Separable approximation 0 1 2 3 4 5

27 © 2003 Warren B. Powell Slide 27 Research questions in electric power Special equipment

28 © 2003 Warren B. Powell Slide 28 Research questions in electric power Two-stage resource allocation under uncertainty

29 © 2003 Warren B. Powell Slide 29 Approximate dynamic programming

30 © 2003 Warren B. Powell Slide 30 Approximate dynamic programming

31 © 2003 Warren B. Powell Slide 31 Approximate dynamic programming

32 © 2003 Warren B. Powell Slide 32 Approximate dynamic programming We estimate the functions by sampling from our distributions. Marginal value:

33 © 2003 Warren B. Powell Slide 33 A dynamic network: Approximate dynamic programming t

34 © 2003 Warren B. Powell Slide 34 Approximate dynamic programming Stepping through time:

35 © 2003 Warren B. Powell Slide 35 Approximate dynamic programming Iterative learning:

36 © 2003 Warren B. Powell Slide 36 Nonlinear approximations Number of resources Approximate value function

37 © 2003 Warren B. Powell Slide 37 Competing algorithmic strategies Competing optimal algorithms: »Discrete dynamic programming Cannot handle even small problems Numerical comparisons are meaningless »Stochastic programming Bender’s decomposition is optimal for this problem class

38 © 2003 Warren B. Powell Slide 38 Benders decomposition Variations on Bender’s decomposition SPAR algorithm Deterministic approximation Iterations

39 © 2003 Warren B. Powell Slide 39 Conclusions: »Using sequences of separable, nonlinear approximations conquers the explosive growth with the number of resources. »We are now solving problems with thousands of resources. »But what about the attribute space? Complex equipment and people are typically described by vectors of attributes. We require multidimensional attributes to capture complex assets such as equipment and people. The size of the attribute space grows exponentially in the number of dimensions.

40 © 2003 Warren B. Powell Slide 40 Benders decomposition Variations on Benders decompositionSPAR Percent over optimal Attribute space = 10 Attribute space = 25 Attribute space = 50 Attribute space = 100

41 © 2003 Warren B. Powell Slide 41 Benders decomposition Variations on Benders decompositionSPAR Percent over optimal Increasing problem size makes solution much worse With SPAR, the solution gets better.

42 © 2003 Warren B. Powell Slide 42 Multidimensional attribute spaces decision d

43 © 2003 Warren B. Powell Slide 43 Multidimensional attribute spaces $450

44 © 2003 Warren B. Powell Slide 44 NE region PA TX NY Multidimensional attribute spaces

45 © 2003 Warren B. Powell Slide 45 Hierarchical Aggregation We can use a family of aggregation functions:

46 We can use different levels of aggregation to capture the value of an asset:

47 © 2003 Warren B. Powell Slide 47 Hierarchical aggregation Alternative: »Use multiple levels of aggregation at the same time Estimate at gth level of aggregation Weight on gth level of aggregation

48 © 2003 Warren B. Powell Slide 48 x f(x) Hierarchical aggregation

49 © 2003 Warren B. Powell Slide 49 x f(x) High structure Moderate structure Zero structure Hierarchical aggregation

50 © 2003 Warren B. Powell Slide 50 Bayesian weights Weight on disaggregate level Optimal weights Hierarchical aggregation

51 © 2003 Warren B. Powell Slide 51

52 © 2003 Warren B. Powell Slide 52 Hierarchical aggregation Aggregate Disaggregate Weighted Combination

53 © 2003 Warren B. Powell Slide 53 Hierarchical aggregation Iterations Weights 1324513245 Aggregation level 6767 Weight on most disaggregate level Weight on most aggregate levels Optimal weights change as the algorithm progresses:

54 © 2003 Warren B. Powell Slide 54 Conclusions »Hierarchical aggregation offers a powerful mechanism for handling high dimensional, arbitrary attribute spaces »Combined with the use of separable approximations for handling large numbers of assets, we have a powerful approach for large-scale resource allocation problems.

55 © 2003 Warren B. Powell Slide 55 Research questions Algorithmic questions: »Stepsizes and rate of convergence 1000100 We need to improve our understanding of adaptive stepsizes.

56 © 2003 Warren B. Powell Slide 56 Research questions Algorithmic questions: »Stability: we would like to measure the responsiveness to small changes. The instability limits analyses to big changes. Research: Big simulations will also be unstable. We need to calculate derivatives of simulations. Theory needs to be extended to new problem classes.

57 © 2003 Warren B. Powell Slide 57 Research questions in electric power Application to electric power: »Fuel optimization (continuous assets): What fuel to purchase when we can switch between fuels Design of fuel contracts Determining prices of forward contracts How much and where to store fuel. »Asset management problems (discrete assets): Unit commitment problems –Control of hydro units Positioning of assets for emergency response –Special equipment –People with specialized training

58 © 2003 Warren B. Powell Slide 58 Research questions in electric power Special equipment


Download ppt "© 2003 Warren B. Powell Slide 1 Approximate Dynamic Programming for High Dimensional Resource Allocation NSF Electric Power workshop November 3, 2003 Warren."

Similar presentations


Ads by Google