R&D Portfolio Optimization One Stage R&D Portfolio Optimization with an Application to Solid Oxide Fuel Cells Lauren Hannah 1, Warren Powell 1, Jeffrey.

Slides:



Advertisements
Similar presentations
QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,
Advertisements

Testing Linear Pricing Algorithms for use in Ascending Combinatorial Auctions (A5) Giro Cavallo David Johnson Emrah Kostem.
Modeling of the Current Distribution in Aluminum Anodization Rohan Akolkar and Uziel Landau Department of Chemical Engineering, CWRU, Cleveland OH
IEOR 4004: Introduction to Operations Research Deterministic Models January 22, 2014.
Design of Experiments Lecture I
Design of the fast-pick area Based on Bartholdi & Hackman, Chpt. 7.
13-Optimization Assoc.Prof.Dr. Ahmet Zafer Şenalp Mechanical Engineering Department Gebze Technical.
Dynamic Programming Rahul Mohare Faculty Datta Meghe Institute of Management Studies.
Markov Decision Process
MS&E 211 Quadratic Programming Ashish Goel. A simple quadratic program Minimize (x 1 ) 2 Subject to: -x 1 + x 2 ≥ 3 -x 1 – x 2 ≥ -2.
Monte Carlo Methods and Statistical Physics
SA-1 Probabilistic Robotics Planning and Control: Partially Observable Markov Decision Processes.
3D-STAF: Scalable Temperature and Leakage Aware Floorplanning for Three-Dimensional Integrated Circuits Pingqiang Zhou, Yuchun Ma, Zhouyuan Li, Robert.
Computational Stochastic Optimization:
1 Reinforcement Learning Introduction & Passive Learning Alan Fern * Based in part on slides by Daniel Weld.
Gizem ALAGÖZ. Simulation optimization has received considerable attention from both simulation researchers and practitioners. Both continuous and discrete.
Classification and Prediction: Regression Via Gradient Descent Optimization Bamshad Mobasher DePaul University.
CHAPTER 16 MARKOV CHAIN MONTE CARLO
Markov Decision Processes
Infinite Horizon Problems
Planning under Uncertainty
POMDPs: Partially Observable Markov Decision Processes Advanced AI
Kuang-Hao Liu et al Presented by Xin Che 11/18/09.
© 2003 Warren B. Powell Slide 1 Approximate Dynamic Programming for High Dimensional Resource Allocation NSF Electric Power workshop November 3, 2003 Warren.
Stochastic Differentiation Lecture 3 Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius, Lithuania EURO Working Group on Continuous.
Approximate Dynamic Programming for High-Dimensional Asset Allocation Ohio State April 16, 2004 Warren Powell CASTLE Laboratory Princeton University
Job Release-Time Design in Stochastic Manufacturing Systems Using Perturbation Analysis By: Dongping Song Supervisors: Dr. C.Hicks & Dr. C.F.Earl Department.
1 A Second Stage Network Recourse Problem in Stochastic Airline Crew Scheduling Joyce W. Yen University of Michigan John R. Birge Northwestern University.
Efficient Methodologies for Reliability Based Design Optimization
1 Chapter 5 Introduction to Factorial Designs Basic Definitions and Principles Study the effects of two or more factors. Factorial designs Crossed:
Planning operation start times for the manufacture of capital products with uncertain processing times and resource constraints D.P. Song, Dr. C.Hicks.
Improving Stable Processes Professor Tom Kuczek Purdue University
Radial Basis Function Networks
1 IE 607 Heuristic Optimization Introduction to Optimization.
CHAPTER 15 S IMULATION - B ASED O PTIMIZATION II : S TOCHASTIC G RADIENT AND S AMPLE P ATH M ETHODS Organization of chapter in ISSO –Introduction to gradient.
Stochastic Approximation and Simulated Annealing Lecture 8 Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius, Lithuania EURO Working.
Computational Stochastic Optimization: Bridging communities October 25, 2012 Warren Powell CASTLE Laboratory Princeton University
Myopic Policies for Budgeted Optimization with Constrained Experiments Javad Azimi, Xiaoli Fern, Alan Fern Oregon State University AAAI, July
Process Flowsheet Generation & Design Through a Group Contribution Approach Lo ï c d ’ Anterroches CAPEC Friday Morning Seminar, Spring 2005.
Planning Production of a Set of Semiconductor Components with Uncertain Wafer and Component Yield Frank W. Ciarallo Assistant Professor Biomedical, Industrial.
Slide 1 Tutorial: Optimal Learning in the Laboratory Sciences The knowledge gradient December 10, 2014 Warren B. Powell Kris Reyes Si Chen Princeton University.
Monte Carlo Simulation and Personal Finance Jacob Foley.
A Modified Meta-controlled Boltzmann Machine Tran Duc Minh, Le Hai Khoi (*), Junzo Watada (**), Teruyuki Watanabe (***) (*) Institute Of Information Technology-Viet.
LINEAR CLASSIFICATION. Biological inspirations  Some numbers…  The human brain contains about 10 billion nerve cells ( neurons )  Each neuron is connected.
Stochastic Linear Programming by Series of Monte-Carlo Estimators Leonidas SAKALAUSKAS Institute of Mathematics&Informatics Vilnius, Lithuania
Discriminant Functions
CSE-573 Reinforcement Learning POMDPs. Planning What action next? PerceptsActions Environment Static vs. Dynamic Fully vs. Partially Observable Perfect.
Non-Bayes classifiers. Linear discriminants, neural networks.
Vaida Bartkutė, Leonidas Sakalauskas
The Markov Chain Monte Carlo Method Isabelle Stanton May 8, 2008 Theory Lunch.
Uncertainty Quantification in Climate Prediction Charles Jackson (1) Mrinal Sen (1) Gabriel Huerta (2) Yi Deng (1) Ken Bowman (3) (1)Institute for Geophysics,
Classification Ensemble Methods 1
Chapter 2-OPTIMIZATION G.Anuradha. Contents Derivative-based Optimization –Descent Methods –The Method of Steepest Descent –Classical Newton’s Method.
September 28, 2000 Improved Simultaneous Data Reconciliation, Bias Detection and Identification Using Mixed Integer Optimization Methods Presented by:
Comparison Value vs Policy iteration
Sequential Off-line Learning with Knowledge Gradients Peter Frazier Warren Powell Savas Dayanik Department of Operations Research and Financial Engineering.
Meta-controlled Boltzmann Machine toward Accelerating the Computation Tran Duc Minh (*), Junzo Watada (**) (*) Institute Of Information Technology-Viet.
Data Driven Resource Allocation for Distributed Learning
Making complex decisions
6.5 Stochastic Prog. and Benders’ decomposition
Markov Decision Processes
ME 521 Computer Aided Design 15-Optimization
Estimating Networks With Jumps
Slides for Introduction to Stochastic Search and Optimization (ISSO) by J. C. Spall CHAPTER 15 SIMULATION-BASED OPTIMIZATION II: STOCHASTIC GRADIENT AND.
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
CS 188: Artificial Intelligence Fall 2007
6.5 Stochastic Prog. and Benders’ decomposition
Reinforcement Learning Dealing with Partial Observability
Presentation transcript:

R&D Portfolio Optimization One Stage R&D Portfolio Optimization with an Application to Solid Oxide Fuel Cells Lauren Hannah 1, Warren Powell 1, Jeffrey Stewart 2 1 Princeton University, Department of Operations Research and Financial Engineering 2 Lawrence Livermore National Laboratory

R&D Portfolio Optimization The R&D Portfolio Problem A government or large corporation wants to spend resources on research. What is the best allocation? Examples: –Wind technologies –Storage technologies –Solid oxide fuel cells –Carbon capture and storage projects

R&D Portfolio Optimization The R&D Portfolio Problem The setup: –Choose a set of projects to fund –Research occurs on those projects….which then changes the state of the world –We must live with that new state The constraints: –Budget –0/1 project funding The goal: –Maximize the expectation of a utility function based on the state of the world after research occurs

R&D Portfolio Optimization The R&D Portfolio Problem Challenges: –Projects are not (necessarily) independent Outcomes may be dependent Project areas may overlap Projects may be focused at different levels (wind technologies vs. bipolar plate coating) –Utility function is not a sum of project values –Problem is fundamentally stochastic knapsack optimization

R&D Portfolio Optimization Solid Oxide Fuel Cells Ceramic (solid oxide) electrolyte Used for stationary power production; high running temps allow use of CH 4 and other non-H 2 gases Research areas: anode, cathode, electrolyte, bipolar plates, seal and pressure vessel

R&D Portfolio Optimization Project dependence: what is a project? –May be specific component (ie anode or cathode), or it may be an entire area (ie wind technologies) –Can affect multiple underlying technologies, such as surface area and production cost of anode This allows sophisticated project interactions – Outcomes occur for technologies, not overall projects Lets dependent outcomes be easily modeled The R&D Portfolio Problem

R&D Portfolio Optimization SOFC Components and Technologies ComponentTechnologyParameterTechnologyParameterTechnologyParameter AnodeSurface AreaPower Density Production Cost CathodeSurface AreaPower Density Production Cost ElectrolyteReaction Stability DegradationProduction Cost Bipolar Plates Temperature Stability ConductivityProduction Cost SealTemperature Stability Chemical Stability Production Cost Pressure Vessel Design---- Production Cost

R&D Portfolio Optimization Projects and Technologies Technology changes are additive and denoted by

R&D Portfolio Optimization R&D Portfolios, Mathematically M projects Decision x in {0,1} M There is a fixed budget, b Each project has cost c i, so Σc i x i b Cost is a function of technologies at time 1, C(ρ 1 ). The problem becomes…

R&D Portfolio Optimization SOFC Cost Function Find power output for a fuel cell, assume 1,000cm 2 footprint: Find the capital cost of the fuel cell: For each possible combination of components I = i 1 (anode index), i 2 (cathode index), i 3 (electrolyte), i 4 (bipolar plates), i 5 (seal), and i 6 (pressure vessel), find cost per kWh, that is (production cost) / (lifetime x kW output):

R&D Portfolio Optimization SOFC Cost Function, Continued Find the lifetime for the fuel cell (minimum of component lifetimes): Get an unpenalized cost per kWh: Calculate penalties for not meeting temperature and design specifications. First, calculate minimum operating temperature:

R&D Portfolio Optimization SOFC Cost Function, Continued Create the penalty term: Add penalty to cost per kWh: The cost for the state of technologies is the cost of the best fuel cell:

R&D Portfolio Optimization Mathematical Challenges The number of possible portfolios grows combinatorially –10 projects out of 30 = ~10 million portfolios –20 projects out of 60 = ~ 4.2 x portfolios Cost function may not be convex or separable Expectation of cost function is hard to compute given a portfolio

R&D Portfolio Optimization Previous Approaches R&D literature: –Simplifies problem to use math programming –Does not often address uncertainty or complex project interactions Stochastic Combinatorial Optimization: –Not used for R&D problems –Uses metaheuristics such as branch and bound, simulated annealing, nested partitions, ant colony optimization, etc. –Performance uncertain (doubtful?) for R&D.

R&D Portfolio Optimization Stochastic Gradient Portfolio Optimization Idea: linearly approximate by Iteratively estimate marginal value i at iteration n by Choose portfolio x n+1 by solving

R&D Portfolio Optimization Stochastic Gradient Portfolio Optimization To determine ith stochastic gradient,, create new portfolio, perturbed around ith project If project i is in the old portfolio, take it out. If it is not, add it in.

R&D Portfolio Optimization Stochastic Gradient Portfolio Optimization Get value for original portfolio Get perturbed technology change,, from perturbed portfolio Update technology parameters for by Obtain value for, Smooth into previous estimate,,

R&D Portfolio Optimization

Comparisons SGPO –Stochastic gradient portfolio optimization EPI-MC –Evolutionary Policy Iteration (Chang, Lee, Fu, Markus, 2005) –Modified to avoid assumption that expectation can be computed exactly. –Provably convergent by using increasing number of samples every iteration to estimate expectation. SA –Simulated annealing (Gutjahr and Pflug, 1996)

R&D Portfolio Optimization Results Marginal values vary with portfolio make-up. Marginal cost of a project

R&D Portfolio Optimization Results Value of selected portfolio for as a function of time for single run. SGPO gravitates to a good value quickly..

R&D Portfolio Optimization Results Empirical density function of portfolio selected at algorithm termination in terms of cost per kWh. Fraction of projects

R&D Portfolio Optimization Results Statistics for terminal portfolio, based on problem class and run time. The x choose y problems give all SOFC projects equal costs, the knapsack problems do not.