Robust Optimization Concepts and Examples

Slides:



Advertisements
Similar presentations
Jingming Xu Multimedia Communications Lab University of Waterloo
Advertisements

13/4/20051 Alaa Sadik Ph.D., University of Hull, UK Faculty of Education in Qena جامعة جنوب الوادي مركز تنمية قدرات أعضاء هيئة التدريس Effective Presentation.
Load Balancing in a Cluster-based Active Jiani Guo (Student Member, IEEE) Laxmi Bhuyan (Fellow, IEEE) March 15 th 2005 Seo, Dong Mahn.
Use of administrative data at Statistics Finland Ilkka Hyppönen Statistics Finland.
Lia Toledo Moreira Mota, Alexandre de Assis Mota, Wu, Shin-Ting
Environmental Impact Assessment of public and private projects EIA Directive 85/337/EEC as amended by 97/11/EC and 2003/35/EC Cohesion Fund Workshop
On the Merits of the Open Source Model Sherif El-Kassas Department of Computer Science The American University in Cairo WIPO International Seminar on IP.
Mt. Diablo Unified School District IFAS Training Purchase Requests Using Web Screens Creating PRs and Receiving Items (First Revision)
A small taste of inferential statistics
UKOLN is supported by: Exploring the Global Knowledge Space Dr Liz Lyon, UKOLN, University of Bath, UK SWMLAC ICT Masterclass Bristol, January
UKOLN is supported by: Starting to explore the role of memory institutions within the social fabric of the new Web Dr Liz Lyon, UKOLN, University of Bath,
Value Engineering Study Case Done By: Ameer A. Al-Saleh ( ID# ) For: Dr. Sadi Assaf Summer 2005 Effective Layout : Cost & Schedule Impact on Residential.
ICDT Optimal Distributed Declustering using Replication Keith Frikken Purdue University Jan 5, 2005.
May 9, September 2005, Barcelona, Spain Prioritization of Forestry Themes for the SRA Risto Päivinen.
21 st Century Learning P. Anandan Managing Director Microsoft Research India
CAS: Central Instrument for Managing for Results.
Thank you and welcome Linear Programming (LP) Modeling Application in manufacturing And marketing By M. Dadfar, PhD.
Pflugerville ISD - August Pflugerville Independent School District Integrated Physics & Chemistry Measurement.
A Fast Estimation of SRAM Failure Rate Using Probability Collectives Fang Gong Electrical Engineering Department, UCLA Collaborators:
Load Balancing Hybrid Programming Models for SMP Clusters and Fully Permutable Loops Nikolaos Drosinos and Nectarios Koziris National Technical University.
Computer Networking Lecture 20 – Queue Management and QoS.
MS&E 211 Quadratic Programming Ashish Goel. A simple quadratic program Minimize (x 1 ) 2 Subject to: -x 1 + x 2 ≥ 3 -x 1 – x 2 ≥ -2.
Chapter 2: Modeling with Linear Programming & sensitivity analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or.
LCSC - 01 Monte Carlo Simulation of Radiation Transport, for Stereotactic Radio Surgery Per Kjäll Elekta Instrument AB
1 Linear Optimization Under Uncertainty: Comparisons Weldon A. Lodwick.
Optimisation of Irradiation Directions in IMRT Planning Rick Johnston Matthias Ehrgott Department of Engineering Science University of Auckland M. Ehrgott,
Radiotherapy Planning Stephen C. Billups University of Colorado at Denver
Convex Position Estimation in Wireless Sensor Networks
Optimization in Financial Engineering Yuriy Zinchenko Department of Mathematics and Statistics University of Calgary December 02, 2009.
All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.
Second order cone programming approaches for handing missing and uncertain data P. K. Shivaswamy, C. Bhattacharyya and A. J. Smola Discussion led by Qi.
Effects of Sampling in IMRT Optimization by Ronald L. Rardin Professor of Industrial Engineering Purdue University West Lafayette, Indiana, USA Caesarea.
Visual Recognition Tutorial
Principles of Robust IMRT Optimization Timothy Chan Massachusetts Institute of Technology Physics of Radiation Oncology – Sharpening the Edge Lecture 10.
1 Reliability and Robustness in Engineering Design Zissimos P. Mourelatos, Associate Prof. Jinghong Liang, Graduate Student Mechanical Engineering Department.
1 The Use of Fuzzy Optimization Methods for Radiation Therapy of Tumors Weldon A. Lodwick.
Solver & Optimization Problems n An optimization problem is a problem in which we wish to determine the best values for decision variables that will maximize.
1 1 Slides by John Loucks St. Edward’s University Modifications by A. Asef-Vaziri.
Lecture II-2: Probability Review
Linear Programming Operations Research – Engineering and Math Management Sciences – Business Goals for this section  Modeling situations in a linear environment.
Application of Methods of Queuing Theory to Scheduling in GRID A Queuing Theory-based mathematical model is presented, and an explicit form of the optimal.
Wind Power Scheduling With External Battery. Pinhus Dashevsky Anuj Bansal.
Applications of Calculus. The logarithmic spiral of the Nautilus shell is a classical image used to depict the growth and change related to calculus.
1 CE 530 Molecular Simulation Lecture 7 David A. Kofke Department of Chemical Engineering SUNY Buffalo
1 Statistical Mechanics and Multi- Scale Simulation Methods ChBE Prof. C. Heath Turner Lecture 11 Some materials adapted from Prof. Keith E. Gubbins:
Data Mining to Aid Beam Angle Selection for IMRT Stuart Price-University of Maryland Bruce Golden- University of Maryland Edward Wasil- American University.
Sampling Issues for Optimization in Radiotherapy Michael C. Ferris R. Einarsson Z. Jiang D. Shepard.
MIT and James Orlin © More Linear Programming Models.
Linear Programming.  Linear Programming provides methods for allocating limited resources among competing activities in an optimal way.  Any problem.
Deterministic Operations Research Models J. Paul Brooks Jill R. Hardin Department of Statistical Sciences and Operations Research November 28, 2006.
Strong Bounds for Linear Programs with Cardinality Limited Violation (CLV) Constraint Systems Ronald L. Rardin University of Arkansas
1 Radiotherapy, hadrontherapy and treatment planning systems. Faiza Bourhaleb INFN-Torino University Med 1er-Morocco  Radiotherapy  Optimization techniques.
Risk Analysis & Modelling
Stochastic Optimization
HASE: A Hybrid Approach to Selectivity Estimation for Conjunctive Queries Xiaohui Yu University of Toronto Joint work with Nick Koudas.
Forward and Backward Deviation Measures and Robust Optimization Peng Sun (Duke) with Xin Chen (UIUC) and Melvyn Sim (NUS)
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Solver & Optimization Problems
6.5 Stochastic Prog. and Benders’ decomposition
A Brachytherapy Treatment Planning Software Based on Monte Carlo Simulations and Artificial Neural Network Algorithm Amir Moghadam.
Georgina Hall Princeton, ORFE Joint work with Amir Ali Ahmadi
Chapter 6. Large Scale Optimization
CSCI B609: “Foundations of Data Science”
An optimization approach to automate radiotherapy treatment planning
FOUNDATIONS OF MODERN SCIENTIFIC PROGRAMMING
6.5 Stochastic Prog. and Benders’ decomposition
Chapter 6. Large Scale Optimization
Optimization under Uncertainty
Presentation transcript:

Robust Optimization Concepts and Examples Yuriy Zinchenko Shane G. Henderson ORIE, Cornell University

Zinchenko and Henderson 2005 Outline What can go wrong with LP? A familiar blend problem The general picture Robust linear programming Software, resources, practicalities Radiation therapy for cancer treatment Zinchenko and Henderson 2005

What can go wrong with LP? Tough LP problem: max x + y s/t 1 x  1 1 y  1 x, y  0 ? Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Blend Problem but properties change with time $$ $$$ $ blend to get output properties at minimum cost for any input properties within reason Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Blend constraints Typical constraint looks like Low ≤ 10 x1 + 12 x2 + 7 x3 ≤ High Changes to Low ≤ a1 x1 + a2 x2 + a3 x3 ≤ High for any vector a that is “close” to (10, 12, 7) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 General robust LP min cTx s/t A(1) x  b1 A(2) x  b2 A(3) x  b3 Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 A more detailed view Simple linear constraint a x  1 x  0 with a “close” to 1, namely 0  a  2 Want x to work for all such a How do we deal with it? Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 a x  1, x  0 for all 0  a  2 max a x  1, x  0 0  a  2, x  0 2 x  1 , x  0 x  1/2 , x  0 Zinchenko and Henderson 2005

A slightly more involved example: a x + b y  1 where (a, b) “close” to (1, 1), namely in Ellipsoidal (spherical) “uncertainty” set U (a, b) is in U if (a, b) = (a0, b0) + (Da, Db) with (a0, b0) = (1, 1) and Da2 + Db2  1 Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Ellipsoidal “uncertainty” set U (a, b) = (a0, b0) + (Da, Db) (a0, b0) = (1, 1) Da2 + Db2  1 Want (x, y) to satisfy a x + b y  1, for all (a, b) from U U (a0, b0) Zinchenko and Henderson 2005

What can we say about a x + b y ? a x + b y  1 for all (a, b) in U max a x + b y  1 (a, b) in U What can we say about a x + b y ? a x + b y = (a0 + Da) x + (b0 + Db) y = (a0 x + b0 y) + (Da x + Db y) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 For a moment, think of (x, y) as your objective function (fixed) max a x + b y ( 1 ?) (a, b) in U same as (a0 x + b0 y) + max (Da x + Db y) ( 1 ?) Da2 + Db2  1 (x, y) U (a0, b0) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 max (Da x + Db y) ( 1 - (a0 x + b0 y) ?) Da2 + Db2  1 Here Da x + Db y  ||(x, y)|| = (x2 + y2)1/2 the “length” of (x, y) (x1, y1) U (x2, y2) (a0, b0) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 a x + b y  1 for all (a, b) in U max a x + b y  1 (a, b) in U (a0 x + b0 y) + max (Da x + Db y)  1 Da2 + Db2  1 ||(x, y)||  1 - (a0 x + b0 y) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Good news Can handle constraints of this type ||(x, y)||  1 - (1 x + 1 y) easily (the so-called second-order conic programming (SOCP)) Not much harder than linear programming! Zinchenko and Henderson 2005

General Robust LP formulation max cTx s/t A(i) x  bi, i = 1,…,m where c, x Î Rn, A(i) Î R1 x n, A(i)=A(i)0 + wi Pi with wi Î R1 x ki, ||wi||  1, i=1,…,m, Pi Î Rki x n Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 SOCP equivalent: max cTx s/t || Pi x ||  bi - A(i)0 x, i = 1,…,m Probabilistic interpretation: think of A(i) taken from an a-level set of your favorite probability distribution (e.g. multivariate normal) the robust constraint will read satisfy the constraint with a given probability a Zinchenko and Henderson 2005

Where’d the ellipse come from? Expert opinion Statistics: Averages live in ellipsoids Doesn’t have to be an ellipse. Can be some other shape (e.g., boxes) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Software Commercial: Mosek (http://www.mosek.com/) “Free”: SeDuMi (http://sedumi.mcmaster.ca/) SDPT3.x (http://www.math.nus.edu.sg/~mattohkc/sdpt3.html/) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Practicalities Realistic problem sizes number of variables/constraints on the order of 103 – 104 depends (greatly) on the problem data structure/sparsity Possible to obtain a “good”, “inexpensive” approximation with LP Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Generality Possible to extend this approach to quite a few other convex programming problems Resources Lectures on Modern Convex Optimization: Analysis, Algorithms, and Engineering Applications by A. Ben-Tal, A. S. Nemirovskii Google for Robust Optimization (robust LP etc.) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 In practice... Joint work with Millie Chu (Cornell) and Michael B. Sharpe (Princess Margaret Hospital, Toronto) Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Cancer treatment About 1.3 million new cancer cases in the U.S. each year Nearly 60% receive radiation therapy (in conjunction with surgery, chemotherapy etc) Zinchenko and Henderson 2005

External beam radiation therapy Radiation delivered by a linear accelerator Cancer cells more susceptible than normal cells Overlay beams from different angles Dose given in daily fractions for ~ 6 weeks Zinchenko and Henderson 2005

Intensity Modulated Radiation Therapy Block parts of the radiation beam – discretize the whole beam into a grid of smaller “beamlets” Choose different intensities for each beamlet Intensity Modulated Radiation Therapy Collaborative Working Group, 2001 Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Treatment Planning Goal: Choose beam angles and beamlet intensities that deliver enough radiation to kill all tumor cells, while avoiding healthy organs & tissue as much as possible Take CT scan Delineate target region and healthy structures Discretize body as small cubes, or “voxels” Formulate & solve a mathematical program to find a “good” plan Zinchenko and Henderson 2005 Princess Margaret Hospital

Robust Treatment Planning Setup errors + Patient motion + Structural changes during treatment = uncertainty in geometry Don’t rescan patient much if at all Use RO to “robustify” mathematical program Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Model Formulation • Many different formulations exist – we use a penalty formulation minimize: penalty objective subject to: Pr(Dose to voxel i in healthy structure k ≤ Uk) ≥ 0.95 Pr(Dose to voxel i in tumor ≥ L) ≥ 0.95 x = beamlet intensities ≥ 0 Zinchenko and Henderson 2005

Computational Results Prostate: tumor + 5 healthy regions 5 equi-spaced beams, ~ 225 beamlets from each angle Voxel size = 2 cm, ~ 400 total voxels Solver: Mosek, v. 3.0.1.18 Solve time = 6 seconds (LP), 45 minutes (SOCP) Zinchenko and Henderson 2005

Dose-Volume Histograms % of structure receiving ≥ x Gy deterministic solution’s plan DVH of expected dose stochastic solution’s plan Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Comparison Simulate 10 treatments (45 fractions each) For each of the 10 treatments, and for each solution (deterministic & stochastic), calculated dose delivered to each voxel in each fraction summed over the 45 fractions to get total dose delivered to each voxel plotted DVH Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 1 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 2 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 3 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 4 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 5 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 6 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 7 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 8 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 9 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 DVH – Treatment 10 det stoch Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 Conclusions LP “pushes you into a corner” True situation never same as data Robust LP: Find good solution that is always feasible within reason Efficient solution methods: can solve real problems Software available The End Zinchenko and Henderson 2005

Zinchenko and Henderson 2005 A Bit More Detail Di(x) = Dose delivered to voxel i in N fractions, with intensities x, a random variable Di(x) is the sum of N random variables (N = 45), assume iid, apply CLT, so Di(x) is approximately normally distributed Take n sample shifts, s1,...,sn, with associated probabilities p = (p1,...,pn)T Let ai(∙)T = ai(s1)T ai(s2)T dose delivered to voxel i, shifted by sj, from each beamlet with unit intensity ai(sn)T so that NpTai(∙)Tx = expected total dose delivered to voxel i, for N fractions. Let vi(x) = sample variance of dose delivered to voxel i  Di(x) ~ Normal ( NpTai(·)Tx, Nvi(x) ) … Zinchenko and Henderson 2005

Probabilistic Constraints Want constraints to be violated with low probability (say, δ = .05) Example: maximum dose constraint on voxel i in Hk: Assuming Di(x) ~ Normal ( NpTai(∙)Tx, Nvi(x) ),   mk Want P(Di(x) > mk) ≤ δ     Second order cone program (SOCP) Zinchenko and Henderson 2005

Dose-Volume Constraints Physicians like constraints of form: “<= fraction fk of structure Hk gets >= dk” 0-1 var for each voxel: = 1 if dose is > dk. MIP: Hard to solve! Many voxels get near max allowed dose Alternative: upper bound the “excess” dose. For healthy structure Hk, we require: Linear constraints☺ Zinchenko and Henderson 2005