Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 5. The Duality Theorem

Similar presentations


Presentation on theme: "Chapter 5. The Duality Theorem"β€” Presentation transcript:

1 Chapter 5. The Duality Theorem
Given an LP, we define another LP derived from the same data, but with different structure. This LP is called the dual problem (μŒλŒ€λ¬Έμ œ). The main purpose to consider dual is to obtain an upper bound (estimate) on the optimal objective value of the given LP (for maximization problem) without solving it to optimality. Also dual problem provides optimality conditions of a solution π‘₯ βˆ— for an LP and help to understand the behavior of the simplex method. Very important concept to understand the properties of the LP and the simplex method. OR-1 Opt. 2018

2 Preliminaries Taking nonnegative linear combination of inequality constraints: Consider two constraints π‘₯ 1 +2 π‘₯ 2 ≀3 and 2 π‘₯ 1 + π‘₯ 2 ≀4 …. (1’) In vector notation : π‘Ž 1 β€²π‘₯≀3, π‘Ž 2 β€²π‘₯≀4, where π‘Ž 1 = , π‘Ž 2 = 2 1 If we multiply scalars 𝑦 1 β‰₯0 to both sides of the 1st constraint and 𝑦 2 β‰₯0 to the 2nd constraint and add the l.h.s. and r.h.s. respectively, we get 𝑦 1 +2 𝑦 2 π‘₯ 1 +(2 𝑦 1 + 𝑦 2 ) π‘₯ 2 ≀(3 𝑦 1 +4 𝑦 2 ) ….. (2’) In vector notation, 𝑦 1 π‘Ž 1 β€²+ 𝑦 2 π‘Ž 2 β€² π‘₯≀(3 𝑦 1 +4 𝑦 2 ) Any vector that satisfies (1’) also satisfies (2’), but converse is not true. Moreover, the coefficient vector in l.h.s. of (2’) is obtained by taking the nonnegative linear combination of the coefficient vectors in (1’) OR-1 Opt. 2018

3 x2 2x1+x2ο‚£4 y1(1, 2)+y2(2, 1), y1, y2 ο‚³ 0 x1+2x2ο‚£3 (1, 2) (2, 1)
(5/3, 2/3) (y1a1’+y2a2’)x ο‚£ (3y1+4y2) x1 (0, 0) OR-1 Opt. 2018

4 Getting Dual Problem ex) max 4 π‘₯ 1 + π‘₯ 2 +5 π‘₯ 3 +3 π‘₯ 4
s.t π‘₯ 1 βˆ’ π‘₯ 2 βˆ’ π‘₯ π‘₯ 4 ≀1 5 π‘₯ π‘₯ 2 +3 π‘₯ 3 +8 π‘₯ 4 ≀55 βˆ’ π‘₯ 1 +2 π‘₯ 2 +3 π‘₯ 3 βˆ’5 π‘₯ 4 ≀3 π‘₯ 1 , π‘₯ 2 , π‘₯ 3 , π‘₯ 4 β‰₯0 Lower bound on the optimal value : consider feasible solution (0, 0, 1, 0) οƒž 𝑧 βˆ— β‰₯5 (3, 0, 2, 0) οƒž 𝑧 βˆ— β‰₯22 Upper bound : consider inequality obtained by multiplying 0 to the 1st , 1 to the 2nd, and 1 to the 3rd constraints and add the l.h.s. and r.h.s. respectively 4 π‘₯ 1 + 3π‘₯ 2 +6 π‘₯ 3 +3 π‘₯ 4 ≀ …… (1) OR-1 Opt. 2018

5 Further, any feasible solution to the LP must satisfy
4 π‘₯ 1 + 3π‘₯ 2 +6 π‘₯ 3 +3 π‘₯ 4 ≀ …… (1) Since we multiplied nonnegative numbers on both sides of the inequalities, any vector that satisfies the three constraints (which includes feasible solutions to the LP) also satisfies (1). Hence any feasible solution to LP, which satisfies the three constraints and nonnegativity, also satisfies (1). Note that all the points satisfying 4 π‘₯ 1 +3 π‘₯ 2 +6 π‘₯ 3 +3 π‘₯ 4 ≀58 do not necessarily satisfy the three constraints in the LP. Further, any feasible solution to the LP must satisfy 4 π‘₯ 1 + π‘₯ 2 +5 π‘₯ 3 +3 π‘₯ 4 ≀4 π‘₯ 1 +3 π‘₯ 2 +6 π‘₯ 3 +3 π‘₯ 4 ( ο‚£ 58) since any feasible solution must satisfy nonnegativity constraints on the variables and the coefficients in the second expression is greater than or equal to the corresponding coefficients in the first expression (objective function). So 58 is an upper bound on the optimal value of the LP. OR-1 Opt. 2018

6 Now, we may use nonnegative weight 𝑦 𝑖 for π‘–βˆ’π‘‘β„Ž constraint .
max 4 π‘₯ 1 + π‘₯ 2 +5 π‘₯ 3 +3 π‘₯ 4 s.t π‘₯ 1 βˆ’ π‘₯ 2 βˆ’ π‘₯ π‘₯ 4 ≀1 5 π‘₯ π‘₯ 2 +3 π‘₯ 3 +8 π‘₯ 4 ≀55 βˆ’ π‘₯ 1 +2 π‘₯ 2 +3 π‘₯ 3 βˆ’5 π‘₯ 4 ≀3 π‘₯ 1 , π‘₯ 2 , π‘₯ 3 , π‘₯ 4 β‰₯0 Now, we may use nonnegative weight 𝑦 𝑖 for π‘–βˆ’π‘‘β„Ž constraint . 𝑦 1 +5 𝑦 2 βˆ’ 𝑦 3 π‘₯ 1 + βˆ’ 𝑦 1 + 𝑦 2 +2 𝑦 3 π‘₯ 2 + βˆ’ 𝑦 1 +3 𝑦 2 +3 𝑦 3 π‘₯ 𝑦 1 +8 𝑦 2 βˆ’5 𝑦 3 π‘₯ 4 ≀ 𝑦 𝑦 2 +3 𝑦 3 In vector notation, 𝑦 1 1,βˆ’1,βˆ’1,3 + 𝑦 2 5,1,3,8 + 𝑦 3 βˆ’1,2,3,βˆ’5 β€² π‘₯≀( 𝑦 𝑦 2 +3 𝑦 3 ) Objective function of the LP is 4 π‘₯ 1 + π‘₯ 2 +5 π‘₯ 3 +3 π‘₯ 4 OR-1 Opt. 2018

7 Hence as long as the nonnegative weights 𝑦 𝑖 ′𝑠 satisfy
𝑦 1 +5 𝑦 2 βˆ’ 𝑦 3 β‰₯4, βˆ’ 𝑦 1 + 𝑦 2 +2 𝑦 3 β‰₯1 βˆ’ 𝑦 1 +3 𝑦 2 +3 𝑦 3 β‰₯5, 3 𝑦 1 +8 𝑦 2 βˆ’5 𝑦 3 β‰₯3 we can use 𝑦 𝑦 2 +3 𝑦 3 as an upper bound on optimal value. To find more accurate upper bound (smallest upper bound), we want to solve Dual problem obtained. Note that the objective value of any feasible solution to the dual problem provides an upper bound on the optimal value of the given LP (called primal problem). OR-1 Opt. 2018

8 P: primal problem, D: dual problem
General form P: primal problem, D: dual problem maximize 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 maximize 𝑐 β€² π‘₯ (P) subject to 𝑗=1 𝑛 π‘Ž 𝑖𝑗 π‘₯ 𝑗 ≀ 𝑏 𝑖 , 𝑖=1,…,π‘š subject to 𝐴π‘₯≀𝑏 π‘₯ 𝑗 β‰₯0, 𝑗=1,2,…,𝑛 π‘₯β‰₯0 minimize 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 minimize 𝑏 β€² 𝑦 (D) subject to 𝑖=1 π‘š π‘Ž 𝑖𝑗 𝑦 𝑖 β‰₯ 𝑐 𝑗 , 𝑗=1,…,𝑛 subject to 𝑦 β€² 𝐴β‰₯𝑐′ 𝑦 𝑖 β‰₯0, 𝑖=1,2,…,π‘š 𝑦β‰₯0 OR-1 Opt. 2018

9 Thm: (Weak duality relation)
Suppose ( π‘₯ 1 ,…, π‘₯ 𝑛 ) is a feasible solution to the primal problem (P) and ( 𝑦 1 ,…, 𝑦 π‘š ) is a feasible solution to the dual problem (D), then 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 ≀ 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 (5.4) pf) 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 ≀ 𝑗=1 𝑛 ( 𝑖=1 π‘š π‘Ž 𝑖𝑗 𝑦 𝑖 ) π‘₯ 𝑗 = 𝑖=1 π‘š ( 𝑗=1 𝑛 π‘Ž 𝑖𝑗 π‘₯ 𝑗 ) 𝑦 𝑖 ≀ 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖  Cor: If we can find a feasible π‘₯ βˆ— to (P) and a feasible 𝑦 βˆ— to (D) such that 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ— = 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— , then x* is an optimal solution to (P) and y* is an optimal solution to (D). pf) For all feasible solution π‘₯ to (P), we have 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 ≀ 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— = 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ— . Similarly, for all feasible y to (D), we have 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— = 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ— ≀ 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 . ο‚… OR-1 Opt. 2018

10 (i.e., no duality gap, dual optimal value – primal optimal value = 0)
Thm 5.1: [Strong Duality Theorem] If (P) has an optimal solution ( π‘₯ 1 βˆ— ,…, π‘₯ 𝑛 βˆ— ), then (D) also has an optimal solution, say ( 𝑦 1 βˆ— ,…, 𝑦 π‘š βˆ— ), and 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ— = 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— . (i.e., no duality gap, dual optimal value – primal optimal value = 0) Note that strong duality theorem says that if (P) has an optimal solution, the dual (D) is neither unbounded nor infeasible, but always has an optimal solution. OR-1 Opt. 2018

11 Idea of the proof: Read the optimal solution of the dual problem from the coefficients of the slack variables in the zeroth equation from the optimal dictionary (tableau). ex) Note that the dual variables 𝑦 1 , 𝑦 2 , 𝑦 3 matches naturally with slack variables π‘₯ 5 , π‘₯ 6 , π‘₯ 7 . For example π‘₯ 5 is slack variable for the first constraint and 𝑦 1 is dual variable for the first constraint, and so on. OR-1 Opt. 2018

12 At optimality, the tableau looks
In the zeroth equation of the tableau, the coefficients of the slack variables are –11 for π‘₯ 5 , 0 for π‘₯ 6 , βˆ’6 for π‘₯ 7 . Assigning these values with reversed signs to the corresponding dual variables, we obtain desired optimal solution of the dual: 𝑦 1 =11, 𝑦 2 =0, 𝑦 3 =6. OR-1 Opt. 2018

13 Suppose we performed row operations
Idea of the proof: Note that the coefficients of π‘₯ 5 , π‘₯ 6 , and π‘₯ 7 in the zeroth equation show what numbers we multiplied to the corresponding equation and add them to the zeroth equation in the elementary row operations (net effect of many row operations) ex) Suppose we performed row operations (row2)  2ο‚΄(row 1) + (row 2), and then (row 0)  3ο‚΄(row 2) + (row 0). The net effect in zeroth equation is adding 6ο‚΄(row 1) + 3ο‚΄(row 2) to the zeroth equation and the scalar we multiplied can be read from the coefficients of π‘₯ 5 and π‘₯ 6 in the zeroth equation. OR-1 Opt. 2018

14 (ex-continued) (row 1)ο‚΄2 + (row 2) OR-1 Opt. 2018

15 (ex-continued) (row 2) ο‚΄ 3 + (row 0) OR-1 Opt. 2018

16 Example – Initial tableau
Optimal tableau OR-1 Opt. 2018

17 𝑦 𝑖 ≀0, 𝑖=1,…,π‘š (for slack variables)
Let 𝑦 𝑖 be the scalar we multiplied to the i-th equation and add to the zeroth equation in the net effect of many simplex iterations (elementary row operations). Then the coefficients of slack variables in the zeroth equation represent the 𝑦 𝑖 values we multiplied to the i-th equation for 𝑖=1,…,π‘š. Also the coefficients of structural variables in the zeroth equation are given as 𝑐 𝑗 + 𝑖=1 π‘š 𝑦 𝑖 π‘Ž 𝑖𝑗 , 𝑗=1,…,𝑛 Now in the optimal tableau, all the coefficients in the zeroth equation are ο‚£ 0, which implies 𝑦 𝑖 ≀0, 𝑖=1,…,π‘š (for slack variables) 𝑐 𝑗 + 𝑖=1 π‘š 𝑦 𝑖 π‘Ž 𝑖𝑗 ≀0, 𝑗=1,…,𝑛 (for stuctural variables) If we take (βˆ’ 𝑦 𝑖 ), 𝑖=1,…,π‘š as a dual solution, it is dual feasible. (βˆ’ 𝑦 𝑖 )β‰₯0, 𝑖=1,…,π‘š 𝑖=1 π‘š (βˆ’ 𝑦 𝑖 ) π‘Ž 𝑖𝑗 β‰₯ 𝑐 𝑗 , 𝑗=1,…,𝑛 OR-1 Opt. 2018

18 It is the idea of the proof.
Also the constant term in the zeroth equation gives the value 𝑖=1 π‘š 𝑦 𝑖 𝑏 𝑖 =βˆ’ 𝑖=1 π‘š (βˆ’ 𝑦 𝑖 ) 𝑏 𝑖 . So it is the negative of the dual objective value of the dual solution (βˆ’ 𝑦 𝑖 ), 𝑖=1,…,π‘š. Note that the constant term in the zeroth equation also gives the negative of the objective value of the current primal feasible solution. So we have found a feasible dual solution giving the dual objective value which is same as the objective value of the current primal feasible solution. From previous Corollary, the dual solution and the primal solution are optimal to the dual and the primal problem, respectively. It is the idea of the proof. OR-1 Opt. 2018

19 pf of strong duality theorem) Suppose we introduced slack variables
π‘₯ 𝑛+𝑖 = 𝑏 𝑖 βˆ’ 𝑗=1 𝑛 π‘Ž 𝑖𝑗 π‘₯ 𝑗 (𝑖=1,…,π‘š) and solve the LP by simplex and obtain optimal dictionary with 𝑧= 𝑧 βˆ— + π‘˜=1 𝑛+π‘š 𝑐 π‘˜ π‘₯ π‘˜ , 𝑐 π‘˜ ≀0 βˆ€π‘˜, 𝑧 βˆ— = 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ— . Let 𝑦 𝑖 βˆ— =βˆ’ 𝑐 𝑛+𝑖 , 𝑖=1,…,π‘š. We claim that 𝑦 𝑖 βˆ— , 𝑖=1,…,π‘š is an optimal dual solution, i.e. it satisfies dual constraints and 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— = 𝑧 βˆ— . Equate 𝑧= 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 and 𝑧= 𝑧 βˆ— + 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 + 𝑗=𝑛+1 𝑛+π‘š 𝑐 𝑗 π‘₯ 𝑗 οƒž 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 = 𝑧 βˆ— + 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 + 𝑗=𝑛+1 𝑛+π‘š 𝑐 𝑗 π‘₯ 𝑗 . This equation must be satisfied by any π‘₯ that satisfies the dictionary (excluding the nonnegativity constraints) since the set of feasible solutions satisfying the equations does not change. OR-1 Opt. 2018

20 Have 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 = 𝑧 βˆ— + 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 + 𝑗=𝑛+1 𝑛+π‘š 𝑐 𝑗 π‘₯ 𝑗 .
Substitute π‘₯ 𝑛+𝑖 = 𝑏 𝑖 βˆ’ 𝑗=1 𝑛 π‘Ž 𝑖𝑗 π‘₯ 𝑗 , 𝑖=1,…,π‘š into above. Since any feasible solution to the dictionary should satisfy this. οƒž 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 = 𝑧 βˆ— + 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ’ 𝑖=1 π‘š 𝑦 𝑖 βˆ— ( 𝑏 𝑖 βˆ’ 𝑗=1 𝑛 π‘Ž 𝑖𝑗 π‘₯ 𝑗 ) οƒž 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 = 𝑧 βˆ— βˆ’ 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— + 𝑗=1 𝑛 ( 𝑐 𝑗 + 𝑖=1 π‘š π‘Ž 𝑖𝑗 𝑦 𝑖 βˆ— ) π‘₯ 𝑗 Now this equation should hold for all feasible solutions to the dictionary (disregarding nonnegativity constraints). From the initial dictionary, we know that any feasible solution to the dictionary (disregarding nonnegativity constraints) can be obtained by assigning arbitrary values to π‘₯ 1 ,…, π‘₯ 𝑛 and setting π‘₯ 𝑛+𝑖 = 𝑏 𝑖 βˆ’ 𝑗=1 𝑛 π‘Ž 𝑖𝑗 π‘₯ 𝑗 , (𝑖=1,…,π‘š). Use these solutions. Note that, in the above equation, the variables π‘₯ 𝑛+𝑖 ′𝑠 do not appear. So it must hold for any value of π‘₯ 𝑗 , 𝑗=1,…,𝑛. OR-1 Opt. 2018

21 Equality must hold for all values of π‘₯ 1 ,…, π‘₯ 𝑛 .
𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 = 𝑧 βˆ— βˆ’ 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— + 𝑗=1 𝑛 ( 𝑐 𝑗 + 𝑖=1 π‘š π‘Ž 𝑖𝑗 𝑦 𝑖 βˆ— ) π‘₯ 𝑗 Equality must hold for all values of π‘₯ 1 ,…, π‘₯ 𝑛 . Hence 𝑧 βˆ— = 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— , 𝑐 𝑗 = 𝑐 𝑗 + 𝑖=1 π‘š π‘Ž 𝑖𝑗 𝑦 𝑖 βˆ— , 𝑗=1,…,𝑛 𝑐 π‘˜ ≀0 βˆ€π‘˜ οƒž 𝑐 𝑗 βˆ’ 𝑖=1 π‘š π‘Ž 𝑖𝑗 𝑦 𝑖 βˆ— ≀0 οƒž 𝑖=1 π‘š π‘Ž 𝑖𝑗 𝑦 𝑖 βˆ— β‰₯ 𝑐 𝑗 , 𝑗=1,…,𝑛. βˆ’ 𝑦 𝑖 βˆ— ≀0 οƒž 𝑦 𝑖 βˆ— β‰₯0, 𝑖=1,…,π‘š Hence 𝑦 βˆ— is dual feasible. Also we have that 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— = 𝑧 βˆ— = 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ— . Since 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— = 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ— ≀ 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 for all dual feasible 𝑦 (weak duality), 𝑦 𝑖 βˆ— , 𝑖=1,…,π‘š is an optimal dual solution and 𝑖=1 π‘š 𝑏 𝑖 𝑦 𝑖 βˆ— = 𝑗=1 𝑛 𝑐 𝑗 π‘₯ 𝑗 βˆ—  OR-1 Opt. 2018


Download ppt "Chapter 5. The Duality Theorem"

Similar presentations


Ads by Google