Download presentation
Presentation is loading. Please wait.
1
The Duality Theorem Primal P: Maximize π βΊ π₯ subject to π΄π₯β€π, π₯β₯0. Dual D: Minimize π βΊ π¦ subject to π΄ βΊ π¦β₯π, π¦β₯0. Weak Duality Theorem: If π₯ is feasible for P and π¦ is feasible for D, then: π βΊ π₯β€ π βΊ π¦ (Strong) Dualiy Theorem: If π₯ β is optimal for P and π¦ β is optimal for D, then: π βΊ π₯ β = π βΊ π¦ β
2
Generalized Duality Maximize Subject to Primal: Minimize Subject to
π 1 π₯ 1 + π 2 π₯ 2 + π 3 π₯ 3 Subject to π¦ 1 : π 11 π₯ 1 + π 12 π₯ 2 + π 13 π₯ 3 β€ π 1 π¦ 2 : π 21 π₯ 1 + π 22 π₯ 2 + π 23 π₯ 3 β₯ π 2 π¦ 3 : π 31 π₯ 1 + π 32 π₯ 2 + π 33 π₯ 3 = π 3 π₯ 1 β₯0, π₯ 2 β€0, π₯ 3 free Primal: Minimize π 1 π¦ 1 + π 2 π¦ 2 + π 3 π¦ 3 Subject to π₯ 1 : π 11 π¦ 1 + π 21 π¦ 2 + π 31 π¦ 3 β₯ π 1 π₯ 2 : π 12 π¦ 1 + π 22 π¦ 2 + π 32 π¦ 3 β€ π 2 π₯ 3 : π 13 π¦ 1 + π 23 π¦ 2 + π 33 π¦ 3 = π 3 π¦ 1 β₯0, π¦ 2 β€0, π¦ 3 free Dual:
3
Rules for Taking the Dual
Constraints in primal corresponds to variables in dual (and vice versa). Coefficients of objective function in primal corresponds to constants in dual (and vice versa). Primal (Maximization) Dual (Minimization) β€ for constraint β₯0 for variable β₯ for constraint β€0 for variable = for constraint free for variable
4
Certifying optimality
Suppose we wish to prove to someone that a solution π₯ β is optimal. If we supply both the optimal solution π₯ β and an optimal solution π¦ β to the dual, then one may easily verify that π₯ β is in fact optimal. Can we similarly prove a linear program is infeasible or unbounded?
5
Certifying infeasibility
Farkas Lemma: Exactly one of the following is true: There exist π₯ s.t. π΄π₯β€π. There exist π¦β₯0 s.t. π΄ β€ π¦=0 and π β€ π¦<0. Proof: Consider the following program and its dual P: max 0 s.t. π΄π₯β€π D: min π β€ π¦ s.t. π΄ β€ π¦=0, π¦β₯0
6
Proof of Farkas Lemma P: max 0 s.t. π΄π₯β€π D: min π β€ π¦ s.t. π΄ β€ π¦=0, π¦β₯0 Assume P is infeasible. Then D is either infeasible or unbounded. D is always feasible, hence unbounded. Thus we may find feasible solution π¦ with π β€ π¦<0.
7
The Complementary Slackness Property
Primal P: Maximize π βΊ π₯ subject to π΄π₯β€π, π₯β₯0. Dual D: Minimize π βΊ π¦ subject to π΄ βΊ π¦β₯π, π¦β₯0. Solutions π₯ and π¦ are said to satisfy Complementary Slackness if and only if: for all π=1,2,β¦,π (at least) one of the following holds: π=1 π π ππ π₯ π = π π (πth primal constraint has slack 0) π¦ π = (πth dual variable is 0) for all j=1,2,β¦,π (at least) one of the following holds: π=1 π π ππ π¦ π = π π (πth dual constraint has slack 0) π₯ π = (πth primal variable is 0)
8
Complementary Slackness Theorem
Primal P: Maximize π βΊ π₯ subject to π΄π₯β€π, π₯β₯0. Dual D: Minimize π βΊ π¦ subject to π΄ βΊ π¦β₯π, π¦β₯0. Theorem Let π₯ and π¦ be feasible solutions to P and D. Then: π₯ and π¦ are both optimal if and only if π₯ and π¦ satisfy complementary slackness.
9
Strict Complementary Slackness Property
Primal P: Maximize π βΊ π₯ subject to π΄π₯β€π, π₯β₯0. Dual D: Minimize π βΊ π¦ subject to π΄ βΊ π¦β₯π, π¦β₯0. Solutions π₯ and π¦ are said to satisfy Strict Complementary Slackness if and only if: for all π=1,2,β¦,π exactly one of the following holds: π=1 π π ππ π₯ π = π π (πth primal constraint has slack 0) π¦ π = (πth dual variable is 0) for all j=1,2,β¦,π exactly one of the following holds: π=1 π π ππ π¦ π = π π (πth dual constraint has slack 0) π₯ π = (πth primal variable is 0)
10
Strict Complementary Slackness Theorem
If P has an optimal solution then there exist optimal solution π₯ to P and π¦ to D that satisfy strict complementary slackness. Proof: Let π§ β be value of an optimal solution. Suppose π is such that π₯ π =0 in all optimal solutions. Then Pβ: max x j s.t. π΄π₯β€π, βπ β€ π₯β€β π§ β , π₯β₯0 has optimal value 0. Dβ: min π β€ π¦β π§ β π‘ s.t. π΄ β€ π¦βππ‘β₯ π π ,π¦β₯0,π‘β₯0
11
Proof (contβd) If π₯ π =0 in all optimal solutions, then Dβ: min π β€ π¦β π§ β π‘ s.t. π΄ β€ π¦βππ‘β₯ π π ,π¦β₯0,π‘β₯0 has optimal value 0. Let ( π¦ β , π‘ β ) be any optimal solution to Dβ. Case π‘ β =0: Then we have π β€ π¦ β =0 π΄ β€ π¦ β β₯ π π π¦ β β₯0 Let π¦ be optimal solution to D. Then π¦+ π¦ β is also optimal solution of D, and has nonzero slack in πth constraint.
12
Proof (contβd) Case π‘ β >0: Then we have π β€ π¦ β β π§ β π‘=0 π΄ β€ π¦ β βππ‘β₯ π π π¦ β , π‘ β β₯0 Define π¦= π¦ β / π‘ β . Then we have π β€ π¦= π§ β π΄ β€ π¦β₯π+ π π / π‘ β π¦ β β₯0 Hence π¦ is optimal solution to D and has nonzero slack in πth constraint.
13
Proof (contβd) For each π where all optimal solutions to P satisfies π₯ π =0 we have found an optimal solution π¦ (π) to D with nonzero slack in πth constraint. The average of all these π¦ (π) is an optimal solution to D with nonzero slack in πth constraint for all π we have considered. Finally do the same for the dual finding solution π₯. Then π₯ and π¦ satisfy strict complementary slackness.
14
Game Theory Just last week:
CMU poker AI player Libratus beats top human poker players in heads up no-limit Texas Holdβem. A monumental achievement! (Compare to Chess (1997), Jeopardy (2009), Go (2016)).
15
Guess the coin game Minnie hides in an envelope some money.
It is either a 1 DKK coin, a 5 DKK coin, or a 10 DKK coin. If Max can guess what it is, he gets the coin!
16
How should Max play? - Worst case approach
Max does not know Minnie and has no clue as to what coin she has hidden. He wants to find a strategy with a good worst case guarantee. He cannot get a worst case guarantee on his actual winnings. But he can get a non-trivial worst case guarantee on his expected winnings, if he uses a randomized strategy!
17
Guess the coin game - Minnie's perspective
Somehow Minnie has been convinced to play the game with Max. But she does actually not like to just give her money away. Can we find the randomized strategy for Minnie that minimizes her expected loss?
18
Matrix Games A Matrix Game is given by a payoff matrix π΄ =( π ππ )β π
πΓπ . Row player (Minnie) chooses row πβ 1,β¦,π (without seeing Max's move). Column player (Max) chooses column πβ{1,β¦π} (without seeing Minnies's move). Max gains π ππ βdollarsβ and Minnie loses π ππ βdollarsβ. Matrix games are zero-sum games as Max gains exactly what Minnie loses. Negative numbers can be used to model money transferred in the other direction. Warning: In many texts and old exam problems, the Row player is the maximizer and the Column player is the minimizer.
19
Guess the coin as a matrix game
Guess DKK 1 Guess DKK 5 Guess DKK 10 Hide DKK 1 1 Hide DKK 5 5 Hide DKK 10 10
20
Optimal randomized Strategy (for Max)
Play the game in a randomized way so that the expected gain is as big as possible, assuming worst case behavior of Minnie.
21
Column Players (Max's) Optimal randomized Strategy
Optimal randomized strategy and guaranteed lower bound on exp. gain for Max is ( π 1 , π 2 ,β¦, π π ;π) which is a solution to the LP: max π s.t. π=1 π π ππ π π β₯π π=1,β¦,π π=1 π π π =1 π π β₯0 π=1,β¦,π
22
Row Players (Minnie's) Optimal randomized Strategy
Optimal randomized strategy and guaranteed upper bound on exp. loss for Minnie is ( π 1 , π 2 ,β¦, π π ;π) which is a solution to the LP: min π s.t. π=1 π π ππ π π β€π j=1,β¦,π π=1 π π π =1 π π β₯0 i=1,β¦,π
23
Crucial observation Max's program and Minnie's program are each others duals!
24
Consequence For any Matrix game, Max's guaranteed lower bound on his expected gain when he plays his optimal randomized strategy is equal to Minnie's guaranteed upper bound on her expected loss when she plays her optimal randomized strategy. The common value is thus called the value of the game. A priori, this is not obvious - intuitively, the βoptimal" strategies are very timid and βpessimistic", as they are optimal only when assuming a worst case opponent.
25
"Guess the basis" trick for solving linear programs and matrix games
If we know the basis of a basic solution to an LP in standard form, we can find it directly by solving the system of linear equations setting the non-basic variables to zero. If we can guess the basis of an optimal solution to the primal and guess the basis of an optimal solution to the dual, we can therefore find the corresponding basic solutions by solving two systems of linear equations, and verify that both are optimal by checking that there is no gap between solution values.
26
"Guess the basis" trick for solving linear programs and matrix games
βknowing the basisβ = βknowing which variables are strictly positiveβ (unless the LP is degenerate) βDefault" guess for the bases of the LPs corresponding to a square matrix game: The probabilities of playing the row/columns are all strictly positive (no strategies are "stupid"), so the variables that should be set to zero are the slack variables of all inequalities.
27
Fair and symmetric games
A matrix game is called symmetric if its matrix is antisymmetric(!). Example: Rock/Scissors/Paper. Counterexample: Guess the coin. A matrix game is called fair if its value is 0. Theorem: All symmetric games are fair. The opposite is not necessarily true. Example?
28
Matrix algebra formulation
Note: If Max plays by (not necessarily optimal) randomized strategy π= π 1 ,β¦, π π and Minnie plays by the randomized strategy π= π 1 ,β¦, π π the expected gain of Max is π β€ π΄π. If the value of the game is π£, we have that π£= max π min π π β€ π΄π Proof: π£ is Max's best possible expected gain, assuming a worst case move of Minnie and hence also assuming a worst case randomized strategy of Minnie.
29
von Neuman Min-Max Theorem (Theorem 11.1)
For any real matrix π΄, max π min π π β€ π΄π = min π max π π β€ π΄π where π (resp. π) are arbitrary probability distributions on columns (resp. rows) of π΄.
30
Exploiting weak opponents
Can we play an optimal strategy and still exploit opponents that do not play by the optimal strategy? Exploit: Achieve a better expectation that the value of the game.
31
Bad news: Principle of indifference
Suppose π β is optimal for Max and π β is optimal for Minnie. Let πβ{1,2,β¦,π} be the set of rows to which π β assigns non-zero probability. Suppose Max plays π β . Then his expected payoff is the same, namely the value of the game, no matter which actual row in π Minnie chooses. Proof: Complementary Slackness Theorem! If π π β >0 then πth constraint is satisfied with equality by π β .
32
Exploiting weak opponents - good news!
Let πβ{1,2,β¦,π} be the set of rows to which some optimal strategy π β for Minnie assigns non-zero probability. There exists an optimal strategy π β for Max so that his expected payoff is strictly bigger than the value of the game if Minnie chooses a row that is not in π β . Proof: Strict Complementary Slackness Theorem!
33
Best replies and Nash equilibrium
Let π¦ be a randomized strategy for Minnie. A best reply to π¦ is a randomized strategy for Max that maximizes his expected winnings assuming that Minnie plays π¦. A pair of randomized strategies π₯ β and π¦ β is called a Nash equilibrium if they are best responses to each other. Theorem: A pair of randomized strategies ( π₯ β , π¦ β ) for a matrix game is a Nash equilibrium if and only if they are both optimal.
Similar presentations
© 2025 SlidePlayer.com Inc.
All rights reserved.