CSE 203B: Convex Optimization Week 4 Discuss Session

Similar presentations


Presentation on theme: "CSE 203B: Convex Optimization Week 4 Discuss Session"β€” Presentation transcript:

1 CSE 203B: Convex Optimization Week 4 Discuss Session
Xinyuan Wang 01/30/2019

2 Contents Convex functions (Ref. Chap.3)
Review: definition, first order condition, second order condition, operations that preserve convexity Epigraph Conjugate function

3 Review of Convex Function
Definition ( often simplified by restricting to a line) First order condition: a global underestimator Second order condition Operations that preserve convexity Nonnegative weighted sum Composition with affine function Pointwise maximum and supremum Composition Minimization Perspective See Chap 3.2, try to prove why the convexity is preserved with those operations.

4 Epigraph 𝛼-sublevel set of 𝑓: 𝑅 𝑛 →𝑅 𝐢 𝛼 = π‘₯βˆˆπ‘‘π‘œπ‘š 𝑓 𝑓 π‘₯ ≀𝛼}
𝐢 𝛼 = π‘₯βˆˆπ‘‘π‘œπ‘š 𝑓 𝑓 π‘₯ ≀𝛼} sublevel sets of a convex function are convex for any value of 𝛼. Epigraph of 𝑓: 𝑅 𝑛 →𝑅 is defined as 𝐞𝐩𝐒 𝑓= (π‘₯,𝑑) π‘₯βˆˆπ‘‘π‘œπ‘š 𝑓,𝑓 π‘₯ ≀𝑑}βŠ† 𝑅 𝑛+1 A function is convex iff its epigraph is a convex set.

5 Relation between convex sets and convex functions
A function is convex iff its epigraph is a convex set. Consider a convex function 𝑓 and π‘₯, π‘¦βˆˆπ‘‘π‘œπ‘š 𝑓 𝑑β‰₯𝑓 𝑦 β‰₯𝑓 π‘₯ +𝛻𝑓 π‘₯ 𝑇 (π‘¦βˆ’π‘₯) The hyperplane supports epi 𝒇 at (π‘₯, 𝑓 π‘₯ ), for any 𝑦,𝑑 ∈𝐞𝐩𝐒 𝑓⇒ 𝛻𝑓 π‘₯ 𝑇 π‘¦βˆ’π‘₯ +𝑓 π‘₯ βˆ’π‘‘β‰€0 β‡’ 𝛻𝑓 π‘₯ βˆ’1 𝑇 𝑦 𝑑 βˆ’ π‘₯ 𝑓 π‘₯ ≀0 First order condition for convexity epi 𝒇 𝑑 π‘₯ Supporting hyperplane, derived from first order condition

6 Pointwise Supremum If for each π‘¦βˆˆπ‘ˆ, 𝑓(π‘₯, 𝑦): 𝑅 𝑛 →𝑅 is convex in π‘₯, then function 𝑔(π‘₯)= sup π‘¦βˆˆπ‘ˆ 𝑓(π‘₯,𝑦) is convex in π‘₯. Epigraph of 𝑔 π‘₯ is the intersection of epigraphs with 𝑓 and set π‘ˆ 𝐞𝐩𝐒 𝑔= ∩ π‘¦βˆˆπ‘ˆ 𝐞𝐩𝐒 𝑓(βˆ™ ,𝑦) knowing 𝐞𝐩𝐒 𝑓(βˆ™ ,𝑦) is a convex set (𝑓 is a convex function in π‘₯ and regard 𝑦 as a const), so 𝐞𝐩𝐒 𝑔 is convex. An interesting method to establish convexity of a function: the pointwise supremum of a family of affine functions. (ref. Chap 3.2.4)

7 Conjugate Function Given function 𝑓: 𝑅 𝑛 →𝑅 , the conjugate function
𝑓 βˆ— 𝑦 = sup π‘₯βˆˆπ‘‘π‘œπ‘š 𝑓 𝑦 𝑇 π‘₯ βˆ’π‘“(π‘₯) The dom 𝑓 βˆ— consists π‘¦βˆˆ 𝑅 𝑛 for which sup π‘₯βˆˆπ‘‘π‘œπ‘š 𝑓 𝑦 𝑇 π‘₯ βˆ’π‘“(π‘₯) is bounded Theorem: 𝑓 βˆ— (𝑦) is convex even 𝑓 π‘₯ is not convex. Supporting hyperplane: if 𝑓 is convex in that domain Slope 𝑦=βˆ’1 ( π‘₯ , 𝑓( π‘₯ )) where 𝛻 𝑓 𝑇 π‘₯ =𝑦 if 𝑓 is differentiable Slope 𝑦=0 (0, βˆ’ 𝑓 βˆ— (0)) (Proof with pointwise supremum) π’š 𝑻 𝒙 βˆ’π’‡(𝒙) is affine function in π’š

8 Examples of Conjugates
Derive the conjugates of 𝑓:𝑅→𝑅 𝑓 βˆ— 𝑦 = sup π‘₯βˆˆπ‘‘π‘œπ‘š 𝑓 𝑦π‘₯ βˆ’π‘“(π‘₯) Affine quadratic Norm See the extension to domain 𝑹 𝒏 in Example 3.21

9 Examples of Conjugates
Log-sum-exp: 𝑓 π‘₯ = log ( Ξ£ 𝑖=1 n 𝑒 π‘₯ 𝑖 ) , π‘₯∈ 𝑅 𝑛 . The conjugate is 𝑓 βˆ— 𝑦 = sup π‘₯βˆˆπ‘‘π‘œπ‘š 𝑓 𝑦 𝑇 π‘₯ βˆ’π‘“(π‘₯) Since 𝑓(π‘₯) is differentiable, first calculate the gradient of 𝑔 π‘₯ = 𝑦 𝑇 π‘₯ βˆ’π‘“(π‘₯) 𝛻𝑔 π‘₯ =π‘¦βˆ’ 𝑍 𝟏 𝑇 𝑍 where 𝑍= 𝑒 π‘₯ 1 , β‹― 𝑒 π‘₯ 𝑛 𝑇 . The maximum of 𝑔 π‘₯ is attained at 𝛻𝑔 π‘₯ =0, so that 𝑦= 𝑍 𝟏 𝑇 𝑍 . 𝑦 𝑖 = 𝑒 π‘₯ 𝑖 Ξ£ 𝑖=1 n 𝑒 π‘₯ 𝑖 , 𝑖=1,…, 𝑛 Does the bounded supremum (not β†’βˆž) exist for all π’šβˆˆ 𝑹 𝒏 ?

10 Examples of Conjugates
Does the bounded supremum (not β†’βˆž) exist for all π’šβˆˆ 𝑹 𝒏 ? If 𝑦 𝑖 <0, then the gradient on 𝛻 𝑖 𝑔= 𝑦 𝑖 βˆ’ 𝑒 π‘₯ 𝑖 Ξ£ 𝑖=1 n 𝑒 π‘₯ 𝑖 <0 for all π‘₯ 𝑖 βˆˆπ‘…. Then 𝑔 π‘₯ reaches maximum at ∞ at π‘₯β†’βˆ’βˆž. So 𝑦≽0. To have 𝑦= 𝑍 1 𝑇 𝑍 solvable, we notice that 𝟏 𝑇 𝑦= 𝟏 𝑇 𝑍 1 𝑇 𝑍 =1. If 𝟏 𝑇 𝑦≠1 and 𝑦≽0, we choose π‘₯=π‘‘πŸ s.t. 𝑔 π‘₯ = 𝑦 𝑇 π‘₯ βˆ’π‘“ π‘₯ =𝑑 𝑦 𝑇 πŸβˆ’ log 𝑛 𝑒 𝑑 =𝑑 𝑦 𝑇 πŸβˆ’1 βˆ’ log 𝑛 𝑔 π‘₯ reaches maximum at ∞ when we let π‘‘β†’βˆž(βˆ’βˆž), which is not bounded. The conjugate function 𝑓 βˆ— 𝑦 = Ξ£ 𝑖=1 n 𝑦 𝑖 log 𝑦 𝑖 if 𝟏 𝑇 𝑦=1 and 𝑦≽0 ∞ otherwise


Download ppt "CSE 203B: Convex Optimization Week 4 Discuss Session"

Similar presentations


Ads by Google