• Keine Ergebnisse gefunden

Numerical Optimization

N/A
N/A
Protected

Academic year: 2022

Aktie "Numerical Optimization"

Copied!
18
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Numerical Optimization

CHAPTER 13. ADMM

(2)

Today

ADMM: Alternating Direction Method of Multipliers

[Glowinski, R. & Marroco, A., 1975]

[D. Gabay & B. Mercier, 1976]

For an introduction: [Boyd et al., FnT ML, 2010]

Aim:

•   Understanding the algorithm

•   Convergence

•   Consensus formulation

(3)

Dual Ascent

min

x ∈ R n

f ( x )

s . t

. Ax = b

A ∈ R m × n , f : R n → R convex

Lagrangian L ( x ; y ) = f ( x ) + y T ( Ax − b ) y ∈ R m

Dual objective function: g ( y ) = inf

x

L ( x ; y )

Let x minimizes L (x ; y ) for a given y . Then

g (y ) = L (x ; y ) = f (x ) + y T (Ax − b)

(4)

Dual Problem

max

y

g ( y ) Ax b g ( y ) ,

x ∈ arg min

x

L ( x ; y )

Using subgradient ascent:

α k > 0 is a stepsize

x k +1 ∈ arg min

x

L (x ; y k )

y k +1 = y k + α k ( Ax k +1 − b )

(5)

Dual Subgradient Ascent

When alpha k is chosen carefully, and with additional assumptions, this procedure can produce

However, this requires conditions often do not hold in practice.

x k +1 ∈ arg min

x

L ( x ; y k )

y k +1 = y k + α k ( Ax k +1 − b )

x k → x , y k → y

(6)

Augmented Lagrangian

Lagrangian:

Augmented Lagrangian:

This is the Lagrangian function associated with an equivalent problem:

L ( x ; y ) = f ( x ) + y T ( Ax − b )

L ρ ( x ; y ) = f ( x ) + y T ( Ax − b ) + ρ

2 k Ax − b k 2 2 ρ > 0 : penalty parameter

min

x

f ( x ) + ρ

2 k Ax − b k 2

s . t . Ax = b

(7)

MM: Method of Multipliers

Dual ascent with augmented Lagrangian:

•   Converges in more general conditions than dual ascent

x k +1 ∈ arg min

x

L ρ ( x ; y k )

y k +1 = y k + ρ(Ax k +1 − b)

(8)

ADMM

min

x ∈ R n , z ∈ R m f ( x ) + g ( z ) s . t . Ax + Bz = c

A ∈ R p × n , B ∈ R p × m

Augmented Lagrangian:

L ρ ( x , z ; y ) = f ( x ) + g ( z ) + y T ( Ax + Bz − c ) + ρ

2 k Ax + Bz − c k 2 2

“Alternating direction” method of multipliers

x k +1 ∈ arg min

x

L ρ ( x , z k ; y k )

z k +1 ∈ arg min

z

L ρ ( x k +1 , z ; y k )

y k +1 = y k + ρ( Ax k +1 + Bz k +1 − b )

Coordinate-Wise Minimization

Dual Ascent

(9)

MM vs. ADMM

x k +1 ∈ arg min

x

L ρ ( x , z k ; y k )

z k +1 ∈ arg min

z

L ρ ( x k +1 , z ; y k )

y k +1 = y k + ρ( Ax k +1 + Bz k +1 − b )

( x k +1 , z k +1 ) ∈ arg min

x

L ρ ( x , z ; y k )

y k +1 = y k + ρ ( Ax k +1 + Bz k +1 − b )

MM

ADMM

(10)

Convergence of ADMM

Assumptions:

1. f : R n → R ∪ {∞}, g : R m → R ∪ {∞} are closed, proper, and convex

This implies that x-update and z-update are solvable, i.e., minimizers exist (but may not be unique)

epi f = {( x , t ) ∈ R n × R : f ( x ) ≤ t }

epi g = {( x , t ) ∈ R n × R : g ( x ) ≤ t }

are closed nonempty convex sets

(11)

Assumption 2:

With assumption 1, this implies that

L 0 has a saddle point, i.e. there exists (x , z , y ) s.t.

L 0 (x , z , y ) ≤ L 0 (x , z , y ) ≤ L 0 (x , z , y ), ∀ x , z , y

(x , z ) is a primal solution of min

x , z f (x ) + g (z ) s.t. Ax + Bz = c

y is a dual solution

(12)

Convergence of ADMM

Under assumptions 1 & 2,

r k := Ax k + Bz k − c → 0 as k → ∞ f ( x k ) + g ( z k ) → f + g as k → ∞

y k → y as k → ∞

Residual Objective Dual variable

Primal variables need not converge to optimal values, although such

results can be shown under additional assumptions

(13)

Convergence of ADMM

General convex case

•   Sublinear convergence O(1/k)

•   [He & Yuan, SIAM J Numerical Analysis, 2012]

Strongly convex case

•   Linear convergence

•   [Deng & Yin, Rice Univ. Tech rep, TR12-14, 2012]

(14)

Global Variable Consensus

x min ∈ R n f ( x ) =

N

!

i =1

f i ( x ) f i : R n → R ∪ {∞} convex

A global variable x is shared across f i ’s

x i ∈ R min n , z ∈ R n

N

!

i =1

f i ( x i )

s.t. x i − z = 0 , i = 1 , 2 , . . . , N .

A simple reformulation (global consensus problem):

(15)

ADMM for Global Consensus

Augmented Lagrangian:

L ρ (x 1 , . . . , x N , z ; y ) =

N

!

i =1

"

f i (x i ) + y i T (x i − z ) + ρ

2 "x i − z " 2 2 # ADMM:

x i k +1 = arg min

x

i

! f i ( x i ) + ( y i k ) T ( x i − z k ) + ρ

2 " x i − z k " 2 2 "

z k +1 = 1 N

N

#

i =1

$

x i k +1 + 1 ρ

y i k

%

k +1 k k +1 k +1

(16)

Simplification

⇒ z k +1 = ¯ x k +1 + 1 ρ y ¯ k

¯

x := 1 N

N

!

i =1

x i

⇒ by averaging, ¯ y k +1 = ¯ y k + ρ(¯ x k +1 − z k +1 )

∴ y ¯ k +1 = 0

∴ z k +1 = ¯ x k +1

ADMM:

x i k +1 = arg min

x

i

!

f i (x i ) + (y i k ) T (x i − z k ) + ρ

2 "x i − z k " 2 2 "

z k +1 = 1 N

N

#

i =1

$

x i k +1 + 1 ρ

y i k

%

y i k +1 = y i k + ρ ( x i k +1 − z k +1 )

(17)

Simplification

z k +1 = ¯ x k +1

ADMM:

x i k +1 = arg min

x

i

!

f i ( x i ) + ( y i k ) T ( x i − z k ) + ρ

2 " x i − z k " 2 2 "

z k +1 = 1 N

N

#

i =1

$

x i k +1 + 1 ρ y i k

%

y i k +1 = y i k + ρ ( x i k +1 − z k +1 )

Simplified ADMM:

x i k +1 = arg min

x i

f i ( x i ) + ( y i k ) T ( x i − x ¯ k ) + ρ

2 k x i − x ¯ k k 2 2

(18)

ADMM for Global Consensus

Each function access local data

Simplified ADMM:

x i k +1 = arg min

x i

f i ( x i ) + ( y i k ) T ( x i − x ¯ k ) + ρ

2 k x i − x ¯ k k 2 2 ⌘ y i k +1 = y i k + ρ( x i k +1 − x ¯ k +1 )

x 1 x 2

x N

. . .

. . .

y

f 1

f 2

. . .

f N

x 1

x 2

x N

¯ x x ¯

¯ x

Message-Passing Algorithm

•   Communication cost?

•   Data storage?

Referenzen

ÄHNLICHE DOKUMENTE

…durch die Auflösung der ersten Erde und ihren Sturz in den Abgrund … Und je nachdem, wie die Bruchstücke fielen … wäre die Erde generell voller Kavernen

Das Mädchen, das, glänzend beim Ball und der Assemblée, für Diesen ein Lächeln, für Jenen eine freundlichen Händedruck hat und darnach strebt, sich bewundert, begehrt zu wissen

(zu allen) Bittet doch alle meinen lieben Freund, mein liebes Kalifalkchen, daß er uns sage, wie meiner Tochter Bräutigam beschaffen seyn soll, um meines lieben Barbchens

Graphics and format ©2000-2008 abcteach® Pets May not be sold/redistributed without

Die wesentlichste Bemuehung gerade gegcnwaertlg muss fuer uns dabei sein, eigene Freiheit zu finden, selbstaendige Denker zu worden, die reif werden auch fuer politische Fragen und

If external lines are to be used then the corresponding port pins should be programmed as bit ports with the correct data direction. Finally, theCo~nter/Timer

This manual contains information on the GMX Micro-20 version of Technical Systems Consultants' UniFLEX Disk Operating.. information is specific to the GMX Micro-20

A breather filter on the drive housing cover reduces the ingress of contaminants through the bearings of the disk motor by minimising the pressure drop across