• Keine Ergebnisse gefunden

4.1 Algorithm

4.1.1 Calculation of Trial Steps

vj(k)gj(xk) 1

2σkgj(xk)2

)

=L(xk, vk) + 1

2σk∥g(xk)22 . (4.4) Here the weighted value of the constraint violation is added to the Lagrangian function (2.20) of the equality constrained optimization problem.

The subsequent section specifies the subproblems that are solved in each iteration in order to determine a trial step (dk, wk), wheredk denotes the step in the primal vari-ables andwk denotes the step in the dual variables. Moreover, a feasibility restoration phase is introduced that is entered if inconsistency in the subproblems occurs.

4.1.1 Calculation of Trial Steps

In each iteration k a trial step (dk, wk), which consists of a step dk in the primal variables and a step wk in the dual variables, has to be calculated. The exact deter-mination ofwk is explained later. Let(xk, vk)be the current iterate. The basic idea is to solve the standard SQP subproblem (3.1) extended by the trust region constraint

∥d∥k. The trust region constraint can easily be transformed into simple bound constraints, i.e., ∥d∥k can be replaced by 2n linear inequality constraints, that is

di+ ∆k 0, i= 1, . . . , n , (4.5) and

k−di 0, i= 1, . . . , n . (4.6) The resulting subproblem is a quadratic problem that can be solved by any available solver. No special strategy for handling the trust region constraint has to be added.

Reformulating the trust region constraint leads to the following subproblem formula-tion

minimize

d∈Rn ∇f(xk)Td+1

2dTBkd

subject to gj(xk) +∇gj(xk)Td= 0 , j ∈ E , gj(xk) +∇gj(xk)Td≥0, j ∈ I ,

k≤di k , i= 1, . . . , n ,

(4.7)

where Bk Rn×n is a symmetric matrix that approximates the Hessian of the La-grangian function (2.20) of the underlying problem.

In the algorithm Bk is required to be positive definite. Positive definiteness is suf-ficient for the global convergence theory. Additional requirements are only necessary for the local convergence theory. Then matrix Bk has to be a good approximation of the Hessian of the Lagrangian function in some sense.

If the feasible region of problem (4.7) is not empty, then the solution of (4.7) is de-noted by(dk, uk, µk), whereuk Rmis the Lagrangian multiplier vector corresponding to the linear constraints gj(xk) +∇gj(xk)Tdk, j = 1, . . . , m. To simplify the notation in the remainder of this work, µk Ris introduced and defined by

µk :=

n i=1

(

µ(k)

i +µ(k)i ) , (4.8)

whereµk:= (µ(k)1 , . . . , µ(k)n )T denotes the multipliers corresponding to the lower bounds (4.5) on step dk, and µk := (µ(k)1 , . . . , µ(k)n )T denotes the multipliers corresponding to the upper bounds (4.6), respectively. A more detailed derivation is stated in Section 4.2.

As illustrated by an example in Section 3.4, subproblem (4.7) can be infeasible and no solution exists. In order to overcome this situation, afeasibility restoration phase is introduced. Such a feasibility restoration phase is also used by the filter algorithm, see Algorithm 3.6 by Fletcher, Leyffer, and Toint [43]. This approach differs from other methods by the fact that the standard procedure is to solve the undisturbed subprob-lem (4.7). Approaches as the Vardi-like ones, the Celis-Dennis-Tapia ones or the one by Yuan, see earlier comments in Section 3.4, apply relaxation techniques in each iteration to guarantee consistency of the subproblems during the whole optimization process.

Consequently, one has to take care of additional safeguards to achieve convergence.

The aim of the strategy employed by the new algorithm presented here is to avoid the need of an additional penalty parameter in the subproblems. Therefore, subproblem (4.7) is solved whenever possible. Only in case the problem is inconsistent a switch to a different subproblem is performed. This strategy is also employed by an algorithm addressing equality constrained problems proposed by El-Alem [30]. El-Alem’s algo-rithm also tries to solve the equality constrained formulation of subproblem (4.7) first.

If the problem is infeasible, then a relaxed problem is solved to obtain a trial step.

During the restoration phase the trial steps are determined in two steps. First, the minimum constraint violation that can be achieved within the trust region bound is determined. Thefeasibility restoration problemthat is solved in this situation is defined as

minimize d∈Rn, δ∈R

j∈E∪Ak

gj(xk)2δ2

subject to gj(xk)(1−δ) +∇gj(xk)Td= 0, j ∈ E , gj(xk)(1−δ) +∇gj(xk)Td≥0, j ∈ Ak , gj(xk) +∇gj(xk)Td≥0, j ∈ Bk ,

k≤di k , i= 1, . . . , n , 0≤δ≤1 ,

(4.9)

where the sets Ak and Bk stand for A(xk,0) and B(xk,0) as defined by (2.15) and (2.16), respectively. This problem always has a solution, since (d, δ) = (0,1) is feasi-ble. The problem determines a relaxation parameter δk. Note that inactive inequality constraints, i.e., constraints in set Bk, are not relaxed and the linearized constraints remain satisfied. The solution of problem (4.9) be denoted byd¯k and δk.

After the required relaxation parameterδkhas been calculated, a second subproblem is set up, where the violated constraints are relaxed. It is stated as

minimize

In subproblem (4.10) the parameterδk remains fixed. Subproblem (4.10) is consistent, asd¯k is a feasible point for (4.10). The solution of (4.10) is also denoted by(dk, uk, µk), where uk is the multiplier vector with respect to the m linear approximations of the constraints and µk is obtained according to (4.8) with the corresponding multipliers µk and µk. Since problem (4.7) is inconsistent in this case, there exists at least one linearized constraint that had to be relaxed.

The algorithm employs an additional variable zkRm that measures the violation of the linearized constraints at the solution of (4.7) or (4.10), respectively. In case the standard subproblem (4.7) is feasible, then the m entries ofzk are set to zero since all linearized constraints are satisfied. If subproblem (4.10) is solved, then the vector zk is determined according to the corresponding subproblem, i.e., dk is either the minimizer of problem (4.7), if the subproblem is consistent, or the solution to the relaxed problem (4.10). The step wk in the dual variables is set to

w(k)j := wherezk is either zero, in case problem (4.7) is consistent, or defined by (4.11). Thus, the size of the dual step wk is also controlled in some sense during the feasibility

restoration phase. Definition (4.12) is motivated by the convergence analysis.

The following section introduces the model that is applied to evaluate the quality of the calculated trial steps.