Efficient error estimation - Error estimation for DEIM reduced systems

5.2 Error estimation for DEIM reduced systems

5.2.4 Efficient error estimation

The error estimator introduced in Theorem 5.2.9 makes explicit use of the local logarithmic Lipschitz constant L_G[f](x^r(t)), which is (as well as its global counterpart LG[f]) not readily available in most practical situations.

Therefore, letJ : Ω →R^d×d denote the Jacobian J(x)off atx ∈ Ω. With

the Taylor expansion off around x^r(t) we obtain he(t),f(x(t))−f(x^r(t))i_G

||e(t)||²_G = D

e(t),J(x^r(t))e(t) +O

||e(t)||²_GE

||e(t)||²_G

= he(t),J(x^r(t))e(t)i_G

||e(t)||²_G +O(||e(t)||_G), (5.33) for anyt ∈ [0, T]. With Definition 5.2.5 and (5.25, p.178) this directly gives a first order approximation of the local logarithmic Lipschitz constant

L_G[f](x^r(t)) = L_G[J(x^r(t))] + O(||e(t)||_G). (5.34) Using the approximation (5.34) avoids the need to obtain L_G[f](x^r(t)), but it comes with additional cost: The computation of the Jacobian logarithmic norm is expensive as it involves solving an eigenvalue problem of high di-mensionddue to the representation (5.26, p.178).

We will address this issue in the following discussion, where we propose to apply a suitable partial similarity transformation to the Jacobians which has been designed to preserve the largest eigenvalues of their symmetric parts.

The key ingredient is to perform a POD of their corresponding eigenvectors, which in turn allows us to bound the resulting eigenvalue approximation error in terms of the remaining eigenvalues of their covariance matrix. We explain this idea for general symmetric matrices in the following theorem, and refer to [97, 176] for details on POD and related error estimates. Recall here the MatLab style notation introduced in Chapter 1.

Theorem 5.2.11 (Approximation of eigenvalues for a family of symmet-ric matsymmet-rices). Let a continuous family of symmetric matrices H(t) ∈ R^d×d overt ∈ [a, b]be given and let [λ(t),q(t)] := λmax(H(t)) denote the largest eigenvalueλ(t) with corresponding normalized eigenvectorq(t) of H(t). Let

5.2. ERROR ESTIMATION FOR DEIM REDUCED SYSTEMS 187 sup_t∈[a,b]||H(t)|| ≤ C_H hold. Further, define

R= Z b

q(t)q(t)^Tdt,

and let QΣ²Q^T = Rbe the eigen-decomposition of Rwith

Q^TQ = I, Σ = diag (σ1, σ2, . . . , σd), σ1 ≥ σ2 ≥ · · · ≥σd > 0.

For k ≤ d letQ_k := Q[:,1:k]andλ_k(t) := λ_max(Q^T_kH(t)Q_k). Then

|λ(t)−λk(t)|dt ≤4CH

j>k

σ²_j.

Proof. First, define the vector valued function w(t) := Σ⁻¹Q^Tq(t), t ∈ [a, b]. Note that

Z b a

w(t)w(t)^Tdt = Z b

Σ⁻¹Q^Tq(t)q^T(t)QΣ⁻¹dt= Σ⁻¹Q^TRQΣ⁻¹ = I.

Thus, Rb

a wi(t)wj(t)dt = δij (the Kronecker delta), where wi(t) is the i-th component of w(t). Now, partition

Q = [Q_k,Q˜_k], Σ = diag

Σ_k,Σ˜_k

, w(t) = [w^T_k(t),w˜^T_k(t)]^T, withQ˜_k := Q(:,(k+1):d),Σ_k,Σ˜_kdenoting the appropriate diagonal blocks ofΣandwk(t),w˜k(t)denoting the corresponding sub-vectors ofw(t). Put q(t) = q_k(t) + ˜q_k(t), with q_k(t) := Q_kΣ_kw_k(t), q˜_k(t) := ˜Q_kΣ˜_kw˜_k(t), and note that q^T_k(t)˜q_k(t) = 0for all t ∈ [a, b]. We observe that

Z b a

q^T_k(t)˜q_k(t)dt= Z b

w^T_k(t) ˜Σ²_kw˜_k(t)dt = X

j>k

σ²_j Z b

w_j²(t)dt = X

j>k

σ_j². (5.35)

In the following we omit the argument t and set q = q(t), q_k = q_k(t),

q_k = ˜q_k(t), etc. Then

q^T_kHq_k = (q−q˜_k)^TH(q−q˜_k) =λ−2q^THq˜_k + ˜q^T_kHq˜_k

= λ−2λq^Tq˜_k + ˜q^T_kHq˜_k = λ−2λ˜q^T_kq˜_k + ˜q^T_kHq˜_k. (5.36)

Moreover, sinceµ ≤ ||H|| ≤ C_H for any eigenvalueµofH, the definitions ofλ = λ(t) and λ_k = λ_k(t) imply

λ = sup

v6=0

v^THv

v^Tv ≥ sup

Q_kv_k6=0

v^T_kQ^T_kHQ_kv_k

v^T_kv_k = λk ≥ q^T_kHq_k

q^T_kq_k . (5.37) Combining (5.36) and (5.37) provides with||q_k||² +||˜q_k||² = ||q||² = 1 that

0≤ λ−λk ≤ λ− q^T_kHq_k q^T_kq_k

= λ− λ−2λ˜q^T_k˜q_k + ˜q^T_kHq˜_k

1−q˜^T_kq˜_k = λ˜q^T_kq˜_k −q˜^T_kHq˜_k 1− ||˜q_k||²

λ− q˜^T_kHq˜_k q˜^T_kq˜_k

||˜q_k||²

1− ||˜q_k||² ≤2||H|| ||˜q_k||² 1− ||˜q_k||², which is equivalent to

0 ≤ (λ−λ_k)(1− ||q˜_k||²) ≤2||H|| ||q˜_k||². Hence,

0 ≤λ −λ_k ≤ (2||H|| +λ−λ_k)||˜q_k||² ≤ 4||H|| ||˜q_k||² ≤4C_H ||˜q_k||², and therefore, using (5.35),

|λ(t)−λ_k(t)|dt≤ 4C_H

||˜q_k(t)||²dt = 4C_H X

j>k

σ_j²

5.2. ERROR ESTIMATION FOR DEIM REDUCED SYSTEMS 189 as claimed.

Remark5.2.12. IfRis rank deficient, the above argument is still valid simply by replacingΣ⁻¹ with the pseudo-inverse Σ⁺.

This result can be applied directly in the context of approximating the log-arithmic norms of the local Jacobians.

Proposition 5.2.13( Jacobian partial similarity transform). LetCC^T denote the Cholesky decomposition of the weighting matrixG, and consider the family of symmetric matrices

H(t) := 1 2

C^TJ(x^r(t))C^−T + C^TJ(x^r(t))C^−TT .

Then, we have the corresponding values C_H > 0, {σ_i}_i=1...d and Q ∈ R^d×d from Theorem 5.2.11, that allow us to write

λ(t) = LG[J(x^r(t))], λ_k(t) = L_I_k

Q^T_kC^TJ(x^r(t))C^−TQ_k

. (5.38)

Now we directly obtain the estimate

L_G[J(x^r(t))]−L_I_k

Q^T_kC^TJ(x^r(t))C^−TQ_k

dt ≤C_HX

j>k

σ_j². (5.39)

Note here that in practice the matrix R in Theorem 5.2.11 is not available.

Instead,Qis obtained as the set of left singular vectors of the singular value decomposition (SVD) of a snapshot matrix

S_n =

rb−a n

q(t₁) . . . q(t_n)i .

Assuming equally spaced pointst_j ∈ [a, b]with t₁ = a, t_n = b, we have

n→∞lim S_nS^T_n =

q(t)q(t)^Tdt = R.

Thus, for a sufficiently large n, the sum of the neglected squared singular values ofS_n will be arbitrarily close to the corresponding neglected eigen-values of R and can safely be used in the estimate. The detailed argument is similar to one given in [97]. However, for a given set of training data X = {x_i | i = 1. . . n} ⊆ Ω, Algorithm 10 describes how to obtain Q in practice. Notationally, the method SV D performs the singular value de-compositionA= VΣW^T for a matrix A.

Algorithm 10: Q = getTransformationMatrix(X)

1: S_n ←[]

2: for allxi ∈ X do

3: M ← C^TJ(x_i)C^−T

4: [λ_i,q_i] ← λ_max ¹₂(M +M^T)

5: S_n ← [S_n q_i]

6: end for

7: [Q,Σ,W^T]← SV D q

b−a n S_n

8: returnQ

Before we can derive an efficient error estimator variant with the transfor-mation introduced above, there is one problem left to deal with. The reduced matrix from the right hand side of (5.38) is of small sizek ×k, however, its computation involves the transformed Jacobian C^TJ(x^r(t))C^−T ∈ R^d×d which makes its computation infeasible during reduced simulations. Thus, we propose to apply a Matrix-DEIM approximation, which not only reduces evaluation costs for the Jacobian but also allows an efficient offline/online decomposition of (5.38), which we will discuss in Section 5.3. A similar idea named “Multi-Component EIM” has been formulated and successfully ap-plied in [171, §4.3.2]. Consequently, for anyA ∈ R^d×d we define the

trans-5.2. ERROR ESTIMATION FOR DEIM REDUCED SYSTEMS 191 formation

Υ : R^d×d →R^d

A 7→Υ[A] :=

A^T₁, A^T₂, . . . , A^T_d T

which maps the matrix entries of Acolumn-wise into a vector (equivalent to the MatLab operation A(:), also known asvec-operation).

Proposition 5.2.14 (Matrix DEIM). Choose MJ ≤ d and let UˆM_J,PˆM_J ∈ R^d

2 ×M_J denote the corresponding matrices for the DEIM basis and interpo-lation points obtained by application of the known DEIM approximation pro-cedure from Section 2.3/[23] to the vector-valued function Υ[J(x)]. Then, for mJ ≤ MJ, the mJ-th order MDEIM approximation ofJ is given via

J˜_m_J(x) := Υ⁻¹h

Uˆ_m_J( ˆP_m_JUˆ_m_J)⁻¹Pˆ^T_m

JΥ[J(x)]i , whereUˆm_J := ˆUM_J(:,1:mJ)and Pˆm_J := ˆPM_J(:,1:mJ).

To this end, no extra assumptions on f need to be made that are not al-ready required by the standard DEIM, as the point-wise evaluations of the Jacobian entries can always be approximated via finite differences and point-wise evaluation of the underlying f. Of course, direct computation of Jaco-bian entries is preferred if possible. Moreover, the evaluation technique for reduced variables V z(t) carries over directly as proposed in the original work [23, §3.5]. We would also like to mention that any matrix approxima-tion technique using linear combinaapproxima-tions of basis matrices could be applied in this context, see [21] for an approach using a POD-basis with least squares weights and structure-preserving constraints.

Next we investigate the approximation quality for the logarithmic norm of the similarity transformed, MDEIM JacobianL_I_k

Q^T_kC^TJ˜_m_J(y^r(t))C^−TQ_k against those of the true Jacobians L_G[J(y^r(t))]. Similar to Section 5.2.2, this is done outside the scope of the error estimation process, where we shall use the offline data from the 1D viscous Burgers model of Section 3.4.3

again. Consequently, Figure 5.10 shows the maximum (left) and mean (right)

Figure 5.10: Maximum (left) and mean (right) approximation error of the logarithmic norm overX

logarithmic norm approximation error overY for differentm_J andkvalues.

While values ofmJ = 13, k = 15are sufficient on average to have less than 1% relative error, in the worst casem_J = 28, k = 30 are needed to ensure the same tolerance over Y. But even for maximal m_J = k = 50, the rel-ative errors are already only 3.162 × 10⁻⁴ (worst case) and 8.222 × 10⁻⁶ (average). These results strongly indicate that both performing the simi-larity transformation and MDEIM approximation of Propositions 5.2.13 and 5.2.14, respectively, are suitable for a low-cost but accurate approximation of Jacobian logarithmic norms.

Now, using the above results from Propositions 5.2.13 and 5.2.14, we de-rive an efficient approximation of the estimator introduced in Theorem 5.2.9, where the computational complexity for obtaining β(t) (see (5.31c, p.184)) is independent of the high dimensiond.

Theorem 5.2.15(Estimator using local Jacobian logarithmic norms). Let the conditions from Theorem 5.2.8, Propositions 5.2.13 and 5.2.14 hold and replace (5.31c, p.184) by

β(t) := LI_k

Q^T_kC^TJ˜m_J(x^r(t))C^−TQ_k

. (5.40)

Then the estimator (5.31a, p.184) can be approximated arbitrarily close with increasingm_J and k and is exactly reproduced form_J = d², k = d.

5.2. ERROR ESTIMATION FOR DEIM REDUCED SYSTEMS 193 Proof. We have J ≡ J˜_m_J for m_J = d² as each entry of the Jacobian is interpolated. Together with (5.39, p.189) for k = d we obtain equality of (5.31c, p.184) and (5.40).

Remark 5.2.16. One other obvious alternative in order to obtain an approx-imation of the Jacobian logarithmic norm is to use the reduced projected Jacobian W^TJ(x^r(t)))V and thus only solve an eigenvalue problem of size r d. When also applying the MDEIM approximation of Proposi-tion 5.2.14, the only real difference lies in the transformaProposi-tion with V,W instead of Q_k, with the advantage of not having to compute Q. However, as the projection subspace spanned by V is not designed to preserve any eigenvalues and the results from Proposition 5.2.13 are “optimal” in a sense, this approach is expected to have a lower approximation quality of the log-arithmic norms.

Addressing this problem by incorporating suitable information into V,W is certainly an interesting question for future work, but does not essentially avoid the necessity of a training sample set of eigenvectors during subspace computation.

Im Dokument Model reduction for nonlinear systems : kernel methods and error estimation (Seite 191-199)