An Introduction to Finite Element Methods for Inverse Coeﬃcient Problems in Elliptic PDEs

(1)

https://doi.org/10.1365/s13291-021-00236-2 S U R V E Y A R T I C L E

An Introduction to Finite Element Methods for Inverse Coeﬃcient Problems in Elliptic PDEs

Bastian Harrach¹

Accepted: 3 May 2021 / Published online: 6 May 2021

Abstract

Several novel imaging and non-destructive testing technologies are based on recon- structing the spatially dependent coefficient in an elliptic partial differential equation from measurements of its solution(s). In practical applications, the unknown coefficient is often assumed to be piecewise constant on a given pixel partition (corresponding to the desired resolution), and only finitely many measurement can be made. This leads to the problem of inverting a finite-dimensional non-linear forward operator F: D(F)⊆Rⁿ→R^m, where evaluatingFrequires one or several PDE solutions.

Numerical inversion methods require the implementation of this forward operator and its Jacobian. We show how to efficiently implement both using a standard FEM package and prove convergence of the FEM approximations against their true- solution counterparts. We present simple example codes for Comsol with the Matlab Livelink package, and numerically demonstrate the challenges that arise from non- uniqueness, non-linearity and instability issues. We also discuss monotonicity and convexity properties of the forward operator that arise for symmetric measurement settings.

This text assumes the reader to have a basic knowledge on Finite Element Methods, including the variational formulation of elliptic PDEs, the Lax-Milgram- theorem, and the Céa-Lemma. Section3also assumes that the reader is familiar with the concept of Fréchet differentiability.

Keywords Finite element methods·Inverse problems·Finitely many measurements·Piecewise-constant coefficient

1 Introduction

Many practical reconstruction problems in the field of medical imaging and non- destructive testing lead to inverse coefficient problems in elliptic partial differential

^{B. Harrach}

harrach@math.uni-frankfurt.de

1 Institute for Mathematics, Goethe-University Frankfurt, Frankfurt am Main, Germany

(2)

equations. This text is meant to be an introductory tutorial for implementing such problems with Finite Element Methods (FEM).

We assume that the unknown coefficient is piecewise-constant on a given resolution, and that finitely many linear measurements of one of several solutions are taken, where different solutions are generated by different linear excitation in the underlying physics model. This leads to the finite-dimensional non-linear inverse problem of determining

σ∈Rⁿ from F(σ )∈R^m withn∈Nunknowns andm∈Nmeasurements.

Iterative numerical solution methods for this inverse problem require evaluating F and its derivatives at each iteration step, which means solving the underlying elliptic PDE. In this work, we will demonstrate how FEM-based implementations for F and its Jacobian can be obtained very efficiently from standard FEM-solvers for the considered elliptic PDE. Roughly speaking, the sensitivity of a measurement with respect to changing the coefficient in one pixel can be simply calculated by multiply- ing FEM-solutions corresponding to the measurement and excitation patterns with so-called pixel stiffness matrices that are obtained from summing up all element stiffness matrices of elements belonging to the pixel where the change occurs. Hence, the FEM-based Jacobian can be obtained without any additional computational burden with just a few lines of extra code. Alternatively, for an even simpler implementation, the pixel stiffness matrices can be easily obtained by subtracting global stiffness matrices without requiring any knowledge about the triangulation details.

This text is meant as a simple-to-read explanation of this approach in a sufficiently general but naturally arising setting. More precisely, we restrict ourselves to coercive and symmetric variational formulations that linearly depend on the unknown coefficients, and to excitations and measurements that correspond to linear functionals. In this setting, we demonstrate how to obtain the Jacobian of the FEM-based forward map with the means of a standard FEM software package such as COMSOL. We also discuss monotonicity and convexity properties arising in symmetric measurement sit- uations that are the basis for recent research on rigorously justified reconstruction methods.

The purpose of this text is of introductory nature, but we proceed in a mathe- matically rigorous fashion to allow this text to also serve as a reference. We prove differentiability of the true-solution forward operator and its FEM-based approximation, and show convergence of the FEM-approximated quantities to their true-solution counterparts.

Section2gives two examples to motivate our general setting: stationary diffusion and Elecrical Impedance Tomography. Section3introduces the forward operator using the exact PDE solution and derives its properties. The FEM-approximation of the forward operator and its Jacobian is studied in Sect.4. In Sect.5we show numerical examples and demonstrate some of the major challenges that arise in solving inverse coefficient problems. The COMSOL/MATLAB source codes for all examples are given in theAppendix.

(3)

Fig. 1 Pixel partition and circular subdomains used for excitations and measurements

2 Motivation and Examples 2.1 Stationary Diﬀusion

We consider the stationary diffusion equation

−∇ ·(σ∇u)=g in (1) with homogeneous Dirichlet boundary conditionu|∂=0 in a Lipschitz bounded domain⊂R^d,d∈N. Foru∈H¹(),σ∈L^∞₊()andg∈L²()the equation is equivalent to findingu∈H₀¹()with

σ∇u· ∇vdx=

gvdx for allv∈H₀¹(), (2) and unique solvability follows from the Lax-Milgram theorem.

We are interested in the inverse coefficient problem of determining the diffusivity coefficient σ in (1) from measurements of the solution for one or several source termsg, cf. [9] for an application in groundwater filtration. In practical applications with finitely many measurements, it is natural to only aim for a certain pixel-based resolution and therefore assume thatσis piecewise constant with respect to a partition =_n

i=1Pi, i.e.

σ (x)= n

i=1

σ_iχ_P_i(x) for allx∈,

where the pixelsPi ⊆are assumed to be measurable subsets. The left image in Fig.1shows a simple example where the unit square=(0,1)²is divided into 3×3 pixels. In the following, with a slight abuse of notation, we writeσ=(σ₁, . . . , σ_n)^T ∈ Rⁿfor the unknown diffusivity.

(4)

Fig. 2 PDE solution for source terms on circular subdomains

The source term g in the diffusion model (1) can be identified with the linear functional on the right hand side of the variational formulation (2)

l∈H⁻¹(), l(v):=

gvdx,

which corresponds to identifyingL²()with a subset ofH⁻¹(). Accordingly, we consider excitations in the form of linear functionals. Also, to emphasize that the solution depends on the diffusion coefficient and the excitation, we writeu^l_σ in the following. The left image in Fig.2illustrates the concentration resulting from a constant source termg=χ_D, i.e.l(v)=

Dvdx, whereD=D2∪D4∪D5∪D7 is a union of four circular subdomains as sketched in the right image of Fig.1. The right image in Fig.2 shows the corresponding plot forD=D₁∪D₃∪D₆∪D₈. Both images show the solution of (1) with constant diffusion coefficientσ=1.

Natural models for measuring the solution of (1) also yield to linear functionals.

Measuring the total concentration in one of the circular subdomainsDj corresponds to measuringr(u):=

Djudx. Hence, the inverse problem of determining finitely many information about the diffusivity coefficient from finitely many measurements of the concentration (possibly but not necessarily resulting from different excitations) leads to the finite-dimensional inverse problem to

determine σ∈Rⁿ₊ from F(σ )∈R^m, where

F: Rⁿ₊→R^m, F(σ ):=(r_j(u^l_σ^j))^m_j₌₁, andu^lσ^j∈H₀¹()solves

n

i=1

σ_ib_i(u^l_σ^j, v)=l_j(v) for allv∈H₀¹(),

(5)

with givenl_j, r_j∈H⁻¹(),j=1, . . . , m, and b_i(u, v):=

Pi

∇u· ∇vdx, i=1, . . . , n.

2.2 Electrical Impedance Tomography (EIT)

We give another example for an application that leads to an inverse elliptic coefficient problem in a similar form as the diffusion example.

EIT aims to image the inner conductivity structure of a subject by current and voltage measurements through electrodes attached to the imaging subject. Let⊆ R^d,d ∈ {2,3}, be a smoothly bounded domain denoting the imaging subject. The electrodesEk,k=1, . . . , K, are assumed to be open connected subsets of∂with disjoint closures.

When currents with strengthJ =(J1, . . . , J_K)∈R^K are driven through the K electrodes (with_K

k=1Jk=0), the resulting electric potentialu∈H¹()inside, and the potentialU∈R^Kon the electrodes, solve

∇ ·(σ∇u)=0 in,

σ ∂_νu=0 on∂\

K

k=1

Ek,

u+zσ ∂_νu=const.=:U_k onEk, k=1, . . . , K,

Ek

σ ∂_νu|_E_kds=J_k onEk, k=1, . . . , K,

whereσ∈L^∞₊()is the conductivity inside, andz >0 is the contact impedance of the electrodes.

Under the gauge conditionU∈R^K:= {V ∈R^K: _K

k=1V_k=0}, one can show (see [19]) that this so-called complete electrode model (CEM) for EIT is equivalent to the variational formulation that(u, U )∈H¹()×R^K solves

σ∇u· ∇wdx+ K

k=1

Ek

1

z(u−U_k)(w−W_k)ds= K

k=1

J_kW_k (3)

for all(w, W )∈H¹()×R^K, and unique solvability follows from the Lax-Milgram theorem.

We assume thatz >0 is known, and that σ (x)=_n

i=1σiχ_P_i(x)is piecewise constant with respect to a pixel partition=_n

i=1Pi, and writeσ=(σ₁, . . . , σ_n)∈ Rⁿfor the unknown conductivity values inside.

The applied current patternsJ =(J₁, . . . , J_K)∈R^K can be identified with the functional

l∈H, l(w, W ):=

K

k=1

J_kW_k for all(w, W )∈H:=H¹()×R^K.

(6)

Likewise, measuring the voltage between the k₁-th and the k₂-th electrode corresponds to measuring the linear functional

r∈H, r(u, U ):=U_k₁−U_k₂, of the solution(u, U )∈H generated by some current pattern.

Hence, the problem of determining the interior conductivity with a fixed finite resolution from finitely many voltage-current measurements in EIT (with CEM) leads to the finite-dimensional inverse problem to

determine σ∈Rⁿ₊ from F(σ )∈R^m,

whereF: Rⁿ₊→R^m,F(σ ):=(rj(u^lσ^j, Uσ^l^j))^m_j₌₁, and(u^lσ^j, Uσ^l^j)∈H solves

b₀((u^l_σ^j, U_σ^l^j), (w, W ))+ n

i=1

σ_ib_i((u^l_σ^j, U_σ^l^j), (w, W ))=l_j(w, W )

for all(w, W )∈H, with givenl_j, r_j ∈H,j=1, . . . , m, and

b₀((u, U ), (w, W )):=

K

k=1

Ek

1

z(u−U_k)(w−W_k)ds, bi((u, U ), (w, W )):=

Pi

∇u· ∇wdx.

Clearly, one could also extend this formulation to cover the case of unknown contact impedances.

3 The True-Solution Setting

The examples in Sect.2lead to inverse problems for a finite-dimensional non-linear forward operatorF: Rⁿ₊→R^m, where evaluations ofFrequire solving an infinite- dimensional linear problem (the PDE). In this section, we will first derive some properties ofFfor the case that it is defined with the true infinite-dimensional PDE solution. The properties of the operatorF≈F, that is defined with a FEM-approximation of the PDE solution, will be studied in Sect.4.

3.1 The True-Solution Forward Operator and Its Derivative

We will study problems that appear in the variational formulation of elliptic PDEs with piecewise constant coefficients on a fixed pixel partition, as in the examples in Sect.2.

(7)

The Variational Setting LetHbe a Hilbert space. We consider the problem of finding u∈Hthat solves

bσ(u, v)=l(v), (4)

whereb_σ: H×H→Ris a bilinear form, andl∈H=L(H,R).b_σ is assumed to linearly depend onnparametersσ=(σ₁, . . . , σ_n)∈Rⁿin the following way

b_σ(u, v)=b0(u, v)+ n

i=1

σ_ib_i(u, v),

whereb0, bi : H×H→Rare bounded, symmetric and positive semidefinite bilinear forms. Writing1:=(1, . . . ,1)^T ∈Rⁿ, we also assume thatb₁is bounded and coercive with constantsβ, C >0, i.e.,

Cv²≥b₁(v, v)=b₀(v, v)+ n

i=1

b_i(v, v)≥βv² ∀v∈H.

Clearly, this yields that for allσ∈Rⁿ₊

Cmax{1, σ1, . . . , σ_n}v²≥b_σ(v, v)≥βmin{1, σ1, . . . , σ_n}v² ∀v∈H, (5) so thatb_σ is symmetric, bounded and coercive. Here and in the followingRⁿ₊denotes the set of allσ ∈Rⁿ withσ >0 and “>” and “≥” are understood elementwise on Rⁿ.

The True-Solution Forward Operator We now characterize the derivative of the solution of (4) with respect toσ.

Lemma 1 Letl∈H. The solution operator

S: Rⁿ₊→H, S(σ ):=u^l_σ, whereu^l_σ∈Hsolves (4), is infinitely often Fréchet differentiable. Its first derivative

S: Rⁿ₊→L(Rⁿ, H )

fulfills that, for allσ∈Rⁿ₊andτ∈Rⁿ,S(σ )τ∈His the unique solution of b_σ(S(σ )τ, w)= −

n

i=1

τ_ib_i(u^l_σ, w) ∀w∈H.

Also, forr∈H,σ∈Rⁿ₊, andτ∈Rⁿ, r(u^l_σ)=b_σ(u^l_σ, u^r_σ) and r

S(σ )τ = − n

i=1

τ_ib_i(u^l_σ, u^r_σ).

(8)

Proof For σ ∈Rⁿ₊, the Riesz theorem yields that there exists a unique operator B(σ )∈L(H, H)associated to the bilinear formb_σ(·,·), i.e.

B(σ )u, vH×H=b_σ(u, v) for allu, v∈H.

Clearly,B(σ ) is symmetric and, by the Lax-Milgram theorem,B(σ ) is invertible with symmetric inverseB(σ )⁻¹∈L(H, H ). Hence, (4) is uniquely solvable, and the solution operatorSis well-defined.

It is easily checked, thatB(σ )is Fréchet differentiable for everyσ∈Rⁿ₊, and that its derivativeB(σ )∈L(Rⁿ,L(H, H))is given by

B(σ )τ= n

i=1

τ_iBi for allσ∈Rⁿ₊, τ∈Rⁿ, whereBi∈L(H, H)is the unique operator fulfilling

(Biu, v)_H×H=b_i(u, v) for allu, v∈H.

Since B(σ ) does not depend on σ, this also shows that B(σ ) is infinitely often Fréchet differentiable with all second and higher derivatives being zero.

Using the derivative of operator inversion and the product and chain rule for the Fréchet derivative, we thus obtain thatS(σ )is infinitely often Fréchet differentiable with

S(σ )τ= −B(σ )⁻¹(B(σ )(τ ))B(σ )⁻¹l= − n

i=1

τ_iB(σ )⁻¹Biu^l_σ.

Hence,v=S(σ )τ∈H solves bσ(v, w)= −

n

i=1

τiBiu^l_σ, wH×H= − n

i=1

τibi(u^l_σ, w) ∀w∈H.

Moreover, we obtain for allr∈H, by using the symmetry ofB(σ ), r(u^l_σ)= B(σ )B(σ )⁻¹r, u^l_σH×H=b_σ(u^l_σ, u^r_σ), and

r

S(σ )τ = r,S(σ )τH×H= B(σ )S(σ )τ,B(σ )⁻¹rH×H

= − n

i=1

τ_iBiu^l_σ, u^r_σ,H×H= − n

i=1

τ_ib_i(u^l_σ, u^r_σ),

which finished the proof.

Corollary 1 Letl, r∈H. Then the mapping

Fl,r: Rⁿ₊→R, Fl,r(σ ):=r(u^l_σ)

(9)

fulfills

Fl,r(σ )=b_σ(u^l_σ, u^r_σ) for allσ∈Rⁿ₊.

Moreover, Fl,r : Rⁿ₊→R is infinitely often differentiable and its first derivatives fulfill

∂

∂σ_iFl,r(σ )= −bi(u^l_σ, u^r_σ).

Proof This follows from Lemma1.

3.2 Convexity and Monotonicity for Symmetric Measurements

A special mathematical structure appears for measurementsFl,r, whenl andr are taken from the same subset ofH, and all combinations are used. In the stationary diffusion example this corresponds to using the same subsets of both for excitations and concentration measurements, in EIT this corresponds to using the same electrodes for voltage and current measurements.

Given a set ofm∈Nexcitations/measurements{l1, . . . , l_m} ⊂H, we combine the measurements into a matrix-valued mapF: Rⁿ₊→R^m^×^m

F(σ )=(Fj,k(σ ))_j,k₌_1,...,m∈R^m^×^m, Fj,k(σ )=Flj,lk(σ )=l_k(u^l_σ^j).

As before, we write “≥” for the elementwise order on Rⁿ. We also writeSm⊆ R^m^×^mfor the subset of symmetricm×m-matrices, and “” for the Loewner order onSm, i.e.BAdenotes thatB−Ais positive semi-definite.

Lemma 2 F: Rⁿ₊→R^m×mhas the following properties:

(a) Fis infinitely often differentiable.

(b) For all σ ∈ Rⁿ₊, F(σ ) ∈ Sm and F(σ ) 0. F(σ ) is positive definite if l₁, . . . , l_m∈Hare linearly independent.

(c) Fis monotonically non-increasing, i.e.

F(σ )τ0 for all σ ∈Rⁿ₊, 0≤τ ∈Rⁿ, (6) and for allσ⁽¹⁾, σ⁽²⁾∈Rⁿ₊

σ⁽¹⁾≤σ⁽²⁾ implies F(σ⁽¹⁾)F(σ⁽²⁾). (7) (d) Fis convex, i.e., for allσ, σ⁽⁰⁾∈Rⁿ₊,

F(σ )−F(σ⁽⁰⁾)F(σ⁽⁰⁾)(σ−σ⁽⁰⁾), (8) and, for allt∈ [0,1],

F((1−t )σ⁽⁰⁾+t σ )(1−t )F(σ⁽⁰⁾)+tF(σ ). (9)

(10)

Proof Corollary1shows that each component ofF is infinitely often differentiable so that (a) is proven.

For the rest of the proof letσ ∈Rⁿ₊,g∈R^m, and setl:=_m

j=1gjlj. By Corol- lary1,

l_k(u^l_σ^j)=b_σ(u^l_σ^j, u^l_σ^k)=b_σ(u^l_σ^k, u^l_σ^j)=l_j(u^l_σ^k), so thatF(σ )is a symmetric matrix. Moreover,

g^TF(σ )g= m

j,k=1

g_jl_k(u^l_σ^j)g_k= m

j,k=1

g_jg_kb_σ(u^l_σ^j, u^l_σ^k)=b_σ(u^l_σ, u^l_σ)≥0, so thatF(σ )0. Ifg=0 andl1, . . . , lm∈H are linearly independent thenl=0, which impliesu^l_σ =0 and thusg^TF(σ )g >0. Hence, (b) is proven.

To prove (c) and (d), we start by using again Corollary1and obtain g^T(F(σ )τ )g= −

n

i=1

τibi(u^l_σ, u^l_σ) for allτ∈Rⁿ. Since the bilinear formsb_i(·,·)are positive semi-definite, this implies (6).

To prove (8), letσ⁽⁰⁾∈Rⁿ₊. For brevity we writeu^l₀:=u^l_σ

0. Using bσ(u^l_σ, u^l₀)=l(u^l₀)=bσ0(u^l₀, u^l₀),

we obtain that

0≤bσ(u^l_σ−u^l₀, u^l_σ −u^l₀)=bσ(u^l_σ, u^l_σ)−2bσ(u^l_σ, u^l₀)+bσ(u^l₀, u^l₀)

=b_σ(u^l_σ, u^l_σ)−2bσ0(u^l₀, u^l₀)+b_σ(u^l₀, u^l₀)

=g^T(F(σ )−F(σ⁽⁰⁾))g+b_σ(u^l₀, u^l₀)−b_σ₀(u^l₀, u^l₀)

=g^T(F(σ )−F(σ⁽⁰⁾)g+ n

i=1

(σi−σ_i⁽⁰⁾)bi(u^l₀, u^l₀).

This shows that

g^T(F(σ )−F(σ⁽⁰⁾))g≥ − n

i=1

(σ_i−σ_i⁽⁰⁾)b_i(u^l₀, u^l₀)=g^TF(σ⁽⁰⁾)(σ−σ⁽⁰⁾)g, so that (8) holds. Together with (6) this also implies (7).

(9) follows from (8) by the following standard argument. Let σ, σ⁽⁰⁾∈Rⁿ₊,t∈ [0,1], and set

σ^{(t )}:=t σ +(1−t )σ⁽⁰⁾∈Rⁿ₊.

Using (8) onF(σ )−F(σ^{(t )})andF(σ⁽⁰⁾)−F(σ^{(t )}), we then obtain that (1−t )F(σ⁽⁰⁾)+tF(σ )−F(σ^{(t )})

(11)

=(1−t )(F(σ⁽⁰⁾)−F(σ^{(t )}))+t (F(σ )−F(σ^{(t )})) (1−t )F(σ^{(t )})(σ⁽⁰⁾−σ^{(t )})+tF(σ^{(t )})(σ−σ^{(t )})

=F(σ^{(t )})((1−t )σ⁽⁰⁾+t σ −σ^{(t )})=0,

which proves (9).

4 The FEM Setting

4.1 The FEM-Approximated Forward Operator and Its Derivative

The Finite Element Method The Finite Element Method numerically approximates the solution of (4) by solving it in a finite-dimensional subspaceV ⊂H, e.g. the subspace of continuous, piecewise linear functions on a fixed triangulation. Let

1, . . . , _N denote a basis of V, e.g. the so-called hat functions for linear finite elements. Then the finite-dimensional variational problem

˜

u^l_σ ∈V solves bσ(u˜^l_σ, v)=l(v) for allv∈V (10) is equivalent to

˜ u^l_σ =

N

j=1

λ_j _j, where B_σλ=y^l, (11)

withλ=(λ_j)^N_j₌₁∈R^N, and the so-called stiffness matrix and load vector B_σ∈R^N^×^N, with(j, k)-th entry given byb_σ( _j, _k),

y^l∈R^N, withj-th entry given byl( j).

It follows from the Lax-Milgram theorem that (10) is uniquely solvable and thatB_σ is a symmetric, positive definite (and thus invertible) matrix. Moreover, the Céa-Lemma yields that the FEM approximationu˜^l_σ ∈V is as good an approximation to the true solutionu^l_σ∈Has elements of the finite-dimensional spaceV can be:

u^l_σ− ˜u^l_σ ≤C_σ β_σ inf

v∈Vu^l_σ−v, (12)

whereCσ:=Cmax{1, σ1, . . . , σn}, andβσ :=βmin{1, σ1, . . . , σn}are the continuity and coercivity constants ofb_σ, cf. (5).

Pixel Stiﬀness Matrices Finite element software packages include triangulation algo- rithms, assembling routines for the global stiffness matrixBσ and the load vectory^l, and efficient solvers for the linear systemB_σλ=y^l. For our setting where

b_σ(u, v)=b₀(u, v)+ n

i=1

σ_ib_i(u, v),

(12)

Fig. 3 A coarser and a finer FEM-mesh for the diffusion example, both complying with the pixel partition and the measurement/excitation subdomains

we will also require the pixel stiffness matrices

Bi∈R^N^×^N, with(j, k)-th entry given by bi( j, k).

The assembling ofBσ is usually done by writing it as a weighted sum of element stiffness matrices. In our setting, it is natural to assume that the pixel partition complies with the FEM triangulation, i.e., that each pixel is a union of triangulation elements. Figure3shows a coarser and a finer FEM mesh for the diffusion example, both complying with the pixel partition and with the subdomains that are used for measurements and excitations. Hence, during the assembly of the global stiffness matrixB_σ, the pixel stiffness matrices can usually be obtained without any additional computational cost by the simple intermediate step of first summing up the element matrices for each pixel, and then summing up the pixel stiffness matrices to obtain B_σ. Alternatively, the pixel stiffness matrixB_i can be conveniently obtained from global stiffness matrices by the simple identities

Bi=B₁₊e_i−B₁, and B0=B₁− n

i=1

Bi,

whereB₁₊e_i andB₁denote the global stiffness matrixBσ forσ=1+eiandσ=1, respectively, ande_i∈Rⁿ is thei-th unit vector. Note that this does not require any knowledge of the triangulation details.

The FEM-Approximated Forward Operator Givenl, r∈H, we approximate the true measurementFl,r(σ )=r(u^l_σ)by

F_l,r(σ ):=r(u˜^l_σ),

whereu˜^l_σ ∈V is the FEM-approximation to the true solutionu^l_σ ∈H, i.e., the solu- tion of (10).

(13)

Algorithm 1 FEM-approximation ofF_l,r(σ )and_∂σ^∂

iF_l,r(σ ),i=1, . . . , n givenl, r∈H,σ∈Rⁿ₊

·use FEM package to calculate load vectorsy^landy^r

·use FEM package to calculate stiffness matricesB₁, andB_1+e_ifor alli=1, . . . , n

·setB_i:=B₁₊_e_i−B₁fori=1, . . . , n, andBσ:=B₁+_n

i=1(σ_i−1)B_i

·solveBσλ^l=y^landBσλ^r=y^rforλ^landλ^r return F_l,r(σ ):=(λ^l)^Ty^r and_∂σ^∂

iF_l,r(σ ):= −(λ^l)^TB_iλ^r,i=1, . . . , n

Lemma 3 Letl, r∈H. Then

F_l,r(σ )=b_σ(u˜^l_σ,u˜^r_σ) for allσ∈Rⁿ₊.

Moreover, Fl,r : Rⁿ₊→R is infinitely often differentiable and its first derivatives fulfill

∂

∂σi

F_l,r(σ )= −b_i(u˜^l_σ,u˜^r_σ), i=1, . . . , n.

Proof This follows by applying Corollary1to the Hilbert spaceV. From Lemma3, we obtain a simple FEM-based implementation of the forward operator and its derivative.

Corollary 2 With

˜ u^l_σ=

N

j=1

λ^l_j _j, λ^l=(λ^l_j)^N_j₌₁∈R^N,

˜ u^r_σ=

N

j=1

λ^r_j _j, λ^r=(λ^r_j)^N_j₌₁∈R^N,

we have that

Fl,r(σ )=(λ^l)^TBσλ^r=(λ^l)^Ty^r, and ∂

∂σ_iFl,r(σ )= −(λ^l)^TBiλ^r.

Proof This follows from Lemma3.

We summarize the consequences of Corollary2 in Algorithm 1. Using a FEM package that is capable of solving the considered PDE, and that allows access to the stiffness matrix and the load vector, one can simply implement the FEM- approximated forward operator and all its first derivatives by a few lines of extra code. This calculation merely requires solving two linear systems with the stiffness matrix (which is equivalent to two PDE solutions).

(14)

Convergence of the FEM-Approximated Forward Operator The following lemma shows that the FEM-approximated operator and its first derivatives agree with their true-solution counterparts as good as the FEM solution agrees with the true solution. Hence, by the Céa-Lemma (12), F_l,r(σ ) and _∂σ^∂

iF_l,r(σ ) will be as good an approximation toFl,r(σ )and _∂σ^∂

iFl,r(σ )as the true solutions can be approximated by elements of the finite-dimensional spaceV.

Lemma 4 For alll, r∈Handσ∈Rⁿ₊, we have that:

Fl,r(σ )−F_l,r(σ )=b_σ(u^l_σ − ˜u^l_σ, u^r_σ− ˜u^r_σ), (13)

∂

∂σ_iFl,r(σ )− ∂

∂σ_iF_l,r(σ )=b_i(u˜^l_σ,u˜^r_σ−u^r_σ)+b_i(u˜^l_σ−u^l_σ, u^r_σ). (14) Hence, by the Céa-Lemma (12),

0≤Fl,r(σ )−Fl,r(σ )≤Cσu^l_σ − ˜u^l_σ u^r_σ − ˜u^r_σ

≤ C_σ³ β_σ² inf

v∈Vu^l_σ−v inf

v∈Vu^r_σ−v,

and

∂

∂σ_iFl,r(σ )

≤C_i ˜u^l_σ ˜u^r_σ−u^r_σ +C_iu^r_σ ˜u^l_σ −u^l_σ

≤C_iC_σ β_σ

˜u^l_σ inf

v∈Vu^r_σ−v + u^r_σ inf

v∈V u^l_σ −v

, whereCi>0 is the continuity constant ofbi(·,·).

Proof Using

b_σ(u˜^l_σ,u˜^r_σ)=l(u˜^r_σ)=b_σ(u^l_σ,u˜^r_σ), and b_σ(u˜^l_σ,u˜^r_σ)=r(u˜^l_σ)=b_σ(u˜^l_σ, u^r_σ), we obtain (13) from

Fl,r(σ )−F_l,r(σ )=b_σ(u^l_σ, u^r_σ)−b_σ(u˜^l_σ,u˜^r_σ)=b_σ(u^l_σ, u^r_σ− ˜u^r_σ)

=bσ(u^l_σ, u^r_σ− ˜u^r_σ)−bσ(u˜^l_σ, u^r_σ− ˜u^r_σ)

=b_σ(u^l_σ− ˜u^l_σ, u^r_σ− ˜u^r_σ).

Also,

∂

∂σ_iF_l,r(σ )=b_i(u˜^l_σ,u˜^r_σ)−b_i(u^l_σ, u^r_σ)

=bi(˜u^l_σ,u˜^r_σ−u^r_σ)+bi(u˜^l_σ−u^l_σ, u^r_σ),

which shows (14).

(15)

4.2 Convexity and Monotonicity for Symmetric Measurements

As in Sect.3.2we now consider the symmetric measurement case, wherelandrare taken from the same subset ofH (and all combinations are used). Given a set of m∈Nexcitations/measurements {l1, . . . , l_m} ⊂H, we combine the measurements into a matrix-valued mapF : Rⁿ₊→R^m^×^m

F (σ )=(Fj,k(σ ))j,k=1,...,m∈R^m^×^m, Fj,k(σ )=Fl_j,l_k(σ )=lk(u^lσ^j).

The entries ofF (σ ) and its first derivatives _∂σ^∂

iF (σ ), i=1, . . . , ncan be calculated as in Algorithm1. Let us stress that this approach is particularly efficient in this symmetric case as it requires onlymlinear system solutions with the stiffness matrix (i.e., the equivalent ofmPDE solutions) for calculating allm²entries of F (σ )∈R^m^×^mand allnm²entries of thenmatrices _∂σ^∂

iF (σ )∈R^m^×^m.

As in Sect.3.2, the FEM-approximated forward operator is monotonically non- increasing and convex in the sense of the elementwise order “≥” on Rⁿ, and the Loewner order “” on the set of symmetricm×m-matrices.

Lemma 5 F : Rⁿ₊→R^m×mhas the following properties:

(a) F is infinitely often differentiable.

(b) For allσ∈Rⁿ₊,F (σ )∈SmandF (σ )0.F (σ )is positive definite ifl1, . . . , lm∈ Hare linearly independent.

(c) F is monotonically non-increasing, i.e.

F(σ )τ0 for all σ∈Rⁿ₊, 0≤τ∈Rⁿ, (15) and for allσ⁽¹⁾, σ⁽²⁾∈Rⁿ₊

σ⁽¹⁾≤σ⁽²⁾ implies F (σ⁽¹⁾)F (σ⁽²⁾). (16) (d) F is convex, i.e.

F (σ )−F (σ⁽⁰⁾)F(σ⁽⁰⁾)(σ−σ⁽⁰⁾) for all σ, σ⁽⁰⁾∈Rⁿ₊, (17) and, for allt∈ [0,1],

F ((1−t )σ⁽⁰⁾+t σ )(1−t )F (σ⁽⁰⁾)+t F (σ ). (18) (e) F(σ )F (σ ).

Proof (a)–(d) follow from applying Lemma2on the Hilbert spaceV. (e) was proven

in Lemma4.

5 Numerical Examples and Inverse Problem Challenges

In this section, we will show some numerical results for the stationary diffusion example from Sect.2.1and demonstrate some major challenges that arise in solving

(16)

Fig. 4 Single measurement in the top right pixel for a source term in the lower left pixel as a function of changing the diffusivity in each of the 3×3 pixels

the inverse coefficient problem of recoveringσˆ ∈Rⁿ fromF (σ )ˆ ∈R^m, or from a noisy versionY^δ≈F (σ ). The source codes for the following examples (and also forˆ generating Fig.1and2) are given in the appendix for the reader’s reference.

5.1 Non-uniqueness

Even for m≥n, and a noise-free measurement Yˆ =F (σ )ˆ ∈R^m, it is not clear whether the measurements uniquely determine the unknownσˆ ∈Rⁿ. To demonstrate this on a simple one-dimensional example, let us consider the stationary diffusion example with 3×3 pixels and circular excitation/measurement subdomains in each boundary pixel as in Fig.1. We apply a source term inD₁in the lower left pixel, and measure the resulting total concentration in D₈ in the top right pixel, so that l=χ₁∈H⁻¹()andr=χ₈∈H⁻¹(), where we writeχ_j:=χ_D_j for the ease of notation. We chooseσ=1 in all pixels exceptPi, and onPi we vary the diffusivity in steps of 0.01 up to 3. Figure4showsF_l,r(σ )for alli=1, . . . ,9, in the same order as the pixels, e.g., the lower left image showsF_l,r(σ )forσ =(σ₁,1, . . . ,1)for varyingσ₁.

Intuitively speaking, one can see that rising the diffusivity in the middle pixel increases the measurement since particles can easier diffuse through the middle pixel