Parallel Solution of Large Sparse Linear Systems, SS 2015 Exercise sheet 3 Prof. Dr. Peter Bastian, Marian Piatkowski Deadline 1st June 2015 IWR, Universit¨at Heidelberg
EXERCISE6 THE PARALLELRICHARDSON ITERATION
We want to solve the linear systemAx“bwith the Richardson iteration xpk`1q“xpkq`ωpb´Axpkqq.
LetAbe the stiffness matrix coming from the discretization of the Poisson equation on the unitsquare withP1Finite Elements. We use a structured simplicial mesh withN “n2degrees of freedom.
To accelerate the computational time we want to do the iteration in parallel with pprocessors.
Therefore we subdivide the unitsquare into psmaller squares and the degrees of freedom are dis- tributed accordingly. With this partitioning we assume thatp is a square number such that every processor haspn{?
pq2 degrees of freedom. The index set of the degrees of freedom is denoted byI, the index set of thei-th processor byIi. Every processor stores the entries ofxpkqcorresponding to its degrees of freedom and relevant rows ofA.
One iteration of the parallel Richardson iteration consists of the following steps:
• Communication of required entries ofxpkqfrom the neighbouring processors.
• Calculation ofxpk`1q.
1. Describe the index setsIiand specify which entries ofxpkqthe processorihas to communicate with which processor.
2. The computation time for an arbitrary arithmetic operation (addition, subtraction or multipli- cation) is denoted bytop, the time needed for sending one byte to another processor bytbyte and the time needed to set up a message to another processor is denoted bytmsg.
Derive a formula for the total computational time of one iteration withpprocessors. The entries ofxpkqare stored in double precision such that every entry requires 8 byte of memory.
The formula has to be only asymptotically correct. The matrix rows of the nodes next to the boundary which have less entries can be considered as interiour nodes.
3. Present the speedup of the parallel iteration in a table using the following parameters:
top“2ns tbyte“20ns tmsg“5000ns
nP t1024,4096u pP t1,4,16,256,4096u
12 Points
EXERCISE7 DOMAIN DECOMPOSITION
In the lecture you have learned the following theorem:
LetΩĂRdbe Lipschitz domain (open, bounded and connected) and letf PL2pΩq. Then the Poisson problem
´∆upxq “fpxq @xPΩ
upxq “0 @xP BΩ (1)
is equivalent to the solution of the two subproblems
´∆u1pxq “fpxq @xPΩ1
u1pxq “0 @xP BΩ1zΓ u1pxq “u2pxq @xPΓ Bu1pxq
Bn1 “ ´Bu2pxq Bn2
@xPΓ
´∆u2pxq “fpxq @xPΩ2
u2pxq “0 @xP BΩ2zΓ
(2)
with the non-overlapping decomposition
Ω“Ω1YΩ2, Ω1XΩ2 “ H, Γ“ BΩ1X BΩ2, µpBΩiq ą0, such that theBΩiare Lipschitz continuous.
The theory for the Poisson equation can be formulated for right-hand sidesf P H´1pΩqas well.
But for the general assumptionf PH´1pΩq, the equivalence (1)ô(2) does not hold. To see this we are going to consider the following counter example in one dimension forΩ“ p´1,1q:
´∆upxq “ ´2δpxq inΩ up´1q “up1q “0
whereδpxqdenotes theDirac delta function.
1. Find the unique weak solutionuPH1pΩq.
2. What are the transmission conditions ofupxqonΓ? Compare them to the transmissions condi- tions given in (2).
3. What changes in the derivation for the transmission conditions if we take again a right-hand sidef PL2pΩq?Hint:Cauchy-Schwarz inequality.
8 Points