• Keine Ergebnisse gefunden

One iteration of the parallel Richardson iteration consists of the following steps

N/A
N/A
Protected

Academic year: 2021

Aktie "One iteration of the parallel Richardson iteration consists of the following steps"

Copied!
2
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Parallel Solution of Large Sparse Linear Systems, SS 2015 Exercise sheet 3 Prof. Dr. Peter Bastian, Marian Piatkowski Deadline 1st June 2015 IWR, Universit¨at Heidelberg

EXERCISE6 THE PARALLELRICHARDSON ITERATION

We want to solve the linear systemAx“bwith the Richardson iteration xpk`1q“xpkq`ωpb´Axpkqq.

LetAbe the stiffness matrix coming from the discretization of the Poisson equation on the unitsquare withP1Finite Elements. We use a structured simplicial mesh withN “n2degrees of freedom.

To accelerate the computational time we want to do the iteration in parallel with pprocessors.

Therefore we subdivide the unitsquare into psmaller squares and the degrees of freedom are dis- tributed accordingly. With this partitioning we assume thatp is a square number such that every processor haspn{?

pq2 degrees of freedom. The index set of the degrees of freedom is denoted byI, the index set of thei-th processor byIi. Every processor stores the entries ofxpkqcorresponding to its degrees of freedom and relevant rows ofA.

One iteration of the parallel Richardson iteration consists of the following steps:

• Communication of required entries ofxpkqfrom the neighbouring processors.

• Calculation ofxpk`1q.

1. Describe the index setsIiand specify which entries ofxpkqthe processorihas to communicate with which processor.

2. The computation time for an arbitrary arithmetic operation (addition, subtraction or multipli- cation) is denoted bytop, the time needed for sending one byte to another processor bytbyte and the time needed to set up a message to another processor is denoted bytmsg.

Derive a formula for the total computational time of one iteration withpprocessors. The entries ofxpkqare stored in double precision such that every entry requires 8 byte of memory.

The formula has to be only asymptotically correct. The matrix rows of the nodes next to the boundary which have less entries can be considered as interiour nodes.

3. Present the speedup of the parallel iteration in a table using the following parameters:

top“2ns tbyte“20ns tmsg“5000ns

nP t1024,4096u pP t1,4,16,256,4096u

12 Points

(2)

EXERCISE7 DOMAIN DECOMPOSITION

In the lecture you have learned the following theorem:

LetΩĂRdbe Lipschitz domain (open, bounded and connected) and letf PL2pΩq. Then the Poisson problem

´∆upxq “fpxq @xPΩ

upxq “0 @xP BΩ (1)

is equivalent to the solution of the two subproblems

´∆u1pxq “fpxq @xPΩ1

u1pxq “0 @xP BΩ1zΓ u1pxq “u2pxq @xPΓ Bu1pxq

Bn1 “ ´Bu2pxq Bn2

@xPΓ

´∆u2pxq “fpxq @xPΩ2

u2pxq “0 @xP BΩ2

(2)

with the non-overlapping decomposition

Ω“Ω1YΩ2, Ω1XΩ2 “ H, Γ“ BΩ1X BΩ2, µpBΩiq ą0, such that theBΩiare Lipschitz continuous.

The theory for the Poisson equation can be formulated for right-hand sidesf P H´1pΩqas well.

But for the general assumptionf PH´1pΩq, the equivalence (1)ô(2) does not hold. To see this we are going to consider the following counter example in one dimension forΩ“ p´1,1q:

´∆upxq “ ´2δpxq inΩ up´1q “up1q “0

whereδpxqdenotes theDirac delta function.

1. Find the unique weak solutionuPH1pΩq.

2. What are the transmission conditions ofupxqonΓ? Compare them to the transmissions condi- tions given in (2).

3. What changes in the derivation for the transmission conditions if we take again a right-hand sidef PL2pΩq?Hint:Cauchy-Schwarz inequality.

8 Points

Referenzen

ÄHNLICHE DOKUMENTE

Since 2010, the Council of Europe and its most visible institution, the European Court of Human Rights, have been undergoing reform.. The success of these reforms will also

replace kann add1 nach innen verschieben.. Data constructor Typ constructor

In the current work we present an idea for accelerating the convergence of the resulted sequence to the solution of the problem by choosing a suitable initial term. The efficiency of

Further evidence is offered by the loss of 126 Da in both spectra from fragments containing the N-terminus indicating the presence of the octanoic acid amide structure, and by

In the 2009 Copenhagen accord 7 and the 2010 Cancun agreement 8 , countries collec- tively pledged to limit warming to below 2 °C, and 73 countries individually pledged emis-

Niklas Höhne, Michel den Elzen, Joeri Rogelj, Bert Metz, Taryn Fransen, Takeshi Kuramochi, Anne Olhoff, Joseph Alcamo, Harald Winkler, Sha Fu, Michiel Schaeffer, Roberto

We prove an affine regularization theorem: these iterations in higher dimensions also deliver generations Q k approaching the affine shape of regular planar

In this paper, the author develops a dual forest iteration method for the stochastic transportation problem.. In this sense it is simple compared with the primal forest