• Keine Ergebnisse gefunden

Scalable Sequential Data Assimilation with the Parallel Data Assimilation Framework PDAF

N/A
N/A
Protected

Academic year: 2022

Aktie "Scalable Sequential Data Assimilation with the Parallel Data Assimilation Framework PDAF"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Alfred Wegener Institute Helmholtz Center for Polar and

Marine Research

Scalable Sequential Data Assimilation

with the Parallel Data Assimilation Framework PDAF

Lars Nerger, Wolfgang Hiller, and Jens Schr ¨oter

Alfred Wegener Institute, Helmholtz Center for Polar and Marine Research, Bremerhaven, Germany Contact: Lars.Nerger@awi.de

·

http://www.awi.de

Data assimilation applications with high-dimensional nu- merical models exhibit extreme requirements on compu- tational resources. Good scalability of the assimilation system is necessary to make these applications feasible.

Sequential data assimilation methods based on ensem- ble forecasts, like ensemble-based Kalman filters, pro- vide such good scalability. This parallelism has to be combined with the parallelization of both the numerical model and the data assimilation algorithm.

The Parallel Data Assimilation Framework PDAF has been developed to simplify the implementation of scal- able data assimilation systems based on existing numer- ical models. PDAF is suitable for educational use with toy models but also for high-dimensional applications and operational use. PDAF is distributed as open source software.

PDAF is configured for sequential data assimilation with ensemble-based filters. A selection of filter and smoother algorithms is fully implemented and optimized in PDAF including parallelization, e.g.

EnKF – Ensemble Kalman Filter [1]

LESTKF – Local Error Subspace Transform Kalman Filter [2]

LETKF – Local Ensemble Transform Kalman Filter [3]

LSEIK – Local Singular Evolutive Interpolated Kalman filter [4]

smoother extensions of the filters above

Common fixes and tuning options like covariance infla- tion are also implemented. Further, a selection of ad- vanced localization options are available.

Top: Principle of sequential data assimilation with a filter algorithm.

The state estimate of the assimilation is given by the ensemble mean. The analysis estimate lies typically between the forecast es- timate and the observation, hence closer to the true state.

Initialize ensembles

Forecast ensemble Perform filter

analysis step

Aaaaaaaa Aaaaaaaa aaaaaaaa a

Start

Stop

Initialize Model

generate mesh Initialize fields

Time stepper

consider BC Consider forcing

Post-processing init_parallel_pdaf

Do i=1, nsteps init_pdaf

Aaaaaaaa Aaaaaaaa aaaaaaaaa

Start

Stop

Do i=1, nsteps

Initialize Model

generate mesh Initialize fields

Time stepper

consider BC Consider forcing

Post-processing

Model

Extension for data assimilation

Legend:

Add 2nd-level parallelization

Model Assimilation System

assimilate_pdaf

Logical separation of the assimilation system

2-level parallelization of the assimilation system

Left: PDAF uses a logical separation of the compo- nents of the data assimilation system: Model, filter algo- rithm, and observations. The filter algorithms are part of PDAF’s core, while the model routines and routines to handle observations are provided by the user. A stan- dard interface for all filter algorithms connects the three components. All user-supplied routines can be imple- mented like model routines.

Right: The assimilation system is implemented with PDAF [5,6] by extending the model source code and utilizing parallelization. Three calls to subroutines are added. In contrast to other frameworks, the model does not need to exist as a separate subroutine. The ensem- ble forecast is controlled by user-supplied routines that are called through PDAF. Implementations using this online coupling have been performed for models like NEMO, FEOM, BSHcmod, MIPOM, NOBM, ADCIRC, and PARODY.

Left: PDAF provides support for a 2-level parallelization for the assimilation system:

1. Each model task can be parallelized.

2. All model tasks are executed concurrently.

Thus, ensemble integrations can be done fully parallel.

In addition, the filter analysis step uses parallelization.

All components are combined in a single program.

PDAF is coded in Fortran with MPI parallelization. It is available as free software. Further information and the

source code of PDAF are available on the web site:

http://pdaf.awi.de

0 10 20 30 40 50 60

1 2 3 4 5 6 7 8

processes per model task

speedup

Speedup with number of processes per model task

ensemble size 8 ensemble size 64

ideal 512 proc.

4096 proc.

64/512 proc.

Speedup with number of processes per model task

0 10 20 30 40 50 60

1 1.02 1.04 1.06 1.08

ensemble size

time relative to ensemble size 8

Time increase with increasing ensemble size

8 processes per model task 64 processes per model task

64/512 proc.

4096 proc.

512 proc.

Scalability: Time increase with increasing ensemble size

The parallel performance has been tested with an imple- mentation of PDAF with the finite-element ocean model FEOM. About 94 to 99% of the computing time are used for the ensemble integrations.

Speedup is accessed with a constant ensemble size. Due to the parallel properties of the model, a speedup of 6 is

obtained when the number of processors is increased by a factor of 8 (left panel).

The scalability of the assimilation system is visible when the number of processes per model task is kept constant.

Increasing the ensemble size by a factor of eight results in a time increase between only 1% and 7% (right panel).

PDAF has been developed to simplify the implementation of data assimilation systems. It is aimed for large-scale data assimilation applications but can also be used to test or teach assimilation methods with small models.

Very good scalability is provided through the complete paral- lelism of all parts of the assimilation system (ensemble inte- gration, filter algorithms, and perhaps the model itself).

Only minimal changes to the model source code are required when combining a model with PDAF in its online mode. An offline mode is also supported with separate programs for model and filtering. The offline mode avoids changes to the model code, but leads to a smaller computing performance.

PDAF is currently used in several research projects with a variety of models. It is in pre-operational use for forecasting in the North Sea (see poster E-P10 by S. Losa et al.).

[1] Evensen, G. (1994). Sequential data assimilation with a nonlinear quasi- geostrophic model using Monte Carlo methods to forecast error statistics. J.

Geophys. Res. 99C: 10143

[2] Nerger, L., T. Janji´c, J. Schr ¨oter, J., and W. Hiller (2012). A unification of en- semble square root Kalman filters. Mon.

Wea. Rev. 140, 2335–2345

[3] Hunt, B.R., E.J. Kostelich, and I. Szun- yogh (2007). Efficient data assimilation for spatiotemporal chaos: A local ensem- ble transform Kalman filter. Physica D 230: 112–126

[4] L. Nerger, S. Danilov, W. Hiller, and J. Schr ¨oter (2006). Using sea-level data to constrain a finite-element primitive- equation ocean model with a local SEIK filter. Ocean Dynamics 56: 634–649

[5] Nerger, L., W. Hiller, and J. Schr ¨oter (2005).

PDAF - The Parallel Data Assimilation Framework:

Experiences with Kalman Filtering, in Use of High Performance Computing in Meteorology - Proceed- ings of the 11th ECMWF Workshop / Eds. W. Zwiefl- hofer, G. Mozdzynski. World Scientific, pp. 63–83

[6] Nerger, L. and W. Hiller (2012). Software for Ensemble-based Data Assimilation Sys- tems – Implementation Strategies and Scal- ability. Computers & Geosciences. 55, 110–

118

Introduction Sequential Data Assimilation

PDAF’s Implementation Concept

Parallel Performance Summary

References

Referenzen

ÄHNLICHE DOKUMENTE

2.2 The Finite Element Sea Ice-Ocean Model (FESOM) The sea ice-ocean component in the coupled system is represented by FESOM, which allows one to simulate ocean and

• Overview of ensemble data assimilation • Data assimilation software PDAF Parallel Data Assimilation Framework • Implementation example MITgcm.. Tutorial: Ensemble Data

In this step the model state vectors are collected on the filter communicator with the help of the coupling communicator (see Fig. Then the observation data are read from netCDF

2.2 The Finite Element Sea Ice-Ocean Model (FESOM) The sea ice-ocean component in the coupled system is represented by FESOM, which allows one to simulate ocean and

The coupled model AWI-CM [2] consists of ECHAM6 for the atmosphere including the land model JSBACH, and the finite-element sea ice-ocean model FESOM for the ocean compartment..

Large scale data assimilation: Global ocean model. •  Finite-element sea-ice ocean

Using only a subset of processes for the analysis requires to collect the forecast states from all model tasks on the processes of the single model task used for the analysis..

Left: PDAF is based on a consistent logical separation of the components of the data assimilation system: model, fil- ter algorithm, and observations.. The filter algorithms are part