• Keine Ergebnisse gefunden

Automatic data quality control

N/A
N/A
Protected

Academic year: 2022

Aktie "Automatic data quality control"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Brenner Silva < bsilva@awi.de >, Roland Koppe, Antonie Haas, Christian Schäfer-Neth, Philipp Fischer, Sebastian Immoor, Peter Gerchow, Bernadette Fritzsch & Stephan Frickenhaus

Automatic data quality control

for understanding extreme climate events

References

[1] https://sensor.awi.de

[2] https://dashboard.awi.de/data-xxl/overview.jsp

[3] OOI, Consortium for Ocean Leadership, 2012. Document Control Number 1341-10006.

[4] ARGO Data Management Team, 2019. doi:10.13155/33951

[5] FZJ, Kaffashzadehet al. 2019. Preprint FZJ-2019-03979. BREMERHAVEN

Am Handelshafen 12 27570 Bremerhaven Telefon 0471 4831-0 www.awi.de

The understanding of extreme events strongly depends on knowledge gained from data. Data integration of

multiple sources, scales and earth compartments is the focus of the project Digital Earth, which also join efforts on the quality control of data. Automatic quality control is embedded in the ingest component of the O2A, the observation-to-archive data flow framework of the

Alfred-Wegener-Institute. (Fig.1).

Spike: |median[Ti-2,.., Ti+2]| + |SD[Ti-2,.., Ti+2]| < Spikemax

– Temperature, Spikemax = 1°C – Pressure, Spikemax = 1 dbar – Salinity, Spikemax = 0.9 PSU

– Conductivity, Spikemax = 0.75 mS/cm

Gradient: | T-(Ti+1 + Ti-1)/2 | < Gradientmax

– Temperature, Gradientmax = 1.5°C – Pressure, Gradientmax = 1.5 dbar – Salinity, Gradientmax = 1.25 PSU

– Conductivity, Gradientmax = 1.0 mS/cm

Manufacturer range: Ti [am, bm]

– Temperature [−2.5, 35.]°C – Pressure  [-5, 3140*] dbar – Salinity [2.,41.] PSU

– Conductivity [0.0,65.] mS/cm

Operation range: = Ti [a, b]

– Temperature  [−1.9, 10.]°C – Pressure [2, 314*] dbar – Salinity [30., 37.] PSU

– Conductivity [25.,45.] mS/cm

At present the O2A-automatic quality control follows a

procedural approach to implement formulations found in the literature and other observatory networks. A set of

plausibility tests including range, spike and gradient tests are currently operational (Fig 3.).

Figure 1. The automatic quality control is embedded in the O2A-Ingest and request for observation properties from the O2A-Sensor to deliver quality-flagged data.

Within Digital Earth, data on flood and drought events along the Elbe River and methane emissions in the

North Sea are to be reviewed using automatic quality control. Fast and scalable automatic quality control will disentangle uncertainty raised by quality issues and

thus improve our understanding of extreme events in those cases.

The automatic quality control scans the ingesting data in near-real-time (NRT) format, builds a table of

devices, and search - either by absolute or derivative values - for correctness and validity of observations.

The availability of observation properties at O2A-

Sensor1, for instance operation ranges, triggers the

automatic quality control, which in turn iterates through the table of devices to set the quality flag for each

sample and observation. Quality flags are monitored using the O2A-dashboard2 (Fig. 2)

Figure 3. Some of the formulations currently operational in the O2A.

Figure 2. The automatic quality control is embedded in the O2A-Ingest and request for observation properties from the O2A-Sensor to deliver quality-flagged data.

Figure 4. Example of results applied to conductivity

(mS/cm) using currently operational tests at O2A-ingest In the technical documentation, equally named quality

tests differ in their algorithms. For instance, where

“spike” is a deviant from the adjacent values, OOI3 uses ranges, while ARGO4 uses arithmetic mean, and FZJ5

also limits to differences of same signal. To standardize such procedures a review of existing formulations is

under construction, that should also help to overcome limitations of the current approach (Fig 4.).

To date, the quality flags in use are sequential and

qualitative, i.e. it describes a level of quality in the data.

A new flagging system is under development to include a descriptive characteristic that will comprise technical and user interpretation.

Referenzen

ÄHNLICHE DOKUMENTE

This is depending on the purpose of each digital library: Does it only help identify literature (e.g., DBLP), aim to summarize existing research and provide an overview on

Figure 5: Normal probability plot (left) and the differences in structure factors against resolution (right). For charge density studies a difference of about 5 % is still accepted

The intramembrane degron recognized by Doa10 resides in the tail-anchor (TA) region of the integral membrane protein Sbh2, the β-subunit of the heterotrimeric

That these criteria result in high-resolution cutoffs that are too conservative is illustrated here using an exa mple data set (EXP) collected for a cyste

We propose a framework that considers different aspects of quality control across three main phases of VGI data collection: the conception phase; the acquisition phase; and the

pring values. In the central Weddell region, all seasons negative bias indicates that drift is underestimated by satellite data. A clear seasonal d.. and further investigations

O2A: A Generic Framework for Enabling the Flow of Sensor Observations to Archives and Publications, OCEANS 2015 Genova.

Wi th the development of such a magnetic ha.ndling system for large data work loads based on the philosophy of human filtering and quick look presentation to