• Keine Ergebnisse gefunden

Break signal

N/A
N/A
Protected

Academic year: 2021

Aktie "Break signal "

Copied!
21
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

The break signal in climate records:

Random walk or random deviations?

Ralf Lindau

(2)

Dipdoc Seminar – 30. May 2016

Break signal

Climate records are affected by

breaks resulting from relocations or changes in the measuring

techniques.

For the detection, differences of neighboring stations are considered to reduce the dominating natural variance.

Homogenization algorithms identify breaks by searching for the

maximum external variance (explained by the jumps).

(3)

Benchmark datasets

Benchmarking data sets are used to assess the skill of homogenization algorithms.

These are artificial data sets with known breaks so that an evaluation of the algorithms is possible.

However, benchmark datasets should reflect as much as possible the statistical properties of real data .

An important question is how to model the breaks:

1. As free random walk (Brownian motion)

2. As random deviation from a fixed level (random noise)

Dipdoc Seminar – 30. May 2016

(4)

Conceptual model

Same signal, two approaches:

Which of the two DT is assumed to be an independent random variable?

The deviations or

the jumps?

Depending on our choice

different statistical properties of

break signal will result.

Dipdoc Seminar – 30. May 2016

Random deviations

Brownian motion

(5)

Different effects of identical s

Dipdoc Seminar – 30. May 2016

Difference:

The introduced break variance of random deviations is larger by a factor of 2 compared to Brownian motion.

Reason:

All jumps are created by the sum of two random numbers, while it is only one in case of Brownian motion.

Preliminary:

𝑽𝒂𝒓 𝑩𝑴 = 𝟏

𝟐 𝑽𝒂𝒓 𝑹𝑫

Random deviations

Brownian motion

(6)

Linearly growing variance of BM

Dipdoc Seminar – 30. May 2016

The variance of a Brownian motion grows linearly in time.

At the end of a BM time series the variance is:

Var = k s2 with k: number of breaks and s2: break variance

The average variance (over time) is VarBM = k/2 s2

For RD the variance is (shown before):

VarRD = 2 s2

k is in the order of 5, for difference time series twice: 10.

Thus k/4 is in the order of 2.5.

Brownian motion created by the same s is much easier to detect.

𝑽𝒂𝒓 𝑩𝑴 = π’Œ

πŸ’ 𝑽𝒂𝒓 𝑹𝑫

(7)

Which type is more realistic?

Dipdoc Seminar – 30. May 2016

There are indications for both of the two break types:

For random deviations:

Relocations are bound to fixed position.

Stations have geographical names and their positions are not free to fluctuate away.

For random walk:

Changes in measuring techniques can be seen as elimination of error sources one after the other.

Ideal case: Today most errors are eliminated.

Then the break signal can be seen as Brownian motion backward in time.

(8)

Different β€œschools”

For a long time we were not aware that there are these two approaches.

Williams et al. (2012) modelled random walk.

Venema et al. (2012) modelled random deviations.

Only the standard deviations applied were communicated.

But these are not comparable for RD and BM.

Dipdoc Seminar – 30. May 2016

(9)

Platforms & Stairs

Venema et al. (2012) analyzed the statistics of the retrieved signal to decide whether breaks are BM or RD type.

Platforms Stairs

p (RD) = 0.67 p (actual) = 0.59 p (BM) = 0.50

But, the result was hardly significant due to the small number.

And (more important):

The result is dependent on the performance of the homogenization algorithm.

Dipdoc Seminar – 30. May 2016

T3 T1

T2 T3

T1

T2

(10)

Platforms are difficult to detect

Running a homogenization algorithm with artificial pure RD data results in 0.62 – 0.64 platform frequency ( < 0.67 ).

In the retrieved signal, the platforms are underestimated.

The detected frequency is not suited as independent indication parameter to distinguish RD from BM.

Therefore, it would be convenient to be independent from the retrieved break signal and instead able to derive break parameters directly from the data.

Dipdoc Seminar – 30. May 2016

(11)

Two superimposed signals

We assume that the climate time series consists of two superimposed signals:

Inhomogeneities and noise

π‘₯ 𝑖 = πœ€π‘ 𝑆 𝑖 + πœ€π‘› 𝑖 , πœ€π‘ ~ 𝑁 0, πœŽπ‘2 , πœ€π‘› ~ 𝑁 0, πœŽπ‘›2

Each yearly value can be thought as the sum of two random numbers, eb and en, where eb depends on segment number S, which is defined as the number of breaks lying temporally behind.

Dipdoc Seminar – 30. May 2016

(12)

Random deviation breaks

In case of random deviation breaks we calculate the

Lag-covariance C(L):

𝐢 𝐿 = 1

𝑛 βˆ’ 𝐿 π‘₯ 𝑖 βˆ’ π‘₯ π‘₯ 𝑖 + 𝐿 βˆ’ π‘₯

π‘›βˆ’πΏ

𝑖=1

For external pairs E(C(L)) = 0 For internal pairs E(C(L)) = sb2

𝐢 𝐿 = 𝑝𝑖𝑛𝑑 βˆ™ πœŽπ‘2

Dipdoc Seminar – 30. May 2016

(13)

Probability of internal pairs

π‘π‘¦π‘’π‘Žπ‘Ÿ(𝑙) = 𝑙 π‘˜ + 1 𝑛

𝑛 βˆ’ 1 βˆ’ 𝑙 π‘˜ βˆ’ 1 𝑛 βˆ’ 1

π‘˜

π‘π‘’π‘Žπ‘Ÿπ‘™π‘¦ 𝑙 = 𝑙 βˆ’ min(𝑙, 𝐿) 𝑙

𝑝𝑖𝑛𝑑(𝐿) = π‘π‘¦π‘’π‘Žπ‘Ÿ(𝑙) βˆ™ π‘π‘’π‘Žπ‘Ÿπ‘™π‘¦(𝑙, 𝐿)

π‘›βˆ’π‘˜

𝑙=1

Dipdoc Seminar – 30. May 2016

Probability of a specific year to belong to segment of length l:

Probability of a specific year to have sufficient spacing to the next break:

Probability of internal pairs is the sum over all length of the product.

The probability for internal pairs increase with segment length l and decrease with time lag L.

(14)

Probability of internal pairs

𝑝𝑖𝑛𝑑 = 𝑙 π‘˜ + 1 𝑛

π‘›βˆ’π‘˜

𝑙=1

βˆ™

𝑛 βˆ’ 1 βˆ’ 𝑙 π‘˜ βˆ’ 1 𝑛 βˆ’ 1

π‘˜

βˆ™ 𝑙 βˆ’ min 𝑙, 𝐿

𝑙

𝑝𝑖𝑛𝑑 =

𝑛 βˆ’ 1 βˆ’ 𝐿 𝑛 βˆ’ 1π‘˜

π‘˜

𝑝𝑖𝑛𝑑 = π‘’βˆ’ π‘›βˆ’π‘˜π‘˜πΏ

Dipdoc Seminar – 30. May 2016

The long version of the product :

By some purely arithmetic transformations we get:

By some further approximations we get:

(15)

Lag covariance for RD

Dipdoc Seminar – 30. May 2016

The covariance is an

exponential function of the time lag.

C(L) = a exp (-bL)

break

a = sb2 strength sb b = k/(n-k) number k

As byproduct we have a nice method to retrieve also

strength and number of breaks directly from the data.

Input:

sb = 1.000 k = 5.000 Output:

sb = 1.000 k = 4.984

(16)

Brownian motion type

Dipdoc Seminar – 30. May 2016

For Brownian motion type breaks the covariance depends only on the segment number of the earlier of the two years , because they have all random numbers eb constituting the break signal at x(i) in common.

πΆπ‘œπ‘£ π‘₯(𝑖), π‘₯ 𝑗 = 𝑆 𝑖 + 1 πœŽπ‘2 , 𝑖 < 𝑗

The segment number is a stochastic variable growing linearly in time:

𝑆 𝑖 = 𝑖 βˆ’ 1

𝑛 βˆ’ 1 π‘˜ , 𝑖 ≀ 𝑛

Consequently, also the covariance grows linearly with time:

πΆπ‘œπ‘£ π‘₯ 𝑖 , π‘₯(𝑗) = 1 + 𝑖 βˆ’ 1

𝑛 βˆ’ 1 π‘˜ πœŽπ‘2 , 𝑖 < 𝑗 ≀ 𝑛

(17)

Time dependent Cov for BM

Dipdoc Seminar – 30. May 2016

The covariance is a linear function in time.

C(i) = a i + b

a = k/(n-1) sb2 b = ( 1 - k/(n-1)) sb2

Input:

sb = 1.000 k = 5.000 Output:

sb = 1.005 k = 4.920

(18)

Mixed applications

Dipdoc Seminar – 30. May 2016

Very small break size.

Input:

sb = 1.000 k = 5.000 Output:

sb = 0.046 k = 14.95

Input:

sb = 1.000 k = 5.000 Output:

sb = 3.471 k = 0.729

Temporal covariance

Random deviations

Lag covariance Brownian motion

Very small break number.

(19)

Conclusion

Brownian motion and random deviation break types can be distinguished by calculating:

1. Lag covariance C(L)

2. Time dependent covariance C(i)

For Random deviations C(L) is decreasing with L.

For Brownian motion C(i) is increasing with j.

The two other combinations yield either small size or small number.

As byproduct we get an estimate for break size and number without running a full homogenization algorithm.

Dipdoc Seminar – 30. May 2016

(20)

Platforms & Stairs

Venema et al. (2012) analyzed the statistics of the retrieved signal to decide whether breaks are BM or RD type.

They distinguish platforms:

from stairs:

Dipdoc Seminar – 30. May 2016

T3 T1

T2

T3

T1

T2

(21)

Platform probability for RD

For RD break types T1, T2, T3 are iid random variables (not the case for BM).

There are 6 possibilities of rank order, which all have the same probability:

Dipdoc Seminar – 30. May 2016

T1 < T2 < T3 T1 < T3 < T2 T2 < T1 < T3 T2 < T3 < T1 T3 < T1 < T2 T3 < T2 < T1

Upward and downward stairs have both the probability 1/6.

Every other combination is a

platform. (Either T2 is the smallest or T2 is the largest element of the

triple.)

Downward stair Upward stair

For RD break types the probability of platforms is 2/3.

Referenzen

Γ„HNLICHE DOKUMENTE

In the second part, seafloor spreading anomaly data, measured by ship, aircraft and helicopter, in the South At- lantic region, are used to constrain a geodynamic model for the

A simple method was proposed to reduce the parameter search space of the Lorenz system, based on the determination of the system’s equilibrium points from the analysis of the

A sprinkler with three arms is supplied by a large tank and rotates with the angular velocity Ο‰ = const... The center of gravity of the system has the distance h to

In order to be able to do that, the Iraqi government and armed forces will need the support of the international community, particularly cooperation between Iran, the

With a foreword by Tiff Macklem, outgoing senior deputy governor of the Bank of Canada, Crisis and Reform examines Canada’s role in the formative years of the international

021 &#34;Nonparametric Estimation of Risk-Neutral Densities&#34; by Maria Grith, Wolfgang Karl HΓ€rdle and Melanie Schienle, March 2010.. SFB 649 Discussion Paper

PROPHETIC VISION IN MODERN ARABIC POETRY.. Von As'ad

β€œFurther deterioration is likely unless there is an immediate and lasting end to the conflict” IPC Indicative Acute Food Insecurity Situation Overview in Yemen, June 2015, Yemen Food