Estimation of the wavelet coefficients - Spot volatility estimation in the semimartingale model

5. Spot volatility estimation in the semimartingale model 53

5.2. Estimation of the wavelet coefficients

The content of this section is subdivided into two steps. First we modify pre-averaging, in order to make it suitable for our purposes. Later, we define and discuss the estimators of the wavelet coefficients.

First step: Let us begin with a definition.

Definition 18 (Pre-average function). A function λ : [0,2] → R that is piecewise Lipschitz continuous and satisfies λ(t) = −λ(2−t)is called pre-average function. Given a pre-average function λ, let

λ:=

2 Z 1

Z s 0

λ(u)du2

ds1/2

and define the (normalized) pre-average function eλ:=λ/λ.

Note that the graph of λ is point symmetric with respect to (1,0).

Example 3. Let us give a few examples of normalized pre-average functions.

(i) eλ(s) = (k+ 1/2)πcos(s(k+ 1/2)π), k = 0,1, . . . (ii) eλ(s) = p

3/2(I[0,1)(s)−I(1,2](s)). This leads us to the generalized quadratic varia-tion that has already been discussed in Secvaria-tion 5.1.

(iii) eλ(s) = 3^−1/2kπsin (kπs), k = 1,2. . . . (iv) eλ(s) = 2⁻¹p

(2k+ 3)(4k+ 5) (1−s)^(2k+1), k = 0,1. . . . As in Chapter 4 we set

M =bcn^1/2c. (5.2.1)

For the observation vectorY, we introduce the modified pre-averaged observations by Yi,M(λ) : = M fori= 2, . . . , M.There are two changes compared to the original pre-averaging procedure defined in (5.1.1). First, a weighted binning is defined and second we directly consider differences over successive, averaged blocks, without defining first local means. Hence, following the observations of Section 5.1 it is plausible that

Y_i,M(λ) = −(X(i−1)/M −X(i−2)/M) +O_p(M^−1/2+ (M/n)^1/2) (5.2.3) and the increments of X are of the same order as the noise.

On the other hand, Y_i,M(λ) may also be represented (up to a small error) by weighted increments ofY, due to

Y_i,M(λ)≈ −M

Second step: Let φ be an L²-function. The estimator for the scalar product hφ, σ²i is given by

where

In particular, given a multiresolution analysis ((V_j)_j, φ) as well as the corresponding wavelet ψ, the estimators of the basis coefficients hφj0,k, σ²i and hψj,k, σ²i are given by hφ\_j₀_,k, σ²i and hψ\_j,k, σ²i, respectively.

Explanation of (5.2.4): Going back to (5.2.3) and the discussions in Section 5.1, it is clear that Y²_i,M = (X(i−1)/M −X(i−2)/M)²+O_p(M⁻¹+ (M/n)) = _M¹ σ²(ⁱ⁻¹_M ) +O_p(M⁻¹+ (M/n)). Now, the noise is of the same order as the signal. However, by the definition of a pre-average function and imposing smoothness on σ, the influence of the bias due to the approximation can be reduced to smaller order, such that we only need to adjust for the bias induced by the pre-averaged noise. Careful calculations reveal that this can be accomplished by subtracting b(λ, Y). Let us mention, that if τ and φ are sufficiently smooth, we might approximate

b(λ, Y)_i,M ≈c²1 and hence (5.2.4) can be written as

hφ, σ\²i=

up to some small approximation error. This can be compared directly to (4.2.10).

Furthermore, since Y²_i,M −b(λ, Y)_i,M has mean σ²(ⁱ⁻¹_M )/M and variance of order n⁻¹ (as shown later) we may think of M(Y²_i,M −b(λ, Y)_i,M) as observations coming from a nonparametric regression model (2.1.3), with regression function σ² and almost centered (but dependent) errors. As mentioned in (2.1.4),

1 is then the natural estimator for the scalar product hφ, σ²i.

Since we will deal with wavelet and approximation coefficients simultaneously, let us introduce h_`k(·) = 2^`/2h(2^` · −k) for a given function h (for which we set h = φ and h=ψ later on).

Assumption 4 (Assumption on h). Suppose that the function h:R→ R is compactly supported, bounded, and has piecewise Lipschitz derivative.

Furthermore, for a function class D, we define E^D[·] :=E[· I{σ²∈D}], provided {σ² ∈ D}

is measurable. In particular, D₁ ⊂ D₂ implies

E^D1[U]≤E^D2[U], for non-negative random variablesU. (5.2.6) First, we evaluate the (thresholded) moments of hh\_`k, σ²i. This result will allow us to obtain rates of convergence in the sense of Definition 11 for estimation of the spot volatility. Before we can do so, the precise conditions on the noise process are given.

Assumption 5 (Refinement on the noise assumption for model (1.1.3)). Let _i,n satisfy Assumption 1. Additionally, assume that E[|ηi,n|^p

< ∞ for any p > 0 and that the function (x₁, x₂)7→τ(x₁, x₂) is continuous and bounded.

The following assumption will allow us to remove the drift in the proofs by a change of measure. It is of interest to note that this assumption is not essential for our proof. In fact, it is imposed in order to reduce the number of terms we need to estimate when we prove moment bounds later. Recall that by Definition 2, the processes σ and b are c`adl`ag and F_t-adapted.

Assumption 6. Suppose that a weak solution of (1.1.1) is unique and well defined.

Moreover, a weak solution to Xet=Rt

0σsdWs is also unique and well defined, the laws of X and Xe are equivalent on F₁ and we have, for some ρ >1

E h

exp ρ Z 1

b_s

σ_sdW_si

<∞.

In order to state the following result, we must first introduce the empiricalL^p[0,1]-norms with respect to the uniform measure on{i/M :i= 1, . . . , M}, defined by

kfk_p,M := 1 M

i=1

|f(_Mⁱ )|^p1/p

. (5.2.7)

Proposition 3 (Moment bounds). Suppose that Assumptions 5 and 6 hold and let hh\_`k, σ²i as in (5.2.4). Assume further that h satisfies Assumption 4 and 2^` ≤ M = bcn^1/2c. Let s >1/π, then, for any p≥1, C >0,

E^B^sπ,∞(C)

hh\_`k, σ²i − hh_`k, σ²i

. M^−p/2+M⁻min{s−1/π,1/2}pkh_`kk^p_1,M, uniformly over `, k.

Proof. Let us first introduce some notation. In the following, eλ always denotes the normalized version of a pre-average function (in the sense of Definition 18). We define the functions Λ,Λ :R→R,

and by using Lemma B.4 also

kΛ M · −(i−2)

k₂ =M^−1/2. Moreover, for C > 0, we define the L^∞-ball

L^∞(C) := {f : [0,1]→R, kfk∞ ≤C}. (5.2.11) Some properties deduced from Assumption 4 that will be used extensively can be found in Lemma B.1. and in the spirit of (5.2.2)

X_i,M :=X_i,M(λ) := M

are the natural extensions of applying pre-averaging toX and . Bounding I : In a first step we will show that

E^B^sπ,∞(C) Note that by the continuous embedding (2.4.8) and the identity (2.4.9) it follows

B^s_π,∞(C)⊂ Cmin(s−1/π,1/2)

where the last inequality follows from Lemma B.4. Let |supp(h`k)| denote the support length ofh_`k. Therefore, by H¨older inequality and Lemma B.1

E^B^sπ,∞(C)

and further by triangle inequality Therefore, on the event σ² ∈ B_π,∞^s (C), Equation (5.2.20) implies by Lemma B.1 (iii)

Recall that by continuous Sobolev embedding (2.4.8), B_π,∞^s ⊂ B^s−1/π∞,∞ . Since B∞,∞^s−1/π ⊂ The moment bound on I, i.e. (5.2.14) follows now by applying successively (5.2.19), (5.2.22), (5.2.23) and (5.2.24).

Bounding II : Combining Lemmas B.6, B.8, B.9 and B.10, we obtain E^B^sπ,∞

|II|^p

.kh_`kk^p_1,MM^pn^−p+kh_`kk^p_2,MM^−3p/2n^−p+kh_`kk^p_p,MM^p+1n^−p .M^−p/2, where Lemma B.1 is applied for the last inequality.

Bounding III : Lemma B.7 gives E^Bπ,∞^s (C) By combining the estimates on partsI−III, the proof of Proposition 3 is complete.

In order to apply Theorem 2, we need further a result of the type (2.2.3). This is given in the next Proposition.

Proposition 4 (Deviation bounds). Suppose that Assumptions 5 and 6 hold. Let us further suppose that h satisfies Assumption 4, s > 1/π, and M = bcn^1/2c. Further assume that

(i) M2^−` ≥M^q, for some q >0 and (ii) M^{−(s−1/π)}kh_`kk_1,M .M^−1/2.

Then for C > 0 and p≥1, we have P

hh\_`k, σ²i − hσ², h_`ki_L²

≥κ ^p^log_M^M1/2

and σ² ∈ B_π,∞^s (C)i

.M⁻^max(2,p) for a sufficiently large constant κ and

C:= sup

σ²∈B^s_π,∞(C)

kσ²k_L^∞.

If X is a driftless continuous Itˆo semimartingale, i.e. b= 0, then κ can be chosen as κ >4C+ 4p

2 C kτk∞ckλk2λ ⁻¹+ 4kτk²_∞c²kλk²₂ λ ⁻². (5.2.25) Remark 2. IndeedC <∞,as it follows from the continuous embedding (5.2.16). More-over, in the case of high smoothness, i.e. s−1/π >1/2, Assumption (ii) in Proposition 4 becomes trivial.

Im Dokument Nonparametric Methods in Spot Volatility Estimation (Seite 55-63)