Prediction methods - Genomic Prediction for Quantitative Traits: Using Kernel Methods and Whole

parametric RBF regression methods can outperform BayesA when predicting total GVs in the presence of non-additive effects using SNP markers.

In this chapter we will demonstrate the potential of the kriging approaches applied to genomic data: As a novelty, we will suggest the family of Matérn covariance functions to reflect the functional dependency of the observed covariances from the distance of genotypes expressed as Euclidean norm. Based on this model and the assumed covariance function, we will suggest two kriging approaches. Under both models, parameters and hidden variables are estimated via maximum likelihood (ML) and BLUP of the unknowns is established by solving the corresponding linear kriging systems. All predictions can also be implemented in the form of the so-called mixed model equations (Henderson, 1973). The predictive performance of the two models will be compared to a common genomic BLUP as a reference method in a whole-genome simulation study considering various gene-action models.

Furthermore, we will show that in a limiting case the genomic covariance structure proposed byVanRaden(2008) can be considered as a covariance function with corresponding quadratic variogram. Besides we will prove theoretically that predicted GVs are only scaled by a factor if the covariance structures are linearly transformed. Finally, we will discuss further options for a more differentiated modeling using the suggested methodological approach.

3.2 Prediction methods

3.2.1 Kriging

The term kriging stems from the prediction of ore concentrations in deposits and was mainly developed by Matheron (1962, 1963) based on the master’s thesis of Krige (1951). In geostatistics, kriging is nowadays the standard approach whenever spatial prediction of a so-called regionalized variable (Matheron,1989), e.g. temperature, ozone concentration or soil moisture, has to be performed based on a few isolated measurements of the quantity. It is assumed that the regionalized variable is a realization of a random function with a certain covariance structure. Mostly, the latter is given by a parameterized covariance function (Cressie,1993), and the random function is assumed to be Gaussian.

The kriging approach consists of two steps: (i) estimation of the unknown parameters and hidden variables (in particular by ML or REML) and (ii) prediction of the values of the regionalized variables by performing a BLUP, under the auxiliary assumption that the parameter values and hidden variables estimated in the first step are the true ones.

Many variants of the general kriging principle have been discussed (Cressie,1993). The type of kriging is implied by the unbiasedness condition: In “simple kriging” it is assumed that the underlying regionalized variable has zero-mean, whereas in “universal kriging” a linear model for the mean of the underlying regionalized variable is assumed.

3.2.2 The model for polygenic and genomic data

In our further studies, we assume to have q individuals with pedigree information, n of them being genotyped and having phenotype measurements of a certain quantitative trait.

Typically, GVs have to be predicted for individuals that are genotyped, but have no phenotype data.

We use the following model for the given data:

yi=w^T_i β+z^T_i u+g(xi) +ei, i= 1, . . . n,

whereyi is a measurement of the phenotype for individuali,β is an f-vector of nuisance location parameters,xi is ap-vector of dummy SNP instance variates (genotype) observed on individuali, andg is an unknown, random function as described below. Let u∼ N(0, σ_u²A) be a q-vector of additive genetic effects of q individuals where σ_u² is the additive genetic variance due to unmarked polygenes, and A is the numerator relationship matrix. The entries of the numerator relationship matrix are twice the coefficients of coancestry between individuals. The vectorsw^T_i andz^T_i are known incidence vectors;zi is a unit vector with one component being 1 and all the others zero, indicating the respective position in the pedigree.

Let e = (e₁, . . . , e_n)^T be the vector of environmental residual effects with e ∼ N(0, σ_e²I), whereσ²_e is the environmental variance.

We assume that{g(xi),xi ∈R^p}is a Gaussian random field (Lifshits,1995) withE(g(xi)) = 0 and covariance structure given by Cov(g(x_i),g(x_j)) = E(g(x_i)g(x_j)) = K_ν,h,σ_K(x_i,x_j), where Kν,h,σK(·,·) is a covariance function depending on parameters ν, h, and σK. Let K_ν,h,σ_K = (K_ν,h,σ_K(xi,xj))1≤i,j≤n be the corresponding covariance matrix.

The family of Matérn covariance functions: For the covariance structure we suggest to use the so-called family of Matérn covariance functions, which was introduced byMatérn(1960) andHandcock & Wallis (1994), and which is defined by

Cov(g(x_i),g(x_j)) =K_ν,h,σ_K(x_i,x_j) =σ_K² · 2^1−ν Γ(ν)

√

2νkx_i−x_jk/h^νK_ν

√2νkx_i−x_jk/h.

Here,k · kis the Euclidean norm,ν >0 is a smoothness parameter,his a scale parameter,σ²_K is the variance parameter andK_ν(·) is a modified Bessel function of the second kind of order ν (Abramowitz & Stegun, 1984). The Matérn function is isotropic, in that Cov(g(x_i),g(x_j)) only depends on the Euclidean norm of the separation vectorx_i−x_j.

Matérn covariance functions build a very general class of covariance functions including special cases like the exponential (ν = 1/2) and the Gaussian (ν=∞) covariance function, the ones that have also been used byPiepho (2009). If the smoothness parameter ν is of the formm+ 1/2, where mis an integer, the Matérn function factorizes into the product of an exponential function and a polynomial of degreem,cf. Table 3.1 and Figure3.1. The best fitting parameter valueν is determined through the model-fitting approaches described below.

In matrix notation, the statistical model is

y=Wβ+Zu+g(X) +e, (3.1) where W = (w^T₁, . . . ,w^T_n)^T is an (n×f)- and Z= (z^T₁, . . . ,z^T_n)^T is an (n×q)-incidence matrix andg(X) = (g(x1), . . . , g(xn))^T. Finally, we assume that the random vectors u,e

3.2 Prediction methods 21

0.0 0.5 1.0 1.5 2.0 2.5 3.0

0.2 0.4 0.6 0.8 1.0

Matern covariance functions K with h = 1, σ_K²=1

Euclidean distance

covariance

Figure 3.1: Matérn covariance functions forh= 1, σ²_K = 1 and different values of ν. From top to bottomν=∞,10,2.5,1.5,0.5.

Table 3.1: Special cases of Matérn covariance functions

ν h Kν,h,σ_K(xi,xj) Exponential 0.5 1 σ²_K·exp(−kx_i−x_jk)

1.5 1 σ²_K·exp(−√

3kx_i−x_jk)·1 +√

3kx_i−x_jk 2.5 1 σ²_K·exp(−√

5kx_i−x_jk)·1 +√

5kx_i−x_jk+⁵₃kx_i−x_jk² Gaussian ∞ 1 exp(−¹₂kx_i−xjk²)

and g(X) are independent.

3.2.3 Two kriging approaches and a reference model

We consider two models to predict the total genetic valuez^T₀u+g(x0) of a certain genotyped individual indexed by 0. This individual belongs to the set of q individuals, but it does not have to be phenotyped. The models differ in the size of the sets of quantities that are estimated in the first kriging step and subsequently used for predictions.

Universal Kriging: Modeling of y: We exploit the fact that y has a multivariate normal distribution,

y∼ N(Wβ, σ_u²ZAZ^T +K_ν,h,σ_K +σ_e²I),

and estimate the parametersβ, σ_u, σ_e, ν, h andσ_K by maximizing the loglikelihood of the corresponding density function.

Then, we perform a best linear unbiased prediction ofg(x0) andz^T₀u,i.e. we apply the BLUP principle: To obtain ˆg(x₀) we minimize

E(ˆg(x0)−g(x0))² −→ min!

with the linear predictor ˆg(x0) = a^T_gy under the condition a_g^TW = 0. This approach is called “universal kriging” in other areas of research (Cressie,1993). In fact, the condition assuresa^T_gWβ= 0 and thereforeEg(x₀) = 0 =a_g^TWβ=Eˆg(x₀),i.e.gˆ(x₀) is unbiased. Let K₀ = (K_ν,h,σ_K(x₁,x₀), . . . , K_ν,h,σ_K(x_n,x₀))^T. The approach results in the following kriging system of equations:

W σ²_uZAZ^T +K_ν,h,σ_K +σ_e²I

0 W^T

λ a_g

K₀ 0

# .

Note that this linear system does not depend onβ. Analogously,z^T₀u can be predicted by the universal kriging estimatorz^d^T₀u=a^T_uy, whereau satisfies

W σ²_uZAZ^T +K_ν,h,σ_K +σ_e²I

0 W^T

λ au

σ_u²ZAz₀ 0

# ,

and one getsz^d^T₀u+ ˆg(x₀) as BLUP ofz^T₀u+g(x₀).

Mixed Model Equations (MME).In the animal breeding context it is well-known that a BLUP-approach for the modely=Wβ+Zu+g(X) +eis equivalent to solving the MME







W^TW W^TZ W^T Z^TW Z^TZ+^σ_σ²^e2

uA⁻¹ Z^T

W Z I+σ_e²K⁻¹_ν,h,σ





·





 βˆ

ˆ u g[(X)





=





 W^Ty

Z^Ty y





 (3.2)

for given variance components estimatede.g.by ML. For a derivation of the MME from the kriging system compare section2.2orDempfle (1982).

Simple Kriging: Joint modeling ofy, u and g(X): In the second approach we model the hidden variablesu andg(X) explicitly and consider the joint density functionfy,u,g ofy,u andg(X) which equals

f_y,u,g(X)(y,u,g(X)) =fy|u,g(X)(y)·fu(u)·fg(g(X))

=f_e(y−Wβ−Zu−g(X))·f_u(u)·f_g(g(X))

3.2 Prediction methods 23

=c·exp−1 2 ·

σ_e²ky−Wβ−Zu−g(X)k²

·exp−1 2·

σ²_uu^TA⁻¹u

·exp−1

2·^hg(X)^TK⁻¹_ν,h,σ

Kg(X)ⁱ with

c⁻¹ = (2π)^n+q/2σ_eⁿ·σ_u^q(detA)^1/2·(detK_ν,h,σ_K)^1/2.

Here, we have to estimate the parameters β, σ_u, σ_e, ν, h, σ_K and the hidden variablesu and g(X). Note that in this approach we consider u andg(X) to be parameters that have to be estimated via ML in the first kriging step. Therefore, we maximize the loglikelihoodJ of the density functionf_y,u,g,i.e. we maximize

J = log(c)−1 2 ·

σ_e²ky−Wβ−Zu−g(X)k²+ 1

σ²_uu^TA⁻¹u+g(X)^TK⁻¹_ν,h,σ

Kg(X) (3.3) with respect to β,u andg(X). Taking the derivatives with respect toβ,u andg(X) leads to the linear system given in eq. (3.2) which yields estimators for β,u and g(X). When using these estimates in eq. (3.3), the value of J depends only onσ_u, σ_e, ν, hand σ_K. Thus, J can be maximized numerically with respect to these parameters, leading to estimates forβ, σu, σe, ν, h, σK,uand g(X). According to the kriging philosophy, we now assume the values of the estimators (especially the value of the estimator for g(X)) to be the true ones, and g(x₀) is predicted via ˆg(x₀) =a_g^Tg(X) by the BLUP principle. That is, we minimize

E(ˆg(x₀)−g(x₀))² −→ min!

with the linear estimator

g(x₀) =a^T_gg(X).

This approach is called “simple kriging” (Cressie, 1990, 1993; Chilès & Delfiner, 1999). Note that ˆg(x₀) is always unbiased. The solution is

g(x₀) =K^T₀K⁻¹_ν,h,σ

Kg(X). (3.4)

Finally, the predicted GV is given byg(x\₀) +z^T₀u= ˆg(x₀) +z^T₀uˆ, where ˆu is the estimator obtained in the iterative procedure described above.

Reference model (genomic BLUP): This approach performs a genomic BLUP based on the model

y=Wβ+Zu+ ˜Xg+e, which leads to the kriging system

W σ²_uZAZ^T +σ_g²XG˜ X˜^T +σ_e²I

0 W^T

λ a

σ_u²ZAz₀+σ²_gXG˜ x˜₀ 0

and predictingz^T₀u\+ ˜x₀^Tg =a^Ty.

Here,β,e∼ N(0,σ²_eI),u∼ N(0, σ_u²A),WandZare defined as in the previous approaches.

The vector g ∼ N(0, σ_g²G) is multivariate normal with G being a genomic relationship matrix calculated by using the approach of VanRaden (2008). (For the definition of the genomic relationship matrix see the formulas in section 3.6.) The matrix ˜X is a known incidence matrix whose rows consist of unit vectors with one component being 1 and all the others zero, indicating the respective position in theg-vector. Variance components for this model are estimated via ML.

Im Dokument Genomic Prediction for Quantitative Traits: Using Kernel Methods and Whole Genome Sequence Based Approaches (Seite 29-34)