Altogether there are four different sources of errors: the statistical one stemming from the data, the error of the lattice spacingπ, the error of the literature value ofπΉ (that is needed in the fit to the HadSpec data), and the error due to the truncation of the chiral expansion. In the following, we address these kinds of errors one by one.
6.4.1 Statistical error of the data
To determine the statistical error stemming from the lattice data, jackknife samples are drawn from the data and the fit is repeated on all those samples. If several ensembles with a different number of underlying bootstrap samples are fitted at once, the data first need to be resampled via a parametric jackknife, such that each ensemble has the same number of samples.
To be precise, first focus on a single ensemble. It containsπtwo-particle energy levelsππΈπβlat, π = 1, β¦ , π, compare Eq. (6.7). In fact, as illustrated in Fig. 5.2 each of these energy levels consists of πΊ β β values, where each value corresponds to a bootstrap sample of the underlying gauge configurations: πΌπ β {ππΈβ1π , β¦ , ππΈπβπΊ}, withππΈπβlat = βπΊπ=1ππΈπβπ/πΊ the mean of those values. The corresponding entries of the covariance matrix πΆappearing in Eq. (6.8) are computed using the standard estimator
πΆππ = 1 πΊ β 1
πΊ
β
π=1
(ππΈπβπβ ππΈπβlat) (ππΈβππ β ππΈπβlat) . (6.16) To discuss the error analysis in detail, it is expedient to focus on one two-particle energy level and abstract away the details. That is, instead of πΌπ, consider for the time being a set π β {π1, β¦ , ππΊ} containing πΊoutcomes of a random variable π. Usually, to attempt an error analy-sis the bootstrap is applied. That is, π΅ β βbootstrap samplesππ β {π1π, β¦ ππΊπ}, π = 1, β¦ , π΅are generated, where eachπππis drawn randomly and uniformly with replacement fromπ(that is, the a priori probability that πππ equalsππ is1/πΊfor all π, π β 1, β¦ , πΊ, andπ β 1 β¦ π΅). On each boot-strap sample the mean ππ β βπΊπ=1πππ/πΊis computed. The variance of the bootstrap means is an
estimator of the variance of the mean. That is, for sufficiently highπ΅, 1
π΅ β 1
π΅
β
π=1
(ππβ 1 π΅
π΅
β
π=1
ππ)
2
βVar[π] (6.17)
holds.
Coming back to the fit at hand, the aforementioned procedure would be applied to each energy level πΌπ, yieldingπ΅outcomes{ππΈβlat,π1 , β¦ , ππΈπβlat,π},π = 1, β¦ , π΅. On each such outcome the fit can be repeated, each time resulting in different values of the fit parameters. Their errors can then be estimated via the covariance matrix of theπ΅different sets of parameter values.
However, in the scenario at hand this approach fails. This is due to the fact that the values ππΈπβlat are often very close to a pole of the quantization condition. The underlying setπΌπ might contain a significant number of values located on the other side of the nearby pole compared to the central valueππΈπβlat, see Fig. 5.2. Hence, some of the bootstrap samples will yield valuesππΈπβlat,π that are separated from the true meanππΈπβlat by a pole, thereby being associated with completely different values of the scattering phase. In this way, the error would be drastically overestimated.
To circumvent this problem, the bootstrap is replaced by a jackknife. Consider the general random variableπinvestigated before. TheπΊjackknife samples are generated via deleting single values ππ: Xπ β {π1, π2, β¦ , ππβ1, ππ+1, ππ+2, β¦ , ππΊ}. Again the meansππare computed and used instead of the bootstrap meansππ. The following results are readily established [167]: the mean of the means of the jackknife samples equals the mean of the original data,
1 πΊ
πΊ
β
π=1
ππ= π. (6.18)
Furthermore,
ππβ π = 1
πΊ β 1(π β ππ) (6.19)
holds, which can be used to compute the variance of the jackknife means, resulting in 1
πΊ β 1
πΊ
β
π=1
(ππβ π)2 = 1
(πΊ β 1)2Var[π]
= πΊ
(πΊ β 1)2Var[π] .
(6.20)
Here Var[π] = Var[π ]/πΊis used. Hence, to obtain the variance of the mean, the variance of the jackknife means needs to be multiplied by a factor (πΊ β 1)2/πΊ β π’. Comparison with Eq. (6.17) shows that this is precisely the factor that relates the variance of the jackknife means with the variance of the bootstrap means.
Carrying this over to the fit at hand, the procedure reads as follows: instead of bootstrap samples, πΊ jackknife samples of each energy level are drawn. That is, there are πΊ outcomes {ππΈ1,πβlat, β¦ , ππΈπ ,πβlat}, π = 1, β¦ , πΊ, where ππΈπ,πβlat denotes the mean obtained using the π-th jackknife sample. Since the variance of these means is a factor1/π’ β 1/πΊsmaller than the variance of the bootstrap means, the risk of jumping over poles in the quantization condition is drasticly reduced.
The fit is repeated on allπΊdifferent jackknife samples, the covariance matrix of the fit parameters
can then be computed in the standard way. However, according to the foregoing discussion the matrix obtained in this way needs to be multiplied by the factorπ’.
In addition to the energy levels, the pion decay constants and masses are fitted, if available.
Often, the individual values of these parameters on theπΊbootstrap samples of gauge configurations underlying the two-particle energy levels are not available. Instead, a central value with an error is given, e.g.,ππΉπlatΒ± ΞππΉπlat. To include this error in the analysis, for each jackknife sampleπ, a value ππΉπ,πlatis drawn from a normal distribution with meanππΉπlatand standard deviationΞππΉπlat/βπ’. This value is subsequently used in the fit. That is, a parametric bootstrap is employed, but with the standard deviation downscaled to match the discrepancy between Eq. (6.20) and Eq. (6.17). This is dubbed parametric jackknife in the following. The same procedure is applied to the masses πππlatΒ± Ξπππlat.
In a simultaneous fit to several ensembles, the situation is slightly more complicated. This is due to the fact that the ensembles often differ in the numberπΊof underlying bootstrap samples of gauge configurations. However, this number is hardwired into the jackknife, as the omnipresence of the factor π’ illustrates. To obtain the same factor on all ensembles, the strategy described in Ref. [87] is used: on each ensemble the covariance matrixπΆis computed via Eq. (6.16). Then, one fixes a number π½ β β of desired jackknife samples. Subsequently, on each ensemble π½ samples {ππΈ1,πβlat, β¦ , ππΈπ ,πβlat},π = 1, β¦ , π½are drawn from a multivariate normal distribution with covariance matrixπΆ/π₯ /πΊ, withπ₯ β (π½ β1)2/π½. That is, a parametric jackknife is applied again. The additional division by πΊ is necessary, for the diagonal entries of πΆ correspond (in the language developed previously in the discussion of the general random variableπ) to Var[π ], but here we are interested in Var[π], see also Eq. (6.17). The fit is then repeated on allπ½ samples and the results obtained in this way need to be rescaled by the appropriate power ofπ₯.
6.4.2 Error of the lattice spacing
As already alluded to in Sec. 6.2, the numerical value of the lattice spacingπenters the fit only via the renormalization scaleπ, if the pion decay constant is fitted. If the decay constant has not been computed on the lattice (as it is the case for the HadSpec data), the lattice spacing also enters in the translation of the literature value ofπΉinto lattice units.
Since in lattice-QCD computations everything is computed in powers of π, the determination ofπ, called scale setting, is a non-trivial task. As a matter of fact,πcannot be determined exactly, but carries an error on its own. This is why we have deliberately designed the fit in a way that the impact of the lattice spacing is as small as possible. There are several ways to set the scale.
The CLS collaboration employs two different methods [166]. Strategy 1 uses the Wilson flow as suggested by LΓΌscher [171], while strategy 2 sets the scale via combinations of decay constants.
The resulting values are incompatible within their errors, already hinting at how error-prone scale setting is. Strategy 1, the one preferred by the authors of Ref. [166], is based on a dimension two quantity π‘0 evaluated at the symmetrical point ππ’ = ππ = ππ (withππ the mass of the strange quark). In Ref. [166] the reference value is determined asπ0ref β β8π‘0 = 0.413(5)(2)fm, where the first error is statistical and the second systematical. In addition, for each lattice spacing, the value of π‘0/π2is given, denoted in the following asπ0,πlat,π = 1, 2, 3. These values are only weakly correlated, however, the correlations of the resulting values ππ = π0ref/β8π0,πlat are significant, because the reference value is common to all of them. To take this correlation into account, we parametrically draw samples of the form {π0,πref, π0,1πlat, π0,2πlat, π0,3πlat}(where π indicates the sample), compute on each sample the three lattice spacings and estimate their covariance in the standard way. In doing so,
we draw the samples from a normal distribution, adding the statistical and systematical errors in quadrature. Contrarily, we approximate the covariance matrix of strategy 2 as diagonal, with the errors given in Ref. [166]. It is worth noting that strategy 1 requires a slight shift in both the pion masses and the decay constants. However, in the computation of the CLS ππenergy levels in Ref. [36] the shifts of the pion masses were not taken into account, and thus throughout our analysis of CLS data we use non-shifted values. Strategy 2 does not suffer from this problem.
HadSpec determines the two lattice spacings via the mass πΞ© of the Ξ© baryon. To that end, ππΞ© is computed on each ensemble [82, 172]; dividing by the experimental value [9] yields the value of the lattice spacing. Again both values are correlated due to the common experimental value, which we take into account by resampling, although the resulting off-diagonal entries of the covariance matrix are suppressed by an order of magnitude compared to the diagonal ones, for the error of the experimentally determinedΞ©mass is small.
To incorporate the error of the lattice spacingsππ,π = 1, β¦ , πΎ, a corresponding fit parameter πfitπ is introduced for each lattice spacing. This fit parameter is used instead of the value obtained from the lattice to translate the renormalization scale into lattice units (and the literature value of πΉ, if required). Furthermore, the term
ππ2 β
πΎ
β
π,π=1
(ππβ ππfit) (πΆπβ1)ππ(ππβ ππfit) (6.21) is added to theπ2, withπΆπthe covariance matrix of the lattice spacings on the different ensembles.
That is, the replacement
π2β¦ π2|πβ¦πfit + ππ2 (6.22)
is performed. Via a parametric boostrap several samples {πππ}πΎπ=1 (withπlabeling the sample) are drawn from a multivariate normal distribution with covariance matrix πΆπ and the fit is repeated for each such sample. From the different fit results the errors can be obtained via the standard estimators.
6.4.3 Error of π
In the fit to the HadSpec data,πΉπ/πΉis set to theπf = 2 + 1FLAG average [47, 173β177], whileπΉπis fixed to its Particle Data Group (PDG) value [9] to obtainπΉ = 86.89(58)MeV. To include this error in the fit, a fit parameterπΉfitis introduced and the replacement
π2 β¦ π2|πΉ β¦πΉfit + (πΉ β πΉfit ΞπΉ )
2
(6.23) is performed.
6.4.4 Truncation error
Lastly, we address the error arising due to the truncation of the chiral expansion at NLO or NNLO.
To this end, we utilize the approach of Ref. [178]. Consider a perturbative expansion of some quantityπin powers of an expansion parameterπΌ βͺ 1:
π = π0+ π1+ π2+ β― =
β
β
π=0
ππ, ππ = π (πΌπ) . (6.24)
In practice, the expansion often needs to be truncated. The question arises how to estimate the error Ξππassociated with the truncation at orderπwithout calculating terms of higher order. Noting that missing terms after a truncation are at least one order inπΌhigher, the idea is to use the recursion
Ξππ =max{πΌΞππβ1, πΌ |ππβ ππβ1|} , π β₯ 0, (6.25) withΞπβ1 β 0andπβ1 β 0for notational convenience. This recursion can be solved to obtain
Ξππ =max
π
β
π=0
{πΌπβπ+1|ππβ ππβ1|} . (6.26)
That is, we compute all differences of two adjacent terms multiplied by the appropriate power ofπΌ and maximize, with the absolute values inserted to obtain positive quantities.
In the scenario at hand, the ChPT expansion of the π wave has two expansion parameters:
πΌ1 β ππ2/ππ2 and πΌ2 β π /ππ2. Here the breakdown scale is set to ππ, for theπ is the lightest resonance in this partial wave and plain ChPT does not incorporate resonances. While the IAM improves theπ dependence via unitarization and allows for a description of theπ, it does nothing to improve theππdependence of the amplitude. Hence, we take onlyπΌ1into account. Noting that the NLO IAM corresponds to the lowest order in our formalism and the NNLO IAM to the subsequent order, Eq. (6.26) yields
ΞπNLO = πΌ1|πNLO| ,
ΞπNNLO =max{πΌ12πNLO, πΌ1|πNLOβ πNNLO|} , (6.27) with π(N)NLO the quantity as obtained from the (N)NLO IAM. This is an educated guess of the truncation error; in particular, due to the non-perturbative nature of the IAM an observable does not really decompose likeπ = πNLO+ πNNLO+ β¦