A Numerical Approach - Numerical Nonlinear Algebra

In this section, we explain our use of numerical nonlinear algebra to obtain Theorem* 6.1 below. We refrain from stating this result as a theorem since we currently cannot certify the last step of our computation. We add the asterisk to acknowledge this gap.

Theorem* 6.1. The degree of the orbit closure of a general cubic surface under the action ofPGL(C,4)is 96120.

All computations performed to arrive at this result are available from the authors upon request.

To compute the degree of the orbit closure, we sample a general cubic surface f ∈P¹⁹ by drawing the real and imaginary parts of each of its coordinates independently from a univariate normal distribution. We then solve the polynomial system (6.2) encoding the enumerative geometry problem. A naive strategy is to sample 15 points p1, . . . , p15 ∈ P³ in general position and use a total degree homotopy, but in this case the Bézout bound is 3¹⁵= 14,348,907. Here, the monodromy method described in Chapter 3.5 is substantially more efficient.

To apply the monodromy method, we consider (6.2) as a polynomial system on the entries ofφ parameterized by 15 pointsp₁, . . . , p₁₅ in P³. We consider the incidence variety

V ={(φ,(p₁, . . . , p₁₅))∈P¹⁵×(P³)¹⁵|F(φ(p_i)) = 0, i= 1, . . . ,15}

and we denote byπ the projectionP¹⁵×(P³)¹⁵→(P³)¹⁵ restricted to V.

We find a start pair (φ₀;p₁, . . . , p₁₅) ∈ V and then we use the monodromy action on the fiber π⁻¹(p1, . . . , p15) to find all solutions in this fiber. Such a start pair can be found by exchanging the role of variables and parameters. First, we sample a φ₀ ∈ P¹⁵ and the first three coordinates of 15 points p_i ∈ P³ in general position. This yields a system of 15 polynomials each depending only on a unique variable: The ith polynomial depends only on the fourth coordinate of pi. Such a system is easy to solve. Solving it yields a start pair (φ₀;p₁, . . . , p₁₅) ∈V, on which we run the monodromy method implemented in HomotopyContinuation.jl. In less than an hour, this method found 96120 approximate solutions corresponding to the start points p1, . . . , p15 ∈ P³. Applying the certification routine implemented in HomotopyContinuation.jland described in Chapter 5, we certify in less than 5 minutes that our approximate solutions correspond to 96120 distinct isolated solutions of the system.

Remark 6.2. The article [BIMTW20] on which this chapter is based performed the certifi-cation using the software alphaCertified [HS12]. However, we were only able to obtain a certificate using floating-point arithmetic due to computational limits. This certificate is not rigorous due to possible floating-point errors. For a rigorous certificate, it would have been necessary to use rational arithmetic. The computation for the floating-point certificate needed 5 hours and 80GB of memory. The new certification routine described in Chapter 5 allowed us to close this gap. In contrast to the alphaCertified computation, it only needed around one minute and substantially less memory.

The certification process establishes a lower bound on the degree of the orbit closure.

As the last step, we run a trace test to verify that we have indeed found all solutions. For this, we construct a linear subspace Lfrom the 15 pointsp₁, . . . , p₁₅such that our solutions from the monodromy computation are also solutions to (6.1). The result of the trace test is theoretically zero if we found all solutions. In our numerical computation, we obtained a value on the order of the machine precision giving us very high confidence that we found indeed all solutions. We conclude that the degree of the orbit closure of a general cubic surface under the action PGL(C,4)is 96120.

We note that as a test of our methods, we confirmed known degrees of other varieties. In agreement with a theoretical result of Aluffi and Faber [AF93], we computed that the degree of the orbit closure of a general quartic curve in the plane is 14280. Additionally, we computed that the degree of the orbit closure of the Cayley cubic, defined by the equationyzw+xzw+ xyw+xyz = 0, is 305. Due to the symmetry of the variables in the Cayley cubic, there are 4! matrices corresponding to every polynomial in the orbit. As expected, we computed 7320 = 4!·305solutions. This coincides with a theoretical result of Vainsencher [Vai03].

6.3 Conclusion

In this chapter, we computed the degree of the orbit closure of the action of the projective linear group PGL(C,4) on cubic surfaces parameterized by points in P¹⁹ using methods from numerical nonlinear algebra. Our result is that the degree is 96120. To compute and verify this degree, we used the monodromy method, a trace test and the certification of isolated solutions. The monodromy method allowed us to quickly compute the 96120 isolated solutions corresponding to the degree of the orbit closure. The certification of the computed 96120 isolated solutions established a rigorous lower bound on the degree of the orbit closure. To strengthen our result, we also employed a trace test to check that we did not miss any solutions with the monodromy method. Unfortunately, the trace test is not rigorous as discussed in Section 3.6. This chapter illustrates the importance of finding a rigorous trace test. Given a rigorous trace test, we could drop the asterisks from Theorem* 6.1.

7 Estimating Linear Covariance Models

This chapter is based on the article “Estimating linear covariance models with numerical nonlinear algebra” [STZ20] by Bernd Sturmfels, Sascha Timme and Piotr Zwiernik. The article appeared in Algebraic Statisticsvolume 11 number 1.

In this chapter, we demonstrate the application of numerical nonlinear algebra in statistics.

In many statistical applications, the covariance matrix Σ has a special structure. A natural setting is that one imposes linear constraints on Σ or its inverse Σ⁻¹. We here study models for Gaussians whose covariance matrix Σ lies in a given linear space. Such linear Gaussian covariance models were introduced by Anderson [And70]. He was motivated by the Toeplitz structure of Σ in time series analysis. Recent applications of such models include repeated time series, longitudinal data, and a range of engineering problems [Pou99]. Other occurrences are Brownian motion tree models [SUZ20], as well as pairwise independence models, where some entries of Σare set to zero.

The literature on estimating a covariance matrix is extremely rich. Its development has been particularly dynamic in high-dimensional statistics under sparsity assumption on Σ or its inverse; see [FLL16] for an overview. Although the sample covariance matrix is known to have poor statistical properties, for many Gaussian models the maximum likelihood estimator (MLE) remains an important reference point.

Maximum likelihood estimation for linear covariance models is a nonlinear algebraic op-timization problem over a spectrahedral cone, namely the convex cone of positive definite matricesΣthat satisfy the linear constraints of interest. The objective function is not convex and can have multiple local maxima. Yet, if the sample size is large relative to the dimension, then the problem is essentially convex. This was shown in [ZUR17]. In general, however, the MLE problem is poorly understood, and there is a need for accurate methods that reliably identify all local maxima.

Nonlinear algebra furnishes such a method, namely solving the score equations [Sul18, Sec-tion 7.1] using numerical homotopy continuaSec-tion [SW05]. This is guaranteed to find all critical points of the likelihood function and hence all local maxima. A key step is the knowledge of the maximum likelihood degree (ML degree). This is the number of complex critical points. The ML degree of a linear covariance model is an invariant of a linear space of symmetric matrices which is of interest in its own right.

Our presentation is organized as follows. In Section 7.1, we introduce various models to be studied, ranging from generic linear equations to colored graph models. In Section 7.2, we discuss the maximum likelihood estimator as well as the dual maximum likelihood estimator.

Starting from [Sul18, Proposition 7.1.10], we derive a convenient form of the score equations.

The natural point of entry for an algebraic geometer is the study of generic linear constraints.

This is our topic in Section 7.3. We compute a range of ML degrees and we compare them

to the dual degrees in [SU10, Section 2.2].

In Section 7.4, we present our software LinearCovarianceModels.jl. This is written in Julia and it is easy to use. It computes the ML degree and the dual ML degree for a given subspaceL, and it determines all complex critical points for a given sample covariance matrix S. Among these, it identifies the real and positive definite solutions, and it then selects those that are local maxima. The package is available at

https://github.com/saschatimme/LinearCovarianceModels.jl (7.1) and rests onHomotopyContinuation.jl.

Section 7.5 discusses instances where the likelihood function has multiple local maxima.

This is meant to underscore the strength of our approach. We then turn to models where the maximum is unique and the MLE is a rational function.

In Section 7.6, we examine Brownian motion tree models. Here the linear constraints are determined by a rooted phylogenetic tree. We study the ML degree and dual ML degree. We show that the latter equals one for binary trees, and we derive the explicit rational formula for their MLE. A census of these degrees is found in Table 7.6.

Im Dokument Numerical Nonlinear Algebra (Seite 78-82)