• Keine Ergebnisse gefunden

Definition 2.6.6. Let A ⊆ Rn≥0 be a subset and let w ∈ Nn be a weight vector.

(a) Letα∈A. If|α|w <|β|w for allβ ∈A\{α}, then we say thatwisolates α in A.

(b) If there exists α ∈ A such that w isolates α in A, then w is called isolating for A.

Let d ≥0 and let f ∈ K[x] be a non-zero polynomial of degree at most d. Then the logarithmic support A := LSupp(f) ⊂ Nn is non-empty. If a weight vector w∈Nn isolates some α∈A, then the univariate polynomial

f zw1, . . . , zwn

∈K[z],

is non-zero, because it has a non-zero monomial of degree|α|w. Note that the Kronecker substitution (2.6.1) yields a weight vector w := 1, D, . . . , Dn−1 for A, though, with entries exponential in d. We are interested in weights of magnitude poly(n, d). The following lemma demonstrates that a weight vector which is randomly chosen from [2nd]n is isolating for A with high probability.

Lemma 2.6.7 (Isolating Lemma, [KS01, Lemma 4]). Let d, N ≥1, and let A⊂Nn such that |α| ≤d for all α∈A. Then we have

Pr

w∈[N]n

w is isolating for A

≥1−nd N .

A suitable derandomization of the Isolating Lemma (see for example [AM08]) would imply a deterministic polynomial-time identity test for arith-metic circuits of polynomial degree.

Finally, we remark that it is easy to obtain an isolating weight vector for A ⊂ Nn if the convex polytope Conv(A) has few vertices. We will exploit this fact in Section 3.2.3.

given arithmetic circuit. Algorithms of this kind are referred to as blackbox algorithms. Blackbox algorithms require the computation of a hitting set according to the following definition.

Definition 2.7.1. Let C ⊆K[x] be a set of polynomials. A set H ⊆Kn is called a hitting set for C if for all non-zero C ∈ C there exists a ∈ H such that C(a)6= 0.

Example 2.7.2. Let us give two examples of hitting sets.

(a) Letd≥0 and letS ⊆K be a subset such that|S| ≥d+ 1. ThenSn is a hitting set forK[x]≤dby Theorem 2.5.4. The size of this hitting set is ex-ponential in the general setting. However, for polynomial-degree circuits with constantly many variables, we obtain a polynomial-size hitting set.

(b) LetK =Qand letCn,s be the set of arithmetic circuitsC overQ[x] such that size(C) ≤ s. Then the proof of Theorem 2.6.3 yields a hitting set for Cn,s consisting of a single point. The coordinates of this point have bit-size exponential in s.

Existence of small hitting sets

The existence of small hitting sets was proven by Heintz & Schnorr [HS80a]

(in their paper, hitting sets are called “correct test sequences”) for fields of characteristic zero. Here we reproduce their proof, but replace a result they use from [HS80b] by a simpler argument that works for arbitrary fields. This argument is inspired by the proof of [SY10, Theorem 3.1]. We will require some machinery from algebraic geometry which we cover in Appendix A.4.

Theorem 2.7.3. Let 1 ≤ n ≤ s and let d ≥ 1. Let K be a field and let S ⊆K be an arbitrary subset with |S| ≥(2sd+ 2)2. Denote by Cn,d,s the set of arithmetic circuits C over K[x]such that fdeg(C)≤d and |C| ≤s. Then there exists a hitting set Hn,d,s ⊆Sn for Cn,d,s such that |Hn,d,s| ≤9s.

Proof. Let y = {y1, . . . , ys} be new variables and let Sn,d,s be the set of constant-free arithmetic circuits C over K[x,y] such that fdeg(C) ≤ d and

|C| ≤ s. Obviously, every circuit in Cn,d,s can be obtained from a cir-cuit in Sn,d,s by substituting constants for the y-variables. There are at most s2s connected, directed multigraphs C with V(C) ⊆ [s] and |C| ≤ s, and the vertices of each such multigraph can be labeled by the symbols {+,×, x1, . . . , xn, y1, . . . , ys} in at most (2s+ 2)s different ways. Therefore, we have |Sn,d,s| ≤(2s+ 2)3s.

Set t:= n+dd

and letxα1, . . .xαt ∈T(x) be the terms of degree at most d. We identify a polynomial f = Pt

i=1ci·xαi ∈ K[x]≤d with its vector of

coefficients (c1, . . . , ct)∈Kt, henceCn,d,s ⊆Kt. LetC ∈ Sn,d,s be a constant-free circuit. WriteC =Pt

i=1ci·xαi withci ∈K[y]. The coefficientsci define a morphism

ϕC: Ks →Kt, a7→(c1(a), . . . , ct(a))

with deg(ϕC) ≤ d. Let YC ⊆ Kt be the Zariski closure of ϕC(Ks). Since dim(Ks) = s, we have dim(YC) ≤ s. By Lemma 2.7.4 below, we obtain degKt(YC)≤ds. The affine variety

Yn,d,s:= [

C∈Sn,d,s

YC ⊆Kt

containsCn,d,s and satisfies dim(Yn,d,s)≤max{dim(YC)|C ∈ Sn,d,s} ≤s and degKt(Yn,d,s)≤ X

C∈Sn,d,s

degKt(YC)

≤ |Sn,d,s| ·max

degKt(YC)|C ∈ Sn,d,s

≤(2s+ 2)3sds.

Set m := 9s. We want to show that there exists a tuple of points (a1, . . . ,am)∈Smn such that for all non-zeroC ∈ Cn,d,s there exists i∈[m]

such thatC(ai)6= 0. This tuple will then constitute a desired hitting set.

Consider the affine variety X :=

(f,a1, . . . ,am)∈Kt+mn|f ∈Yn,d,s and f(ai) = 0 for all i∈[m] . Fori∈[t] and (j, k)∈[m]×[n], letzi and zj,k be the coordinates ofKt+mn. Then X is defined by the polynomial equations for Yn,d,s and

t

X

i=1

zi·zj,1αi,1· · ·zj,nαi,n, j ∈[m].

By Theorem A.4.7, we have

degKt+mn(X)≤degKt+mn(Yn,d,s)·(d+ 1)m ≤(2s+ 2)3sds(d+ 1)m. Define the projections π1: Kt+mn Kt and π2: Kt+mn Kmn to the first t and last mn coordinates, respectively. Let C1, . . . , C` ⊆ Kt+mn be all irreducible components C ⊆ X such that π1(C) contains a non-zero polynomial, and set C := S`

i=1Ci. Then π2(C)∩ Smn contains all tuples (a1, . . . ,am)∈Smn that do not constitute a hitting set for Cn,d,s.

Let i∈[`] and let f ∈π1(Ci) such that f 6= 0. Then π1−1(f) ={f} × VKn(f)× · · · × VKn(f)

| {z }

mtimes

,

hence dim(π1−1(f)) = m(n−1). Applying Lemma A.4.2 to the morphism π1: Ci →Yn,d,s, we obtain

dim(Ci)≤dim(π−11 (f)) + dim(Yn,d,s)≤m(n−1) +s.

This implies dim(C)≤max{dim(Ci)|i∈[`]} ≤m(n−1) +s.

Now define the hypersurfaces

Hj,k :=VKt+mn Q

c∈S(zj,k−c) for all j ∈[m] andk ∈[n], and set H :=T

(j,k)∈[m]×[n]Hj,k. Then we have

2(C)∩Smn|=|π2(C∩H)|

≤degKt+mn(C∩H)

≤degKt+mn(C)·max{degKt+mn(Hj,k)|j ∈[m], k ∈[n]}dim(C)

≤degKt+mn(X)· |S|dim(C)

≤(2s+ 2)3sds(d+ 1)m· |S|m(n−1)+s

≤ |S|2s+m/2· |S|m(n−1)+s

=|S|−m/6· |S|mn,

where the second inequality follows from Corollary A.4.8. This implies the existence of a tuple (a1, . . . ,am) ∈ Smn that constitutes a hitting set for Cn,d,s.

In the proof of Theorem 2.7.3 we used the following lemma for bounding the degree of the image of a morphism.

Lemma 2.7.4. Let X ⊆ Ks be an irreducible affine variety, let Y ⊆Kt be an affine variety, and letϕ:X →Y be a dominant morphism. Then we have

degKt(Y)≤degKs(X)·deg(ϕ)dim(Y).

Proof. The following argument is contained in the proof of [HS80b, Lemma 1]. Set r := dim(Y). Since ϕ is dominant, Y is irreducible. By Theorem A.4.4, ϕ(X) is a constructible set, so by Lemma A.4.3 it contains a non-empty open subset of Y. Therefore, by Lemma A.4.6, there exist affine hyperplanes H1, . . . , Hr⊂Kt such that degKt(Y) = |ϕ(X)∩H1∩ · · · ∩Hr|.

Thenϕ−1(Hi)⊂Ks is an affine hypersurface with degKs−1(Hi))≤deg(ϕ) for all i∈[r]. By Theorem A.4.7, we obtain

degKs X∩ϕ−1(H1)∩ · · · ∩ϕ−1(Hr)

≤degKs(X)·deg(ϕ)r.

Let C1, . . . , Cm ⊆ Ks be the irreducible components of the affine variety X∩ϕ−1(H1)∩ · · · ∩ϕ−1(Hr). Since the map

ϕ: X∩ϕ−1(H1)∩ · · · ∩ϕ−1(Hr)→ϕ(X)∩H1∩ · · · ∩Hr is surjective andϕ(Ci) is a singleton for all i∈[m], we get

degKt(Y) =|ϕ(X)∩H1∩ · · · ∩Hr|

≤m

m

X

i=1

degKs(Ci)

= degKs X∩ϕ−1(H1)∩ · · · ∩ϕ−1(Hr)

≤degKs(X)·deg(ϕ)r, finishing the proof.

Polynomial-space computation of hitting sets

Using quantifier elimination, the proof of Theorem 2.7.3 can be turned into a polynomial-space algorithm for the computation of small hitting sets. For an introduction to quantifier elimination, see [BPR06, Chapter 1].

Theorem 2.7.5. Let K = Q or K = Fq for some prime power q. Then there exists a Turing machine that, given 1≤n ≤s and d ≥1, computes in poly(s)-space a hitting set Hn,d,s ⊆ Sn for Cn,d,s of size |Hn,d,s| ≤ 9s, where S ⊂K is a subset such that bs(c) = poly(logs,logd) for all c∈S.

Proof sketch. The description of the asserted Turing machineM is as follows.

If K = Q, then M sets S ← [(2sd + 2)2] ⊂ K. If K = Fq for some prime power q, then M constructs the smallest field extension L/K such that |L| ≥ (2sd + 2)2 and picks a subset S ⊆ L of size |S| = (2sd+ 2)2. In both cases, we obtain a subset S ⊆ K such that |S| = (2sd + 2)2 and bs(c) = poly(logs,logd) for all c∈S.

Next, M sets m ← 9s and checks for all m-subsets H ⊆ Sn whether H is a hitting set for Cn,d,s as follows. As in the proof of Theorem 2.7.3, let y={y1, . . . , ys}be new variables and letSn,d,s be the set of all constant-free arithmetic circuits C over K[x,y] such that fdeg(C) ≤ d and |C| ≤ s. Let

Cn,d,s be the set of arithmetic circuits C over K[x] such that fdeg(C) ≤ d and |C| ≤s, thusCn,d,s ⊆ Cn,d,s. ThenH is a hitting set forCn,d,s if and only if the sentence

^

C∈Sn,d,s

∀c∈Ks

∀b∈Kn C(b,c) = 0

∨ _

a∈H

C(a,c)6= 0

(in the first-order theory of algebraically closed fields) is true. Note that the sentence in the innermost parentheses is just another way of saying that C(x,c) ∈ K[x] is the zero polynomial. Using quantifier elimination, M checks the truth of the sentence

∀c∈Ks

∀b∈Kn C(b,c) = 0

∨ _

a∈H

C(a,c)6= 0

for all C ∈ Sn,d,s. Since the number of quantifier alternations is constant, this can be done in poly(s)-space [Ier89].

By Theorem 2.7.3, M will eventually find a hitting set Hn,d,s ⊆ Sn for Cn,d,s of size |Hn,d,s| ≤ 9s. By reusing space, the algorithm can be imple-mented to run in poly(s)-space.

Connections to lower bounds

The following simple theorem demonstrates that small hitting sets imply lower bounds (cf. [HS80a, Theorem 4.5]). See also [Agr05] for a similar result.

Theorem 2.7.6. Let 1≤ n ≤s and let d ≥1. Let K be a field, let S ⊆ K be a subset, and let K0 ⊆ K be the prime field of K. Denote by Cn,d,s the set of arithmetic circuits C over K[x] such that fdeg(C) ≤ d and |C| ≤ s.

Assume that Hn,d,s ⊆Sn is a hitting set for Cn,d,s of size m :=|Hn,d,s|.

If m≤ n+dd

−1, then there exists a non-zero polynomial f ∈K0(S)[x]≤d

with sp(f)≤m+ 1 such that f /∈ Cn,d,s.

Proof. The proof is by interpolation. Denote Hn,d,s = {a1, . . . ,am}, let t1, . . . , tm+1 ∈ T(x)≤d be distinct terms, and let y = {y1, . . . , ym+1} be new variables. Consider the homogeneous system of linear equations

t1(ai)·y1+· · ·+tm+1(ai)·ym+1 = 0, i∈[m],

with indeterminatesy and coefficients inK0(S). Since this system has more variables than equations, there exists a non-zero solution (c1, . . . , cm+1) ∈ K0(S)m+1. The polynomialf :=Pm+1

i=1 ci·ti has the desired properties.

Linear Independence Techniques

This chapter deals with the theme of linear independence. First we present the Alternant Criterion for linear independence of polynomials. Using tech-niques from the existing literature, we give constructions of rank-preserving homomorphisms for linear forms, sparse polynomials, and products of linear forms. On the way, we encounter hitting set constructions for sparse poly-nomials and ΣΠΣ-circuits with constant top fan-in. Using isolating weight vectors, we generalize the hitting sets for sparse polynomials to polynomi-als whose Newton polytope can be decomposed into sparse polytopes. All constructions will be independent of the field of constants. Finally, we out-line that out-linear independence testing and the computation of out-linear rela-tions is (more or less) equivalent to PIT. In this context, we extend the polynomial-time PIT algorithm [RS05] for set-multilinear ΣΠΣ-circuits (with unbounded top fan-in) to an algorithm for computing the linear relations of set-multilinear ΠΣ-circuits.

Chapter outline

This chapter is organized as follows. Section 3.1 contains a criterion for lin-ear independence of polynomials. In Section 3.2 we define rank-preserving homomorphisms and give explicit constructions of rank-preserving homo-morphisms and hitting sets for several circuit classes. We summarize those results in Section 3.2.5. Section 3.3 deals with the linear independence testing problem. Finally, in Section 3.4, we investigate the complexity of computing linear relations.

41

3.1 Linear Independence

In this section we introduce a bit of notation connected with linear indepen-dence and present a criterion for linear indepenindepen-dence of polynomials.

LetK be a field, letAbe aK-vector space, and leta1, . . . , am ∈A. Then LinRelK(a1, . . . , am) :=

λ∈Km1a1+· · ·+λmam = 0 (3.1.1) is a K-subspace of Km and is called the subspace of linear relations of a1, . . . , am over K. It is the kernel of the K-linear epimorphism

Km → ha1, . . . , amiK, λ 7→λ1a1+· · ·+λmam. For a subset S⊆A, we define therank of S over K as

rkK(S) := dimK hSiK

∈N∪ {∞}. (3.1.2) We are primarily interested in the case whereAis a polynomial ring overK.

3.1.1 The Alternant Criterion

Let K be a field and let K[x] = K[x1, . . . , xn] be a polynomial ring over K. The following theorem contains a criterion for linear independence of polynomials in K[x] if the field K is sufficiently large.

Theorem 3.1.1 (Alternant Criterion). Let K be an infinite field and let f1, . . . , fm ∈ K[x] be polynomials. Then f1, . . . , fm are K-linearly indepen-dent if and only if there exist points a1, . . . ,am ∈Kn such that

det fi(aj)

1≤i,j≤m 6= 0.

Proof. By Theorem 2.5.4, this follows from Lemma 3.1.2 below.

The Alternant Criterion is based on the following assertion which ap-peared in the proof of [Kay10, Lemma 8].

Lemma 3.1.2. Let f1, . . . , fm ∈K[x] be polynomials. Define the matrix

A:=

f1(t1,1, . . . , t1,n) · · · fm(t1,1, . . . , t1,n)

... ...

f1(tm,1, . . . , tm,n) · · · fm(tm,1, . . . , tm,n)

∈K[t]m×m,

where t ={ti,j| i∈[m] and j ∈[n]} are new variables. Then f1, . . . , fm are K-linearly independent if and only if det(A)6= 0.

Proof. By a linear algebra argument, f1, . . . , fm are K-linearly independent if and only if they are K-linearly independent. Therefore, we may assume that K is infinite.

First let f1, . . . , fm be K-linearly dependent. Then the columns ofA are K(t)-linearly dependent, hence det(A) = 0.

Conversely, assume thatf1, . . . , fm are K-linearly independent. We show det(A) 6= 0 by induction on m. The case m = 1 is obvious, so let m ≥ 2.

Expanding det(A) by the last row, we get det(A) =

m

X

j=1

(−1)j+m·fj(tm,1, . . . , tm,n)·det(Am,j), (3.1.3)

whereAm,j ∈K[t\{tm,1, . . . , tm,n}](m−1)×(m−1) is obtained fromAby deleting them-th row andj-th column. By induction hypothesis, we have det(Am,1)6=

0. Since K is infinite, Theorem 2.5.4 implies that there exist ci,k ∈ K for i∈[m−1] andj ∈[n] such that (det(Am,1))(c)6= 0, where c= (ci,k). Since f1(tm,1, . . . , tm,n), . . . , fm(tm,1, . . . , tm,n) are K-linearly independent, (3.1.3) implies (det(A))(c)6= 0, hence det(A)6= 0.