This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The 4 ×\times 4 orthostochastic variety

Justin Chen School of Mathematics, Georgia Institute of Technology, Atlanta, Georgia, 30332 U.S.A. [email protected]  and  Papri Dey Department of Mathematics, University of Missouri, Columbia, Missouri, 65211 U.S.A. [email protected]
Abstract.

Orthostochastic matrices are the entrywise squares of orthogonal matrices, and naturally arise in various contexts, including notably definite symmetric determinantal representations of real polynomials. However, defining equations for the real variety were previously known only for 3×33\times 3 matrices. We study the real variety of 4×44\times 4 orthostochastic matrices, and find a minimal defining set of equations consisting of 6 quintics and 3 octics. The techniques used here involve a wide range of both symbolic and computational methods, in computer algebra and numerical algebraic geometry.

Key words and phrases:
orthostochastic matrices, real algebraic varieties, numerical algebraic geometry
2010 Mathematics Subject Classification:
14Q15, 14P05, 15B51, 68W30

1. Introduction

A real square matrix is called orthostochastic if it is the entrywise square of an orthogonal matrix. Explicitly, A=(aij)n×nA=(a_{ij})\in{\mathbb{R}}^{n\times n} is orthostochastic if there exists an orthogonal matrix V=(vij)V=(v_{ij}) with aij=vij2a_{ij}=v_{ij}^{2} for all i,j=1,,ni,j=1,\ldots,n. It follows immediately that an orthostochastic matrix is doubly stochastic (i.e. has nonnegative entries and all row and column sums equal to 1), as all rows and columns of an orthogonal matrix are unit vectors.

As an interesting and special class of doubly stochastic matrices, orthostochastic (and their unitary generalization, unistochastic) matrices arise naturally in a number of contexts, including spectral theory [13, 14], convex analysis [1, 11], and physics [3, 7]. More recently – and of interest in algebraic geometry – it has been shown that orthostochastic matrices are deeply connected to definite symmetric determinantal representations of real polynomials. Indeed, for hyperbolic plane curves, every monic symmetric determinantal representation arises from certain associated orthostochastic matrices, which yields an effective algorithm [5, 8] for computing such representations for cubic curves.

It is thus of interest to find intrinsic characterizations of the set of orthostochastic matrices. One approach to this is: by definition, the set of orthostochastic matrices is the image of an algebraic variety (the real orthogonal group) under a polynomial map (coordinate-wise squaring) – thus the Zariski closure of the image is a real algebraic variety. Our goal then is to find equations for this Zariski closure, which we refer to as finding equations for the orthostochastic variety. The equations of the 3×33\times 3 orthostochastic variety are known, which made the computation of determinantal representations of cubic curves possible, but for matrices of size 4\geq 4 no set of equations which define the orthostochastic variety were previously known. In view of this, our main result is:

Theorem 1.

The 4×44\times 4 orthostochastic variety is defined set-theoretically by 6 quintics and 3 octics, which are known explicitly and defined over {\mathbb{Z}}.

We outline the remainder of the article: in Section 2 we formalize the set-up and notation, and review some basic invariants of the varieties under consideration. In Section 3 we give a procedure for obtaining a naive set of equations which always cut out a superset of the orthostochastic variety, and see how dimension counts give a simple proof of equality in the case n=3n=3. In Section 4 we consider the case n=4n=4, and detail the techniques used to find the equations in Theorem 1. Section 5 concludes with some remarks and remaining questions.

2. Setup and basic invariants

We begin by setting up some notation which will be used in the remainder of the article. We reserve nn to denote the size of a square matrix, and all matrices are considered to have real entries. For nn\in{\mathbb{N}}, let O(n)\operatorname{O}(n) (resp. SO(n)\operatorname{SO}(n)) denote the group of n×nn\times n orthogonal (resp. special orthogonal) matrices, which is a real algebraic variety.

Next, the set of doubly stochastic n×nn\times n matrices is defined by the linear conditions iaij=jaij=1\sum_{i}a_{ij}=\sum_{j}a_{ij}=1, aij0a_{ij}\geq 0, for all i,j=1,,ni,j=1,\ldots,n. This can be interpreted as saying that an n×nn\times n doubly stochastic matrix is uniquely determined by any (n1)×(n1)(n-1)\times(n-1) submatrix obtained by deleting a row and column. Moreover, recall that the set of n×nn\times n doubly stochastic matrices equals the convex hull of all n×nn\times n permutation matrices, which is the so-called Birkhoff polytope n{\mathcal{B}}_{n}, of dimension (n1)2(n-1)^{2}.

We now introduce the varieties in question. Identifying the space of n×nn\times n real matrices with n2n^{2}-dimensional affine space 𝔸n2{\mathbb{A}}^{n^{2}}_{{\mathbb{R}}}, we have the coordinate-wise squaring map

ϕ:𝔸n2𝔸n2\displaystyle\phi:{\mathbb{A}}^{n^{2}}\to{\mathbb{A}}^{n^{2}}
(aij)(aij2)\displaystyle(a_{ij})\mapsto(a_{ij}^{2})

Restriction to O(n)\operatorname{O}(n) gives a map O(n)𝔸n2\operatorname{O}(n)\to{\mathbb{A}}^{n^{2}}, whose image lies in n{\mathcal{B}}_{n} (note that the image of ϕ\phi is contained in the non-negative orthant 𝔸0n2{\mathbb{A}}^{n^{2}}_{\geq 0}). Next, the coordinate projection π:𝔸n2𝔸(n1)2\pi:{\mathbb{A}}^{n^{2}}\to{\mathbb{A}}^{(n-1)^{2}}, given by projecting onto the upper-left (n1)×(n1)(n-1)\times(n-1) submatrix, is injective on n{\mathcal{B}}_{n}, so we may compose with π\pi to obtain πϕ|O(n):O(n)𝔸(n1)2\pi\circ\phi\big{|}_{\operatorname{O}(n)}:\operatorname{O}(n)\to{\mathbb{A}}^{(n-1)^{2}}. Finally, for both theoretical and practical reasons it is convenient to work in projective space, so taking the projective closure of the image yields the map

φ:O(n)(n1)2\varphi:\operatorname{O}(n)\to{\mathbb{P}}^{(n-1)^{2}}_{{\mathbb{R}}}

We set Zn:=φ(O(n))¯Z_{n}:=\overline{\varphi(\operatorname{O}(n))}, the Zariski-closure of the image of φ\varphi, which is a projective variety in (n1)2{\mathbb{P}}^{(n-1)^{2}}_{{\mathbb{R}}}. Concretely, we view ZnZ_{n} as the projective closure of the set of (n1)×(n1)(n-1)\times(n-1) matrices which are the upper-left submatrix of an n×nn\times n orthostochastic matrix. In this way the linear equations which define (the linear span of) n{\mathcal{B}}_{n} are already accounted for in ZnZ_{n}, and furthermore, the reduction of variables from n2n^{2} to (n1)2+1(n-1)^{2}+1 will be valuable for computation.

Introducing coordinates, we have that the map φ\varphi above corresponds algebraically to the ring map

F:[y(1,1),,y(n1,n1),s][x1,1,,xn,n]/IO(n)F:{\mathbb{R}}[y_{(1,1)},\ldots,y_{(n-1,n-1)},s]\to{\mathbb{R}}[x_{1,1},\ldots,x_{n,n}]/I_{\operatorname{O}(n)}
y(i,j)x(i,j)2y_{(i,j)}\mapsto x_{(i,j)}^{2}
s1s\mapsto 1

Here [x1,1,,xn,n]/IO(n){\mathbb{R}}[x_{1,1},\ldots,x_{n,n}]/I_{\operatorname{O}(n)} is the affine coordinate ring of O(n)\operatorname{O}(n), and [y(1,1),,y(n1,n1),s]{\mathbb{R}}[y_{(1,1)},\ldots,y_{(n-1,n-1)},s] is the homogeneous coordinate ring of (n1)2{\mathbb{P}}^{(n-1)^{2}}_{\mathbb{R}}. Equations for ZnZ_{n} thus correspond to homogeneous forms in kerF\ker F, and computing these will be our primary goal, since we can then obtain equations for the orthostochastic variety by dehomogenizing with respect to ss, i.e. setting s=1s=1 (although one caveat arises with the hyperplane at infinity s=0s=0 – this will be dealt with later).

We next give the dimension and degree of ZnZ_{n}. Note that ϕ\phi is a finite map, with general fibers of size 2n22^{n^{2}}, corresponding to sign choices on each of the n2n^{2} entries of a potential preimage. This implies that dimZn=dimO(n)\dim Z_{n}=\dim\operatorname{O}(n), which we now recall: a matrix is orthogonal iff for all i,ji,j, the dot product of the ithi^{\text{th}} and jthj^{\text{th}} rows equals δij\delta_{ij}, so

IO(n)=[x(i,1)x(i,n)][x(j,1)x(j,n)]Tδij| 1ijnI_{\operatorname{O}(n)}=\langle\begin{bmatrix}x_{(i,1)}\ldots x_{(i,n)}\end{bmatrix}\begin{bmatrix}x_{(j,1)}\ldots x_{(j,n)}\end{bmatrix}^{T}-\delta_{ij}\;\Big{|}\;1\leq i\leq j\leq n\rangle

In fact, these (n2)+n=(n+12){n\choose 2}+n={n+1\choose 2} quadrics form a regular sequence, so dimO(n)=n2codimIO(n)=n2(n+12)=(n2)\dim\operatorname{O}(n)=n^{2}-\operatorname{codim}I_{\operatorname{O}(n)}=n^{2}-{n+1\choose 2}={n\choose 2}.

The degree of ZnZ_{n} is also known (albeit much more recently): the degree of SO(n)\operatorname{SO}(n) was computed in [4], and in [9], a general formula for the degree of a coordinate-wise power of a variety is given, and combining these yields:

Proposition 2 (cf. [4], Theorem 1.1, and [9], Prop 2.4).

For any n2n\geq 2,

degZn=2(n1)2(n+12)degO(n)=2(n12)det[(2n2i2jn2i)]i,j=1n/2\deg Z_{n}=2^{(n-1)^{2}-{n+1\choose 2}}\deg\operatorname{O}(n)=2^{{n-1\choose 2}}\det\begin{bmatrix}{2n-2i-2j\choose n-2i}\end{bmatrix}_{i,j=1}^{\lfloor n/2\rfloor}
Remark 3.

1) Although IO(n)I_{\operatorname{O}(n)} is a complete intersection of the (n+12){n+1\choose 2} quadric generators given above, the degree of O(n)\operatorname{O}(n) is not the product of the generator degrees, namely 2(n+12)2^{{n+1\choose 2}}, since the quadrics defining O(n)\operatorname{O}(n) are not homogeneous (it is true that the homogenizations of the quadrics define a homogeneous complete intersection of degree 2(n+12)2^{{n+1\choose 2}}, but that variety contains many more components besides those arising from O(n)\operatorname{O}(n)).

2) Every orthostochastic matrix is in fact the image under ϕ\phi of a special orthogonal matrix – e.g. one may negate the first row without changing the coordinate-wise square. This shows that ZnZ_{n} is irreducible, being (the projective closure of) an image of an irreducible variety SO(n)\operatorname{SO}(n).

For reference, we tabulate the values of dimZn,degZn\dim Z_{n},\deg Z_{n} for small values of nn:

nn 2 3 4 5
dimZn\dim Z_{n} 1 3 6 10
degZn\deg Z_{n} 1 4 40 1536

3. n=3n=3 and naive equations

We now review what is known about the defining equations of ZnZ_{n}, for n3n\leq 3. For n=2n=2, every doubly stochastic matrix is orthostochastic: indeed, a 2×22\times 2 doubly stochastic matrix is of the form A=(a1a1aa)A=\begin{pmatrix}a&1-a\\ 1-a&a\end{pmatrix} for some 0a10\leq a\leq 1, and writing a=:cos2θa=:\cos^{2}\theta gives 1a=sin2θ1-a=\sin^{2}\theta, so AA is the coordinate-wise square of the rotation matrix (cosθsinθsinθcosθ)SO(2)\begin{pmatrix}\cos\theta&-\sin\theta\\ \sin\theta&\cos\theta\end{pmatrix}\in\operatorname{SO}(2). Thus no equations are needed to define Z2=1Z_{2}={\mathbb{P}}^{1}.

For n=3n=3, the variety Z3Z_{3} is a 33-dimensional variety in 4{\mathbb{P}}^{4}, hence is a hypersurface, and is defined by a single equation. We now show how to find this equation, first found in [12], following the presentation in Section 3 of [7], which will in fact give a set of “naive” equations for any nn. Consider a 3×33\times 3 doubly stochastic matrix

A=[y(1,1)y(1,2)y(1,3)y(2,1)y(2,2)y(2,3)y(3,1)y(3,2)y(3,3)]A=\begin{bmatrix}y_{(1,1)}&y_{(1,2)}&y_{(1,3)}\\ y_{(2,1)}&y_{(2,2)}&y_{(2,3)}\\ y_{(3,1)}&y_{(3,2)}&y_{(3,3)}\end{bmatrix}

In order for AA to be orthostochastic, there must exist sign choices ϵ1,ϵ2{±1}\epsilon_{1},\epsilon_{2}\in\{\pm 1\} such that with these sign choices, the entrywise square roots of the first two columns are orthogonal:

y(1,1)y(1,2)+ϵ1y(2,1)y(2,2)+ϵ2y(3,1)y(3,2)=0\sqrt{y_{(1,1)}}\sqrt{y_{(1,2)}}+\epsilon_{1}\sqrt{y_{(2,1)}}\sqrt{y_{(2,2)}}+\epsilon_{2}\sqrt{y_{(3,1)}}\sqrt{y_{(3,2)}}=0

By rearranging the above equation and squaring, we can eliminate one sign choice and all but one square root:

y(1,1)y(1,2)+2ϵ1y(1,1)y(1,2)y(2,1)y(2,2)+y(2,1)y(2,2)=y(3,1)y(3,2)y_{(1,1)}y_{(1,2)}+2\epsilon_{1}\sqrt{y_{(1,1)}y_{(1,2)}y_{(2,1)}y_{(2,2)}}+y_{(2,1)}y_{(2,2)}=y_{(3,1)}y_{(3,2)}

and another rearrangement and squaring produces a polynomial relation without ϵi\epsilon_{i}:

(y(1,1)y(1,2)+y(2,1)y(2,2)y(3,1)y(3,2))2=4y(1,1)y(1,2)y(2,1)y(2,2)(y_{(1,1)}y_{(1,2)}+y_{(2,1)}y_{(2,2)}-y_{(3,1)}y_{(3,2)})^{2}=4y_{(1,1)}y_{(1,2)}y_{(2,1)}y_{(2,2)}

and finally, using the fact that AA is doubly stochastic, we may express y(3,1)y_{(3,1)} (resp. y(3,2)y_{(3,2)}) in terms of y(1,1),y(2,1)y_{(1,1)},y_{(2,1)} (resp. y(1,2),y(2,2)y_{(1,2)},y_{(2,2)}) and homogenize with respect to ss to obtain

(y(1,1)y(1,2)+y(2,1)y(2,2)(sy(1,1)y(2,1))(sy(1,2)y(2,2)))2=4y(1,1)y(1,2)y(2,1)y(2,2)(y_{(1,1)}y_{(1,2)}+y_{(2,1)}y_{(2,2)}-(s-y_{(1,1)}-y_{(2,1)})(s-y_{(1,2)}-y_{(2,2)}))^{2}=4y_{(1,1)}y_{(1,2)}y_{(2,1)}y_{(2,2)}

This defines a degree 44 hypersurface in 4{\mathbb{P}}^{4} which contains Z3Z_{3}, and therefore equals Z3Z_{3} by dimension and degree considerations.

In general, one can perform the same procedure for any pair of columns or rows of an n×nn\times n doubly stochastic matrix, to obtain a set of equations which any n×nn\times n orthostochastic matrix must satisfy:

Definition 4.

For 1i<jn1\leq i<j\leq n, let Ci,jC_{i,j} be the polynomial obtained by eliminating ϵ1,,ϵn1,y(n,i),y(n,j)\epsilon_{1},\ldots,\epsilon_{n-1},y_{(n,i)},y_{(n,j)} from the relations

y(1,i)y(1,j)+k=2nϵk1y(k,i)y(k,j)=0,ϵ12==ϵn12=1,y(n,i)=1k=1n1y(k,i),y(n,j)=1k=1n1y(k,j)\sqrt{y_{(1,i)}y_{(1,j)}}+\sum_{k=2}^{n}\epsilon_{k-1}\sqrt{y_{(k,i)}y_{(k,j)}}=0,\quad\epsilon_{1}^{2}=\ldots=\epsilon_{n-1}^{2}=1,\quad y_{(n,i)}=1-\sum_{k=1}^{n-1}y_{(k,i)},\;y_{(n,j)}=1-\sum_{k=1}^{n-1}y_{(k,j)}

Note that Ci,jC_{i,j} is a polynomial of degree 2n12^{n-1} in [y(1,1),,y(n1,n1),s]{\mathbb{R}}[y_{(1,1)},\ldots,y_{(n-1,n-1)},s], obtained by repeatedly squaring (and rearranging) the first relation listed n1n-1 times (as in the case of n=3n=3 above), and that Ci,jC_{i,j} only involves variables in the ithi^{\text{th}} and jthj^{\text{th}} columns of a generic doubly stochastic matrix (y(i,j))(y_{(i,j)}). Similarly, by transposing indices we define Ri,jR_{i,j} which only involves variables in the ithi^{\text{th}} and jthj^{\text{th}} rows, and refer to {Ci,j,Ri,j1i<jn}\{C_{i,j},R_{i,j}\mid 1\leq i<j\leq n\} as the set of naive equations, of which there are 2(n2)=n(n1)2{n\choose 2}=n(n-1) equations in total.

It follows from the discussion above that every n×nn\times n orthostochastic matrix must satisfy the naive equations Ci,j,Ri,jC_{i,j},R_{i,j}. More precisely, a doubly stochastic matrix AA satisfies Ci,j=0C_{i,j}=0 if and only if there exist choices of signs ϵ1,,ϵn1\epsilon_{1},\ldots,\epsilon_{n-1} such that with these choices of signs, the entrywise square roots of the ithi^{\text{th}} and jthj^{\text{th}} columns of AA become orthogonal. A natural question is:

Question 5 (cf. [7], Remark 3.4).

Do the naive equations define the orthostochastic variety inside n{\mathcal{B}}_{n}? In other words, if a doubly stochastic matrix satisfies Ci,j,Ri,j=0C_{i,j},R_{i,j}=0 for all i<ji<j, must it be orthostochastic?

Stated another way: satisfying a single equation Ci,jC_{i,j} only guarantees sign choices which work for a single pair of columns. 5 asks whether existence of local sign choices for each pair of columns (and rows), implies existence of a single global sign choice which simultaneously makes all pairs of columns orthogonal.

As an example of what could go wrong, it is conceivable that (after fixing sign choices on the first two columns as required by C1,2C_{1,2}) the sign choices imposed on the third column by C1,3C_{1,3} could differ from those imposed by C2,3C_{2,3}. This turns out not to happen in the case of n=3n=3, as Z3Z_{3} is defined by the vanishing of (the homogenization of) C1,2C_{1,2}. However, as noted in [7], this good behavior is indeed special only to small values of nn, as evidenced by the following proposition (which justifies our use of the term naive):

Proposition 6.

For n6n\geq 6, there exists a doubly stochastic matrix which satisfies Ci,j,Ri,j=0C_{i,j},R_{i,j}=0 for all i<ji<j, but is not orthostochastic. Thus 5 has a negative answer for n6n\geq 6.

Proof.

Let A:=(16J6)In6A:=(\frac{1}{6}J_{6})\oplus I_{n-6} be block diagonal, where J6J_{6} is the 6×66\times 6 matrix of all 11’s, and In6I_{n-6} is the identity matrix of size n6n-6. Then AA is doubly stochastic, and it is straightforward to check that AA satisfies all Ci,j,Ri,jC_{i,j},R_{i,j}: since AA is symmetric, it suffices to check that Ri,jR_{i,j} is satisfied, i.e. there exist local sign choices which make the entrywise square roots of the ithi^{\text{th}} and jthj^{\text{th}} rows orthogonal. If j>6j>6 then the ithi^{\text{th}} and jthj^{\text{th}} rows are already orthogonal, and if i<j6i<j\leq 6 then choosing all positive signs for the ithi^{\text{th}} row and half positive, half negative for the jthj^{\text{th}} row shows that Ri,jR_{i,j} is satisfied. On the other hand, 16J6\frac{1}{6}J_{6} is not orthostochastic, since there does not exist a 6×66\times 6 Hadamard matrix, and therefore AA is not orthostochastic, since a direct sum of square matrices is orthostochastic iff each matrix is orthostochastic. ∎

Remark 7.

It was noted in [7] that if n6n\geq 6 satisfies n2(mod4)n\equiv 2\pmod{4}, then JnJ_{n} satisfies Ci,j,Ri,j=0C_{i,j},R_{i,j}=0 but is not orthostochastic, and it was also proven that counterexamples exist for all n16n\geq 16, using Hurwitz-Radon theory. However, the simple explicit argument with direct sums given in Proposition 6 seems to have gone unnoticed.

4. n=4n=4

We now arrive at the main focus of this paper, the case n=4n=4. In this section, most of the methods and arguments we use will be primarily computational in nature, so we first remark on the techniques used.

Computations were performed in Macaulay2 [10] and Bertini [2], and involve a mix of symbolic and numerical methods. All symbolic computations were done in exact arithmetic, using Gröbner bases over the rational numbers – this is possible since all the varieties in question are defined over {\mathbb{Q}}. For the purposes of this article, we regard results obtained by symbolic computation to be on par with those obtained by theoretical proof.

On the other hand, numerical computations were done in floating point, to 53 bits of precision (i.e. standard double-precision floating-point). Results obtained by numerical methods are regarded as correct “with probability 1”: the fact that these programs give reproducible results agreeing with theory in thousands of cases allows us to say with overwhelming confidence that the results obtained in this way are correct. Throughout the section we indicate when a computation was used, as well as the type (symbolic or numerical). We encourage the interested reader to confirm the results of the computations themselves, included in the appendix/auxiliary files.

Returning to the variety at hand: Z4Z_{4} is a 66-dimensional variety in 9{\mathbb{P}}^{9} of degree 4040 (we remark that by coincidence, this is the unique value for n2n\geq 2 for which degZn=degSO(n)\deg Z_{n}=\deg\operatorname{SO}(n)). As before, we have the naive equations Ci,j,Ri,jC_{i,j},R_{i,j} for 1i<j41\leq i<j\leq 4, which constitute 1212 octics whose vanishing locus (after taking projective closure) contains Z4Z_{4}.

Definition 8.

We set Yc:=V(Ci,j~:1i<j4)Y_{c}:=V(\widetilde{C_{i,j}}:1\leq i<j\leq 4) and Yr:=V(Ri,j~:1i<j4)Y_{r}:=V(\widetilde{R_{i,j}}:1\leq i<j\leq 4) as the varieties defined by the homogenizations of 66 naive octics corresponding to (all pairs of) columns and rows, respectively. We also define Y:=YcYrY:=Y_{c}\cap Y_{r}. Note that these are all subvarieties of the same ambient space 9{\mathbb{P}}^{9} as Z4Z_{4}.

We know that Z4YZ_{4}\subseteq Y, and in fact the two agree generically:

Proposition 9.

If L9L\subseteq{\mathbb{P}}^{9} is a general linear space of codimension 66, then YLY\cap L consists of 4040 points.

Proof.

Since the claim is for a general linear space LL, we may show this by direct computation. Choosing a linear slice LL at random (i.e. 66 linear forms in 1010 variables with random rational coefficients), upon computing a minimal presentation of the ideal of YLY\cap L we arrive at an ideal generated by 1212 octics in 44 variables, whose dimension and degree can easily be computed symbolically to be 0 and 40, respectively. ∎

Proposition 9 shows that YY has the same dimension and degree as Z4Z_{4}. In particular, Z4Z_{4} (being irreducible) is the unique top-dimensional component of YY, and so YZ4Y\neq Z_{4} if and only if YY contains additional components of lower dimension. Thus, if I(Y)=(Ci,j,Ri,j)I(Y)=(C_{i,j},R_{i,j}) denotes the ideal generated by the 12 naive octics, then showing that I(Y)I(Y) has only one minimal prime (or equivalently, that the radical of I(Y)I(Y) is prime) would imply Y=Z4Y=Z_{4}.

However, the basic issue is that I(Y)I(Y) is too large to handle directly: half of the octics have 967 terms, and the other half have 6760 terms. Neither symbolic nor numerical methods (via numerical irreducible decomposition) were able to determine the number of minimal primes of I(Y)I(Y). This motivates our search for other, “smaller” equations of Z4Z_{4}.

How does one find equations for a parametrized variety? This procedure is known as implicitization, and amounts to computing the kernel of a ring map. Classically, this is accomplished with Gröbner bases, but in the n=4n=4 case, this is infeasible (as of yet). Thus, we try a numerical approach, using interpolation (cf. Theorem 3 in [6]). The method is as follows: since Z4Z_{4} is a projective variety, its defining ideal I(Z4)I(Z_{4}) has homogeous generators, so we may search degree by degree. Fixing a degree dd, we may find a basis of I(Z4)dI(Z_{4})_{d} (the degree dd part of I(Z4)I(Z_{4})) by sampling a large number of points on Z4Z_{4}, evaluating all monomials of degree dd in 9{\mathbb{P}}^{9} at these points, and then computing the numerical kernel of the resulting matrix (with rows indexed by points, and columns indexed by monomials). The kernel vectors are then coefficients of degree dd forms (linearly independent over {\mathbb{R}}) which vanish at all of the sampled points, and if the number of points is large, one expects such a form to vanish on all of Z4Z_{4}. There are no linear forms in I(Z4)I(Z_{4}) (as these are already accounted for by restricting to 4{\mathcal{B}}_{4}), and similarly we find no forms of degrees 2,32,3, and 44. However, in degree 55 we find:

Proposition 10.

The space of quintics in I(Z4)I(Z_{4}) is 66-dimensional, with explicitly known basis.

Proof.

Note that sampling points on Z4Z_{4} is easily accomplished: one can generate a random 4×44\times 4 orthogonal matrix, e.g. via the Cayley parameterization of SO(n)\operatorname{SO}(n), and then apply the map φ\varphi to obtain a point on Z4Z_{4}. In total, if AA is a 4×44\times 4 matrix with entries chosen at random, then the final output of AB:=12(AAT)C:=IBI+Bφ(C)A\mapsto B:=\frac{1}{2}(A-A^{T})\mapsto C:=\frac{I-B}{I+B}\mapsto\varphi(C) is a (random) point on Z4Z_{4}.

With this, determining numerically the dimension of I(Z4)5I(Z_{4})_{5} by the method outlined above is straightforward and relatively quick, using the Macaulay2 package NumericalImplicitization, which returns an answer of 66 within 11 minute. However, this is only a numerical computation, which provides approximate quintics with floating-point coefficients.

To obtain explicit quintics over {\mathbb{Z}} from the approximate quintics, we use the LLL algorithm, again implemented in NumericalImplicitization (which in turn calls the native LLL implementation in Macaulay2). This calculation is significantly longer, taking around 3030 hours to finish, but results in a 2002×62002\times 6 integer matrix, whose columns are coefficients of degree 5 forms in 9{\mathbb{P}}^{9}. Once these quintics are known, they are easily checked to be in I(Z4)I(Z_{4}): verifying symbolically that the integer quintics lie in kerF\ker F takes <1<1 second. ∎

Definition 11.

We set JJ to be the ideal generated by the 66 quintics found in Proposition 10, and let X:=V(J)X:=V(J) be the variety they define.

Although still too large to comfortably reproduce here, the quintics in JJ are significantly more manageable than the octics Ci,j,Ri,jC_{i,j},R_{i,j}: 4 of the quintics have 284 terms, and the most complicated involves 454 terms. In fact a Gröbner basis of JJ can be computed (taking around 8 hours), which yields dimX=6\dim X=6, degX=40\deg X=40 (the same as Z4Z_{4}). Since we also know Z4XZ_{4}\subseteq X as well, we may again test whether equality holds by computing the number of minimal primes of JJ, i.e. irreducible components of XX. Although symbolic computation of minimal primes of JJ is still too slow to be practical, the key difference from the previous case with YY is that XX is small enough for a numerical irreducible decomposition to be feasible. The result, though initially negative, is immediately promising and leads to a resolution of the problem.

Proposition 12.

The numerical irreducible decomposition of XX consists of:

  1. (1)

    1616 components of dimension 44, each of degree 11,

  2. (2)

    1515 components of dimension 55, each of degree 11,

  3. (3)

    11 component of dimension 66, of degree 4040.

Proof.

Using Bertini [2], we may compute a numerical irreducible decomposition of JJ, which finishes within 3030 minutes. Note that the result is consistent with the Gröbner basis of JJ computed symbolically, which implies that Z4Z_{4} is the unique top-dimensional component of XX. ∎

The data of a numerical irreducible decomposition of a variety (given by defining equations) consists of a collection of witness sets, each representing an irreducible component of the variety, along with linear equations defining a complementary-dimensional slice of each witness set, and the finitely many intersection points of each witness set with its linear slice, thus encoding the dimension and degree of each irreducible component. Since all the lower-dimensional components of XX obtained in the numerical irreducible decomposition were themselves linear spaces (being degree 1), one should expect that equations for them can be found and moreover interpreted as certain classes of 4×44\times 4 matrices. This indeed turns out to be the case:

  1. (1)

    The 4-dimensional components correspond to the 16 ways of specifying that one entry of a 4×44\times 4 doubly stochastic matrix be equal to 11: this is a codimension 55 constraint in 4{\mathcal{B}}_{4}, as it forces the remaining entries in that row and column to be 0. For instance, one such component is defined by the ideal (y(2,1),y(2,2),y(1,3),y(3,3),y(2,3)s)(y_{(2,1)},y_{(2,2)},y_{(1,3)},y_{(3,3)},y_{(2,3)}-s), which (after dehomogenizing s=1s=1) requires that the (2,3)(2,3)-entry is 11. Up to row and column permutations, such matrices are a direct sum of a 3×33\times 3 matrix with the 1×11\times 1 identity.

  2. (2)

    The 5-dimensional components are slightly more complicated: of these, 6 are contained in the hyperplane at infinity s=0s=0, and thus are irrelevant for the orthostochastic variety. The 9 relevant components are as follows: fix a pair of indices (a,a)(a,a^{\prime}) with a,a{2,3,4}a,a^{\prime}\in\{2,3,4\}, and write {2,3,4}{a}=:{b,c},{2,3,4}{a}=:{b,c}\{2,3,4\}\setminus\{a\}=:\{b,c\},\{2,3,4\}\setminus\{a^{\prime}\}=:\{b^{\prime},c^{\prime}\}. Then the ideal (y(1,1)y(a,a),y(a,1)y(1,a),y(b,1)y(c,a),y(1,b)y(a,c))(y_{(1,1)}-y_{(a,a^{\prime})},y_{(a,1)}-y_{(1,a^{\prime})},y_{(b,1)}-y_{(c,a^{\prime})},y_{(1,b^{\prime})}-y_{(a,c^{\prime})}) defines a 5-dimensional component of XX. Note that these components can be described by partitioning a 4×44\times 4 doubly stochastic matrix into 4 disjoint 2×22\times 2 submatrices, and requiring each submatrix to be circulant (i.e. has equal cross terms).

These results were obtained by following the same procedure as in the proof of Proposition 10 (sampling points, obtaining approximate linear equations, and then extracting exact linear equations via LLL), and the resulting equations are symbolically verified to define subvarieties of XX. In the case of the 5-dimensional components, an additional, carefully chosen, change-of-basis was necessary to obtain binomial linear generators, and with it the ensuing description as matrices. The explicit linear ideals are recorded (as o22 and o24) in the appendix below.

Each of the relevant lower-dimensional components of XX can thus be defined over {\mathbb{Z}} and interpreted as a particular class of matrices. With these descriptions via integer equations at hand, it is also straightforward to confirm that each of these classes of matrices defines an actual component of XX, via checking dimensions of tangent spaces. Finally, we may use this to obtain Theorem 1:

Proof of Theorem 1.

Let K:=(C1,2,C1,3,C2,3)K:=(C_{1,2},C_{1,3},C_{2,3}) be the ideal generated by 3 of the octics, corresponding to (pairs of) the first 3 columns. We claim that JJ and KK together cut out the 4×44\times 4 orthostochastic variety set-theoretically, i.e. (Z4)s=1=(XV(K))s=1(Z_{4})_{s=1}=(X\cap V(K))_{s=1} as sets (here (¯)s=1(\underline{\hskip 5.69046pt})_{s=1} means dehomogenize with respect to ss). In view of the numerical irreducible decomposition of XX computed in Proposition 12, it suffices to show that for each of the relevant lower-dimensional components CC of XX, we have (CZ4)s=1=(CV(K))s=1(C\cap Z_{4})_{s=1}=(C\cap V(K))_{s=1}.

For QQ a 44-dimensional component of XX, we have that a point p(QZ4)s=1p\in(Q\cap Z_{4})_{s=1} is (up to row and column permutation) a direct sum of a 1×11\times 1 and a 3×33\times 3 orthostochastic matrix. The case n=3n=3 in Section 3 implies that QZ4Q\cap Z_{4} is defined by a single form of degree 44, and we check symbolically that QV(K)Q\cap V(K) is in fact defined by a single octic, which is precisely the square of the quartic defining QZ4Q\cap Z_{4}.

For PP a relevant 55-dimensional component of XX, we again check that PV(K)P\cap V(K) is defined by a single octic. In this case we do not have equations for PZ4P\cap Z_{4} a priori – however, the structure of the matrices in PP is special enough to make symbolic implicitization feasible. Indeed, we may compute symbolically the kernel of the ring map

F(mod(I(P)+(s1))):[y(1,1),,y(3,3)]/I(P)[x(1,1),,x(4,4)]/(IO(4)+F(I(P)))F\!\!\!\!\pmod{(I(P)+(s-1))}:{\mathbb{R}}[y_{(1,1)},\ldots,y_{(3,3)}]/I(P)\to{\mathbb{R}}[x_{(1,1)},\ldots,x_{(4,4)}]/(I_{\operatorname{O}(4)}+F(I(P)))

which takes roughly 8 minutes – this is made possible by the fact that after minimizing the presentation, the source is really a polynomial ring in 5 variables. This gives equations for (PZ4)s=1(P\cap Z_{4})_{s=1}, which turns out to be precisely the dehomogenization of the octic defining PV(K)P\cap V(K).

Putting this all together, we have that if l1,,l16l_{1},\ldots,l_{16} are the 44-dimensional components of XX, and L1,,L15L_{1},\ldots,L_{15} are the 55-dimensional components of XX, where L10,,L15L_{10},\ldots,L_{15} are contained in the hyperplane s=0s=0, then

(XV(K))s=1\displaystyle(X\cap V(K))_{s=1} =((Z4i=116lii=115Li)V(K))s=1\displaystyle=\left(\Big{(}Z_{4}\cup\bigcup_{i=1}^{16}l_{i}\cup\bigcup_{i=1}^{15}L_{i}\Big{)}\cap V(K)\right)_{s=1}
=(Z4V(K))s=1i=116(liV(K))s=1j=19(LiV(K))s=1\displaystyle=(Z_{4}\cap V(K))_{s=1}\cup\bigcup_{i=1}^{16}(l_{i}\cap V(K))_{s=1}\cup\bigcup_{j=1}^{9}(L_{i}\cap V(K))_{s=1}
=(Z4)s=1i=116(liZ4)s=1j=19(LiZ4)s=1=(Z4)s=1\displaystyle=(Z_{4})_{s=1}\cup\bigcup_{i=1}^{16}(l_{i}\cap Z_{4})_{s=1}\cup\bigcup_{j=1}^{9}(L_{i}\cap Z_{4})_{s=1}=(Z_{4})_{s=1}\qed

5. Conclusion

A few remarks on the methodology used in Section 4 are in order. First, the only part of the proof of Theorem which relies on numerical computation (i.e. for which there is no symbolic verification) is the correctness of the numerical irreducible decomposition Proposition 12. Specifically, what is required is that there are no other irreducible components of XX other than those listed in Proposition 12. For readers concerned about the random choices involved in path tracking and homotopy continuation, we mention that the numerical irreducible decomposition of XX has been calculated multiple times, each time giving the same consistent result. Moreover, the numerical irreducible decomposition of the affine variety Xs=1X_{s=1} (the dehomogenization of XX) has also been computed, and is consistent with Proposition 12 (e.g. among the 55-dimensional components, only the 99 relevant components not at infinity are present).

One may also ask about the utility of set-theoretic equations for Z4Z_{4}. After all, given a particular 4×44\times 4 matrix, the brute-force check to determine if it is orthostochastic is still feasible (namely checking all possible sign patterns in a fiber of ϕ\phi. Since (/2)n(/2)n({\mathbb{Z}}/2{\mathbb{Z}})^{n}\oplus({\mathbb{Z}}/2{\mathbb{Z}})^{n} acts on fibers of ϕ\phi by row and column sign changes, one may assume the first row and column are all nonnegative, so there are only 2(41)2=5122^{(4-1)^{2}}=512 sign patterns to check). In response to this, we note that in addition to their intrinsic interest, knowledge of set-theoretic equations is extremely useful for many other purposes. For instance, given a variety which meets the set of 4×44\times 4 orthostochastic matrices, one can now describe their common intersection locus (which was previously not possible to do). We plan to use this in a forthcoming upgrade to [5], to compute monic symmetric determinantal representations of hyperbolic plane quartic curves.

As for the equations themselves, it is natural to ask about minimality. For degree reasons, any set of minimal generators of I(Z4)I(Z_{4}) must include the 66 quintics JJ in Definition 11. It has been checked that the spaces of sextics and 7-forms in I(Z4)I(Z_{4}) are 6060- and 330330-dimensional respectively, and moreover that the quintics in JJ have no quadratic (or linear) syzygies. It follows that there are no forms of degree 66 or 77 in I(Z4)I(Z_{4}) other than those already in JJ. For degree 88, it is not difficult to see that 22 of the naive octics (along with JJ) would not be sufficient to cut out Z4Z_{4}, so the generating set in Theorem 1 is both inclusion-wise and degree-wise minimal.

Finally, we list some questions that remain unanswered with this work. First, it would be interesting to have a combinatorial description of the 66 quintics JJ, as well as a theoretical proof for why they vanish on 4×44\times 4 orthostochastic matrices. Next, for n=4,5n=4,5 it remains open whether the naive equations cut out the orthostochastic variety inside the Birkhoff polytope set-theoretically. Lastly, we know that for n6n\geq 6 additional equations are needed beyond the naive ones. However, finding where these come from is related to the first question listed here, and even a computational proof of sufficiency may only be possible with some advances in computing technology.

6. Acknowledgements

The first author would like to thank Anton Leykin for helpful discussions, especially related to verifying components via tangent spaces. The second author would like to thank Paul Breiding, Mateusz Michalek and Bernd Sturmfels for their encouragement towards work on this project.

References

  • [1] Au-Yeung, Yik-Hoi, and Yiu-Tung Poon. 3×33\times 3 orthostochastic matrices and the convexity of generalized numerical ranges. Linear Algebra Appl. 27 (1979): 69–79.
  • [2] Bates, Daniel J., Jonathan D. Hauenstein, Andrew J. Sommese, and Charles W. Wampler. Bertini: Software for Numerical Algebraic Geometry. Available at bertini.nd.edu with permanent doi: dx.doi.org/10.7274/R0H41PB5.
  • [3] Bengtsson, Ingemar, Åsa Ericsson, Marek Kuś, Wojciech Tadej, and Karol Życzkowski. Birkhoff’s polytope and unistochastic matrices, N=3N=3 and N=4N=4. Comm. Math. Phys. 259, no. 2 (2005): 307–324.
  • [4] Brandt, Madeline, Juliette Bruce, Taylor Brysiewicz, Robert Krone, and Elina Robeva. The Degree of SO(n,)\mathop{\mathrm{SO}}\nolimits(n,\mathbb{C}). Combinatorial Algebraic Geometry, pp. 229–246. Springer, New York, NY, 2017.
  • [5] Chen, Justin and Papri Dey. Computing symmetric determinantal representations. arXiv:1905.07035.
  • [6] Chen, Justin and Joe Kileel. Numerical implicitization. J. Softw. Algebra Geom. 9, no. 1 (2019): 55–63.
  • [7] Chterental, Oleg, and Dragomir Ž. Doković. On orthostochastic, unistochastic and qustochastic matrices. Linear Algebra Appl. 428, no. 4 (2008): 1178–1201.
  • [8] Dey, Papri. Definite Determinantal Representations via Orthostochastic Matrices. arXiv:1708.09559.
  • [9] Dey, Papri, Paul Görlach, and Nidhi Kaihnsa. Coordinate-wise powers of algebraic varieties. Beitr. Algebra Geom. (2020). https://doi.org/10.1007/s13366-019-00481-8.
  • [10] Grayson, Daniel R., and Michael E. Stillman. Macaulay2, a software system for research in algebraic geometry. Available at https://faculty.math.illinois.edu/Macaulay2/.
  • [11] Horn, Alfred. Doubly stochastic matrices and the diagonal of a rotation matrix. Amer. J. Math. 76, no. 3 (1954): 620–630.
  • [12] Nakazato, Hiroshi. Set of 3×33\times 3 orthostochastic matrices. Nihonkai Math. J. 7, no. 2 (1996): 83–100.
  • [13] Nylen, Peter, Tin-Yau Tam, and Frank Uhlig. On the eigenvalues of principal submatrices of normal, Hermitian and symmetric matrices. Linear Multilinear Algebra 36, no. 1 (1993): 69–78.
  • [14] Thompson, Robert. C. Principal submatrices IV. On the independence of the eigenvalues of different principal submatrices. Linear Algebra Appl. 2, no. 3 (1969): 355–374.

Appendix A M2 session

The following is an example Macaulay2 session that demonstrates the main results in this paper. For technical reasons, it is more convenient to use y0,,y9y_{0},\ldots,y_{9} as the variables of 9{\mathbb{P}}^{9} rather than y(1,1),,y(3,3),sy_{(1,1)},\ldots,y_{(3,3)},s.

Macaulay2, version 1.15
i1 : needsPackage "DeterminantalRepresentations"
i2 : needsPackage "NumericalImplicitization"
i3 : S = RR[x_(1,1)..x_(4,4)]; R = QQ[y_0..y_9]; s = y_9;
i6 : A = genericMatrix(S,4,4); IO4 = minors(1, A*transpose A - id_(S^4));
i8 : F = matrix{flatten entries submatrix(hadamard(A,A),{0,1,2},{0,1,2})} | matrix{{1_S}}
-- Sample random points on Z_4
i9 : pts = apply(binomial(9+5,5), i -> sub(F,matrix{flatten entries randomOrthogonal(4,RR)}));
-- Compute dimension of deg 5 part of I(Z_4): takes ~30 seconds
i10 : HF5 = numericalHilbertFunction(F, IO4, pts, 5, UseSLP => true, Precondition => false)
-- Get integer quintics via LLL: takes ~30 hours
i11 : time E = extractImageEquations(HF5, AttemptZZ => true);
-- Alternatively, after downloading the auxiliary files, one can load the equations with:
-- i11 : E = first lines get "ortho4-quintics.txt";
i12 : J = ideal value toString E;
-- Create the s-homogenization of a generic 4x4 doubly stochastic matrix
i13 : V = genericMatrix(R,3,3); V = transpose(V || matrix{toList(3:s) - sum entries V})
i15 : V = V || matrix{toList(4:s) - sum entries V}
-- 3 naive octics C_{1,2}, C_{1,3}, C_{2,3}
i16 : K = ideal apply(subsets(3, 2), p -> (
        (i,j) = (p#0, p#1);
        ((V_(i,0)*V_(j,0) + V_(i,1)*V_(j,1) - V_(i,2)*V_(j,2) -
        V_(i,3)*V_(j,3))^2 - 4*V_(i,0)*V_(j,0)*V_(i,1)*V_(j,1) -
        4*V_(i,2)*V_(j,2)*V_(i,3)*V_(j,3))^2 -
        64*V_(i,0)*V_(j,0)*V_(i,1)*V_(j,1)*V_(i,2)*V_(j,2)*V_(i,3)*V_(j,3)
      )); -- cf. equation (3.5) in [7], esp. the coefficient of 64
i17 : time numericalIrreducibleDecomposition(J, Software => BERTINI) -- takes ~25 minutes
-- Note: the dimensions shown here are of the affine cones,
-- which are 1 more than that of the corresponding projective variety
i18 : (X, eps) = (oo, 1e-10);
-- These define some helper functions to obtain descriptions of linear components
i19 : realPartMatrix := A -> matrix apply(entries A, r -> r/realPart)
i20 : imPartMatrix := A -> matrix apply(entries A, r -> r/imaginaryPart)
i21 : getLinEqs = (A, mons, c, n) -> (
        B = random(RR)*realPartMatrix A + random(RR)*imPartMatrix A;
        C = matrix apply(entries B, r -> r/(e -> lift(round(10^(1+n)*round(n, e)), ZZ)));
        mons*submatrix(LLL(id_(ZZ^(numcols C))||C), toList(0..<numcols mons), toList(0..<c))
      )
-- The following are the 4-dim (= codim 5) components of X, recorded in o22
i22 : L5 = apply(X#5, W -> ideal getLinEqs(clean(eps, matrix W#Points#0), basis(1,R), 5, 10))
o22 = {ideal(y_0, y_3, y_6, -y_2-y_5-y_8+y_9, -y_1-y_4-y_7+y_9),
      ideal(y_0, y_2, y_4, y_7, -y_1+y_9),
      ideal(y_2, y_5, y_6, y_7, -y_8+y_9),
      ideal(y_2, y_3, y_4, y_8, -y_5+y_9),
      ideal(y_1, y_2, y_3, y_6, -y_0+y_9),
      ideal(y_0, y_1, y_2, -y_6-y_7-y_8+y_9, -y_3-y_4-y_5+y_9),
      ideal(y_0+y_3+y_6-y_9,y_0+y_1+y_2-y_9,y_6+y_7+y_8-y_9,y_1+y_4+y_7-y_9,y_2+y_5+y_8-y_9),
      ideal(y_0, y_3, y_7, y_8, -y_6+y_9),
      ideal(y_6, y_7, y_8, -y_3-y_4-y_5+y_9, -y_0-y_1-y_2+y_9),
      ideal(y_1, y_3, y_5, y_7, -y_4+y_9),
      ideal(y_0, y_4, y_5, y_6, -y_3+y_9),
      ideal(y_3, y_4, y_5, -y_6-y_7-y_8+y_9, -y_0-y_1-y_2+y_9),
      ideal(y_0, y_1, y_5, y_8, -y_2+y_9),
      ideal(y_1, y_4, y_7, -y_0-y_3-y_6+y_9, y_2+y_5+y_8-y_9),
      ideal(y_2, y_5, y_8, -y_0-y_3-y_6+y_9, y_1+y_4+y_7-y_9),
      ideal(y_1, y_4, y_6, y_8, -y_7+y_9)}
-- Check: description as direct sums of 3x3 ++ 1x1 matrices, Q \cap Z_4 = Q \cap V(K)
i23 : all((0,0)..(3,3), p -> (
      (i,j) = p;
      L = ideal(V_(i,j) - s, V_((i+1)%4,j), V_((i+2)%4,j), V_(i,(j+1)%4), V_(i,(j+2)%4));
      any(L5, l -> l == L) and isSubset(J, L) and (
        Q = minimalPresentation sub(K, R/L);
        T = ring Q;
        f = (T_4^2 - T_4*(T_0+T_1+T_2+T_3) + (T_0*T_3+T_1*T_2))^2 - 4*T_0*T_1*T_2*T_3;
        ideal(f^2) == Q
      )))
-- The following are the 5-dim (= codim 4) components of X, recorded in o24
i24 : L4 = apply(X#6, W -> ideal getLinEqs(clean(eps, matrix W#Points#0), basis(1,R), 4, 10))
o24 = {ideal(-y_1+y_6, -y_0+y_7, -y_1-y_3-y_4-y_7+y_9, y_2-y_3-y_4+y_8),
      ideal(-y_2+y_6, -y_0+y_8, y_1-y_3-y_5+y_7, -y_1-y_2-y_7-y_8+y_9),
      ideal(-y_1+y_3, -y_0+y_4, -y_2-y_5+y_6+y_7, -y_2-y_3-y_4-y_5+y_9),
      ideal(y_9, y_0+y_1, y_3+y_4, y_6+y_7),
      ideal(-y_4+y_6, -y_3+y_7, -y_0-y_1+y_5+y_8, -y_3-y_5-y_6-y_8+y_9),
      ideal(y_9, y_0+y_6, y_1+y_7, y_2+y_8),
      ideal(-y_5+y_6, -y_3+y_8, -y_0-y_2-y_5-y_8+y_9, -y_0-y_2+y_4+y_7),
      ideal(-y_2+y_4, -y_1+y_5, -y_0-y_3+y_7+y_8, -y_1-y_2-y_7-y_8+y_9),
      ideal(y_9, y_3+y_6, y_4+y_7, y_5+y_8),
      ideal(-y_5+y_7, -y_4+y_8, -y_1-y_2+y_3+y_6, -y_1-y_2-y_5-y_8+y_9),
      ideal(-y_2+y_3, -y_0+y_5, -y_1-y_4+y_6+y_8, -y_2-y_5-y_6-y_8+y_9),
      ideal(y_9, y_1+y_2, y_4+y_5, y_7+y_8),
      ideal(-y_2+y_7, -y_1+y_8, y_0-y_4-y_5+y_6, -y_1-y_2-y_4-y_5+y_9),
      ideal(y_9, y_0+y_2, y_3+y_5, y_6+y_8),
      ideal(y_9, y_0+y_3, y_1+y_4, y_2+y_5)}
-- Remove 6 components at infinity
i25 : L4 = select(L4, l -> s % l != 0); #L4
o26 = 9
-- Check: description as block 2x2 circulant matrices, and containment in X
i27 : all((1,1)..(3,3), p -> (
        (a,a’) = p;
        (b,c) = toSequence({1,2,3} - set{a});
        (b’,c’) = toSequence({1,2,3} - set{a’});
        L = ideal(V_(0,0)-V_(a,a’),V_(a,0)-V_(0,a’),V_(b,0)-V_(c,a’),V_(0,b’)-V_(a,c’));
        any(L4, l -> l == L) and isSubset(J, L)
      ))
-- Work over QQ for exact implicitization
i28 : S = QQ[x_(1,1)..x_(4,4)]; A = genericMatrix(S,4,4);
i30 : IO4 = minors(1, A*transpose A - id_(S^4));
i31 : H = matrix{flatten entries submatrix(hadamard(A,A),{0,1,2},{0,1,2})};
i32 : F = H | matrix{{1_S}}
-- Check: (P \cap Z_4)_{y_9=1} = (P \cap V(K))_{y_9=1}
i33 : all(L4, I -> (
        P = minimalPresentation sub(K + ideal(s - 1), R/I);
        time ker map(S/(IO4 + sub(I, F)), ring P, H_((gens ring P)/baseName/last)) == P
      )) -- takes ~9*8 minutes
-- Check: J is contained in ker F
i34 : sub(gens J, F) % IO4 == 0
-- Check: J has no quadratic syzygies
i35 : syz(gens J, DegreeLimit => 7) == 0
-- To compute the dimension of the space of 7-forms in I(Z_4):
-- repeat lines i3 - i10, replacing everywhere 5 with 7 (takes ~7 hours, ~34 GB RAM)