Symmetric Tensor Decompositions On Varieties

Jiawang Nie Jiawang Nie, Department of Mathematics, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA, 92093. [email protected] , Ke Ye Ke Ye and Lihong Zhi, KLMM, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China [email protected], [email protected] and Lihong Zhi

Abstract.

This paper discusses the problem of symmetric tensor decomposition on a given variety $X$ : decomposing a symmetric tensor into the sum of tensor powers of vectors contained in $X$ . In this paper, we first study geometric and algebraic properties of such decomposable tensors, which are crucial to the practical computations of such decompositions. For a given tensor, we also develop a criterion for the existence of a symmetric decomposition on $X$ . Secondly and most importantly, we propose a method for computing symmetric tensor decompositions on an arbitrary $X$ . As a specific application, Vandermonde decompositions for nonsymmetric tensors can be computed by the proposed algorithm.

Key words and phrases:

symmetric tensor, numerical algorithm, decomposition, generating polynomial, generating matrix

2010 Mathematics Subject Classification:

15A69, 65F99

1. Introduction

Let $n,d>0$ be integers and $\mathbb{C}$ be the complex field. Denote by $\operatorname{T}^{d}(\mathbb{C}^{n+1})$ the space of $(n+1)$ -dimensional complex tensors of order $d$ . For $\mathcal{A}\in\operatorname{T}^{d}(\mathbb{C}^{n+1})$ and an integral tuple $(i_{1},\ldots,i_{d})$ , $\mathcal{A}_{i_{1}\dots i_{d}}$ denotes the $(i_{1},\ldots,i_{d})$ th entry of $\mathcal{A}$ , where $0\leq i_{1},\ldots,i_{m}\leq n$ . The tensor $\mathcal{A}$ is symmetric if

\mathcal{A}_{i_{1}\ldots i_{d}}=\mathcal{A}_{j_{1}\ldots j_{d}}

whenever $(j_{1},\ldots,j_{d})$ is a permutation of $(i_{1},\ldots,i_{d})$ . Let $\operatorname{S}^{d}(\mathbb{C}^{n+1})$ be the subspace of all symmetric tensors in $\operatorname{T}^{d}(\mathbb{C}^{n+1})$ . For a vector $u$ , denote by $u^{\otimes d}$ the $d$ th tensor power of $u$ , i.e., $u^{\otimes d}$ is the tensor such that $(u^{\otimes d})_{i_{1},\dots,i_{d}}=u_{i_{1}}\cdots u_{i_{d}}$ . As shown in [9], for each $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , there exist vectors $u_{1},\ldots,u_{r}\in\mathbb{C}^{n+1}$ such that

(1.1)

\mathcal{A}=(u_{1})^{\otimes d}+\cdots+(u_{r})^{\otimes d}.

The above is called a symmetric tensor decomposition (STD) for $\mathcal{A}$ . The smallest such $r$ is called the symmetric rank of $\mathcal{A}$ , for which we denote as $\operatorname{srank}(\mathcal{A})$ . If $\operatorname{srank}(\mathcal{A})=r$ , $\mathcal{A}$ is called a rank- $r$ tensor and (1.1) is called a symmetric rank decomposition, which is also called a Waring decomposition in some references. The rank of a generic symmetric tensor is given by a formula in Alexander-Hirschowitz [2]. We refer to [9] for symmetric tensors and their symmetric ranks, and refer to [27, 30] for general tensors and their ranks.

This paper concerns symmetric tensor decompositions on a given set. Let $X\subseteq\mathbb{C}^{n+1}$ be a homogeneous set, i.e., $tx\in X$ for all $t\in\mathbb{C}$ and $x\in X$ . If each $u_{j}\in X$ , (1.1) is called a a symmetric tensor decomposition on $X$ (STDX) for $\mathcal{A}$ . The STDX problem has been studied in applications for various choices of $X$ . Symmetric tensor decompositions have broad applications in quantum physics [50], algebraic complexity theory [7, 26, 46, 51], numerical analysis [34, 52]. More tensor applications can be found in [24].

When $X=\mathbb{C}^{n+1}$ , the STDX is just the classical symmetric tensor decomposition, which has been studied extensively in the literature. Binary tensor (i.e., $n=1$ ) decomposition problems were discussed in [8]. For higher dimensional tensors, the Catalecticant type methods [23] are often used when their ranks are low. For general symmetric tensors, Brachat et al. [5] proposed a method by using Hankel (and truncated Hankel) operators. It is equivalent to computing a new tensor whose order is higher but the rank is the same as the original one. Oeding and Ottaviani [37] proposed to compute symmetric decompositions by Koszul flattening, tensor eigenvectors and vector bundles. Other related work on computing symmetric tensor decompositions can be found in [3, 4]. For generic tensors of certain ranks, the symmetric tensor decomposition is unique. As shown in [16], a generic $\mathcal{A}\in\operatorname{S}^{m}(\mathbb{C}^{n+1})$ has a unique Waring decomposition if and only if

(n,m,r)\in\big{\{}(1,2k-1,k),\,(3,3,5),\,(2,5,7)\big{\}}.

When $\mathcal{A}\in\operatorname{S}^{m}(\mathbb{C}^{n+1})$ is a generic tensor of a subgeneric rank $r$ (i.e., $r$ is smaller than the value given by the Alexander-Hirschowitz formula; see [2, 9].) and $m\geq 3$ , the Waring decomposition is unique, with only three exceptions [6].

When $X=\{(a^{n},a^{n-1}b,\ldots,ab^{n-1},b^{n}):\,a,b\in\mathbb{C}\}\subseteq\mathbb{C}^{n+1}$ , i.e., $X$ is the affine cone of a rational normal curve in the projective space $\mathbb{P}^{n}$ , the STDX becomes a Vandermonde decomposition for symmetric tensors. It only exists for Hankel tensors, which were introduced in [38] for studying the harmonic retrieval problem. Hankel tensors were discussed in [39]. Relations among various ranks of Hankel tensors were studied in [35]. More applications of Hankel tensors can be found in [44, 49].

When $X=\{a_{1}\otimes\cdots\otimes a_{k}:a_{1},\ldots,a_{k}\in\mathbb{C}^{m}\}\subseteq\mathbb{C}^{n+1}$ with $n+1=m^{k}$ , i.e., $X$ is a Segre variety, the STDX becomes a Vandermonde decomposition for nonsymmetric tensors. This has broad applications in signal processing [12, 29, 36, 47]. Vandermonde decompositions for nonsymmetric tensors are closely related to secant varieties of Segre-Veronese varieties, which has been studied vastly [1, 25, 40]. In the subsection 3.3, we will discuss this question with more details. Interesting, it can be shown that every nonsymmetric tensor has a Vandermonde decomposition, which is different from the symmetric case.

Contributions

This paper focuses on computing symmetric tensor decompositions on a given set $X$ . We assume that $X\subseteq\mathbb{C}^{n+1}$ is a variety that is given by homogeneous polynomial equations. Generally, symmetric tensor decompositions on $X$ can be theoretically studied by secant varieties of the Veronese embedding of $X$ [28, 42, 43]. From this view, one may expect to get polynomials which characterize the symmetric $X$ -rank of a given symmetric tensor. In this paper, we give a method for computing symmetric $X$ -rank decompositions. It is based on the tool of generating polynomials that were recently introduced in [33]. The work [33] only discussed the case $X=\mathbb{C}^{n+1}$ . When $X$ is a variety, i.e., the method in [33] does not work, because $u_{i}\in X$ is required in (1.1). We need to modify the approach in [33], by posing additional conditions on generating polynomials. For this purpose, we give a unified framework for computing symmetric tensor decompositions on $X$ , which sheds light on the study of both theoretical and computational aspects of tensor decompositions.

The paper is organized as follows. Section 2 gives some basics for tensor decompositions. Section 3 studies some properties of symmetric tensor decompositons on $X$ . Section 4 defines generating polynomials and generating matrices. It gives conditions ensuring that the computed vectors belong to the given set in symmetric tensor decompositions. Section 5 presents a unified framework to do the computation. Last, Section 6 gives various examples to show how the proposed method works.

2. Preliminaries

Notation The symbol $\mathbb{N}$ (resp., $\mathbb{R}$ , $\mathbb{C}$ ) denotes the set of nonnegative integers (resp., real, complex numbers). For $\alpha:=(\alpha_{1},\ldots,\alpha_{n})\in\mathbb{N}^{n}$ , define $|\alpha|:=\alpha_{1}+\cdots+\alpha_{n}$ . For a degree $d>0$ , denote the index set

\mathbb{N}^{n}_{d}=\{\alpha:=(\alpha_{1},\ldots,\alpha_{n})\in\mathbb{N}^{n}\mid|\alpha|\leq d\}.

For a real number $t\in\mathbb{R}$ , we denote by $\lceil t\rceil$ the smallest integer $n$ such that $n\geq t$ . Let $x:=(x_{0},\dots,x_{n})$ and $\mathbb{C}[x]:=\mathbb{C}[x_{0},\dots,x_{n}]$ denote the ring of polynomials in $x$ and with complex coefficients. For a degree $m$ , $\mathbb{C}[x]_{m}$ denotes the subset of polynomials whose degrees are less than or equal to $m$ , and $\mathbb{C}[x]_{m}^{h}$ denotes the subset of forms whose degrees are equal to $m$ . The cardinality of a finite set $T$ is denoted as $|T|$ . For a finite set $\mathbb{B}\subseteq\mathbb{C}[x]$ and a vector $v\in\mathbb{C}^{n}$ , denote

(2.1)

[v]_{\mathbb{B}}:=\big{(}p(v)\big{)}_{p\in\mathbb{B}},

the vector of polynomials in $\mathbb{B}$ evaluated at $v$ . For a complex matrix $A$ , $A^{T}$ denotes its transpose and $A^{*}$ denotes its conjugate transpose. For a complex vector $u$ , $\|u\|_{2}=\sqrt{u^{*}u}$ denotes the standard Euclidean norm. The $e_{i}$ denotes the standard $i$ -th unit vector in $\mathbb{N}^{n}$ . For two square matrices $X,Y$ of the same dimension, their commutator is $[X,Y]:=XY-YX$ .

2.1. Equivalent descriptions for symmetric tensors

There is a one-to-one correspondence between symmetric tensors in $\operatorname{S}^{d}(\mathbb{C}^{n+1})$ and homogeneous polynomials of degree $d$ and in $(n+1)$ variables. When $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ is symmetric, we can equivalently use $\alpha=(\alpha_{1},\dots,\alpha_{n})\in\mathbb{N}_{d}^{n}$ to label $\mathcal{A}$ in the way that

(2.2)

\mathcal{A}_{\alpha}\coloneqq\mathcal{A}_{j_{1},\dots,j_{d}},\quad\mbox{if}\quad x_{0}^{d-|\alpha|}x_{1}^{\alpha_{1}}\cdots x_{n}^{\alpha_{n}}=x_{j_{1}}\cdots x_{j_{d}}.

The symmetry guarantees that the labelling $\mathcal{A}_{\alpha}$ is well-defined. For $\mathcal{A}$ , define the homogeneous polynomial (i.e., a form) in $x:=(x_{0},\dots,x_{n})$ and of degree $d$ :

(2.3)

\mathcal{A}(x)\,:=\,{\sum}_{j_{1},\ldots,j_{d}=0}^{n}\mathcal{A}_{j_{1},\dots,j_{d}}x_{j_{1}}\cdots x_{j_{d}}.

If $\mathcal{A}$ is labeled as in (2.2), then

\mathcal{A}(x)={\sum}_{\alpha\in\mathbb{N}_{d}^{n}}\binom{d}{d-|\alpha|,\alpha_{1},\dots,\alpha_{n}}\mathcal{A}_{\alpha}x_{0}^{d-|\alpha|}x_{1}^{\alpha_{1}}\cdots x_{n}^{\alpha_{n}}

where $\binom{d}{\alpha_{0},\dots,\alpha_{n}}\coloneqq\frac{d!}{\alpha_{0}!\cdots\alpha_{n}!}.$ The decomposition (1.1) is equivalent to

\mathcal{A}(x)={\sum}_{j=1}^{r}\big{(}(u_{j})_{0}x_{0}+\cdots+(u_{j})_{n}x_{n}\big{)}^{d}.

For a polynomial $p=\sum_{\alpha\in\mathbb{N}^{n}_{d}}p_{\alpha}y_{1}^{\alpha_{1}}\cdots y_{n}^{\alpha_{n}}$ and $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , define the operation

(2.4)

\langle p,\mathcal{A}\rangle={\sum}_{\alpha\in\mathbb{N}^{n}_{d}}p_{\alpha}\mathcal{A}_{\alpha}

where $\mathcal{A}$ is labeled as in (2.2). For fixed $p$ , $\langle p,\cdot\rangle$ is a linear functional on $\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , while for fixed $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , $\langle\cdot,\mathcal{A}\rangle$ is a linear functional on $\mathbb{C}[y_{1},\ldots,y_{n}]_{d}$ .

2.2. Algebraic varieties

A set $I\subseteq\mathbb{C}[x]:=\mathbb{C}[x_{0},\dots,x_{n}]$ is an ideal if $I\cdot\mathbb{C}[x]\subseteq I$ and $I+I\subseteq I$ . For polynomials $f_{1},\dots,f_{s}\in\mathbb{C}[x]$ , let $\left\langle f_{1},\dots,f_{s}\right\rangle$ denote the smallest ideal that contains $f_{1},\ldots,f_{s}$ . A subset $X\subseteq\mathbb{C}^{n+1}$ is called an affine variety if $X$ is the set of common zeros of some polynomials in $\mathbb{C}[x]$ . The vanishing ideal of $X$ is the ideal consisting of all polynomials identically vanish on $X$ .

Two nonzero vectors in $\mathbb{C}^{n+1}$ are equivalent if they are parallel to each other. Denote by $[u]$ the set of all nonzero vectors that are equivalent to $u$ ; the set $[u]$ is called the equivalent class of $u$ . The set of all equivalent classes $[u]$ with $0\neq u\in\mathbb{C}^{n+1}$ is the projective space $\mathbb{P}^{n}$ , or equivalently, $\mathbb{P}^{n}=\{[u]:\,0\neq u\in\mathbb{C}^{n+1}\}$ . A subset $Z\subseteq\mathbb{P}^{n}$ is said to be a projective variety if there are homogeneous polynomials $h_{1},\ldots,h_{t}\in\mathbb{C}[x]$ such that

Z=\{[u]\in\mathbb{P}^{n}:\,h_{1}(u)=\cdots=h_{t}(u)=0\}.

The vanishing ideal $\mathcal{I}(Z)$ is defined to be the ideal consisting of all polynomials identically vanish on $Z$ . For each nonnegative integer $m$ , we denote by $\mathcal{I}_{m}(Z)$ the linear subspace of polynomials of degree $m$ in $\mathcal{I}(Z)$ . A projective variety $Z\subseteq\mathbb{P}^{n}$ is said to be nondegenerate if $Z$ is not contained in any proper linear subspace of $\mathbb{P}^{n}$ , i.e., $\mathcal{I}_{1}(Z)=\{0\}$ .

In the Zariski topology for $\mathbb{C}^{n+1}$ and $\mathbb{P}^{n}$ , the closed sets are varieties and the open sets are complements of varieties. For a projective variety $Z\subseteq\mathbb{P}^{n}$ , its Hilbert function $h_{Z}:\mathbb{N}\to\mathbb{N}$ is defined as $h_{Z}(d)\,\coloneqq\,\dim\mathbb{C}[Z]_{d}.$ As in [11, 20, 21], when $d$ is sufficiently large, $h_{Z}(d)$ is a polynomial and

h_{Z}(d)=\frac{e}{m!}d^{m}+O(d^{m-1}),\quad e=\deg(Z),\quad m=\dim(Z),

where $O(d^{m-1})$ denotes terms of order at most $m-1$ .

For an affine variety $X\subseteq\mathbb{C}^{n+1}$ , we denote by $\mathbb{P}X$ the projective set of equivalent classes of nonzero vectors in $X$ , i.e., $\mathbb{P}X=\{[u]:\,0\neq u\in X\}.$ If $\mathbb{P}X=Z$ , then $X$ is called the affine cone of $Z$ . Clearly, the vanishing ideal of $X$ is the same as that of $\mathbb{P}X$ , i.e., $\mathcal{I}(X)=\mathcal{I}(\mathbb{P}X)$ . For a degree $m>0$ , the set

\mathcal{I}_{m}(X):=\mathbb{C}[x]^{h}_{m}\cap\mathcal{I}(X)

is the space of all forms of degree $m$ vanishing on $X$ .

Veronese maps

For an affine variety $X\subseteq\mathbb{C}^{n+1}$ , let $v_{d}(X)\subseteq\operatorname{S}^{d}(\mathbb{C}^{n+1})$ be the image of $X$ under the Veronese map:

v_{d}:\mathbb{C}^{n+1}\to\operatorname{S}^{d}(\mathbb{C}^{n+1}),\quad u\mapsto u^{\otimes d}.

Note that $v_{d}^{-1}(u^{\otimes d})=\{\omega^{i}u:i=0,\dots,d-1\}$ , where $\omega$ is a primitive $d$ -th root of $1$ . Therefore, the dimension of $v_{d}(X)$ is the same as that of $X$ . In particular, for

C:=\{(x_{0},x_{1},\ldots,x_{n})\in\mathbb{C}^{n+1}:x_{i}x_{j}=x_{k}x_{l},\,\forall\,i+j=k+l\},

the set $v_{d}(C)$ is a variety of tensors $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{2})$ that is defined by the equations

\mathcal{A}_{\alpha}\mathcal{A}_{\beta}-\mathcal{A}_{\gamma}\mathcal{A}_{\tau}=0\,\quad(\alpha+\beta=\gamma+\tau,\,\alpha,\beta,\gamma,\tau\in\mathbb{N}_{d}^{2}).

In the above, the tensors in $\operatorname{S}^{d}(\mathbb{C}^{2})$ are labelled by vectors in $\mathbb{N}_{d}^{2}$ . The projectivization $\mathbb{P}v_{d}(C)$ is called the rational normal curve in the projective space $\mathbb{P}\operatorname{S}^{d}(\mathbb{C}^{2})\simeq\mathbb{P}^{d}$ . For a projective variety $Z\subseteq\mathbb{P}^{n}$ , the Veronese embedding map $v_{d}$ is defined in the same way as

v_{d}:\mathbb{P}^{n}\to\mathbb{P}\operatorname{S}^{d}(\mathbb{C}^{n+1}),\quad[u]\mapsto[u^{\otimes d}].

Note that $\mathbb{P}v_{d}(X)=v_{d}(\mathbb{P}X)$ for every affine variety $X$ .

Segre varieties

For projective spaces $\mathbb{P}^{n_{1}},\ldots,\mathbb{P}^{n_{k}}$ , their Segre product, denoted as $\operatorname{Seg}(\mathbb{P}^{n_{1}}\times\cdots\times\mathbb{P}^{n_{k}})$ , is the image of the Segre map:

\operatorname{Seg}:\,\quad([u_{1}],\ldots,[u_{k}])\mapsto[u_{1}\otimes\cdots\otimes u_{k}].

The dimension of $\operatorname{Seg}(\mathbb{P}^{n_{1}}\times\cdots\times\mathbb{P}^{n_{k}})$ is the sum $n_{1}+\cdots+n_{k}$ . The Segre product $\mathbb{P}^{n_{1}},\dots,\mathbb{P}^{n_{k}}$ is defined by equations of the form

\mathcal{A}_{\alpha}\mathcal{A}_{\beta}-\mathcal{A}_{\gamma}\mathcal{A}_{\gamma}=0\,,\quad(\alpha+\beta=\gamma+\tau,\,\alpha,\beta,\gamma,\tau\in{\prod}_{j=1}^{k}\{0,\dots,n_{j}\}).

Here, tensors in $\mathbb{P}(\bigotimes_{j=1}^{k}\mathbb{C}^{n_{j}+1})$ are labelled by integral tuples in $\prod_{j=1}^{k}\{0,\dots,n_{j}\}$ .

Secant varieties

Let $X\subseteq\mathbb{C}^{n+1}$ be an affine variety and let $v_{d}(X)$ be its image under the $d$ -th Veronese map $v_{d}$ . Define the set

\sigma^{\circ}_{r}(v_{d}(X))\,:=\,\big{\{}(u_{1})^{\otimes d}+\cdots+(u_{r})^{\otimes d}:u_{1},\dots,u_{r}\in X\big{\}}.

The Zariski closure of $\sigma_{r}^{\circ}(v_{d}(X))$ , which we denote as $\sigma_{r}(v_{d}(X))\subseteq\mathbb{C}^{n+1}$ , is called the $r$ th secant variety of $v_{d}(X)$ . The closure $\sigma_{r}(v_{d}(X))$ is an affine variety in $\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , while $\sigma_{r}^{\circ}(v_{d}(X))$ is usually not. However, it holds that

\dim\sigma_{r}^{\circ}(v_{d}(X))=\dim\sigma_{r}(v_{d}(X)),

because $\sigma^{\circ}_{r}(v_{d}(X))$ is a dense subset of $\sigma_{r}(v_{d}(X))$ in the Zariski topology. When $v_{d}(X)$ is replaced by a general variety $Y$ , the sets $\sigma_{r}^{\circ}(Y)$ and $\sigma_{r}(Y)$ can be defined in the same way. We refer to [27] for secant varieties.

3. Properties of STDX

Let $X\subseteq\mathbb{C}^{n+1}$ be a set that is given by homogeneous polynomial equations. For a given tensor $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , a symmetric $X$ -decomposition on $X$ is

(3.1)

\mathcal{A}=(u_{1})^{\otimes d}+\cdots+(u_{r})^{\otimes d},\quad u_{1},\ldots,u_{r}\in X.

The smallest such $r$ is called the symmetric $X$ -rank of $\mathcal{A}$ , which we denote as $\operatorname{srank}_{X}(\mathcal{A})$ , or equivalently,

(3.2)

\operatorname{srank}_{X}(\mathcal{A})=\min\{r:\,\mathcal{A}=\sum_{i=1}^{r}(u_{i})^{\otimes d},\,u_{i}\in\mathbb{C}^{X}\}.

When $r$ is the smallest, (3.1) is called a rank-retaining symmetric $X$ -decomposition for $\mathcal{A}$ . It is possible that the decomposition (3.1) does not exist; for such a case, we define $\operatorname{srank}_{X}(\mathcal{A})=+\infty$ . For instance, a symmetric tensor $\mathcal{A}$ admits a Vandermonde decomposition if and only if $\mathcal{A}$ is a Hankel tensor. So, if $\mathcal{A}$ is not Hankel, then $\operatorname{srank}_{X}(\mathcal{A})=+\infty$ . Interested readers are referred to [35, 39] for more details. We denote by $\operatorname{S}^{d}(X)$ the subspace of tensors which admit a symmetric $X$ -decomposition as in (3.1). As a counterpart for symmetric border rank, the symmetric border $X$ -rank of $\mathcal{A}$ is similarly defined as

(3.3)

\operatorname{sbrank}_{X}(\mathcal{A})\,\coloneqq\,\min\{r:\mathcal{A}\in\sigma_{r}(v_{d}(X))\},

where $\sigma_{r}(v_{d}(X))$ is the secant variety of $v_{d}(X)$ , defined in Subsection 2. When $\mathbb{P}X$ is an irreducible variety, $\operatorname{sbrank}_{X}(\mathcal{A})$ is also equal to the smallest integer $r$ such that $\mathcal{A}$ is the limit of a sequence of tensors whose symmetric $X$ -rank is $r$ (see [27, Sec. 5.1.1] or [32, Theorem 2.33]). The generic symmetric $X$ -rank of $\operatorname{S}^{d}(X)$ is the smallest $r$ such that $\sigma_{r}(v_{d}(X))=\operatorname{S}^{d}(X)$ . When $X=\mathbb{C}^{n+1}$ , the symmetric $X$ -rank becomes the usual symmetric rank (or Waring rank). If $\mathbb{P}X=v_{d}(\mathbb{P}^{1})$ is the rational normal curve, the symmetric $X$ -rank becomes the Vandermonde rank for Hankel tensors [35, 39]. How to characterize tensors that has a symmetric $X$ -decomposition as in (3.1)? How to tell $\operatorname{srank}_{X}(\mathcal{A})<+\infty$ or $\operatorname{srank}_{X}(\mathcal{A})=+\infty$ ? These questions are the focus of this section.

3.1. Existence of symmetric $X$ -decompositions

Let $\mathbb{P}X\subseteq\mathbb{P}^{n}$ be the projectivization of $X$ . Its vanishing ideal is $\mathcal{I}(X)$ , the ideal of polynomials that are identically zero on $X$ . The section of degree- $d$ forms in $\mathcal{I}(X)$ is

\mathcal{I}_{d}(X)\coloneqq\mathcal{I}(X)\cap\mathbb{C}[x]^{h}_{d}.

Let $c:=\dim\mathcal{I}_{d}(X)$ . Choose a basis $\{f_{1},\ldots,f_{c}\}$ for $\mathcal{I}_{d}(X)$ . Let $l_{1},\dots,l_{c}$ be linear functionals on $\operatorname{S}^{d}(\mathbb{C}^{n+1})$ such that

(3.4)

l_{i}\Big{(}{\sum}_{j}(u_{j})^{\otimes d}\Big{)}={\sum}_{j}f_{i}(u_{j}).

They are linearly independent functions on $\operatorname{S}^{d}(X)$ . If we use $\hat{f}_{i}$ to denote the dehomogenization of $f_{i}$ , i.e., $\hat{f}_{i}(x_{1},\ldots,x_{n}):=f_{i}(1,x_{1},\ldots,x_{n})$ , then $l_{i}(\mathcal{A})=\langle\hat{f}_{i},\mathcal{A}\rangle$ for all $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ . See (2.4) for the definition of the operation $\langle\cdot,\cdot\rangle$ .

Proposition 3.1.

Let $X,c,f_{i},l_{i}$ be as above. Then, a tensor $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ belongs to $\operatorname{S}^{d}(X)$ if and only if $l_{1}(\mathcal{A})=\cdots=l_{c}(\mathcal{A})=0$ . Consequently, the codimension of $\operatorname{S}^{d}(X)$ is $c$ , i.e., $\dim\operatorname{S}^{d}(X)=\binom{n+d}{d}-c.$

Proof.

Clearly, if $\mathcal{A}\in\operatorname{S}^{d}(X)$ , then $l_{1}(\mathcal{A})=\cdots=l_{c}(\mathcal{A})=0$ . Next, we prove the converse is also true. Suppose $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ is such that $l_{1}(\mathcal{A})=\cdots=l_{c}(\mathcal{A})=0$ . To show $\mathcal{A}\in\operatorname{S}^{d}(X)$ , it is enough to show that: if $l$ is a linear function vanishing on $\operatorname{S}^{d}(X)$ then $l(\mathcal{A})=0$ . Each $l_{i}$ vanishes on $v_{d}(X)$ and $l_{1},\dots,l_{c}$ are linearly independent as vectors in $\operatorname{S}^{d}(\mathbb{C}^{n+1})^{\ast}$ . (The superscript ^∗ denotes the dual space.) Note that $l\in\operatorname{S}^{d}(\mathbb{C}^{n+1})^{\ast}$ and it vanishes on $v_{d}(X)$ . So, there is a form $f\in\mathbb{C}[x]_{d}$ such that $l(u^{\otimes d})=f(u)$ for all $u\in\mathbb{C}^{n+1}$ . Since $l\equiv 0$ on $v_{d}(X)$ , $f$ also vanishes on $X$ . So, $f$ is a linear combination of $f_{1},\dots,f_{c}$ , and hence $l$ is a linear combination of $l_{1},\dots,l_{c}$ . This implies that $l(\mathcal{A})=0$ and $\mathcal{A}\in\operatorname{S}^{d}(X)$ .

Since $l_{1},\dots,l_{c}$ are linearly independent, the subspace $\operatorname{S}^{d}(X)$ are defined by $c$ linearly independent linear equations. So its codimension is $c$ . Since the dimension of $\operatorname{S}^{d}(\mathbb{C}^{n+1})$ is $\binom{n+d}{d}$ , the dimension of $\operatorname{S}^{d}(X)$ follows from the codimension. ∎

The first part of Proposition 3.1 is a high dimensional analogue of the apolarity lemma, which can be found in [23, Theorem 5.3], [41, Section 1.3], and [48, Section 3]). For convenience of referencing, we state this result here and give a straightforward proof. We would like to thank Zach Teitler for pointing out the relationship between Proposition 3.1 and the apolarity Lemma.

By Proposition 3.1, we get the following algorithm for checking if $\mathcal{A}\in\operatorname{S}^{d}(X)$ or not. Suppose the vanishing ideal $\mathcal{I}(X)$ is generated by the forms $f_{1},\dots,f_{s}$ , with degrees $d_{1}\leq\dots\leq d_{s}$ respectively.

Algorithm 3.2.

For a given tensor $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , do the following:

Step 1:

Find the integer $k\geq 0$ such that $d_{k}\leq d<d_{k+1}.$ ¹¹1Here we adopt the convention that $d_{0}=0$ and $d_{s+1}=\infty$ .
Step 2:

For each $t=1,\dots,k$ and $\beta\in\mathbb{N}^{n+1}_{d-d_{t}}$ , let $f_{t,\beta}:=f_{t}\cdot x_{0}^{\beta_{0}}\cdots x_{n}^{\beta_{n}}.$
Step 3:

Check whether or not $\langle f_{t,\beta},\mathcal{A}\rangle=0$ for all $t$ and $\beta$ in Step 2. If it is, then $\mathcal{A}\in\operatorname{S}^{d}(X)$ ; otherwise, $\mathcal{A}\not\in\operatorname{S}^{d}(X)$ .

The above algorithm can be easily applied to detect tensors in $\operatorname{S}^{d}(X)$ . For instance, if $X$ is defined by linear equations $\sum_{j=0}^{n}f_{ij}x_{j}=0$ ( $i=1,\ldots,c$ ), then $\mathcal{A}\in\operatorname{S}^{d}(X)$ if and only if for all $1\leq i\leq c$ and $0\leq k_{2},\ldots,k_{m}\leq n$ ,

\sum_{j=0}^{n}f_{ij}\mathcal{A}_{jk_{2}\ldots k_{m}}=0.

If $X$ is a hypersurface defined by a single homogeneous polynomial $f(x)=0$ with $\deg(f)\leq m$ , then $\mathcal{A}\in\operatorname{S}^{d}(X)$ if and only if

\langle f\cdot x^{\alpha},\mathcal{A}\rangle=0

for all monomials $x^{\alpha}$ with $\deg(f)+|\alpha|=m$ .

When does $\operatorname{S}^{d}(X)=\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , i.e., when does every tensor admit a symmetric $X$ -decomposition? By Proposition 3.1, we get the following characterization.

Corollary 3.3.

Let $X\subseteq\mathbb{C}^{n+1}$ be as above. Then, the equality $\operatorname{S}^{d}(X)=\operatorname{S}^{d}(\mathbb{C}^{n+1})$ holds if and only if $\mathcal{I}_{d}(X)=\{0\}$ , which is equivalent to that $X$ is not contained in any hypersurface of degree $d$ .

The above corollary implies that if $X$ is a hypersurface of degree bigger than $d$ , then every tensor in $\operatorname{S}^{d}(\mathbb{C}^{n+1})$ has a symmetric $X$ -decomposition. Moreover, if $X=\mathbb{C}^{n+1}$ , then obviously we have $\mathcal{I}_{d}(X)=\{0\}$ for any $d\geq 1$ , which implies the well known fact [9] that every symmetric tensor admits a symmetric decomposition.

3.2. The dimension and expected rank

By Proposition 3.1, $\dim\operatorname{S}^{d}(X)=h_{\mathbb{P}X}(d),$ where $h_{\mathbb{P}X}(\cdot)$ is the Hilbert function for the projective variety $\mathbb{P}X$ . For the Veronese map $v_{d}$ , we have $\dim v_{d}(X)=\dim X$ . Therefore, the expected dimension of the secant variety $\sigma_{r}(v_{d}(X))$ :

\operatorname{exp.dim}\sigma_{r}(v_{d}(X))=\min\left\{r\dim X,h_{\mathbb{P}X}(d)\right\}.

The expected generic symmetric $X$ -rank of $\operatorname{S}^{d}(X)$ is therefore

(3.5)

\operatorname{exp.grank}=\left\lceil{h_{\mathbb{P}X}(d)}/{\dim X}\right\rceil.

Example 3.4.

(i) If $\mathbb{P}X=\operatorname{Seg}(\mathbb{P}^{n_{1}}\times\cdots\times\mathbb{P}^{n_{k}})$ , the Segre variety, then $\dim X=n_{1}+\cdots+n_{k}+1$ . Its Hilbert function $h_{\mathbb{P}X}(d)=\prod_{j=1}^{k}\binom{n_{j}+d}{n_{j}}$ [20, Example 18.15]. So

\dim\operatorname{S}^{d}(X)=\prod_{j=1}^{k}\binom{n_{j}+d}{n_{j}},\quad\operatorname{exp.grank}=\left\lceil\frac{\prod_{j=1}^{k}\binom{n_{j}+d}{n_{j}}}{\sum_{j=1}^{k}n_{j}+1}\right\rceil.

(ii) If $\mathbb{P}X\subseteq\mathbb{P}^{n}$ is a hypersurface defined by a form of degree $t$ , the Hilbert function of $\mathbb{P}X$ is

h_{\mathbb{P}X}(d)=\begin{cases}\binom{n+d}{n},~{}\text{if}~{}d<t,\\ \binom{n+d}{n}-\binom{n+d-t}{n},~{}\text{otherwise}.\end{cases}

Then, $\dim\operatorname{S}^{d}(X)=h_{\mathbb{P}{X}}(d)$ and the $\operatorname{exp.grank}$ can be obtained accordingly.

When $\mathbb{P}X$ is a curve, we can get the dimension of $\sigma_{r}(v_{d}(X))$ as follows.

Proposition 3.5.

If $\mathbb{P}X$ is a non-degenerate curve (i.e., $\dim\mathbb{P}X=1$ and $\mathbb{P}X$ is not contained in any proper linear subspace of $\mathbb{P}^{n}$ ), then

\dim\sigma_{r}(v_{d}(X))=\min\left\{2r,h_{\mathbb{P}X}(d)\right\}.

Therefore, the generic symmetric $X$ -rank is $\lceil\frac{h_{X}(d)}{2}\rceil$ . Moreover, if $\mathbb{P}X$ is nonsingular of genus $g$ and degree $t$ , then there exists an integer $d_{0}$ such that for all $d\geq d_{0}$ ,

\displaystyle\dim\sigma_{r}(v_{d}(X))=\min\{2r,dt-g+1\},\quad\operatorname{grank}=\lceil\frac{dt-g+1}{2}\rceil.

Proof.

The first part follows directly from [20, Example 11.30]. The “moreover” part follows from Riemann-Roch theorem [21, Chapter 4]. ∎

By [20, Example 13.7], if $\mathbb{P}X$ is a space curve of degree $e$ , i.e., $\mathbb{P}X$ is a curve in $\mathbb{P}^{3}$ which intersects a generic plane in $e$ points, then $d_{0}=e-2$ in Proposition 3.5. For an arbitrary curve $\mathbb{P}X$ , however, not much is known about $d_{0}$ . The Hilbert functions are also known for Veronese varieties, Grassmann varieties and flag varieties, see [20, 19]. Hence the expected value of the generic rank for them may be calculated by (3.5).

3.3. Vandermonde decompositions for nonsymmetric tensors

A nonsymmetric tensor $\mathcal{A}\in(\mathbb{C}^{d+1})^{\otimes k}$ is said to admit a Vandermonde decomposition if there exist vectors ( $s=1,\ldots,k$ , $j=1,\ldots,r$ )

v_{s}^{(j)}:=\Big{(}a_{sj})^{d},(a_{sj})^{d-1}b_{sj},\ldots,a_{sj}(b_{sj})^{d-1},(b_{sj})^{d}\Big{)}\in\mathbb{C}^{d+1}

such that

\mathcal{A}={\sum}_{j=1}^{r}v_{1}^{(j)}\otimes\cdots\otimes v_{k}^{(j)}.

The smallest integer $r$ in the above is called the Vandermonde rank of $\mathcal{A}$ , which we denote as $\operatorname{vrank}(\mathcal{A})$ . Since $v_{s}^{(j)}=(a_{sj},b_{sj})^{\otimes d}\in\operatorname{S}^{d}\mathbb{C}^{2}\simeq\mathbb{C}^{d+1},$ we can rewrite (3.6) equivalently as

(3.6)

\mathcal{A}={\sum}_{j=1}^{r}\Big{(}(a_{1j},b_{1j})\otimes\cdots\otimes(a_{kj},b_{kj})\Big{)}^{\otimes d}.

The Vandermonde decomposition can be thought of as a symmetric tensor decomposition on the set $X\subseteq\mathbb{C}^{2^{k}}$ such that

\mathbb{P}X\coloneqq\mathbb{P}^{1}\times\cdots\times\mathbb{P}^{1}\quad\mbox{\, ($k$ times)}.

The variety $\sigma_{r}(v_{d}(X))\subseteq\mathbb{P}\operatorname{S}^{d}(\mathbb{C}^{2^{k}})$ is exactly the Zariski closure of tensors whose Vandermonde ranks at most $r$ . The vanishing ideal of the Segre variety $\mathbb{P}^{n_{1}}\times\cdots\times\mathbb{P}^{n_{k}}$ is generated by $2\times 2$ minors of its flattenings [18]. So $\mathcal{I}_{d}(\mathbb{P}^{n_{1}}\times\cdots\times\mathbb{P}^{n_{k}})\neq 0$ for all $d\geq 2$ . By Corollary 3.3, $\operatorname{S}^{d}(X)$ is a proper subspace of $\operatorname{S}^{d}(\mathbb{C}^{2^{k}})$ . However, every $\mathcal{A}\in(\mathbb{C}^{d+1})^{\otimes k}$ has a Vandermonde decomposition.

Theorem 3.6.

Every tensor in $(\mathbb{C}^{d+1})^{\otimes k}$ has a Vandermonde decomposition.

Proof.

Each $\mathcal{A}\in(\mathbb{C}^{d+1})^{\otimes k}$ admits a general tensor decomposition, say,

\mathcal{A}={\sum}_{j=1}^{L}u_{j,1}\otimes\cdots\otimes u_{j,k}

for vectors $u_{j,i}\in\mathbb{C}^{d+1}$ . Choose distinct numbers $t_{0},t_{1},\ldots,t_{d}$ and let

v_{l}:=(1,t_{l},t_{l}^{2},\ldots,t_{l}^{d}),

for $l=0,1,\ldots,d$ . Clearly, $v_{0},v_{1},\ldots,v_{d}$ are linearly independent and they span $\mathbb{C}^{d+1}$ . For each $u_{j,i}$ , there exists numbers $\lambda_{j,i,0},\ldots,\lambda_{j,i,d}$ such that

u_{j,i}=\lambda_{j,i,0}v_{0}+\cdots+\lambda_{j,i,d}v_{l}.

Plugging the above expression of $u_{j,i}$ in the decomposition for $\mathcal{A}$ , we get

\mathcal{A}={\sum}_{j_{1},\ldots,j_{k}=0}^{l}c_{j_{1},\ldots,j_{k}}v_{j_{1}}\otimes\cdots\otimes v_{j_{k}},

for some scalars $c_{j_{1},\ldots,j_{k}}$ . ∎

4. Generating polynomials

Assume that $X\subseteq\mathbb{C}^{n+1}$ is a set defined by homogeneous polynomial equations. For a given tensor $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ with $\operatorname{srank}_{X}(\mathcal{A})=r$ , we discuss how to compute the symmetric $X$ -decomposition

(4.1)

\mathcal{A}=(u_{1})^{\otimes d}+\cdots+(u_{r})^{\otimes d},\quad u_{1},\dots,u_{r}\in X.

Suppose $X$ is given as

(4.2)

X=\{x\in\mathbb{C}^{n+1}:\,h_{i}(x)=0,i=1,\ldots,N\},

with homogeneous polynomials $h_{i}$ in $x:=(x_{0},x_{1},\ldots,x_{n})$ . Denote $y:=(y_{1},\ldots,y_{n})$ . The dehomogenization of $X$ is the set

(4.3)

Y=\{y\in\mathbb{C}^{n}:\,h_{i}(1,y_{1},\ldots,y_{n})=0,i=1,\ldots,N\}.

If each $(u_{j})_{0}\neq 0$ , then the decomposition (4.1) is equivalent to that

(4.4)

\mathcal{A}=\lambda_{1}(1,v_{1})^{\otimes d}+\cdots+\lambda_{r}(1,v_{r})^{\otimes d},

for scalars $\lambda_{j}\in\mathbb{C}$ and vectors $v_{j}\in Y$ . In fact, they are

\lambda_{j}=((u_{j})_{0})^{d},\quad v_{j}=\frac{1}{(u_{j})_{0}}\big{(}(u_{j})_{1},\dots,(u_{j})_{n}\big{)}^{\mathsf{T}}.

The assumption that $(u_{j})_{0}\neq 0$ is generic. For the rare case that it is zero, we can apply a generic coordinate change for $\mathcal{A}$ so that this assumption holds. Throughout this section, we assume the rank $r$ is given.

4.1. Generating polynomials

Denote the quotient ring $C[Y]:=\mathbb{C}[y]/\operatorname{I}(Y)$ , where $I(Y)$ is the vanishing ideal of $Y$ . The $C[Y]$ is also called the coordinate ring of $Y$ . We list monomials in $\mathbb{C}[y]$ with respect to the graded lexicographic order, i.e.,

1<y_{1}<\cdots<y_{n}<y_{1}^{2}<y_{1}y_{2}<\cdots.

We choose $B_{0}$ to be the set of first $r$ monomials, whose images in $\mathbb{C}[Y]$ are linearly independent, say,

B_{0}=\{y^{\beta_{1}},\dots,y^{\beta_{r}}\}.

The border set of $B_{0}$ is $B_{1}:=B_{0}\cup y_{1}B_{0}\cup\cdots\cup y_{n}B_{0}.$ The boundary of $B_{1}$ is

\partial B_{1}:=\big{(}B_{0}\cup y_{1}B_{0}\cup\cdots\cup y_{n}B_{0}\big{)}\setminus B_{0}.

For a matrix $G\in\mathbb{C}^{r\times\lvert\partial B_{1}\rvert}$ , we label it as

G=(c_{i,\alpha})_{i\in[r],\alpha\in\partial B_{1}}.

For each $\alpha\in\partial B_{1}$ , denote the polynomial in $y$

(4.5)

\varphi[G,\alpha]:={\sum}_{i=1}^{r}c_{i,\alpha}y^{\beta_{i}}-y^{\alpha}.

Denote the tuple of all such polynomials as

(4.6)

\varphi[G]\,:=\,\big{(}\varphi[G,\alpha](y):\alpha\in\partial B_{1}\big{)}.

Let $J_{G}$ be the ideal generated by $\varphi[G]$ . The set $\varphi[G]$ is called a border basis of $J_{G}$ with respect to $B_{0}$ . We refer to [45, 14] for border sets and border bases.

In the following, we give the definition of generating polynomials which were introduced in [33]. For $\alpha\in\partial B_{1}$ , the polynomial $\varphi[G,\alpha]$ is called a generating polynomial for $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ if

(4.7)

\langle y^{\gamma}\varphi[G,\alpha],\mathcal{A}\rangle=0\quad\forall\,\gamma\in\mathbb{N}^{n}_{d-\lvert\alpha\rvert}.

(See (2.4) for the operation $\langle,\rangle$ .) If $\varphi[G,\alpha]$ is a generating polynomial for all $\alpha\in\partial B_{1}$ , then $G$ is called a generating matrix for $\mathcal{A}$ . The set of all generating matrices for $\mathcal{A}$ is denoted as $\mathcal{G}(\mathcal{A})$ . The condition (4.7) is equivalent to that

(4.8)

{\sum}_{i=1}^{r}c_{i,\alpha}\mathcal{A}_{\beta_{i}+\gamma}=\mathcal{A}_{\alpha+\gamma}.

We use $G(:,\alpha)$ to denote the $\alpha$ th column of $G$ . Then (4.8) can be rewritten as

A[\mathcal{A},\alpha]G(:,\alpha)=b[\mathcal{A},\alpha]

where $A[\mathcal{A},\alpha],b[\mathcal{A},\alpha]$ are given as

\big{(}A[\mathcal{A},\alpha])_{\gamma,\beta}=\mathcal{A}_{\beta+\gamma},\quad\big{(}b[\mathcal{A},\alpha])_{\gamma}=\mathcal{A}_{\alpha+\gamma}\quad\big{(}\beta\in B_{0},\,\gamma\in\mathbb{N}^{n}_{d-\lvert\alpha\rvert}\big{)}.

The solutions to (4.8) can be parameterized as $c_{\alpha}+N_{\alpha}w_{\alpha}$ , where $c_{\alpha}$ is a solution to (4.8), $N_{\alpha}$ is a basis matrix for the nullspace, and $w_{\alpha}$ is the vector of free parameters. So, every generating matrix can be parameterized as

(4.9)

G(w)=C+N(w),

where $C$ is a constant matrix and $N(w)\in\mathbb{C}^{[r]\times\partial B_{1}}$ . The following is an example of parameterizing $G(w)$ .

Example 4.1.

Consider the tensor $\mathcal{A}\in\operatorname{S}^{3}(\mathbb{C}^{4})$ such that

\mathcal{A}(1,y_{1},y_{2},y_{3})\,=\,32\,{y_{{3}}}^{3}-12\,{y_{{3}}}^{2}y_{{2}}+126\,{y_{{3}}}^{2}y_{{1}}-48\,y_{{3}}{y_{{2}}}^{2}+36\,y_{{3}}y_{{2}}y_{{1}}\\ +150\,y_{{3}}{y_{{1}}}^{2}-20\,{y_{{2}}}^{3}-18\,{y_{{2}}}^{2}y_{{1}}+42\,y_{{2}}{y_{{1}}}^{2}+51\,{y_{{1}}}^{3}+18\,{y_{{3}}}^{2}-36\,y_{{3}}y_{{2}}\\ +84\,y_{{3}}y_{{1}}-30\,{y_{{2}}}^{2}+12\,y_{{2}}y_{{1}}+45\,{y_{{1}}}^{2}+6\,y_{{3}}-6\,y_{{2}}+9\,y_{{1}}-1.

For $r=3$ , if we choose $B_{0}=\{1,y_{1},y_{2}\}$ , then

\partial B_{1}=\{y_{3},y_{1}^{2},y_{1}y_{2},y_{2}^{2},y_{1}y_{3},y_{2}y_{3}\},\quad\lvert\partial B_{1}\rvert=6,

	$\displaystyle\alpha_{1}$	$\displaystyle=(0,0,1),\quad\alpha_{2}=(2,0,0),\quad\alpha_{3}=(1,1,0),$
	$\displaystyle\alpha_{4}$	$\displaystyle=(0,2,0),\quad\alpha_{5}=(1,0,1),\quad\alpha_{6}=(0,1,1).$

The $A[\mathcal{A},\alpha_{i}]$ and $b[\mathcal{A},\alpha_{i}]$ can be formulated accordingly. For instance,

A[\mathcal{A},\alpha_{1}]=\hbox{}\vbox{\kern 0.86108pt\hbox{$\kern 0.0pt\kern 2.5pt\kern-2.5pt\left[\kern 0.0pt\kern-2.5pt\kern-5.55557pt\vbox{\kern-0.86108pt\vbox{\vbox{ \halign{\kern\arraycolsep\hfil\@arstrut$\kbcolstyle#$\hfil\kern\arraycolsep& \kern\arraycolsep\hfil$\@kbrowstyle#$\ifkbalignright\relax\else\hfil\fi\kern\arraycolsep&& \kern\arraycolsep\hfil$\@kbrowstyle#$\ifkbalignright\relax\else\hfil\fi\kern\arraycolsep\cr 2.5pt\hfil\@arstrut$\scriptstyle$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle(0,0,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle(1,0,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle(0,1,0)\\(0,0,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-1$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 3$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-2\\(1,0,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 3$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 15$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 2\\(0,1,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-2$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 2$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-10\\(0,0,1)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 2$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 14$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-6\\(2,0,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 15$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 51$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 14\\(1,1,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 2$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 14$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-6\\(0,2,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-10$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-6$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-20\\(1,0,1)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 14$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 50$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 6\\(0,1,1)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-6$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 6$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-16\\(0,0,2)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 6$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 42$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-4\\$\hfil\kern 2.5pt\crcr}}}}\right]$}},\quad b[\mathcal{A},\alpha_{1}]=\hbox{}\vbox{\kern 0.86108pt\hbox{$\kern 0.0pt\kern 2.5pt\kern-2.5pt\left[\kern 0.0pt\kern-2.5pt\kern-5.55557pt\vbox{\kern-0.86108pt\vbox{\vbox{ \halign{\kern\arraycolsep\hfil\@arstrut$\kbcolstyle#$\hfil\kern\arraycolsep& \kern\arraycolsep\hfil$\@kbrowstyle#$\ifkbalignright\relax\else\hfil\fi\kern\arraycolsep&& \kern\arraycolsep\hfil$\@kbrowstyle#$\ifkbalignright\relax\else\hfil\fi\kern\arraycolsep\cr 2.5pt\hfil\@arstrut$\scriptstyle$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle~{}\\(0,0,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 2\\(1,0,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 14\\(0,1,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-6\\(0,0,1)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 6\\(2,0,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 50\\(1,1,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 6\\(0,2,0)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-16\\(1,0,1)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 42\\(0,1,1)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle-4\\(0,0,2)$\hfil\kern 2.5pt&2.5pt\hfil$\scriptstyle 32\\$\hfil\kern 2.5pt\crcr}}}}\right]$}}.

The generating matrix is uniquely determined, i.e.,

G(w)=\left[\begin{array}[]{cccccc}-1&-3&-1&{\frac{83}{20}}&-4&{\frac{63}{20}}\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1&4&1&-{\frac{27}{20}}&4&-{\frac{7}{20}}\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1&0&1&{\frac{9}{10}}&1&{\frac{9}{10}}\end{array}\right].

For $r=4$ , if we choose $B_{0}=\{1,y_{1},y_{2},y_{3}\},$ then $\partial B_{1}=\{y_{1}^{2},y_{1}y_{2},y_{2}^{2},y_{1}y_{3},y_{2}y_{3},y_{3}^{2}\}.$ The generating matrix has $6$ parameters, i.e.,

G(w)=\begin{bmatrix}-3+w_{1}&-1+w_{2}&83/20+w_{3}&-4+w_{4}&63/20+w_{5}&3/20+w_{6}\\ 4-w_{1}&1-w_{2}&-27/20-w_{3}&4-w_{4}&-7/20-w_{5}&53/20-w_{6}\\ -w_{1}&1-w_{2}&9/10-w_{3}&1-w_{4}&9/10-w_{5}&9/10-w_{6}\\ -3+w_{1}&w_{2}&w_{3}&w_{4}&w_{5}&w_{6}\\ \end{bmatrix}.

4.2. Polynomial division by $\varphi[G]$

From now on, suppose the vanishing ideal

\operatorname{I}(Y)=\left\langle g_{1},\dots,g_{N}\right\rangle,

generated by $g_{1},\dots,g_{N}\in\mathbb{C}[y]$ . For $p\in\mathbb{C}[y]$ , denote by $\mbox{NF}(p;G)$ the normal form of $p$ with respect to $\varphi[G]$ , i.e., $\mbox{NF}(p;G)$ is the remainder of $p$ divided by $\varphi[G]$ , obtained by the Border Division Algorithm [22, Proposition 6.4.11]. Here we use the Formally, $\mbox{NF}(p;G)$ is the polynomial such that $p-\mbox{NF}(p;G)\in J_{G}$ , the ideal generated by polynomials in $\varphi[G]$ . Note that $\mbox{NF}(p;G)$ is a polynomial in $y:=(y_{1},\ldots,y_{n})$ whose coefficients are parametrized by the entries of $G$ .

Proposition 4.2.

Suppose $B_{0}$ is the set of first $r$ monomials²²2We remark that instead of the first $r$ monomials, one may choose any $r$ monomials which are connected to one. Interested readers are referred to [31] for a detailed discussion. For simplicity, we use in this paper the set of first $r$ monomials, which is obviously connected to one. This choice of basis will be convenient in practical computations like those in Section 6. $y^{\beta_{1}},\dots,y^{\beta_{r}}$ in the graded lexicographic ordering such that their images in $\mathbb{C}[y]/\operatorname{I}(Y)$ are linearly independent. Then, a polynomial $p\in\mathbb{C}[y]$ belongs to $J_{G}$ if and only if $\mbox{NF}(p;G)=0$ .

Proof.

By the construction, $\varphi[G]$ is a border basis of $J_{G}$ , so it contains a Gröbner basis (say, $S$ ) of $J_{G}$ with respect to the graded lexicographic ordering. Indeed, those elements in $G$ which are associated to the corner of $B_{0}$ form a Gröbner basis. See [45, Proposition 2.30] or [14, Proposition 4.3.9] for definition of the corner of $B_{0}$ and more details of the proof. Therefore, $p\in J_{G}$ if and only if $\mbox{NF}(p;G)=0$ . Since $S\subseteq\varphi[G]$ , the normal form of $p$ with respect to $S$ is the same as the normal form with respect to $\varphi[G]$ , which is $\mbox{NF}(y;G)$ . Therefore, $p\in J_{G}$ if and only if $\mbox{NF}(p;G)=0$ . ∎

The condition $\mbox{NF}(p;G)=0$ is equivalent to that its coefficients are zeros identically. The coefficients of $\mbox{NF}(p;G)$ are polynomials in $c_{i,\alpha}$ . Their degrees can be bounded as follows. For a degree $k>1$ , define the set $B_{k}$ recursively as

B_{k}\,:=\,B_{k-1}\cup y_{1}B_{k-1}\cup\cdots\cup y_{n}B_{k-1}.

The monomials in $B_{k}$ generate a subspace, which we denote as $\mbox{Span}\,B_{k}$ .

Lemma 4.3.

Let $B_{k},p(y)$ and $\mbox{NF}(p;G)$ be as above. Write $p=p_{1}+p_{2}$ , where $p_{1}\in\cup_{i\geq 1}\mbox{Span}\,B_{i}$ and $p_{2}$ is a polynomial whose monomials are not contained in any $B_{i}$ . If $p_{1}\in\mbox{Span}\,B_{k}$ , then the coefficients of $\mbox{NF}(p;G)$ are polynomials of degree at most $k$ in $G$ .

Proof.

Note that $\mbox{NF}(p;G)=\mbox{NF}(p_{1};G)+p_{2}$ , because $p_{2}$ is not reducible by $\varphi[G]$ . The coefficients of $p_{2}$ do not depend on $G$ .

For the case $k=1$ , we can write $p_{1}\in\operatorname{Span}B_{1}$ as

p_{1}={\sum}_{\beta\in\partial B_{1}}a_{\beta}y^{\beta}+{\sum}_{\gamma\in B_{1}}a_{\gamma}y^{\gamma},

with coefficients $a_{\beta},a_{\gamma}\in\mathbb{C}$ . The reduction of $p$ by $\varphi[G]$ is equivalent to that

\mbox{NF}(p_{1};G)\,=\,p_{1}+{\sum}_{\beta\in\partial B_{1}}a_{\beta}\varphi[G,\beta]={\sum}_{\gamma\in B_{1}}a^{\prime}_{\gamma}(G)y^{\gamma},

where $a^{\prime}_{\gamma}(G)$ is affine linear in the entries of $G$ and are affine linear in the coefficients of $p$ . So, the coefficients of $\mbox{NF}(p;G)$ are affine linear in $G$ .

For the case $k>1$ , we can write each $p_{1}$ as

p_{1}=p_{0}+{\sum}_{\beta\in\partial B_{k}}a_{\beta}y^{\beta},

with $p_{0}\in\operatorname{Span}B_{k-1}$ and coefficients $a_{\beta}\in\mathbb{C}$ . For each $\beta\in\partial B_{k}$ , denote by $i(\beta)$ the smallest $i\in[n]$ such that $\beta\in y_{i}B_{k-1}$ . Then

p_{1}=p_{0}+{\sum}_{\beta\in\partial B_{k}}a_{\beta}y_{i(\beta)}y^{\beta-e_{i(\beta)}},

\mbox{NF}(p_{1};G)=\mbox{NF}(p_{0};G)+{\sum}_{\beta\in\partial B_{k}}a_{\beta}\mbox{NF}\Big{(}y_{i(\beta)}\mbox{NF}(y^{\beta-e_{i(\beta)}};G);G\Big{)}.

By induction, the coeffieints of $\mbox{NF}(p_{1};G)$ are of degree $\leq k$ in $G$ . ∎

Example 4.4.

Suppose that $X$ is the Segre variety $\mathbb{P}^{1}\times\mathbb{P}^{1}\subset\mathbb{P}^{3}$ . Then $Y$ is the surface in $\mathbb{C}^{3}$ defined by $g(y)\coloneqq y_{3}-y_{1}y_{2}=0,$ where $y_{1},y_{2}$ and $y_{3}$ are coordinates³³3 If we use the notation in Proposition 4.3, coordinates of $\mathbb{C}^{3}$ should be $y_{1,0},y_{0,1}$ and $y_{1,1}$ . of $\mathbb{C}^{3}$ . If we choose $B_{1}=\{1,y_{1},y_{2},y_{3}\},$ then

\partial B_{1}=\{y_{1}^{2},y_{1}y_{2},y_{2}^{2},y_{1}y_{3},y_{2}y_{3},y_{3}^{2}\}.

As in Example 4.1, we can get

	$\displaystyle\varphi[G,(2,0,0)]$	$\displaystyle=(-3+w_{1})+(4-w_{1})y_{1}-w_{1}y_{2}+w_{1}y_{3}-y_{1}^{2},$
	$\displaystyle\varphi[G,(1,1,0)]$	$\displaystyle=(-1+w_{2})+(1-w_{2})y_{1}+(1-w_{2})y_{2}+w_{2}y_{3}-y_{1}y_{2},$
	$\displaystyle\varphi[G,(0,2,0)]$	$\displaystyle=(83/20+w_{4})+(-27/20-w_{4})y_{1}+(9/10-w_{4})y_{2}+w_{4}y_{3}-y_{2}^{2},$
	$\displaystyle\varphi[G,(1,0,1)]$	$\displaystyle=(-4+w_{3})+(4-w_{3})y_{1}+(1-w_{3})y_{2}+w_{3}y_{3}-y_{1}y_{3},$
	$\displaystyle\varphi[G,(0,1,1)]$	$\displaystyle=(63/20+w_{5})+(-7/20-w_{5})y_{1}+(9/10-w_{5})y_{2}+w_{5}y_{3}-y_{2}y_{3},$
	$\displaystyle\varphi[G,(0,0,2)]$	$\displaystyle=(3/20+w_{6})+(-53/20-w_{6})y_{1}+(9/10-w_{6})y_{2}+w_{6}y_{3}-y_{2}y_{3}.$

The normal form $NF(g;G)$ of $g$ is

NF(g;G)=(1-w_{2})-(1-w_{2})y_{1}-(1-w_{2})y_{2}+(1-w_{2})y_{3}.

The condition $NF(g;G)=0$ requires that $1-w_{2}=0$ .

4.3. Commutativity conditions

For each $1\leq i\leq n$ and $G\in\mathbb{C}^{[r]\times\partial B_{1}}$ , define the matrix $M_{i}(G)\in\mathbb{C}^{r\times r}$ as

(M_{i}(G))_{s,t}=\begin{cases}1,\quad~{}\text{if}~{}\beta_{s}=\beta_{t}+e_{i}\in B_{0},\\ 0,\quad~{}\text{if}~{}\beta_{s}\neq\beta_{t}+e_{i}\in B_{0},\\ c_{s,\beta_{t}+e_{i}},\quad~{}\text{if}~{}\beta_{t}+e_{i}\in\partial B_{1}.\end{cases}

They are also called multiplication matrices for the ideal $J_{G}$ .

Theorem 4.5.

Let $B_{0}$ be the set of first $r$ monomials whose images in $\mathbb{C}[Y]$ are linearly independent. Then, for $G\in\mathbb{C}^{[r]\times\partial B_{1}}$ , the polynomial system

(4.10)

\varphi[G](y)=0

has $r$ solutions (counting multipliciites) and they belong to $Y$ if and only if

(4.11)

\big{[}M_{i}(G),M_{j}(G)\big{]}=0\quad(1\leq i<j\leq n),

(4.12)

\mbox{NF}(g_{i};G)=0\quad(i=1,\dots,N),

where $\left\langle g_{1},\ldots,g_{N}\right\rangle=I(Y)$ .

Proof.

By Theorem 2.4 of [33], (4.10) has $r$ solutions if and only if (4.11) holds. All the solutions are contained in $Y$ if and only if $\operatorname{I}(Y)\subseteq J_{G}$ , which is equivalent to that each normal form $\mbox{NF}(g_{i};G)=0$ , by Proposition 4.2. ∎

5. An algorithm for computing STDXs

Let $X,Y$ be the varieties as in (4.2) and (4.3). This section discusses how to compute a symmetric tensor decomposition on $X$ for a given tensor $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ whose symmetric $X$ -rank is $r$ . To compute (4.1), it is enough to compute

(5.1)

\mathcal{A}=\lambda_{1}(1,v_{1})^{\otimes d}+\cdots+\lambda_{r}(1,v_{r})^{\otimes d}

for vectors $v_{1},\ldots,v_{r}\in Y$ and scalars $\lambda_{1},\ldots,\lambda_{r}$ .

Choose $B_{0}=\{x^{\beta_{1}},\ldots,x^{\beta_{r}}\}$ to be the set of first $r$ monomials that are linearly independent in $C[Y]$ . For a point $v\in\mathbb{C}^{n}$ , denote by $B_{0}(v)\in\mathbb{C}^{r}$ the column vector whose $i$ th entry is $v^{\beta_{i}}$ . Denote the Zariski open subset $D$ of $(\mathbb{C}^{n})^{r}$

D=\{(v_{1},\ldots,v_{r})\in(\mathbb{C}^{n})^{r}:\,\det\begin{bmatrix}B_{0}(v_{1})&\cdots B_{0}(v_{r})\end{bmatrix}\neq 0\}.

Recall that $\varphi[G]$ is the tuple of generating polynomials as in (4.6) and $\mathcal{G}(\mathcal{A})$ denotes the set of generating matrices for $\mathcal{A}$ .

Theorem 5.1.

Let $X,Y,D$ be as above. For each $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ , we have:

•

If $G\in\mathcal{G}(\mathcal{A})$ and $v_{1},\dots,v_{r}\in Y$ are distinct zeros of $\varphi[G]$ , then there exist scalars $\lambda_{1},\ldots,\lambda_{r}$ satisfying (5.1).
•

If the decomposition (5.1) holds for $\mathcal{A}$ and $(v_{1},\dots,v_{r})\in D$ , then there exists a unique $G\in\mathcal{G}(\mathcal{A})$ such that $v_{1},\dots,v_{r}$ are zeros of $\varphi[G]$ .

Proof.

The conclusions mostly follow from Theorem 3.2 of [33]. The difference is that we additionally require the points $v_{1},\ldots,v_{r}\in Y$ . ∎

According to Theorem 5.1, to compute a symmetric $X$ -rank decomposition for $\mathcal{A}$ , we need to find a generating matrix $G\in\mathcal{G}(\mathcal{A})$ such that

(1)

$\varphi[G]$ has $r$ distinct zeros.
(2)

The zeros of $\varphi[G]$ are contained in $Y$ , i.e., $\operatorname{I}(Y)\subseteq J_{G}$ .

Conditions $(1)$ and $(2)$ are equivalent to (4.11) and (4.12) by Theorem 5.1. Suppose the vanishing ideal $I(Y)=\left\langle g_{1},\dots,g_{N}\right\rangle$ . We have the following algorithm.

Algorithm 5.2.

For a given $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ with $\operatorname{srank}_{X}(\mathcal{A})\leq r$ , do the following:

Step 0

Choose the set of first $r$ monomials $y^{\beta_{1}},\dots,y^{\beta_{r}}$ that are linearly independent in the quotient ring $\mathbb{C}[Y]:=\mathbb{C}[y]/\operatorname{I}(Y)$ , with respect to the graded lexicographic monomial order.
Step 1

Parameterize the generating matrix $G(w)=C+N(w)$ as in (4.9).
Step 2

For each $g_{i}$ , compute $\mbox{NF}(g_{i};G(w))$ with respect to $\varphi[G(w)]$ .

Step 3

Compute a solution of the polynomial system

(5.2)

\left\{\begin{array}[]{rl}\big{[}M_{i}(G(w)),M_{j}(G(w))\big{]}&=0\quad(1\leq i<j\leq n),\\ \mbox{NF}(g_{i};G(w))&=0\quad(1\leq i\leq N).\end{array}\right.

Step 4

Compute $r$ zeros $v_{1},\dots,v_{r}\in\mathbb{C}^{n}$ of the polynomial system

$\varphi[G(w),\alpha]=0\quad(\alpha\in\partial B_{1}).$
Step 5

Determine scalars $\lambda_{1},\dots,\lambda_{r}$ satisfying (5.1).

The major task of Algorithm 5.2 is in Step 3. It requires to solve a set of polynomial system, which is given by commutative equations and normal forms. The commutative equations are quadratic in the parameter $w$ . The equations $\mbox{NF}(g_{i};G)=0$ are polynomial in $w$ , whose degrees are bounded in Lemma 4.3. One can apply the existing symbolic or numerical methods for solving polynomial equations. In Step 4, the polynomials $\varphi[G(w),\alpha]$ have special structures. One can get a Gröbner basis quickly, hence their zeros can be computed efficiently. We refer to [10] and [33, Sec. 2.4] for how to compute their common zeros.

Remark 5.3.

In Algorithm 5.2, we need to know a value of $r$ , with $r\geq\operatorname{srank}_{X}(\mathcal{A})$ . Typically, such a $r$ is not known. In practice, we can choose $r$ heuristically. For instance, we can choose $r$ to be the expected generic rank given in the subsection 3.2. If the flattening matrices of $\mathcal{A}$ have low ranks, we can choose $r$ to be the maximum of their ranks. For any case, if the system (5.2) cannot be solved, we can increase the value of $r$ and repeat the algorithm.

We conclude this section with an example of applying Algorithm 5.2.

Example 5.4.

Let $\mathcal{A}\in\operatorname{S}^{3}(\mathbb{C}^{4})$ be the same tensor as in Example 4.1. Let $X\subseteq\mathbb{C}^{4}$ be the set whose dehomogenization $Y\subseteq\mathbb{C}^{3}$ is the surface whose defining ideal is $I(Y)=\langle y_{3}-y_{1}y_{2}\rangle$ . The maximum rank of flattening matrices of $\mathcal{A}$ is $3$ , so we apply Algorithm 5.2 with $r=3$ . Choose $B_{0}=\{1,y_{1},y_{2}\}$ . From the calculation in Example 4.1, we can get

	$\displaystyle M_{1}(G(w))$	$\displaystyle=\left[\begin{array}[]{rrr}0&-3&-1\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1&4&1\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 0&0&1\end{array}\right],\quad M_{2}(G(w))=\left[\begin{array}[]{rrr}0&-1&{\frac{83}{20}}\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 0&1&-{\frac{27}{20}}\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1&1&{\frac{9}{10}}\end{array}\right],$
	$\displaystyle M_{3}(G(w))$	$\displaystyle=\left[\begin{array}[]{rrr}-1&-4&{\frac{63}{20}}\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1&4&-{\frac{7}{20}}\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1&1&{\frac{9}{10}}\end{array}\right].$

The matrices $M_{1}(G(w)),M_{2}(G(w))$ and $M_{3}(G(w))$ commute. The generating polynomials are

	$\displaystyle\varphi[G,(0,0,1)]$	$\displaystyle=(-1+y_{1}+y_{2})-y_{3},$
	$\displaystyle\varphi[G,(2,0,0)]$	$\displaystyle=(-3+4y_{1})-y_{1}^{2},$
	$\displaystyle\varphi[G,(1,1,0)]$	$\displaystyle=(-1+y_{1}+y_{2})-y_{1}y_{2},$
	$\displaystyle\varphi[G,(0,2,0)]$	$\displaystyle=(83/20-27/20y_{1}+9/10y_{2})-y_{2}^{2},$
	$\displaystyle\varphi[G,(1,0,1)]$	$\displaystyle=(-4+4y_{1}+y_{2})-y_{1}y_{3},$
	$\displaystyle\varphi[G,(0,1,1)]$	$\displaystyle=(63/20-7/20y_{1}+9/10y_{2})-y_{2}y_{3}.$

They have $3$ common zeros. There are no radical formulae for them, but they can be numerically evaluated as

(3,1,3),\quad(1,-1.283,-1.283),\quad(1,2.183,2.183).

The given symmetric $X$ -decomposition for $\mathcal{A}$ as

	$\displaystyle\widetilde{\mathcal{A}}(y)$	$\displaystyle\coloneqq 2\,\left(1+3\,y_{{1}}+\,y_{{2}}+3\,y_{{3}}\right)^{3}-0.7353\,\left(1+\,y_{{1}}-1.283\,y_{{2}}-1.283\,y_{{3}}\right)^{3}$
		$\displaystyle-2.265\,\left(1+\,y_{{1}}+2.183\,y_{{2}}+2.183\,y_{{3}}\right)^{3}.$

Because of numerical errors, we do not have $\widetilde{\mathcal{A}}=\mathcal{A}$ exactly, but the round-off error $\lVert\widetilde{\mathcal{A}}-\mathcal{A}\rVert\approx 6.84\cdot 10^{-16}$ .

6. Numerical experiments

In this section, we present examples of applying Algorithm 5.2 to compute symmetrix $X$ -rank decompositions. The computation is implemented in a laptop with a 2.5 GHz Intel Core i7 processor. The software for carrying out numerical experiments is Maple 2017. We solve the system (5.2) by the built-in function fsolve directly. The algorithm returns a decomposition

\widetilde{\mathcal{A}}\coloneqq\widetilde{\lambda}_{1}(1,\widetilde{v}_{1})^{\otimes d}+\cdots+\widetilde{\lambda}_{r}(1,\widetilde{v}_{r})^{\otimes d}.

Because of round-off errors, the equation $\widetilde{\mathcal{A}}=\mathcal{A}$ does not hold exactly. We use the absolute error $\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert$ or the relative one $\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert/\lVert\mathcal{A}\rVert$ to verify the correctness. Here, the Hilbert-Schmidt norm of $\mathcal{A}$ is used, i.e.,

\|\mathcal{A}\|=\Big{(}{\sum}_{i_{1},\ldots,i_{m}}|\mathcal{A}_{i_{1}\ldots i_{m}}|^{2}\Big{)}^{1/2}.

We display the computed decompositions by showing

\widetilde{V}=\begin{bmatrix}\widetilde{v}_{1}\\ \vdots\\ \widetilde{v}_{r}\end{bmatrix},\quad\widetilde{\Lambda}=\begin{bmatrix}\widetilde{\lambda}_{1}\\ \vdots\\ \widetilde{\lambda}_{r}\end{bmatrix}.

For neatness, only four decimal digits are shown, and $i$ denotes the unit pure imaginary number. If the real or imaginary part of a complex number is smaller than $10^{-10}$ , we treat it as zero and do not display it, for cleanness of the paper. To apply Algorithm 5.2, we need a value for the rank $r$ . This issue is discussed in Remark 5.3. In our computation, we initially choose $r$ to be the maximum rank of the flattening matrices of the given tensor $\mathcal{A}$ . If the equations (4.8) or (5.2) are inconsistent, we need to increase the value of $r$ by one, until Algorithm 5.2 successfully returns a decomposition.

For the set $Y$ dehomogenized from $X$ as in (4.3), we need generators of its vanishing ideal $I(Y)$ . For some $Y$ , it is easy to compute the generators of $I(Y)$ ; for some $Y$ , it may be difficult to compute them. This is a classical, standard problem in symbolic computation. We refer to [13, 15, 17] for the related work. So, we do not focus on how to compute generators of $I(Y)$ in this paper. In our examples, the generators of $I(Y)$ are known or can be computed easily.

First we illustrate how to apply Algorithm 3.2 to detect the existence of the STDX for a given $\mathcal{A}$ and $\mathbb{P}X$ .

Example 6.1.

We consider $\mathcal{A}\in\operatorname{S}^{3}(\mathbb{C}^{4})$ that is given as

\mathcal{A}_{ijk}=i+j+k

and $X\subseteq\mathbb{P}^{3}$ that is defined by $x_{2}^{2}=x_{1}^{2}+x_{0}^{2}$ , $x_{3}^{2}=x_{2}^{2}+x_{1}^{2}$ . According to Algorithm 3.2, we have

f_{1}=x_{2}^{2}-x_{1}^{2}-x_{0}^{2},\quad f_{2}=x_{3}^{2}-x_{2}^{2}-x_{1}^{2}.

We let $\beta\in\mathbb{N}^{4}_{1}$ be $\beta=(1,0,0,0)$ and hence we have $f_{1,\beta}=f_{1}x_{0}$ . It is straightforward to verify that $\langle f_{1,\beta},\mathcal{A}\rangle\neq 0$ and hence $\mathcal{A}$ has no STDX for $X$ . Another example is

\mathcal{A}\in\operatorname{S}^{4}(\mathbb{C}^{3}),\quad\mathcal{A}_{ijkl}=(i+j+k+l)^{2}-(i^{2}+j^{2}+k^{2}+l^{2})

and $X\in\mathbb{P}^{2}$ is defined by $x_{0}x_{1}+x_{1}x_{2}+x_{0}x_{2}=0,x_{0}^{2}x_{1}+x_{1}^{2}x_{2}+x_{2}^{2}x_{0}=0$ . By Algorithm 3.2 again, we can show easily that $\mathcal{A}$ does not admit an STDX for such $X$ .

The resting examples in this section are devoted to exhibit the validity and efficiency of Algorithm 5.2. To this end, we make the following convention on the representations of tensors. Recall that a tensor $\mathcal{A}\in\operatorname{S}^{d}(\mathbb{C}^{n+1})$ is an array of numbers whose elements are indexed by $(i_{1},\dots,i_{d})$ , i.e., $\mathcal{A}=(\mathcal{A}_{i_{1},\dots,i_{d}})$ , where $0\leq i_{1},\dots,i_{d}\leq n$ . As in Section 2.1, we can equivalently represent $\mathcal{A}$ by $\mathcal{A}_{\alpha}$ ’s, where $\alpha=(\alpha_{0},\dots,\alpha_{n})\in\mathbb{N}_{d}^{n+1}$ satisfies $\lvert\alpha\rvert=d$ . To be more precise, for each element in $\{(i_{1},\dots,i_{d}):0\leq i_{1},\dots,i_{d}\leq n\}$ , we have

\mathcal{A}_{i_{1},\dots,i_{d}}=\mathcal{A}_{\alpha},

where $\alpha\in\mathbb{N}_{d}^{n+1}$ is the sequence such that $\lvert\alpha\rvert=d$ and $x_{0}^{\alpha_{0}}\cdots x_{n}^{\alpha_{n}}=x_{i_{1}}\cdots x_{i_{d}}$ . We may list the entries $\mathcal{A}_{\alpha}$ with respect to the lexicographic order, i.e., $\mathcal{A}_{\alpha}$ precedes $\mathcal{A}_{\beta}$ if and only if the most left nonzero entry of $\alpha-\beta$ is positive. For instance, a binary cubic tensor $\mathcal{A}=\in\operatorname{S}^{3}(\mathbb{C}^{2})$ can be displayed as $\mathcal{A}_{30},\mathcal{A}_{21},\mathcal{A}_{12},\mathcal{A}_{03}$ . In the following examples, we will represent a symmetric tensor $\mathcal{A}$ in this way.

Example 6.2.

Let $\mathbb{P}X\subseteq\mathbb{P}^{2}$ be the parabola defined by $x_{2}^{2}-x_{0}x_{1}+x_{0}^{2}=0$ . Let $\mathcal{A}\in\operatorname{S}^{3}(\mathbb{C}^{3})$ be the symmetric tensor such that

	$\displaystyle\mathcal{A}_{300}$	$\displaystyle=15,\mathcal{A}_{210}=81,\mathcal{A}_{201}=-6,\mathcal{A}_{120}=621,\mathcal{A}_{111}=-108,\mathcal{A}_{102}=66,$
	$\displaystyle\mathcal{A}_{030}$	$\displaystyle=5541,\mathcal{A}_{021}=-1296,\mathcal{A}_{012}=540,\mathcal{A}_{003}=-102.$

The vanishing ideal of $Y\subseteq\mathbb{C}^{2}$ is

I(Y)=\langle y_{2}^{2}-y_{1}+1\rangle.

The maximum rank of flattening matrices of $\mathcal{A}$ is $3$ . When we run Algorithm 5.2 with $r=3$ , it does not give a desired tensor decomposition. So, we apply Algorithm 5.2 with $r=4$ and the symmetric $X$ -decomposition with

\widetilde{V}=\left[\begin{array}[]{cc}9.874-0.002786\,i&-2.979+0.0004677\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1.151-0.006293\,i&-0.3886+0.008096\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 4.124+0.02027\,i&1.768+0.005734\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 16.62+1.251\,i&3.956+0.1581\,i\end{array}\right],\quad\widetilde{\Lambda}=\left[\begin{array}[]{c}5.184+0.0036\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 3.694+0.0184\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 6.098-0.0154\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 0.0246-0.0066\,i\end{array}\right].

We have $\lVert{\mathcal{A}}\rVert=7241.79$ and the error $\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert=8\cdot 10^{-14}$ .

Example 6.3.

Let $\mathbb{P}X\subseteq\mathbb{P}^{2}$ be the nodal curve defined by $x_{1}^{3}+x_{0}x_{1}^{2}-x_{0}x_{2}^{2}=0.$ The vanishing ideal of $Y\subseteq\mathbb{C}^{2}$ is

I(Y)=\langle y_{1}^{3}+y_{1}^{2}-y_{2}^{2}\rangle.

Let $\mathcal{A}\in\operatorname{S}^{3}(\mathbb{C}^{3})$ be the symmetric tensor such that

	$\displaystyle\mathcal{A}_{300}$	$\displaystyle=3,\mathcal{A}_{210}=24,\mathcal{A}_{201}=72,\mathcal{A}_{120}=144,\mathcal{A}_{111}=456,\mathcal{A}_{102}=1224,$
	$\displaystyle\mathcal{A}_{030}$	$\displaystyle=1080,\mathcal{A}_{021}=3288,\mathcal{A}_{012}=9432,\mathcal{A}_{003}=28512.$

The maximum rank of flattening matrices of $\mathcal{A}$ is $3$ . When we run Algorithm 5.2 with $r=3,4$ , it fails to give a desired tensor decomposition. So, we apply Algorithm 5.2 with $r=5$ and get the symmetric $X$ -decomposition with

\widetilde{V}=\left[\begin{array}[]{cc}3.029-0.07505\,i&-6.079+0.2073\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-1.047+0.1114\,i&0.2338+0.2814\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 2.615-0.09868\,i&4.970-0.2555\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 7.927-0.008981\,i&23.68-0.03874\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-5.685-3.079\,i&10.84-10.80\,i\end{array}\right],\quad\widetilde{\Lambda}=\left[\begin{array}[]{c}-0.9769-0.06497\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-1.4650-0.1107\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 3.3470+0.1587\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 2.0980+0.01425\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-0.002438+0.002715\,i\end{array}\right].

We have $\lVert{\mathcal{A}}\rVert=41632.56$ and the error $\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert=4\cdot 10^{-13}$ .

Example 6.4.

Let $\mathbb{P}X\subseteq\mathbb{P}^{3}$ be the union of two planes defined by $(x_{3}-x_{2})(x_{1}-x_{0})=0$ . Let $\mathcal{A}\in\operatorname{S}^{3}(\mathbb{C}^{4})$ be the symmetric tensor such that

	$\displaystyle\mathcal{A}_{3000}$	$\displaystyle=2,\mathcal{A}_{2100}=1,\mathcal{A}_{2010}=5,\mathcal{A}_{2001}=-3,\mathcal{A}_{1200}=5,\mathcal{A}_{1110}=10,\mathcal{A}_{1020}=9,$
	$\displaystyle\mathcal{A}_{1101}$	$\displaystyle=2,\mathcal{A}_{1011}=7,\mathcal{A}_{1002}=5,\mathcal{A}_{0300}=-5,\mathcal{A}_{0210}=2,\mathcal{A}_{0120}=8,\mathcal{A}_{0030}=29,$
	$\displaystyle\mathcal{A}_{0201}$	$\displaystyle=-6,\mathcal{A}_{0111}=6,\mathcal{A}_{0021}=7,\mathcal{A}_{0102}=4,\mathcal{A}_{0012}=5,\mathcal{A}_{0003}=-9.$

The vanishing ideal of $Y\subseteq\mathbb{C}^{3}$ is

I(Y)=\langle(y_{3}-y_{2})(y_{1}-1)\rangle.

The maximum rank of flattening matrices of $\mathcal{A}$ is $4$ . When we run Algorithm 5.2 with $r=4$ , it fails to give a desired tensor decomposition. So, we use $r=5$ and apply Algorithm 5.2. It returns the symmetric $X$ -decomposition

\widetilde{V}=\left[\begin{array}[]{ccc}-2.0&-1.0&-1.0\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1.0&-1.487&1.0\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1.0&2.287&1.0\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-1.0&1.0&1.0\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1.0&0&-2.0\end{array}\right],\quad\widetilde{\Lambda}=\left[\begin{array}[]{c}1.0\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-1.249\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 2.249\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-1.0\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1.0\end{array}\right].

We have $\lVert\mathcal{A}\rVert=104.86$ and the error $\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert=10^{-15}$ .

Example 6.5.

Let $\mathbb{P}X\subseteq\mathbb{P}^{3}$ be the surface defined by $-3x_{1}x_{2}^{2}+x_{1}^{3}-x_{0}^{2}x_{3}=0$ . Then $Y\subseteq\mathbb{C}^{3}$ is the monkey saddle surface whose vanishing idea is

I(Y)=\langle-3y_{1}y_{2}^{2}+y_{1}^{3}-y_{3}\rangle.

Let $\mathcal{A}\in\operatorname{S}^{3}(\mathbb{C}^{4})$ be the symmetric tensor such that

	$\displaystyle\mathcal{A}_{3000}$	$\displaystyle=5,\mathcal{A}_{2100}=-1,\mathcal{A}_{2010}=6,\mathcal{A}_{2001}=-13,\mathcal{A}_{1200}=9,\mathcal{A}_{1110}=8,\mathcal{A}_{1020}=6,$
	$\displaystyle\mathcal{A}_{1101}$	$\displaystyle=-33,\mathcal{A}_{1011}=-16,\mathcal{A}_{1002}=87,\mathcal{A}_{0300}=17,\mathcal{A}_{0210}=24,\mathcal{A}_{0120}=10,\mathcal{A}_{0030}=12,$
	$\displaystyle\mathcal{A}_{0201}$	$\displaystyle=-91,\mathcal{A}_{0111}=-54,\mathcal{A}_{0021}=-38,\mathcal{A}_{0102}=233,\mathcal{A}_{0012}=5,\mathcal{A}_{0003}=-739.$

The maximum rank of flattening matrices of $\mathcal{A}$ is $4$ . When we run Algorithm 5.2 with $r=4,5$ , it fails to give a desired tensor decomposition. So, we apply Algorithm 5.2 with $r=6$ and get the symmetric $X$ -decomposition

	$\displaystyle\widetilde{V}$	$\displaystyle=\left[\begin{array}[]{ccc}5.936-0.8124\,i&3.582-0.4510\,i&-19.62+2.978\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-0.4115-0.5223\,i&-1.801+1.185\,i&9.226-2.511\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 2.042-0.01150\,i&-1.031+0.008333\,i&2.008-0.001995\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-0.6422-0.02100\,i&0.9440+0.001783\,i&1.453+0.03664\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 2.157-0.08836\,i&1.576-0.04318\,i&-6.043+0.3063\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-1.0-{1.505\cdot 10^{-18}}\,i&-{3.163\cdot 10^{-18}}-{1.004\cdot 10^{-18}}\,i&-1.0+{1.519\cdot 10^{-18}}\,i\end{array}\right],$
	$\displaystyle\widetilde{\Lambda}$	$\displaystyle=\left[\begin{array}[]{c}0.03454+0.02496\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 0.001235+0.00440\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr-0.9564-0.01231\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 2.0480-0.05064\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 1.8730+0.03358\,i\\ \vskip 6.0pt plus 2.0pt minus 2.0pt\cr 2.0+{1.8560\cdot 10^{-17}}\,i\end{array}\right],\quad\lVert\mathcal{A}\rVert=1275.93,\quad\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert=2\cdot 10^{-15},\quad\frac{\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert}{\lVert\widetilde{\mathcal{A}}\rVert}=2\cdot 10^{-18}.$

In the next two examples, we still display the tensor entries $\mathcal{A}_{\alpha}$ according to the lexicographic order, but we drop the labelling indices, for cleanness of the paper.

Example 6.6.

Let $\mathbb{P}X\subseteq\mathbb{P}^{4}$ be the curve defined by

	$\displaystyle x_{1}x_{3}-x_{0}x_{2}-x_{0}x_{1}+x_{0}x_{3}$	$\displaystyle=0,\,x_{3}^{2}-x_{0}x_{1}-x_{0}^{2}=0,$
	$\displaystyle x_{1}x_{4}+4x_{1}x_{3}-x_{1}x_{2}-x_{1}^{2}+5x_{1}$	$\displaystyle=0.$

The vanishing ideal of $Y\subseteq\mathbb{C}^{4}$ is then

I(Y)=\langle y_{1}y_{3}-y_{2}-y_{2}+y_{3},y_{3}^{2}-y_{1}-1,y_{1}y_{4}+4y_{1}y_{3}-y_{1}y_{2}-y_{1}^{2}+5y_{1}\rangle.

Let $\mathcal{A}\in\operatorname{S}^{3}(\mathbb{C}^{5})$ be the symmetric tensor whose $35$ entries are

-7,-2,87,25,20,26,334,-233,60,-45,-9,130,-144,-74,182,406,1754,1150,\\ 13647,300,156,2353,24,421,85,830,610,6500,60,1050,150,550,3630,880,-250.

The maximum rank of flattening matrices of $\mathcal{A}$ is $5$ . When we run Algorithm 5.2 with $r=5$ , it fails to give a desired tensor decomposition. So, we apply Algorithm 5.2 with $r=6$ and get the symmetric $X$ -decomposition

\widetilde{V}=\left[\begin{array}[]{cccc}680.9&17130.0&26.11&17700.0\\ 7.717&18.02&2.952&8.926\\ 0&-1.0&-1.0&-10.52-8.239\,i\\ -0.2272&-0.4521&-0.8791&-2.163\\ 0.7526&1.568&1.324&-7.975\\ 2.945&-10.78&-1.986&-4.891\end{array}\right],\quad\widetilde{\Lambda}=\left[\begin{array}[]{c}0\\ 1.1720\\ 0\\ -6.780\\ 3.871\\ -5.263\end{array}\right].

We have the norm $\lVert{\mathcal{A}}\rVert=29222.16$ and the error $\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert=10^{-10}$ .

Example 6.7.

Let $\mathbb{P}X\subseteq\mathbb{P}^{4}$ be the surface defined by

\displaystyle x_{3}^{2}+x_{4}^{2}-x_{0}x_{1}=0,\quad x_{3}x_{4}-x_{0}x_{2}=0.

Then the vanishing ideal of the variety $Y\subseteq\mathbb{C}^{4}$ is

I(Y)=\langle y_{3}^{2}+y_{4}^{2}-y_{1},y_{3}y_{4}-y_{2}\rangle.

Let $\mathcal{A}\in\operatorname{S}^{4}(\mathbb{C}^{5})$ be the symmetric tensor whose $70$ entries are respectively

22,38,89,6,34,220,490,79,119,65,32,165,71,89,6,2216,3044,686,653,1029,\\ 490,239,195,173,48,1111,574,257,490,79,65,25,317,71,100,21424,20440,\\ 6028,4570,1615,8033,3788,1918,929,1553,1162,415,455,233,116,8187,4316,1954,\\ 1007,3044,686,653,490,239,173,663,1882,271,574,257,79,621,335,317,-54.

The maximum rank of flattening matrices of $\mathcal{A}$ is $10$ . When we run Algorithm 5.2 with $r=10$ , we get the symmetric $X$ -decomposition

	$\displaystyle\widetilde{V}$	$\displaystyle=\left[\begin{array}[]{cccc}5.0&-2.0&-1.0&2.0\\ 0.7596+0.2348\,i&0.2187+0.5369\,i&1.0&0.2187+0.5369\,i\\ 1.937+0.1658\,i&0.9718+0.08532\,i&1.0&0.9718+0.08532\,i\\ 2.0&1.0&-1.0&-1.0\\ 1.0&0&0&1.0\\ 2.065+0.07655\,i&-1.033-0.03706\,i&1.0&-1.033-0.03706\,i\\ 5.003+0.006546\,i&-2.001-0.001636\,i&1.0&-2.001-0.001636\,i\\ 2.0&-1.0&-1.0&1.0\\ 4.999+0.01109\,i&2.0+0.002774\,i&1.0&2.0+0.002774\,i\\ 8.0&4.0&2.0&2.0\end{array}\right],$
	$\displaystyle\widetilde{\Lambda}$	$\displaystyle=\left[\begin{array}[]{c}-10.0\\ 0.505+0.297\,i\\ 3.232-0.413\,i\\ 11.0\\ 11.0\\ -3.778+0.132\,i\\ -7.970+0.053\,i\\ 5.0\\ 6.01-0.069\,i\\ 7.0\end{array}\right],\quad\lVert{\mathcal{A}}\rVert=147394.37,\quad\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert=8\cdot 10^{-11}.$

We conclude this section by considering various examples on Segre varieties.

Example 6.8.

(Vandermonde decompositions of nonsymmetric tensors) Each tensor $\mathcal{A}\in(\mathbb{C}^{d+1})^{\otimes k}$ has a Vandermonde decomposition as in (3.6), which is proved in Theorem 3.6. We can view $\mathcal{A}$ as a tensor in $\operatorname{S}^{d}(\mathbb{C}^{2^{k}})$ with the set $X\subseteq\mathbb{C}^{2^{k}}$ such that $\mathbb{P}X=\mathbb{P}^{1}\times\cdots\times\mathbb{P}^{1}$ ( $\mathbb{P}^{1}$ is repeated $k$ times). Let

n=2^{k}-1.

A vector $x\in\mathbb{C}^{2^{k}}$ can be labelled as $x=(x_{\nu})$ , with binary vectors $\nu\in\{0,1\}^{k}$ . Under this labelling, the set $X$ is defined by the homogeneous equations (c.f. [20, Example 2.11]) or [27, Section 4.3.5])

(6.1)

x_{\mu}x_{\nu}-x_{\eta}x_{\theta}=0

for all $\mu,\nu,\eta,\theta\in\{0,1\}^{k}$ such that for some $1\leq i<j\leq k$ ,

\mu_{l}=\nu_{l}=\eta_{l}=\theta_{l}\,(l\neq i,j),\quad\mu_{i}+\nu_{j}=\eta_{j}+\theta_{i}.

Under the dehomogenization $x_{0\ldots 0}=1$ , the corresponding affine variety $Y\subseteq\mathbb{C}^{2^{k}-1}$ consists of vectors $y$ , labelled as $y=(x_{\mu})$ with $0\neq\mu\in\{0,1\}^{k}$ , defined by the vanishing ideal

I(Y)=\langle y_{\mu}y_{\nu}-y_{\eta}y_{\theta}\rangle,

where $\mu,\nu,\eta,\theta\in\{0,1\}^{k}$ are as above and $y_{0\ldots 0}=1$ .

We apply Algorithm 5.2 to compute Vandermonde decompositions for random $\mathcal{A}\in(\mathbb{C}^{d+1})^{\otimes k}$ whose entries are randomly generated (obeying the normal distribution). For all the instances, Algorithm 5.2 successfully got Vandermonde rank decompositions. The relative errors $\frac{\lVert\mathcal{A}-\widetilde{\mathcal{A}}\rVert}{\lVert\mathcal{A}\rVert}$ are all in the magnitude of $O(10^{-16})$ . The consumed computtaional time (in seconds) is also reported. The results are displayed in Table 1.

Table 1. Computational results on symmetric

X

-decompositions on Segre varieties

$k$	$n$	$d$	$r$	time	$k$	$n$	$d$	$r$	time
2	3	3	4	7.07	3	7	3	7	7.45
2	3	4	4	7.08	3	7	3	9	9.74
2	3	4	5	7.11	3	7	3	10	19.04
2	3	4	6	7.31	3	7	4	8	6.85
2	3	4	7	7.45	3	7	4	9	6.88
2	3	4	8	8.11	3	7	4	10	6.54
2	3	5	5	7.10	3	7	4	11	8.35
2	3	5	6	7.08	3	7	4	12	11.37
2	3	5	7	7.22	3	7	4	13	22.84
2	3	5	8	7.17	3	7	4	14	41.45
2	3	5	9	7.37	3	7	4	15	69.26
2	3	5	10	7.76	3	7	4	16	134.47
2	3	5	11	8.87	3	7	4	17	224.97
2	3	6	12	7.84	3	7	4	18	382.00
2	3	6	13	7.97	3	7	5	19	8.35
2	3	6	14	8.50	3	7	5	20	8.45
2	3	6	15	9.26	3	7	5	21	8.65
2	3	6	16	12.33	3	7	5	22	8.65
2	3	7	17	8.27	3	7	5	23	8.80
2	3	7	18	12.11	3	7	5	24	9.08
2	3	7	19	34.24	3	7	5	25	8.96
2	3	7	20	722.98	3	7	6	26	9.10

7. Conclusion

In this paper, we discuss how to compute symmetric $X$ -decompositions of symmetric tensors on a given variety $X$ . The tool of generating polynomial is used to do the computation. Based on that, give an algorithm for computing symmetric $X$ -decompositions. Various examples are given to demonstrate the correctness and efficiency of the proposed method.

References

[1] H. Abo and M. Brambilla. On the dimensions of secant varieties of segre-veronese varieties. Annali di Matematica Pura ed Applicata, 192(1):61–92, 2013.
[2] J. Alexander and A. Hirschowitz. Polynomial interpolation in several variables. Journal of Algebraic Geometry, 4(1995), pp. 201-22.
[3] E. Balllico and A. Bernardi. Decomposition of homogeneous polynomials with low rank. Math. Z. 271, 1141-1149, 2012.
[4] A. Bernardi, A. Gimigliano and M. Idà. Computing symmetric rank for symmetric tensors. Journal of Symbolic Computation 46, (2011), 34-53.
[5] J. Brachat, P. Comon, B. Mourrain, and E. Tsigaridas. Symmetric tensor decomposition. Linear Algebra and its Applications, 433(11):1851–1872, 2010.
[6] L. Chiantini, G. Ottaviani, and N. Vannieuwenhoven. On generic identifiability of symmetric tensors of subgeneric rank. Trans. Amer. Math. Soc., 369 (2017), 4021-4042.
[7] L. Chiantini, J. Hauenstein, C. Ikenmeyer, J. Landsberg, and G. Ottaviani. Polynomials and the exponent of matrix multiplication. arXiv preprint arXiv:1706.05074, 2017.
[8] G. Comas and M. Seiguer. On the rank of a binary form. Foundations of Computational Mathematics, Vol. 11, No. 1, pp. 65-78, 2011.
[9] P. Comon, G. Golub, L.-H. Lim, and B. Mourrain. Symmetric tensors and symmetric tensor rank. SIAM Journal on Matrix Analysis and Applications, 30(3):1254–1279, 2008.
[10] R. Corless, P. Gianni, and B. Trager. A reordered schur factorization method for zero-dimensional polynomial systems with multiple roots. In International Symposium on Symbolic and Algebraic Computation, pp. 133–140, 1997.
[11] D. Cox, J. Little and D. O’Shea. Ideals, varieties, and algorithms: an introduction to computational algebraic geometry and commutative algebra, Springer, 2007.
[12] L. De Lathauwer and B. De Moor. From matrix to tensor: Multilinear algebra and signal processing. In Institute of Mathematics and Its Applications Conference Series, vol. 67, pp. 1–16, 1998.
[13] D. Eisenbud, H. Craig and V. Wolmer. Direct methods for primary decomposition.. Inventiones mathematicae, 110.1 (1992): 207-235.
[14] A. Kehrein, M. Kreuzer and L. Robbiano. An algebraist’s view on border bases. In: A. Dickenstein et al. (eds) Solving Polynomial Equations: Foundations, Algorithms, and Applications. Algorithms and Computation in Mathematics, vol. 14, pp. 169–202. Springer-Verlag, New York-Heidelberg, 2005.
[15] E. Fortuna, P. Gianni and B. Trager. Derivations and radicals of polynomial ideals over fields of arbitrary characteristic. Journal of Symbolic Computation, 33.5 (2002): 609-625.
[16] F. Galuppi and M. Mella. Identifiability of homogeneous polynomials and Cremona Transformations. Preprint, 2016. arXiv:1606.06895v2[math.AG]
[17] T. Gianni, T. Barry, and Z. Gail. Gröbner bases and primary decomposition of polynomial ideals.. Journal of Symbolic Computation, 6.2-3 (1988): 149-167.
[18] R. Grone. Decomposable tensors as a quadratic variety. Proceedings of the American Mathematical Society, 64(2):227–230, 1977.
[19] B. Gross and N. Wallach. On the Hilbert polynomials and Hilbert series of homogeneous projective varieties. Arithmetic geometry and automorphic forms 19 (2011): 253-263.
[20] J. Harris. Algebraic geometry, Graduate Texts in Mathematics, vol. 133. Springer-Verlag, New York, 1992.
[21] R. Hartshorne. Algebraic geometry. Graduate Texts in Mathematics, vol. 52. Springer-Verlag, New York-Heidelberg, 1977.
[22] M. Kreuzer and L. Robbiano. Computational commutative algebra 2. Springer Science & Business Media, 2005.
[23] A. Iarrobino and V. Kanev. Power Sums, Gorenstein algebras, and determinantal varieties. Lecture Notes in Mathematics #1721, Springer, 1999.
[24] T. Kolda and B. Bader. Tensor decompositions and applications. SIAM Rev. vol. 51, no. 3, 455-500, 2009.
[25] A. Laface and E. Postinghel. Secant varieties of segre-veronese embeddings of $(\mathbb{P}^{1})^{\times r}$ . Mathematische Annalen, 356(4):1455–1470, 2013.
[26] J. Landsberg. The border rank of the multiplication of $2\times 2$ matrices is seven. Journal of the American Mathematical Society, 19(2):447–459, 2006.
[27] J. Landsberg. Tensors: geometry and applications. Graduate Studies in Mathematics, 128, AMS, Providence, RI, 2012.
[28] J. Landsberg and G. Ottaviani. Equations for secant varieties of veronese and other varieties. Annali di Matematica Pura ed Applicata, 192(4):569–606, 2013.
[29] L.-H. Lim and P. Comon. Multiarray signal processing: Tensor decomposition meets compressed sensing. Comptes Rendus Mecanique, 338(6):311–320, 2010.
[30] L.-H. Lim. Tensors and hypermatrices, in: L. Hogben (Ed.), Handbook of linear algebra, 2nd Ed., CRC Press, Boca Raton, FL, 2013.
[31] B. Mourrain, A new criterion for normal form algorithms. International Symposium on Applied Algebra, Algebraic Algorithms, and Error-Correcting Codes, Springer, Berlin, Heidelberg, 1999.
[32] D. Mumford. Algebraic Geometry I: Complex Projective Varieties. Springer Verlag, Berlin, 1995.
[33] J. Nie. Generating polynomials and symmetric tensor decompositions. Foundations of Computational Mathematics, 17(2):423–465, 2017.
[34] J. Nie. Low rank symmetric tensor approximations, SIAM J. Matrix Anal. Appl., 38 (no. 4), pp. 1517–1540, 2017.
[35] J. Nie and K. Ye. Hankel tensor decompositions and ranks. SIAM J. Matrix Anal. Appl., 40 (no. 2), 486–516, 2019.
[36] D. Nion and N. Sidiropoulos. Tensor algebra and multidimensional harmonic retrieval in signal processing for mimo radar. IEEE Transactions on Signal Processing, 58(11):5693–5705, 2010.
[37] L. Oeding and G. Ottaviani. Eigenvectors of tensors and algorithms for waring decomposition. Journal of Symbolic Computation, 54:9–35, 2013.
[38] J. Papy, L. De Lathauwer, and S. Huffel. Exponential data fitting using multilinear algebra: the single-channel and multi-channel case. Numerical linear algebra with applications, 12(8):809–826, 2005.
[39] L. Qi. Hankel tensors: Associated hankel matrices and vandermonde decomposition. Communications in Mathematical Sciences, 13(1):113–125, 2015.
[40] C. Raicu. Secant varieties of segre–veronese varieties. Algebra & Number Theory, 6(8):1817–1868, 2012.
[41] K. Ranestad,, and F. O. Schreyer. Varieties of sums of powers. Journal für die reine und angewandte Mathematik, (2000): 147-182.
[42] S. Sam. Ideals of bounded rank symmetric tensors are generated in bounded degree. Inventiones mathematicae, 207(1):1–21, 2017.
[43] S. Sam. Syzygies of bounded rank symmetric tensors are generated in bounded degree. Mathematische Annalen, 368(3-4):1095–1108, 2017.
[44] M. Signoretto, L. De Lathauwer, and J. Suykens. A kernel-based framework to tensorial data analysis. Neural networks, 24(8):861–874, 2011.
[45] H. Stetter. Numerical Polynomial Algebra. SIAM, Philadelphia, 2004.
[46] V. Strassen. Gaussian elimination is not optimal. Numerische mathematik, 13(4):354–356, 1969.
[47] W. Sun and H. So. Accurate and computationally efficient tensor-based subspace approach for multidimensional harmonic retrieval. IEEE Transactions on Signal Processing, 60(10):5077–5088, 2012.
[48] Z. Teitler Sufficient conditions for Strassen’s additivity conjecture. Illinois Journal of Mathematics, 59.4 (2015): 1071-1085.
[49] S. Trickett, L. Burroughs, A. Milton, et al. Interpolation using hankel tensor completion. In 2013 SEG Annual Meeting. Society of Exploration Geophysicists, 2013.
[50] W. Uemura and O. Sugino. Symmetric tensor decomposition description of fermionic many-body wave functions. Physical Review Letters, 109(25):253001, 2012.
[51] K. Ye and L.-H. Lim. Fast structured matrix computations: tensor rank and cohn–umans method. Foundations of Computational Mathematics, 18(1) pp. 45–95, 2018.
[52] K. Ye and L.-H. Lim. Tensor network ranks. arXiv preprint arXiv:1801.02662, 2018.

Symmetric Tensor Decompositions On Varieties

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Contributions

2. Preliminaries

2.1. Equivalent descriptions for symmetric tensors

2.2. Algebraic varieties

Veronese maps

Segre varieties

Secant varieties

3. Properties of STDX

3.1. Existence of symmetric XX-decompositions

Proposition 3.1.

Proof.

Algorithm 3.2.

Corollary 3.3.

3.2. The dimension and expected rank

Example 3.4.

Proposition 3.5.

Proof.

3.3. Vandermonde decompositions for nonsymmetric tensors

Theorem 3.6.

Proof.

4. Generating polynomials

4.1. Generating polynomials

Example 4.1.

4.2. Polynomial division by φ​[G]\varphi[G]

Proposition 4.2.

Proof.

Lemma 4.3.

Proof.

Example 4.4.

4.3. Commutativity conditions

Theorem 4.5.

Proof.

5. An algorithm for computing STDXs

Theorem 5.1.

Proof.

Algorithm 5.2.

Remark 5.3.

Example 5.4.

6. Numerical experiments

Example 6.1.

Example 6.2.

Example 6.3.

Example 6.4.

Example 6.5.

Example 6.6.

Example 6.7.

Example 6.8.

7. Conclusion

References

3.1. Existence of symmetric $X$ -decompositions

4.2. Polynomial division by $\varphi[G]$