This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Algorithms yield upper bounds
in differential algebra

Wei Li KLMM, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, No.55 Zhongguancun East Road, 100190, Beijing, China [email protected] Alexey Ovchinnikov CUNY Queens College, Department of Mathematics, 65-30 Kissena Blvd, Queens, NY 11367, USA and CUNY Graduate Center, Mathematics and Computer Science, 365 Fifth Avenue, New York, NY 10016, USA [email protected] Gleb Pogudin LIX, CNRS, École Polytechnique, Institut Polytechnique de Paris, 1 rue Honoré d’Estienne d’Orves, 91120, Palaiseau, France and National Research University Higher School of Economics, Moscow, Russia [email protected]  and  Thomas Scanlon University of California, Berkeley, Department of Mathematics, Berkeley, CA 94720-3840, USA [email protected]
Abstract.

Consider an algorithm computing in a differential field with several commuting derivations such that the only operations it performs with the elements of the field are arithmetic operations, differentiation, and zero testing. We show that, if the algorithm is guaranteed to terminate on every input, then there is a computable upper bound for the size of the output of the algorithm in terms of the size of the input. We also generalize this to algorithms working with models of good enough theories (including for example, difference fields).

We then apply this to differential algebraic geometry to show that there exists a computable uniform upper bound for the number of components of any variety defined by a system of polynomial PDEs. We then use this bound to show the existence of a computable uniform upper bound for the elimination problem in systems of polynomial PDEs with delays.

2010 Mathematics Subject Classification:
Primary 12H05, 12H10, 03C10; Secondary 03C60, 03D15

1. Introduction

Finding uniform bounds for problems and quantities (e.g., consistency testing or counting of solutions) is one of the central questions in differential algebra. In [27], it was demonstrated that, in commutative algebra, one can show the existence of such bounds as a consequence of theorems about nonstandard extensions of standard algebraic objects. This approach was successfully applied in the differential algebra context in [11] and [8, Section 6] for establishing, for example, the existence of a uniform bound in the differential Nullstellensatz. Furthermore, in [26], the authors used methods of proof theory to extract explicit bounds based on nonstandard existence proofs.

The present paper can be viewed as an alternative approach, in which we derive the existence of a computable uniform bound for an object from the existence of an algorithm for computing the object. More precisely, let TT be a complete decidable theory. The most relevant examples for us would be the theory of differentially closed fields in zero characteristic with mm commuting derivations and the theory of existentially closed difference fields, others include algebraically closed and real closed fields. Consider an algorithm AA performing computations in a model of TT that is restricted to using only definable functions when working with elements of the model (for formal definition, we refer to Section 4.1) and required to terminate for every input.

We show that there is a computable upper bound for the size of the output of AA in terms of the input size of AA. We apply this to the Rosenfeld-Gröbner algorithm [3] that decomposes a solution set of a system of polynomial PDEs into components and is such an algorithm. This allows us to show that there is a uniform upper bound for the number of components of any differential-algebraic variety defined by a system of polynomial PDEs. We also show how this bound for the number of components leads to a uniform upper bound for the elimination problem in systems of polynomial PDEs with delays.

A bound for the number of components of varieties defined by polynomial ODEs appeared in [18], as did a bound for the elimination problem for polynomial ODEs with delays. These bounds are based on the application of the Rosenfeld-Gröbner algorithm, which, if applied in this situation to ODEs, outputs equations whose order does not exceed the order of the input. This allowed to restrict to a finitely generated subring of the ring of differential polynomials and use tools from algebraic geometry. It is non-trivial to generalize this to polynomial PDEs because the orders in the output of the Rosenfeld-Gröbner can be greater than the orders of the input. Another key ingredient in the ODE case to obtain the bound in [18] was an analysis of differential dimension polynomials. A significant difference of our present PDE context with the ordinary case that these polynomials behave less predictably under projections of varieties (compare [18, Lemma 6.16] and Lemma 6.6). To overcome this difficulty, we use again our bound for the Rosenfeld-Gröbner algorithm.

We believe that our method can also be applied to obtain bounds for other algorithms in differential algebra such as [1, Algorithm 3.6] and for algorithms from other theories, e.g. [7, Algorithm 3] for systems of difference equations. Since the reducibility of a polynomial can be expressed as a first-order existential formula, it seems plausible that the same methods could be applied to other algorithms dealing with difference [5] and differential-difference [6] equations that use factorization because the corresponding theories satisfy the requirements of our approach [14, 16, 23]. However, we leave these for future research.

The paper is organized as follows. Section 2 contains definitions and notation used in Section 3 to state the main results. Bounds for an algorithm working with a model of a theory TT are established in Section 4. These results are applied to differential algebra in Section 5. Further applications to delay PDEs are given in Section 6.

2. Basic notions and notaiton

Definition 2.1 (Differential-difference rings).
  • A Δ\Delta-σ\sigma-ring (,Δ,σ)(\mathcal{R},\Delta,\sigma) is a commutative ring \mathcal{R} endowed with a finite set Δ={1,,m}\Delta=\{\partial_{1},\ldots,\partial_{m}\} of commuting derivations of \mathcal{R} and an endomorphism σ\sigma of \mathcal{R} such that, for all ii, iσ=σi\partial_{i}\sigma=\sigma\partial_{i}.

  • When \mathcal{R} is additionally a field, it is called a Δ\Delta-σ\sigma-field.

  • If σ\sigma is an automorphism of \mathcal{R}, \mathcal{R} is called a Δ\Delta-σ\sigma^{\ast}-ring.

  • If σ=id\sigma=\operatorname{id}, \mathcal{R} is called a Δ\Delta-ring or differential ring.

  • For a commutative ring \mathcal{R}, F\langle F\rangle denotes the ideal generated by FF\subset\mathcal{R} in \mathcal{R}.

  • For Δ={1,,m}\Delta=\{\partial_{1},\ldots,\partial_{m}\}, let ΘΔ={1i1mimij0, 1jm}\Theta_{\Delta}=\{\partial_{1}^{i_{1}}\cdot\ldots\cdot\partial_{m}^{i_{m}}\mid i_{j}\geqslant 0,\,1\leqslant j\leqslant m\}.

  • For θ=1i1mimΘΔ\theta=\partial_{1}^{i_{1}}\cdot\ldots\cdot\partial_{m}^{i_{m}}\in\Theta_{\Delta}, we let ordθ=i1++im\operatorname{ord}\theta=i_{1}+\ldots+i_{m}. For a non-negative integer BB, we denote ΘΔ(B):={θΘΔordθB}\Theta_{\Delta}(B):=\{\theta\in\Theta_{\Delta}\mid\,\operatorname{ord}\theta\leqslant B\}.

  • For a Δ\Delta-ring \mathcal{R}, the differential ideal generated by FF\subset\mathcal{R} in \mathcal{R} is denoted by F()\langle F\rangle^{(\infty)}; for a non-negative integer BB, we introduce the following ideal of \mathcal{R}:

    F(B):=θ(F)θΘΔ(B).\langle F\rangle^{(B)}:=\langle\theta(F)\mid\,\theta\in\Theta_{\Delta}(B)\rangle.
Definition 2.2 (Differential polynomials).

Let \mathcal{R} be a Δ\Delta-ring. The differential polynomial ring over \mathcal{R} in 𝒚=y1,,yn\bm{y}=y_{1},\ldots,y_{n} is defined as

{𝒚}Δ:=[θysθΘΔ; 1sn].\mathcal{R}\{\bm{y}\}_{\Delta}:=\mathcal{R}[\theta y_{s}\mid\theta\in\Theta_{\Delta};\,1\leqslant s\leqslant n].

The structure of a Δ\Delta-ring is defined by i(θys):=(iθ)ys\partial_{i}(\theta y_{s}):=(\partial_{i}\theta)y_{s} for every θΘΔ\theta\in\Theta_{\Delta}.

Definition 2.3 (Differential-difference polynomials).

Let \mathcal{R} be a Δ\Delta-σ\sigma-ring. The differential-difference polynomial ring over \mathcal{R} in 𝒚=y1,,yn\bm{y}=y_{1},\ldots,y_{n} is defined as

[𝒚]:=[θσiysθΘΔ;i0; 1sn].\mathcal{R}[\bm{y}_{\infty}]:=\mathcal{R}[\theta\sigma^{i}y_{s}\mid\theta\in\Theta_{\Delta};\,i\geqslant 0;\,1\leqslant s\leqslant n].

The structure of Δ\Delta-σ\sigma ring is defined by σ(θσjys):=θσj+1ys\sigma(\theta\sigma^{j}y_{s}):=\theta\sigma^{j+1}y_{s} and i(θσjys):=(iθ)σjys\partial_{i}(\theta\sigma^{j}y_{s}):=(\partial_{i}\theta)\sigma^{j}y_{s} for every θΘΔ\theta\in\Theta_{\Delta} and j0j\geqslant 0.

A Δ\Delta-σ\sigma-polynomial is an element of [𝒚]\mathcal{R}[\bm{y}_{\infty}]. Given BB\in\mathbb{N}, let [𝒚B]\mathcal{R}[\bm{y}_{B}] denote the polynomial ring

[θσjysθΘΔ(B);0jB;1sn].\mathcal{R}[\theta\sigma^{j}y_{s}\mid\theta\in\Theta_{\Delta}(B);0\leqslant j\leqslant B;1\leqslant s\leqslant n].

The notions from logic that we use are described in detail in [19]. In particular, we will use the notions of a first-order language [19, Definition 1.1.1], structure [19, Definition 1.1.2], formula [19, Definition 1.1.5], theory [19, Section 1.2, page 14], model [19, Section 1.2, page 14], compactness [19, Section 2.1], complete theory [19, Definition 2.2.1], decidable theory [19, Definition 2.2.7], quantifier elimination [19, Definition 3.1.1], and 0\aleph_{0}-saturation [19, Definition 4.3.1].

3. Main results

For clarity, we gather our main results in one section.

Theorem 3.1 (Upper bound for irreducible components for PDEs).

There exists a computable function Components(m,n)\operatorname{Components}(m,n) such that, for every differential field kk of zero characteristic with a set of mm commuting derivations Δ\Delta and finite Fk{y1,,yn}ΔF\subset k\{y_{1},\ldots,y_{n}\}_{\Delta} with max{ordF,degF}s\max\{\operatorname{ord}F,\deg F\}\leqslant s, the number of components in the variety defined by F=0F=0 does not exceed Components(m,max{n,s})\operatorname{Components}(m,\max\{n,s\}).

Additional details and proof are given in Theorem 5.13.

Theorem 3.2 (Upper bound for elimination in delay PDEs).

For all non-negative integers rr, mm and ss, there exists a computable B=B(r,m,s)B=B(r,m,s) such that, for all:

  • non-negative integers qq and tt,

  • a Δ\Delta-σ\sigma-field kk with chark=0\operatorname{char}k=0 and |Δ|=m|\Delta|=m,

  • sets of Δ\Delta-σ\sigma-polynomials Fk[𝒙t,𝒚s]F\subset k[\bm{x}_{t},\bm{y}_{s}], where 𝒙=x1,,xq\bm{x}=x_{1},\ldots,x_{q}, 𝒚=y1,,yr\bm{y}=y_{1},\ldots,y_{r}, and deg𝒚Fs\deg_{\bm{y}}F\leqslant s,

we have

σi(F)i0()k[𝒙]{0}σi(F)i[0,B](B)k[𝒙B+t]{0}.\big{\langle}\sigma^{i}(F)\mid i\in\mathbb{Z}_{\geqslant 0}\big{\rangle}^{(\infty)}\cap k[\bm{x}_{\infty}]\neq\{0\}\\ \iff\langle\sigma^{i}(F)\mid i\in[0,B]\big{\rangle}^{(B)}\cap k[\bm{x}_{B+t}]\neq\{0\}.
Corollary 3.3 (Effective Nullstellensatz for delay PDEs).

For all non-negative integers rr, mm and ss, there exists a computable B=B(r,m,s)B=B(r,m,s) such that, for all:

  • Δ\Delta-σ\sigma-fields kk with chark=0\operatorname{char}k=0 and |Δ|=m|\Delta|=m,

  • sets of Δ\Delta-σ\sigma-polynomials Fk[𝒚s]F\subset k[\bm{y}_{s}], where 𝒚=y1,,yr\bm{y}=y_{1},\ldots,y_{r}, and degFs\deg F\leqslant s,

the following statements are equivalent:

  1. (1)

    There exists a Δ\Delta-σ\sigma^{*} field LL extending kk such that F=0F=0 has a sequence solution in LL.

  2. (2)

    1σi(F)i[0,B](B)1\notin\langle\sigma^{i}(F)\mid i\in[0,B]\big{\rangle}^{(B)}.

  3. (3)

    There exists a field extension LL of kk such that the polynomial system {σi(F)(j)=0i,j[0,B]}\big{\{}\sigma^{i}(F)^{(j)}=0\mid i,j\in[0,B]\big{\}} in the finitely many unknowns 𝒚B+s\bm{y}_{B+s} has a solution in LL.

The two preceding theorems are proved using our main technical result about algorithms performing computations in complete decidable theories. Stating it precisely requires defining admissible algorithms carefully, so we postpone it until Section 4 and give here a simplified and informal version of the statement.

Theorem 3.4 (Algorithm yields a bound, stated precisely as Theorem 4.5).

There exists a computable function with input

  • a complete decidable theory TT;

  • an algorithm 𝒜\mathcal{A} performing computations in a model of TT restricted to using only definable functions when working with elements of the model;

  • positive integer \ell

that computes a number NN such that for every model MM of TT and every 𝐚M\mathbf{a}\in M^{\ell} the size of the output of 𝒜\mathcal{A} with input 𝐚\mathbf{a} does not exceed NN.

For the application of this to the Rosenfeld-Gröbner algorithm, see Theorem 5.10.

4. Bounds for the output size of algorithms over complete theories

In this section, we will use the formalism of oracle Turing machines [24, § 14.3]. Roughly speaking, an oracle Turing machine is a Turing machine with an extra tape for performing queries to an external oracle. An oracle is not considered to be a part of the machine.

4.1. Setup

To consider an algorithm dealing with elements of a (not necessarily computable) model of a theory TT, we will “encapsulate” the elements of the model given to the algorithm into an oracle that allows to perform only first-order operations with them as defined below. Alternatively, one could adapt other approaches used to formalize computations in real numbers [2, Section 3] or in arbitrary structures (see [9, §1] and [4, §2.2]).

Definition 4.1 (TT-oracle).

Let \mathcal{L} be a language and TT be a theory in \mathcal{L}. For elements a1,,aa_{1},\ldots,a_{\ell} of a model MM of TT, any oracle that supports the following queries: given a formula φ(x1,,x)\varphi(x_{1},\ldots,x_{\ell}), the oracle returns the value φ(a1,,a)\varphi(a_{1},\ldots,a_{\ell}) in MM (can be true or false), will be denoted by 𝒪M(a1,,a)\mathcal{O}_{M}(a_{1},\ldots,a_{\ell}) and called an evaluation oracle.

Definition 4.2 (Total algorithm over TT).

An oracle Turing machine 𝒜\mathcal{A} will be called a total algorithm over TT if, for all positive integers \ell, every model MM of TT and every a1,,aMa_{1},\ldots,a_{\ell}\in M, the machine with every input and oracle 𝒪M(a1,,a)\mathcal{O}_{M}(a_{1},\ldots,a_{\ell}) is guaranteed to terminate.

4.2. Auxiliary bound and result

Lemma 4.3.

There is an algorithm that takes as input:

  • language \mathcal{L};

  • a complete decidable theory TT given by a Turing machine checking correctness of sentences in the theory;

  • a total algorithm 𝒜\mathcal{A} over TT;

  • positive integers \ell and NN;

  • a string 𝒮\mathcal{S} in the input alphabet of 𝒜\mathcal{A};

and computes

  • a first-order formula φ=φT,𝒜(,𝒮,N)\varphi=\varphi_{T,\mathcal{A}}(\ell,\mathcal{S},N) in \mathcal{L} in \ell variables and

  • a number 𝒩:=𝒩T,𝒜(,𝒮,N)\mathcal{N}:=\mathcal{N}_{T,\mathcal{A}}(\ell,\mathcal{S},N)

such that, for any model MM of TT and tuple 𝐚M\mathbf{a}\in M^{\ell}, the following are equivalent:

  1. (1)

    the sentence φ(𝐚)\varphi(\mathbf{a}) is true in MM;

  2. (2)

    algorithm 𝒜\mathcal{A} with input 𝒮\mathcal{S} and oracle 𝒪M(𝐚)\mathcal{O}_{M}(\mathbf{a}) terminates after performing at most NN queries to the oracle

and if these statements are true, then the number of steps performed by 𝒜\mathcal{A} with input 𝒮\mathcal{S} and oracle 𝒪M(𝐚)\mathcal{O}_{M}(\mathbf{a}) does not exceed 𝒩\mathcal{N}.

Proof.

We describe an algorithm for computing φT,𝒜(,𝒮,N)\varphi_{T,\mathcal{A}}(\ell,\mathcal{S},N) and 𝒩T,𝒜(,𝒮,N)\mathcal{N}_{T,\mathcal{A}}(\ell,\mathcal{S},N). Fix some ,T,𝒜,\mathcal{L},T,\mathcal{A}, \ell, and 𝒮\mathcal{S}.

We will describe an algorithm that, for a given positive integer ss, computes first-order formulas ψs\psi_{s} and qsq_{s} in \mathcal{L} in the variables 𝐱=(x1,,x)\mathbf{x}=(x_{1},\ldots,x_{\ell}) and a positive integer 𝒩s\mathcal{N}_{s} such that, for every model MM of TT and every 𝐚T\mathbf{a}\in T^{\ell}

  • ψs(𝐚)\psi_{s}(\mathbf{a}) is true in MM iff algorithm 𝒜\mathcal{A} with input 𝒮\mathcal{S} and oracle 𝒪M(𝐚)\mathcal{O}_{M}(\mathbf{a}) will perform at least ss queries;

  • if ψs(𝐚)\psi_{s}(\mathbf{a}) is true in MM, then the result of the ss-th query will be qs(𝐚)q_{s}(\mathbf{a});

  • if algorithm 𝒜\mathcal{A} with input 𝒮\mathcal{S} and oracle 𝒪M(𝐚)\mathcal{O}_{M}(\mathbf{a}) performs at most ss queries, then the number of steps performed does not exceed 𝒩s\mathcal{N}_{s}.

Fix some s1s\geqslant 1 and assume that the algorithm have computed ψ1,,ψs1\psi_{1},\ldots,\psi_{s-1}, q1,,qs1q_{1},\ldots,q_{s-1}, and 𝒩0,,𝒩s2\mathcal{N}_{0},\ldots,\mathcal{N}_{s-2}. Assume that 𝒜\mathcal{A} with input 𝒮\mathcal{S} has performed s1s-1 queries. Then whether or not an ss-th query will be performed is determined by the results of the first s1s-1 queries. Fix some 𝐫{True,False}s1\mathbf{r}\in\{\mathrm{True},\mathrm{False}\}^{s-1}. It will represent possible results of the first s1s-1 queries. Consider the following formula in \mathcal{L}:

ψ𝐫(𝐱):=ψs1(𝐱)i=1s1(qi(𝐱)ri),\psi_{\mathbf{r}}(\mathbf{x}):=\psi_{s-1}(\mathbf{x})\wedge\bigwedge\limits_{i=1}^{s-1}\left(q_{i}(\mathbf{x})\iff r_{i}\right),

where we assume ψ0=True\psi_{0}=\mathrm{True}. The algorithm uses the algorithm for checking correctness of sentences in TT to check whether the sentence 𝐱ψ𝐫(𝐱)\exists\mathbf{x}\;\psi_{\mathbf{r}}(\mathbf{x}) is false in TT. If it is, then there is no oracle of the form 𝒪M(𝐚)\mathcal{O}_{M}(\mathbf{a}) such that 𝒜\mathcal{A} will perform at least s1s-1 queries on it with the results being r1,,rs1r_{1},\ldots,r_{s-1}.

In the case of 𝐱ψ𝐫(𝐱)\exists\mathbf{x}\;\psi_{\mathbf{r}}(\mathbf{x}) is true in TT, the algorithm will run 𝒜\mathcal{A} with input 𝒮\mathcal{S} and an oracle 𝒪𝐫\mathcal{O}_{\mathbf{r}} that works as follows. For the first s1s-1 queries, 𝒪𝐫\mathcal{O}_{\mathbf{r}} will return r1,,rs1r_{1},\ldots,r_{s-1}. For all subsequent queries, it always returns True. The algorithm will stop the execution of 𝒜\mathcal{A} if 𝒜\mathcal{A} makes an ss-th query to the oracle, and denote the formula in the query by q𝐫q_{\mathbf{r}}.

Since 𝐱ψ𝐫(𝐱)\exists\mathbf{x}\;\psi_{\mathbf{r}}(\mathbf{x}) is true in TT, 𝒪𝐫\mathcal{O}_{\mathbf{r}} gives the same responses to the first s1s-1 queries as some oracle of the form 𝒪M(𝐚)\mathcal{O}_{M}(\mathbf{a}). Since 𝒜\mathcal{A} must terminate in finite time for every such oracle, one of the following must happen:

  1. (1)

    𝒜\mathcal{A} will perform an ss-th query.

  2. (2)

    𝒜\mathcal{A} will terminate after performing only s1s-1 queries.

In the former case, as described above, the algorithm will define a formula q𝐫q_{\mathbf{r}} to be the ss-th query. In the latter case, the algorithm will define 𝒩𝐫\mathcal{N}_{\mathbf{r}} to be the number of steps performed by 𝒜\mathcal{A}. Then the algorithm computes

ψs(𝐱):=q𝐫 is definedψ𝐫(𝐱),qs(𝐱):=q𝐫 is defined(ψ𝐫(𝐱)q𝐫(𝐱)),\displaystyle\psi_{s}(\mathbf{x}):=\bigvee\limits_{q_{\mathbf{r}}\text{ is defined}}\psi_{\mathbf{r}}(\mathbf{x}),\quad q_{s}(\mathbf{x}):=\bigwedge\limits_{q_{\mathbf{r}}\text{ is defined}}(\psi_{\mathbf{r}}(\mathbf{x})\implies q_{\mathbf{r}}(\mathbf{x})),
𝒩s1:=max(𝒩s2,𝒩𝐫 is defined𝒩𝐫),\displaystyle\mathcal{N}_{s-1}:=\max\left(\mathcal{N}_{s-2},\sum\limits_{\mathcal{N}_{\mathbf{r}}\text{ is defined}}\mathcal{N}_{\mathbf{r}}\right),

where we assume 𝒩1=\mathcal{N}_{-1}=-\infty. If the set {𝐫q𝐫 is defined}\{\mathbf{r}\mid q_{\mathbf{r}}\text{ is defined}\} is empty, the algorithm sets ψs(𝐱)=False\psi_{s}(\mathbf{x})=\operatorname{False} and qs(𝐱)=Trueq_{s}(\mathbf{x})=\operatorname{True}. Finally, the algorithm returns φT,𝒜(,𝒮,N):=¬ψN+1\varphi_{T,\mathcal{A}}(\ell,\mathcal{S},N):=\lnot\psi_{N+1} and 𝒩T,𝒜(,𝒮,N):=𝒩N\mathcal{N}_{T,\mathcal{A}}(\ell,\mathcal{S},N):=\mathcal{N}_{N}. ∎

Lemma 4.4.

Let TT be a theory and MM an 0\aleph_{0}-saturated model. Let U1U2U3U_{1}\supset U_{2}\supset U_{3}\supset\ldots be a sequence of definable sets in MnM^{n} such that i=1Ui=\bigcap\limits_{i=1}^{\infty}U_{i}=\varnothing. Then there exists NN such that UN=U_{N}=\varnothing.

Proof.

Assume the contrary, that is, that UiU_{i}\neq\varnothing for every i1i\geqslant 1. We will show that i=1Ui\bigcap\limits_{i=1}^{\infty}U_{i}\neq\varnothing.

We show that a collection of formulas {xUi}i=1\{x\in U_{i}\}_{i=1}^{\infty} is finitely satisfiable. Indeed, let S>0S\subset\mathbb{Z}_{>0} be a finite set and N=maxSN=\max S. Then iSUi=UN\bigcap_{i\in S}U_{i}=U_{N}\neq\varnothing. Due to compactness, the countable collection {xUi}i=1\{x\in U_{i}\}_{i=1}^{\infty} is satisfiable in some elementary extension of MM. Since MM is 0\aleph_{0}-saturated, this collection is satisfiable in MM. Therefore, i=1Ui\bigcap\limits_{i=1}^{\infty}U_{i}\neq\varnothing. ∎

4.3. Main result

Theorem 4.5.

There exists a computable function StepsT,𝒜(,r)\operatorname{Steps}_{T,\mathcal{A}}(\ell,r) with input

  • a complete decidable theory TT (given by an algorithm for checking correctness of sentences);

  • a total algorithm 𝒜\mathcal{A} over TT;

  • positive integers \ell and rr

that computes a number NN such that for every model MM of TT, every 𝐚M\mathbf{a}\in M^{\ell}, and every string 𝒮\mathcal{S} in the alphabet of 𝒜\mathcal{A} of size at most rr, the number of steps performed by 𝒜\mathcal{A} with input 𝒮\mathcal{S} and oracle 𝒪M(𝐚)\mathcal{O}_{M}(\mathbf{a}) does not exceed NN.

Remark 4.6.

Let the intermediate result at step nn for a total algorithm 𝒜\mathcal{A} with given input and oracle be the content of all the cells of the tape that have been read by the Turing machine. Since a Turing machine can read at most one cell at each step, the number of these cells cannot exceed nn. Therefore, the intermediate result at step nn can be encoded using nlogn\log\ell bits, where \ell is the cardinality of the alphabet of 𝒜\mathcal{A}. In particular, if a binary alphabet is used, the bitsize of the intermediate result never exceeds the total number of steps in the algorithm.

Proof.

We will describe an algorithm for computing StepsT,𝒜(,r)\operatorname{Steps}_{T,\mathcal{A}}(\ell,r). We fix TT, 𝒜\mathcal{A}, \ell, and rr. We will consider 𝒮\mathcal{S} of length at most rr and describe how to compute a bound for the number of steps given that the input is 𝒮\mathcal{S}. Taking the maximum over all 𝒮\mathcal{S} of length at most rr (there are finitely many of them), we obtain Steps𝒜,T(,r)\operatorname{Steps}_{\mathcal{A},T}(\ell,r).

The algorithm will compute φi:=φT,𝒜(,𝒮,i)\varphi_{i}:=\varphi_{T,\mathcal{A}}(\ell,\mathcal{S},i) for i=1,2,i=1,2,\ldots using the algorithm from Lemma 4.3. For each φi\varphi_{i}, the algorithm will check whether the formula is equivalent to True\mathrm{True} in TT using the decidability of TT.

If this is true, the algorithm stops and returns 𝒩T,𝒜(,𝒮,i)\mathcal{N}_{T,\mathcal{A}}(\ell,\mathcal{S},i) (see Lemma 4.3). It remains to show that the described procedure terminates in finitely many steps. Let MM be an 0\aleph_{0}-saturated model of TT (it exists, for example, due to [19, Theorem 4.3.12]). For every i=1,2,i=1,2,\ldots, we introduce a definable set

Ui:={𝐚Mφi(𝐚)=False}.U_{i}:=\{\mathbf{a}\in M^{\ell}\mid\varphi_{i}(\mathbf{a})=\mathrm{False}\}.

Notice that Ui=U_{i}=\varnothing if and only if (φiTrue)(\varphi_{i}\iff\mathrm{True}) in TT. Then the definition of φi\varphi_{i}’s implies that U1U2U_{1}\supset U_{2}\supset\ldots. Assume that i=1Ui\bigcap_{i=1}^{\infty}U_{i} is not empty and choose an element 𝐚\mathbf{a} in it. Then 𝒜\mathcal{A} will not terminate in finitely many steps with input 𝒮\mathcal{S} and oracle 𝒪M(𝐚)\mathcal{O}_{M}(\mathbf{a}). Thus, i=1Ui=\bigcap_{i=1}^{\infty}U_{i}=\varnothing. Lemma 4.4 implies that there exists NN such that UN=U_{N}=\varnothing. Then our algorithm will terminate after checking whether φN\varphi_{N} is equivalent to True. ∎

5. Applications to differential algebra

In this section, we will apply the results of Section 4 to the theory of differentially closed fields with several commuting derivations.

5.1. Preparation

Notation 5.1.

Let mm be a positive integer.

  • The language of partial differential rings with mm commuting derivation is denoted by m:={+,,,0,1,1,,m}\mathcal{L}_{m}:=\{+,-,\cdot,0,1,\partial_{1},\ldots,\partial_{m}\}. We add a separate functional symbol for subtraction for convenience.

  • The theory of partial differentially closed fields with mm commuting derivations of characteristic zero is denoted by DCFm\operatorname{DCF}_{m}. Recall that DCFm\operatorname{DCF}_{m} is complete [21, Corollary 3.1.9 ] and, with this, is decidable by [19, Lemma 2.2.8] and [21, Lemma 3.1.2 and page 890].

Notation 5.2.

Let m,n,hm,n,h be positive integers and kk a differential field with a set of mm commuting derivations Δ={1,,m}\Delta=\{\partial_{1},\ldots,\partial_{m}\}.

  • Polk(m,n,h)\operatorname{Pol}_{k}(m,n,h) denotes the space of all differential polynomials over kk in nn variables of order at most hh and degree at most hh.

  • The dimension of Polk(m,n,h)\operatorname{Pol}_{k}(m,n,h) (which does not depend on kk) will be denoted by PolDim(m,n,h)\operatorname{PolDim}(m,n,h).

Notation 5.3.

Let mm, \ell and nn be positive integers.

  • Let m(x1,,x){y1,,yn}Δ\mathcal{L}_{m}(x_{1},\ldots,x_{\ell})\{y_{1},\ldots,y_{n}\}_{\Delta} denote the ring of differential polynomials in differential variables y1,,yny_{1},\ldots,y_{n} with respect to mm derivations with the coefficients being terms in the language m\mathcal{L}_{m} in x1,,xx_{1},\ldots,x_{\ell} (that is, elements of {x1,,x}Δ)\mathbb{Z}\{x_{1},\ldots,x_{\ell}\}_{\Delta}).

    This is a computable differential ring with mm commuting derivations. In what follows, we will assume that the algorithms use dense representation to store these polynomials (that is, store all the coefficients up to certain order and certain degree).

  • Let kk be a differential field with mm derivations and 𝐚k\mathbf{a}\in k^{\ell}. Then, for Tm(x1,,x){y1,,yn}ΔT\in\mathcal{L}_{m}(x_{1},\ldots,x_{\ell})\{y_{1},\ldots,y_{n}\}_{\Delta}, we define T(𝐚)k{y1,,yn}ΔT(\mathbf{a})\in k\{y_{1},\ldots,y_{n}\}_{\Delta} to be the result of evaluating the coefficients of TT at 𝐚\mathbf{a}.

Definition 5.4.

A differential ranking for k{z1,,zn}Δk\{z_{1},\ldots,z_{n}\}_{\Delta} is a total order >> on Z:={θziθΘΔ, 1in}Z:=\{\theta z_{i}\mid\theta\in\Theta_{\Delta},\,1\leqslant i\leqslant n\} satisfying, for all ii, 1im1\leqslant i\leqslant m:

  • for all xZx\in Z, i(x)>x\partial_{i}(x)>x and

  • for all x,yZx,y\in Z, if x>yx>y, then i(x)>i(y)\partial_{i}(x)>\partial_{i}(y).

Notation 5.5.

For a Δ\Delta-field kk and fk{z1,,zn}Δ\kf\in k\{z_{1},\ldots,z_{n}\}_{\Delta}\backslash k and differential ranking >>,

  • lead(f)\operatorname{lead}(f) is the element of ZZ of the highest rank appearing in ff.

  • The leading coefficient of ff considered as a polynomial in lead(f)\operatorname{lead}(f) is denoted by in(f)\operatorname{in}(f) and called the initial of ff.

  • The separant of ff is flead(f)\frac{\partial f}{\partial\operatorname{lead}(f)}.

  • The rank of ff is rank(f)=lead(f)deglead(f)f\operatorname{rank}(f)=\operatorname{lead}(f)^{\deg_{\operatorname{lead}(f)}f}. The ranks are compared first with respect to lead\operatorname{lead}, and in the case of equality with respect to deg\deg.

  • For Sk{z1,,zn}Δ\kS\subset k\{z_{1},\ldots,z_{n}\}_{\Delta}\backslash k, the set of initials and separants of SS is denoted by HSH_{S}.

Remark 5.6 (Defining a ranking).

In general, there are uncountable many differential rankings already for m=2m=2 and n=1n=1. However, [25, Theorem 29] implies that any differential ranking can be defined by m(m+1)nm(m+1)n real numbers together with n2n^{2} integers not exceeding mm and one permutation on nn elements. We define a function RKm,n(𝜶,𝒮)\operatorname{RK}_{m,n}(\bm{\alpha},\mathcal{S}) taking as input a tuple 𝜶\bm{\alpha} of m(m+1)nm(m+1)n real numbers and a binary string 𝒮\mathcal{S} (of length at most (n2+n)log2(max(n,m))(n^{2}+n)\log_{2}(\max(n,m))) encoding the integers and the permutation and returning the corresponding binary predicate on the derivatives as in [25, Definition 28]. The relevant properties of this encoding for us will be that, for fixed 𝒮\mathcal{S}:

  1. (1)

    the statement that RKm,n(𝜶,𝒮)\operatorname{RK}_{m,n}(\bm{\alpha},\mathcal{S}) defines a ranking is a first-order formula in 𝜶\bm{\alpha} in the language of ordered fields;

  2. (2)

    for every two derivatives θ1zi\theta_{1}z_{i} and θ2zj\theta_{2}z_{j}, the fact that θ1zi<θ2zj\theta_{1}z_{i}<\theta_{2}z_{j} with respect to RKm,n(𝜶,𝒮)\operatorname{RK}_{m,n}(\bm{\alpha},\mathcal{S}) is also a first-order formula in 𝜶\bm{\alpha} in the language of ordered fields.

Definition 5.7 (Characteristic sets).
  • For f,gk{z1,,zn}Δ\kf,g\in k\{z_{1},\ldots,z_{n}\}_{\Delta}\backslash k, ff is said to be reduced w.r.t. gg if no proper derivative of lead(g)\operatorname{lead}(g) appears in ff and deglead(g)f<deglead(g)g\deg_{\operatorname{lead}(g)}f<\deg_{\operatorname{lead}(g)}g.

  • A subset 𝒜k{z1,,zn}Δ\k\mathcal{A}\subset k\{z_{1},\ldots,z_{n}\}_{\Delta}\backslash k is called autoreduced if, for all p𝒜p\in\mathcal{A}, pp is reduced w.r.t. every element of 𝒜{p}\mathcal{A}\setminus\{p\}. One can show that every autoreduced set is finite [13, Section I.9].

  • Let 𝒜=A1<<Ar\mathcal{A}=A_{1}<\ldots<A_{r} and =B1<<Bs\mathcal{B}=B_{1}<\ldots<B_{s} be autoreduced sets ordered by their ranks (see Notation 5.5). We say that 𝒜<\mathcal{A}<\mathcal{B} if

    • r>sr>s and rank(Ai)=rank(Bi)\operatorname{rank}(A_{i})=\operatorname{rank}(B_{i}), 1is1\leqslant i\leqslant s, or

    • there exists qq such that rank(Aq)<rank(Bq)\operatorname{rank}(A_{q})<\operatorname{rank}(B_{q}) and, for all ii, 1i<q1\leqslant i<q, rank(Ai)=rank(Bi)\operatorname{rank}(A_{i})=\operatorname{rank}(B_{i}).

  • An autoreduced subset of the smallest rank of a differential ideal Ik{z1,,zn}ΔI\subset k\{z_{1},\ldots,z_{n}\}_{\Delta} is called a characteristic set of II. One can show that every non-zero differential ideal in k{z1,,zn}Δk\{z_{1},\ldots,z_{n}\}_{\Delta} has a characteristic set.

  • A radical differential ideal II of k{z1,,zn}Δk\{z_{1},\ldots,z_{n}\}_{\Delta} is said to be characterizable if II has a characteristic set CC such that I=C():HCI=\langle C\rangle^{(\infty)}:H_{C}^{\infty}.

The Rosenfeld-Gröbner algorithm [3, Theorem 9] takes as input a finite set FF of differential polynomials and a differential ranking and outputs autoreduced sets 𝒞1,,𝒞N\mathcal{C}_{1},\ldots,\mathcal{C}_{N} such that

(1) F()=i=1N𝒞i():H𝒞i\sqrt{\langle F\rangle^{(\infty)}}=\bigcap_{i=1}^{N}\langle\mathcal{C}_{i}\rangle^{(\infty)}:H_{\mathcal{C}_{i}}^{\infty}

and that, for each ii, 1iN1\leqslant i\leqslant N, 𝒞i\mathcal{C}_{i} is a characteristic set of 𝒞i():H𝒞i\langle\mathcal{C}_{i}\rangle^{(\infty)}:H_{\mathcal{C}_{i}}^{\infty}. The representation (1) can be used, for example, for membership testing, estimating the number of irreducible components (used in Theorem 5.13) or the Kolchin polynomial (used in Section 6) of a differential-algebraic variety.

With the next Proposition 5.9 we express how we will call the Rosenfeld-Gröbner algorithm. This algorithm depends on the choice of a differential ranking. The reader may wish to make one such choice once and for all, thereby ignoring the potential ambiguity. However, since such a choice may affect the size of the output and the efficiency of any given implementation of the algorithm, one may prefer to allow for these other orderings.

We will express this dependence by seeing the algorithm as a total algorithm relative to the two-sorted theory DCFmRCF\operatorname{DCF}_{m}\oplus\operatorname{RCF} which is a disjoint union of DCFm\operatorname{DCF}_{m} and the complete decidable theory with quantifier elimination of real closed fields RCF\operatorname{RCF} [19, Theorem 3.3.15 and Corollary 3.3.16]. Then we will use the characterization of differential rankings via real numbers from Remark 5.6.

Lemma 5.8.

Theory DCFmRCF\operatorname{DCF}_{m}\oplus\operatorname{RCF} is decidable and complete.

Proof.

In order to prove the completeness and decidability, we will prove that there is an algorithm for quantifier elimination in DCFmRCF\operatorname{DCF}_{m}\oplus\operatorname{RCF} based on the existence of such algorithms for DCFm\operatorname{DCF}_{m} (follows from decidability, see Notation 5.1, and quantifier elimination [21, Theorem 3.1.7]) and RCF\operatorname{RCF}. It is sufficient to perform quantifier elimination for a formula of the form

xS:L1LN,\exists x\in S\colon L_{1}\wedge\ldots\wedge L_{N},

where SS is one of the sorts (corresponding to DCFm\operatorname{DCF}_{m} or RCF\operatorname{RCF}) and L1,,LNL_{1},\ldots,L_{N} are literals. (See [19, Lemma 3.1.5].) By reordering L1,,LNL_{1},\ldots,L_{N} if necessary, we will further assume that there exists N0N_{0} such that L1,,LN0L_{1},\ldots,L_{N_{0}} are in the signature of the sort SS and LN0+1,,LNL_{N_{0}+1},\ldots,L_{N} are in the signature of the other sort. Then

(xS:L1LN)(xS:L1LN0)(LN0+1LN),\left(\exists x\in S\colon L_{1}\wedge\ldots L_{N}\right)\iff\left(\exists x\in S\colon L_{1}\wedge\ldots\wedge L_{N_{0}}\right)\wedge\left(L_{N_{0}+1}\wedge\ldots\wedge L_{N}\right),

and, for xS:L1LN0\exists x\in S\colon L_{1}\wedge\ldots\wedge L_{N_{0}}, the algorithm for the corresponding sort SS can compute an equivalent quantifier-free formula.

The resulting theory is decidable because the correctness of each sentence can be checked by performing quantifier elimination after which the formula will become just true/false. ∎

Proposition 5.9.

There is a computable function that, for a given positive integer mm, computes a total algorithm 𝒢m,\mathcal{RG}_{m,} over DCFmRCF\operatorname{DCF}_{m}\oplus\operatorname{RCF} such that, for every differential field kk with mm derivations and 𝐚k\mathbf{a}\in k^{\ell} and any 𝐛s\mathbf{b}\in\mathbb{R}^{s}, the input-output specification of 𝒢m\mathcal{RG}_{m} with oracle 𝒪k(𝐚,𝐛)\mathcal{O}_{k\oplus\mathbb{R}}(\mathbf{a},\mathbf{b}) is the following:

Input:

finite subsets AA and SS of m(x1,,x){y1,,yn}Δ\mathcal{L}_{m}(x_{1},\ldots,x_{\ell})\{y_{1},\ldots,y_{n}\}_{\Delta} and a binary string 𝒮\mathcal{S};

Output:

if RKm,n(𝐛,𝒮)\operatorname{RK}_{m,n}(\mathbf{b},\mathcal{S}) (see Remark 5.6) defines a differential ranking, return a list of tuples C1,,CNC_{1},\ldots,C_{N} from m(x1,,x){y1,,yn}Δ\mathcal{L}_{m}(x_{1},\ldots,x_{\ell})\{y_{1},\ldots,y_{n}\}_{\Delta} such that

C1(𝐚),,CN(𝐚)C_{1}(\mathbf{a}),\ldots,C_{N}(\mathbf{a})

is the output of the Rosenfeld-Gröbner algorithm [3, Theorem 9] with input (A(𝐚),S(𝐚))(A(\mathbf{a}),S(\mathbf{a})) with respect to the ranking RKm,n(𝐛,𝒮)\operatorname{RK}_{m,n}(\mathbf{b},\mathcal{S}). Otherwise, return \varnothing.

Proof.

[3, Theorem 9] states that the only operations performed by the Rosenfeld-Gröbner algorithm with the elements of the ground differential field are arithmetic operations, differentiation, and zero testing. Algorithm 𝒢m\mathcal{RG}_{m} is constructed to work exactly in the same way as the Rosenfeld-Gröbner algorithm with the only difference that the elements of the ground differential field will be represented as L(𝐚)L(\mathbf{a}), where Lm(x1,,x){y1,,yn}ΔL\in\mathcal{L}_{m}(x_{1},\ldots,x_{\ell})\{y_{1},\ldots,y_{n}\}_{\Delta}. The arithmetic operations and differentiations can be performed with LL, zero testing can be performed using the kk-component of the oracle, and the queries to the ranking can be performed using the \mathbb{R}-component of the oracle, so 𝒢\mathcal{RG} will be able to perform the same computations as the Rosenfeld-Gröbner algorithm.

Due to [3, Theorem 5], the Rosenfeld-Gröbner algorithm is guaranteed to terminate on every input. Hence, the same is true for 𝒢m\mathcal{RG}_{m}. ∎

5.2. Bounds

Theorem 5.10 (Upper bound for Rosenfeld-Gröbner algorithm).

There exists a computable function RG(m,n,)\operatorname{RG}(m,n,\ell) such that, for every differential field kk with mm derivations and subsets A,SPolk(m,n,n)A,S\subset\operatorname{Pol}_{k}(m,n,n) with |A|,|S||A|,|S|\leqslant\ell, and every differential ranking, the Rosenfeld-Gröbner algorithm [3, Theorem 9] on AA and SS will produce at most RG(m,n,)\operatorname{RG}(m,n,\ell) components with all the orders and degrees of the differential polynomials occurring in the algorithm not exceeding RG(m,n,)\operatorname{RG}(m,n,\ell).

Proof.

We fix integers mm, nn, and \ell and compute the total algorithm 𝒢m\mathcal{RG}_{m} over DCFmRCF\operatorname{DCF}_{m}\oplus\operatorname{RCF} from Proposition 5.9. Let 𝐚\mathbf{a} be the set of all the coefficients of AA and SS. Then |𝐚|N:=2PolDim(m,n,n)|\mathbf{a}|\leqslant N:=2\ell\operatorname{PolDim}(m,n,n). The sets AA and SS can be presented as evaluations of subsets A~,S~m(x1,,xN){y1,,yn}Δ\widetilde{A},\widetilde{S}\subset\mathcal{L}_{m}(x_{1},\ldots,x_{N})\{y_{1},\ldots,y_{n}\}_{\Delta} at 𝐚\mathbf{a} such that the orders and degrees of A~,S~\widetilde{A},\widetilde{S} in y1,,yny_{1},\ldots,y_{n} do not exceed nn and every coefficient is a single variable xix_{i}. Let the ranking be defined as RK(𝐛,𝒮)\operatorname{RK}(\mathbf{b},\mathcal{S}) (see Remark 5.6), where 𝐛\mathbf{b} is a tuple of m(m+1)nm(m+1)n real numbers and 𝒮\mathcal{S} is a binary string of length at most (n2+n)log2max(n,m)(n^{2}+n)\log_{2}\max(n,m). Then the the tuple (A~,S~,𝒮)(\widetilde{A},\widetilde{S},\mathcal{S}) can be encoded as a binary string of the length bounded by a computable function S(m,n,N)S(m,n,N).

We run 𝒢m\mathcal{RG}_{m} with the input =(A~,S~,𝒮)\mathcal{I}=(\widetilde{A},\widetilde{S},\mathcal{S}) and oracle 𝒪(𝐚,𝐛)\mathcal{O}(\mathbf{a},\mathbf{b}). Theorem 4.5 implies that the number of steps and, consequently, the bitsize of of all the intermediate results (see Remark 4.6) will not exceed Steps𝒢m,DCFmRCF(N,S(m,n,N))\operatorname{Steps}_{\mathcal{RG}_{m},\operatorname{DCF}_{m}\oplus\operatorname{RCF}}(N,S(m,n,N)).

Since each component takes at least one bit, a polynomial of degree dd or order dd has at least dd coefficients (due to the dense representation of the polynomials, see Notation 5.2) requiring at least one bit each, the number of components, the degrees and orders do not exceed the bitsize of the intermediate results. Therefore, we can set RG(m,n,)=Steps𝒢m,DCFmRCF(N,S(m,n,N))\operatorname{RG}(m,n,\ell)=\operatorname{Steps}_{\mathcal{RG}_{m},\operatorname{DCF}_{m}\oplus\operatorname{RCF}}(N,S(m,n,N)). ∎

Corollary 5.11.

There exists a computable function CharSet(m,n,)\operatorname{CharSet}(m,n,\ell) such that, for every computable differential field kk with mm derivations and subsets A,SPolk(m,n,n)A,S\subset\operatorname{Pol}_{k}(m,n,n) with |A|,|S||A|,|S|\leqslant\ell, and every differential ranking, the ideal A():S\sqrt{\langle A\rangle^{(\infty)}\colon S^{\infty}} can be written as an intersection of at most CharSet(m,n,)\operatorname{CharSet}(m,n,\ell) characterizable differential ideals defined by their characteristic sets with respect to the ranking of order and degree not exceeding CharSet(m,n,)\operatorname{CharSet}(m,n,\ell).

Proof.

Theorem 5.10 implies that there exists a representation

A():S=(C1():HC1)(CN():HCN),\sqrt{\langle A\rangle^{(\infty)}\colon S^{\infty}}=(\langle C_{1}\rangle^{(\infty)}\colon H_{C_{1}})\cap\ldots\cap(\langle C_{N}\rangle^{(\infty)}\colon H_{C_{N}}),

where HCiH_{C_{i}} is the product of the initials and separants of CiC_{i}, and CiC_{i} is the characteristic presentation [3, Definition 8] of Ci():HCi\langle C_{i}\rangle^{(\infty)}\colon H_{C_{i}}^{\infty} for every 1iN1\leqslant i\leqslant N. As noted in [3, p. 108] a characteristic set of Ci():HCi\langle C_{i}\rangle^{(\infty)}\colon H_{C_{i}}^{\infty} can be obtained from CiC_{i} by performing reductions until it will become autoreduced. Since differential reduction is a part of the Rosenfeld-Gröbner algorithm, it can also be performed by a total algorithm over DCFmRCF\operatorname{DCF}_{m}\oplus\operatorname{RCF}. Therefore, as in the proof of Theorem 5.10, Lemma 4.5 implies that Ci():HCi\langle C_{i}\rangle^{(\infty)}\colon H_{C_{i}}^{\infty} has a characteristic set with degrees and order bounded by a computable function of the degrees and orders of CiC_{i}. The latter are bounded by a computable function RG\operatorname{RG} due to Theorem 5.10. Composing these two bounds, we obtain a desired function CharSet(m,n,)\operatorname{CharSet}(m,n,\ell). ∎

Lemma 5.12.

There exists a computable function PrimeComp(m,n)\operatorname{PrimeComp}(m,n) such that for every partial differential field kk with mm derivations, every ranking, and every characterizable differential ideal II defined by a characteristic set CPolk(m,n,n)C\subset\operatorname{Pol}_{k}(m,n,n) with respect to this ranking, we have

  1. (1)

    the number of prime components of II does not exceed PrimeComp(m,n)\operatorname{PrimeComp}(m,n);

  2. (2)

    every prime component of II has a characteristic set with respect to the ranking with orders and degrees bounded by PrimeComp(m,n)\operatorname{PrimeComp}(m,n).

Proof.

Let HH be the product of the initials and separants of CC. [3, Theorem 4] implies that the number of prime components of C():H\langle C\rangle^{(\infty)}\colon H^{\infty} is equal to the number of prime components of the algebraic ideal (C():H)Rn(\langle C\rangle^{(\infty)}\colon H^{\infty})\cap R_{n}, where RnR_{n} is the ring of differential polynomials of order at most nn. Since the degrees of elements of CC are bounded by nn, the Bézout inequality implies that there is a computable bound DD for the degree of the radical ideal IRnI\cap R_{n} (defined, e.g., as the degree of the corresponding affine variety [12, page 246]) in terms of mm and nn, so this gives a bound for the number of components.

Let P1,,PP_{1},\ldots,P_{\ell} be the prime components of II. For every 1i1\leqslant i\leqslant\ell, PiRnP_{i}\cap R_{n} is a prime algebraic ideal, and its zero set can be defined by equations of degree at most deg(PiRn)\deg(P_{i}\cap R_{n}) due to [12, Proposition 3]. Therefore, for each 2i2\leqslant i\leqslant\ell, we can choose a polynomial in (P1Pi)Rn(P_{1}\setminus P_{i})\cap R_{n} of degree at most deg(PiRn)\deg(P_{i}\cap R_{n}). Their product QQ has degree at most deg(IRn)D\deg(I\cap R_{n})\leqslant D. Observe that

P1=P1:QI:Q=(P1:Q)(P:Q)=P1.P_{1}=P_{1}\colon Q^{\infty}\subset I\colon Q^{\infty}=(P_{1}\colon Q^{\infty})\cap\ldots\cap(P_{\ell}\colon Q^{\infty})=P_{1}.

Thus, applying Corollary 5.11 to a pair (C,HQ)(C,HQ) and using that |C|PolDim(m,n)|C|\leqslant\operatorname{PolDim}(m,n), we show that P1P_{1} has a characteristic set with orders and degrees bounded by CharSet(m,D+n,PolDim(m,n))\operatorname{CharSet}(m,D+n,\operatorname{PolDim}(m,n)). ∎

Theorem 5.13 (Upper bound for the components of a differential variety and their number).

There exists a computable function Components(m,n)\operatorname{Components}(m,n) such that, for all non-negative integers mm, nn and hh and a partial differential field kk with mm derivations and finite set FPolk(m,n,h)F\subset\operatorname{Pol}_{k}(m,n,h):

  1. (1)

    the number of components in the variety defined by F=0F=0 does not exceed Components(m,max{n,h})\operatorname{Components}(m,\max\{n,h\});

  2. (2)

    for every differential ranking and every component XX of the variety F=0F=0, XX has a characteristic set with respect to the ranking with orders and degrees bounded by Components(m,max{n,h})\operatorname{Components}(m,\max\{n,h\}).

Proof.

Consider any differential ranking. By replacing FF with the basis of its linear span, we will further assume that |F|PolDim(m,n,h)|F|\leqslant\operatorname{PolDim}(m,n,h) (see Notation 5.2). Corollary 5.11 implies that F()\sqrt{\langle F\rangle^{(\infty)}} can be represented as an intersection of at most NN characterizable ideals with characteristic sets C1,,CNC_{1},\ldots,C_{N} of order and degree at most NN, where

N:=CharSet(m,max{n,h},PolDim(m,n,h)).N:=\operatorname{CharSet}(m,\max\{n,h\},\operatorname{PolDim}(m,n,h)).

Lemma 5.12 applied to each of C1,,CNC_{1},\ldots,C_{N} implies that the number of components of the variety defined by F=0F=0 does not exceed NPrimeComp(m,N)N\cdot\operatorname{PrimeComp}(m,N), and each of them has a characteristic set with orders and degrees not exceeding PrimeComp(m,N)\operatorname{PrimeComp}(m,N). ∎

Remark 5.14.

It was shown in [11, Theorem 6.1] that there exists a (not necessarily computable) bound for the degrees and orders a characteristic set of a prime differential ideal. The second part of Theorem 5.13 implies that there is a computable bound.

6. Application to delay PDEs

In this section, we will show how Theorem 5.13 applies to the problem of elimination of unknowns in delay PDEs.

The proof of the main result of this section, Theorem 6.23 (Effective elimination theorem for delay PDEs) inherited from [18] had only two missing ingredients closely related to each other: the bound on the number of components of the variety defined by a system of differential algebraic PDEs and bounds on the coefficients of Kolchin polynomials under projection in the PDE case. Now that we have obtained the former in Theorem 5.13 together with a bound for characteristic sets, it is possible to obtain the latter in Lemma 6.6 and finish the proof. Therefore, Section 6 can be thought of as a motivation for the rest of the paper and is an interesting example of a problem from differential-difference algebra that motivated a purely differential algebraic result.

6.1. Bounds for Kolchin polynomials for algebraic PDEs

Definition 6.1.

Let KK be a differentially closed Δ\Delta-field containing a Δ\Delta-field kk. We say that XKnX\subset K^{n} is a Δ\Delta-variety over kk if there exists Fk{y1,,yn}ΔF\subset k\{y_{1},\ldots,y_{n}\}_{\Delta} such that

X={𝒂KnfFf(𝒂)=0}.X=\{\bm{a}\in K^{n}\mid\forall\,f\in F\ f(\bm{a})=0\}.

We write X=𝕍(F)X=\mathbb{V}(F). The Δ\Delta-variety KnK^{n} is denoted by 𝔸n\mathbb{A}^{n}. A Δ\Delta-variety 𝕍(F)\mathbb{V}(F) is called irreducible if the differential ideal F()\sqrt{\langle F\rangle^{(\infty)}} is prime.

For a subset YKnY\subset K^{n}, the smallest Δ\Delta-variety XKnX\subset K^{n} containing YY is called the Kolchin closure of YY and denoted by Y¯Kol\overline{Y}^{\operatorname{Kol}}.

Definition 6.2.

We will say that a Δ\Delta-variety X𝔸nX\subset\mathbb{A}^{n} is bounded by NN if Nmax(n,m)N\geqslant\max(n,m) (m=|Δ|m=|\Delta|) and XX can be defined by equations of order and degree at most NN.

Notation 6.3.

For a numeric polynomial ω(t)=i=0mai(t+ii)\omega(t)=\sum\limits_{i=0}^{m}a_{i}\binom{t+i}{i}, we set

|ω|:=i=0m|ai|.|\omega|:=\sum\limits_{i=0}^{m}|a_{i}|.
Definition 6.4.

The generic point (a1,,an)(a_{1},\ldots,a_{n}) of an irreducible Δ\Delta-variety X=𝕍(F)X=\mathbb{V}(F), where Fk{𝒚}ΔF\subset k\{\bm{y}\}_{\Delta}, is the image of 𝒚\bm{y} under the homomorphism K{𝒚}ΔK{𝒚}Δ/F()K\{\bm{y}\}_{\Delta}\to K\{\bm{y}\}_{\Delta}\big{/}\sqrt{\langle F\rangle^{(\infty)}}.

Definition 6.5.

The Kolchin polynomial of an irreducible Δ\Delta-variety V=𝕍(F)V=\mathbb{V}(F), where Fk{𝒚}ΔF\subset k\{\bm{y}\}_{\Delta}, is the unique numerical polynomial ωV(t)\omega_{V}(t) such that there exists t00t_{0}\geqslant 0 such that, for all tt0t\geqslant t_{0} and the generic point 𝒂\bm{a} of VV, ωV(t)=trdegk(𝒂t)/k\omega_{V}(t)=\operatorname{trdeg}k(\bm{a}_{t})/k, where 𝒂t=(θ(𝒂):θΘΔ(t))\bm{a}_{t}=\big{(}\theta(\bm{a}):\theta\in\Theta_{\Delta}(t)\big{)}. For the proof of the existence, see [22, Theorem 5.4.1].

Lemma 6.6.

There exists a computable function KolchinProj(N)\operatorname{KolchinProj}(N) such that for every

  • differential variety X𝔸nX\subset\mathbb{A}^{n} bounded by NN,

  • irreducible component X0XX_{0}\subset X,

  • and linear projection π:𝔸n𝔸\pi\colon\mathbb{A}^{n}\to\mathbb{A}^{\ell},

we have |ωY|KolchinProj(N)|\omega_{Y}|\leqslant\operatorname{KolchinProj}(N), where Y:=π(X0)¯KolY:=\overline{\pi(X_{0})}^{\operatorname{Kol}}.

Proof.

By performing a linear change of variables, we reduce the problem to the case in which π\pi is the projection to the first \ell coordinates. Consider a ranking such that

  • y+iy_{\ell+i} is greater than every derivative of yjy_{j} for every i>0i>0 and 1j1\leqslant j\leqslant\ell;

  • the restriction of the ranking on y1,,yy_{1},\ldots,y_{\ell} is an orderly ranking (that is, a ranking such that ordθ1>ordθ2\operatorname{ord}\theta_{1}>\operatorname{ord}\theta_{2} implies, for all ii and jj, θ1yi>θ2yj\theta_{1}y_{i}>\theta_{2}y_{j}).

Theorem 5.13 implies that X0X_{0} has a characteristic set 𝒞\mathcal{C} with respect to this ranking with the order bounded by a computable function of NN. Since a characteristic set of YY can be obtained from 𝒞\mathcal{C} by selecting the polynomials only in the first \ell variables, there is a charactersitic set of YY with respect to the orderly ranking with the order bounded by a computable function of NN. Then [15, Proposition 3.1] and [15, Fact 2.1] imply that |ωY||\omega_{Y}| is bounded by a computable function of NN. ∎

Proposition 6.7.

There exists an algorithm that, for every computable function g(n):00g(n)\colon\mathbb{Z}_{\geqslant 0}\to\mathbb{Z}_{\geqslant 0}, produces a number Leng\operatorname{Len}_{g} such that, for every sequence of Kolchin polynomials

ω0>ω1>>ω\omega_{0}>\omega_{1}>\ldots>\omega_{\ell}

such that |ωi|<g(i)|\omega_{i}|<g(i) for every 0i0\leqslant i\leqslant\ell, we have <Leng\ell<\operatorname{Len}_{g}.

Proof.

By replacing g(n)g(n) with n+max0sng(s)n+\max\limits_{0\leqslant s\leqslant n}g(s), we can further assume that g(n)g(n) is increasing and g(n)ng(n)\geqslant n. [22, Definition 2.4.9 and Lemma 2.4.12] define a computable order-preserving map cc from the set of all Kolchin polynomials 𝒦\mathcal{K} to 0m+1\mathbb{Z}_{\geqslant 0}^{m+1} (considered with respect to the lexicographic ordering). For v=(v0,,vm)0m+1v=(v_{0},\ldots,v_{m})\in\mathbb{Z}_{\geqslant 0}^{m+1}, we define |v|=v0++vm|v|=v_{0}+\ldots+v_{m}. For every function g:00g\colon\mathbb{Z}_{\geqslant 0}\to\mathbb{Z}_{\geqslant 0}, we define

g~(n):=maxω𝒦,|ω|g(n)|c(ω)|.\widetilde{g}(n):=\max\limits_{\omega\in\mathcal{K},\;|\omega|\leqslant g(n)}|c(\omega)|.

Note that if g(n)g(n) was computable, then g~(n)\widetilde{g}(n) is also computable.

The sequence ω0>ω1>\omega_{0}>\omega_{1}>\ldots gives rise to a sequence c(ω0)>lexc(ω1)>lexc(\omega_{0})>_{\operatorname{lex}}c(\omega_{1})>_{\operatorname{lex}}\ldots in 0m+1\mathbb{Z}_{\geqslant 0}^{m+1} with |c(ωi)|g~(i)|c(\omega_{i})|\leqslant\widetilde{g}(i) for every ii. [20, Main Lemma] implies that there is an algorithm to compute the maximal length of such a sequence, so there is an algorithm to compute a bound on \ell from gg. ∎

6.2. Trains of varieties, partial solutions, and their upper bounds

Lemma 6.8.

For every Δ\Delta-σ\sigma-field kk of characteristic zero, there exists an extension kKk\subset K of Δ\Delta-σ\sigma-fields, where KK is a differentially closed Δ\Delta-σ\sigma^{*}-field.

Proof.

The proof follows [18, Lemma 6.1] mutatis mutandis as follows. We will show that there exists a Δ\Delta-σ\sigma^{\ast}-field K0K_{0} containing kk. The proof of [17, Proposition 2.1.7] implies that one can build an ascending chain of σ\sigma-fields

(2) k0k1k2k_{0}\subset k_{1}\subset k_{2}\subset\ldots

such that, for every ii\in\mathbb{N}, there exists an isomorphism φi:kki\varphi_{i}\colon k\to k_{i} of σ\sigma-fields, σ(ki+1)=ki\sigma(k_{i+1})=k_{i}, and φi=σφi+1\varphi_{i}=\sigma\circ\varphi_{i+1} for every ii\in\mathbb{N}. We transfer the Δ\Delta-σ\sigma-structure from kk to kik_{i}’s via φi\varphi_{i}’s. Then φi=σφi+1\varphi_{i}=\sigma\circ\varphi_{i+1} implies that the restriction of Δ\Delta on ki+1k_{i+1} to kik_{i} coincides with the action of Δ\Delta on kik_{i}. We set K0:=ikiK_{0}:=\bigcup\limits_{i\in\mathbb{N}}k_{i}. Since the action Δ\Delta and σ\sigma is consistent with the ascending chain (2), K0K_{0} is a Δ\Delta-σ\sigma-extension of k0kk_{0}\cong k. It is shown in [17, Proposition 2.1.7] that the action of σ\sigma on K0K_{0} is surjective. [14, Corollary 2.4] implies that K0K_{0} can be embedded in a differentially closed Δ\Delta-σ\sigma^{\ast}-field KK. ∎

Notation 6.9.

Within Sections 6.2 and 6.3, we fix a ground Δ\Delta-σ\sigma field kk and a differentially closed Δ\Delta-σ\sigma^{\ast}-field KK given by Lemma 6.8 applied to kk. All varieties in Sections 6.2 and 6.3 are considered over KK.

Definition 6.10 (Partial solutions).
  • For Δ\Delta-σ\sigma-rings 1\mathcal{R}_{1} and 2\mathcal{R}_{2}, a homomorphism ϕ:12\phi:\mathcal{R}_{1}\longrightarrow\mathcal{R}_{2} is called a Δ\Delta-σ\sigma-homomorphism if, for all ii, ϕi=iϕ\phi\partial_{i}=\partial_{i}\phi and ϕσ=σϕ\phi\sigma=\sigma\phi.

  • Let \mathcal{R} be a Δ\Delta-σ\sigma-ring containing a Δ\Delta-σ\sigma-field kk. Let k[𝒚]k[\bm{y}_{\infty}] be the Δ\Delta-σ\sigma-polynomial ring over kk in 𝒚=y1,,yr\bm{y}=y_{1},\ldots,y_{r}. Given a point 𝒂=(a1,,ar)r\bm{a}=(a_{1},\dots,a_{r})\in\mathcal{R}^{r}, there exists a unique Δ\Delta-σ\sigma-homomorphism over kk,

    ϕ𝒂:k[𝒚]withϕ𝒂(yi)=aiandϕ𝒂|k=id.\phi_{\bm{a}}:k[\bm{y}_{\infty}]\longrightarrow\mathcal{R}\quad\text{with}\ \ \phi_{\bm{a}}(y_{i})=a_{i}\ \text{and}\ \phi_{\bm{a}}|_{k}=\operatorname{id}.

    Given fk[𝒚]f\in k[\bm{y}_{\infty}], 𝒂\bm{a} is called a solution of ff in \mathcal{R} if fKer(ϕ𝒂)f\in\operatorname{Ker}(\phi_{\bm{a}}).

  • For a Δ\Delta-σ\sigma-kk-algebra \mathcal{R} and I=I=\mathbb{N} or \mathbb{Z}, the sequence ring I\mathcal{R}^{I} has the following structure of a Δ\Delta-σ\sigma-ring (Δ\Delta-σ\sigma^{\ast}-ring for I=I=\mathbb{Z}) with σ\sigma and Δ\Delta defined by

    σ((xi)iI):=(xi+1)iIandj((xi)iI):=(j(xi))iI.\sigma\big{(}(x_{i})_{i\in I}\big{)}:=(x_{i+1})_{i\in I}\quad\text{and}\quad\partial_{j}\big{(}(x_{i})_{i\in I}\big{)}:=(\partial_{j}(x_{i}))_{i\in I}.

    For a kk-Δ\Delta-σ\sigma-algebra \mathcal{R}, I\mathcal{R}^{I} can be considered a kk-Δ\Delta-σ\sigma-algebra by embedding kk into I\mathcal{R}^{I} in the following way:

    a(σi(a))iI,ak.a\mapsto(\sigma^{i}(a))_{i\in I},\ \ a\in k.

    For fk[𝒚]f\in k[\bm{y}_{\infty}], a solution of ff with components in I\mathcal{R}^{I} is called a sequence solution of ff in \mathcal{R}.

  • Given f[𝒚]f\in\mathcal{R}[\bm{y}_{\infty}], the order of ff is defined to be the maximal ordθ+j\operatorname{ord}\theta+j such that θσjys\theta\sigma^{j}y_{s} effectively appears in ff for some ss, denoted by ord(f)\operatorname{ord}(f).

  • The relative order of ff with respect to Δ\Delta (resp. σ\sigma), denoted by ordΔ(f)\operatorname{ord}_{\Delta}(f) (resp. ordσ(f)\operatorname{ord}_{\sigma}(f)), is defined as the maximal ordθ\operatorname{ord}\theta (resp. jj) such that θσjys\theta\sigma^{j}y_{s} effectively appears in ff for some ss.

  • Let F={f1,,fN}k[𝒚]F=\{f_{1},\ldots,f_{N}\}\subset k[\bm{y}_{\infty}], where 𝒚=y1,,yr\bm{y}=y_{1},\ldots,y_{r}, be a set of Δ\Delta-σ\sigma-polynomials. Suppose h=max{ordσ(f)fF}h=\max\{\operatorname{ord}_{\sigma}(f)\mid f\in F\}. A sequence of tuples (a¯1,,a¯r)K+h××K+h(\overline{a}_{1},\ldots,\overline{a}_{r})\in K^{\ell+h}\times\cdots\times K^{\ell+h} is called a partial solution of FF of length \ell if (a¯1,,a¯r)(\overline{a}_{1},\ldots,\overline{a}_{r}) is a Δ\Delta-solution of the system in 𝒚,+h1\bm{y}_{\infty,\ell+h-1}:

    {σi(F)=0| 0i1},\{\sigma^{i}(F)=0\;|\;0\leqslant i\leqslant\ell-1\},

    where 𝒚,+h1={θσiysθΘΔ; 0i+h1; 1sr}\bm{y}_{\infty,\ell+h-1}=\{\theta\sigma^{i}y_{s}\mid\theta\in\Theta_{\Delta};\,0\leqslant i\leqslant\ell+h-1;\,1\leqslant s\leqslant r\}.

We associate the following geometric data with the above set FF of Δ\Delta-σ\sigma-polynomials:

  • the Δ\Delta-variety X𝔸HX\subset\mathbb{A}^{H} defined by f1=0,,fN=0f_{1}=0,\ldots,f_{N}=0 regarded as Δ\Delta-equations in k[𝒚,h]k[\bm{y}_{\infty,h}] with H=r(h+1)H={r}(h+1), and

  • two projections π1,π2:𝔸H𝔸Hr\pi_{1},\pi_{2}:\mathbb{A}^{H}\longrightarrow\mathbb{A}^{H-r} defined by

    π1(a1,,σh(a1);;\displaystyle\pi_{1}(a_{1},\ldots,\sigma^{h}(a_{1});\ldots; ar,,σh(ar))\displaystyle a_{r},\ldots,\sigma^{h}(a_{r}))
    :=(a1,σ(a1),,σh1(a1);;ar,,σh1(ar)),\displaystyle:=(a_{1},\sigma(a_{1}),\ldots,\sigma^{h-1}(a_{1});\ldots;a_{r},\ldots,\sigma^{h-1}(a_{r})),
    π2(a1,,σh(a1);;\displaystyle\pi_{2}(a_{1},\ldots,\sigma^{h}(a_{1});\ldots; ar,,σh(ar))\displaystyle a_{r},\ldots,\sigma^{h}(a_{r}))
    :=(σ(a1),,σh(a1);;σ(ar),,σh(ar)).\displaystyle:=(\sigma(a_{1}),\ldots,\sigma^{h}(a_{1});\ldots;\sigma(a_{r}),\ldots,\sigma^{h}(a_{r})).

Let σ(X)\sigma(X) denote the Δ\Delta-variety in 𝔸H\mathbb{A}^{H} defined by f1σ,,fNσf_{1}^{\sigma},\ldots,f_{N}^{\sigma}, where fiσf_{i}^{\sigma} is the result by applying σ\sigma to the coefficients of fif_{i}.

Definition 6.11.

A sequence p1,,p𝔸Hp_{1},\ldots,p_{\ell}\in\mathbb{A}^{H} is a partial solution of the triple (X,π1,π2)(X,\pi_{1},\pi_{2}) if

  1. (1)

    for all ii, 1i1\leqslant i\leqslant\ell, we have piσi1(X)p_{i}\in\sigma^{i-1}(X) and

  2. (2)

    for all ii, 1i<1\leqslant i<\ell, we have π1(pi+1)=π2(pi)\pi_{1}(p_{i+1})=\pi_{2}(p_{i}).

A two-sided infinite sequence with such a property is called a solution of the triple (X,π1,π2)(X,\pi_{1},\pi_{2}).

Lemma 6.12.

For every positive integer \ell, FF has a partial solution of length \ell if and only if the triple (X,π1,π2)(X,\pi_{1},\pi_{2}) has a partial solution of length .\ell. The system FF has a sequence solution in KK^{\mathbb{Z}} if and only if the triple (X,π1,π2)(X,\pi_{1},\pi_{2}) has a solution.

Proof.

As in [18, Lemma 6.5]. ∎

Definition 6.13.

For \ell\in\mathbb{N} or ++\infty, a sequence of irreducible Δ\Delta-subvarieties (Y1,,Y)(Y_{1},\ldots,Y_{\ell}) in 𝔸H\mathbb{A}^{H} is said to be a train of length \ell in XX if

  1. (1)

    for all ii, 1i1\leqslant i\leqslant\ell, we have Yiσi1(X)Y_{i}\subseteq\sigma^{i-1}(X) and

  2. (2)

    for all ii, 1i<1\leqslant i<\ell, we have π1(Yi+1)¯Kol=π2(Yi)¯Kol\overline{\pi_{1}(Y_{i+1})}^{\operatorname{Kol}}=\overline{\pi_{2}(Y_{i})}^{\operatorname{Kol}}.

Lemma 6.14.

For every train (Y1,,Y)(Y_{1},\ldots,Y_{\ell}) in XX, there exists a partial solution p1,,pp_{1},\ldots,p_{\ell} of (X,π1,π2)(X,\pi_{1},\pi_{2}) such that for all ii, we have piYip_{i}\in Y_{i}. In particular, if there is an infinite train in XX, then there is a solution of the triple (X,π1,π2)(X,\pi_{1},\pi_{2}).

Proof.

We prove it as in [18, Lemma 6.7], as follows. To prove the existence of a partial solution of (X,π1,π2)(X,\pi_{1},\pi_{2}) with the desired property, it suffices to prove the following:

Claim.

There exists a nonempty open (in the sense of the Kolchin topology) subset UYU\subseteq Y_{\ell} such that for each pUp_{\ell}\in U, pp_{\ell} can be extended to a partial solution p1,,pp_{1},\ldots,p_{\ell} of (X,π1,π2)(X,\pi_{1},\pi_{2}) with piYip_{i}\in Y_{i} for every 1i1\leqslant i\leqslant\ell.

We will prove the Claim by induction on \ell. For =1\ell=1, take U=Y1U=Y_{1}. Since each point in Y1Y_{1} is a partial solution of (X,π1,π2)(X,\pi_{1},\pi_{2}) of length 1, the Claim holds for =1\ell=1. Now suppose we have proved the Claim for 1\ell-1. So there exists a nonempty open subset U0Y1U_{0}\subseteq Y_{\ell-1} satisfying the desired property. Since Y1Y_{\ell-1} is irreducible, U0U_{0} is dense in Y1.Y_{\ell-1}. So, π2(U0)\pi_{2}(U_{0}) is dense in π2(Y1)¯Kol=π1(Y)¯Kol\overline{\pi_{2}(Y_{\ell-1})}^{\operatorname{Kol}}=\overline{\pi_{1}(Y_{\ell})}^{\operatorname{Kol}}. Since U0U_{0} is Δ\Delta-constructible (that is, solution set of a quantifier-free formula with parameters in KK or, equivalently, a finite union of Δ\Delta-closed and Δ\Delta-open sets), π2(U0)\pi_{2}(U_{0}) is Δ\Delta-constructible too. So, π2(U0)\pi_{2}(U_{0}) contains a nonempty open subset of π1(Y)¯Kol\overline{\pi_{1}(Y_{\ell})}^{\operatorname{Kol}}.

Since π1(Y)\pi_{1}(Y_{\ell}) is Δ\Delta-constructible and dense in π1(Y)¯Kol\overline{\pi_{1}(Y_{\ell})}^{\operatorname{Kol}}, π2(U0)π1(Y)\pi_{2}(U_{0})\cap\pi_{1}(Y_{\ell})\neq\varnothing is Δ\Delta-constructible and dense in π1(Y)¯Kol\overline{\pi_{1}(Y_{\ell})}^{\operatorname{Kol}}. Let U1U_{1} be a nonempty open subset of π1(Y)¯Kol\overline{\pi_{1}(Y_{\ell})}^{\operatorname{Kol}} contained in π2(U0)π1(Y)\pi_{2}(U_{0})\cap\pi_{1}(Y_{\ell}) and

U2=π11(U1)Y.U_{2}=\pi_{1}^{-1}(U_{1})\cap Y_{\ell}.

Then U2U_{2} is a nonempty open subset of YY_{\ell}. We will show that for each pU2p_{\ell}\in U_{2}, there exists piYip_{i}\in Y_{i} for i=1,,1i=1,\ldots,\ell-1 such that p1,,pp_{1},\ldots,p_{\ell} is a partial solution of (X,π1,π2).(X,\pi_{1},\pi_{2}).

Since π1(p)U1π2(U0)\pi_{1}(p_{\ell})\in U_{1}\subset\pi_{2}(U_{0}), there exists p1U0p_{\ell-1}\in U_{0} such that π1(p)=π2(p1).\pi_{1}(p_{\ell})=\pi_{2}(p_{\ell-1}). Since p1U0p_{\ell-1}\in U_{0}, by the inductive hypothesis, there exists piYip_{i}\in Y_{i} for i=1,,1i=1,\ldots,\ell-1 such that p1,,p1p_{1},\ldots,p_{\ell-1} is a partial solution of (X,π1,π2)(X,\pi_{1},\pi_{2}) of length 1\ell-1. So p1,,pp_{1},\ldots,p_{\ell} is a partial solution of (X,π1,π2)(X,\pi_{1},\pi_{2}) of length \ell. ∎

For two trains Y=(Y1,,Y)Y=(Y_{1},\ldots,Y_{\ell}) and Y=(Y1,,Y)Y^{\prime}=(Y^{\prime}_{1},\ldots,Y^{\prime}_{\ell}), denote YYY\subseteq Y^{\prime} if YiYiY_{i}\subseteq Y^{\prime}_{i} for each ii. Given an increasing chain of trains Yi=(Yi,1,,Yi,)Y_{i}=(Y_{i,1},\ldots,Y_{i,\ell}),

(iYi,1¯Kol,,iYi,¯Kol)\big{(}\overline{\cup_{i}Y_{i,1}}^{\operatorname{Kol}},\ldots,\overline{\cup_{i}Y_{i,\ell}}^{\operatorname{Kol}}\big{)}

is a train in XX that is an upper bound for this chain. (For each jj, iYi,j¯Kol\overline{\cup_{i}Y_{i,j}}^{\operatorname{Kol}} is an irreducible Δ\Delta-variety in σj1\sigma^{j-1}(X).) So by Zorn’s lemma, maximal trains of length \ell always exist in XX.

For \ell\in\mathbb{N}, consider the product

X:=X×σ(X)××σ1(X)\textbf{X}_{\ell}:=X\times\sigma(X)\times\ldots\times\sigma^{\ell-1}(X)

and denote the projection of X\textbf{X}_{\ell} onto σi1(X)\sigma^{i-1}(X) by φ,i\varphi_{\ell,i}. Let

𝐖(X,π1,π2):={pX:π2(φ,i(p))=π1(φ,i+1(p)),i=1,,1}.\mathbf{W}_{\ell}(X,\pi_{1},\pi_{2}):=\{p\in\textbf{X}_{\ell}:\pi_{2}(\varphi_{\ell,i}(p))=\pi_{1}(\varphi_{\ell,i+1}(p)),i=1,\ldots,\ell-1\}.
Lemma 6.15.

For every irreducible Δ\Delta-subvariety W𝐖W\subset\mathbf{W}_{\ell},

(φ,1(W)¯Kol,,φ,(W)¯Kol)\big{(}\overline{\varphi_{\ell,1}(W)}^{\operatorname{Kol}},\ldots,\overline{\varphi_{\ell,\ell}(W)}^{\operatorname{Kol}}\big{)}

is a train in XX of length \ell. Conversely, for each train (Y1,,Y)(Y_{1},\ldots,Y_{\ell}) in XX, there exists an irreducible Δ\Delta-subvariety W𝐖W\subseteq\mathbf{W}_{\ell} such that Yi=φ,i(W)¯KolY_{i}=\overline{\varphi_{\ell,i}(W)}^{\operatorname{Kol}} for each i=1,,i=1,\ldots,\ell.

Proof.

The proof follows [18, Lemma 6.8]. The first assertion is straightforward. We will prove the second assertion by induction on \ell. For =1\ell=1, 𝐖1=X\mathbf{W}_{1}=X, and we can set W=Y1W=Y_{1}.

Let >1\ell>1. Apply the inductive hypothesis to the train (Y1,,Y1)(Y_{1},\ldots,Y_{\ell-1}) and obtain an irreducible subvariety Y𝐖1𝐗1Y^{\prime}\subset\mathbf{W}_{\ell-1}\subset\mathbf{X}_{\ell-1}. Then there is a natural embedding of Y×YY^{\prime}\times Y_{\ell} into 𝐗\mathbf{X}_{\ell}. Denote (Y×Y)𝐖(Y^{\prime}\times Y_{\ell})\cap\mathbf{W}_{\ell} by Y~\widetilde{Y}. Since Y𝐖1Y^{\prime}\subset\mathbf{W}_{\ell-1},

Y~={pY×Y|π2(φ,1(p))=π1(φ,(p))}.\widetilde{Y}=\{p\in Y^{\prime}\times Y_{\ell}\>|\>\pi_{2}\left(\varphi_{\ell,\ell-1}(p)\right)=\pi_{1}\left(\varphi_{\ell,\ell}(p)\right)\}.

Let

(3) Z:=π2(φ1,1(Y))¯=π1(Y)¯.Z:=\overline{\pi_{2}\left(\varphi_{\ell-1,\ell-1}(Y^{\prime})\right)}=\overline{\pi_{1}(Y_{\ell})}.

Then we have a (k,Δ)(k,\Delta)-isomorphism

RYRZRYRY~R_{Y^{\prime}}\otimes_{R_{Z}}R_{Y_{\ell}}\to R_{\widetilde{Y}}

under the (k,Δ)(k,\Delta)-algebra homomorphisms i1:RZRYi_{1}:R_{Z}\to R_{Y^{\prime}} and i2:RZRYi_{2}:R_{Z}\to R_{Y_{\ell}} induced by π2φ1,1\pi_{2}\circ\varphi_{\ell-1,\ell-1} and π1\pi_{1}, respectively. Equality (3) implies that i1i_{1} and i2i_{2} are injective. Denote the fields of fractions of RYR_{Y^{\prime}}, RYR_{Y_{\ell}}, and RZR_{Z} by EE, FF, and LL, respectively. Let 𝔭\mathfrak{p} be any prime differential ideal in ELFE\otimes_{L}F,

R:=(ELF)/𝔭,R:=(E\otimes_{L}F)/\mathfrak{p},

and π:ELFR\pi\colon E\otimes_{L}F\to R be the canonical homomorphism. Consider the natural homomorphism i:RYRZRYELFi\colon R_{Y^{\prime}}\otimes_{R_{Z}}R_{Y_{\ell}}\to E\otimes_{L}F. Since 1i(RYRZRY)1\in i(R_{Y^{\prime}}\otimes_{R_{Z}}R_{Y_{\ell}}), the composition πi\pi\circ i is a nonzero homomorphism. Since i1i_{1} and i2i_{2} are injective, the natural homomorphisms iY:RYRYRZRYi_{Y^{\prime}}\colon R_{Y^{\prime}}\to R_{Y^{\prime}}\otimes_{R_{Z}}R_{Y_{\ell}} and iY:RYRYRZRYi_{Y_{\ell}}\colon R_{Y_{\ell}}\to R_{Y^{\prime}}\otimes_{R_{Z}}R_{Y_{\ell}} are injective as well. We will show that the compositions

πiiY:RYRandπiiY:RYR\pi\circ i\circ i_{Y^{\prime}}\colon R_{Y^{\prime}}\to R\quad\text{and}\quad\pi\circ i\circ i_{Y_{\ell}}\colon R_{Y_{\ell}}\to R

are injective. Introducing the natural embeddings iE:EELFi_{E}\colon E\to E\otimes_{L}F and jY:RYEj_{Y^{\prime}}\colon R_{Y^{\prime}}\to E, we can rewrite

πiiY=πiEjY.\pi\circ i\circ i_{Y^{\prime}}=\pi\circ i_{E}\circ j_{Y^{\prime}}.

The homomorphisms iEi_{E} and jYj_{Y^{\prime}} are injective. The restriction of π\pi to iE(E)i_{E}(E) is also injective since EE is a field. Hence, the whole composition πiEjY\pi\circ i_{E}\circ j_{Y^{\prime}} is injective. The argument for πiiY\pi\circ i\circ i_{Y_{\ell}} is analogous. Let

S:=(RYRZRY)/(𝔭(RYRZRY)),S:=\big{(}R_{Y^{\prime}}\otimes_{R_{Z}}R_{Y_{\ell}}\big{)}\big{/}\bigl{(}\mathfrak{p}\cap\big{(}R_{Y^{\prime}}\otimes_{R_{Z}}R_{Y_{\ell}}\big{)}\bigr{)},

which is a domain, and the homomorphisms πiiY:RYS\pi\circ i\circ i_{Y^{\prime}}:R_{Y^{\prime}}\to S and πiiY:RYS\pi\circ i\circ i_{Y_{\ell}}:R_{Y_{\ell}}\to S are injective. Let Fk{W}F\subset k\{\textbf{W}_{\ell}\} be such that S=k{W}/F()S=k\{\textbf{W}_{\ell}\}/\sqrt{\langle F\rangle}^{(\infty)}. We now let WW be the Δ\Delta-subvariety of W\textbf{W}_{\ell} defined by F=0F=0. For every ii, 1i<1\leqslant i<\ell, the homomorphism

φ,i=(πiiY)φ1,i:RYiRYS\varphi_{\ell,i}^{\sharp}=(\pi\circ i\circ i_{Y^{\prime}})\circ\varphi_{\ell-1,i}^{\sharp}:R_{Y_{i}}\to R_{Y^{\prime}}\to S

is injective as a composition of two injective homomorphisms. Hence, the restriction φ,i:WYi\varphi_{\ell,i}\colon W\to Y_{i} is dominant. ∎

Lemma 6.16.

Let (X,π1,π2)(X,\pi_{1},\pi_{2}) be a triple with XX bounded by nn. Then, for every \ell, the number of maximal trains of length \ell in XX does not exceed Components(m,n)\operatorname{Components}(m,\ell n).

Proof.

By Lemma 6.15, the number of maximal trains of length \ell in XX is equal to the number of irreducible components of W\textbf{W}_{\ell}. By Theorem 5.13, this number does not exceed Components(m,n)\operatorname{Components}(m,\ell n). ∎

Definition 6.17.

Let (X,π1,π2)(X,\pi_{1},\pi_{2}) be a triple and ω(t)\omega(t) be a numeric polynomial. We define B(X,ω){}B(X,\omega)\in\mathbb{Z}\cup\{\infty\} as the smallest value that is greater than the length of any train in XX with Kolchin polynomials at least ω\omega.

Lemma 6.18.

Let XX be a differential variety bounded by nn such that B(X,0)<B(X,0)<\infty. Then B(X,ωX)B(X,\omega_{X}) does not exceed the number of components of XX plus one.

Proof.

Denote the number of components in XX by NN and assume that there is a train (Y1,,YN+1)(Y_{1},\ldots,Y_{N+1}) with the Kolchin polynomial at least ωX\omega_{X}. Then each of Y1,σ1(Y2),,σN(YN+1)Y_{1},\sigma^{-1}(Y_{2}),\ldots,\sigma^{-N}(Y_{N+1}) must be a component of XX, so there exist 1i<jN+11\leqslant i<j\leqslant N+1 such that Yj=σjiYiY_{j}=\sigma^{j-i}Y_{i}. Thus, there exists an infinite train (Y1,,Yi,Yi+1,,Yj1,σji(Yi),σji(Yi+1),)(Y_{1},\ldots,Y_{i},Y_{i+1},\ldots,Y_{j-1},\sigma^{j-i}(Y_{i}),\sigma^{j-i}(Y_{i+1}),\ldots) in XX. This contradicts to B(X,0)<B(X,0)<\infty. ∎

Lemma 6.19.

There exists a computable function Iter(n,D)\operatorname{Iter}(n,D) such that, for every triple (X,π1,π2)(X,\pi_{1},\pi_{2}) such that

  • B(X,0)<B(X,0)<\infty

  • XX is bounded by nn

and every numeric polynomial ω1(t)>0\omega_{1}(t)>0, there exists a numeric polynomial ω2(t)0\omega_{2}(t)\geqslant 0 such that

  • ω2(t)<ω1(t)\omega_{2}(t)<\omega_{1}(t);

  • |ω2|Iter(n,B(X,ω1))|\omega_{2}|\leqslant\operatorname{Iter}(n,B(X,\omega_{1}));

  • B(X,ω2)Iter(n,B(X,ω1))B(X,\omega_{2})\leqslant\operatorname{Iter}(n,B(X,\omega_{1})).

Proof.

The proof follows [18, Lemma 6.20]. Let B1:=B(X,ω1)B_{1}:=B(X,\omega_{1}), and let TT be the number of maximal trains of length B1B_{1} in XX. We set B2:=B1+TB_{2}:=B_{1}+T. Lemma 6.16 implies that TT is bounded by Components(m,nB1)\operatorname{Components}(m,nB_{1}). Consider the fibered product 𝐖B1(X,π1,π2)\mathbf{W}_{B_{1}}(X,\pi_{1},\pi_{2}), and, for each irreducible component WW in it, denote the corresponding train by YWY_{W}. We set (assuming max=0\max\varnothing=0)

ω2:=max{ωYWωYW<ω1,W is a component of 𝐖B1}.\omega_{2}:=\max\bigl{\{}\omega_{Y_{W}}\mid\omega_{Y_{W}}<\omega_{1},W\text{ is a component of }\mathbf{W}_{B_{1}}\bigr{\}}.

We will show that B(X,ω2)B1+TB(X,\omega_{2})\leqslant B_{1}+T. Assume that there is a maximal train (Y1,,YB2)(Y_{1},\ldots,Y_{B_{2}}) in XX with the Kolchin polynomial at least ω2\omega_{2}. Introduce T+1T+1 trains Z(1),,Z(T+1)Z^{(1)},\ldots,Z^{(T+1)} of length B1B_{1} in X,σ(X),,σT(X)X,\sigma(X),\ldots,\sigma^{T}(X), respectively, such that for each jj,

Z(j)=(Z1(j),,ZT(j)):=(Yj,,Yj+B11).Z^{(j)}=\big{(}Z^{(j)}_{1},\ldots,Z^{(j)}_{T}\big{)}:=(Y_{j},\ldots,Y_{j+B_{1}-1}).

Then for each jj, consider a maximal train Z~(j)\tilde{Z}^{(j)} of length B1B_{1} containing Z(j)Z^{(j)}. So σj+1(Z~(j))\sigma^{-j+1}(\tilde{Z}^{(j)}) is a maximal train of length B1B_{1} in XX. There are two cases to consider:

(Case 1) {ωYW(t)|ωYW(t)<ω1(t),W is a component of 𝐖B1}=.\big{\{}\omega_{Y_{W}}(t)\>\big{|}\>\omega_{Y_{W}}(t)<\omega_{1}(t),\;W\text{ is a component of }\mathbf{W}_{B_{1}}\big{\}}=\varnothing.

In this case, Z~(1)\tilde{Z}^{(1)} is a train in XX with Kolchin polynomial at least ω1\omega_{1}. This contradicts the definition of B(X,ω1)B(X,\omega_{1}).

(Case 2) {ωYW(t)|ωYW(t)<ω1(t),W is a component of 𝐖B1}.\big{\{}\omega_{Y_{W}}(t)\>\big{|}\>\omega_{Y_{W}}(t)<\omega_{1}(t),\;W\text{ is a component of }\mathbf{W}_{B_{1}}\big{\}}\neq\varnothing.

By the definition of B(X,ω1)B(X,\omega_{1}), for every jj, ωσj+1(Z~(j))(t)<ω1(t)\omega_{\sigma^{-j+1}(\tilde{Z}^{(j)})}(t)<\omega_{1}(t). This implies that, for each jj,

ωσj+1(Z~(j))(t)=ω2(t).\omega_{\sigma^{-j+1}(\tilde{Z}^{(j)})}(t)=\omega_{2}(t).

Since there are only TT maximal trains in XX of length B1B_{1}, there exist a<ba<b such that

σa+1(Z~(a))=σb+1(Z~(b))=:Z.\sigma^{-a+1}(\tilde{Z}^{(a)})=\sigma^{-b+1}(\tilde{Z}^{(b)})=:Z.

Since ωZ=ω2\omega_{Z}=\omega_{2}, there exists \ell such that ωZ=ω2\omega_{Z_{\ell}}=\omega_{2}. Since

ωσa+1(Z(a))=ω2 and σa+1(Z(a))Z\omega_{\sigma^{-a+1}({Z}^{(a)}_{\ell})}=\omega_{2}\quad\text{ and }\quad\sigma^{-a+1}({Z}^{(a)}_{\ell})\subseteq Z_{\ell}

we have σa+1(Z(a))=Z\sigma^{-a+1}({Z}^{(a)}_{\ell})=Z_{\ell}. Similarly, we can show σb+1(Z(b))=Z\sigma^{-b+1}({Z}^{(b)}_{\ell})=Z_{\ell}. Hence,

σa+1(Ya+1)=σa+1(Z(a))=σb+1(Z(b))=σb+1(Yb+1).\sigma^{-a+1}(Y_{a+\ell-1})=\sigma^{-a+1}({Z}^{(a)}_{\ell})=\sigma^{-b+1}({Z}^{(b)}_{\ell})=\sigma^{-b+1}(Y_{b+\ell-1}).

Thus, we have Yb+1=σba(Ya+1)Y_{b+\ell-1}=\sigma^{b-a}(Y_{a+\ell-1}). This contradicts the fact that B(X,0)<B(X,0)<\infty.

It remains to show that |ω2||\omega_{2}| is bounded by a computable function of nn and B1B_{1}. Let WW be a component of 𝐖B1\mathbf{W}_{B_{1}} such that ωYW=ω2\omega_{Y_{W}}=\omega_{2}. Let YW=(YW,1,,YW,B1)Y_{W}=(Y_{W,1},\ldots,Y_{W,B_{1}}). There exists 1iB11\leqslant i\leqslant B_{1} such that ωYi=ω2\omega_{Y_{i}}=\omega_{2}. Since YiY_{i} is the Kolchin closure of a linear projection of a component of 𝐖B1\mathbf{W}_{B_{1}} and 𝐖B1\mathbf{W}_{B_{1}} is bounded by B1nB_{1}n, Lemma 6.6 implies that |ω2||\omega_{2}| is bounded by a computable function of nn and B1B_{1}.

Taking Iter(n,D)\operatorname{Iter}(n,D) to be the maximum of the computable bounds for B(X,ω2)B(X,\omega_{2}) and |ω2||\omega_{2}|, we conclude the proof. ∎

Definition 6.20.

Let nn be a positive integer and ω(t)\omega(t) be a numeric polynomial such that ω>0\omega>0. We define B(n,ω){}B(n,\omega)\in\mathbb{Z}\cup\{\infty\} as the smallest value such that, for every affine differential variety XX bounded by nn, if there exists a train in XX with Kolchin polynomial at least ω\omega of length at least B(n,ω)B(n,\omega), then there exists an infinite train in XX.

Proposition 6.21.

B(n,0)B(n,0) is bounded by a computable function A(n)A(n).

Proof.

We recursively define the following function G(n)G(n) on nonnegative integers

G(0):=max(Components(n,n)+1,KolchinProj(n)),\displaystyle G(0):=\max\bigl{(}\operatorname{Components}(n,n)+1,\operatorname{KolchinProj}(n)\bigr{)},
G(j+1):=Iter(n,G(j)),j0.\displaystyle G(j+1):=\operatorname{Iter}(n,G(j)),\ \ j\geqslant 0.

Consider a variety XX bounded by nn such that there is no infinite train in XX, that is B(X,0)<B(X,0)<\infty. Lemma 6.18 implies that B(X,ωX)1B(X,\omega_{X})-1 does not exceed the number of components of XX. Hence, Theorem 5.13 implies that B(X,ωX)Components(n,n)+1B(X,\omega_{X})\leqslant\operatorname{Components}(n,n)+1. Lemma 6.6 implies that |ωX|KolchinProj(n)|\omega_{X}|\leqslant\operatorname{KolchinProj}(n). Repeatedly applying Lemma 6.19, we obtain a sequence of numeric polynomials

ω0:=ωX>ω1>ω2>\omega_{0}:=\omega_{X}>\omega_{1}>\omega_{2}>\ldots

such that, for every 1iL1\leqslant i\leqslant L, we have B(X,ωi)G(i)B(X,\omega_{i})\leqslant G(i) and |ωi|G(i)|\omega_{i}|\leqslant G(i). Since the Kolchin polynomial are well-ordered, there exists LL such that ωL=0\omega_{L}=0. Proposition 6.7 implies that LLenGL\leqslant\operatorname{Len}_{G}. Hence, B(X,0)G(LenG)B(X,0)\leqslant G(\operatorname{Len}_{G}), where the right-hand side is a computable function of nn. Set A(n):=G(LenG)A(n):=G(\operatorname{Len}_{G}), then B(n,0)A(n)B(n,0)\leqslant A(n). ∎

Corollary 6.22.

For all rr, mm and s0s\in\mathbb{Z}_{\geqslant 0}, and a set of Δ\Delta-σ\sigma polynomials Fk[𝐲s]F\subset k[\bm{y}_{s}] with |Δ|=m|\Delta|=m, degFs\deg F\leqslant s and |𝐲|=r|\bm{y}|=r, F=0F=0 has a sequence solution in KK^{\mathbb{Z}} if and only if F=0F=0 has a partial solution of computable length A(max{r,m,s})A(\max\{r,m,s\}).

Proof.

The proof is as in [18, Corollary 6.21], as follows. Let X𝔸HX\subset\mathbb{A}^{H} be the Δ\Delta-variety defined by F=0F=0 regarded as a system of Δ\Delta-equations in 𝒚,σ(𝒚),,σh(𝒚)\bm{y},\sigma(\bm{y}),\ldots,\sigma^{h}(\bm{y}), where H=n(h+1)H=n(h+1). By Lemmas 6.12 and 6.14, F=0F=0 has a partial solution of length DD (resp. F=0F=0 has a solution in KK^{\mathbb{Z}} ) if and only if there exists a train of length DD in XX (resp., there exists an infinite train in XX). By Proposition 6.21, if there exists a train of length D:=A(max{r,m,s})D:=A(\max\{r,m,s\}) in XX, then there exists a infinite train in XX. So the assertion holds. ∎

6.3. Upper bound for delay PDEs

We now state and prove the main result of this section which generalizes [18, Theorem 3.1] to delay PDEs.

Theorem 6.23 (Effective elimination for delay PDEs).

For all non-negative integers rr, mm, and ss, there exists a computable B=B(r,m,s)B=B(r,m,s) such that, for all:

  • non-negative integers qq and tt,

  • Δ\Delta-σ\sigma-fields kk with chark=0\operatorname{char}k=0 and |Δ|=m|\Delta|=m,

  • sets of Δ\Delta-σ\sigma-polynomials Fk[𝒙t,𝒚s]F\subset k[\bm{x}_{t},\bm{y}_{s}], where 𝒙=x1,,xq\bm{x}=x_{1},\ldots,x_{q}, 𝒚=y1,,yr\bm{y}=y_{1},\ldots,y_{r}, and deg𝒚Fs\deg_{\bm{y}}F\leqslant s,

we have

σi(F)i0()k[𝒙]{0}σi(F)i[0,B](B)k[𝒙B+t]{0}.\big{\langle}\sigma^{i}(F)\mid i\in\mathbb{Z}_{\geqslant 0}\big{\rangle}^{(\infty)}\cap k[\bm{x}_{\infty}]\neq\{0\}\iff\langle\sigma^{i}(F)\mid i\in[0,B]\big{\rangle}^{(B)}\cap k[\bm{x}_{B+t}]\neq\{0\}.
Proof.

The proof closely follows [18, Theorem 6.22]. The “\impliedby” implication is straightforward. We will prove the “\implies” implication. For this, let A:=A(max{r,m,s})A:=A(\max\{r,m,s\}) from Corollary 6.22, and let BB be a computable bound obtained from [10, Theorem 3.4] with

mm,nr(A+s+1),hs,andDs.m\leftarrow m,\ \ n\leftarrow r(A+s+1),\ \ h\leftarrow s,\ \ \text{and}\ \ D\leftarrow s.

By assumption,

(4) 1σi(F)i0()k(𝒙)[𝒚].1\in\big{\langle}\sigma^{i}(F)\mid i\in\mathbb{Z}_{\geqslant 0}\big{\rangle}^{(\infty)}\cdot k(\bm{x}_{\infty})[\bm{y}_{\infty}].

Suppose that

(5) σi(F)i[0,A](B)k[𝒙B+t]={0}.\langle\sigma^{i}(F)\mid i\in[0,A]\big{\rangle}^{(B)}\cap k[\bm{x}_{B+t}]=\{0\}.

If

1σi(F)i[0,A](B)k(𝒙B+t)[𝒚,A+s],1\in\big{\langle}\sigma^{i}(F)\mid i\in[0,A]\big{\rangle}^{({B})}\cdot k(\bm{x}_{B+t})[\bm{y}_{\infty,A+s}],

then there would exist ci,jk(𝒙B+t)[𝒚,A+s]c_{i,j}\in k(\bm{x}_{B+t})[\bm{y}_{\infty,A+s}] such that

(6) 1=θΘΔ(B)j=0AfFci,jθ(σj(f)).1=\sum\limits_{\theta\in\Theta_{\Delta}(B)}\sum\limits_{j=0}^{A}\sum\limits_{f\in F}c_{i,j}\theta(\sigma^{j}(f)).

Multiplying equation (6) by the common denominator in the variables 𝒙B+t\bm{x}_{B+t}, we obtain a contradiction with (5). Hence, by [10, Theorem 3.4],

1σi(F)i[0,A]()k(𝒙B+t)[𝒚,A+s].1\notin\big{\langle}\sigma^{i}(F)\mid i\in[0,A]\big{\rangle}^{(\infty)}\cdot k(\bm{x}_{B+t})[\bm{y}_{\infty,A+s}].

By Lemma 6.8, there exists a differentially closed Δ\Delta-σ\sigma^{*}-field extension Lk(𝒙)k(𝒙B+t)L\supset k(\bm{x}_{\infty})\supset k(\bm{x}_{B+t}). Then differential Nullstellensatz implies that the system of differential equations

{σi(F)=0i[0,A]}\{\sigma^{i}(F)=0\mid i\in[0,A]\}

in the unknowns 𝒚,A+s\bm{y}_{\infty,A+s} has a solution in LL. Then the system F=0F=0 has a partial solution of length A+1A+1 in LL. Now from (4), we see that the system F=0F=0 has no solutions in LL^{\mathbb{Z}}. Together with the existence of a partial solution of length A+1A+1, this contradicts to Corollary 6.22. ∎

Acknowledgments

This work was partially supported by the NSF grants CCF-1564132, CCF-1563942, DMS-1760448, DMS-1760413, DMS-1853650, DMS-1853482, and DMS-1800492; the NSFC grants (11971029, 11688101) and the fund of Youth Innovation Promotion Association of CAS. We are grateful to the referees for numerous insightful comments, which helped us substantially improve the paper.

References

  • Bächler et al. [2012] T. Bächler, V. Gerdt, M. Lange-Hegermann, and D. Robertz. Algorithmic Thomas decomposition of algebraic and differential systems. Journal of Symbolic Computation, 47(10):1233–1266, 2012. URL https://doi.org/10.1016/j.jsc.2011.12.043.
  • Blum et al. [1998] L. Blum, F. Cucker, M. Shub, and S. Smale. Complexity and Real Computation. Springer, 1998. URL https://doi.org/10.1007/978-1-4612-0701-6.
  • Boulier et al. [2009] F. Boulier, D. Lazard, F. Ollivier, and M. Petitot. Computing representations for radicals of finitely generated differential ideals. Applicable Algebra in Engineering, Communication and Computing, 20:73–121, 2009. URL https://doi.org/10.1007/s00200-009-0091-7.
  • Chapuis and Koiran [1999] O. Chapuis and P. Koiran. Saturation and stability in the theory of computation over the reals. Annals of Pure and Applied Logic, 99(1):1–49, 1999. URL https://doi.org/10.1016/S0168-0072(98)00060-8.
  • Gao et al. [2009a] X.-S. Gao, Y. Luo, and C. Yuan. A characteristic set method for ordinary difference polynomial systems. Journal of Symbolic Computation, 44(3):242–260, 2009a. URL https://doi.org/10.1016/j.jsc.2007.05.005.
  • Gao et al. [2009b] X.-S. Gao, J. Van der Hoeven, C.-M. Yuan, and G.-L. Zhang. Characteristic set method for differential-difference polynomial systems. Journal of Symbolic Computation, 44(9):1137–1163, 2009b. URL https://doi.org/10.1016/j.jsc.2008.02.010.
  • Gerdt and Robertz [2019] V. Gerdt and D. Robertz. Algorithmic approach to strong consistency analysis of finite difference approximations to PDE systems. In ISSAC ’19: Proceedings of the 2019 on International Symposium on Symbolic and Algebraic Computation, pages 163–170. ACM Press, 2019. URL https://doi.org/10.1145/3326229.3326255.
  • Golubitsky et al. [2009] O. Golubitsky, M. Kondratieva, A. Ovchinnikov, and A. Szanto. A bound for orders in differential Nullstellensatz. Journal of Algebra, 322(11):3852–3877, 2009. URL https://doi.org/10.1016/j.jalgebra.2009.05.032.
  • Goode [1994] J. B. Goode. Accessible telephone directories. The Journal of Symbolic Logic, 59:92–105, 1994. URL https://www.jstor.org/stable/2275252.
  • Gustavson et al. [2016] R. Gustavson, M. Kondratieva, and A. Ovchinnikov. New effective differential Nullstellensatz. Advances in Mathematics, 290:1138–1158, 2016. URL http://dx.doi.org/10.1016/j.aim.2015.12.021.
  • Harrison-Trainor et al. [2012] M. Harrison-Trainor, J. Klys, and R. Moosa. Nonstandard methods for bounds in differential polynomial rings. Journal of Algebra, 360:71–86, 2012. URL https://doi.org/10.1016/j.jalgebra.2012.03.013.
  • Heintz [1983] J. Heintz. Definability and fast quantifier elimination in algebraically closed fields. Theoretical Computer Science, 24(3):239–277, 1983. URL http://dx.doi.org/10.1016/0304-3975(83)90002-6.
  • Kolchin [1973] E. Kolchin. Differential Algebra and Algebraic Groups. Academic Press, New York, 1973.
  • Léon Sánchez [2016] O. Léon Sánchez. On the model companion of partial differential fields with an automorphism. Israel Journal of Mathematics, 212:419–442, 2016. URL https://doi.org/10.1007/s11856-016-1292-y.
  • León Sánchez [2019] O. León Sánchez. Estimates for the coefficients of differential dimension polynomials. Mathematics of Computation, 88:2959–2985, 2019. URL https://doi.org/10.1090/mcom/3429.
  • Léon Sánchez and Moosa [2016] O. Léon Sánchez and R. Moosa. The model companion of differential fields with free operators. Journal of Symbolic Logic, 81(2):493–509, 2016. URL https://doi.org/10.1017/jsl.2015.76.
  • Levin [2008] A. Levin. Difference algebra, volume 8 of Algebra and Applications. Springer, New York, 2008. URL http://dx.doi.org/10.1007/978-1-4020-6947-5.
  • Li et al. [2018] W. Li, A. Ovchinnikov, G. Pogudin, and T. Scanlon. Elimination of unknowns for systems of algebraic differential-difference equations, 2018. URL https://arxiv.org/abs/1812.11390.
  • Marker [2002] D. Marker. Model theory: An introduction. Springer-Verlag New York, 2002. URL https://doi.org/10.1007/b98860.
  • McAloon [1984] K. McAloon. Petri nets and large finite sets. Theoretical Computer Science, 32(1-2):173–183, 1984. URL https://doi.org/10.1016/0304-3975(84)90029-X.
  • McGrail [2000] T. McGrail. The model theory of differential fields with finitely many commuting derivations. Journal of Symbolic Logic, 65(2):885–913, 2000. URL https://doi.org/10.2307/2586576.
  • Mikhalev et al. [1999] A. V. Mikhalev, A. Levin, E. Pankratiev, and M. Kondratieva. Differential and Difference Dimension Polynomials, volume 461 of Mathematics and Its Applications. Springer Netherlands, 1999. URL https://doi.org/10.1007/978-94-017-1257-6.
  • Moosa and Scanlon [2014] R. Moosa and T. Scanlon. Model theory of fields with free operators in characteristic zero. Journal of Mathematical Logic, 14(2):1450009, 2014. URL https://doi.org/10.1142/s0219061314500093.
  • Papadimitriou [1993] C. H. Papadimitriou. Computational Complexity. Pearson, 1993.
  • Rust and Reid [1997] C. J. Rust and G. J. Reid. Rankings of partial derivatives. In Proceedings of the 1997 international symposium on Symbolic and algebraic computation - ISSAC’97, 1997. URL https://doi.org/10.1145/258726.258737.
  • Simmons and Towsner [2019] W. Simmons and H. Towsner. Proof mining and effective bounds in differential polynomial rings. Advances in Mathematics, 343:567–623, 2019. URL https://doi.org/10.1016/j.aim.2018.11.026.
  • van den Dries and Shmidt [1984] L. van den Dries and K. Shmidt. Bounds in the theory of polynomial rings over fields. a nonstandard approach. Inventiones mathematicae, 76:77–91, 1984. URL https://doi.org/10.1007/BF01388493.