This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Algebraic Analysis of the Hypergeometric
Function F 11\,{{}_{1}F_{\!\!\;1}}\, of a Matrix Argument

Paul Görlach Paul Görlach
Fakultät für Mathematik
Technische Universität Chemnitz
Reichenhainer Straße 39, 09126 Chemnitz, Germany
[email protected]
Christian Lehn Christian Lehn
Fakultät für Mathematik
Technische Universität Chemnitz
Reichenhainer Straße 39, 09126 Chemnitz, Germany
[email protected]
 and  Anna-Laura Sattelberger Anna-Laura Sattelberger
Max-Planck-Institut für Mathematik in den Naturwissenschaften
Inselstraße 22, 04103 Leipzig, Germany
[email protected]
Abstract.

In this article, we investigate Muirhead’s classical system of differential operators for the hypergeometric function F 11\,{{}_{1}F_{\!\!\;1}}\, of a matrix argument. We formulate a conjecture for the combinatorial structure of the characteristic variety of its Weyl closure which is both supported by computational evidence as well as theoretical considerations. In particular, we determine the singular locus of this system.

Key words and phrases:
Algebraic Analysis, hypergeometric function, characteristic variety, singular locus, holonomic function
2020 Mathematics Subject Classification:
34M15, 33C70 (primary), 34M35, 13P10, 14Q15 (secondary).

1. Introduction

Hypergeometric functions are probably the most famous special functions in mathematics and their study dates back to Euler, Pfaff, and Gauß, earlier contributions to the development of the theory are due to Wallis, Newton, and Stirling, we refer to [7]. Around the origin, they have the series expansion

(1.1) Fqp(a1,,ap;c1,,cq)(x)n=0(a1)n(ap)n(c1)n(cq)nxnn!,\,{}_{p}F_{\!q}(a_{1},\ldots,a_{p};c_{1},\ldots,c_{q})(x)\,\coloneqq\,\sum_{n=0}^{\infty}\;{\frac{(a_{1})_{n}\cdots(a_{p})_{n}}{(c_{1})_{n}\cdots(c_{q})_{n}}}\ {\frac{x^{n}}{n!}},

where p,qp,\,q are non-negative integers with q+1p\,q+1\geq p\, and (a)n=a(a+n1)\,(a)_{n}=a\cdots(a+n-1)\, denotes the Pochhammer symbol. Hypergeometric functions are ubiquitous in mathematics and physics: they are intimately related to the theory of differential equations and show up at prominent places in physics such as the hydrogen atom. In recent years, there has been renewed interest in the subject coming from the connection with toric geometry established in [10, 11] and the interplay with mirror symmetry, see also the article [36] in this volume for more details and further references.

A natural generalization are hypergeometric functions of a matrix argument XX as introduced by Herz in [15, Section 2] using the Laplace transform. Herz was building on work of Bochner [2]. Ever since, they have been a recurrent topic in the theory of special functions. In [4, Section 5], Constantine expressed these functions as a series of zonal polynomials, thereby establishing a link with the representation theory of GLn\operatorname{GL}_{n}. This series expansion bears a striking likeness to (1.1) and is usually written as

(1.2) Fqp(a1,,ap;c1,,cq)(X)n=0λn(a1)λ(ap)λ(c1)λ(cq)λCλ(X)n!,{{}_{p}F_{\!q}}(a_{1},\ldots,a_{p};c_{1},\ldots,c_{q})(X)\,\coloneqq\,\sum_{n=0}^{\infty}\sum_{\lambda\,\vdash n}\frac{(a_{1})_{\lambda}\cdots(a_{p})_{\lambda}}{(c_{1})_{\lambda}\cdots(c_{q})_{\lambda}}\frac{C_{\lambda}(X)}{n!},

where the λ\,\lambda\, are partitions of n\,n\, and the (ai)λ(a_{i})_{\lambda}, (cj)λ(c_{j})_{\lambda} are certain generalized Pochhammer symbols, see Definition 3.2.

In this article, we examine the differential equations the hypergeometric function F 11(a;c)\,{{}_{1}F_{\!\!\;1}}(a;c)\, of a matrix argument X\,X\, satisfies from the point of view of algebraic analysis. If X\,X\, is an (m×m)(m\times m)-matrix, the function (1.2) only depends on the eigenvalues x1,,xm\,x_{1},\ldots,x_{m}\, counted with multiplicities. So we may equally well assume that X=diag(x1,,xm)\,X=\operatorname{diag}(x_{1},\ldots,x_{m})\, is a diagonal matrix. In [32], Muirhead showed that the linear partial differential operators

(1.3) gkxkk2+(cxk)k+12(kxxkx(k))a,g_{k}\,\coloneqq\,x_{k}\partial_{k}^{2}\,+\,(c-x_{k})\partial_{k}\,+\,\frac{1}{2}\mathopen{}\mathclose{{}\left(\sum_{\ell\neq k}\frac{x_{\ell}}{x_{k}-x_{\ell}}(\partial_{k}-\partial_{\ell})}\right)\,-\,a,

k=1,,mk=1,\ldots,m, annihilate F 11(a;c)\,{{}_{1}F_{\!\!\;1}}(a;c)\, wherever they are defined. We denote by Pk\,P_{k}\, the differential operator obtained from gk\,g_{k}\, by clearing denominators and consider the left ideal Im(P1,,Pm)\,I_{m}\coloneqq(P_{1},\ldots,P_{m})\, in the Weyl algebra DmD_{m}, see Section 4. We refer to Im\,I_{m}\, as the Muirhead ideal or the Muirhead system of differential equations and denote by W(Im)W(I_{m})\, its Weyl closure. Our main result is:

Theorem 5.1.

The singular locus of Im\,I_{m}\, agrees with the singular locus of W(Im)W(I_{m}). It is the hyperplane arrangement

(1.4) 𝒜{xm|k=1mxkk(xkx)=0}.\,{{\mathscr{A}}}\,\coloneqq\,\mathopen{}\mathclose{{}\left\{x\in\mathbb{C}^{m}\ \middle|\ \prod_{k=1}^{m}x_{k}\prod_{\ell\neq k}(x_{k}-x_{\ell})=0}\right\}.

This leads to a lower bound for the characteristic variety of ImI_{m}, by which we essentially mean the characteristic variety of the DmD_{m}-module Dm/ImD_{m}/I_{m}. We would like to point out that the terminology used in this article is a slight modification and refinement of the usual definition in the theory of DD-modules, taking scheme-theoretic structures into account. For details, see Definition 2.1 and the remarks thereafter.

Corollary 5.7.

The characteristic variety of W(Im)\,W(I_{m})\, contains the zero section and the conormal bundles of the irreducible components of 𝒜{{\mathscr{A}}}, i.e.,

(1.5) Char(W(Im))\displaystyle\operatorname{Char}(W(I_{m}))\,\supseteq V(ξ1,,ξm)iV(xi,ξ1,,ξi^,,ξm)\displaystyle V\mathopen{}\mathclose{{}\left(\xi_{1},\ldots,\xi_{m}}\right)\,\cup\,\bigcup_{i}V(x_{i},\xi_{1},\ldots,\widehat{\xi_{i}},\ldots,\xi_{m})
ijV(xixj,ξi+ξj,ξ1,,ξi^,,ξj^,,ξm).\displaystyle\ \ \cup\bigcup_{i\neq j}V(x_{i}-x_{j},\,\xi_{i}+\xi_{j},\,\xi_{1},\ldots,\widehat{\xi_{i}},\ldots,\widehat{\xi_{j}},\ldots,\xi_{m}).

Here, ()^\widehat{(\cdot)} means that the corresponding entry gets deleted. Note that the varieties on the right hand side of (1.5) are conormal varieties for the natural symplectic structure on T𝔸mT^{*}{\mathbb{A}}^{m}, see Section 2.2. More precisely, they are the conormal varieties to the irreducible components of the divisor 𝒜\,{{\mathscr{A}}}\, of singularities of the Muirhead system. To formulate our conjecture about the structure of the characteristic variety of W(Im)W(I_{m}), we introduce the following notation. Let J0|J1Jk\,J_{0}|J_{1}\ldots J_{k}\, denote a partition of [m]={1,,m}[m]=\{1,\ldots,m\}, such that only J0J_{0} may possibly be empty. We denote by ZJ0|J1JkZ_{J_{0}|J_{1}\ldots J_{k}} the linear subspace given by the vanishing of all xi\,x_{i}\, for iJ0\,i\in J_{0}\, and all xixj\,x_{i}-x_{j}\, for i,jJ\,i,j\in J_{\ell} and [k]\ell\in[k]. For a smooth subvariety Y𝔸mY\subseteq{\mathbb{A}}^{m}, we denote by NYT𝔸m\,N^{*}Y\subseteq T^{*}{\mathbb{A}}^{m}\, the conormal variety to YY. Then our conjecture can be phrased as follows:

Conjecture 6.2.

Let CJ0|J1JkNZJ0|J1JkC_{J_{0}|J_{1}\ldots J_{k}}\coloneqq N^{*}Z_{J_{0}|J_{1}\ldots J_{k}}. The (reduced) characteristic variety of W(Im)\,W(I_{m})\, is the following arrangement of mm-dimensional linear spaces:

Char(W(Im))red=[m]=J0JkCJ0|J1Jk.\operatorname{Char}(W(I_{m}))^{\text{\rm red}}\,=\,\bigcup_{[m]\,=\,J_{0}\sqcup\dots\sqcup J_{k}}C_{J_{0}|J_{1}\ldots J_{k}}.

In particular, it has Bm+1\,B_{m+1}\, many irreducible components, where Bn\,B_{n}\, denotes the nn-th Bell number.

By an explicit analysis of the differential operators in ImI_{m}, we also obtain an upper bound for Char(Im)\operatorname{Char}(I_{m}). For a partition J0|J1JkJ_{0}|J_{1}\dots J_{k}, we define certain subspaces C^J0|J1JkT𝔸m\widehat{C}_{J_{0}|J_{1}\dots J_{k}}\subseteq T^{*}{\mathbb{A}}^{m}\, such that CJ0|J1JkC^J0|J1Jk\,C_{J_{0}|J_{1}\dots J_{k}}\subseteq\widehat{C}_{J_{0}|J_{1}\dots J_{k}}\, with equality if and only if |J|2\,{\mathopen{}\mathclose{{}\left|J_{\ell}}\right|}\leq 2\, for all 1\ell\geq 1, see (6.3) for the precise definition.

Proposition 6.3.

The (reduced) characteristic variety of Im\,I_{m}\, is contained in the arrangement of the linear spaces C^J0|J1Jk\widehat{C}_{J_{0}|J_{1}\dots J_{k}}:

Char(Im)red[m]=J0J1JkC^J0|J1Jk.\operatorname{Char}(I_{m})^{\text{\rm red}}\,\subseteq\,\bigcup_{[m]\,=\,J_{0}\sqcup J_{1}\sqcup\dots\sqcup J_{k}}\widehat{C}_{J_{0}|J_{1}\ldots J_{k}}.

It is the upper and lower bound together with explicit computations in the computer algebra system Singular for small values of mm, see Section 6.3, that led us to formulate Conjecture 6.2. We believe that it may contribute to a better understanding of the hypergeometric function F 11{{}_{1}F_{\!\!\;1}}. As Im\,I_{m}\, turns out to be non-holonomic in general, it seems that one should rather work with its Weyl closure W(Im)W(I_{m}), for which, in general, generators are not known. Clearly, one has Char(W(Im))Char(Im)\operatorname{Char}(W(I_{m}))\subseteq\operatorname{Char}(I_{m}). Therefore, Proposition 6.3 in particular also gives an upper bound for Char(W(Im))\operatorname{Char}(W(I_{m})).

Applications and related work.

Hypergeometric functions of a matrix argument possess a rich structure and are highly fascinating objects. Not surprisingly, there is by now a long list of interesting applications in various areas such as number theory, numerical mathematics, random matrix theory, representation theory, statistics, and others; the following short list does not claim to be exhaustive.

The relation to representation theory and statistics is classical. For the link to representation theory, we refer to [1] and references therein. The connection with multivariate statistics was already present in [15] through the connection to the Wishart distribution, see [15, Section 8].

Unlike in the one-variable case, hypergeometric functions of a matrix argument have been studied from the point of view of holonomic systems only recently. The first instance we know of appeared in arithmetic [20]. Motivated by the study of Siegel modular forms and the computation of special values of LL-functions, the authors of [20] study solutions of certain systems of differential equations. They are equivalent to Muirhead’s system, see e.g. their Proposition 7.4 and Theorem 7.5. Holonomicity is shown explicitly in [20, Theorem 9.1]. Apart from number theory, hypergeometric functions of a matrix argument and holonomic systems also made an appearance in random matrix theory [6].

A large impetus came from numerical analysis with the advent of the holonomic gradient descent and the holonomic gradient method developed in [34]. These methods allowed to numerically evaluate and minimize several functions that are of importance in multivariate statistics. In [34] and [28], these methods are applied to the Fisher–Bingham distribution. In [12], the holonomic gradient method is used to approximate the cumulative distribution function of the largest root of a Wishart matrix. Motivated by this method, several teams, mainly in Japan, have studied Muirhead’s systems from the DD-module point of view such as [12, 13, 35, 40]. This is the starting point for our contribution. We examine the DD-module theoretic properties of Muirhead’s ideal for the hypergeometric function F 11\,{{}_{1}F_{\!\!\;1}}\, of a matrix argument from a completely and consistently algebraic point of view.

Outline.

This article is organized as follows. In Section 2, we recall some basic facts about the Weyl algebra and DmD_{m}-ideals. We recall the notion of holonomic functions and give a characterization that is well suited for testing holonomicity. In Section 3, we discuss hypergeometric functions of a matrix argument. In Section 4, we define the Muirhead ideal Im\,I_{m}\, and collect what is known about holonomicity of Im\,I_{m}\, and its Weyl closure. Section 5 contains our main results. We investigate the Muirhead ideal of operators annihilating F 11\,{{}_{1}F_{\!\!\;1}}\, and determine its singular locus. This section also contains some results about holomorphic and formal solutions of the Muirhead system. The characteristic variety of this ideal and its Weyl closure is investigated in Section 6. Conjecture 6.2 suggests that the characteristic variety of the Weyl closure can be described in a combinatorial way, using partitions of sets. We also discuss some basic computations in low dimensions.

For computations around the characteristic variety, we mainly used the libraries dmod [31], dmodapp [29], and dmodloc [30] in Singular [5]. We also performed some Gröbner basis computations in the rational Weyl algebra, where we used the Mathematica [21] package HolonomicFunctions [27].

Acknowledgments.

We are thankful to András Lőrincz, Christian Sevenheck, Bernd Sturmfels, and Nobuki Takayama for insightful discussions. We are grateful to the anonymous referee for valuable hints on literature and for proposing a strategy that led to an alternative proof of our main theorem using different techniques and enabled us to remove a technical condition on a parameter. We refer to the discussion in Section 5 and Appendix A for details.

P.G. acknowledges partial support by the DFG grant Se 1114/5-2. C.L. was supported by the DFG through the research grants Le 3093/2-2 and Le 3093/3-1.

2. The Weyl algebra

In this section, we recall basic facts about the Weyl algebra, the characteristic variety, and the definition of holonomic functions. We mainly follow the presentation and notation given in [37, 39].

2.1. Ideals and characteristic varieties

We start by introducing some notation and terminology. Throughout this article, {\mathbb{N}} denotes the natural numbers including 0. For m>0m\in{\mathbb{N}}_{>0}, we denote by

Dm[x1,,xm]1,,mD_{m}\,\coloneqq\,\mathbb{C}[x_{1},\ldots,x_{m}]\langle\partial_{1},\ldots,\partial_{m}\rangle

the mm-th Weyl algebra and by

Rm(x1,,xm)1,,mR_{m}\,\coloneqq\,\mathbb{C}\mathopen{}\mathclose{{}\left(x_{1},\ldots,x_{m}}\right)\langle\partial_{1},\ldots,\partial_{m}\rangle

the ring of differential operators with rational functions as coefficients. In this article, we refer to Rm\,R_{m}\, as mm-th rational Weyl algebra. For a commutative ring AA, we will abbreviate A[x]=A[x1,,xm]\,A[x]=A[x_{1},\ldots,x_{m}]\, the polynomial ring and A(x)=A(x1,,xm)\,A(x)=A(x_{1},\ldots,x_{m})\, the field of rational functions. We will also use ξ\,\xi\, as a set of variables so that e.g. (x)[ξ]=(x1,,xm)[ξ1,,ξm]\mathbb{C}(x)[\xi]=\mathbb{C}(x_{1},\ldots,x_{m})[\xi_{1},\ldots,\xi_{m}].

For a vector w=(u,v)2m\,w=(u,v)\in{\mathbb{R}}^{2m}\, with u+v0\,u+v\geq 0\, component-wise, we define a partial order on the monomials xαβ[x1,,xm]1,,m\,x^{\alpha}\partial^{\beta}\in{\mathbb{C}}[x_{1},\ldots,x_{m}]\langle\partial_{1},\ldots,\partial_{m}\rangle\, for α,βm\,\alpha,\beta\in{\mathbb{N}}^{m}\, by comparing the quantity

degw(xαβ)αu+βv=i=1mαiui+βivi,{\textrm{deg}_{w}}(x^{\alpha}\partial^{\beta})\,\coloneqq\,\alpha\cdot u+\beta\cdot v\,=\,\sum_{i=1}^{m}\alpha_{i}u_{i}+\beta_{i}v_{i},

where the indices refer to the coordinates of the vectors. We refer to w\,w\, as a weight vector and to degw\,{\textrm{deg}_{w}}\, as the ww-degree. With the notation e=(1,,1)m\,e=(1,\ldots,1)\in{\mathbb{N}}^{m}\, and w=(0,e)\,w=(0,e)\, we recover the order of a partial differential operator as the leading exponent for this ww-degree.

Given an operator PDm\,P\in D_{m}\, and a weight vector w2mw\in{\mathbb{R}}^{2m}, we define its initial form inw(P)\,\operatorname{in}_{w}(P)\, to be the sum of all terms of maximal ww-degree. Note that one has to write P\,P\, in the basis xαβ\,x^{\alpha}\partial^{\beta}\, in order to compute the ww-degree, i.e., one has to bring all differentials to the right.

The initial form inw(P)\,\operatorname{in}_{w}(P)\, can be viewed as the class of P\,P\, of the associated graded algebra grw(Dm)\,\operatorname{gr}_{w}(D_{m})\, to the filtration of Dm\,D_{m}\, induced by ww. The relation ixixii=1\,\partial_{i}x_{i}-x_{i}\partial_{i}=1 in Dm\,D_{m}\, induces the relation

ixixii={0if ui+vi>01if ui+vi=0in gr(u,v)(Dm).\partial_{i}x_{i}-x_{i}\partial_{i}\,=\,\begin{cases}0&\text{if }\,u_{i}+v_{i}>0\\ 1&\text{if }\,u_{i}+v_{i}=0\end{cases}\qquad\qquad\text{in }\operatorname{gr}_{(u,v)}(D_{m}).

To highlight this commutator relation notationally, one writes ξi\,\xi_{i}\, instead of i\,\partial_{i}\, in gr(u,v)(Dm)\,\operatorname{gr}_{(u,v)}(D_{m})\, for all indices i\,i\, with ui+vi=0u_{i}+v_{i}=0. In particular,

gr(u,v)(Dm)=[x][ξ] if u+v>0 and gr(u,v)(Dm)=Dm if u+v=0.\operatorname{gr}_{(u,v)}(D_{m})\,=\,{\mathbb{C}}[x][\xi]\ \text{ if }\,u+v>0\quad\text{ and }\quad\operatorname{gr}_{(u,v)}(D_{m})\,=\,D_{m}\ \text{ if }\,u+v=0.

A DmD_{m}-ideal is a left DmD_{m}-ideal. For a DmD_{m}-ideal II, the initial ideal with respect to w\,w\, is the left ideal

(2.1) inw(I)(inw(P)|PI)grw(Dm).\operatorname{in}_{w}(I)\,\coloneqq\,\mathopen{}\mathclose{{}\left(\,\operatorname{in}_{w}(P)\,\middle|\,P\in I\,}\right)\,\subseteq\,\operatorname{gr}_{w}(D_{m}).

A DmD_{m}-module is a left DmD_{m}-module. Mod(Dm)\text{Mod}(D_{m}) denotes the category of DmD_{m}-modules. Likewise for RmR_{m}-ideals and RmR_{m}-modules, respectively. Next we recall the important notions of a characteristic variety and of holonomicity.

Definition 2.1.

The characteristic variety of a DmD_{m}-ideal I\,I\, is the subscheme of 𝔸2m\,{\mathbb{A}}^{2m}\, determined by the ideal in(0,e)(I)[x1,,xm][ξ1,,ξm]\,\operatorname{in}_{(0,e)}(I)\subseteq\mathbb{C}[x_{1},\ldots,x_{m}][\xi_{1},\ldots,\xi_{m}]\, and is denoted by Char(I)\operatorname{Char}(I). The DmD_{m}-ideal I\,I\, is called holonomic if in(0,e)(I)\,\operatorname{in}_{(0,e)}(I)\, has dimension mm.

Remark 2.2.
  1. (1)

    Note that (0)\mathopen{}\mathclose{{}\left(0}\right) and Dm\,D_{m}\, are not holonomic. Therefore, if I\,I\, is a holonomic ideal, it is a non-zero, proper DmD_{m}-ideal.

  2. (2)

    Recall that as a consequence of an important theorem of Sato–Kawai–Kashiwara [38], we have dimZm\dim Z\geq m for all irreducible components Z\,Z\, of Char(I)\operatorname{Char}(I), see also the discussion in Section 2.2.

  3. (3)

    It is worthwhile to remark that the scheme structure of the characteristic variety is not uniquely determined by the DmD_{m}-module Dm/ID_{m}/I. Intrinsic invariants of Dm/I\,D_{m}/I\, are the set Char(I)red\,\operatorname{Char}(I)^{\operatorname{red}}\, and the multiplicity of its irreducible components, see e.g. [16, Section 2.2]. The point is that—unlike in the commutative world—II cannot be recovered as the annihilator of the DmD_{m}-module Dm/I\,D_{m}/I, and so there can be IJDm\,I\neq J\subseteq D_{m}\, with Dm/IDm/J\,D_{m}/I{\ \cong\ }D_{m}/J\,.

2.2. Conormality of the characteristic variety

We remark that 𝔸2m=Spec[x1,,xm,ξ1,,ξm]\,{\mathbb{A}}^{2m}=\operatorname{Spec}{\mathbb{C}}[x_{1},\ldots,x_{m},\xi_{1},\ldots,\xi_{m}]\, should actually be considered as the cotangent bundle T𝔸m\,T^{*}{\mathbb{A}}^{m}\, where the ξi\,\xi_{i}\, are the coordinates in the fiber of the canonical morphism T𝔸m𝔸m\,T^{*}{\mathbb{A}}^{m}\xrightarrow{\ \ }{\mathbb{A}}^{m}\, and the xi\,x_{i}\, are the coordinates in the base. Being a cotangent bundle, T𝔸mT^{*}{\mathbb{A}}^{m} carries a natural (algebraic) symplectic form σ\,\sigma\, which can explicitly be described in coordinates as

σ=dx1dξ1++dxmdξm.\sigma\,=\,dx_{1}\wedge d\xi_{1}+\cdots+dx_{m}\wedge d\xi_{m}.

The symplectic structure gives rise to the notion of a Lagrangian subvariety, that is, a subvariety ZT𝔸m\,Z\subseteq T^{*}{\mathbb{A}}^{m}\, such that at every smooth point zZregz\in Z^{\operatorname{reg}}, the tangent space TzZTz(T𝔸m)=T𝔸m\,T_{z}Z\subseteq T_{z}(T^{*}{\mathbb{A}}^{m})\,=\,T^{*}{\mathbb{A}}^{m}\, is isotropic (i.e., σ\sigma vanishes identically on this subspace) and maximal with this property. Note that a Lagrangian subvariety automatically has dimension mm. Examples for Lagrangian subvarieties in T𝔸m\,T^{*}{\mathbb{A}}^{m}\, are conormal varieties. Given a subvariety X𝔸mX\subseteq{\mathbb{A}}^{m}, the associated conormal variety NX\,N^{*}_{X}\, is defined as the Zariski closure of the conormal bundle NXreg/𝔸mT𝔸mN_{X^{\operatorname{reg}}/{\mathbb{A}}^{m}}^{*}\subseteq T^{*}{\mathbb{A}}^{m}. This is always a Lagrangian subvariety. We will make use of the following (special case of) important results due to Sato–Kawai–Kashiwara [38, Theorem 5.3.2], see also Gabber’s article [9, Theorem I] for an algebraic proof.

Theorem 2.3.

Let I\,I\, be a DmD_{m}-ideal. Then Char(I)T𝔸m\,\operatorname{Char}(I)\subseteq T^{*}{\mathbb{A}}^{m}\, is coisotropic. If I\,I\, is holonomic, every irreducible component Z\,Z\, of the characteristic variety Char(I)\,\operatorname{Char}(I)\, is a conormal variety. In particular, ZZ is Lagrangian.

To be more precise, the references above show that Z\,Z\, is Lagrangian. By definition, the characteristic variety is stable under the {\mathbb{C}}^{\ast}-action given by scalar multiplication in the fibers of T𝔸m𝔸mT^{*}{\mathbb{A}}^{m}\xrightarrow{\ \ }{\mathbb{A}}^{m}, and therefore it is conormal by [24, Lemma (3.2)], see also [16, Theorem E.3.6].

2.3. Holonomic functions

In this section, we recall the definition of a holonomic function and give a characterization of this notion which turns out to be very useful in practice.

Definition 2.4.

Let M\,M\, be a DmD_{m}-module and fMf\in M. The annihilator of f\,f\, is the DmD_{m}-ideal

AnnDm(f){PDmPf=0}.\text{Ann}_{D_{m}}\mathopen{}\mathclose{{}\left(f}\right)\,\coloneqq\,\mathopen{}\mathclose{{}\left\{P\in D_{m}\mid P\bullet f=0}\right\}.

An element fMf\in M\, is holonomic if its annihilator is a holonomic DmD_{m}-ideal.

The definition generalizes in an obvious way to arbitrary subsets NMN\subseteq M. If M\,M\, is a space of functions (e.g. holomorphic, multivalued holomorphic, smooth etc.) and fM\,f\in M\, is holonomic, then we refer to f\,f\, as a holonomic function. The definition of a holonomic function first appeared in the article [45] of Zeilberger.

Definition 2.5.

The Weyl closure of a DmD_{m}-ideal I\,I\, is the DmD_{m}-ideal

W(I)(RmI)Dm.W(I)\,\coloneqq\,\mathopen{}\mathclose{{}\left(R_{m}I}\right)\,\cap\,D_{m}.

We clearly have IW(I)I\subseteq W(I). A DmD_{m}-ideal I\,I\, is Weyl closed if I=W(I)\,I=W(I)\, holds.

In general, it is a challenging task to compute the Weyl closure of a DmD_{m}-ideal, see [43] for the one-dimensional case and [44] in general. The following property is in particular shared by spaces of functions.

Definition 2.6.

A DmD_{m}-module M\,M\, is torsion-free if it is torsion-free as module over [x1,,xm]\mathbb{C}[x_{1},\ldots,x_{m}].

This class of DmD_{m}-modules allows to deduce further properties of annihilating DmD_{m}-ideals.

Lemma 2.7.

Let MMod(Dm)\,M\in\text{Mod}\mathopen{}\mathclose{{}\left(D_{m}}\right)\, be torsion-free and NN a subset of MM. Then AnnDm(N)\,\text{Ann}_{D_{m}}\mathopen{}\mathclose{{}\left(N}\right)\, is Weyl closed.

Proof.

Write a given PW(AnnDm(N))\,P\in W(\text{Ann}_{D_{m}}(N))\, as P=iqiPi\,P=\sum_{i}q_{i}P_{i}\, where qiRm\,q_{i}\in R_{m}\, and PiAnnDm(N)P_{i}\in\text{Ann}_{D_{m}}(N). We choose h[x1,,xm]\,h\in\mathbb{C}[x_{1},\ldots,x_{m}]\, such that hPAnnDm(N)hP\in\text{Ann}_{D_{m}}(N). Then for every fN\,f\in N\, we have hPf=0\,hP\bullet f=0\, and therefore Pf=0P\bullet f=0, since M\,M\, is torsion-free. ∎

Definition 2.8.

For a DmD_{m}-ideal II, its singular locus is the set

(2.2) Sing(I)ZChar(I)π(Z)¯𝔸m,\displaystyle\operatorname{Sing}(I)\,\coloneqq\,\bigcup_{Z\,\subseteq\,\operatorname{Char}(I)}\overline{\pi(Z)}\,\subseteq\,{\mathbb{A}}^{m},

where π\,\pi\, denotes the projection T𝔸m𝔸m\,T^{*}{\mathbb{A}}^{m}\xrightarrow{\ \ }{\mathbb{A}}^{m}\, and the union is over all irreducible components Z\,Z\, of Char(I)\,\operatorname{Char}(I)  distinct from the zero section {ξ1==ξm=0}\,\{\xi_{1}=\cdots=\xi_{m}=0\}\, as sets. Moreover, we denote by

(2.3) rank(I)dim(x)((x)[ξ]/(x)[ξ]in(0,e)(I))=dim(x)(Rm/RmI)\displaystyle\operatorname{rank}\mathopen{}\mathclose{{}\left(I}\right)\,\coloneqq\,\dim_{\mathbb{C}(x)}\mathopen{}\mathclose{{}\left(\mathbb{C}(x)[\xi]/\mathbb{C}(x)[\xi]\operatorname{in}_{(0,e)}(I)}\right)\,=\,\dim_{\mathbb{C}(x)}\mathopen{}\mathclose{{}\left(R_{m}/R_{m}I}\right)

the holonomic rank of II.

The second equality is a standard fact, we refer to [37, Section 1.4]. If I\,I\, is a holonomic DmD_{m}-ideal, rank(I)\operatorname{rank}(I) gives the dimension of the space of holomorphic solutions to I\,I\, in a simply connected domain outside the singular locus of I\,I\, by the theorem of Cauchy–Kowalevski–Kashiwara Theorem [25, p. 44], see also [37, Theorem 1.4.19]. The following result clarifies the relationship between the holonomic rank and holonomicity.

Lemma 2.9 ([37], Theorem 1.4.15).

Let I\,I\, be a DmD_{m}-ideal. If I\,I\, has finite holonomic rank, then its Weyl closure W(I)\,W(I)\, is a holonomic DmD_{m}-ideal.

The following characterization of holonomicity is useful.

Proposition 2.10.

Let M\,M\, be a torsion-free DmD_{m}-module and fMf\in M. Then the following statements are equivalent.

  1. (1)

    ff is holonomic.

  2. (2)

    For all k=1,,mk=1,\ldots,m, there exists a natural number m(k)\,m(k)\in{\mathbb{N}}\, and a non-zero differential operator Pk==0m(k)a(x1,,xm)kAnnDm(f).P_{k}=\sum_{\ell=0}^{m(k)}a_{\ell}(x_{1},\ldots,x_{m})\partial_{k}^{\ell}\in\operatorname{Ann}_{D_{m}}(f).

  3. (3)

    The annihilator of f\,f\, has finite holonomic rank.

Proof.

By the elimination property for holonomic ideals in the Weyl algebra (cf. [45, Lemma 4.1], with a proof attributed to Bernstein), (1)\Rightarrow(2) holds. The equivalence (2)\Leftrightarrow(3) is obvious. Finally, (3)\Rightarrow(1) follows from combining Lemma 2.7 with Lemma 2.9. ∎

Without the condition of torsion-freeness, there are counterexamples to the validity of (3)\Rightarrow(1), see e.g. [37, Example 1.4.10].

3. Hypergeometric functions of a matrix argument

In this section, we are going to introduce the hypergeometric functions of a matrix argument in the sense of Herz [15], see Definition 3.2. We will follow Constantine’s approach [4] via zonal polynomials.

3.1. Zonal polynomials

Zonal polynomials are important in multivariate analysis with applications in multivariate statistics. Their theory has been developed by James in [22, 23] and subsequent works, see the introduction of Chapter 12 of Farrell’s monograph [8] for a more complete list. The definition given by James in [23] relies on representation theoretic work of É. Cartan [3] and James also credits Hua [17, 18], see [19] for an English translation. As a general reference, the reader may consult the monographs of Farrell [8, Chapter 12], Takemura [42], and Muirhead [33]. The presentation here follows [33, Chapter 7].

Let m\,m\, be a fixed positive integer. Throughout, we only consider partitions of the form λ=(λ1,,λm)\,\lambda\,=\,(\lambda_{1},\ldots,\lambda_{m})\, of an integer d=|λ|λ1++λm\,d={\mathopen{}\mathclose{{}\left|\lambda}\right|}\coloneqq\lambda_{1}+\cdots+\lambda_{m}\, with λ1λ2λm0\,\lambda_{1}\geq\lambda_{2}\geq\cdots\geq\lambda_{m}\geq 0\, if not explicitly stated otherwise.

Definition 3.1.

For all partitions λ=(λ1,,λm)\,\lambda=(\lambda_{1},\ldots,\lambda_{m})\, of dd, the zonal polynomials Cλ[x1,,xm]\,C_{\lambda}\,\in\,{\mathbb{C}}[x_{1},\ldots,x_{m}]\, are defined to be the unique symmetric homogeneous polynomials of degree d\,d\, satisfying the following three properties.

  1. (1)

    The leading monomial with respect to the lexicographic order lex\,\prec_{\operatorname{lex}}\, with xmlexlexx1\,x_{m}\prec_{\operatorname{lex}}\dots\prec_{\operatorname{lex}}x_{1}\, is LMlex(Cλ)=xλ=x1λ1xmλm\operatorname{LM}_{\prec_{\operatorname{lex}}}(C_{\lambda})=x^{\lambda}=x_{1}^{\lambda_{1}}\cdots x_{m}^{\lambda_{m}}.

  2. (2)

    The functions Cλ\,C_{\lambda}\, are eigenfunctions of the operator

    Δ=i=1mxi2i2+i,j=1ijmxi2xixji,\Delta\,=\,\sum_{i=1}^{m}x_{i}^{2}\partial_{i}^{2}\,+\,\sum_{\begin{subarray}{c}i,j=1\\ i\neq j\end{subarray}}^{m}\frac{x_{i}^{2}}{x_{i}-x_{j}}\partial_{i},

    i.e, ΔCλ=αλCλ\Delta\bullet C_{\lambda}=\alpha_{\lambda}\cdot C_{\lambda} for some αλ\alpha_{\lambda}\in{\mathbb{C}}.

  3. (3)

    We have

    (x1++xm)d=|λ|=dCλ.(x_{1}+\cdots+x_{m})^{d}\,=\,\sum_{{\mathopen{}\mathclose{{}\left|\lambda}\right|}=d}C_{\lambda}.

The uniqueness and existence of course have to be proven, we refer to [33, Section 7.2], where also the eigenvalues αλ\,\alpha_{\lambda}\, are determined to be

αλ=ρλ+d(m1)with ρλ=i=1mλi(λii).\displaystyle\alpha_{\lambda}\,=\,\rho_{\lambda}\,+\,d\cdot(m-1)\quad\textrm{with }\,\,\rho_{\lambda}\,=\,\sum_{i=1}^{m}\lambda_{i}(\lambda_{i}-i).

Zonal polynomials can be explicitly calculated by a recursive formula for the coefficients in a basis of monomial symmetric functions. From this it follows that zonal polynomials have in fact rational coefficients. The space of symmetric polynomials has a basis given by symmetrizations of monomials. We can enumerate this basis by ordered partitions; the partition of a given basis element is its leading exponent in the lexicographic order. For a partition λ=(λ1,,λm)\,\lambda=(\lambda_{1},\ldots,\lambda_{m})\, we put:

Mλxλ+ all permutations =μ𝔖m.λxμ,\displaystyle M_{\lambda}\,\coloneqq\,x^{\lambda}+\textrm{ all permutations }\,=\,\sum_{\mu\in\mathfrak{S}_{m}.\lambda}x^{\mu},

where 𝔖m.λ\,\mathfrak{S}_{m}.\lambda\, denotes the orbit of the mm-th symmetric group 𝔖m\mathfrak{S}_{m}. We write the zonal polynomials with respect to this basis:

Cλ=μλcλ,μMμ.\displaystyle C_{\lambda}\,=\,\sum_{\mu\leq\lambda}c_{\lambda,\mu}M_{\mu}.

Zonal polynomials can now be computed explicitly thanks to the following recursive formula:

cλ,μ=κκiκjρλρμcλ,κ,\displaystyle c_{\lambda,\mu}\,=\,{\sum}_{\kappa}\ \frac{\kappa_{i}-\kappa_{j}}{\rho_{\lambda}-\rho_{\mu}}\ c_{\lambda,\kappa},

where the sum runs over all (not necessarily ordered) partitions κ=(κ1,,κm)\,{\kappa\,=\,(\kappa_{1},\ldots,\kappa_{m})}\, such that there exist i<j\,i<j\, with κk=μk\,\kappa_{k}=\mu_{k}\, for all ki,j\,k\neq i,j\, and κi=μi+t\kappa_{i}=\mu_{i}+t, κj=μjt\kappa_{j}=\mu_{j}-t for some t{1,,μj}\,t\in\{1,\ldots,\mu_{j}\}\, and such that μ<κλ\,\mu<\kappa\leq\lambda\, after reordering κ\kappa.

3.2. Hypergeometric functions of a matrix argument

Let Xm×m\,X\in{\mathbb{C}}^{m\times m}\, be a square matrix and λ=(λ1,,λm)\,\lambda=(\lambda_{1},\ldots,\lambda_{m})\, a partition. One defines the zonal polynomial Cλ(X)\,C_{\lambda}(X)\, as

Cλ(X)Cλ(x1,,xm),C_{\lambda}(X)\,\coloneqq\,C_{\lambda}(x_{1},\ldots,x_{m}),

where x1,,xm\,x_{1},\ldots,x_{m}\, are the eigenvalues of X\,X\, counted with multiplicities. Note that Cλ(X)\,C_{\lambda}(X)\, is well-defined because Cλ\,C_{\lambda}\, is a symmetric polynomial.

Definition 3.2.

The hypergeometric function of a matrix argument X\,X\, is given by

(3.1) Fqp(a1,,ap;c1,,cq)(X)k=0λk(a1)λ(ap)λ(c1)λ(cq)λCλ(X)k!,{{}_{p}F_{\!q}}(a_{1},\ldots,a_{p};c_{1},\ldots,c_{q})(X)\,\coloneqq\,\sum_{k=0}^{\infty}\sum_{\lambda\,\vdash k}\frac{(a_{1})_{\lambda}\cdots(a_{p})_{\lambda}}{(c_{1})_{\lambda}\cdots(c_{q})_{\lambda}}\frac{C_{\lambda}(X)}{k!},

where, for a partition λ=(λ1,,λm)\lambda=(\lambda_{1},\ldots,\lambda_{m}), the symbol (a)λ\,(a)_{\lambda}\, denotes the generalized Pochhammer symbol

(a)λi=1m(ai12)λi.\displaystyle(a)_{\lambda}\,\coloneqq\,\prod_{i=1}^{m}\mathopen{}\mathclose{{}\left(a-\frac{i-1}{2}}\right)_{\lambda_{i}}.

Here, for an integer \ell, the quantity (a)=a(a+1)(a+1)\,(a)_{\ell}=a(a+1)\cdots(a+\ell-1)\, with (a)0=1\,(a)_{0}=1\, is the usual Pochhammer symbol.

The parameters a1,,ap\,a_{1},\ldots,a_{p}\, and c1,,cq\,c_{1},\ldots,c_{q}\, in this definition are allowed to attain all complex values such that all the denominators (ci)λ\,(c_{i})_{\lambda}\, do not vanish. Explicitly,

(3.2) a1,,apandc1,,cq{()if m=1,{k2k,km1}if m2.a_{1},\ldots,a_{p}\in{\mathbb{C}}\ \ \text{and}\ \ c_{1},\ldots,c_{q}\in\begin{cases}{\mathbb{C}}\setminus(-{\mathbb{N}})&\text{if }\,m=1,\\ {\mathbb{C}}\setminus\big{\{}\frac{k}{2}\mid k\in{\mathbb{Z}},\,k\leq m-1\big{\}}&\text{if }\,m\geq 2.\end{cases}
Remark 3.3.

If X=diag(x1,0,,0)X=\text{diag}(x_{1},0,\ldots,0), it follows straight forward from Definition 3.1 of zonal polynomials that Fqp(a1,,ap;c1,,cq)(X){{}_{p}F_{\!q}}(a_{1},\ldots,a_{p};c_{1},\ldots,c_{q})(X) is the classical hypergeometric function Fqp(a1,,ap;c1,,cq)(x1){{}_{p}F_{\!q}}(a_{1},\ldots,a_{p};c_{1},\ldots,c_{q})(x_{1}) in one variable. Therefore, Definition 3.2 is indeed an appropriate generalization of hypergeometric functions in one variable.

The convergence behavior of the hypergeometric function of a matrix argument is analogous to the one-variable case, basically with the same proof. For pq\,p\leq q, this series converges for all XX. For p=q+1p=q+1, this series converges for X<1\|X\|<1, where \,\|\cdot\|\, denotes the maximum of the absolute values of the eigenvalues of XX. If p>q+1\,p>q+1, the series diverges for all X0X\neq 0.

4. Annihilating ideals of F 11{{}_{1}F_{\!\!\;1}}

Let F 11\,{{}_{1}F_{\!\!\;1}}\, be the hypergeometric function of a matrix argument as introduced in Definition 3.2. In this section, we systematically study a certain ideal that annihilates F 11{{}_{1}F_{\!\!\;1}}. This function depends on two complex parameters a,ca,c satisfying condition (3.2), which in this case means

(4.1) {cif m=1,c{k2k,km1}if m2.\begin{cases}c\notin-{\mathbb{N}}&\text{if }m=1,\\ c\notin\{\frac{k}{2}\mid k\in{\mathbb{Z}},k\leq m-1\}&\text{if }m\geq 2.\end{cases}

As discussed in the last section, the value of this function on a symmetric matrix Xm×m\,X\in{\mathbb{C}}^{m\times m}\, is the same as the value on the unique semisimple element in the GLm()\,\operatorname{GL}_{m}({\mathbb{C}})\, (conjugacy) orbit closure of XX. We may thus restrict our attention to the case where X\,X\, is diagonal. Then this hypergeometric function satisfies the following differential equations.

4.1. Setup and known results about the annihilator

Theorem 4.1.

[33, Theorem 7.5.6] Let m>0m\in{\mathbb{N}}_{>0} and let a,ca,c\in{\mathbb{C}} be parameters with cc satisfying (4.1). The function F 11(a;c)\,{{}_{1}F_{\!\!\;1}}(a;c)\, of a diagonal matrix argument X=diag(x1,,xm)\,X=\operatorname{diag}(x_{1},\ldots,x_{m})\, is the unique solution F\,F\, of the system of the m\,m\, linear partial differential equations given by the operators

(4.2) gkxkk2+(cm12xk+12kxkxkx)k12(kxxkx)a,\displaystyle g_{k}\,\coloneqq\,x_{k}\partial_{k}^{2}+\mathopen{}\mathclose{{}\left(c-\frac{m-1}{2}-x_{k}+\frac{1}{2}\sum_{\ell\neq k}\frac{x_{k}}{x_{k}-x_{\ell}}}\right)\partial_{k}-\frac{1}{2}\mathopen{}\mathclose{{}\left(\sum_{\ell\neq k}\frac{x_{\ell}}{x_{k}-x_{\ell}}\partial_{\ell}}\right)-a,

k=1,,m,k=1,\ldots,m, subject to the conditions that F\,F\, is symmetric in x1,,xmx_{1},\ldots,x_{m}, and F\,F\, is analytic at X=0X=0, and F(0)=1F(0)=1.

In fact, we will point out in 5.8 that in this theorem, the condition of symmetry in x1,,xm\,x_{1},\ldots,x_{m}\, can be dropped as it is implied by the other conditions. By using the identity

xkxkx= 1+xxkx,\displaystyle\frac{x_{k}}{x_{k}-x_{\ell}}\,=\,1\,+\,\frac{x_{\ell}}{x_{k}-x_{\ell}},

the operators from (4.2) can be written as

(4.3) gk=xkk2+(cxk)k+12(kxxkx(k))a.\displaystyle g_{k}\,=\,x_{k}\partial_{k}^{2}\,+\,(c-x_{k})\partial_{k}\,+\,\frac{1}{2}\mathopen{}\mathclose{{}\left(\sum_{\ell\neq k}\frac{x_{\ell}}{x_{k}-x_{\ell}}(\partial_{k}-\partial_{\ell})}\right)\,-\,a.

Clearing the denominators in (4.2), we obtain

(4.4) Pk(k(xkx))gkDm,k= 1,,m.P_{k}\,\coloneqq\,\mathopen{}\mathclose{{}\left(\prod_{\ell\neq k}(x_{k}-x_{\ell})}\right)\cdot g_{k}\,\in\,D_{m},\quad k\,=\,1,\ldots,m.
Definition 4.2.

We denote by Im\,I_{m}\, the DmD_{m}-ideal generated by P1,,Pm\,P_{1},\ldots,P_{m}\, and call it the Muirhead ideal.

Note that, by construction,

RmIm=(g1,,gm).R_{m}I_{m}\,=\,(g_{1},\ldots,g_{m}).

Our goal is to systematically study the ideal ImI_{m}. In this direction, Hashiguchi–Numata–Takayama–Takemura obtained the following result in [12].

Theorem 4.3 ([12, Theorem 2]).

For the graded lexicographic term order on RmR_{m}, a Gröbner basis of RmIm\,R_{m}I_{m}\, is given by {gk=xkk2+l.o.t.k=1,,m}\{g_{k}=x_{k}\partial_{k}^{2}+\text{l.o.t.}\mid k=1,\ldots,m\}.

An immediate consequence is:

Corollary 4.4.

The holonomic rank of Im\,I_{m}\, is given by rank(Im)=2m\operatorname{rank}(I_{m})=2^{m}. In particular, the Weyl closure W(Im)\,W(I_{m})\, of Im\,I_{m}\, and the function F 11\,{{}_{1}F_{\!\!\;1}}\, of a diagonal matrix are holonomic.

Proof.

This immediately follows from Theorem 4.3 and Lemma 2.9. ∎

At the end of Section 5 in [12], it is conjectured that Im\,I_{m}\, is holonomic. Via direct computation they show that I2\,I_{2}\, is holonomic in Appendix A of the paper. One can still verify holonomicity of I3\,I_{3}\, for generic parameters a,ca,c through a computation in Singular. It turns out, however, that the above conjecture does not hold. We are thankful to N. Takayama for pointing out that the D4D_{4}-ideal I4\,I_{4}\, was shown to be non-holonomic in the Master’s thesis [26]. We give an easy alternative argument for this in Example 6.6.

5. Analytic solutions to the Muirhead ideal

In this section, we determine the singular locus of the Muirhead ideal Im\,I_{m}\, and of its Weyl closure:

Theorem 5.1.

Let m>0m\in{\mathbb{N}}_{>0} and let a,ca,c\in{\mathbb{C}} be parameters. Then the singular locus of Im\,I_{m}\, agrees with the singular locus of W(Im)W(I_{m}). It is the hyperplane arrangement

(5.1) 𝒜{xm|i=1mxiji(xixj)=0}.\displaystyle{{\mathscr{A}}}\,\coloneqq\,\mathopen{}\mathclose{{}\left\{x\in{\mathbb{C}}^{m}\ \middle|\ \prod_{i=1}^{m}x_{i}\prod_{j\neq i}(x_{i}-x_{j})=0}\right\}.

To be more precise, in this section we will prove the statement under the additional

Assumption 5.2.

The parameter c\,c\, satisfies condition (4.1).

Note that this condition makes the function F11(a;c)\,{{}_{1}F_{1}(a;c)}\, well-defined. However, we would like to point out that this assumption is not necessary; a proof of the stronger statement is given in Appendix A. We are grateful to the anonymous referee for suggesting to investigate restriction modules which are the central tool in the proof presented there. As these are different techniques, we deem it worthwhile to also present our original proof, which is the purpose of this section.

The inclusion Sing(Im)𝒜\,\operatorname{Sing}(I_{m})\subseteq{{\mathscr{A}}}\, is readily seen from

in(0,e)(Pi)=xiji(xixj)i2.\operatorname{in}_{(0,e)}(P_{i})\,=\,x_{i}\prod_{j\neq i}(x_{i}-x_{j})\partial_{i}^{2}.

To prove the reverse containment, we investigate analytic solutions to the Muirhead system locally around points in the components of the arrangement 𝒜{{\mathscr{A}}}. Our main technical tool is the following observation resembling [37, Theorem 2.5.5]:

Lemma 5.3.

Let I\,I\, be a DmD_{m}-ideal and let u0mu\in{\mathbb{R}}_{\geq 0}^{m}. Then

dimSolx(I)dimSolx(in(u,u)(I)),\dim\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(I)\,\leq\,\dim\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(\operatorname{in}_{(-u,u)}(I)),

where Solx()\,\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(\cdot)\, denotes the solution space in the formal power series ring x{\mathbb{C}}\llbracket x\rrbracket.

Proof.

For f=αmλαxαxf=\sum_{\alpha\in{\mathbb{N}}^{m}}\lambda_{\alpha}x^{\alpha}\in{\mathbb{C}}\llbracket x\rrbracket, we denote111Our notation for the initial of a formal power series differs from the one used, among others, in [37, 39]. Ours is more coherent with the definition of initial forms of linear differential operators.

inu(f)uTα min.with λα0λαxαx.\operatorname{in}_{-u}(f)\,\coloneqq\sum_{\begin{subarray}{c}u^{T}\alpha\text{ min.}\\ \text{with }\lambda_{\alpha}\neq 0\end{subarray}}\lambda_{\alpha}x^{\alpha}\in{\mathbb{C}}\llbracket x\rrbracket.

If P=in(u,u)(P)+P~Dm\,P=\operatorname{in}_{(-u,u)}(P)+\tilde{P}\in D_{m}\, annihilates f=inu(f)+f~f=\operatorname{in}_{-u}(f)+\tilde{f}, then

0=Pf=in(u,u)(P)inu(f)+P~f+in(u,u)(P)f~0\,=\,P\bullet f\,=\,\operatorname{in}_{(-u,u)}(P)\bullet\operatorname{in}_{-u}(f)\,+\,\tilde{P}\bullet f\,+\,\operatorname{in}_{(-u,u)}(P)\bullet\tilde{f}

and all monomials appearing in the expanded expression P~f+in(u,u)(P)f\,\tilde{P}\bullet f+\operatorname{in}_{(-u,u)}(P)\bullet f\, are of higher uu-degree than those of in(u,u)(P)inu(f)\operatorname{in}_{(-u,u)}(P)\bullet\operatorname{in}_{-u}(f). Hence, in(u,u)(P)\operatorname{in}_{(-u,u)}(P) annihilates inu(f)\operatorname{in}_{-u}(f). This shows that for every DmD_{m}-ideal II, we have

(5.2) {inu(f)fSolx(I)}Solx(in(u,u)(I)).\mathopen{}\mathclose{{}\left\{\operatorname{in}_{-u}(f)\mid f\in\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(I)}\right\}\,\subseteq\,\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(\operatorname{in}_{(-u,u)}(I)).

Let F\,F\, be a basis of the solution space Solx(I)\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(I). Replacing F\,F\, by a suitable linear combination of its elements, we can assure that the initial forms inu(f)\,\operatorname{in}_{-u}(f)\, for fF\,f\in F\, are linearly independent. Then (5.2) implies

dimSolx(in(u,u)(I))|{inu(f)fF}|=|F|=dimSolx(I).\dim\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(\operatorname{in}_{(-u,u)}(I))\,\geq\,|\{\operatorname{in}_{-u}(f)\mid f\in F\}|\,=\,|F|\,=\,\dim\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(I).\qed

In the following two lemmata, we apply Lemma 5.3 to the Muirhead system and bound the spaces of analytic solutions locally around general points in 𝒜{{\mathscr{A}}}. Note that up to 𝔖m\mathfrak{S}_{m}-symmetry, there are two types of components in 𝒜{{\mathscr{A}}}, namely {xmx1=0}\{x\in{\mathbb{C}}^{m}\mid x_{1}=0\} and {xmx1=x2}\{x\in{\mathbb{C}}^{m}\mid x_{1}=x_{2}\}. Lemma 5.4 considers points that lie in exactly one component of 𝒜\,{{\mathscr{A}}}\, of the first type, while Lemma 5.5 is concerned with the second type.

Lemma 5.4.

Let pm\,p\in{\mathbb{C}}^{m}\, be a point with distinct coordinates, one of which is zero. If a,ca,c\in{\mathbb{C}} with c(m1)/2c\notin(m-1)/2-{\mathbb{N}}, then the space of formal power series solutions to Im\,I_{m}\, centered at p\,p\, is of dimension at most 2m12^{m-1}.

Proof.

Since Im\,I_{m}\, is invariant under the action of the symmetric group 𝔖m\mathfrak{S}_{m}, we may assume that the point p=(p1,,pm)\,p=(p_{1},\ldots,p_{m})\, has the unique zero coordinate p1=0p_{1}=0. Studying formal power series solutions to Im\,I_{m}\, around p\,p\, is equivalent to substituting xi\,x_{i}\, by xi+pi\,x_{i}+p_{i}\, in each of the generators P1,,Pm\,P_{1},\ldots,P_{m}\, and to studying the solutions in x\,{\mathbb{C}}\llbracket x\rrbracket\, of the resulting operators. Let us define u(3,2,,2)mu\coloneqq(3,2,\ldots,2)\in{\mathbb{R}}^{m}. Examining the expression for P1,,PmP_{1},\ldots,P_{m}, we observe that

(5.3) in(u,u)(P1|xx+p)=(1)m1p2p3pm1x1θ1(θ1+cm+12)andin(u,u)(Pi|xx+p)=piji(pipj)1xi2θi(θi1)for all i2,\begin{array}[]{lcl}\operatorname{in}_{(-u,u)}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptP_{1}}\right|_{x\,\mapsto x+p}}\big{)}&=&(-1)^{m-1}p_{2}p_{3}\cdots p_{m}\frac{1}{x_{1}}\theta_{1}\Big{(}\theta_{1}+c-\frac{m+1}{2}\Big{)}\quad\ \text{and}\\ \operatorname{in}_{(-u,u)}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptP_{i}\,}\right|_{x\,\mapsto x+p}}\big{)}&=&p_{i}\prod_{j\neq i}(p_{i}-p_{j})\frac{1}{x_{i}^{2}}\theta_{i}(\theta_{i}-1)\qquad\text{for all }\,i\geq 2,\end{array}

where θixii\theta_{i}\coloneqq x_{i}\partial_{i} and Pi|xx+p\,{\mathopen{}\mathclose{{}\left.\kern-1.2ptP_{i}}\right|_{x\,\mapsto x+p}}\, denotes the operator obtained from Pi\,P_{i}\, by replacing xx with x+px+p. Note that an operator P(θ1,,θm)[θ1,,θm]Dm\,P(\theta_{1},\ldots,\theta_{m})\in{\mathbb{C}}[\theta_{1},\ldots,\theta_{m}]\subseteq D_{m}\, acts on the one-dimensional vector spaces xα\,{\mathbb{C}}\cdot x^{\alpha}\, for αm\,\alpha\in{\mathbb{N}}^{m}\, with eigenvalue P(α)P(\alpha). In particular, the space of solutions in x\,{\mathbb{C}}\llbracket x\rrbracket\, of the operators (5.3) is spanned by the  2m1\,2^{m-1}\, monomials xα\,x^{\alpha}\, with α1=0\,\alpha_{1}=0\, and αi{0,1}\,\alpha_{i}\in\{0,1\}\, for all i2i\geq 2. Here, we have used that m+12c>0\,\frac{m+1}{2}-c\notin{\mathbb{N}}_{>0}\, by 5.2 on cc, which guarantees that formal power series solutions to θ1+cm+12\,\theta_{1}+c-\frac{m+1}{2}\, are constant in x1x_{1}. In particular, from Lemma 5.3, we conclude

dimSolx(Im|xx+p)\displaystyle\dim\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptI_{m}}\right|_{x\,\mapsto x+p}}\big{)} dimSolx(in(u,u)(Im|xx+p))\displaystyle\,\leq\,\dim\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}\big{(}\!\operatorname{in}_{(-u,u)}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptI_{m}}\right|_{x\,\mapsto x+p}}\big{)}\big{)}
dimSolx(in(u,u)(Pi|xx+p)i=1,,m)= 2m1.\displaystyle\,\leq\,\dim\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}\big{(}\!\operatorname{in}_{(-u,u)}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptP_{i}}\right|_{x\,\mapsto x+p}}\big{)}\mid i=1,\ldots,m\big{)}\,=\,2^{m-1}.

Lemma 5.5.

Let p=(p1,,pm)()m\,p=(p_{1},\ldots,p_{m})\in({\mathbb{C}}^{*})^{m}\, with #{p1,,pm}=m1\,\#\{p_{1},\ldots,p_{m}\}=m-1. For all a,ca,c\in{\mathbb{C}}, the space of formal power series solutions to Im\,I_{m}\, centered at p\,p\, is of dimension at most 2m232^{m-2}\cdot 3.

Proof.

We proceed similar to the proof of Lemma 5.4. By symmetry of ImI_{m}, we may assume that p1=p2p_{1}=p_{2}, while all other pairs of coordinates of p\,p\, are distinct. Denote e:=(1,,1)me:=(1,\ldots,1)\in{\mathbb{N}}^{m}. Then

in(e,e)(P1|xx+p)=12p1j=3m(p1pj)(2(x1x2)12+12),in(e,e)(P2|xx+p)=12p2j=3m(p2pj)(2(x1x2)22+12),in(e,e)(Pi|xx+p)=piji(pipj)1xj2θi(θi1)for i3\begin{array}[]{lcl}\operatorname{in}_{(-e,e)}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptP_{1}}\right|_{x\,\mapsto x+p}}\big{)}&\,=&\phantom{-}\frac{1}{2}p_{1}\prod_{j=3}^{m}(p_{1}-p_{j})\cdot(2(x_{1}-x_{2})\partial_{1}^{2}+\partial_{1}-\partial_{2}),\\ \operatorname{in}_{(-e,e)}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptP_{2}}\right|_{x\,\mapsto x+p}}\big{)}&\,=&-\frac{1}{2}p_{2}\prod_{j=3}^{m}(p_{2}-p_{j})\cdot(2(x_{1}-x_{2})\partial_{2}^{2}+\partial_{1}-\partial_{2}),\\ \operatorname{in}_{(-e,e)}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptP_{i}\,}\right|_{x\,\mapsto x+p}}\big{)}&\,=&\phantom{-\frac{1}{2}}p_{i}\,\prod_{j\neq i}\;(p_{i}-p_{j})\cdot\frac{1}{x_{j}^{2}}\theta_{i}(\theta_{i}-1)\qquad\text{for }\,i\geq 3\\ \end{array}

with θixii\theta_{i}\coloneqq x_{i}\partial_{i}. From the identity θixα=αixα\,\theta_{i}\bullet x^{\alpha}=\alpha_{i}x^{\alpha}\, for all αm\,\alpha\in{\mathbb{N}}^{m}\, we deduce that a basis of Solx({Pi|xx+pi})\,\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}\big{(}\big{\{}{\mathopen{}\mathclose{{}\left.\kern-1.2ptP_{i}}\right|_{x\,\mapsto x+p}}\mid i\big{\}}\big{)}\, is given by f(x1,x2)x3α3x4α4xmαmf(x_{1},x_{2})x_{3}^{\alpha_{3}}x_{4}^{\alpha_{4}}\cdots x_{m}^{\alpha_{m}}, where α3,,αm{0,1}\,{\alpha_{3},\ldots,\alpha_{m}\in\{0,1\}}\, and where f\,f\, varies over a basis of

Solx1,x2(2(x1x2)12+12, 2(x1x2)22+12).\operatorname{Sol}_{{\mathbb{C}}\llbracket x_{1},x_{2}\rrbracket}\Big{(}2(x_{1}-x_{2})\partial_{1}^{2}+\partial_{1}-\partial_{2},\;2(x_{1}-x_{2})\partial_{2}^{2}+\partial_{1}-\partial_{2}\Big{)}.

The latter is a 33-dimensional vector space spanned by {1,x1+x2,x12+6x1x2+x22}\{1,\,x_{1}+x_{2},\,x_{1}^{2}+6x_{1}x_{2}+x_{2}^{2}\}. This can be easily verified as follows. After the change of variables

y1(x1+x2)/2,y2(x1x2)/2,y1=1+2,y2=12,y_{1}\,\coloneqq\,(x_{1}+x_{2})/2,\quad y_{2}\,\coloneqq\,(x_{1}-x_{2})/2,\quad\partial_{y_{1}}\,=\,\partial_{1}+\partial_{2},\quad\partial_{y_{2}}\,=\,\partial_{1}-\partial_{2},

this system becomes

(y2(y1+y2)2+y2)f= 0,(y2(y1y2)2+y2)f= 0\displaystyle\mathopen{}\mathclose{{}\left(y_{2}\mathopen{}\mathclose{{}\left(\partial_{y_{1}}+\partial_{y_{2}}}\right)^{2}+\partial_{y_{2}}}\right)\bullet f\,=\,0,\qquad\mathopen{}\mathclose{{}\left(y_{2}\mathopen{}\mathclose{{}\left(\partial_{y_{1}}-\partial_{y_{2}}}\right)^{2}+\partial_{y_{2}}}\right)\bullet f\,=\,0

From summing these two equations, we observe that a solution fy1,y2f\in{\mathbb{C}}\llbracket y_{1},y_{2}\rrbracket needs to be annihilated by the operator y1y2\partial_{y_{1}}\partial_{y_{2}}. Therefore, we can write any solution as f=i0λiy1i+j1μjy2jf=\sum_{i\geq 0}\lambda_{i}y_{1}^{i}+\sum_{j\geq 1}\mu_{j}y_{2}^{j}. Plugging this into (y2(y1+y2)2+y2)f=0(y_{2}(\partial_{y_{1}}+\partial_{y_{2}})^{2}+\partial_{y_{2}})\bullet f=0, we observe that λi=μi=0\,\lambda_{i}=\mu_{i}=0\, for all i3i\geq 3, μ1=0\mu_{1}=0 and λ2=2μ2\lambda_{2}=-2\mu_{2}, leading to the basis of solutions

{1, 2y1=x1+x2, 8y124y22=x12+6x1x2+x22}.\mathopen{}\mathclose{{}\left\{1,\,2y_{1}=x_{1}+x_{2},\,8y_{1}^{2}-4y_{2}^{2}=x_{1}^{2}+6x_{1}x_{2}+x_{2}^{2}}\right\}.

With this, we have argued that the solution space of in(e,e)(Im|xx+p)\,\operatorname{in}_{(-e,e)}\big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptI_{m}}\right|_{x\,\mapsto x+p}}\big{)}\, is at most 32m23\cdot 2^{m-2}-dimensional. Together with Lemma 5.3, this proves the claim. ∎

Proof of 5.1.

First, we observe that

in(0,e)(Pi)=xiji(xixj)ξi2\operatorname{in}_{(0,e)}(P_{i})\,=\,x_{i}\prod_{j\neq i}(x_{i}-x_{j})\xi_{i}^{2}

and hence

Char(Im)redi=1m(V(ξi)V(xi)jiV(xixj))π1(𝒜)V(ξ1,,ξm),\operatorname{Char}(I_{m})^{\text{\rm red}}\,\subseteq\,\bigcap_{i=1}^{m}\Big{(}V(\xi_{i})\cup V(x_{i})\cup\bigcup_{j\neq i}V(x_{i}-x_{j})\Big{)}\,\subseteq\,\pi^{-1}({{\mathscr{A}}})\cup V(\xi_{1},\ldots,\xi_{m}),

where π:T𝔸m𝔸m\,\pi\colon T^{*}{\mathbb{A}}^{m}\xrightarrow{\ \ }{\mathbb{A}}^{m}\, denotes the natural projection. By definition of the singular locus, this proves the containment

Sing(W(Im))Sing(Im)𝒜.\operatorname{Sing}(W(I_{m}))\,\subseteq\,\operatorname{Sing}(I_{m})\,\subseteq\,{{\mathscr{A}}}.

For the reverse inclusion, consider a point pm\,p\in{\mathbb{C}}^{m}\, contained in exactly one irreducible component of 𝒜{{\mathscr{A}}}. By Lemma 5.4 and Lemma 5.5, the space of formal power series solutions to Im\,I_{m}\, (or, equivalently, to W(Im)W(I_{m})) around p\,p\, is of dimension strictly smaller than 2m=rank(Im)=rank(W(Im))2^{m}=\operatorname{rank}(I_{m})=\operatorname{rank}(W(I_{m})). In particular, pp needs to be a singular point of Im\,I_{m}\, and of W(Im)W(I_{m}), as otherwise the Cauchy–Kowalevski–Kashiwara Theorem implies the existence of  2m\,2^{m}\, linearly independent analytic solutions around pp. In particular, the singular loci of Im\,I_{m}\, and of W(Im)\,W(I_{m})\, must contain those points. Since singular loci are closed, we conclude that they contain the entire arrangement 𝒜{{\mathscr{A}}}. ∎

Remark 5.6.

The condition (4.1) on the parameter c\,c\, is very natural from the point of view of analytic functions, as the hypergeometric function F 11(a;c)\,{{}_{1}F_{\!\!\;1}}(a;c)\, of a diagonal matrix argument is only defined under this condition.222Note however that our proof of 5.1 in this section relies only on the condition that c(m1)/2c\notin(m-1)/2-{\mathbb{N}}, which is slightly weaker than (4.1). However, the Muirhead ideal itself is defined for arbitrary a,c\,a,c\in{\mathbb{C}}\, and is the more interesting object from the point of view of DD-module theory.

The description of the singular locus in 5.1 gives rise to the following lower bound on the characteristic variety. In Section 6, we will also discuss an upper bound and a conjectural description of the characteristic variety.

Corollary 5.7.

The characteristic variety of W(Im)\,W(I_{m})\, contains the zero section and the conormal bundles of the irreducible components of 𝒜{{\mathscr{A}}}, i.e.,

Char(W(Im))\displaystyle\operatorname{Char}(W(I_{m}))\,\supseteq\, V(ξ1,,ξm)iV(xi,ξ1,,ξi^,,ξm)\displaystyle V\mathopen{}\mathclose{{}\left(\xi_{1},\ldots,\xi_{m}}\right)\,\cup\,\bigcup_{i}V(x_{i},\xi_{1},\ldots,\widehat{\xi_{i}},\ldots,\xi_{m})
ijV(xixj,ξi+ξj,ξ1,,ξi^,,ξj^,,ξm).\displaystyle\quad\cup\bigcup_{i\neq j}V(x_{i}-x_{j},\,\xi_{i}+\xi_{j},\,\xi_{1},\ldots,\widehat{\xi_{i}},\ldots,\widehat{\xi_{j}},\ldots,\xi_{m}).
Proof.

As already noted in the introduction after (1.5), the linear spaces on the right hand side of the claimed inclusion are conormal varieties. By Theorem 2.3, the conormal varieties to the irreducible components of Sing(W(Im))\,\operatorname{Sing}(W(I_{m}))\, are contained in Char(W(Im))\operatorname{Char}(W(I_{m})). Moreover, the zero section V(ξ1,,ξm)\,V\mathopen{}\mathclose{{}\left(\xi_{1},\ldots,\xi_{m}}\right)\, is always contained in the characteristic variety. Theorem 5.1 concludes the proof. ∎

Above, we have studied bounds on solutions to the Muirhead system locally around points in m\,{\mathbb{C}}^{m}\, contained in exactly one component of 𝒜{{\mathscr{A}}}, while the Cauchy–Kowalevski–Kashiwara Theorem describes the behavior around points in m𝒜{\mathbb{C}}^{m}\setminus{{\mathscr{A}}}. A more detailed study around special points p𝒜\,p\in{{\mathscr{A}}}\, where several components of 𝒜\,{{\mathscr{A}}}\, intersect may be of interest.

We finish this section by looking at the most degenerate case: p=0p=0. Recall from 4.1 that F 11\,{}_{1}F_{\!\!\;1}\, is the unique analytic solution to Im\,I_{m}\, around  0\,0\, that is symmetric and normalized to attain the value  1\,1\, at the origin. In fact, the restricting factor assuring uniqueness here is not the symmetry, but the analyticity around 0. Namely, using the techniques presented before, we arrive at the following refinement of 4.1:

Proposition 5.8.

Let m>0m\in{\mathbb{N}}_{>0} and let a,ca,c\in{\mathbb{C}} be parameters with cc satisfying (4.1). Then F 11(a;c)\,{}_{1}F_{\!\!\;1}(a;c)\, is the unique formal power series solution to Im\,I_{m}\, around  0\,0\, with F 11(a;c)(0)=1{}_{1}F_{\!\!\;1}(a;c)(0)=1. In particular, F 11(a;c){}_{1}F_{\!\!\;1}(a;c) is the unique convergent power series solution to Im\,I_{m}\, around  0\,0\, with F 11(a;c)(0)=1{}_{1}F_{\!\!\;1}(a;c)(0)=1.

Proof.

Consider any weight vector u0m\,u\in{\mathbb{R}}_{\geq 0}^{m}\, with 0<u1<u2<<um0<u_{1}<u_{2}<\cdots<u_{m}. From the definition of P1,,PmP_{1},\ldots,P_{m}, we see that for all i{1,,m}i\in\{1,\ldots,m\}:

in(u,u)(Pi)=(1)i12x1xi1ximi1(2θi2+(2ci1)θij=i+1mθj),\operatorname{in}_{(-u,u)}(P_{i})\,=\,\frac{(-1)^{i-1}}{2}x_{1}\cdots x_{i-1}\cdot x_{i}^{m-i-1}\cdot\mathopen{}\mathclose{{}\left(2\theta_{i}^{2}+(2c-i-1)\theta_{i}-\sum_{j=i+1}^{m}\theta_{j}}\right),

where θixii\theta_{i}\coloneqq x_{i}\partial_{i}. In particular, the Weyl closure of in(u,u)(I)\,\operatorname{in}_{(-u,u)}(I)\, contains the operators Qi2θi2+(2ci1)θij=i+1mθjQ_{i}\coloneqq 2\theta_{i}^{2}+(2c-i-1)\theta_{i}-\sum_{j=i+1}^{m}\theta_{j}. The action of operators in [θ1,,θm]Dm\,{\mathbb{C}}[\theta_{1},\ldots,\theta_{m}]\subseteq D_{m}\, on x\,{\mathbb{C}}\llbracket x\rrbracket\, diagonalizes with respect to the basis of x\,{\mathbb{C}}\llbracket x\rrbracket\, given by the monomials. In particular, Solx(Q1,,Qm)\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(Q_{1},\ldots,Q_{m}) is a subspace of Solx(in(u,u)(I))\,\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(\operatorname{in}_{(-u,u)}(I))\, spanned by monomials. Therefore, by Lemma 5.3, it suffices to show that the only monomial annihilated by Q1,,Qm\,Q_{1},\ldots,Q_{m}\, is 11.

Let αm\,\alpha\in{\mathbb{N}}^{m}\, be such that xα\,x^{\alpha}\, is annihilated by Q1,,QmQ_{1},\ldots,Q_{m}. Assume for contradiction that α0\,\alpha\neq 0\, and let i{1,,m}\,i\in\{1,\ldots,m\}\, be maximal such that αi0\alpha_{i}\neq 0. Then

0=Qixα= 2αi2+(2ci1)αij=i+1mαj=αi(2αi+2ci1).0\,=\,Q_{i}\bullet x^{\alpha}\,=\,2\alpha_{i}^{2}+(2c-i-1)\alpha_{i}-\sum_{j=i+1}^{m}\alpha_{j}\,=\,\alpha_{i}\cdot(2\alpha_{i}+2c-i-1).

Note that c{k2k,km1}\,c\notin\{\frac{k}{2}\mid k\in{\mathbb{Z}},\,k\leq m-1\}\, guarantees  2+2ci10\,2\ell+2c-i-1\neq 0\, for all positive integers \ell. This contradicts the assumption αi0\alpha_{i}\neq 0. We conclude that

Solx(in(u,u)(I))Solx(Q1,,Qm)={1}\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(\operatorname{in}_{(-u,u)}(I))\,\subseteq\,\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(Q_{1},\ldots,Q_{m})\,=\,{\mathbb{C}}\cdot\{1\}

and therefore dimSolx(Im)1\dim\operatorname{Sol}_{{\mathbb{C}}\llbracket x\rrbracket}(I_{m})\leq 1. The last claim is now immediate. ∎

6. Characteristic variety of the Muirhead ideal

In this section, we give a conjectural description of the (reduced) characteristic variety of the Weyl closure of the Muirhead ideal ImI_{m}, see Conjecture 6.2. The conjecture based on our computations and further evidence is provided by the partial results obtained in Corollary 5.7 and Proposition 6.3. The description of Char(W(Im))\,\operatorname{Char}\mathopen{}\mathclose{{}\left(W(I_{m})}\right)\, is combinatorial in nature and would imply that the number of irreducible components is given by the (m+1)(m+1)-st Bell number Bm+1B_{m+1}.

6.1. Conjectural structure of the characteristic variety

Let us first explain some notations.

Notation 6.1.

We denote [m]={1,,m}[m]=\{1,\ldots,m\}. We consider partitions of this set [m]=J0J1Jk[m]=J_{0}\sqcup J_{1}\sqcup\ldots\sqcup J_{k}, where J0\,J_{0}\, is allowed to be empty, the Ji\,J_{i}\, with i0\,i\neq 0\, are nonempty, and we consider the J1,,Jk\,J_{1},\ldots,J_{k}\, as unordered. Taking into account that J0\,J_{0}\, plays a distinguished role, we denote such a partition by J0J1JkJ_{0}\mid J_{1}\ldots J_{k}.

For a partition [m]=J0J1Jk[m]=J_{0}\mid J_{1}\dots J_{k}, we denote by CJ0|J1Jk\,C_{J_{0}|J_{1}\ldots J_{k}}\, the mm-dimensional linear subspace

(6.1) V({xjjJ0}{iJξi|= 1,,k}=1k{xixji,jJ})\displaystyle V\>\bigg{(}\{x_{j}\mid j\in J_{0}\}\cup\Big{\{}\sum_{i\in J_{\ell}}\xi_{i}\mathopen{}\mathclose{{}\left.\vphantom{\Big{\{}}\;|\;\vphantom{\Big{\}}}}\right.\ell\,=\,1,\ldots,k\Big{\}}\cup\bigcup_{\ell=1}^{k}\{x_{i}-x_{j}\mid i,j\in J_{\ell}\}\bigg{)}

of T𝔸m=𝔸2m=Spec[x1,,xm,ξ1,,ξm]T^{*}{\mathbb{A}}^{m}={\mathbb{A}}^{2m}=\operatorname{Spec}{\mathbb{C}}[x_{1},\ldots,x_{m},\xi_{1},\ldots,\xi_{m}].

Let Bk\,B_{k}\in{\mathbb{N}}\, denote the kk-th Bell number, i.e., the number of partitions of a set of size kk. For example B1=1B_{1}=1, B2=2B_{2}=2, B3=5B_{3}=5, B4=15B_{4}=15, B5=52B_{5}=52, and so on. For the Muirhead ideal Im\,I_{m}, the characteristic variety of its Weyl closure W(Im)\,W(I_{m})\, has the following conjectural description.

Conjecture 6.2.

The (reduced) characteristic variety of W(Im)\,W(I_{m})\, is the following arrangement of mm-dimensional linear spaces:

Char(W(Im))red=[m]=J0JkCJ0|J1Jk.\operatorname{Char}(W(I_{m}))^{\text{\rm red}}\,=\,\bigcup_{[m]\,=\,J_{0}\sqcup\dots\sqcup J_{k}}C_{J_{0}|J_{1}\ldots J_{k}}.

In particular, Char(W(Im))\operatorname{Char}(W(I_{m})) has Bm+1\,B_{m+1}\, many irreducible components.

As I4\,I_{4}\, is not holonomic, it does not seem reasonable to make predictions about Char(Im)\operatorname{Char}(I_{m}). The better object to study is its Weyl closure, which is challenging to compute. The appearance of the Bell numbers in the conjecture is explained by the following observation:

We have a bijection of sets

(6.2) k=1m{Ordered partitions {0,1,,m}=J~1J~k}/𝔖k1:1k=0m{Ordered partitions {1,,m}=J0J1Jk}/𝔖k,\displaystyle\begin{split}&\bigcup_{k=1}^{m}\,\big{\{}\text{Ordered partitions }\{0,1,\ldots,m\}\,=\,\tilde{J}_{1}\sqcup\ldots\sqcup\tilde{J}_{k}\big{\}}/\mathfrak{S}_{k}\\ \stackrel{{\scriptstyle 1:1}}{{\rightleftarrows}}\,\,&\bigcup_{k=0}^{m}\,\big{\{}\text{Ordered partitions }\{1,\ldots,m\}\,=\,J_{0}\sqcup J_{1}\sqcup\ldots\sqcup J_{k}\big{\}}/\mathfrak{S}_{k},\end{split}

defined by J0J~i{0}\,J_{0}\coloneqq\tilde{J}_{i}\setminus\{0\}\, for 0J~i0\in\tilde{J}_{i}, where on the right hand side of (6.2), the symmetric group 𝔖k\,\mathfrak{S}_{k}\, acts on J1JkJ_{1}\sqcup\ldots\sqcup J_{k}. It is important to note that J0\,J_{0}\, is allowed to be empty, and J0\,J_{0}\, is the only set among the Ji\,J_{i}\, and J~j\,\tilde{J}_{j}\, with this property.

6.2. Bounds for the characteristic variety

Next, we give an upper bound for the reduced characteristic variety Char(Im)red\,\operatorname{Char}(I_{m})^{\rm red}\, and hence a fortiori an upper bound for Char(W(Im))red\operatorname{Char}(W(I_{m}))^{\operatorname{red}}. By upper bound, we mean a variety containing the given variety. Note that we already proved a lower bound for Char(W(Im))\,\operatorname{Char}(W(I_{m}))\, in 5.7.

For a partition J0J1Jk\,J_{0}\mid J_{1}\ldots J_{k}\, of [m][m], we defined the linear subspace CJ0|J1Jk\,C_{J_{0}|J_{1}\dots J_{k}}\, of 𝔸2m{\mathbb{A}}^{2m}\, in (6.1). We denote by C^J0|J1Jk\,\widehat{C}_{J_{0}|J_{1}\dots J_{k}}\, the linear space

(6.3) V({xjjJ0}=1k{xixji,jJ}{iJξi=1,,k s.t. |J|2})\displaystyle V\bigg{(}\{x_{j}\mid j\in J_{0}\}\cup\bigcup_{\ell=1}^{k}\{x_{i}-x_{j}\mid i,j\in J_{\ell}\}\cup\Big{\{}\sum_{i\in J_{\ell}}\xi_{i}\mid\ell=1,\ldots,k\text{ s.t.\ }|J_{\ell}|\leq 2\Big{\}}\bigg{)}

of 𝔸2m{\mathbb{A}}^{2m}. Clearly, C^J0|J1JkCJ0|J1Jk\widehat{C}_{J_{0}|J_{1}\dots J_{k}}\supseteq C_{J_{0}|J_{1}\dots J_{k}}, with equality if and only if |J|2\,|J_{\ell}|\leq 2\, for =1,,k\ell=1,\ldots,k. Further evidence for Conjecture 6.2 is given by the following result.

Proposition 6.3.

The (reduced) characteristic variety of Im\,I_{m}\, is contained in the arrangement of the linear spaces C^J0|J1Jk\widehat{C}_{J_{0}|J_{1}\dots J_{k}}:

Char(Im)red[m]=J0J1JkC^J0|J1Jk.\operatorname{Char}(I_{m})^{\text{\rm red}}\,\subseteq\,\bigcup_{[m]\,=\,J_{0}\sqcup J_{1}\sqcup\dots\sqcup J_{k}}\widehat{C}_{J_{0}|J_{1}\ldots J_{k}}.

In particular, this also gives an upper bound for Char(W(Im))red\operatorname{Char}(W(I_{m}))^{\text{\rm red}}.

Proof.

The characteristic variety of Im\,I_{m}\, is defined by the vanishing of the symbols in(0,e)(P)[x][ξ]\,\operatorname{in}_{(0,e)}(P)\in{\mathbb{C}}[x][\xi]\, of all operators PImP\in I_{m}. Hence, describing explicit symbols in in(0,e)(Im)\,\operatorname{in}_{(0,e)}(I_{m})\, bounds Char(Im)\,\operatorname{Char}(I_{m})\, from above. We observe that

in(0,e)(Pi)=xi(ji(xixj))ξi2for i= 1,,m.\displaystyle\operatorname{in}_{(0,e)}(P_{i})\,=\,x_{i}\cdot\mathopen{}\mathclose{{}\left(\prod_{j\neq i}(x_{i}-x_{j})}\right)\cdot\xi_{i}^{2}\qquad\text{for }\,i\,=\,1,\ldots,m.

Moreover, for iji\neq j, consider the following operators in ImI_{m}:

Sijxj(ki,j(xjxk))j2Pi+xi(ki,j(xixk))i2Pj.S_{ij}\,\coloneqq\,x_{j}\cdot\mathopen{}\mathclose{{}\left(\prod_{k\neq i,j}(x_{j}-x_{k})}\right)\cdot\partial_{j}^{2}\cdot P_{i}\,+\,x_{i}\cdot\mathopen{}\mathclose{{}\left(\prod_{k\neq i,j}(x_{i}-x_{k})}\right)\cdot\partial_{i}^{2}\cdot P_{j}.

This expression can be seen as the SS-pair of the operators PiP_{i} and PjP_{j} for graded term orders on RmR_{m}. A straightforward computation by hand reveals that

in(0,e)(Sij)=12xixj(ki,j(xixk)(xjxk))(ξi+ξj)3+(xixj)Qij\displaystyle\operatorname{in}_{(0,e)}(S_{ij})\,=\,-\frac{1}{2}x_{i}x_{j}\Big{(}\prod_{k\neq i,j}(x_{i}-x_{k})(x_{j}-x_{k})\Big{)}\big{(}\xi_{i}+\xi_{j}\big{)}^{3}+(x_{i}-x_{j})Q_{ij}

for some Qij[x][ξ]Q_{ij}\in{\mathbb{C}}[x][\xi].

Since these operators lie in the Muirhead ideal, we have

Char(Im)V(in(0,e)(Pi),in(0,e)(Sij)ij)Z,\operatorname{Char}\mathopen{}\mathclose{{}\left(I_{m}}\right)\,\subseteq\,V\,\mathopen{}\mathclose{{}\left(\operatorname{in}_{(0,e)}(P_{i}),\,\operatorname{in}_{(0,e)}(S_{ij})\mid i\neq j}\right)\,\eqqcolon\,Z,

so it suffices to see that Z\,Z\, is set-theoretically contained in the union of all C^J0|J1Jk\,\widehat{C}_{J_{0}|J_{1}\dots J_{k}}. We prove this by the comparing their fibers over 𝔸m=Spec[x1,,xm]{\mathbb{A}}^{m}=\operatorname{Spec}{\mathbb{C}}[x_{1},\ldots,x_{m}]. Let z=(z1,,zm)𝔸m\,z=(z_{1},\ldots,z_{m})\in{\mathbb{A}}^{m}\, and let [m]=J0J1Jk\,[m]=J_{0}\sqcup J_{1}\sqcup\dots\sqcup J_{k}\, be a partition of [m]\,[m]\, such that

(6.4) zi=0iJ0andzi=zj:i,jJ.\displaystyle z_{i}=0\,\Leftrightarrow\,i\in J_{0}\qquad\text{and}\qquad z_{i}=z_{j}\,\Leftrightarrow\,\exists\ell:i,j\in J_{\ell}.

Note that this partition is uniquely determined by the point z\,z\, up to permuting J1,,JkJ_{1},\ldots,J_{k}. Let F\,F\, denote the fiber of Z\,Z\, over the point zz. We claim that F\,F\, is set-theoretically contained in the fiber of C^J0|J1Jk\,\widehat{C}_{J_{0}|J_{1}\dots J_{k}}\, over zz.

To prove this claim, it suffices to see that for all singletons J={n}\,J_{\ell}=\{n\}\, and two-element sets J={i,j}\,J_{\ell^{\prime}}=\{i,j\}\, in our partition, where 1,k1\leq\ell,\,\ell^{\prime}\leq k, the polynomials ξn2\,\xi_{n}^{2}\, and (ξi+ξj)3\,(\xi_{i}+\xi_{j})^{3}\, vanish on FF. But for those n,i,j,n,i,j, the polynomial

(6.5) in(0,e)(Pn)|𝐱=z=zn(jn(znzj))ξn2{\mathopen{}\mathclose{{}\left.\kern-1.2pt\operatorname{in}_{(0,e)}(P_{n})\,}\right|_{\mathbf{x}\,=\,z}}\,=\,z_{n}\cdot\mathopen{}\mathclose{{}\left(\prod_{j\neq n}(z_{n}-z_{j})}\right)\cdot\xi_{n}^{2}

is a non-zero multiple of ξn2\,\xi_{n}^{2}\, by (6.4), since J\,J_{\ell}\, is a singleton, and

(6.6) in(0,e)(Sij)|𝐱=z=12zizjpi,j(zizp)(zjzp)(ξi+ξj)3{\mathopen{}\mathclose{{}\left.\kern-1.2pt\operatorname{in}_{(0,e)}(S_{ij})\,}\right|_{\mathbf{x}=z}}\,=\,-\frac{1}{2}z_{i}z_{j}\cdot\prod_{p\neq i,j}(z_{i}-z_{p})(z_{j}-z_{p})(\xi_{i}+\xi_{j})^{3}

is a non-zero multiple of (ξi+ξj)3(\xi_{i}+\xi_{j})^{3}. Here, we have used that zi=zj\,z_{i}=z_{j}\, by construction of the partition J0|J1JkJ_{0}|J_{1}\dots J_{k}.

Both (6.5) and (6.6) vanish on F\,F\, by the definition of ZZ, and hence ξn\,\xi_{n}\, and ξi+ξj\,\xi_{i}+\xi_{j}\, vanish on the set FredF^{\text{\rm red}}, disregarding the scheme structure. This shows that FredC^J0|J1JkF^{\rm red}\subseteq\widehat{C}_{J_{0}|J_{1}\dots J_{k}}. In particular,

Char(I)redZred[m]=J0J1JkC^J0|J1Jk,\operatorname{Char}(I)^{\rm red}\,\subseteq\,Z^{\rm red}\,\subseteq\,\bigcup_{[m]\,=\,J_{0}\sqcup J_{1}\sqcup\dots\sqcup J_{k}}\widehat{C}_{J_{0}|J_{1}\ldots J_{k}},

concluding the proof. ∎

6.3. Examples

The computational difficulty of questions concerning the characteristic variety Char(Im)\operatorname{Char}(I_{m}), the Weyl closure W(Im)W(I_{m}), its characteristic variety, irreducible components, and more increases rapidly with the number of variables mm. For m=2,3,m=2,3, we succeed with straightforward computations in Singular to obtain the characteristic variety and its decomposition into irreducible components. For m=2m=2, also the Weyl closure W(Im)\,W(I_{m})\, is computable, but already for m=3m=3 this is no longer feasible. For m=4m=4, none of the computer calculations terminate. We provide more precise information in the following examples.

Example 6.4.

We consider the case m=2m=2. We perform our computations for generic a,ca,c, i.e., in

(a,c)[x1,,xm]1,,m\mathbb{Q}(a,c)[x_{1},\ldots,x_{m}]\langle\partial_{1},\ldots,\partial_{m}\rangle

with indeterminates a,ca,c. Computations in Singular show that the characteristic variety Char(I2)\,\operatorname{Char}\mathopen{}\mathclose{{}\left(I_{2}}\right)\, set-theoretically decomposes into the following five irreducible components

(6.7) V(x1,x2)V(x1,ξ2)V(ξ1,x2)V(ξ1,ξ2)V(ξ1+ξ2,x1x2).\displaystyle V\mathopen{}\mathclose{{}\left(x_{1},x_{2}}\right)\,\cup\,V\mathopen{}\mathclose{{}\left(x_{1},\xi_{2}}\right)\,\cup\,V\mathopen{}\mathclose{{}\left(\xi_{1},x_{2}}\right)\,\cup\,V\mathopen{}\mathclose{{}\left(\xi_{1},\xi_{2}}\right)\,\cup\,V\mathopen{}\mathclose{{}\left(\xi_{1}+\xi_{2},\,x_{1}-x_{2}}\right).

Already for m=2m=2, the ideal Im\,I_{m}\, and its Weyl closure W(Im)\,W(I_{m})\, differ. The operator

P=g1g2=(x112x222)(x11x22)+(c12)(12)P\,=\,g_{1}-g_{2}\,=\,(x_{1}\partial_{1}^{2}-x_{2}\partial_{2}^{2})-(x_{1}\partial_{1}-x_{2}\partial_{2})+(c-\frac{1}{2})(\partial_{1}-\partial_{2})

is clearly in W(I2)I2W(I_{2})\setminus I_{2}. In fact, W(I2)=I2+(P)W(I_{2})=I_{2}+(P). Moreover, Char(I2)red=Char(W(I2))red\operatorname{Char}(I_{2})^{\operatorname{red}}=\operatorname{Char}(W(I_{2}))^{\operatorname{red}}\, but the multiplicities of the irreducible components are different. In the order of appearance in (6.7), the irreducible components have multiplicities  4,2,2,4,3\,4,2,2,4,3\, in I2\,I_{2}\, and  3,2,2,4,1\,3,2,2,4,1\, in W(I2)W(I_{2}).

The decomposition (6.7) will also turn out to be a byproduct of our more general result presented in Proposition 6.3.

Example 6.5.

Next we consider the case m=3m=3. Computations for generic a,ca,c in Singular show that Char(I3)\,\operatorname{Char}\mathopen{}\mathclose{{}\left(I_{3}}\right)\, decomposes into the  15=B4\,15=B_{4}\, irreducible components

V(x1,x2,x3)V(ξ1,x2,x3)V(x1,ξ2,x3)V(x1,x2,ξ3)\displaystyle V(x_{1},x_{2},x_{3})\ \cup\ V(\xi_{1},x_{2},x_{3})\ \cup\ V(x_{1},\xi_{2},x_{3})\ \cup\ V(x_{1},x_{2},\xi_{3})
V(ξ1,ξ2,x3)V(ξ1,x2,ξ3)V(x1,ξ2,ξ3)V(ξ1,ξ2,ξ3)\displaystyle\ \ \cup\ V(\xi_{1},\xi_{2},x_{3})\ \cup\ V(\xi_{1},x_{2},\xi_{3})\ \cup\ V(x_{1},\xi_{2},\xi_{3})\ \cup\ V(\xi_{1},\xi_{2},\xi_{3})
V(x1x2,ξ1+ξ2,x3)V(x1x3,ξ1+ξ3,x2)V(x2x3,ξ2+ξ3,x1)\displaystyle\ \ \cup\ V(x_{1}-x_{2},\,\xi_{1}+\xi_{2},\,x_{3})\ \cup\ V(x_{1}-x_{3},\,\xi_{1}+\xi_{3},\,x_{2})\,\cup\ V(x_{2}-x_{3},\,\xi_{2}+\xi_{3},\,x_{1})
V(x1x2,ξ1+ξ2,ξ3)V(x1x3,ξ1+ξ3,ξ2)V(x2x3,ξ2+ξ3,ξ1)\displaystyle\ \ \cup\ V(x_{1}-x_{2},\,\xi_{1}+\xi_{2},\,\xi_{3})\ \cup\ V(x_{1}-x_{3},\,\xi_{1}+\xi_{3},\,\xi_{2})\ \cup\ V(x_{2}-x_{3},\,\xi_{2}+\xi_{3},\,\xi_{1})
V(x1x2,x1x3,ξ1+ξ2+ξ3).\displaystyle\ \ \cup\ V(x_{1}-x_{2},\,x_{1}-x_{3},\,\xi_{1}+\xi_{2}+\xi_{3}).

If we compare this to our upper bound for the characteristic variety Char(W(I3))\,\operatorname{Char}(W(I_{3}))\, from Proposition 6.3, we see that the only difference between the components in (6.1) and (6.3) is that instead of V(x1,x2,x3)\,V(x_{1},x_{2},x_{3})\, and V(x1x2,x2x3,ξ1+ξ2+ξ3)V(x_{1}-x_{2},\,x_{2}-x_{3},\,\xi_{1}+\xi_{2}+\xi_{3}), we only have the component BV(x1x2,x2x3)T𝔸3\,B\coloneqq V(x_{1}-x_{2},\,x_{2}-x_{3})\subseteq T^{*}{\mathbb{A}}^{3}\, in the upper bound. However, the Weyl closure is holonomic by Lemma 2.9 and thus the components of its characteristic variety are the conormals to their projections to 𝔸3\,{\mathbb{A}}^{3}\, by Theorem 2.3. Such a projection is a closed subvariety of the diagonal V(x1x2,x2x3)𝔸3V(x_{1}-x_{2},\,x_{2}-x_{3})\subseteq{\mathbb{A}}^{3}, hence either equal to it or equal to a point. The corresponding conormal varieties are V(x1x2,x2x3,ξ1+ξ2+ξ3)\,V(x_{1}-x_{2},\,x_{2}-x_{3},\,\xi_{1}+\xi_{2}+\xi_{3})\, and the cotangent spaces to the points pλ(λ,λ,λ)\,p_{\lambda}\coloneqq(\lambda,\lambda,\lambda)\, for some λ\lambda\in{\mathbb{C}}. It turns out that the components V(x1x2,x2x3,ξ1+ξ2+ξ3)\,V(x_{1}-x_{2},\,x_{2}-x_{3},\,\xi_{1}+\xi_{2}+\xi_{3})\, and V(x1,x2,x3)\,V(x_{1},x_{2},x_{3})\, of Char(W(I3))\,\operatorname{Char}(W(I_{3}))\, are the only ones contained in BB. In other words, the cotangent spaces to pλ\,p_{\lambda}\, are not contained in the characteristic variety unless λ=0\lambda=0. It does not seem to be very pleasant to verify this last claim by hand. The operator P\,P\, of lowest order we found in D3\,D_{3}\, whose symbol in(0,e)(P)\,\operatorname{in}_{(0,e)}(P)\, does not vanish on pλ\,p_{\lambda}\, with λ0\,\lambda\neq 0\, has order  4\,4\, and one needs coefficients of order  6\,6\, to show that PI3P\in I_{3}.

It is striking that the components of Char(W(I3))\,\operatorname{Char}(W(I_{3}))\, contained in B\,B\, are exactly those conormal bundles contained in B\,B\, that are bihomogeneous in the xix_{i} and the ξj\xi_{j}. According to Conjecture 6.2, all components should have this property but for the time being we do not see how to deduce bihomogeneity in general, see also Problem 6.8.

Example 6.6.

Computations in Singular for fixed a,ca,c over a finite field suggest that Char(I4)\,\operatorname{Char}\mathopen{}\mathclose{{}\left(I_{4}}\right)\, decomposes into  51=B51\,51=B_{5}-1\, irreducible components. One of them, KV(x1x2,x1x3,x1x4)K\coloneqq V(x_{1}-x_{2},\,x_{1}-x_{3},\,x_{1}-x_{4}), is 55-dimensional. The analogous computations over (a,c)\,{\mathbb{Q}}(a,c)\, do not terminate. We can nevertheless verify its existence via the following trick. Instead of I4I_{4}, we consider the ideal J4I4+(x1x2)J_{4}\coloneqq I_{4}+(x_{1}-x_{2}). Then we clearly have:

Char(I4)Char(I4)V(x1x2)Char(J4).\operatorname{Char}(I_{4})\,\supseteq\,\operatorname{Char}(I_{4})\,\cap\,V(x_{1}-x_{2})\,\supseteq\,\operatorname{Char}(J_{4}).

The computation of Char(J4)\,\operatorname{Char}(J_{4})\, is much simpler and immediately terminates. It turns out that KChar(J4)K\subseteq\operatorname{Char}(J_{4}). Therefore, Char(I4)\,\operatorname{Char}(I_{4})\, contains the 55-dimensional component K\,K\, and we conclude that I4\,I_{4}\, is not holonomic.

6.4. Open problems concerning the characteristic variety

As the examples above indicated, there are a lot of open problems which we would like to put forward.

Problem 6.7.

Compute the Weyl closure W(Im)\,W(I_{m})\, of Im\,I_{m}\, for any mm.

A first step would be to explicitly write down differential operators in W(Im)ImW(I_{m}){\ \setminus\ }I_{m}.

Problem 6.8.

Show that Char(W(Im))\,\operatorname{Char}(W(I_{m}))\, (and possibly Char(Im)\operatorname{Char}(I_{m})) are invariant under the action of ×\,\mathbb{C}^{\ast}\times\mathbb{C}^{\ast}\, on T𝔸m=𝔸m×𝔸m\,T^{*}{\mathbb{A}}^{m}={\mathbb{A}}^{m}\times{\mathbb{A}}^{m}\, given by scalar multiplication on the factors.

This would of course be an immediate consequence of a proof of Conjecture 6.2. It should however be easier to tackle Problem 6.8 directly. One strategy could be to write down a flat one-parameter family of ideals {Jt}t𝔸1\{J_{t}\}_{t\in{\mathbb{A}}^{1}}, such that J1=Im\,J_{1}=I_{m}\, and J0\,J_{0}\, has an action by ×\,\mathbb{C}^{\ast}\times\mathbb{C}^{\ast}\, and then to see how to relate the characteristic varieties in a flat family.

One way to realize such a one-parameter family concretely is to apply a suitable \mathbb{C}^{\ast}-action to Im\,I_{m}\, and take the limit as the parameter t\,t\, of \,\mathbb{C}^{\ast}\, goes to zero. If e.g. we decree the xi\,x_{i}\, to have weight zero and the ξi\,\xi_{i}\, have weight one, the commutator relation of the Weyl algebra is preserved and for each tt we obtain an ideal JtJ_{t} as claimed. The flat limit is stable under the {\mathbb{C}}^{\ast}-action and can be found by applying the action to a Gröbner basis. Note that the action on J0\,J_{0}\, induces an action of ×\,{\mathbb{C}^{\ast}\times\mathbb{C}^{\ast}}\, on Char(J0)\operatorname{Char}(J_{0}), as the latter always has a \mathbb{C}^{\ast}-action given by scalar multiplication on the fibers of T𝔸m𝔸mT^{*}{\mathbb{A}}^{m}\xrightarrow{\ \ }{\mathbb{A}}^{m}.

There are also other instances of annihilating ideals related by one-parameter families. It is classically known that the hypergeometric functions F 10\,{{}_{0}F_{\!\!\;1}}\, and F 11{}_{1}F_{\!\!\;1} are related to one another through a scaling and limit process. More precisely, F 11(a;c)(1aX)F 10(c)(X){{}_{1}F_{\!\!\;1}}(a;c)\big{(}\frac{1}{a}X\big{)}\xrightarrow{\ \ }{{}_{0}F_{\!\!\;1}}(c)(X) as aa\xrightarrow{\ \ }\infty, see [33, Section 7.5]. Also, the hypergeometric function F 10\,{{}_{0}F_{\!\!\;1}}\, is known to be annihilated by the operators

(6.8) xkk2+ck+12(kxxkx(k)) 1,\displaystyle x_{k}\partial_{k}^{2}\,+\,c\partial_{k}\,+\,\frac{1}{2}\mathopen{}\mathclose{{}\left(\sum_{{\ell}\neq k}\frac{x_{\ell}}{x_{k}-x_{\ell}}(\partial_{k}-\partial_{\ell})}\right)\,-\,1,

where k=1,,mk=1,\ldots,m. One directly checks that the gk\,g_{k}\, from (4.2) scale accordingly to give the system (6.8), see [33, Theorem 7.5.6].

Problem 6.9.

Can the scaling relation between F 10\,{{}_{0}F_{\!\!\;1}}\, and F 11\,{}_{1}F_{\!\!\;1}\, be used to deduce a relation between the characteristic varieties of Im\,I_{m}\, and the corresponding ideal generated by the operators (6.8)?

We would like to mention that F 10{{}_{0}F_{\!\!\;1}} naturally appears when investigating the normalizing constant of the Fisher distribution on SO(3), as described in [41].

6.5. Outlook

We think that Conjecture 6.2 deserves further study and that it will be helpful to get a better understanding of the hypergeometric function F 11\,{{}_{1}F_{\!\!\;1}}\, of a matrix argument. The goal of the present article was to put forward this very clear and intriguing conjecture and to provide some evidence for it. The context in which we studied the function F 11\,{{}_{1}F_{\!\!\;1}}\, was rather conceptual, but our methods were mainly ad hoc. We believe that, eventually, the problem should be addressed using more advanced methods from DD-module theory. For this, one should look for a more intrinsic description of the Muirhead ideal—or rather its Weyl closure. In particular, it would be interesting to understand if there is some generalization of GKZ systems and a relation to the hypergeometric function of a matrix argument similar to the one-variable case. We hope to be able to tackle these problems in the future.

Appendix A Singular locus for special parameters

In Section 5, we discussed the singular locus of the Muirhead ideal and of its Weyl closure for those parameters a,ca,c, for which the hypergeometric function F 11(a;c)\,{{}_{1}F_{\!\!\;1}}(a;c)\, of a diagonal matrix argument is defined. In this appendix, we prove Theorem 5.1 without any restriction on the parameter cc\in{\mathbb{C}}. We are grateful to the referee for proposing an approach based on restriction modules, which finally led to the proof presented here. We would like to point out that similar problems have been studied in the literature. In [14], the singular locus of a holonomic system annihilating Lauricella’s hypergeometric function FC\,F_{C}\, was computed using a different technique. Hattori–Takayama used Gröbner bases and syzygies to compute a certain Ext-module whereas we analyze restriction modules on coordinate hyperplanes by a more elementary, computational argument. It would be interesting to compare the two methods more thoroughly.

Even though our approach in Section 5 for studying the singular locus rests only on the differential operators, defined regardless of the value of the parameters, the need to consider non-special parameters shows up in one subtle step of the computations: To prove that the coordinate hyperplanes {xmxi=0}\,\{x\in{\mathbb{C}}^{m}\mid x_{i}=0\}\, lie in the singular locus of W(Im)W(I_{m}), in Lemma 5.4 our proof relied on the condition cm12c\notin\frac{m-1}{2}-{\mathbb{N}}. Note that this is the only step in the proof of 5.1 that does not work for arbitrary cc\in{\mathbb{C}}. In particular, the diagonal hyperplanes were shown to lie in the singular locus for any c\,c\, by Lemma 5.5. Therefore, to prove 5.1, it suffices (by symmetry) to show that the hyperplane H{xmxm=0}\,H\coloneqq\{x\in{\mathbb{C}}^{m}\mid x_{m}=0\}\, is contained in the singular locus of W(Im)W(I_{m}).

For this, we investigate the Dm1D_{m-1}-module Dm/(W(Im)+xmDm)D_{m}/(W(I_{m})+x_{m}D_{m}), which is the restriction module of Dm/W(Im)\,D_{m}/W(I_{m})\, with respect to HH. Its holonomic rank coincides with the dimension of the space of formal power series solutions to W(Im)\,W(I_{m})\, centered at a general point of HH. Hence, recalling that rank(W(Im))=2m\,\operatorname{rank}(W(I_{m}))=2^{m}\, by Corollary 4.4, we can conclude 5.1 from the following result:

Proposition A.1.

Let a,c\,a,c\in{\mathbb{C}}\, be arbitrary parameters. Then the holonomic rank of the restriction module Dm/(W(Im)+xmDm)\,D_{m}/(W(I_{m})+x_{m}D_{m})\, is strictly smaller than 2m2^{m}.

Proof.

Consider the localized Weyl algebra

D(xm)[x1,,xm](xm)[x1,,xm]Dm,D_{(x_{m})}\,\coloneqq\,{\mathbb{C}}[x_{1},\dots,x_{m}]_{(x_{m})}\otimes_{{\mathbb{C}}[x_{1},\dots,x_{m}]}D_{m},

which is the ring of differential operators with rational function coefficients that do not have poles along the hyperplane {xmxm=0}\{x\in{\mathbb{C}}^{m}\mid x_{m}=0\}. Denote by Jm\,J_{m}\, the ideal W(Im)D(xm)D(xm)W(I_{m})\cap D_{(x_{m})}\subseteq D_{(x_{m})}. Then the inclusion DmD(xm)D_{m}\subseteq D_{(x_{m})} induces an isomorphism of Rm1R_{m-1}-modules

Rm1Dm1Dm/(W(Im)+xmDm)D(xm)/(Jm+xmD(xm))M.R_{m-1}\otimes_{D_{m-1}}D_{m}/(W(I_{m})+x_{m}D_{m})\,\cong\,D_{(x_{m})}/(J_{m}+x_{m}D_{(x_{m})})\,\eqqcolon\,M.

By definition, the holonomic rank of the restriction module Dm/(W(Im)+xmDm)\,D_{m}/(W(I_{m})+x_{m}D_{m})\, is the dimension of the Rm1R_{m-1}-module M\,M\, as a vector space over (x1,,xm1){\mathbb{C}}(x_{1},\dots,x_{m-1}). Therefore, our aim is to bound dim(x1,,xm1)M\dim_{{\mathbb{C}}(x_{1},\dots,x_{m-1})}M. Note that

D(xm)/xmD(xm)[x1,,xm](xm)/(xm)[x1,,xm]DmD_{(x_{m})}/x_{m}D_{(x_{m})}\,\cong\,{\mathbb{C}}[x_{1},\dots,x_{m}]_{(x_{m})}/(x_{m})\otimes_{{\mathbb{C}}[x_{1},\dots,x_{m}]}D_{m}

is a free Rm1R_{m-1}-module isomorphic to Rm1\,R_{m-1}^{\oplus\infty}\, with the countable basis {1,m,m2,}\{1,\partial_{m},\partial_{m}^{2},\dots\}. For each operator QD(xm)Q\in D_{(x_{m})}, we write Q|xm=0\,{\mathopen{}\mathclose{{}\left.\kern-1.2ptQ}\right|_{x_{m}=0}}\, for the unique expression i=0kQimi\,\sum_{i=0}^{k}Q_{i}\partial_{m}^{i}\, with QiRm1\,Q_{i}\in R_{m-1}\, representing it in D(xm)/xmD(xm)D_{(x_{m})}/x_{m}D_{(x_{m})}. Then M\,M\, is the quotient of D(xm)/xmD(xm)\,D_{(x_{m})}/x_{m}D_{(x_{m})}\, by the Rm1R_{m-1}-submodule

N{Q|xm=0QJm}.N\,\coloneqq\,\{{\mathopen{}\mathclose{{}\left.\kern-1.2ptQ}\right|_{x_{m}=0}}\mid Q\in J_{m}\}.

We equip D(xm)/xmD(xm)\,D_{(x_{m})}/x_{m}D_{(x_{m})}\, with a total order \,\prec\, on its (x1,,xm1){\mathbb{C}}(x_{1},\dots,x_{m-1})-basis of monomials {α=1α1m1αm1mαmαm}\,\{\partial^{\alpha}=\partial_{1}^{\alpha_{1}}\dots\partial_{m-1}^{\alpha_{m-1}}\partial_{m}^{\alpha_{m}}\mid\alpha\in{\mathbb{N}}^{m}\}\, as follows: For α,βm\alpha,\beta\in{\mathbb{N}}^{m},

αβ:αm<βm or (αm=βm and (α1,,αm1)grlex(β1,,βm1)),\partial^{\alpha}\prec\partial^{\beta}\ \ :\Leftrightarrow\ \ \alpha_{m}<\beta_{m}\text{ or }\big{(}\alpha_{m}=\beta_{m}\text{ and }(\alpha_{1},\dots,\alpha_{m-1})\prec_{\rm grlex}(\beta_{1},\dots,\beta_{m-1})\big{)},

where grlex\,\prec_{\rm grlex}\, denotes the graded lexicographic order on m1{\mathbb{N}}^{m-1}. This is a POT term order (“position over term”) on the free Rm1R_{m-1}-module D(xm)/xmD(xm)Rm1D_{(x_{m})}/x_{m}D_{(x_{m})}\cong R_{m-1}^{\oplus\infty}, cf. [37, §5.2]. The dimension of M\,M\, over (x1,,xm1)\,{\mathbb{C}}(x_{1},\dots,x_{m-1})\, agrees with that of the associated graded module

gr(M)=(x1,,xm1)[ξ1,,ξm1]/in(N),\operatorname{gr}^{\prec}(M)\,=\,{\mathbb{C}}(x_{1},\dots,x_{m-1})[\xi_{1},\dots,\xi_{m-1}]^{\oplus\infty}/\operatorname{in}_{\prec}(N),

where in(N)\,\operatorname{in}_{\prec}(N)\, is the (x1,,xm1)[ξ1,,ξm1]{\mathbb{C}}(x_{1},\dots,x_{m-1})[\xi_{1},\dots,\xi_{m-1}]-submodule generated by the initial forms of elements in N\,N\, with respect to \prec.

Our approach is now to explicitly write out (\prec-initial forms of) elements in NN to bound the holonomic rank of the restriction module. Note that the Muirhead operators g1,,gm\,g_{1},\dots,g_{m}\, from (4.3) lie in JmJ_{m}, hence mkgi|xm=0N\,{\mathopen{}\mathclose{{}\left.\kern-1.2pt\partial_{m}^{k}g_{i}}\right|_{x_{m}=0}}\in N\, for all i=1,,m\,i=1,\dots,m\, and kk\in{\mathbb{N}}. If imi\neq m, one computes that

(A.1) mkgi|xm=0=xii2mk+smaller order terms w.r.t..{\mathopen{}\mathclose{{}\left.\kern-1.2pt\partial_{m}^{k}g_{i}}\right|_{x_{m}=0}}\,=\,x_{i}\partial_{i}^{2}\partial_{m}^{k}\,+\,\text{smaller order terms w.r.t.}\prec.

Hence, xiξi2mkin(N)x_{i}\xi_{i}^{2}\,\partial_{m}^{k}\in\operatorname{in}_{\prec}(N) for all im1i\leq m-1, kk\in{\mathbb{N}}.

Moreover, a straightforward computation reveals that (mkgm)|xm=0\,{\mathopen{}\mathclose{{}\left.\kern-1.2pt(\partial_{m}^{k}g_{m})}\right|_{x_{m}=0}}\, equals

(A.2) (c+km12)mk+1(k+a)mk+12=0kk!!j=1m1xjk(jxj1)m.\big{(}c+k-\frac{m-1}{2}\big{)}\partial_{m}^{k+1}\,-\,(k+a)\partial_{m}^{k}\,+\,\frac{1}{2}\sum_{\ell=0}^{k}\frac{k!}{\ell!}\sum_{j=1}^{m-1}x_{j}^{\ell-k}(\partial_{j}-\ell x_{j}^{-1})\;\partial_{m}^{\ell}.

In particular, we see that mk+1in(N)\,\partial_{m}^{k+1}\in\operatorname{in}_{\prec}(N)\, for all  kk\in{\mathbb{N}}\, with c+km120c+k-\frac{m-1}{2}\neq 0.

For the case that cm12c\notin\frac{m-1}{2}-{\mathbb{N}}, we conclude that gr(M)\,\operatorname{gr}^{\prec}(M)\, is generated over (x1,,xm1)\,{\mathbb{C}}(x_{1},\dots,x_{m-1})\, by the  2m1\,2^{m-1}\, elements ξ1α1ξm1αm1\,\xi_{1}^{\alpha_{1}}\cdots\xi_{m-1}^{\alpha_{m-1}}\, with α{0,1}m1\alpha\in\{0,1\}^{m-1}. This shows

rank(Dm/(W(Im)+xmDm))\displaystyle\operatorname{rank}(D_{m}/(W(I_{m})+x_{m}D_{m})) =dim(x1,,xm1)(M)\displaystyle\,=\,\dim_{{\mathbb{C}}(x_{1},\dots,x_{m-1})}(M)
=dim(x1,,xm1)(gr(M)) 2m1< 2m,\displaystyle\,=\,\dim_{{\mathbb{C}}(x_{1},\dots,x_{m-1})}\big{(}\operatorname{gr}^{\prec}(M)\big{)}\,\leq\,2^{m-1}\,<\,2^{m},

reproving Lemma 5.4.

Now, we turn to the remaining case c=m12s\,c=\frac{m-1}{2}-s\, for some ss\in{\mathbb{N}}. In this case, (A.1) for k{0,s}\,k\in\{0,s\}\, and (A.2) for k{s}\,k\in{\mathbb{N}}\setminus\{s\}\, show that gr(M)\,\operatorname{gr}^{\prec}(M)\, is generated over (x1,,xm1)\,{\mathbb{C}}(x_{1},\dots,x_{m-1})\, by the  2m\,2^{m}\, elements ξ1α1ξm1αm1mr\,\xi_{1}^{\alpha_{1}}\cdots\xi_{m-1}^{\alpha_{m-1}}\,\partial_{m}^{r}\, with α{0,1}m1\,\alpha\in\{0,1\}^{m-1}\, and r{0,s+1}r\in\{0,s+1\}. It suffices to prove that there is a linear dependence among these generators. Then we can conclude that

rank(Dm/(W(Im)+xmDm))=dim(x1,,xm1)(gr(M)) 2m1< 2m.\displaystyle\operatorname{rank}\big{(}D_{m}/(W(I_{m})+x_{m}D_{m})\big{)}\,=\,\dim_{{\mathbb{C}}(x_{1},\dots,x_{m-1})}\big{(}\operatorname{gr}^{\prec}(M)\big{)}\,\leq\,2^{m}-1\,<\,2^{m}.

In the following lemma, we leverage (A.2) for k=s\,k=s\, to show the linear dependence, concluding the proof. ∎

Lemma A.2.

Let aa\in{\mathbb{C}}, ss\in{\mathbb{N}} and c=m12sc=\frac{m-1}{2}-s. For each r{1,,s+1}r\in\{1,\dots,s+1\}, there exists an element in N\,N\, of the form

(A.3) Hr=s(s1)(sr+1)mrτ{0,1}m1qτ(r)1τ1m1τm1,H_{r}\,=\,s(s-1)\cdots(s-r+1)\partial_{m}^{r}\,-\sum_{\tau\,\in\,\{0,1\}^{m-1}}q_{\tau}^{\smash[t]{(r)}}\partial_{1}^{\tau_{1}}\cdots\partial_{m-1}^{\tau_{m-1}},

where qτ(r)(x1,,xm1)\,q_{\tau}^{\smash[t]{(r)}}\in{\mathbb{C}}(x_{1},\dots,x_{m-1})\, and for each r\,r\, there is at least one τ\,\tau\, with qτ(r)0q_{\tau}^{\smash[t]{(r)}}\neq 0. In particular (setting r=s+1r=s+1), the elements {1τ1m1τm1τ{0,1}m1}\,\big{\{}\partial_{1}^{\tau_{1}}\dots\partial_{m-1}^{\tau_{m-1}}\mid\tau\in\{0,1\}^{m-1}\big{\}}\, of M\,M\, are not linearly independent over (x1,,xm1){\mathbb{C}}(x_{1},\dots,x_{m-1}).

Proof.

We fix mm and ss, and for τ{0,1}m1\tau\in\{0,1\}^{m-1}, we denote |τ|:=τ1++τm1|\tau|:=\tau_{1}+\dots+\tau_{m-1}. By induction on rr, we prove the following:

  1. (i)

    x1r|τ|qτ(r)[x1,,xm1](x1)x_{1}^{r-|\tau|}q_{\tau}^{\smash[t]{(r)}}\in{\mathbb{C}}[x_{1},\dots,x_{m-1}]_{(x_{1})} for all τ{0,1}m1\tau\in\{0,1\}^{m-1}.

  2. (ii)

    (x1rq(0,0,,0)(r))|x1=0=0{\mathopen{}\mathclose{{}\left.\kern-1.2pt\bigg{(}x_{1}^{r}q_{(0,0,\dots,0)}^{\smash[t]{(r)}}\bigg{)}}\right|_{x_{1}=0}}=0.

  3. (iii)

    (x1r1q(1,0,,0)(r))|x1=0122r1122r2{\mathopen{}\mathclose{{}\left.\kern-1.2pt\bigg{(}x_{1}^{r-1}q_{(1,0,\dots,0)}^{\smash[t]{(r)}}\bigg{)}}\right|_{x_{1}=0}}\in\frac{1}{2^{2r-1}}{\mathbb{Z}}\setminus\frac{1}{2^{2r-2}}{\mathbb{Z}}.

Note that the expressions in (ii) and (iii) are well-defined because of (i). In particular, condition (iii) guarantees that q(1,0,,0)(r)0q_{(1,0,\dots,0)}^{\smash[t]{(r)}}\neq 0.

For r=1r=1, let H1\,H_{1}\, be the negative of (A.2) with k=0k=0, which is of the desired form with

qτ(1)={aif |τ|=0,1/2if |τ|=1,0if |τ|>2.q_{\tau}^{\smash[t]{(1)}}\,=\,\begin{cases}-a&\text{if }|\tau|=0,\\ 1/2&\text{if }|\tau|=1,\\ 0&\text{if }|\tau|>2.\end{cases}

For the induction step, fix k1\,k\geq 1\, and assume that the operators Hr\,H_{r}\, have been constructed with properties (i) to (iii) for rk\,r\leq k\,. We wish to construct Hk+1H_{k+1}. For this, we start with a suitable multiple of the expression (A.2) and reduce it with respect to \,\prec\, modulo the expressions Hr\,H_{r}\, for rk\,r\leq k\, and the expressions gi|xm=0\,{\mathopen{}\mathclose{{}\left.\kern-1.2ptg_{i}}\right|_{x_{m}=0}}\, for i<m\,i<m\, from (A.1). To verify the desired properties, we do not get around carrying out the calculations. Explicitly, the following element arises after the reductions modulo only the expressions Hr\,H_{r}\, for rkr\leq k:

H~k+1\displaystyle\widetilde{H}_{k+1}\,\coloneqq\,{} i=0k1(si)(mkgm)|xm=0(k+a)Hk\displaystyle-\prod_{i=0}^{k-1}(s-i)\cdot{\mathopen{}\mathclose{{}\left.\kern-1.2pt\big{(}\partial_{m}^{k}g_{m}\big{)}}\right|_{x_{m}=0}}-(k+a)H_{k}
+12r=1kk!r!i=rk1(si)j=1m1xjrk(jrxj1)Hr.\displaystyle+\frac{1}{2}\sum_{r=1}^{k}\frac{k!}{r!}\prod_{i=r}^{k-1}(s-i)\sum_{j=1}^{m-1}x_{j}^{r-k}(\partial_{j}-rx_{j}^{-1})H_{r}.

Note that in H~k+1\widetilde{H}_{k+1}, all terms involving mr\,\partial_{m}^{r}\, for r{0,k+1}\,r\notin\{0,k+1\}\, cancel, the term mk+1\,\partial_{m}^{k+1}\, only occurs with coefficient s(sr+1)s\cdots(s-r+1), and all other terms are of the form p1α1m1αm1\,p\partial_{1}^{\alpha_{1}}\dots\partial_{m-1}^{\alpha_{m-1}}\, with p(x1,,xm1)\,p\in{\mathbb{C}}(x_{1},\dots,x_{m-1})\, and α{0,1,2}m1\alpha\in\{0,1,2\}^{m-1}. We define Hk+1\,H_{k+1}\, as the expression obtained by further reducing H~k+1\,\widetilde{H}_{k+1}\, modulo the expressions gi|xm=0\,{\mathopen{}\mathclose{{}\left.\kern-1.2ptg_{i}}\right|_{x_{m}=0}}\, for im1i\leq m-1. For this, note that gi|xm=0\,{\mathopen{}\mathclose{{}\left.\kern-1.2ptg_{i}}\right|_{x_{m}=0}}\, for i<m\,i<m\, are the Muirhead operators (4.2) in dimension m1m-1, and are in particular a Gröbner basis for \,\prec\, by Theorem 4.3.

Denote by ν:(x1,,xm1){}\,\nu\colon{\mathbb{C}}(x_{1},\dots,x_{m-1})\xrightarrow{\ \ }{\mathbb{Z}}\cup\{\infty\}\, the discrete valuation with valuation ring [x1,,xm1](x1){\mathbb{C}}[x_{1},\dots,x_{m-1}]_{(x_{1})}, i.e.,

ν(p)sup{ix1ip(x1,,xm1)(x1)}.\nu(p)\,\coloneqq\,\sup\mathopen{}\mathclose{{}\left\{i\in{\mathbb{Z}}\mid x_{1}^{-i}p\in{\mathbb{C}}(x_{1},\dots,x_{m-1})_{(x_{1})}}\right\}.

With this notation, property (i) can be reformulated as ν(qτ(r))|τ|r\nu(q_{\tau}^{\smash[t]{(r)}})\geq|\tau|-r.

In Rm1R_{m-1}, a reduction with respect to \,\prec\, modulo the Muirhead operator gi|xm=0\,{\mathopen{}\mathclose{{}\left.\kern-1.2ptg_{i}}\right|_{x_{m}=0}}\, for im\,i\leq m\, replaces

(A.4) i2xi1((s12+xi12jim1xixixj)i+12jim1xjxixjj+a).\partial_{i}^{2}\,\mapsto\,x_{i}^{-1}\bigg{(}\Big{(}s-\frac{1}{2}+x_{i}-\frac{1}{2}\sum_{j\neq i}^{m-1}\frac{x_{i}}{x_{i}-x_{j}}\Big{)}\partial_{i}+\frac{1}{2}\sum_{j\neq i}^{m-1}\frac{x_{j}}{x_{i}-x_{j}}\partial_{j}+a\bigg{)}.

Applying this reduction to pα\,p\partial^{\alpha}\, with p(x1,,xm1)p\in{\mathbb{C}}(x_{1},\dots,x_{m-1}), α{0,1,2}m1\alpha\in\{0,1,2\}^{m-1} yields only terms pα\,p^{\prime}\partial^{\alpha^{\prime}}\, with |α|ν(p)|α|ν(p)|\alpha^{\prime}|-\nu(p^{\prime})\leq|\alpha|-\nu(p), and equality can only hold for i=1i=1. Therefore, to prove property (i) for Hk+1H_{k+1}, it suffices to show that H~k+1\,\widetilde{H}_{k+1}\, has only terms pα\,p\partial^{\alpha}\, with |α|ν(p)k+1|\alpha|-\nu(p)\leq k+1. This can easily be seen by substituting (A.2) and (A.3) (for rkr\leq k) into the definition of H~\widetilde{H}, and using that property (i) holds for rk\,r\leq k\, by the induction hypothesis.

We now turn to verifying properties (ii) and (iii). For this, we denote

q¯τ(r)(x1r|τ|qτ(r))|x1=0(x2,,xm1)\bar{q}_{\tau}^{\smash[t]{(r)}}\,\coloneqq\,{\mathopen{}\mathclose{{}\left.\kern-1.2pt\bigg{(}x_{1}^{r-|\tau|}q_{\tau}^{\smash[t]{(r)}}\bigg{)}}\right|_{x_{1}=0}}\in{\mathbb{C}}(x_{2},\dots,x_{m-1})

for all r{1,,k+1}r\in\{1,\dots,k+1\}. To determine q¯τ(k+1)\bar{q}_{\tau}^{\smash[t]{(k+1)}}, we restrict our attention to those terms pα\,p\partial^{\alpha}\, in Hk+1\,H_{k+1}\, for which the |α|ν(p)\,|\alpha|-\nu(p)\, attains the maximum, namely k+1k+1. As we have seen above in (A.4), for this purpose, the terms of H~k+1\,\widetilde{H}_{k+1}\, that get reduced modulo gi|xm=0\,{\mathopen{}\mathclose{{}\left.\kern-1.2ptg_{i}}\right|_{x_{m}=0}}\, for  2im1\,2\leq i\leq m-1\, can be ignored, and it suffices to carry out the reductions of H~k+1\,\widetilde{H}_{k+1}\, modulo the single Muirhead operator g1|xm=0{\mathopen{}\mathclose{{}\left.\kern-1.2ptg_{1}}\right|_{x_{m}=0}}, which results in

H~k+1+12r=1kk!r!i=rk1(si)x1rkτ:τ1=1qτ(r)x11(0,τ2,,τm1)(g1|xm=0).\widetilde{H}_{k+1}+\frac{1}{2}\sum_{r=1}^{k}\frac{k!}{r!}\prod_{i=r}^{k-1}(s-i)x_{1}^{r-k}\sum_{\tau:\tau_{1}=1}q_{\tau}^{\smash[t]{(r)}}x_{1}^{-1}\partial^{(0,\tau_{2},\dots,\tau_{m-1})}\Big{(}{\mathopen{}\mathclose{{}\left.\kern-1.2ptg_{1}}\right|_{x_{m}=0}}\Big{)}.

Expanding this expression and dismissing all terms pα\,p\partial^{\alpha}\, with |α|ν(p)<k+1\,|\alpha|-\nu(p)<k+1\, or with αi>1\alpha_{i}>1, one reads off for all τ{0,1}m1\,\tau\in\{0,1\}^{m-1}\, the recursion

q¯τ(k+1)={12r=1kk!r!i=rk1(si)((|τ|2r+s12)q¯τ(r)+q¯τe1(r))+j=2m1(τj1)12k!i=0k1(si) if τ1=1,12r=1kk!r!i=rk1(si)((|τ|2r+s1)q¯τ(r)+12j:τj=1q¯τej+e1(r)) if τ1=0,\displaystyle\bar{q}_{\tau}^{\smash[t]{(k+1)}}\,=\,\begin{cases}\scriptsize\begin{aligned} &\frac{1}{2}\sum_{r=1}^{k}\frac{k!}{r!}\prod_{i=r}^{k-1}(s-i)\bigg{(}\Big{(}|\tau|-2r+s-\frac{1}{2}\Big{)}\bar{q}_{\tau}^{\smash[t]{(r)}}+\bar{q}_{\tau-e_{1}}^{\smash[t]{(r)}}\bigg{)}{}+{}\prod_{j=2}^{m-1}(\tau_{j}-1)\cdot\frac{1}{2}k!\prod_{i=0}^{k-1}(s-i)\\ \end{aligned}&\text{ if }\tau_{1}=1,\\[9.00002pt] \scriptsize\begin{aligned} \frac{1}{2}\sum_{r=1}^{k}\frac{k!}{r!}\prod_{i=r}^{k-1}(s-i)\bigg{(}\Big{(}|\tau|-2r+s-1\Big{)}\bar{q}_{\tau}^{\smash[t]{(r)}}+\frac{1}{2}\sum_{j:\tau_{j}=1}\bar{q}_{\tau-e_{j}+e_{1}}^{\smash[t]{(r)}}\bigg{)}\end{aligned}&\text{ if }\tau_{1}=0,\end{cases}

where ej(0,,0,1,0,,0)m1\,e_{j}\coloneqq(0,\dots,0,1,0,\dots,0)\in{\mathbb{N}}^{m-1}\, with entry  1\,1\, at the jj-th position.

In particular, we immediately see that q¯(0,0,0)(k+1)=0\bar{q}_{(0,0\dots,0)}^{\smash[t]{(k+1)}}=0, as q¯(0,0,0)(r)=0\,\bar{q}_{(0,0\dots,0)}^{\smash[t]{(r)}}=0\, for rk\,r\leq k\, by the induction hypothesis, proving property (ii). Now, considering the above formula for τ=e1\tau=e_{1}, we get

q¯e1(k+1)=12(12k+s12)q¯e1(k)+[summands for r<k]+12k!i=0k1(si).\bar{q}_{e_{1}}^{\smash[t]{(k+1)}}\,=\,\frac{1}{2}\Big{(}1-2k+s-\frac{1}{2}\Big{)}\bar{q}_{e_{1}}^{\smash[t]{(k)}}\,+\,\bigg{[}\text{summands for $r<k$}\bigg{]}\,+\,\frac{1}{2}k!\prod_{i=0}^{k-1}(s-i).

From the induction hypothesis, we see that the unique term of smallest 22-adic valuation in this expression is 14q¯e1(k)122k+1122k\frac{1}{4}\bar{q}_{e_{1}}^{\smash[t]{(k)}}\in\frac{1}{2^{2k+1}}{\mathbb{Z}}\setminus\frac{1}{2^{2k}}{\mathbb{Z}}. This shows property (iii) and concludes the proof. ∎

References

  • [1] R. J. Beerends and E. M. Opdam. Certain hypergeometric series related to the root system BCBC. Trans. Amer. Math. Soc., 339(2):581–609, 1993.
  • [2] S. Bochner. Bessel functions and modular relations of higher type and hyperbolic differential equations. Comm. Sém. Math. Univ. Lund [Medd. Lunds Univ. Mat. Sem.], 1952(Tome, Tome Supplémentaire):12–20, 1952.
  • [3] E. Cartan. Sur la détermination d’un système orthogonal complet dans un espace de Riemann symétrique clos. Rend. Circ. Mat. Palermo, 53:217–252, 1929.
  • [4] A. G. Constantine. Some non-central distribution problems in multivariate analysis. Ann. Math. Statist., 34:1270–1285, 1963.
  • [5] W. Decker, G.-M. Greuel, G. Pfister, and H. Schönemann. Singular 4-1-2 — A computer algebra system for polynomial computations. http://www.singular.uni-kl.de, 2019.
  • [6] P. Desrosiers and D.-Z. Liu. Selberg integrals, super-hypergeometric functions and applications to β\beta-ensembles of random matrices. Random Matrices Theory Appl., 4(2):1550007, 59, 2015.
  • [7] J. Dutka. The early history of the hypergeometric function. Arch. Hist. Exact Sci., 31(1):15–34, 1984.
  • [8] R. H. Farrell. Techniques of multivariate calculation. Lecture Notes in Mathematics, Vol. 520. Springer-Verlag, Berlin-New York, 1976.
  • [9] O. Gabber. The integrability of the characteristic variety. Amer. J. Math., 103(3):445–468, 1981.
  • [10] I. M. Gelfand, M. M. Kapranov, and A. V. Zelevinsky. Generalized Euler integrals and AA-hypergeometric functions. Adv. Math., 84(2):255–271, 1990.
  • [11] I. M. Gelfand, A. V. Zelevinskiĭ, and M. M. Kapranov. Hypergeometric functions and toric varieties. Funktsional. Anal. i Prilozhen., 23(2):12–26, 1989.
  • [12] H. Hashiguchi, Y. Numata, N. Takayama, and A. Takemura. The holonomic gradient method for the distribution function of the largest root of a Wishart matrix. J. Multivariate Anal., 117:296–312, 2013.
  • [13] H. Hashiguchi, N. Takayama, and A. Takemura. Distribution of the ratio of two Wishart matrices and cumulative probability evaluation by the holonomic gradient method. J. Multivariate Anal., 165:270–278, 2018.
  • [14] R. Hattori and N. Takayama. The singular locus of Lauricella’s FCF_{C}. J. Math. Soc. Japan, 66(3):981–995, 2014.
  • [15] C. S. Herz. Bessel functions of matrix argument. Ann. of Math. (2), 61:474–523, 1955.
  • [16] R. Hotta, K. Takeuchi, and T. Tanisaki. DD-modules, perverse sheaves, and representation theory, volume 236 of Progress in Mathematics. Birkhäuser Boston, Inc., Boston, MA, 2008. Translated from the 1995 Japanese edition by Takeuchi.
  • [17] L.-K. Hua. On the theory of functions of several complex variables. III. On a complete orthonormal system in the hyperbolic space of symmetric and skew-symmetric matrices. Acta Math. Sinica, 5:205–242, 1955.
  • [18] L.-K. Hua. Harmonic analysis of functions of several complex variables in classical domains (Russian). Translated from the Chinese by M. A. Evgrafov; edited by M. I. Graev. Izdat. Inostr. Lit., Moscow, 1959.
  • [19] L.-K. Hua. Harmonic analysis of functions of several complex variables in the classical domains, volume 6 of Translations of Mathematical Monographs. American Mathematical Society, Providence, R.I., 1979. Translated from the Russian, which was a translation of the Chinese original, by Leo Ebner and Adam Korányi, With a foreword by M. I. Graev, Reprint of the 1963 edition.
  • [20] T. Ibukiyama, T. Kuzumaki, and H. Ochiai. Holonomic systems of Gegenbauer type polynomials of matrix arguments related with Siegel modular forms. J. Math. Soc. Japan, 64(1):273–316, 2012.
  • [21] W. R. Inc. Mathematica, Version 11.2. Champaign, IL, 2017.
  • [22] A. T. James. The distribution of the latent roots of the covariance matrix. Ann. Math. Statist., 31:151–158, 1960.
  • [23] A. T. James. Zonal polynomials of the real positive definite symmetric matrices. Ann. of Math. (2), 74:456–469, 1961.
  • [24] M. Kashiwara. On the maximally overdetermined system of linear differential equations. I. Publ. Res. Inst. Math. Sci., 10:563–579, 1974/75.
  • [25] M. Kashiwara. Systems of microdifferential equations, volume 34 of Progress in Mathematics. Birkhäuser Boston, Inc., Boston, MA, 1983. Based on lecture notes by Teresa Monteiro Fernandes translated from the French, With an introduction by Jean-Luc Brylinski.
  • [26] T. Kondo. On a holonomic system of partial differential equations satisfied by F11{{}_{1}F_{1}} of a matrix argument (in Japanese). Master’s thesis, Kobe university, 2013.
  • [27] C. Koutschan. HolonomicFunctions: A Mathematica package for dealing with multivariate holonomic functions, including closure properties, summation, and integration.
  • [28] T. Koyama, H. Nakayama, K. Nishiyama, and N. Takayama. Holonomic gradient descent for the Fisher–Bingham distribution on the dd-dimensional sphere. Comput. Stat., 29(3-4):661–683, 2014.
  • [29] V. Levandovskyy and D. Andres. dmodapp.lib. A singular 4-1-2 library for applications of algebraic DD-modules. http://www.singular.uni-kl.de, 2013.
  • [30] V. Levandovskyy and D. Andres. dmodloc.lib. A singular 4-1-2 library for localization of algebraic DD-modules and applications. http://www.singular.uni-kl.de, 2013.
  • [31] V. Levandovskyy and J. M. Morales. dmod.lib. A singular 4-1-2 library for algorithms for algebraic DD-modules. http://www.singular.uni-kl.de, 2013.
  • [32] R. J. Muirhead. Systems of partial differential equations for hypergeometric functions of matrix argument. Ann. Math. Statist., 41:991–1001, 1970.
  • [33] R. J. Muirhead. Aspects of multivariate statistical theory. John Wiley & Sons, Inc., New York, 1982. Wiley Series in Probability and Mathematical Statistics.
  • [34] H. Nakayama, K. Nishiyama, M. Noro, K. Ohara, T. Sei, N. Takayama, and A. Takemura. Holonomic gradient descent and its application to the Fisher–Bingham integral. Adv. in Appl. Math., 47(3):639–658, 2011.
  • [35] M. Noro. System of partial differential equations for the hypergeometric function F11{{}_{1}F_{1}} of a matrix argument on diagonal regions. ISSAC ’16: Proceedings of the ACM on International Symposium of Symbolic and Algebraic Computation, pages 381–388, July 2016.
  • [36] T. Reichelt, M. Schulze, C. Sevenheck, and U. Walther. Algebraic aspects of hypergeometric differential equations, 2020.
  • [37] M. Saito, B. Sturmfels, and N. Takayama. Gröbner deformations of hypergeometric differential equations, volume 6 of Algorithms and Computation in Mathematics. Springer-Verlag, Berlin, 2000.
  • [38] M. Sato, T. Kawai, and M. Kashiwara. Microfunctions and pseudo-differential equations. In Hyperfunctions and pseudo-differential equations (Proc. Conf., Katata, 1971; dedicated to the memory of André Martineau), pages 265–529. Lecture Notes in Math., Vol. 287, 1973.
  • [39] A.-L. Sattelberger and B. Sturmfels. DD-modules and holonomic functions. Preprint arXiv:1910.01395 [math.AG], 2019.
  • [40] T. Sei, H. Shibata, A. Takemura, K. Ohara, and N. Takayama. Properties and applications of Fisher distribution on the rotation group. J. Multivariate Anal., 116:440–455, 2013.
  • [41] T. Sei, A. Shibatam H and, Takemura, K. Ohara, and N. Takayama. Properties and applications of the fisher distribution on the rotation group. J. Multivariate Analysis, 116:440–455, 2013.
  • [42] A. Takemura. Zonal polynomials, volume 4 of Institute of Mathematical Statistics Lecture Notes—Monograph Series. Institute of Mathematical Statistics, Hayward, CA, 1984.
  • [43] H. Tsai. Weyl closure of a linear differential operator. J. Symbolic Comput., 29(4-5):747–775, 2000. Symbolic computation in algebra, analysis, and geometry (Berkeley, CA, 1998).
  • [44] H. Tsai. Algorithms for associated primes, Weyl closure, and local cohomology of DD-modules. In Local cohomology and its applications (Guanajuato, 1999), volume 226 of Lecture Notes in Pure and Appl. Math., pages 169–194. Dekker, New York, 2002.
  • [45] D. Zeilberger. A holonomic systems approach to special functions identities. J. Comput. Appl. Math., 32(3):321–368, 1990.