This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

QM abelian varieties, hypergeometric character sums and modular forms

Jerome William Hoffman Department of Mathematics
Louisiana State University
Baton Rouge, Louisiana 70803
 and  Fang-Ting Tu Department of Mathematics
Louisiana State University
Baton Rouge, Louisiana 70803
[email protected] [email protected]
Abstract.

This is a report on recent work, with Wen-Ching Winnie Li and Ling Long. In that work explicit formulas are given, involving hypergeometric character sums, for the traces of Hecke operators TpT_{p} acting spaces of cusp forms Sk(Γ)S_{k}(\Gamma) of weight kk for certain arithmetically defined Fuchsian subgroups ΓSL2(𝐑)\Gamma\subset\mathrm{SL}_{2}(\mathbf{R}). In particular we consider the groups attached to the quaternion division algebra B6B_{6} over 𝐐\mathbf{Q} of discriminant 6.

Key words and phrases:
hypergeometric functions, motive, \ell-adic representations, rigidity
1991 Mathematics Subject Classification:
11T23, 11T24, 11S40, 11F80, 11F85, 33C05, 33C65

1. Introduction.

The forthcoming joint work with Wen-Ching Winnie Li and Ling Long [43] extends some investigations of Wen-Ching Winnie Li, Tong Liu, and Ling Long on 4-dimensional Galois representations with quaternion structures, [6]; see also [63]. Our main results are explicit formulas in the shape (see Section 8; terms to be explained in more detail)

(1) Tr(TpSk(Γ))=an expression in hypergeometric character sums.-\mathrm{Tr}(T_{p}\mid S_{k}(\Gamma))=\text{an expression in hypergeometric character sums.}

Here Tr(TpSk(Γ))\mathrm{Tr}(T_{p}\mid S_{k}(\Gamma)) is the trace of the Hecke operator TpT_{p} acting on the space of weight-kk cusp forms Sk(Γ)S_{k}(\Gamma) on the arithmetic Fuchsian group Γ\Gamma. In fact, the expression will be a sum of terms for each λXΓ(𝐅p)\lambda\in X_{\Gamma}(\mathbf{F}_{p}) of the local Frobenius traces Tr(Frobλ)\mathrm{Tr}(\mathrm{Frob}_{\lambda}) on a constructible 𝐐¯\bar{\mathbf{Q}}_{\ell}-sheaf Vk(Γ)V^{k}(\Gamma) for the étale topology on the curve XΓX_{\Gamma}. The theory of hypergeometric character sums over finite fields was developed largely by Greene [41], Katz [52], Beukers-Cohen-Mellit [12], and Fuselier-Long-Ramakrishna-Swisher-Tu [36]. The groups Γ\Gamma are certain arithmetic triangle groups.

The inspiration for this work can be found in the papers of Ahlgren, Frechette, Fuselier, Lennon, Ono, Papanikolas [1, 2, 31, 37, 61, 62], wherein formulas in terms of hypergeometric character sums for traces of Hecke operators are given, but not in terms of traces of Frobenius in étale cohomology. Rather, their approach combines (1) counting formulas for elliptic curves over finite fields due to Schoof, [79], and (2) the Selberg trace formula. In fact, formulas for the trace of Hecke operators TpT_{p} on S2k(SL2(𝐙))S_{2k}(\mathrm{SL}_{2}(\mathbf{Z})) had already been given by Ihara, [46] by similar methods, but Ihara did not use hypergeometric character sums.

We give a different approach, which is more geometric, and applies not only to the cases treated by these authors, but also to groups Γ\Gamma arising from the units in the quaternion algebra B6B_{6} over 𝐐\mathbf{Q} of discriminant 6. We will explain the geometric viewpoints on modular forms and hypergeometric functions.

2. Background.

This work weaves several major mathematical threads. On the one hand, this is part of the general Langlands program, namely that part concerned with expressing the LL-functions of motives in terms of LL-functions of automorphic forms. In the case at hand, the automorphic forms are classical modular forms on the algebraic group GL(2)𝐐GL(2)_{\mathbf{Q}} or its inner twist B6,𝐐B_{6,\mathbf{Q}} given by the quaternion algebra B6B_{6}. The automorphic LL-functions are attached to local systems on modular curves. When ΓSL2(𝐙)\Gamma\subset\mathrm{SL}_{2}(\mathbf{Z}) is a congruence subgroup, these are the modular curves classifying families of elliptic curves with level structure. When Γ\Gamma comes from a quaternion algebra, these are Shimura curves. Thus, the motives involved belong to the general theory of Shimura varieties, [71].

Another major theme is that of hypergeometric functions. The term hypergeometric includes not only the classical F12\phantom{}{}_{2}F_{1} functions, and their generalizations Fqp\phantom{}{}_{p}F_{q}, Appell-Lauricella functions, etc. There are (at least) two general extensions of the theory of hypergeometric functions: (1) the AA-hypergeometric, or GKZ (Gelfand-Kapranov-Zelevinsky) systems, [39]; (2) the Gabber-Loeser-Sabbah hypergeometric systems, based on earlier works of M. Sato and Ore, [66, 67, 68]. See the paper of Dwork/Loeser: [24].

Generally speaking, a hypergeometric function is one that

  • 1.

    has power-series expansions in special form: Γ\Gamma-series;

  • 2.

    satisfies a (regular) holonomic system of differential equations;

  • 3.

    has Euler integral expressions;

  • 4.

    is attached to a motivic sheaf.

What the last item means is that not only are there complex-analytic functions defined by them, but also \ell-adic and pp-adic versions. The \ell-adic versions give rise to hypergeometric character sums, which are central to this paper. For the GKZ systems see [34]; for the GLS systems see [38]. The pp-adic versions give pp-adic analytic functions, first investigated by Dwork. For the GKZ systems, see [35].

Note that there are irregular differential equations of hypergeometric type, the confluent hypergeometrics. The Hodge-deRham realizations are related to irregular Hodge theory and the periods belong to exponential modules, see [25]. The character sums involve additive as well as multiplicative characters of finite fields, and hence their \ell-adic sheaves have wild ramification at infinity. For a recent example see [33].

3. Modular forms and modular curves

3.1. Modular forms

(Reference: [85]). A Fuchsian subgroup of the first kind is a discrete subgroup ΓSL2(𝐑)\Gamma\subset\mathrm{SL}_{2}(\mathbf{R}) such that the invariant volume Γ\SL2(𝐑)\Gamma\backslash\mathrm{SL}_{2}(\mathbf{R}) is finite. This acts on the upper half-plane \mathfrak{H} by

γ.z=az+bcz+d,γ=(abcd).\gamma.z=\frac{az+b}{cz+d},\quad\gamma=\begin{pmatrix}a&b\\ c&d\end{pmatrix}.

The upper half-plane is conformally equivalent to the open unit disk 𝔇\mathfrak{D}, via the map zw=(zi)/(z+i)z\mapsto w=(z-i)/(z+i). This reflects the isomorphism SU(1,1)SL2(𝐑)\mathrm{SU}(1,1)\cong\mathrm{SL}_{2}(\mathbf{R}), where

SU(1,1)={(abb¯a¯)a,b𝐂,|a|2|b|2=1.},(abb¯a¯).w=aw+bb¯w+a¯.\mathrm{SU}(1,1)=\left\{\begin{pmatrix}a&b\\ \bar{b}&\bar{a}\end{pmatrix}\mid a,b\in\mathbf{C},\ \ |a|^{2}-|b|^{2}=1.\right\},\quad\begin{pmatrix}a&b\\ \bar{b}&\bar{a}\end{pmatrix}.w=\frac{aw+b}{\bar{b}w+\bar{a}}.

There are isomorphisms SL2(𝐑)/SO(2)𝔇\mathrm{SL}_{2}(\mathbf{R})/\mathrm{SO}(2)\cong\mathfrak{H\cong}\mathfrak{D} where

SO(2)={(cos(θ)sin(θ)sin(θ)cos(θ))}SL2(𝐑) is a maximal compact subgroup.\mathrm{SO}(2)=\left\{\begin{pmatrix}\cos(\theta)&\sin(\theta)\\ -\sin(\theta)&\cos(\theta)\end{pmatrix}\right\}\subset\mathrm{SL}_{2}(\mathbf{R})\text{\ \ is a maximal compact subgroup.}

For a Fuchsian subgroup Γ\Gamma of first kind, let XΓan=Γ\X_{\Gamma}^{an}=\Gamma\backslash\mathfrak{H} or Γ\\Gamma\backslash\mathfrak{H}^{*} according to whether Γ\Gamma cocompact or not. Here \mathfrak{H}^{*} is the union of \mathfrak{H} and the cusps of Γ\Gamma, if any. We have a finite set S=ScSeXΓS=S_{c}\cup S_{e}\subset X_{\Gamma} of points which are cusps or elliptic points, that is, their preimages in \mathfrak{H}^{*} are cusps or elliptic points of Γ\Gamma, respectively. Let

XΓ=XΓ(ScSe),YΓ=XΓSc=Γ\,X_{\Gamma}^{\circ}=X_{\Gamma}-(S_{c}\cup S_{e}),\quad\quad Y_{\Gamma}=X_{\Gamma}-S_{c}=\Gamma\backslash\mathfrak{H},

the complement of the set of cusps and elliptic points, and the complement of the set of cusps, respectively. Note that p:{elliptic points of Γ}XΓp:\mathfrak{H}-\{\text{elliptic points of \ }\Gamma\}\to X_{\Gamma}^{\circ} is a covering space in the sense of topology. In particular, if Γ\Gamma has no elliptic points, then p:YΓp:\mathfrak{H}\to Y_{\Gamma} is the universal covering of YΓY_{\Gamma}. The action of Γ\Gamma on \mathfrak{H}^{*} is via the quotient Γ¯\bar{\Gamma} in PSL2(𝐑)\mathrm{PSL}_{2}(\mathbf{R}). For the groups in this paper, Γ¯=Γ/Γ{±I}\bar{\Gamma}=\Gamma/\Gamma\cap\{\pm I\}. Therefore, if Γ\Gamma torsion-free, then p:YΓp:\mathfrak{H}\to Y_{\Gamma} is the universal covering of YΓY_{\Gamma}, and the fundamental group at any base-point xx

π1(YΓ,x)Γ,\pi_{1}(Y_{\Gamma},x)\cong\Gamma,

an isomorphism unique up to inner automorphism. In the quaternion cases this is π1(XΓ,x)Γ\pi_{1}(X_{\Gamma},x)\cong\Gamma for Γ\Gamma without torsion. In general, there is an epimorphism

π1(XΓ,x)Γ¯Aut(({elliptic points of Γ})/XΓ).\pi_{1}(X^{\circ}_{\Gamma},x)\to\bar{\Gamma}\cong\mathrm{Aut}((\mathfrak{H}-\{\text{elliptic points of \ }\Gamma\})/X_{\Gamma}^{\circ}).

If ΓSL2(𝐑)\Gamma\subset\mathrm{SL}_{2}(\mathbf{R}) is a Fuchsian subgroup of the first kind, and k0k\geq 0 is an integer, we recall that a modular form of weight kk is a holomorphic function f(z)f(z) for zz\in\mathfrak{H} such that

f(γ.z)=j(γ,z)kf(z) for all γΓ,j((abcd),z)=cz+df(\gamma.z)=j(\gamma,z)^{k}f(z)\text{\ for all\ }\gamma\in\Gamma,\quad j\left(\begin{pmatrix}a&b\\ c&d\end{pmatrix},z\right)=cz+d

which satisfies a growth condition at the cusps. For each cusp cc of Γ\Gamma there is a parameter qcq_{c} such that a modular form has an expansion at cc: f=a0+a1qc+a2qc2+f=a_{0}+a_{1}q_{c}+a_{2}q_{c}^{2}+.... A cusp form is one that vanishes at every cusp i.e., a0=0a_{0}=0 at every cusp of Γ\Gamma. The space of cusp forms Sk(Γ)S_{k}(\Gamma) is finite dimensional over 𝐂\mathbf{C}. We assume the reader is familiar with the standard subgroups Γ(N),Γ0(N)SL2(𝐙)\Gamma(N),\Gamma_{0}(N)\subset\mathrm{SL}_{2}(\mathbf{Z}) as well as the Hecke operators T(p)T(p) (the latter only defined in the case of arithmetically defined Fuchsian subgroups).

3.2. Modular curves

In our work, we consider those Γ\Gamma which are arithmetically defined subgroups. There are two cases:

  • 1.

    Elliptic modular case. Γ\Gamma is commensurable with SL2(𝐙)\mathrm{SL}_{2}(\mathbf{Z}). Then the cusps of Γ\Gamma is the set of rational numbers and the point \infty.

  • 2.

    Quaternion case. Γ\Gamma is commensurable to the set OB1O_{B}^{1} of norm 1 elements in a maximal order OBO_{B} of an indefinite quaternion algebra BB over 𝐐\mathbf{Q}. That is, we choose once and for all an embedding BM2(𝐑)B\subset M_{2}(\mathbf{R}), which induces an embedding θ:OB1SL2(𝐑)\theta:O_{B}^{1}\subset\mathrm{SL}_{2}(\mathbf{R}). In this case, the set of cusps is empty, so =\mathfrak{H}^{*}=\mathfrak{H}.

In both these cases, Shumura’s theory of canonical models shows that these Riemann surfaces are the 𝐂\mathbf{C}-points of an algebraic curve, denoted XΓX_{\Gamma} defined over a number field kΓk_{\Gamma}. We will use the notation XΓX_{\Gamma} for the corresponding Riemann surface, if there is no ambiguity. If necessary, we use XΓanX^{an}_{\Gamma} to denote the analytic space.

These curves are (in general coarse) moduli spaces. If Γ\Gamma has no torsion, then there are universal families of abelian varieties f:AΓYΓf:A_{\Gamma}\to Y_{\Gamma} with additional structures. In the elliptic modular cases these are families of elliptic curves f:EΓYΓf:E_{\Gamma}\to Y_{\Gamma}. In the quaternion cases, YΓ=XΓY_{\Gamma}=X_{\Gamma}, these are families of 2-dimensional abelian varieties with an action of the quaternion algebra, that is, with a homomorphism

θ:BEnd(A)𝐐\theta:B\to\mathrm{End}(A)_{\mathbf{Q}}

and additional structure.

Example 3.1.

(See [22, 86]). Let

M4=Spec𝐙[i,1/2,σ,(σ(σ41))1].M_{4}=\mathrm{Spec}\,\mathbf{Z}[i,1/2,\sigma,(\sigma(\sigma^{4}-1))^{-1}].

This is the moduli scheme for Γ(4)SL2(𝐙)\Gamma(4)\subset\mathrm{SL}_{2}(\mathbf{Z}). The universal elliptic curve for this is

Eσ:y2=x(x1)(xλ),λ=(σ+σ1)2/4.E_{\sigma}:y^{2}=x(x-1)(x-\lambda),\quad\lambda=(\sigma+\sigma^{-1})^{2}/4.

This curve is isomorphic with the Jacobi quartic

Cσ:y2=(1σ2x2)(1x2/σ2).C_{\sigma}:y^{2}=(1-\sigma^{2}x^{2})(1-x^{2}/\sigma^{2}).

via the change of variables

X=σ2+12σ2xσx1/σ,Y=σ414σ3y(x1/σ)2.X=\frac{\sigma^{2}+1}{2\sigma^{2}}\cdot\frac{x-\sigma}{x-1/\sigma},\ \ Y=\frac{\sigma^{4}-1}{4\sigma^{3}}\cdot\frac{y}{(x-1/\sigma)^{2}}.
Example 3.2.

Let

M3=Spec𝐙[ζ3,1/3,μ,(μ31)1].M_{3}=\mathrm{Spec}\,\mathbf{Z}[\zeta_{3},1/3,\mu,(\mu^{3}-1)^{-1}].

This is the moduli scheme for Γ(3)SL2(𝐙)\Gamma(3)\subset\mathrm{SL}_{2}(\mathbf{Z}). The universal elliptic curve for this is

Eμ:x3+y3+z33μxyz=0.E_{\mu}:x^{3}+y^{3}+z^{3}-3\mu xyz=0.

The quaternion case with D=6D=6 will be discussed in detail in section 7.

3.3. Triangle groups

In this work, we consider those Γ\Gamma that are (or are closely related to) triangle groups. We recall the definition: Let a,b,c𝐙2{}a,b,c\in\mathbf{Z}_{\geq 2}\cup\{\infty\}, with abca\leq b\leq c. Define

χ(a,b,c)=1a+1b+1c1\chi(a,b,c)=\frac{1}{a}+\frac{1}{b}+\frac{1}{c}-1

and refer to (a,b,c)(a,b,c) as spherical, Euclidean, hyperbolic, depending on whether χ(a,b,c)\chi(a,b,c) is >0>0, =0=0 or <0<0 respectively. In each of these three cases we associate a geometry

H={thesphere1(𝐂), if χ(a,b,c)>0;theplane𝐂, if χ(a,b,c)=0;theupperhalfplane, if χ(a,b,c)<0.H=\begin{cases}\mathrm{the\ sphere\ }\mathbb{P}^{1}(\mathbf{C}),\text{\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ if \ }\chi(a,b,c)>0;\\ \mathrm{the\ plane\ }\mathbf{C},\text{\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ if \ }\chi(a,b,c)=0;\\ \mathrm{the\ upper\ half-plane\ }\mathfrak{H},\text{\ \ \ \ if \ }\chi(a,b,c)<0.\end{cases}

Spherical triples: (2,3,3)(2,3,3), (2,3,4)(2,3,4), (2,3,5)(2,3,5), (2,2,c)(2,2,c), for c2c\geq 2. Euclidean triples: (2,2,)(2,2,\infty), (2,3,6)(2,3,6), (2,4,4)(2,4,4), (3,3,3)(3,3,3). The rest are hyperbolic. Among them, there are finitely many arithmetic triangle groups, which were enumerated by Takeuchi, see [89], [90]. To each triple we have the associated triangle group, defined as

Δ(a,b,c)=δa,δb,δcδaa=δbb=δcc=δaδbδc=1.\Delta(a,b,c)=\langle\delta_{a},\delta_{b},\delta_{c}\mid\delta_{a}^{a}=\delta_{b}^{b}=\delta_{c}^{c}=\delta_{a}\delta_{b}\delta_{c}=1\rangle.

We have representation of Δ(a,b,c)\Delta(a,b,c) via isometries of the associated geometry HH as follows. Let TT be a geodesic triangle in HH with angles π/a,π/b,π/c\pi/a,\pi/b,\pi/c. Let τa\tau_{a}, τb\tau_{b}, τc\tau_{c} be reflections in the three sides of TT. The group generated by τa,τb,τc\tau_{a},\tau_{b},\tau_{c} is a discrete group with fundamental domain TT. The subgroup of orientation-preserving isometries is generated by

δa=τcτb,δb=τaτc,δc=τbτa\delta_{a}=\tau_{c}\tau_{b},\quad\delta_{b}=\tau_{a}\tau_{c},\quad\delta_{c}=\tau_{b}\tau_{a}

satisfies the relations of Δ(a,b,c)\Delta(a,b,c) with δp\delta_{p} a counterclockwise rotation at the vertex with angle 2πi/p2\pi i/p. The quotient space X(Δ(a,b,c)):=Δ(a,b,c)\HX(\Delta(a,b,c)):=\Delta(a,b,c)\backslash H is a Riemann orbifold of genus zero. The fundamental domain DΔD_{\Delta} is then a union of TT and a reflected image of TT.

Refer to caption
Figure 1. A normalized triangle and its images under the reflections τa\tau_{a} and τb\tau_{b}, and under the rotations δa=τcτb\delta_{a}=\tau_{c}\tau_{b} and δb=τaτc\delta_{b}=\tau_{a}\tau_{c}. Image from Klug, Musty, Schiavone and Voight, [55].

The relevance of triangle groups for hypergeometric equations arises from the fundamental theorem of H. A. Schwarz [80]. Given two independent solutions y1y_{1} and y2y_{2} to a hypergeometric differential equation, the ratio η=y1/y2\eta=y_{1}/y_{2} maps the complex upper half-plane conformally onto a region TT in the extended complex plane which is bounded by circular arcs. For the function

F12[abc;z]\,{}_{2}F_{1}\left[\begin{matrix}a&b\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &c\end{matrix}\;;\;z\right]

the angles are pπp\pi, qπq\pi, rπr\pi where p=|1c|p=|1-c|, q=|cab|q=|c-a-b|, r=|ab|r=|a-b|. Schwarz observed that the monodromy of the differential equation gave rise to isometries of the associated geometry HH. In some cases, one obtains by analytic continuation a tessellation of HH. For this to extend to a tesselation we must have that p,p,rp,p,r are all in the shape 1/n1/n where n𝐙2{}n\in\mathbf{Z}_{\geq 2}\cup\{\infty\}. We can denote the angles of this triangle by π/a,π/b,π/c\pi/a,\pi/b,\pi/c and classify the triples (a,b,c)(a,b,c) as above. He was especially interested in the spherical case, where the monodromy group is necessarily finite, and he managed to completely classify these.

Given a triangle group Γ=(e0,e1,e)\Gamma=(e_{0},e_{1},e_{\infty}), following Theorem 9 of [92] by Y. Yang we introduce the following hypergeometric parameters:

a=12(11e11e01e),b=12(11e11e0+1e),c=11e0,a=\frac{1}{2}(1-\frac{1}{e_{1}}-\frac{1}{e_{0}}-\frac{1}{e_{\infty}}),\quad b=\frac{1}{2}(1-\frac{1}{e_{1}}-\frac{1}{e_{0}}+\frac{1}{e_{\infty}}),\quad c=1-\frac{1}{e_{0}},

and

a~=12(11e1+1e01e),b~=12(11e1+1e0+1e),c~=1+1e0.\tilde{a}=\frac{1}{2}(1-\frac{1}{e_{1}}+\frac{1}{e_{0}}-\frac{1}{e_{\infty}}),\quad\tilde{b}=\frac{1}{2}(1-\frac{1}{e_{1}}+\frac{1}{e_{0}}+\frac{1}{e_{\infty}}),\quad\tilde{c}=1+\frac{1}{e_{0}}.

Using these Yang wrote down an explicit basis for Sk(Γ)S_{k}(\Gamma) in terms of F12[abc;t]\,{}_{2}F_{1}\left[\begin{matrix}a&b\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &c\end{matrix}\;;\;t\right] and F12[a~b~c~;t]\,{}_{2}F_{1}\left[\begin{matrix}\tilde{a}&\tilde{b}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &\tilde{c}\end{matrix}\;;\;t\right].

Refer to caption
Figure 2. Tesselation of the hyperbolic disk belonging to (4,4,4)(4,4,4) that appears in Gauss’s collected works volume 8.

There are now extensive resources on the web for exploring hyperbolic tesselations. This had several major later developments. On the one hand, Poincaré extended this analysis to conformal maps given by second-order linear differential equations with nn regular singular points. The conformal image is an nn-gon bounded by circular arcs. By studying the transformations of these under analytic continuation, Poincaré made the link with non-Euclidean geometry and initiated the modern theories of hyperbolic geometry and automorphic forms. For hypergeometric differential equations of several variables, including those studied by Picard, and Appell and Lauricella, see the works of Deligne and Mostow, [21].

4. Local systems.

4.1. Complex analytic.

Riemann introduced the idea of monodromy into the study of analytic differential equations. Given a representation of the fundamental group

ρ:π1(1(𝐂){0,1,},x)GL2(𝐂)\rho:\pi_{1}(\mathbb{P}^{1}(\mathbf{C})-\{0,1,\infty\},x)\to\mathrm{GL}_{2}(\mathbf{C})

there is a unique second order rational differential equation with regular singular points z=0,1,z=0,1,\infty with the property that if f1(z),f2(z)f_{1}(z),f_{2}(z) is a basis of holomorphic solutions at xx then analytic continuation around a loop γπ1(1(){0,1,},x)\gamma\in\pi_{1}(\mathbb{P}^{1}(\mathbb{C})-\{0,1,\infty\},x) yields the linear transformation

(f1f2)ρ(γ)(f1f2).\begin{pmatrix}f_{1}\\ f_{2}\end{pmatrix}\to\rho(\gamma)\begin{pmatrix}f_{1}\\ f_{2}\end{pmatrix}.

Since

π1(1(𝐂){0,1,},x)=γ0,γ1,γγ0γ1γ=1,\pi_{1}(\mathbb{P}^{1}(\mathbf{C})-\{0,1,\infty\},x)=\langle\gamma_{0},\gamma_{1},\gamma_{\infty}\mid\gamma_{0}\gamma_{1}\gamma_{\infty}=1\rangle,

to give the monodromy representation is equivalent to giving two-by-two matrices T0,T1,TT_{0},T_{1},T_{\infty} such that T0T1T=1T_{0}T_{1}T_{\infty}=1. These are well-defined up to simultaneous conjugation by an element of GL2(𝐂)\mathrm{GL}_{2}(\mathbf{C}). As Katz observes, Riemann proved a stronger result. Namely, it suffices to give the Jordan canonical forms of T0,T1,TT_{0},T_{1},T_{\infty} to reconstruct the hypergeometric differential equation. Actually Riemann only considered the case when these were semisimple, so equivalent to diagonal matrices, with eigenvalues exp(2πiα)\exp(2\pi i\alpha) and exp(2πiα)\exp(2\pi i\alpha^{\prime}); he called the α,α\alpha,\alpha^{\prime} the exponents at the singular point. They are well-defined up to permutation and adding 𝐙\mathbf{Z}. This stronger property, that a differential equation is determined by the Jordan forms of the monodromies at the singular points, is what Katz ([53]) calls rigidity. Rigidity plays an important role in our work.

If XX is a connected nonsingular algebraic variety over 𝐂\mathbf{C} and

ρ:π1(Xan,x)GLn(𝐂)\rho:\pi_{1}(X^{an},x)\to\mathrm{GL}_{n}(\mathbf{C})

is a representation, we get a local system V on the analytic space XanX^{an}. By the Riemann-Hilbert correspondence, this is the solution sheaf to a differential equation, unique up to isomorphism,

:𝒱ΩX/𝐂1𝒪X𝒱\nabla:\mathcal{V}\to\Omega^{1}_{X/\mathbf{C}}\otimes_{\mathcal{O}_{X}}\mathcal{V}

with regular singular points at infinity ([16], [51]). If XX is a nonsingular algebraic variety over 𝐂\mathbf{C} we let Xan=X(𝐂)X^{an}=X(\mathbf{C}) be the set of complex points with the classical topology. Assume that XX is quasi-projective and connected. Recall the following dictionary: The following categories are equivalent:

  • 1.

    Local systems of finite-dimensional 𝐂\mathbf{C}-vector spaces V on XanX^{an}.

  • 2.

    Representations ρ:π1(Xan,x)GL(V)\rho:\pi_{1}(X^{an},x)\to\mathrm{GL}(V) on finite-dimensional 𝐂\mathbf{C}-vector spaces VV.

  • 3.

    Holomorphic integrable connections

    :𝒱anΩXan/𝐂1𝒪Xan𝒱an\nabla:\mathcal{V}^{an}\to\Omega^{1}_{X^{an}/\mathbf{C}}\otimes_{\mathcal{O}_{X^{an}}}\mathcal{V}^{an}

    where 𝒱an\mathcal{V}^{an} is a locally free 𝒪Xan\mathcal{O}_{X}^{an}-module of finite rank.

  • 4.

    Integrable algebraic connections

    :𝒱ΩX/𝐂1𝒪X𝒱\nabla:\mathcal{V}\to\Omega^{1}_{X/\mathbf{C}}\otimes_{\mathcal{O}_{X}}\mathcal{V}

    where 𝒱\mathcal{V} is a locally free 𝒪X\mathcal{O}_{X}-module of finite rank, and which have regular singular points “at infinity”.

4.2. \ell-adic.

A reference: [18], [20]. Fix a prime number \ell. In this section: scheme = a separated noetherian scheme on which \ell is invertible. We are interested in constructible 𝐐¯\bar{\mathbf{Q}}_{\ell}-sheaves on XX, in particular, those that are lisse. In this section: the étale topology is understood. An \ell-adic representation of a profinite group π\pi on a 𝐐¯\bar{\mathbf{Q}}_{\ell}-vector space VV is a homomorphism

σ:πGL(V)\sigma:\pi\to\mathrm{GL}(V)

such that there is a finite subextension E/𝐐E/\mathbf{Q}_{\ell} and an EE-structure VEV_{E} on VV such that σ\sigma factorizes in a continuous homomorphism πGL(VE)\pi\to\mathrm{GL}(V_{E}). Recall that a geometric point x¯\bar{x} of a scheme XX is a morphism of the spectrum of an algebraically closed field denoted k(x¯)k(\bar{x}) to XX. It is localized in xXx\in X if its image is xx. If XX is connected and pointed by a geometric point x¯\bar{x}, the functor

the π1(X,x¯) module x¯\mathcal{F}\mapsto\text{the \ }\pi_{1}(X,\bar{x})-\text{\ module\ }\mathcal{F}_{\bar{x}}

is an equivalence between the categories of

  • 1.

    lisse 𝐐¯\bar{\mathbf{Q}}_{\ell}-sheaves on XX;

  • 2.

    \ell-adic continuous representations of π1(X,x¯)\pi_{1}(X,\bar{x}).

Here π1(X,x¯)\pi_{1}(X,\bar{x}) is Grothendieck’s fundamental group. Especially if X=Spec(k)X=\mathrm{Spec}(k) is a field, the category of lisse 𝐐¯\bar{\mathbf{Q}}_{\ell}-sheaves on XX is equivalent to the category of \ell-adic representations of Gal(k¯/k)\mathrm{Gal}(\bar{k}/k).

If one is working in the category of schemes XX of finite type over a perfect field kk with algebraic closure k¯\bar{k}, and xX(k)x\in X(k) is a kk-rational point, by convention we let x¯\bar{x} be the geometric point of XX which is the composite Spec(k¯)Spec(k)X\mathrm{Spec}(\bar{k})\to\mathrm{Spec}(k)\to X.

More generally, let SS be an irreducible scheme of finite type over a field kk. Then a lisse sheaf 𝒱\mathcal{V} on SS is equivalent to a representation

ρ:π1(S,η¯)GL(𝒱η¯)GL(n,𝐐¯)\rho:\pi_{1}(S,\bar{\eta})\to GL(\mathcal{V}_{\bar{\eta}})\sim GL(n,\bar{\mathbf{Q}}_{\ell})

where η¯\bar{\eta} is a geometric generic point. That is, η¯=Spec(k(S)¯)\bar{\eta}=\mathrm{Spec}(\overline{k(S)}) where η\eta is the generic point of SS, and k(S)=𝒪S,ηk(S)=\mathcal{O}_{S,\eta} is the function field of SS.

Recall that there is a canonical exact sequence

0π1(Sk¯,η¯)π1(S,η¯)Gal(k¯/k)0.\begin{CD}0@>{}>{}>\pi_{1}(S_{\bar{k}},\bar{\eta})@>{}>{}>\pi_{1}(S,\bar{\eta})@>{}>{}>\mathrm{Gal}(\bar{k}/k)@>{}>{}>0.\end{CD}

Moreover if kk has characteristic 0 and we choose an embedding k𝐂k\hookrightarrow\mathbf{C}, π1(Sk¯,η¯)\pi_{1}(S_{\bar{k}},\bar{\eta}) is isomorphic to the profinite completion of the Poincaré fundamental group, π1(San,x)\pi_{1}(S^{an},x) so this includes the geometric monodromy of \mathcal{F}.

Now let k=𝐅qk=\mathbf{F}_{q} be a be a finite field of characteristic p>0p>0. If X0X_{0} is a scheme of finite type, we define X=X0𝐅q𝐅¯qX=X_{0}\otimes_{\mathbf{F}_{q}}\bar{\mathbf{F}}_{q} and recall that there is a morphism F:XXF:X\to X which sends a point with coordinates xx to the point with coordinates xqx^{q}. The set of closed points |X||X| is canonically identified with X0(𝐅¯q)X_{0}(\bar{\mathbf{F}}_{q}), and the Frobenius fixed points |X|F=X(𝐅q)|X|^{F}=X({\mathbf{F}_{q}}). The set of closed points |X0||X_{0}| is isomorphic to the set of orbits |X|F|X|_{F} of FF; for x0|X0|x_{0}\in|X_{0}| the size of the corresponding orbit ZZ is deg(x0)\mathrm{deg}(x_{0}), which is also the degree of the extension field k(x)/𝐅qk(x)/\mathbf{F}_{q}. If 0\mathcal{F}_{0} is a constructible 𝐐¯\bar{\mathbf{Q}}_{\ell}-sheaf on X0X_{0} for the étale topology, we let (X,)(X,\mathcal{F}) be deduced from (X0,0)(X_{0},\mathcal{F}_{0}) by extension to the algebraic closure. If x¯|X|\bar{x}\in|X| is a geometric point localized in x|X0|x\in|X_{0}|, we define

Frobx:=Frobqdeg(x)Gal(k(x¯)/k(x))=Gal(𝐅¯q/𝐅q).\mathrm{Frob}_{x}:=\mathrm{Frob}_{q^{\mathrm{deg}(x)}}\in\mathrm{Gal}(k(\bar{x})/k(x))=\mathrm{Gal}(\bar{\mathbf{F}}_{q}/\mathbf{F}_{q}).

The stalk x¯\mathcal{F}_{\bar{x}} is a 𝐐¯\bar{\mathbf{Q}}_{\ell}-vector space of finite dimension on which Gal(k(x¯)/k(x))\mathrm{Gal}(k(\bar{x})/k(x)) acts and therefore

det(1t.Frobxx¯)𝐐¯[t]\det\left(1-t.\mathrm{Frob}_{x}\mid\mathcal{F}_{\bar{x}}\right)\in\bar{\mathbf{Q}}_{\ell}[t]

is defined. It is independent of the geometric point x¯\bar{x} localized in xx, so we can simply denote it by det(1t.Frobx0)\det\left(1-t.\mathrm{Frob}_{x}\mid\mathcal{F}_{0}\right). The coefficient of t-t in this polynomial is the trace Tr(Frobx0)\mathrm{Tr}(\mathrm{Frob}_{x}\mid\mathcal{F}_{0}). The Grothendieck-Lefschetz trace formula reads:

(2) i=02d(1)iTr(FrobqHci(X,))=xX(𝐅q)Tr(Frobx0),\sum_{i=0}^{2d}(-1)^{i}\mathrm{Tr}(\mathrm{Frob}_{q}\mid H^{i}_{c}(X,\mathcal{F}))=\sum_{x\in X(\mathbf{F}_{q})}\mathrm{Tr}(\mathrm{Frob}_{x}\mid\mathcal{F}_{0}),

where dd is the dimension of XX and HciH^{i}_{c} is cohomology with proper support. We define the LL-function

Z(X0,0,t):=x|X0|det(1tdeg(x)Frobx0)1.Z(X_{0},\mathcal{F}_{0},t):=\prod_{x\in|X_{0}|}\det\left(1-t^{\mathrm{deg}(x)}\mathrm{Frob}_{x}\mid\mathcal{F}_{0}\right)^{-1}.

The trace formula is equivalent to the factorization

Z(X0,0,t):=i=02ddet(1tFrobqHci(X,))i+1.Z(X_{0},\mathcal{F}_{0},t):=\prod_{i=0}^{2d}\det\left(1-t\mathrm{Frob}_{q}\mid H^{i}_{c}(X,\mathcal{F})\right)^{i+1}.

4.3. Families of varieties.

Let f:XSf:X\to S be a proper smooth morphism of algebraic varieties. Then we get local systems:

  • 1.

    Over 𝐂\mathbf{C}: Rif𝐂=Ker(an)R^{i}f_{*}\mathbf{C}=\mathrm{Ker}(\nabla^{an}) where

    :HdRi(X/S)ΩS/𝐂1𝒪SHdRi(X/S),HdRi(X/S):=𝐑ifΩX/S\nabla:H^{i}_{dR}(X/S)\to\Omega^{1}_{S/\mathbf{C}}\otimes_{\mathcal{O}_{S}}H^{i}_{dR}(X/S),\quad H^{i}_{dR}(X/S):=\mathbf{R}^{i}f_{*}\Omega^{\bullet}_{X/S}

    is the Gauss-Manin connection on the relative deRham cohomology. Then (Rif𝐂)s=Hi(Xsan,𝐂)(R^{i}f_{*}\mathbf{C})_{s}=H^{i}(X_{s}^{an},\mathbf{C}). This has regular singular points (Griffiths; see [49]).

  • 2.

    \ell-adic: Rif𝐐R^{i}f_{*}\mathbf{Q}_{\ell} is a lisse 𝐐\mathbf{Q}_{\ell}-sheaf on the étale topology of SS. The stalk in a geometric point (Rif𝐐)s¯=Heti(Xsκ(s)κ(s¯),𝐐)(R^{i}f_{*}\mathbf{Q}_{\ell})_{\bar{s}}=H^{i}_{et}(X_{s}\otimes_{\kappa(s)}\kappa(\bar{s}),\mathbf{Q}_{\ell}), which has an action of the Galois group Gal(κ(s¯)/κ(s))\mathrm{Gal}(\kappa(\bar{s})/\kappa(s)).

4.4. Example: Legendre curve.

Let S=Spec𝐙[1/2,λ,(λ(1λ))1]S=\mathrm{Spec}\,\mathbf{Z}[1/2,\lambda,(\lambda(1-\lambda))^{-1}]. This is the open subset

𝐙[1/2]1{0,1,}𝐙[1/2]1.\mathbb{P}^{1}_{\mathbf{Z}[1/2]}-\{0,1,\infty\}\subset\mathbb{P}^{1}_{\mathbf{Z}[1/2]}.

For each λ\lambda we let EλE_{\lambda} be the projective nonsingular model of the affine curve

y2=x(x1)(xλ).y^{2}=x(x-1)(x-\lambda).

This is an elliptic curve with origin at infinity (x,y,z)=(0,1,0)(x,y,z)=(0,1,0) in the projective plane. We get a proper smooth morphism f:ESf:E\to S defined over 𝐙[1/2]\mathbf{Z}[1/2] with these fibers.

The local system over 𝐂\mathbf{C} topologically is the flat bundle of H1(Eλ,𝐂)H^{1}(E_{\lambda},\mathbf{C}) for λ1(𝐂){0,1,}\lambda\in\mathbb{P}^{1}(\mathbf{C})-\{0,1,\infty\}. This latter is the sphere with 3 points removed. We have

π1(1(𝐂){0,1,},λ0)γ0,γ1,γγ0γ1γ=1,\pi_{1}(\mathbb{P}^{1}(\mathbf{C})-\{0,1,\infty\},\lambda_{0})\cong\langle\gamma_{0},\gamma_{1},\gamma_{\infty}\mid\gamma_{0}\gamma_{1}\gamma_{\infty}=1\rangle,

isomorphic to a free group on 2 generators. We can represent these by loops starting at the base-point λ0\lambda_{0} and circling once around 0,1,0,1,\infty, respectively. The monodromy can be represented topologically by following the generators of H1(Eλ,𝐙)𝐙2H^{1}(E_{\lambda},\mathbf{Z})\sim\mathbf{Z}^{2} as λ\lambda moves along these loops. This can be viewed by analytically continuing the period matrix of EλE_{\lambda} relative to a basis of differential forms. The deRham cohomology HdR1(E/S)H^{1}_{dR}(E/S) is a free 𝒪S\mathcal{O}_{S}-module of rank 2. We can take as basis

ω1=dxy,ω2=(ddλ)ω1=dx2(xλ)y.\omega_{1}=\frac{dx}{y},\quad\omega_{2}=\nabla\left(\frac{d}{d\lambda}\right)\omega_{1}=\frac{dx}{2(x-\lambda)y}.

which is a basis of meromorphic 1-forms of the first and second kind modulo exact forms.

We have the following equation in HdR1(E/S)H^{1}_{dR}(E/S):

(ddλ)(ω1ω2)=1λ(1λ)(0λ(1λ)1/42λ1)(ω1ω2)mod exact forms.\nabla\left(\frac{d}{d\lambda}\right)\begin{pmatrix}\omega_{1}\\ \omega_{2}\end{pmatrix}=\frac{1}{\lambda(1-\lambda)}\begin{pmatrix}0&\lambda(1-\lambda)\\ 1/4&2\lambda-1\end{pmatrix}\begin{pmatrix}\omega_{1}\\ \omega_{2}\end{pmatrix}\quad\text{mod exact forms.}

Note that HdR1(Ean/San)=0=R1f𝐙𝐂H^{1}_{dR}(E^{an}/S^{an})^{\nabla=0}=R^{1}f_{*}\mathbf{Z}\otimes\mathbf{C}, where R1f𝐙R^{1}f_{*}\mathbf{Z} is a local system of free 𝐙\mathbf{Z}-modules of rank 2 on San=1(𝐂){0,1,}S^{an}=\mathbb{P}^{1}(\mathbf{C})-\{0,1,\infty\} with a nondegenerate symplectic pairing on it. The dual local system R1f𝐙R_{1}f_{*}\mathbf{Z} is the homology local system: H1(Eλan,𝐙)H_{1}(E^{\mathrm{an}}_{\lambda},\mathbf{Z}). For any local horizontal section γ\gamma of the dual R1f𝐙R_{1}f_{*}\mathbf{Z} of R1f𝐙R^{1}f_{*}\mathbf{Z}, it follows easily from this that the period f(λ)=γ𝑑x/yf(\lambda)=\int_{\gamma}dx/y is a solution to the differential equation

λ(1λ)f′′+(12λ)ff/4=0,\lambda(1-\lambda)f^{\prime\prime}+(1-2\lambda)f^{\prime}-f/4=0,

which is the differential equation satisfied by F12[1/21/21;λ]\,{}_{2}F_{1}\left[\begin{matrix}1/2&1/2\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &1\end{matrix}\;;\;\lambda\right]. In fact,

F12[1/21/21;λ]=1π1dxy.\,{}_{2}F_{1}\left[\begin{matrix}1/2&1/2\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &1\end{matrix}\;;\;\lambda\right]=\frac{1}{\pi}\int_{1}^{\infty}\frac{dx}{y}.

We view a small loop around the interval [1,][1,\infty] as topological cycle γ\gamma on the Riemann surface EλE_{\lambda} which is a branched cover of 1(𝐂)\mathbb{P}^{1}(\mathbf{C}) of degree 2.

The monodromy group of this differential equation is projectively equivalent to the principal congruence subgroup Γ(2)SL2(𝐙)\Gamma(2)\subset\mathrm{SL}_{2}(\mathbf{Z}). In a suitable basis (we can take so-called vanishing cycles at 11 and 0) γ,δ\gamma,\delta of H1(Et,𝐙)H^{1}(E_{t},\mathbf{Z}) the monodromy matrices are

T0=(1201),T1=(1021).T_{0}=\begin{pmatrix}1&2\\ 0&1\end{pmatrix},\quad T_{1}=\begin{pmatrix}1&0\\ -2&1\end{pmatrix}.
Refer to caption
Refer to caption
Figure 3. The Riemann surface EtE_{t} is a double cover of this with branch cuts as shown.

These generate a subgroup Γ\Gamma of index 2 inside Γ(2)\Gamma(2) acting on the upper half plane with quotient

Γ\=Γ(2)\1(𝐂){0,1,},τλ(τ).\Gamma\backslash\mathfrak{H}=\Gamma(2)\backslash\mathfrak{H}\overset{\sim}{\longrightarrow}\mathbb{P}^{1}(\mathbf{C})-\{0,1,\infty\},\quad\tau\mapsto\lambda(\tau).

This map is induced by λ:𝐂{}\lambda:\mathfrak{H}\to\mathbf{C}\cup\{\infty\} where λ(τ)\lambda(\tau) is a generator of the field of modular functions for Γ(2)\Gamma(2). The inverse of this is the multivalued function on 1(𝐂){0,1,}\mathbb{P}^{1}(\mathbf{C})-\{0,1,\infty\} given by the period ratio τ(λ)=αω1/βω1\tau(\lambda)=\int_{\alpha}\omega_{1}/\int_{\beta}\omega_{1} of two independent solutions to this differential equation. The above quotient space, denoted M2M_{2}, is the coarse moduli space of elliptic curves with a level 2 structure.

The \ell-adic local system R1f𝐐R^{1}f_{*}\mathbf{Q}_{\ell} gives zeta functions of the elliptic curves in the family. For instance, let λ𝐅p\lambda\in\mathbf{F}_{p} for an odd prime pp, λ0,1\lambda\neq 0,1. Then this gives a geometric point λ¯\bar{\lambda} of the scheme SS, and we can identify the fiber

(R1f𝐐)λ¯=H1(Eλκ(λ¯),𝐐),Gal(κ(λ¯)/κ(λ))=Gal(𝐅¯p/𝐅p)=Frobp(R^{1}f_{*}\mathbf{Q}_{\ell})_{\bar{\lambda}}=H^{1}(E_{\lambda}\otimes\kappa(\bar{\lambda}),\mathbf{Q}_{\ell}),\quad\mathrm{Gal}(\kappa(\bar{\lambda})/\kappa(\lambda))=\mathrm{Gal}(\bar{\mathbf{F}}_{p}/\mathbf{F}_{p})=\langle\mathrm{Frob}_{p}\rangle

which is the dual of the Tate module of the curve EλE_{\lambda}. Then

#Eλ(𝐅p)=p+1Tr(Frobp(R1f𝐐)λ¯)\#E_{\lambda}(\mathbf{F}_{p})=p+1-\mathrm{Tr}(\mathrm{Frob}_{p}\mid(R^{1}f_{*}\mathbf{Q}_{\ell})_{\bar{\lambda}})
Refer to caption
Figure 4. (,,)(\infty,\infty,\infty) hyperbolic tessellation of the unit disk 𝔇\mathfrak{D} corresponding to Γ(2)\Gamma(2). Image thanks to Fricke -Klein.

4.5. Example: Generalized Legendre curves.

For any ring RR, we denote by λ\lambda the coordinate on 𝔸R1\mathbb{A}^{1}_{R} and by SS the open set where λ(1λ)\lambda(1-\lambda) is invertible. Given any elements a,b,cRa,b,c\in R we denote by E(a,b,c)E(a,b,c) the free 𝒪S\mathcal{O}_{S}-module of rank 2 with basis e0e_{0}, e1e_{1}, and integrable RR-connection

(ddλ)(e0)=\displaystyle\nabla\left(\frac{d}{d\lambda}\right)(e_{0})= e1\displaystyle e_{1}
(ddλ)(e1)=\displaystyle\nabla\left(\frac{d}{d\lambda}\right)(e_{1})= (c(a+b+1)λ)λ(1λ)e1+abλ(1λ)e0.\displaystyle-\frac{(c-(a+b+1)\lambda)}{\lambda(1-\lambda)}e_{1}+\frac{ab}{\lambda(1-\lambda)}e_{0}.

Horizontal sections of the dual of E(a,b,c)E(a,b,c) over an open set USU\subset S can be identified with fΓ(U,𝒪U)f\in\Gamma(U,\mathcal{O}_{U}) which satisfy the differential equation

λ(1λ)(ddλ)2f+(c(a+b+1)λ)dfdλabf=0.{\lambda(1-\lambda)}\left(\frac{d}{d\lambda}\right)^{2}f+(c-(a+b+1)\lambda)\frac{df}{d\lambda}-abf=0.

These hypergeometric equations are two-dimensional factors of the the cohomology of the family of curves yN=xa(x1)b(xλ)cy^{N}=x^{a}(x-1)^{b}(x-\lambda)^{c}. In effect, the Euler integral representation

F(α,β;γ;λ)=Γ(γ)Γ(β)Γ(βγ)1xαγ(x1)γβ1(xλ)α𝑑xF(\alpha,\beta;\gamma;\lambda)=\frac{\Gamma(\gamma)}{\Gamma(\beta)\Gamma(\beta-\gamma)}\int_{1}^{\infty}x^{\alpha-\gamma}(x-1)^{\gamma-\beta-1}(x-\lambda)^{-\alpha}dx

shows that the solutions to the differential equation are given by periods of those curves. Given integers N,a,b,cN,a,b,c greater that zero, let Y(N;a,b,c)λY(N;a,b,c)_{\lambda} be the nonsingular projective model of the affine curve in (x,y)(x,y)-space defined by the equation yN=xa(x1)b(xλ)cy^{N}=x^{a}(x-1)^{b}(x-\lambda)^{c}. We consider this as a family of curves

f:Y(N;a,b,c)S:=R1{0,1,},R=RN=𝐙[1/N,ζN].f:Y(N;a,b,c)\to S:=\mathbb{P}^{1}_{R}-\{0,1,\infty\},\quad R=R_{N}=\mathbf{Z}[1/N,\zeta_{N}].

We have the Gauss-Manin connection

:HdR1(Y(N;a,b,c)/S)ΩS/R1𝒪SHdR1(Y(N;a,b,c)/S).\nabla:H^{1}_{dR}(Y(N;a,b,c)/S)\to\Omega^{1}_{S/R}\otimes_{\mathcal{O}_{S}}H^{1}_{dR}(Y(N;a,b,c)/S).

The following theorem gives the structure of this, at least in the generic fiber Spec(𝐂(λ))U\mathrm{Spec}(\mathbf{C}(\lambda))\hookrightarrow U. Choose an embedding R𝐂R\to\mathbf{C}. Let

X(N;a,b,c)=Spec𝐂(λ)[x,y,1/y]/(yNxa(x1)b(xλ)c).X(N;a,b,c)=\mathrm{Spec}\ \mathbf{C}(\lambda)[x,y,1/y]/(y^{N}-x^{a}(x-1)^{b}(x-\lambda)^{c}).

This is the open affine subset where yy is invertible. It is affine and smooth of relative dimension one over 𝐂(λ)\mathbf{C}(\lambda). The map (x,y)x(x,y)\to x is a finite étale covering

π:X(N;a,b,c)𝔸𝐂(λ)1{0,1,λ}.\pi:X(N;a,b,c)\to\mathbb{A}^{1}_{\mathbf{C}(\lambda)}-\{0,1,\lambda\}.

For any root of unity ξμn\xi\in\mu_{n} there is an automorphism of X(N;a,b,c)X(N;a,b,c) given by (x,y)(x,ξy)(x,y)\mapsto(x,\xi y). This gives the Galois group of the covering π\pi. Note that the dx/ymdx/y^{m} defines an element in the character eigenspace HDR1(X(N;a,b,c)/𝐂(λ))χ(m)H^{1}_{DR}(X(N;a,b,c)/\mathbf{C}(\lambda))^{\chi(-m)} where χ(t)(ξ)=ξt\chi(t)(\xi)=\xi^{t}.

Proposition 4.1.

([50, 6.8.6]) Suppose that NN does not divide a,b,c,a+b+ca,b,c,a+b+c. Then for any integer k1k\geq 1 which is invertible modulo NN the map

e0\displaystyle e_{0} theclassofdxyk\displaystyle\mapsto\mathrm{the\ class\ of\ \ }\frac{dx}{y^{k}}
e1\displaystyle e_{1} (ddλ)(theclassofdxyk)\displaystyle\mapsto\nabla\left(\frac{d}{d\lambda}\right)(\mathrm{the\ class\ of\ \ }\frac{dx}{y^{k}})

induces an isomorphism

E(kcN,k(a+b+c)N1,k(a+c)N)HdR1(X(N;a,b,c)/𝐂(λ))χ(k).E\left(\frac{kc}{N},\frac{k(a+b+c)}{N}-1,\frac{k(a+c)}{N}\right)\cong H^{1}_{dR}(X(N;a,b,c)/\mathbf{C}(\lambda))^{\chi(-k)}.

This gives only the part of the cohomology belonging to primitive characters modulo NN. For nn prime to NN, we have HdR1(X(N;a,b,c)/𝐂(λ))χ(n)=HdR1(Y(N;a,b,c)/𝐂(λ))χ(n)H^{1}_{dR}(X(N;a,b,c)/\mathbf{C}(\lambda))^{\chi(n)}=H^{1}_{dR}(Y(N;a,b,c)/\mathbf{C}(\lambda))^{\chi(n)}. The local system R1f𝐂R^{1}f_{*}\mathbf{C} on SanS^{an} underlies a polarized variation of Hodge structures of weight 1. The modules with connection HdR1(X(n;a,b,c)/𝐂(λ))χ(n)H^{1}_{dR}(X(n;a,b,c)/\mathbf{C}(\lambda))^{\chi(n)} correspond to the rank 2 local system (R1f𝐂)χ(n)(R^{1}f_{*}\mathbf{C})^{\chi(n)}. The (R1f𝐂)χ(n)(R^{1}f_{*}\mathbf{C})^{\chi(n)} does not correspond to a variation of Hodge structures unless the character χ(n)\chi(n) is real.

The eigenspaces (R1f𝐐¯)χ(n)(R^{1}f_{*}\bar{\mathbf{Q}}_{\ell})^{\chi(n)} then give the \ell-adic realization, where f:Y(N;a,b,c)Sf:Y(N;a,b,c)\to S is as before but now as schemes over SS for the étale topology.

These results are generalized to families of curves of the form

yN=xa(x1)b(xλ1)c1.(xλr)cr.y^{N}=x^{a}(x-1)^{b}(x-\lambda_{1})^{c_{1}}....(x-\lambda_{r})^{c_{r}}.

in [44]. One gets a regular holonomic system of partial differential equations in the variables λ1,,λr\lambda_{1},...,\lambda_{r}.

5. Hypergeometric motives

In this paper, the word motive (really: motivic sheaf) is used informally. These concepts can be rigorously applied utilizing results of Arapura [4], [5], and Ayoub [7], [8]. For our purpose, the motives on a scheme XX will be viewed as giving

  • 1.

    a constructible sheaf 𝐂\mathcal{F}_{\mathbf{C}} of 𝐂\mathbf{C}-vector spaces on the analytic space Xan=X(𝐂)X^{an}=X(\mathbf{C});

  • 2.

    a constructible 𝐐¯\bar{\mathbf{Q}}_{\ell}- sheaf \mathcal{F}_{\ell} for the étale topology on XX, for a fixed prime number \ell.

In this section we will describe the motivic sheaves attached to hypergeometric data. We utilize the formalism of hypergeometric motives as developed by Roberts and Rodriguez Villegas, see [76]. A hypergeometric datum HDHD is a pair of multi-sets α={a1,,an},β={b1=1,b2,,bn}\alpha=\{a_{1},\cdots,a_{n}\},\beta=\{b_{1}=1,b_{2},\cdots,b_{n}\} with ai,bj𝐐a_{i},b_{j}\in\mathbf{Q}. A datum HD={α={a1,,an},β={b1=1,b2,,bn}}HD=\left\{\alpha=\{a_{1},\cdots,a_{n}\},\beta=\{b_{1}=1,b_{2},\cdots,b_{n}\}\right\} with ai,bj𝐐×a_{i},b_{j}\in\mathbf{Q}^{\times} is said to be defined over 𝐐\mathbf{Q} if the set of column vectors {(a1b1),,(anbn)}\left\{\begin{pmatrix}a_{1}\\ b_{1}\end{pmatrix},\cdots,\begin{pmatrix}a_{n}\\ b_{n}\end{pmatrix}\right\} mod 𝐙\mathbf{Z} is invariant under multiplication by all r(𝐙/M𝐙)×r\in(\mathbf{Z}/M\mathbf{Z})^{\times}, where M=M(HD)M=M(HD), called the level of HDHD, is the least common denominator of a1,,an,b1,,bna_{1},...,a_{n},b_{1},...,b_{n}. See Definition 1 in [63, 2.2]. We mainly consider primitive hypergeometric data HDHD, namely aibj𝐙a_{i}-b_{j}\notin\mathbf{Z} for all i,ji,j.

Let RM=𝐙[ζM,1/M]R_{M}=\mathbf{Z}[\zeta_{M},1/M]. There is a motive (HD)\mathcal{H}(HD) on 𝐆m,RM=SpecRM[x,x1]\mathbf{G}_{m,R_{M}}=\mathrm{Spec}\,R_{M}[x,x^{-1}] which forms a local system of rank nn and is pure of weight n1n-1 on 𝐆m,RM{1}=RM1{0,1,}\mathbf{G}_{m,R_{M}}-\{1\}=\mathbb{P}^{1}_{R_{M}}-\{0,1,\infty\}.

5.1. Over 𝐂\mathbf{C}.

Let Γ(x)\Gamma(x) denote the Gamma function. For k𝐙k\in\mathbf{Z} and a𝐂a\in\mathbf{C}, define the Pochhammer symbol (a)k:=Γ(a+k)/Γ(a)(a)_{k}:=\Gamma(a+k)/\Gamma(a). Given HD={α,β}HD=\{\alpha,\beta\} we associate the hypergeometric function in the variable t𝐂t\in\mathbf{C}:

F(α,β;t)=nFn1[a1a2anb2bn;t]:=k0(a1)k(an)k(b1)k(bn)ktk.\displaystyle F(\alpha,\beta;t)=\,_{n}F_{n-1}\left[\begin{matrix}a_{1}&a_{2}&\cdots&a_{n}\\ &b_{2}&\cdots&b_{n}\end{matrix}\;;\;t\right]:=\sum_{k\geq 0}\frac{(a_{1})_{k}\cdots(a_{n})_{k}}{(b_{1})_{k}\cdots(b_{n})_{k}}t^{k}.

It satisfies the Fuchsian ordinary differential equation with three regular singular points at 0,1,0,1,\infty:

[θ(θ+b21)(θ+bn1)t(θ+a1)(θ+an)]F=0,whereθ:=tddt.\left[\theta\left(\theta+b_{2}-1\right)\cdots\left(\theta+b_{n}-1\right)-t\left(\theta+a_{1}\right)\cdots\left(\theta+a_{n}\right)\right]F=0,\quad\text{where}~{}\theta:=t\frac{d}{dt}.

The local solutions of this equation form a rank-nn local system, denoted by (HD)𝐂\mathcal{H}(HD)_{\mathbf{C}}. The local exponents of the above hypergeometric differential equation are

(3) 0,1b2,,1bn at t=0,a1,a2,,an at t=,0,1,2,,n2,γ at t=1,\begin{split}0,1-b_{2},\cdots,1-b_{n}&\quad\text{ at }t=0,\\ a_{1},\,a_{2},\,\cdots,\,a_{n}&\quad\text{ at }t=\infty,\\ 0,1,2,\cdots,n-2,\gamma&\quad\text{ at }t=1,\end{split}

respectively, where

(4) γ=1+j=1nbjj=1naj.\gamma=-1+\sum_{j=1}^{n}b_{j}-\sum_{j=1}^{n}a_{j}.

See [13, 87] for more details.

5.2. Over a finite field.

Let 𝐅q\mathbf{F}_{q} be a finite field of odd characteristic and use 𝐅q×^\widehat{\mathbf{F}_{q}^{\times}} to denote the group of multiplicative characters of 𝐅q×\mathbf{F}_{q}^{\times}. We use εq\varepsilon_{q} or simply ε\varepsilon to denote the trivial character, and ϕq\phi_{q} or ϕ\phi to denote the quadratic character. For any A𝐅q×^A\in\widehat{\mathbf{F}_{q}^{\times}}, use A¯\overline{A} to denote its inverse, and extend AA to 𝐅q\mathbf{F}_{q} by setting A(0)=0A(0)=0. For any characters AiA_{i}, Bi𝐅q×^B_{i}\in\widehat{\mathbf{F}_{q}^{\times}}, i=1,,ni=1,\cdots,n with B1=εB_{1}=\varepsilon in 𝐅q×^\widehat{\mathbf{F}_{q}^{\times}}, the n1n{}_{n}\mathbb{P}_{n-1}-function (cf. [36]) is defined as follows:

(5) nn1[A1A2AnB2Bn;λ;q]:=i=2n(AiBi(1))(1q1χ𝐅q×^(A1χχ)(A2χB2χ)(AnχBnχ)χ(λ)+δ(λ)i=2n(AiBi)),\,_{n}{\mathbb{P}}_{n-1}\left[\begin{matrix}A_{1}&A_{2}&\cdots&A_{n}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &B_{2}&\cdots&B_{n}\end{matrix}\;;\;\lambda;q\right]\\ :=\prod_{i=2}^{n}\left(-A_{i}B_{i}(-1)\right)\cdot\left(\frac{1}{q-1}\sum_{\chi\in\widehat{\mathbf{F}_{q}^{\times}}}\binom{A_{1}\chi}{\chi}\binom{A_{2}\chi}{B_{2}\chi}\cdots\binom{A_{n}\chi}{B_{n}\chi}\chi(\lambda)+\delta(\lambda)\prod_{i=2}^{n}\binom{A_{i}}{B_{i}}\right),

where

(AB):=B(1)J(A,B¯),J(A,B):=t𝐅qA(t)B(1t),andδ(λ):={1, if λ=0,0, otherwise. \displaystyle\binom{A}{B}:=-B(-1)J(A,\overline{B}),\quad J(A,B):=\sum_{t\in\mathbf{F}_{q}}A(t)B(1-t),\quad\mathrm{and}\quad\delta(\lambda):=\begin{cases}1,&\mbox{ if }\lambda=0,\\ 0,&\mbox{ otherwise. }\end{cases}

It will be useful for us to have some notation for characters of finite fields. First we recall the MMth power residue symbol. Let RM=𝐙[1/M,ζM]R_{M}=\mathbf{Z}[1/M,\zeta_{M}]. For every prime ideal 𝔭RM\mathfrak{p}\subset R_{M}, RM/𝔭R_{M}/\mathfrak{p} is a finite field with q=q(𝔭)=N𝔭q=q(\mathfrak{p})=N\mathfrak{p} elements. Recall that 𝐅q×\mathbf{F}_{q}^{\times} is a cyclic group with q1q-1 elements, and since ζM\zeta_{M} maps to an element of order MM modulo 𝔭\mathfrak{p}, we have q1q\equiv 1 mod MM. For any xRMx\in R_{M} prime to 𝔭\mathfrak{p}, there is a unique MMth root of unity with the property

x(N𝔭1)/M(x𝔭)Mx mod 𝔭,(x𝔭)MμM(𝐐¯).x^{(N\mathfrak{p}-1)/M}\equiv\left(\frac{x}{\mathfrak{p}}\right)_{M}x\text{\ mod\ }\mathfrak{p},\quad\left(\frac{x}{\mathfrak{p}}\right)_{M}\in\mu_{M}(\bar{\mathbf{Q}}).

The map x(x𝔭)Mx\mapsto\left(\frac{x}{\mathfrak{p}}\right)_{M} induces a character (RM/𝔭)×μM(𝐐¯)(R_{M}/\mathfrak{p})^{\times}\to\mu_{M}(\bar{\mathbf{Q}}). Moreover, every such character is of the form (𝔭)Mi\left(\frac{\cdot}{\mathfrak{p}}\right)^{i}_{M} for some i𝐙/Mi\in\mathbf{Z}/M. Given a rational number aa with denominator MM, we can identify it with an element of (1/M)𝐙/𝐙=𝐙/M(1/M)\mathbf{Z}/\mathbf{Z}\ =\mathbf{Z}/M. Then we define a character ι𝔭(a)()=(𝔭)Ma:(RM/𝔭)×μM(𝐐¯)\iota_{\mathfrak{p}}(a)(\cdot)=\left(\frac{\cdot}{\mathfrak{p}}\right)^{a}_{M}:(R_{M}/\mathfrak{p})^{\times}\to\mu_{M}(\bar{\mathbf{Q}}).

We choose an isomorphism of all roots of unity 𝐐(ζ)𝐐¯\mathbf{Q}(\zeta_{\infty})\subset\bar{\mathbf{Q}}_{\ell} with the roots of unity 𝐐(ζ)𝐂\mathbf{Q}(\zeta_{\infty})\subset\mathbf{C}.

Theorem 5.1 (Katz [52, 54]).

Let \ell be a prime. Given a primitive hypergeometric datum HD = {α,β}\{\alpha,\beta\} consisting of α={a1,,an}\alpha=\{a_{1},\cdots,a_{n}\}, β={1,b2,,bn}\beta=\{1,b_{2},\cdots,b_{n}\} with ai,bj𝐐×a_{i},b_{j}\in\mathbf{Q}^{\times} and M:=M(HD)=lcd(αβ)M:=M(HD)=\mathrm{lcd}(\alpha\cup\beta). There exists a constructible 𝐐¯\bar{\mathbf{Q}}_{\ell}-sheaf for the étale topology on 𝐆m,RM[1/]\mathbf{G}_{m,R_{M}}[1/\ell], denoted P(HD)\mathcal{H}^{P}(HD)_{\ell}, with the following properties:

  • 1.

    Let λ|𝐆m,RM[1/]|\lambda\in|\mathbf{G}_{m,R_{M}}[1/\ell]| be a closed point and λ¯\bar{\lambda} a geometric point localized in λ\lambda. The residue field k(λ)k(\lambda) is a finite extension of the field RM/𝔭:=k(𝔭)=𝐅q(𝔭)=𝐅qR_{M}/\mathfrak{p}:=k({\mathfrak{p}})=\mathbf{F}_{q(\mathfrak{p})}=\mathbf{F}_{q}. We let deg(λ)\mathrm{deg}(\lambda) be the degree of this extension. Then

    Tr(FrobλP(HD),λ¯)=nn1[ιλ(a1)ιλ(a2)ιλ(an)ιλ(b2)ιλ(bn); 1/λ;q(λ)]\mathrm{Tr}\left(\mathrm{Frob}_{\lambda}\mid\mathcal{H}^{P}(HD)_{\ell,\bar{\lambda}}\right)=\,_{n}{\mathbb{P}}_{n-1}\left[\begin{matrix}\iota_{\lambda}(a_{1})&\iota_{\lambda}(a_{2})&\cdots&\iota_{\lambda}(a_{n})\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &\iota_{\lambda}(b_{2})&\cdots&\iota_{\lambda}(b_{n})\end{matrix}\;;\;1/\lambda;q(\lambda)\right]

    where q(λ)=qdeg(λ)q(\lambda)=q^{\mathrm{deg}(\lambda)} and ιλ(c)\iota_{\lambda}(c) is the character

    xι𝔭(c)(Nk(λ)/k(𝔭)(x)),k(λ)×μM(𝐐¯)𝐂×.x\mapsto\iota_{\mathfrak{p}}(c)(N_{k(\lambda)/k(\mathfrak{p})}(x)),\quad k(\lambda)^{\times}\to\mu_{M}(\bar{\mathbf{Q}})\subset\mathbf{C}^{\times}.
  • 2.

    When λ1\lambda\neq 1, the stalk (P(HD))λ¯(\mathcal{H}^{P}(HD)_{\ell})_{\bar{\lambda}} has dimension nn and all roots of the characteristic polynomial of Frobλ\mathrm{Frob}_{\lambda} are algebraic numbers and have the same absolute value q(λ)(n1)/2q(\lambda)^{(n-1)/2} under all archimedean embeddings.

There is a variant of these character sums due to Beukers/Cohen/Mellit which has the important property that in many cases we can define these sheaves slightly more generally. For instance, suppose that the data HDHD is defined over 𝐐\mathbf{Q}, then there is a sheaf BCM(HD)\mathcal{H}^{BCM}(HD)_{\ell} on Spec(𝐙[1/M,λ,λ1]\mathrm{Spec}(\mathbf{Z}[1/M\ell,\lambda,\lambda^{-1}] such that

BCM(HD)𝐙[1/M]𝐙[1/M,ζM]=P(HD).\mathcal{H}^{BCM}(HD)_{\ell}\otimes_{\mathbf{Z}[1/M\ell]}\mathbf{Z}[1/M\ell,\zeta_{M}]=\mathcal{H}^{P}(HD)_{\ell}.

The Frobenius traces of BCM(HD)\mathcal{H}^{BCM}(HD)_{\ell} are 𝐐\mathbf{Q}-valued.

5.3. Transformations

There is a huge number of identities relating different hypergeometric functions. These have parallel versions over 𝐂\mathbf{C} and over finite fields. For instance:

The Clausen formula ([3] or [63, Eqn. (39)]):

(6) (1t)123F2[12ab+12ba+12a+b+1232ab;t]=2F1[aba+b+12;t]2F1[1a1b32ab;t]{(1-t)^{-\frac{1}{2}}}\,_{3}F_{2}\left[\begin{matrix}\frac{1}{2}&a-b+\frac{1}{2}&b-a+\frac{1}{2}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &a+b+\frac{1}{2}&\frac{3}{2}-a-b\end{matrix}\;;\;t\right]\\ =\,_{2}F_{1}\left[\begin{matrix}a&b\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &a+b+\frac{1}{2}\end{matrix}\;;\;t\right]\,_{2}F_{1}\left[\begin{matrix}1-a&1-b\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &\frac{3}{2}-a-b\end{matrix}\;;\;t\right]

when both sides are convergent. The finite field analog

Theorem 5.2 (Evans-Greene [30]).

For a fixed finite field 𝐅q\mathbf{F}_{q}, let ϕ\phi be the quadratic character. Assume η,K𝐅q×^\eta,K\in\widehat{\mathbf{F}_{q}^{\times}} such that none of η,Kϕ,ηK,ηK¯\eta,K\phi,\eta K,\eta\overline{K} is trivial. When t=1t=1, we have

23[ϕηη¯KK¯; 1]={0, if ηK is not a square in 𝐅q×^,J(ηK,η¯K)J(ϕ,K¯)(J(SK¯,ϕS¯)2+J(ϕSK¯,S¯)2), if ηK=S2 in 𝐅q×^.\,{}_{3}{\mathbb{P}}_{2}\left[\begin{matrix}\phi&\eta&\overline{\eta}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &K&\overline{K}\end{matrix}\;;\;1\right]=\begin{cases}0,&\mbox{ if $\eta K$ is not a square in $\widehat{\mathbf{F}_{q}^{\times}}$,}\\ \frac{J(\eta K,\overline{\eta}K)}{J(\phi,\overline{K})}\left(J(S\overline{K},\phi\overline{S})^{2}+J(\phi S\overline{K},\overline{S})^{2}\right),&\mbox{ if $\eta K=S^{2}$ in $\widehat{\mathbf{F}_{q}^{\times}}$}.\end{cases}

When t0,1t\neq 0,1, suppose ηK=S2\eta K=S^{2} for some S𝐅q×^S\in\widehat{\mathbf{F}_{q}^{\times}}, we have

ϕ(1t)32[ϕηη¯KK¯;t]\displaystyle\phi(1-t)\,_{3}{\mathbb{P}}_{2}\left[\begin{matrix}\phi&\eta&\overline{\eta}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &K&\overline{K}\end{matrix}\;;\;t\right] =21[ϕKS¯SK;t]21[ϕK¯SS¯K¯;t]q.\displaystyle=\,_{2}{\mathbb{P}}_{1}\left[\begin{matrix}\phi K\overline{S}&S\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &K\end{matrix}\;;\;t\right]\,_{2}{\mathbb{P}}_{1}\left[\begin{matrix}\phi\overline{K}S&\overline{S}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &\overline{K}\end{matrix}\;;\;t\right]-q.

For a finite field analog of the Whipple F67\phantom{}{}_{7}F_{6}-formula see [63]. For an application of a cubic transformation formula for an Appell-Lauricella hypergeometric function over a finite field, see [32].

6. Automorphic motives

There are motives Vn(Γ)V^{n}(\Gamma) on modular curves XΓX_{\Gamma} such that the cohomology H1(XΓ,Vn(Γ))H^{1}(X_{\Gamma},V^{n}(\Gamma)) is related to cusp forms on Γ\Gamma of weight n+2n+2. This is the content of Eichler-Shimura theory. There are two main aspects: the geometric description of modular forms, and the congruence formula relating the trace of Hecke operators to the trace of Frobenius.

6.1. Over 𝐂\mathbf{C}.

The main result here due to Eichler [27] and Shimura [82] is an isomorphism

Sn+2(Γ)Hpar1(Γ,V𝐑n)S_{n+2}(\Gamma)\overset{\sim}{\longrightarrow}H^{1}_{par}(\Gamma,V^{n}_{\mathbf{R}})

between the space of cusp forms of weight n+2n+2 for Γ\Gamma and the parabolic cohomology group, where V𝐑nV^{n}_{\mathbf{R}} is the nn-th symmetric power of the standard representation of SL2(𝐑)\mathrm{SL}_{2}(\mathbf{R}) on 𝐑2\mathbf{R}^{2}. Parabolic cohomology is treated in detail in Shimura’s book, [85].

Here is a brief outline: let (uv)𝐂2\begin{pmatrix}u\\ v\end{pmatrix}\in\mathbf{C}^{2}. For any n0n\geq 0, let (uv)n𝐂n+1\begin{pmatrix}u\\ v\end{pmatrix}^{n}\in\mathbf{C}^{n+1} be the vector whose components are un,un1v,,vnu^{n},u^{n-1}v,...,v^{n} (=1=1 if n=0n=0.) We define a representation ρn\rho_{n} of GL(2,𝐂)\mathrm{GL}(2,\mathbf{C}) of dimension n+1n+1 by the rule:

ρn(α)(uv)n:=(α(uv))n,αGL(2,𝐂).\rho_{n}(\alpha)\begin{pmatrix}u\\ v\end{pmatrix}^{n}:=\left(\alpha\begin{pmatrix}u\\ v\end{pmatrix}\right)^{n},\quad\alpha\in\mathrm{GL}(2,\mathbf{C}).

That is, ρn(α)=Symn(α)\rho_{n}(\alpha)=\mathrm{Sym}^{n}(\alpha). One checks, for αGL(2,𝐑)+\alpha\in\mathrm{GL}(2,\mathbf{R})^{+}:

(α(z)1)n=j(α,z)nρn(α)(z1)n,where z,α(z)=az+bcz+d,α=(abcd),j(α,z)=(cz+d).\begin{pmatrix}\alpha(z)\\ 1\end{pmatrix}^{n}=j(\alpha,z)^{-n}\rho_{n}(\alpha)\begin{pmatrix}z\\ 1\end{pmatrix}^{n},\quad\text{where \ }z\in\mathfrak{H},\alpha(z)=\frac{az+b}{cz+d},\alpha=\begin{pmatrix}a&b\\ c&d\end{pmatrix},j(\alpha,z)=(cz+d).

We have

j(αβ,z)=j(α,β(z))j(β,z).j(\alpha\beta,z)=j(\alpha,\beta(z))j(\beta,z).

Let ΓSL(2,𝐑)\Gamma\subset\mathrm{SL}(2,\mathbf{R}) be a Fuchsian subgroup of the first kind and n0n\geq 0 an integer. If 1Γ-1\in\Gamma we take nn to be even. If fSn+2f\in S_{n+2} is a cusp-form of weight n+2n+2 we define a vector-valued differential form

𝔡(f)=(f(z)zndzf(z)zn1dzf(z)dz).\mathfrak{d}(f)=\begin{pmatrix}f(z)z^{n}dz\\ f(z)z^{n-1}dz\\ \vdots\\ f(z)dz\end{pmatrix}.

We get

𝔡(f)α=ρn(α)𝔡(f),Re(𝔡(f))α=ρn(α)Re(𝔡(f)),αΓ,\mathfrak{d}(f)\circ\alpha=\rho_{n}(\alpha)\mathfrak{d}(f),\quad\mathrm{Re(}\mathfrak{d}(f))\circ\alpha=\rho_{n}(\alpha)\mathrm{Re}(\mathfrak{d}(f)),\quad\alpha\in\Gamma,

the second equality holding because the representation ρn\rho_{n} is real. Now fix any point z0z_{0}\in\mathfrak{H}^{*} and define for fSn+2(Γ)f\in S_{n+2}(\Gamma) the indefinite integral

F(z)=z0z𝔡(f)+v,z,v𝐂n+1,F(z)=\int_{z_{0}}^{z}\mathfrak{d}(f)+v,\quad z\in\mathfrak{H}^{*},v\in\mathbf{C}^{n+1},

for any path in \mathfrak{H} connecting z0z_{0} and zz. If they are both cusps, we can take a geodesic arc in \mathfrak{H} connecting them. Then

F(α(z))=ρn(α)F(z)+t(α)F(\alpha(z))=\rho_{n}(\alpha)F(z)+t(\alpha)

where αt(α)\alpha\mapsto t(\alpha) is a parabolic 1-cocyle of Γ\Gamma with values in the representation V𝐂nV^{n}_{\mathbf{C}}. This means

  • 1.

    t(αβ)=t(α)+ρn(α)t(β)t(\alpha\beta)=t(\alpha)+\rho_{n}(\alpha)t(\beta).

  • 2.

    For each parabolic element σΓ\sigma\in\Gamma there is a vector wV𝐂nw\in V^{n}_{\mathbf{C}} such that t(σ)=wρn(σ)wt(\sigma)=w-\rho_{n}(\sigma)w. The vector ww may depend on the parabolic element σ\sigma.

We define Hpar1(Γ,V𝐂n)H^{1}_{par}(\Gamma,V^{n}_{\mathbf{C}}) as the quotient of all parabolic 1-cocyles (property 1 and 2) modulo all coboundaries, that is, there is a ww such that t(α)=wρn(α)wt(\alpha)=w-\rho_{n}(\alpha)w for all αΓ\alpha\in\Gamma. Similarly, taking real parts defining

F𝐑(z)=z0zRe(𝔡(f))+v,z,v𝐑n+1,F_{\mathbf{R}}(z)=\int_{z_{0}}^{z}\mathrm{Re}(\mathfrak{d}(f))+v,\quad z\in\mathfrak{H}^{*},v\in\mathbf{R}^{n+1},

the corresponding cocycle u(α)u(\alpha) takes values in Hpar1(Γ,V𝐑n)H^{1}_{par}(\Gamma,V^{n}_{\mathbf{R}}) and depends only on ff and not on the auxiliary choices of z0z_{0} and vv. Shimura proved that fu(α)f\mapsto u(\alpha) defines an isomorphism of real vector spaces. Sn+2(Γ)Hpar1(Γ,V𝐑n)S_{n+2}(\Gamma)\cong H^{1}_{par}(\Gamma,V^{n}_{\mathbf{R}}). This implies that dimHpar1(Γ,V𝐂n)=2dimSn+2(Γ)\dim H^{1}_{par}(\Gamma,V^{n}_{\mathbf{C}})=2\dim S_{n+2}(\Gamma). When Γ\Gamma is an arithmetically defined group, there are Hecke operators defined and the Eichler-Shimura isomorphism respects this action. Moreover in that case there is a lattice V𝐙nV𝐑nV_{\mathbf{Z}}^{n}\subset V_{\mathbf{R}}^{n} stabilized by Γ\Gamma and we have parabolic cohomology groups Hpar1(Γ,V𝐙n)H^{1}_{par}(\Gamma,V_{\mathbf{Z}}^{n}). One can define a complex structure and a polarization such that the torus Hpar1(Γ,V𝐑n)/Hpar1(Γ,V𝐙n)H^{1}_{par}(\Gamma,V_{\mathbf{R}}^{n})/H^{1}_{par}(\Gamma,V_{\mathbf{Z}}^{n}) has the structure of an abelian variety A(Γ,n)A(\Gamma,n). Finally, if fSn+2(Γ)f\in S_{n+2}(\Gamma) is a Hecke eigenform the periods of ff are related to the values of the LL-function L(f,s)L(f,s) at integer points in the critical strip for this Dirichlet series. See Manin’s papers, [69], [70] for the connection to modular symbols, pp-adic Hecke series and more. See Greenberg and Voight [40] for algorithms for computing Hecke eigenvalues in parabolic cohomology. See Stiller’s monograph [88] for more information on the connection to differential equations and special values of LL-functions. See the papers of Zagier [93] and Kontsevich/Zagier [56] for more information on differential equations and periods.

Here are examples from [93, 63]:

F34[12121212111; 1]=16π2L(η(2τ)4η(4τ)4,2),\,{}_{4}F_{3}\left[\begin{matrix}\frac{1}{2}&\frac{1}{2}&\frac{1}{2}&\frac{1}{2}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &1&1&1\end{matrix}\;;\;1\right]=\frac{16}{\pi^{2}}L(\eta(2\tau)^{4}\eta(4\tau)^{4},2),
F56[12121212121211111; 1]=16|t2(τ)|=1τ2f8.6.a.a(τ2)𝑑τ,\,{}_{6}F_{5}\left[\begin{matrix}\frac{1}{2}&\frac{1}{2}&\frac{1}{2}&\frac{1}{2}&\frac{1}{2}&\frac{1}{2}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &1&1&1&1&1\end{matrix}\;;\;1\right]=16\oint_{|t_{2}(\tau)|=1}\tau^{2}f_{8.6.a.a}\left(\frac{\tau}{2}\right)d\tau,

where t2(τ)=64η(2τ)24η(τ)24t_{2}(\tau)=-64\frac{\eta(2\tau)^{24}}{\eta(\tau)^{24}} is a Hauptmodul for Γ0(2)\Gamma_{0}(2), and f8.6.a.af_{8.6.a.a} is a normalized cuspidal newform expressed in LMFDB label.

Deligne [17] reinterpreted this in the following manner. Assume that Γ\Gamma has no torsion. First, we have an isomorphism

Sk+2(Γ)H0(XΓ,ωkΩXΓ,1)S_{k+2}(\Gamma)\cong H^{0}(X_{\Gamma},\omega^{k}\otimes\Omega^{1}_{X_{\Gamma},})

for certain coherent sheaves ωk\omega^{k} on the compact Riemann surface XΓanX^{an}_{\Gamma}. These sheaves have the property that

ω2ΩXΓ1(logSΓ),SΓ=the set of cusps of XΓ.\omega^{2}\cong\Omega^{1}_{X_{\Gamma}}(\mathrm{log}S_{\Gamma}),\quad S_{\Gamma}=\text{the set of cusps of\ }X_{\Gamma}.

In fact, over the complement of the cusps, the sheaf ωk\omega^{k} is the holomorphic line bundle attached to the cocycle j(α,z)kj(\alpha,z)^{k}. There are constructible sheaves of 𝐂\mathbf{C}-vector spaces Vk(Γ)𝐂V^{k}(\Gamma)_{\mathbf{C}} on XΓX_{\Gamma} for integers k1k\geq 1 with the following properties:

  • 1.

    Over the open subset XΓX_{\Gamma}^{\circ} these are local systems of rank k+1k+1. They are thus solution sheaves to a system of regular singular differential equations on XΓX_{\Gamma}^{\circ}. For Γ\Gamma considered in this paper, these will be hypergeometric local systems when k=1k=1 or 2.

  • 2.

    The sheaves Vk(Γ)𝐂V^{k}(\Gamma)_{\mathbf{C}} on XΓX_{\Gamma} are extensions of the sheaves on XΓX_{\Gamma}^{\circ} defined above:

    Vk(Γ)𝐂=ι(Vk(Γ)𝐂XΓ)for the inclusion mapι:XΓXΓ.V^{k}(\Gamma)_{\mathbf{C}}=\iota_{*}(V^{k}(\Gamma)_{\mathbf{C}}\mid X_{\Gamma}^{\circ})\quad\text{for~{}the~{}inclusion~{}map}~{}\iota:X_{\Gamma}^{\circ}\to X_{\Gamma}.
  • 3.

    We have:

    Hpar1(Γ,Symk(𝐂2))H1(XΓ,Vk(Γ)𝐂)Sk+2(Γ)Sk+2(Γ)¯,H^{1}_{par}(\Gamma,\mathrm{Sym}^{k}(\mathbf{C}^{2}))\cong H^{1}(X_{\Gamma},V^{k}(\Gamma)_{\mathbf{C}})\cong S_{k+2}(\Gamma)\oplus\overline{S_{k+2}(\Gamma)},

    which is a Hodge decomposition of type (k+1,0),(0,k+1)(k+1,0),(0,k+1). The left-hand side is parabolic cohomology. Moreover

    H1(XΓ,Vk(Γ)𝐂)Im(Hc1(XΓ,Vk(Γ)𝐂)H1(XΓ,Vk(Γ)𝐂))H^{1}(X_{\Gamma},V^{k}(\Gamma)_{\mathbf{C}})\cong\mathrm{Im}(H^{1}_{c}(X_{\Gamma}^{\circ},V^{k}(\Gamma)_{\mathbf{C}})\to H^{1}(X_{\Gamma}^{\circ},V^{k}(\Gamma)_{\mathbf{C}}))

    the image of the compactly supported cohomology. The Hecke operators act on the spaces as geometric correspondences and these isomorphisms are equivariant for the Hecke actions.

When ΓSL2(𝐑)\Gamma\subset\mathrm{SL}_{2}(\mathbf{R}) is an arbitrary Fuchsian subgroup of the first kind, proofs of these theorems can be found in [10]. When Γ\Gamma is torsion-free, these are defined as follows. The group Γ\Gamma acts on ×Symk(𝐂2)\mathfrak{H}^{*}\times\mathrm{Sym}^{k}(\mathbf{C}^{2}), on the first factor by linear fractional transformations, on the second factor via ϱk:=Symk(ϱ1)\varrho_{k}:=\mathrm{Sym}^{k}(\varrho_{1}), where ϱ1\varrho_{1} arises from the inclusion ΓSL2(𝐑)\Gamma\subset\mathrm{SL}_{2}(\mathbf{R}) and the canonical action of SL2(𝐑)\mathrm{SL}_{2}(\mathbf{R}) on 𝐂2\mathbf{C}^{2}. Then

(7) Vk(Γ)𝐂:=pΓ(Symk(𝐂2)¯)=Γ\×Symk(𝐂2).V^{k}(\Gamma)_{\mathbf{C}}:=p_{*}^{\Gamma}(\underline{\mathrm{Sym}^{k}(\mathbf{C}^{2})})=\Gamma\backslash\mathfrak{H}^{*}\times\mathrm{Sym}^{k}(\mathbf{C}^{2}).

When Γ\Gamma has torsion, one takes a normal subgroup ΓΓ\Gamma^{\prime}\subset\Gamma of finite index and defines

Vk(Γ):=pΓ/Γ(Vk(Γ)).V^{k}(\Gamma):=p_{*}^{\Gamma/\Gamma^{\prime}}(V^{k}(\Gamma^{\prime})).

This works with certain restrictions, namely if 1Γ-1\in\Gamma this becomes identically 0 when kk is odd, so one must restrict to even kk (this is no limitation anyway since there are no cusp forms of odd weight if 1Γ-1\in\Gamma).

6.2. \ell-adic.

Original papers: [26], [81], [83]. It is preferable to replace the group SL2\mathrm{SL}_{2} by GL2\mathrm{GL}_{2} and work with the adeles. Let GG be the algebraic group GL2\textsf{GL}_{2} or B×\textsf{B}^{\times} according as Γ\Gamma non-cocompact or cocompact arising from an indefinite quaternion division algebra BB defined over 𝐐\mathbf{Q}. In both cases, G(𝐑)=G(\mathbf{R})= GL(𝐑)2{}_{2}(\mathbf{R}) which has two connected components G(𝐑)±G(\mathbf{R})_{\pm} consisting of elements with positive and negative determinants, respectively. Moreover, G(𝐑)+G(\mathbf{R})_{+} acts transitively on the upper half plane \mathfrak{H} via fractional linear transformations so that we may identify \mathfrak{H} with G(𝐑)+/SO2Z(𝐑)+G(\mathbf{R})_{+}/\rm{SO}_{2}Z(\mathbf{R})_{+}, where ZZ denotes the center of GG. Thus X:=G(𝐑)/SO2Z(𝐑)X:=G(\mathbf{R})/\rm{SO}_{2}Z(\mathbf{R}) may be identified with ±\mathfrak{H}^{\pm}, the disjoint union of upper and lower half plane. The pair (G,X)(G,X) satisfies the axioms of Deligne for Shimura varieties, [19]. Let G(𝐐)+=G(𝐐)G(𝐑)+G(\mathbf{Q})_{+}=G(\mathbf{Q})\cap G(\mathbf{R})_{+}. Write 𝐀\mathbf{A} for the ring of adeles over 𝐐\mathbf{Q}, and 𝐀f\mathbf{A}^{f} the subring of finite adeles. Let UU be a compact-open subgroup of G(𝐀f)G(\mathbf{A}_{f}). Define

YUan:=UM𝐂(G,X)=G(𝐐)+\(X+×G(𝐀f))/U.Y_{U}^{an}:=\phantom{}_{U}M_{\mathbf{C}}(G,X)=G(\mathbf{Q})_{+}\backslash(X^{+}\times G(\mathbf{A}^{f}))/U.

This is a finite disjoint union, indexed by SU=G(𝐐)+\G(𝐀f)/US_{U}=G(\mathbf{Q})_{+}\backslash G(\mathbf{A}^{f})/U, of quotients Γg\X+\Gamma_{g}\backslash X^{+}, for subgroups ΓgGad(𝐑)+\Gamma_{g}\subset G^{\mathrm{ad}}(\mathbf{R})^{+} which are the images of Γg=gUg1G(𝐐)+\Gamma^{\prime}_{g}=gUg^{-1}\cap G(\mathbf{Q})_{+} under G(𝐐)+G(𝐑)+Gad(𝐑)+G(\mathbf{Q})_{+}\to G(\mathbf{R})_{+}\to G^{\mathrm{ad}}(\mathbf{R})^{+}, where Gad=G/ZG^{\mathrm{ad}}=G/Z. Shimura’s theory of canonical models gives a curve YUY_{U} defined over 𝐐\mathbf{Q} whose complex points are these. This curve is irreducible, but not absolutely irreducible. The irreducible components of YU𝐐𝐐¯Y_{U}\otimes_{\mathbf{Q}}\bar{\mathbf{Q}} are indexed by the same finite set as above, and the corresponding analytic spaces are the Γg\X+\Gamma_{g}\backslash X^{+}. These components are defined over specific abelian number fields kΓgk_{\Gamma_{g}}.

We also have the spaces of cusp forms Sk(U)S_{k}(U). We can describe this as

gSUSk(Γg\X+),\bigoplus_{g\in S_{U}}S_{k}(\Gamma_{g}\backslash X^{+}),

but more intrinsically as the set of functions ϕ:G(𝐀)𝐂\phi:G(\mathbf{A})\to\mathbf{C} such that

  • 1.

    For all γG(𝐐)\gamma\in G(\mathbf{Q}), and uUu\in U

    ϕ(γgu)=ϕ(g).\phi(\gamma gu)=\phi(g).
  • 2.

    For any gfG(𝐀f)g_{f}\in G(\mathbf{A}^{f}), the function gϕ(ggf)g_{\infty}\mapsto\phi(g_{\infty}g_{f}) is invariant under Z(𝐑)G(𝐑)+Z(\mathbf{R})\cap G(\mathbf{R})_{+}, smooth on G(𝐑)+G(\mathbf{R})_{+} and satisfies

    ϕ(gkθ)=ϕ(g)(eiθ)k,kθ=(cos(θ)sin(θ)sin(θ)cos(θ))\phi(gk_{\theta})=\phi(g)(e^{i\theta})^{k},\quad k_{\theta}=\begin{pmatrix}\cos(\theta)&\sin(\theta)\\ -\sin(\theta)&\cos(\theta)\end{pmatrix}

    and

    Xϕ=0,X_{-}\phi=0,

    where

    X±=12(1±i±i1)𝔤𝐂=Lie(G(𝐂)).X_{\pm}=\frac{1}{2}\begin{pmatrix}1&\pm i\\ \pm i&-1\end{pmatrix}\in\mathfrak{g}_{\mathbf{C}}=\mathrm{Lie}(G(\mathbf{C})).
  • 3.

    If G=𝖦𝖫2G={\sf GL}_{2}, then for all gG(𝐀)g\in G(\mathbf{A}),

    𝐐\𝐀ϕ((1x01)g)𝑑x=0.\int_{\mathbf{Q}\backslash\mathbf{A}}\phi\left(\begin{pmatrix}1&x\\ 0&1\end{pmatrix}g\right)dx=0.

The relation between this adelic viewpoint and classical cusp forms is explained in several places, see [47]. The above ϕ\phi corresponds to functions f(τ)f(\tau) which are modular with respect to subgroups Γg\Gamma_{g} as above. The second condition expresses the holomorphy of f(τ)f(\tau), and that it has weight kk. The third condition expresses the vanishing of the zeroth Fourier coefficients at the cusps and is only meaningful for us in the case G=GL2G=\textsf{GL}_{2}.

The main theorem is

Theorem 6.1.

Let UU be an open compact subgroup as above, stable under the canonical involution. Let d=dim𝐂Sk(U)d=\dim_{\mathbf{C}}S_{k}(U). For each rational prime \ell there exists an \ell-adic representation

ψU,k:Gal(𝐐¯/𝐐)GL(2d,𝐐)\psi_{U,k}:\mathrm{Gal}(\bar{\mathbf{Q}}/\mathbf{Q})\longrightarrow\mathrm{GL}(2d,\mathbf{Q}_{\ell})

which is unramified outside a finite set SS of prime numbers. For pSp\notin S we have

det(1ψU,k(Frobp)x)=det(1T(p)x+pT(p,p)x2Sk(U))\det\left(1-\psi_{U,k}(\mathrm{Frob}_{p})x\right)=\det\left(1-T(p)x+pT(p,p)x^{2}\mid S_{k}(U)\right)

where T(p)T(p) and T(p,p)T(p,p) are the standard Hecke operators.

For G=𝖦𝖫2G={\sf GL}_{2} this is due to Deligne; for G=𝖡G={\sf B} this is main result of Ohta, stated only for quaternion algebras over 𝐐\mathbf{Q}. For even weights 2k2k and for UU which are sufficiently small, the \ell-adic representation is on the 𝐐\mathbf{Q}_{\ell}-vector space

Het1(XU𝐐𝐐¯,V2k(U))H^{1}_{et}(X_{U}\otimes_{\mathbf{Q}}\bar{\mathbf{Q}},V^{2k}(U)_{\ell})

where V2k(U)V^{2k}(U)_{\ell} are \ell-adic analogs of the local systems denoted V2k(Γ)𝐂V^{2k}(\Gamma)_{\mathbf{C}} in the previous section. They have similar properties: they are constructible 𝐐\mathbf{Q}_{\ell}- sheaves, lisse of rank 2k+12k+1 on the complement of cusps and elliptic points. The situation for odd weights kk is more complicated in general. In the quaternion case, there are sheaves Vk(U)V^{k}(U)_{\ell} but their rank is twice the corresponding Vk(Γ)𝐂V^{k}(\Gamma)_{\mathbf{C}}. Moreover, one must exclude cases where 1U-1\in U.

The curves and sheaves all have good reduction modulo pp for pSp\notin S. Using the same letters to denote them we get as a corollary that

(8) Tr(FrobpHet1(XU𝐅p𝐅¯p,V2k(U)))=Tr(T(p)S2k(U)).\mathrm{Tr}\,(\mathrm{Frob}_{p}\mid H^{1}_{et}(X_{U}\otimes_{\mathbf{F}_{p}}\bar{\mathbf{F}}_{p},V^{2k}(U)_{\ell}))=\mathrm{Tr}\,(T(p)\mid S_{2k}(U)).

Since Heti(XU𝐅p𝐅¯p,V2k(U))=0H^{i}_{et}(X_{U}\otimes_{\mathbf{F}_{p}}\bar{\mathbf{F}}_{p},V^{2k}(U)_{\ell})=0 for all i1i\neq 1, we can use the Grothendieck-Lefschetz trace formula to calculate the left-hand side. This will be explained in section 8.

7. Shimura curves.

7.1. Shimura curves in general

Let FF be a totally real number field of degree gg over 𝐐\mathbf{Q}. Let BB be an indefinite quaternion algebra over FF such that B𝐐𝐑M2(𝐑)×g1B\otimes_{\mathbf{Q}}\mathbf{R}\cong M_{2}(\mathbf{R})\times\mathbb{H}^{g-1} where \mathbb{H} is Hamilton’s quaternions. We let OBO_{B} be a maximal order (unique up to conjugation) and OB1O_{B}^{1} the set of elements of reduced norm 1. If Γ\Gamma is commensurable with OB1O_{B}^{1} then we have a compact Riemann surface arising from Γ\Gamma, say XΓX_{\Gamma}. Shimura proved that this has a canonical model over a number field kΓk_{\Gamma}. Unless g=1g=1 however, this has no simple moduli interpretation. Nonetheless this belongs to the general theory of Shimura varieties (but not of PEL type). When F=𝐐F=\mathbf{Q} Kuga and Shimura studied these and in particular the zeta functions of the fiber spaces of abelian varieties over these curves. See [57], [58], [84]. This was extended by Ohta in a series of papers, [72], [74], [73]. Langlands also established these results by his methods. See [59], [60]. These methods are now part of the standard toolbox in the arithmetic of Shimura varieties. In our paper we follow Kuga, Shimura and Ohta.

In Takeuchi’s list of arithmetic triangle groups is the famous (2,3,7)(2,3,7)-tesselation. As Fricke discovered, this is related to a quaternion algebra over the cubic field F=𝐐(ζ7+ζ¯7)F=\mathbf{Q}(\zeta_{7}+\bar{\zeta}_{7}). He also noted that the corresponding curve is the Klein quartic

x3y+y3z+z3x=0.x^{3}y+y^{3}z+z^{3}x=0.

For a modern exposition, see Elkies’ article, [29].

7.2. Shimura curves over 𝐐\mathbf{Q}.

References for this section: [28], [48], [77], [91]. Let BB be an indefinite quaternion division algebra over 𝐐\mathbf{Q} of discriminant DD. It is known that BB ramifies at an even number primes Σ\Sigma. We let OB=OO_{B}=O be a maximal order in BB (unique up to conjugation). Define

Γ=ΓB:=O1/±1,Γ=ΓB:={xB/𝐐xO=Ox,nr(x)>0}.\Gamma=\Gamma_{B}:=O^{1}/{\pm 1},\quad\Gamma^{*}=\Gamma^{*}_{B}:=\left\{x\in B^{*}/\mathbf{Q}^{*}\mid xO=Ox,\mathrm{nr}(x)>0\right\}.

It is known that Γ\Gamma is a normal subgroup of Γ\Gamma^{*} with quotient an elementary abelian 2-group with #Σ\#\Sigma generators. It is also known that the elements of Γ\Gamma^{*} are the classes mod 𝐐\mathbf{Q}^{*} of elements of O{O} with reduced norm pTp\prod_{p\in T}p for some possibly empty subset TΣT\subset\Sigma. For any subset TΣT\subset\Sigma we let ΓΓTΓ\Gamma\subset\Gamma_{T}\subset\Gamma^{*} be the subgroup corresponding to TT.

Fix an isomorphism θ:B𝐐𝐑M2(𝐑)\theta:B\otimes_{\mathbf{Q}}\mathbf{R}\cong M_{2}(\mathbf{R}). Then the groups Γ,Γ\Gamma,\Gamma^{*} become isomorphic to subgroups of PSL2(𝐑)=Aut()\mathrm{PSL}_{2}(\mathbf{R})=\mathrm{Aut}(\mathfrak{H}). For any subset TΣT\subset\Sigma we have quotient Riemann surfaces XΓ,T=XB,T:=ΓT\X_{\Gamma,T}=X_{B,T}:=\Gamma_{T}\backslash\mathfrak{H}. These are compact (no cusps), but there are elliptic points. Shimura’s theory of canonical models shows that these are the sets of complex points of an algebraic curve, denoted by the same symbol if no confusion is possible, defined over a specific number field kB,Tk_{B,T}. These have interpretations as coarse moduli spaces of polarized abelian surfaces, as we briefly recall.

To define a polarization we choose an element μO\mu\in O with μ2=D\mu^{2}=-D. This defines an anti-involution aaa\mapsto a^{\prime} of BB by the rule a=μ1aμa^{\prime}=\mu^{-1}a^{*}\mu where aa^{*} is the canonical involution: nr(a)=aa,tr(a)=a+a\mathrm{nr}(a)=aa^{*},\mathrm{tr}(a)=a+a^{*}. This is a positive involution in that the quadratic form atr(aa)a\mapsto\mathrm{tr}(a^{\prime}a) is positive definite. We describe the moduli space in the case T=T=\emptyset, i.e., for Γ\Gamma. Namely XB=XΓX_{B}=X_{\Gamma} is the coarse moduli space of triples (A,ρ,ι)(A,\rho,\iota) where AA is an abelian surface, ρ\rho is a principal polarization and ι:OEnd(A)\iota:O\to\mathrm{End}(A) is an embedding such that the Rosati involution defined by ρ\rho on ι(O)\iota(O) is the involution defined by μ\mu. Concretely, for each zz\in\mathfrak{H} we can define a triple (Az,ρz,ιz)(A_{z},\rho_{z},\iota_{z}) as follows.

Az=𝐂2/Λz,Λz:=θ(O)vz,vz=(z1),θ(O)M2(𝐑).A_{z}=\mathbf{C}^{2}/\Lambda_{z},\quad\Lambda_{z}:=\theta(O)v_{z},\quad v_{z}=\begin{pmatrix}z\\ 1\end{pmatrix},\quad\theta(O)\subset M_{2}(\mathbf{R}).

Then Ez:Λz×Λz𝐙E_{z}:\Lambda_{z}\times\Lambda_{z}\to\mathbf{Z} given by

Ez(θ(λ1)vz,θ(λ2)vz)=tr(λ2μλ1)E_{z}(\theta(\lambda_{1})v_{z},\theta(\lambda_{2})v_{z})=\mathrm{tr}(\lambda_{2}^{*}\mu\lambda_{1})

is a Riemann form which defines a principal polarization ρz\rho_{z} on AzA_{z}. Also, each element aOa\in O gives a map θ(a)\theta(a) which maps 𝐂2𝐂2\mathbf{C}^{2}\to\mathbf{C}^{2} and sends Λz\Lambda_{z} to itself, thereby inducing an endomorphism ι(a)\iota(a) of AzA_{z}. See [14]. We can enhance this structure by including a rigidification of the points of order NN for some integer N1N\geq 1, namely an isomorphism

α:A[N]=Ker(N:AA)(𝐙/N)4\alpha:A[N]=\mathrm{Ker}(N:A\to A)\to(\mathbf{Z}/N)^{4}

which carries Weil pairing to the standard symplectic pairing on (𝐙/N)4(\mathbf{Z}/N)^{4}. The resulting moduli problem for (A,ρ,ι,α)(A,\rho,\iota,\alpha) is now representable if N3N\geq 3, so we obtain fine moduli schemes XB(N)X_{B}(N). Complex-analytically this is Γ(N)\\Gamma(N)\backslash\mathfrak{H} where Γ(N)={γΓγ1 mod N}\Gamma(N)=\{\gamma\in\Gamma\mid\gamma\equiv 1\text{\ mod\ \ }N\}.

For the case of a general TT, there are coarse moduli spaces XΓ,TX_{\Gamma,T} that represent equivalence classes of (A,ρ,ι)(A,\rho,\iota) for an equivalence relation arising from stable quadratic twisting rings RTOR_{T}\subset O. We refer to [77] for details. For each prime pp dividing the discriminant DD, there are involutions wpw_{p} (Atkin-Lehner involutions) of the curve XBX_{B}. In fact ΓT=Γ,wp,pT\Gamma_{T}=\langle\Gamma,w_{p},p\in T\rangle and viewing the wpw_{p} as transformations of the moduli problem,

wp(A,ρ,ι)=(A/Ker(Ip),ρ,ι)w_{p}(A,\rho,\iota)=(A/\mathrm{Ker}(I_{p}),\rho^{\prime},\iota^{\prime})

where IpOI_{p}\subset O is the set of elements whose norm is divisible by pp. This is a 2-sided ideal with O/Ip𝐅p2O/I_{p}\cong\mathbf{F}_{p^{2}} and Ip2=pOI_{p}^{2}=pO. Since Ker(Ip)\mathrm{Ker}(I_{p}) is isotropic under the Weil pairing, A/Ker(Ip)A/\mathrm{Ker}(I_{p}) inherits a principal polarization ρ\rho^{\prime}, and IpI_{p} being a 2-sided ideal, it also inherits an action ι\iota^{\prime} of OO. In other words, XΓ,TX_{\Gamma,T} parametrizes (A,ρ,ι)(A,\rho,\iota) up to identification of AA with A/Ker(I)A/\mathrm{Ker}(I) where I=pTIpI=\cap_{p\in T}I_{p} for TΣT\subset\Sigma.

Since these curves parametrize families of principally polarized abelian varieties of dimension 2, there are modular embeddings

f:2,2=Siegel space of genus 2.\displaystyle f:\mathfrak{H}\to\mathfrak{H}_{2},\quad\mathfrak{H}_{2}=\text{Siegel space of genus \ }2.
φ:O1Sp4(𝐙),f(γz)=φ(γ)(f(z))\displaystyle\varphi:O^{1}\to\mathrm{Sp}_{4}(\mathbf{Z}),\quad f(\gamma z)=\varphi(\gamma)(f(z))

inducing maps XΓ,T𝔄2X_{\Gamma,T}\to\mathfrak{A}_{2} to the moduli space of principally-polarized abelian varieties of dimension 2 (see [42]). Because the Torelli map from the moduli space of genus 2 curves 𝔐2𝔄2\mathfrak{M}_{2}\to\mathfrak{A}_{2} is a birational injection, we get algebraic coordinates on 𝔄2\mathfrak{A}_{2} from Igusa-Clebsch invariants, [45]. The images of Shimura curves can then be described in terms of Igusa-Clebsch invariants. See [64] where Borcherds forms are used to calculate these.

7.3. D=6.D=6. The curves and modular forms

References for this section: [9], [11]. There is a unique quaternion algebra 𝖡𝟨\sf{B}_{6} of discriminant D=6D=6 defined over 𝐐\mathbf{Q}. It has generators 11, II, JJ, KK with I2=3I^{2}=3, J2=1J^{2}=-1, K=IJ=JIK=IJ=-JI. We fix the embedding Φ:𝖡𝟨𝖬𝟤(𝐑)\Phi:\sf{B}_{6}\to M_{2}(\mathbf{R})

γ=x+yI+zJ+tK[x+y3z+t3(zt3)xy3].\gamma=x+yI+zJ+tK\mapsto\begin{bmatrix}x+y\sqrt{3}&z+t\sqrt{3}\\ -(z-t\sqrt{3})&x-y\sqrt{3}\end{bmatrix}.

All maximal orders are conjugate and we fix the representative

O6=𝐙[1,I,J,(1+I+J+K)/2].O_{6}=\mathbf{Z}[1,I,J,(1+I+J+K)/2].

The group O61O_{6}^{1} can be identified with its image Γ6SL2(𝐑)\Gamma_{6}\subset\mathrm{SL}_{2}(\mathbf{R}) under Φ\Phi. Then the image of this in PSL2(𝐑)\mathrm{PSL}_{2}(\mathbf{R}) is Γ¯6=Γ6/{±1}\bar{\Gamma}_{6}=\Gamma_{6}/\{\pm 1\} and is a Fuchsian group without parabolic elements. We let X6an=Γ¯6\X^{an}_{6}=\bar{\Gamma}_{6}\backslash\mathfrak{H}, X6an=X6(𝐂)X^{an}_{6}=X_{6}(\mathbf{C}) for a smooth projective curve defined over 𝐐\mathbf{Q}, Shimura’s canonical model. This is a compact Riemann surface of genus 0. In fact, X6X_{6} is the projective conic x2+3y2+z2=0x^{2}+3y^{2}+z^{2}=0.

The curve X6X_{6} sits in a Galois (𝐙/2)2(\mathbf{Z}/2)^{2} tower of curves, defined by Atkin-Lehner quotients with X6(d):=X6/wdX^{(d)}_{6}:=X_{6}/\langle w_{d}\rangle and X6(+):=X6/w2,w3X^{(+)}_{6}:=X_{6}/\langle w_{2},w_{3}\rangle. All these curves have genus zero and the corresponding groups have fundamental half-domains in the upper half-plane (or unit disk) which are hyperbolic polygons with either 3 or 4 sides. The angles at the vertices are (π/a,π/b,π/c,(π/d))(\pi/a,\pi/b,\pi/c,(\pi/d)) where (a,b,c,(d))(a,b,c,(d)) are as follows:

{diagram}{diagram}\begin{diagram}\qquad\qquad\begin{diagram}
Refer to caption
Figure 5. Fundamental domain for Γ6\Gamma_{6}. Image from Bayer-Travesa, [11]. Atkin-Lehner quotients appear as dotted subregions. Special CM points and gluing of boundaries shown.

For the triangle groups listed above, we give a corresponding hypergeometric function. By suitable choices of two solutions to the corresponding differential equations (HDE), the ratios give the Schwarz conformal maps from the upper half-plane to hyperbolic triangles:

curve angles HDE
X6(2)X_{6}^{(2)} ( 3, 4, 4) F12[1121323;t]\,{}_{2}F_{1}\left[\begin{matrix}\frac{1}{12}&\frac{1}{3}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &\frac{2}{3}\end{matrix}\;;\;t\right]
X6(3)X_{6}^{(3)} (2, 6, 6) F12[1121456;t]\,{}_{2}F_{1}\left[\begin{matrix}\frac{1}{12}&\frac{1}{4}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &\frac{5}{6}\end{matrix}\;;\;t\right]
X6(+)X_{6}^{(+)} (2, 4, 6) F12[12472456;t]\,{}_{2}F_{1}\left[\begin{matrix}\frac{1}{24}&\frac{7}{24}\vskip 3.0pt plus 1.0pt minus 1.0pt\\ &\frac{5}{6}\end{matrix}\;;\;t\right]
Refer to caption
Figure 6. 334 hyperbolic tessellation. Image thanks to Helena Verrill.

Let Γ=O61\Gamma=O_{6}^{1}. The graded ring of modular forms is

k=0S2k(Γ)=𝐂[h4(z),h6(z),h12(z)]𝐂[h4,h6,h12]/(h122+3h64+h46)\bigoplus_{k=0}^{\infty}S_{2k}(\Gamma)=\mathbf{C}[h_{4}(z),h_{6}(z),h_{12}(z)]\cong\mathbf{C}[h_{4},h_{6},h_{12}]/(h_{12}^{2}+3h_{6}^{4}+h_{4}^{6})

where zz is the coordinate in \mathfrak{H}, the subscript indicates the weight, and h4(z)h_{4}(z), h6(z)h_{6}(z) are algebraically independent. The Hauptmoduln on various Atkin-Lehner quotients are expressible in terms of these modular functions. For instance the (2,4,6)(2,4,6) curve we have denoted by X6+X_{6}^{+} is canonically isomorphic with 𝐐1\mathbb{P}^{1}_{\mathbf{Q}} with a coordinate denoted j6j_{6}.

j6=16h649h46.j_{6}=\frac{16h_{6}^{4}}{9h_{4}^{6}}.

In the canonical model of the curve X6X_{6} as the projective plane conic x2+3y2+z2=0x^{2}+3y^{2}+z^{2}=0 the Atkin-Lehner involutions are given on p=(x,y,z)p=(x,y,z) by w2(p)=(x,y,z)w_{2}(p)=(x,-y,z), w3(p)=(x,y,z)w_{3}(p)=(-x,y,z), w6=w2w3w_{6}=w_{2}w_{3}. The curves X6(2)X_{6}^{(2)}, X6(3)X_{6}^{(3)}, X6(6)X_{6}^{(6)}, X6+X_{6}^{+} are all isomorphic with 𝐐1\mathbb{P}^{1}_{\mathbf{Q}}, and the projections π2\pi_{2}, π3\pi_{3}, π6\pi_{6}, π+\pi_{+} are given by

π2(p)=(x,z),π2(p)=(y,z),π6(p)=(x,y),π+(p)=(x2,y2)\pi_{2}(p)=(x,z),\quad\pi_{2}(p)=(y,z),\quad\pi_{6}(p)=(x,y),\quad\pi_{+}(p)=(x^{2},y^{2})

where (y/x)2=9j6/16=h64/h46(y/x)^{2}=9j_{6}/16=h_{6}^{4}/h_{4}^{6}. See [9] for detailed discussion.

7.4. D=6.D=6. Models of the abelian varieties

If ΓΓ6\Gamma\subset\Gamma_{6} is a congruence subgroup with no torsion, then there is a universal family of abelian varieties f:AΓXΓf:A_{\Gamma}\to X_{\Gamma}. In fact, the groups in our paper all have torsion so that we do not have universal families. Nonetheless we do have families of abelian varieties with quaternion structure that are “sufficiently close” to universal, and these are utilized for explicit calculations. This is analogous to the situation for SL2(𝐙)\mathrm{SL}_{2}(\mathbf{Z}), where no universal family of elliptic curves exists, but there are families of elliptic curves whose jj-invariant = jj, the coordinate on the modular curve X(SL2(𝐙))=j1X(\mathrm{SL}_{2}(\mathbf{Z}))=\mathbb{P}^{1}_{j}:

(9) j:y2+xy=x336x+1j1728.\mathcal{E}_{j}:\quad y^{2}+xy=x^{3}-\frac{36x+1}{j-1728}.

These curves are not unique, but as Ihara and Scholl [78] have shown, they can be effectively used for calculation of Hecke traces. There are three specific models of 2-dimensional abelian varieties with multiplication by the quaternion algebra B6B_{6} relevant to this paper:

  • 1.

    The Jacobian of the generalized Legendre curve

    y6=x4(x1)3(xλ)y^{6}=x^{4}(x-1)^{3}(x-\lambda)

    decomposes according to the characters of μ6\mu_{6}. The part belonging to the primitive characters is 2-dimensional. It corresponds to a hypergeometric motive and has QM by B6B_{6}. See [15].

  • 2.

    The subfamily of the Picard family of genus 3 curves

    y3=x(x1)(xλ)(xμ)y^{3}=x(x-1)(x-\lambda)(x-\mu)

    with μ=1λ\mu=1-\lambda has the following property: Their Jacobians factor as E×AE\times A where EE is an elliptic curve with CM by 𝐐(ζ3)\mathbf{Q}(\zeta_{3}) and AA is a 2-dimensional abelian variety with endomorphisms by B6B_{6}, [75]

  • 3.

    The Baba-Granath curves. In [9], Baba and Granath construct a family of genus-2 curves whose Jacobians have QM by B6B_{6}. This family is analogous the the family of elliptic curves with jj-invariant = jj. It is defined on a quadratic covering of X(Γ6)=j61X(\Gamma_{6})=\mathbb{P}^{1}_{j_{6}}, not on j61\mathbb{P}^{1}_{j_{6}}. This is because of Mestre’s obstruction: in general you cannot define a genus 2 curve CC over the same field as the moduli of its Jacobian. Let K=𝐐(j)K=\mathbf{Q}(j), j=j6j=j_{6}, and define s=6js=\sqrt{-6j}, t=2(27j+16)t=-2(27j+16). Let CC be the projective nonsingular model of the curve y2=f(x)y^{2}=f(x) where

    f(x)=\displaystyle f(x)= (4+3s)x6+6tx5+3t(28+9s)x44t2x3\displaystyle(-4+3s)x^{6}+6tx^{5}+3t(28+9s)x^{4}-4t^{2}x^{3}
    +3t2(289s)x2+6t3xt3(4+3s).\displaystyle+3t^{2}(28-9s)x^{2}+6t^{3}x-t^{3}(4+3s).

    We can rewrite this in terms of modular forms h4h_{4}, h6h_{6}, and h12h_{12} in the previous section. Let zz\in\mathfrak{H}. Then the curve Cz:y2=gz(x)C_{z}:y^{2}=g_{z}(x) where

    gz(x)=\displaystyle g_{z}(x)= h43(z)(x621x421x2+1)\displaystyle h_{4}^{3}(z)(x^{6}-21x^{4}-21x^{2}+1)
    +6h62(z)(x6+9x49x21)\displaystyle+\sqrt{-6}h_{6}^{2}(z)(x^{6}+9x^{4}-9x^{2}-1)
    +22h12(z)x(3x42x2+3)\displaystyle+2\sqrt{2}h_{12}(z)x(3x^{4}-2x^{2}+3)

    has Jacobian isomorphic to (Az,ρz,ιz)(A_{z},\rho_{z},\iota_{z}). The Igusa-Clebsch invariant of this is

    [A,B,C,D]=[j6+1,j6,j6(1j6),j63][A,B,C,D]=[j_{6}+1,j_{6},j_{6}(1-j_{6}),j_{6}^{3}]

    where

    j6=ABCAB+C=D2B5.j_{6}=\frac{AB-C}{AB+C}=\frac{D^{2}}{B^{5}}.

8. Main Theorem.

8.1. Outline.

To prove the main claim, formula (1), by combining the trace formula (2) with the result of Eichler-Shimura theory (8) we must calculate, for the groups we are considering,

Tr(T(p)S2k(U))=Tr(FrobpHet1(XU𝐅p𝐅¯p,V2k(U)))=xXU(𝐅p)Tr(FrobxV2k(U),x¯).-\mathrm{Tr}\,(T(p)\mid S_{2k}(U))=-\mathrm{Tr}\,(\mathrm{Frob}_{p}\mid H^{1}_{et}(X_{U}\otimes_{\mathbf{F}_{p}}\bar{\mathbf{F}}_{p},V^{2k}(U)_{\ell}))=\sum_{x\in X_{U}(\mathbf{F}_{p})}\mathrm{Tr}(\mathrm{Frob}_{x}\mid V^{2k}(U)_{\ell,\bar{x}}).

The contributions are of two sorts: (1) from cusps and elliptic points and (2) the rest.

  • 1.

    The sheaf V2k(U)V^{2k}(U)_{\ell} can be related recursively to V2m(U)V^{2m}(U)_{\ell} for 1m<k1\leq m<k. This means that the traces of Frobenius at xx on it can be expressed as polynomial functions of the trace on V2(U),x¯V^{2}(U)_{\ell,\bar{x}}. It suffices therefore to relate the Frobenius trace on V2(U),x¯V^{2}(U)_{\ell,\bar{x}} to hypergeometric character sums for xx not a cusp or elliptic point. In fact one shows that

    V2(U)a character of finite ordera Kummer sheafa hypergeometric sheaf (HD(U))V^{2}(U)_{\ell}\cong\text{a character of finite order}\otimes\text{a Kummer sheaf}\otimes\text{a hypergeometric sheaf\ }\mathcal{H}(HD(U))

    where HD(U)HD(U) is hypergeometric data attached to UU.

  • 2.

    V2(U)V^{2}(U)_{\ell} (and V2(U)𝐂V^{2}(U)_{\mathbf{C}}) is a rigid local system. Its local monodromies are easily computed complex-analytically, and this is the same as the local monodromy \ell-adically. The local monodromy of the hypergeometric sheaves are also known. The Kummer twist is there to insure matching. Rigidity then shows that there is an isomorphism

    V2(U)a Kummer sheafa hypergeometric sheaf (HD(U))V^{2}(U)_{\ell}\cong\text{a Kummer sheaf}\otimes\text{a hypergeometric sheaf\ }\mathcal{H}(HD(U))

    for the curves/sheaves over the algebraic closure 𝐐¯\bar{\mathbf{Q}}.

  • 3.

    To get an isomorphism over 𝐐\mathbf{Q} we compare both sides at (one or more) CM point. For this we need the explicit models of the curves and abelian varieties.

  • 4.

    The Frobenius traces of the hypergeometric sheaves at points not corresponding to cusps or elliptic points are given by hypergeometric character sums. For cusps or elliptic points, this is essentially what is done in determining the fiber at places of bad reduction for a family of abelian varieties. For elliptic curves, this is Tate’s algorithm; for abelian varieties of dimension 2 coming from genus 2 curves, this is Qing Liu’s algorithm [65]. Actually, our situation is greatly simplified and we follow the method of Scholl in [78].

8.2. Examples

For the triangle groups Γ\Gamma under consideration, we associate hypergeometric data HD(Γ)={α(Γ),β(Γ)}HD(\Gamma)=\{\alpha(\Gamma),\beta(\Gamma)\} defined over 𝐐\mathbf{Q} for which Beukers-Cohen-Mellit [12] introduced hypergeometric character sums Hp(HD(Γ),λ)H_{p}(HD(\Gamma),\lambda) for λ𝐅p×\lambda\in\mathbf{F}_{p}^{\times}. For each integer m1m\geq 1, let Fm(S,T)F_{m}(S,T) be a degree-mm polynomial which expresses the symmetric polynomial i=02muiv2mi\sum_{i=0}^{2m}u^{i}v^{2m-i} in u,vu,v of degree 2m2m as a polynomial in S=u2+uv+v2S=u^{2}+uv+v^{2} and T=uvT=uv, i.e.,

(10) Fm(u2+uv+v2,uv)=i=02muiv2mi.F_{m}(u^{2}+uv+v^{2},uv)=\sum_{i=0}^{2m}u^{i}v^{2m-i}.
Theorem 8.1 ([43]).

For Γ=(2,4,6),(2,,),(2,3,),(2,4,),(2,6,)\Gamma=~{}(2,4,6),~{}(2,\infty,\infty),(2,3,\infty),(2,4,\infty),(2,6,\infty), the table below describes the hypergeometric datum HD(Γ)={α(Γ),β(Γ)}HD(\Gamma)=\{\alpha(\Gamma),\beta(\Gamma)\} and the choice of a generator λ=λ(Γ)\lambda=\lambda(\Gamma) of the field of 𝐐\mathbf{Q}-rational functions on XΓX_{\Gamma} by its values at each elliptic point of given order and cusp:

Γ\Gamma (2,,)(2,\infty,\infty) (2,3,)(2,3,\infty) (2,4,)(2,4,\infty) (2,6,)(2,6,\infty) (2,4,6)(2,4,6)
λ\lambda (1,0,)(1,0,\infty) (1,,0)(1,\infty,0) (1,,0)(1,\infty,0) (1,,0)(1,\infty,0) (3,,0)(-3,\infty,0)
α(Γ)\alpha(\Gamma) {12,12,12}\{\frac{1}{2},\frac{1}{2},\frac{1}{2}\} {12,16,56}\{\frac{1}{2},\frac{1}{6},\frac{5}{6}\} {12,14,34}\{\frac{1}{2},\frac{1}{4},\frac{3}{4}\} {12,13,23}\{\frac{1}{2},\frac{1}{3},\frac{2}{3}\} {12,14,34}\{\frac{1}{2},\frac{1}{4},\frac{3}{4}\}
β(Γ)\beta(\Gamma) {1,1,1}\{1,1,1\} {1,1,1}\{1,1,1\} {1,1,1}\{1,1,1\} {1,1,1}\{1,1,1\} {1,56,76}\{1,\frac{5}{6},\frac{7}{6}\}

Given k2k\geq 2 an even integer and a fixed prime \ell, the terms on the right-hand side of (8.1) for almost all primes pp\neq\ell where XΓX_{\Gamma} has good reduction are as follows.

For λ\lambda not corresponding to an elliptic point or a cusp

(11) Tr(Frobλ(Vk(Γ))λ¯)=Fk/2(aΓ(λ,p),bΓ(λ,p)),\mathrm{Tr}(\mathrm{Frob}_{\lambda}\mid(V^{k}(\Gamma)_{\ell})_{\bar{\lambda}})=F_{k/2}(a_{\Gamma}(\lambda,p),b_{\Gamma}(\lambda,p)),

where bΓ(λ,p)=pb_{\Gamma}(\lambda,p)=p and

(12) aΓ(λ,p)={(11/λp)Hp(HD(Γ),1/λ) if Γ(2,4,6);(3(1+3/λ)p)pHp(HD(Γ),3/λ) if Γ=(2,4,6).a_{\Gamma}(\lambda,p)=\begin{cases}\left(\frac{1-1/\lambda}{p}\right)H_{p}(HD(\Gamma),1/\lambda)&\text{ if $\Gamma\neq(2,4,6)$};\\ \left(\frac{-3(1+3/\lambda)}{p}\right)pH_{p}(HD(\Gamma),-3/\lambda)&\text{ if $\Gamma=(2,4,6)$}.\end{cases}

The contribution of λ\lambda corresponding to a cusp is 11. There are also explicit formulas for the contributions of the elliptic points.

Here is an example illustrating our main result. For Γ=(2,4,6)\Gamma=(2,4,6) the lowest kk with nontrivial Sk+2(2,4,6)S_{k+2}(2,4,6) is k=6k=6, in which case S8(2,4,6)=h42S_{8}(2,4,6)=\langle h_{4}^{2}\rangle is 11-dimensional, where h4h_{4} generates S4(2,2,3,3)S_{4}(2,2,3,3). By Jacquet-Langlands correspondence [47], h42h_{4}^{2} corresponds to the normalized weight-88 level 66 cuspidal newform f6.8.a.af_{6.8.a.a} in the LMFDB label. For primes p>5p>5, denote by ap(h42)a_{p}(h_{4}^{2}) the eigenvalue of TpT_{p} on h42h_{4}^{2}, which is equal to the ppth Fourier coefficient ap(f6.8.a.a)a_{p}(f_{6.8.a.a}) of f6.8.a.af_{6.8.a.a}. The theorem above gives

ap(h42)=ap(f6.8.a.a)\displaystyle-a_{p}(h_{4}^{2})=-a_{p}(f_{6.8.a.a}) =λ𝐅p,λ0,3(aΓ(λ,p)32paΓ(λ,p)2p2aΓ(λ,p)+p3)\displaystyle=\sum_{\lambda\in\mathbf{F}_{p},\lambda\neq 0,-3}\left(a_{\Gamma}(\lambda,p)^{3}-2pa_{\Gamma}(\lambda,p)^{2}-p^{2}a_{\Gamma}(\lambda,p)+p^{3}\right)
+p((pHp(HD(Γ);1))2p2)+((1p)+(3p)+(6p))p3.\displaystyle+p((pH_{p}(HD(\Gamma);1))^{2}-p^{2})+\left(\left(\frac{-1}{p}\right)+\left(\frac{-3}{p}\right)+\left(\frac{-6}{p}\right)\right)p^{3}.

The term p((pHp(HD(Γ);1))2p2)p((pH_{p}(HD(\Gamma);1))^{2}-p^{2}) on the right side comes from the contribution at the elliptic point of order 22 on X(2,4,6)X_{(2,4,6)}; it also equals pap(f24.5.h.b)pa_{p}(f_{24.5.h.b}), in terms of the ppth Fourier coefficient of the weight-5 CM modular form f24.5.h.bf_{24.5.h.b}.

8.3. Future directions

Here are two:

  • 1.

    Ohta’s results apply to Shimura curves of quaternion algebras over any totally real number field FF. The (5,5,5)(5,5,5) triangle group, which appears in Takeuchi’s list, corresponds to a quaternion algebra BB over F=𝐐(5)F=\mathbf{Q}(\sqrt{5}). The new part of the Jacobian of the curve

    y10=x2(1x)7(1λx)7y^{10}=x^{2}(1-x)^{7}(1-\lambda x)^{7}

    is 4=2[F:𝐐]4=2[F:\mathbf{Q}]-dimensional and has endomorphism algebra the quaternion algebra over 𝐐(5)\mathbf{Q}(\sqrt{5}) with discriminant 𝔭5\mathfrak{p}_{5}, the unique prime over 55. This is a model of the family of abelian varieties over a covering of the Shimura curve for BB. See [15]. The Jacquet-Langlands correspondence in this case will land in Hilbert modular forms for 𝐐(5)\mathbf{Q}(\sqrt{5}).

  • 2.

    The quaternion algebra B10B_{10} over 𝐐\mathbf{Q} of discriminant 10 is also discussed in the paper of Baba-Granath. But this time, the fundamental domains are not related to triangle groups. In fact, the corresponding differential equation now has 4 regular singular points, and in particular is no longer rigid. There is an accessory parameter which was determined by Elkies in [28] using Schwarzian equations. The number theory here will be governed by Heun functions over finite fields, a theory that needs to be developed. See the entry on Heun Functions in [23].

References

  • [1] Scott Ahlgren. Gaussian hypergeometric series and combinatorial congruences. In Symbolic computation, number theory, special functions, physics and combinatorics (Gainesville, FL, 1999), volume 4 of Dev. Math., pages 1–12. Kluwer Acad. Publ., Dordrecht, 2001.
  • [2] Scott Ahlgren and Ken Ono. A Gaussian hypergeometric series evaluation and Apéry number congruences. J. Reine Angew. Math., 518:187–212, 2000.
  • [3] George E. Andrews, Richard Askey, and Ranjan Roy. Special functions, volume 71 of Encyclopedia of Mathematics and its Applications. Cambridge University Press, Cambridge, 1999.
  • [4] Donu Arapura. An abelian category of motivic sheaves. Adv. Math., 233:135–195, 2013.
  • [5] Donu Arapura. Motivic sheaves revisited. J. Pure Appl. Algebra, 227(8):Paper No. 107125, 22, 2023.
  • [6] A. O. L. Atkin, Wen-Ching Winnie Li, Tong Liu, and Ling Long. Galois representations with quaternion multiplication associated to noncongruence modular forms. Trans. Amer. Math. Soc., 365(12):6217–6242, 2013.
  • [7] Joseph Ayoub. Les six opérations de Grothendieck et le formalisme des cycles évanescents dans le monde motivique. I. Astérisque, (314):x+466, 2007.
  • [8] Joseph Ayoub. Les six opérations de Grothendieck et le formalisme des cycles évanescents dans le monde motivique. II. Astérisque, (315):vi+364, 2007.
  • [9] Srinath Baba and Håkan Granath. Genus 2 curves with quaternionic multiplication. Canad. J. Math., 60(4):734–757, 2008.
  • [10] Pilar Báyer and Jürgen Neukirch. On automorphic forms and Hodge theory. Math. Ann., 257(2):137–155, 1981.
  • [11] Pilar Báyer and Artur Travesa. Uniformizing functions for certain Shimura curves, in the case D=6D=6. Acta Arith., 126(4):315–339, 2007.
  • [12] Frits Beukers, Henri Cohen, and Anton Mellit. Finite hypergeometric functions. Pure Appl. Math. Q., 11(4):559–589, 2015.
  • [13] Frits Beukers and Gerrit Heckman. Monodromy for the hypergeometric function Fn1n{}_{n}F_{n-1}. Invent. Math., 95(2):325–354, 1989.
  • [14] Christina Birkenhake and Herbert Lange. Complex Abelian Varieties, volume 302 of Grundlehren der mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin, second edition, 2004.
  • [15] Alyson Deines, Jenny G. Fuselier, Ling Long, Holly Swisher, and Fang-Ting Tu. Generalized Legendre curves and quaternionic multiplication. J. Number Theory, 161:175–203, 2016.
  • [16] Pierre Deligne. Équations différentielles à points singuliers réguliers. Lecture Notes in Mathematics, Vol. 163. Springer-Verlag, Berlin-New York, 1970.
  • [17] Pierre Deligne. Formes modulaires et représentations ll-adiques. In Séminaire Bourbaki. Vol. 1968/69: Exposés 347–363, volume 175 of Lecture Notes in Math., pages Exp. No. 355, 139–172. Springer, Berlin, 1971.
  • [18] Pierre Deligne. La conjecture de Weil. I. Inst. Hautes Études Sci. Publ. Math., (43):273–307, 1974.
  • [19] Pierre Deligne. Variétés de Shimura: interprétation modulaire, et techniques de construction de modèles canoniques. In Automorphic forms, representations and LL-functions (Proc. Sympos. Pure Math., Oregon State Univ., Corvallis, Ore., 1977), Part 2, Proc. Sympos. Pure Math., XXXIII, pages 247–289. 1979.
  • [20] Pierre Deligne. La conjecture de Weil. II. Inst. Hautes Études Sci. Publ. Math., (52):137–252, 1980.
  • [21] Pierre Deligne and George D. Mostow. Monodromy of hypergeometric functions and nonlattice integral monodromy. Inst. Hautes Études Sci. Publ. Math., (63):5–89, 1986.
  • [22] Pierre Deligne and Michael Rapoport. Les schémas de modules de courbes elliptiques. In Modular functions of one variable, II (Proc. Internat. Summer School, Univ. Antwerp, Antwerp, 1972), Lecture Notes in Math., Vol. 349, pages 143–316. Springer, Berlin, 1973.
  • [23] NIST Digital Library of Mathematical Functions. http://dlmf.nist.gov/, Release 1.1.0 of 2020-12-15. F. W. J. Olver, A. B. Olde Daalhuis, D. W. Lozier, B. I. Schneider, R. F. Boisvert, C. W. Clark, B. R. Miller, B. V. Saunders, H. S. Cohl, and M. A. McClain, eds.
  • [24] Bernard Dwork and François Loeser. Hypergeometric series. Japan. J. Math. (N.S.), 19(1):81–129, 1993.
  • [25] Bernard Dwork and François Loeser. Hypergeometric series and functions as periods of exponential modules. In Barsotti Symposium in Algebraic Geometry (Abano Terme, 1991), volume 15 of Perspect. Math., pages 153–174. Academic Press, San Diego, CA, 1994.
  • [26] Martin Eichler. Quaternäre quadratische Formen und die Riemannsche Vermutung für die Kongruenzzetafunktion. Arch. Math., 5:355–366, 1954.
  • [27] Martin Eichler. Eine Verallgemeinerung der Abelschen Integrale. Math. Z., 67:267–298, 1957.
  • [28] Noam D. Elkies. Shimura curve computations. In Algorithmic number theory (Portland, OR, 1998), volume 1423 of Lecture Notes in Comput. Sci., pages 1–47. Springer, Berlin, 1998.
  • [29] Noam D. Elkies. The Klein quartic in number theory. In The eightfold way, volume 35 of Math. Sci. Res. Inst. Publ., pages 51–101. Cambridge Univ. Press, Cambridge, 1999.
  • [30] Ron Evans and John Greene. Clausen’s theorem and hypergeometric functions over finite fields. Finite Fields Appl., 15(1):97–109, 2009.
  • [31] Sharon Frechette, Ken Ono, and Matthew Papanikolas. Gaussian hypergeometric functions and traces of Hecke operators. Int. Math. Res. Not., (60):3233–3262, 2004.
  • [32] Sharon Frechette, Holly Swisher, and Fang-Ting Tu. A cubic transformation formula for Appell–Lauricella hypergeometric functions over finite fields. Res. Number Theory, 4(2):4:27, 2018.
  • [33] Javier Fresán, Claude Sabbah, and Jeng-Daw Yu. Hodge theory of Kloosterman connections. Duke Math. J., 171(8):1649–1747, 2022.
  • [34] Lei Fu. \ell-adic GKZ hypergeometric sheaves and exponential sums. Adv. Math., 298:51–88, 2016.
  • [35] Lei Fu, Wan Daqing, and Hao Zhang. The pp-adic Gelfand-Kapranov-Zelevinsky complex. arXiv:1804.05297, to appear, Mathematische Annalen.
  • [36] Jenny Fuselier, Ling Long, Ravi Ramakrishna, Holly Swisher, and Fang-Ting Tu. Hypergeometric functions over finite fields. Mem. Amer. Math. Soc., 280(1382), 2022.
  • [37] Jenny G. Fuselier. Hypergeometric functions over 𝔽p\mathbb{F}_{p} and relations to elliptic curves and modular forms. Proc. Amer. Math. Soc., 138(1):109–123, 2010.
  • [38] Ofer Gabber and François Loeser. Faisceaux pervers ll-adiques sur un tore. Duke Math. J., 83(3):501–606, 1996.
  • [39] Izrail M. Gelfand, Mikhail M. Kapranov, and Andrey V. Zelevinsky. Generalized Euler integrals and AA-hypergeometric functions. Adv. Math., 84(2):255–271, 1990.
  • [40] Matthew Greenberg and John Voight. Computing systems of Hecke eigenvalues associated to Hilbert modular forms. Math. Comp., 80(274):1071–1092, 2011.
  • [41] John Greene. Hypergeometric functions over finite fields. Trans. Amer. Math. Soc., 301(1):77–101, 1987.
  • [42] Ki-ichiro Hashimoto. Explicit form of quaternion modular embeddings. Osaka J. Math., 32(3):533–546, 1995.
  • [43] Jerome W. Hoffman, Wen-Ching Winnie Li, Ling Long, and Fang-Ting Tu. Traces of Hecke Operators via Hypergeometric Character Sums, 2024; arXiv: 2408.02918.
  • [44] Rolf-Peter Holzapfel. Geometry and arithmetic around Euler partial differential equations, volume 11 of Mathematics and its Applications (East European Series). D. Reidel Publishing Co., Dordrecht, 1986.
  • [45] Jun-ichi Igusa. Arithmetic variety of moduli for genus two. Ann. of Math. (2), 72:612–649, 1960.
  • [46] Yasutaka Ihara. Hecke polynomials as congruence ζ\zeta functions in elliptic modular case. Ann. of Math. (2), 85:267–295, 1967.
  • [47] Hervé Jacquet and Robert P. Langlands. Automorphic forms on GL(2){\rm GL}(2). Lecture Notes in Mathematics, Vol. 114. Springer-Verlag, Berlin-New York, 1970.
  • [48] Bruce W. Jordan. On the Diophantine arithmetic of Shimura curves, thesis, Harvard University, 1981.
  • [49] Nicholas M. Katz. The regularity theorem in algebraic geometry. In Actes du Congrès International des Mathématiciens (Nice, 1970), Tome 1, pages 437–443. 1971.
  • [50] Nicholas M. Katz. Algebraic solutions of differential equations (pp-curvature and the Hodge filtration). Invent. Math., 18:1–118, 1972.
  • [51] Nicholas M. Katz. An overview of Deligne’s work on Hilbert’s twenty-first problem. In Mathematical developments arising from Hilbert problems (Proc. Sympos. Pure Math., Northern Illinois Univ., De Kalb, Ill., 1974), volume Vol. XXVIII of Proc. Sympos. Pure Math., pages 537–557. Amer. Math. Soc., Providence, RI, 1976.
  • [52] Nicholas M. Katz. Exponential Sums and Differential Equations, volume 124 of Annals of Mathematics Studies. Princeton University Press, Princeton, NJ, 1990.
  • [53] Nicholas M. Katz. Rigid local systems, volume 139 of Annals of Mathematics Studies. Princeton University Press, Princeton, NJ, 1996.
  • [54] Nicholas M. Katz. Another look at the Dwork family. In Algebra, arithmetic, and geometry: in honor of Yu. I. Manin. Vol. II, volume 270 of Progr. Math., pages 89–126. Birkhäuser Boston, Boston, MA, 2009.
  • [55] Michael Klug, Michael Musty, Sam Schiavone, and John Voight. Numerical calculation of three-point branched covers of the projective line. LMS J. Comput. Math., 17(1):379–430, 2014.
  • [56] Maxim Kontsevich and Don Zagier. Periods. In Mathematics unlimited—2001 and beyond, pages 771–808. Springer, Berlin, 2001.
  • [57] Michio Kuga. Kuga varieties, volume 9 of CTM. Classical Topics in Mathematics. Higher Education Press, Beijing, 2018. Fiber varieties over a symmetric space whose fibers are Abelian varieties, Appendices including a letter by André Weil and an essay by I. Satake.
  • [58] Michio Kuga and Goro Shimura. On the zeta function of a fibre variety whose fibres are abelian varieties. Ann. of Math. (2), 82:478–539, 1965.
  • [59] Robert P. Langlands. Modular forms and \ell-adic representations. In Modular functions of one variable, II (Proc. Internat. Summer School, Univ. Antwerp, Antwerp, 1972), Lecture Notes in Math., Vol. 349, pages 361–500. Springer, Berlin, 1973.
  • [60] Robert P. Langlands. On the zeta functions of some simple Shimura varieties. Canadian J. Math., 31(6):1121–1216, 1979.
  • [61] Catherine Lennon. Gaussian hypergeometric evaluations of traces of Frobenius for elliptic curves. Proc. Amer. Math. Soc., 139(6):1931–1938, 2011.
  • [62] Catherine Lennon. Trace formulas for Hecke operators, Gaussian hypergeometric functions, and the modularity of a threefold. J. Number Theory, 131(12):2320–2351, 2011.
  • [63] Wen-Ching W. Li, Ling Long, and Fang-Ting Tu. A Whipple F67{}_{7}F_{6} formula revisited. Matematica, 1(2):480–530, 2022.
  • [64] Yi-Hsuan Lin and Yifan Yang. Quaternionic loci in Siegel’s modular threefold. Math. Z., 295(1-2):775–819, 2020.
  • [65] Qing Liu. Modèles minimaux des courbes de genre deux. J. Reine Angew. Math., 453:137–164, 1994.
  • [66] François Loeser and Claude Sabbah. Caractérisation des 𝒟{\mathcal{D}}-modules hypergéométriques irréductibles sur le tore. C. R. Acad. Sci. Paris Sér. I Math., 312(10):735–738, 1991.
  • [67] François Loeser and Claude Sabbah. Équations aux différences finies et déterminants d’intégrales de fonctions multiformes. Comment. Math. Helv., 66(3):458–503, 1991.
  • [68] François Loeser and Claude Sabbah. Caractérisation des 𝒟{\mathcal{D}}-modules hypergéométriques irréductibles sur le tore. II. C. R. Acad. Sci. Paris Sér. I Math., 315(12):1263–1264, 1992.
  • [69] Ju. I. Manin. Parabolic points and zeta functions of modular curves. Izv. Akad. Nauk SSSR Ser. Mat., pages 19–66, 1972.
  • [70] Ju. I. Manin. Periods of cusp forms, and pp-adic Hecke series. Mat. Sb. (N.S.), pages 378–401, 503, 1973.
  • [71] James S. Milne. Introduction to Shimura varieties. In Harmonic analysis, the trace formula, and Shimura varieties, volume 4 of Clay Math. Proc., pages 265–378. Amer. Math. Soc., Providence, RI, 2005.
  • [72] Masami Ohta. On ll-adic representations attached to automorphic forms. Japan. J. Math. (N.S.), 8(1):1–47, 1982.
  • [73] Masami Ohta. On the zeta function of an abelian scheme over the Shimura curve. Japan. J. Math. (N.S.), 9(1):1–25, 1983.
  • [74] Masami Ohta. On the zeta function of an abelian scheme over the Shimura curve. II. In Galois groups and their representations (Nagoya, 1981), volume 2 of Adv. Stud. Pure Math., pages 37–54. North-Holland, Amsterdam, 1983.
  • [75] Maria Petkova and Hironori Shiga. A new interpretation of the Shimura curve with discriminant 6 in terms of Picard modular forms. Arch. Math. (Basel), 96(4):335–348, 2011.
  • [76] David P. Roberts and Fernando Rodriguez Villegas. Hypergeometric motives. Notices Amer. Math. Soc., 69(6):914–929, 2022.
  • [77] Victor Rotger. Modular Shimura varieties and forgetful maps. Trans. Amer. Math. Soc., 356(4):1535–1550, 2004.
  • [78] Anthony J. Scholl. The ll-adic representations attached to a certain noncongruence subgroup. J. Reine Angew. Math., 392:1–15, 1988.
  • [79] René Schoof. Counting points on elliptic curves over finite fields. volume 7, pages 219–254. 1995. Les Dix-huitièmes Journées Arithmétiques (Bordeaux, 1993).
  • [80] Hermann A. Schwarz. Ueber diejenigen Fälle, in welchen die Gaussische hypergeometrische Reihe eine algebraische Function ihres vierten Elementes darstellt. J. Reine Angew. Math., 75:292–335, 1873.
  • [81] Goro Shimura. Correspondances modulaires et les fonctions ζ\zeta de courbes algébriques. J. Math. Soc. Japan, 10:1–28, 1958.
  • [82] Goro Shimura. Sur les intégrales attachées aux formes automorphes. J. Math. Soc. Japan, 11:291–311, 1959.
  • [83] Goro Shimura. On the zeta-functions of the algebraic curves uniformized by certain automorphic functions. J. Math. Soc. Japan, 13:275–331, 1961.
  • [84] Goro Shimura. An \ell-adic method in the theory of automorphic forms. Collected Papers, II:237–272, 1968.
  • [85] Goro Shimura. Introduction to the arithmetic theory of automorphic functions, volume 11 of Publications of the Mathematical Society of Japan. Princeton University Press, Princeton, NJ, 1994. Reprint of the 1971 original, Kanô Memorial Lectures, 1.
  • [86] Tetsuji Shioda. On rational points of the generic elliptic curve with level NN structure over the field of modular functions of level NN. J. Math. Soc. Japan, 25:144–157, 1973.
  • [87] Lucy Joan Slater. Generalized hypergeometric functions. Cambridge University Press, Cambridge, 1966.
  • [88] Peter Stiller. Special values of Dirichlet series, monodromy, and the periods of automorphic forms. Mem. Amer. Math. Soc., 49(299):iv+116, 1984.
  • [89] Kisao Takeuchi. Arithmetic triangle groups. J. Math. Soc. Japan, 29(1):91–106, 1977.
  • [90] Kisao Takeuchi. Commensurability classes of arithmetic triangle groups. J. Fac. Sci. Univ. Tokyo Sect. IA Math., 24(1):201–212, 1977.
  • [91] John Voight. Quaternion algebras, volume 288 of Graduate Texts in Mathematics. Springer, Cham, [2021] ©2021.
  • [92] Yifan Yang. Schwarzian differential equations and Hecke eigenforms on Shimura curves. Compos. Math., 149(1):1–31, 2013.
  • [93] Don Zagier. The arithmetic and topology of differential equations. In European Congress of Mathematics, pages 717–776. Eur. Math. Soc., Zürich, 2018.