This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Arithmetic Groups and the Lehmer Conjecture

Lam Pham L.P.: Department of Mathematics, Brandeis University, 415 South Street, 02453 Waltham, United States of America & The Einstein Institute of Mathematics, Edmond J. Safra Campus, Givat Ram, The Hebrew University of Jerusalem Jerusalem, 91904, Israel [email protected]  and  François Thilmany F.T.: Department of mathematics, KU Leuven, Celestijnenlaan 200B, 3001 Leuven, Belgium [email protected]
Abstract.

We generalize a result of Sury [Sur92] and prove that uniform discreteness of cocompact lattices in higher rank semisimple Lie groups (first conjectured by Margulis [Mar91]) is equivalent to a weak form of Lehmer’s conjecture. We include a short survey of related results and conjectures.

L.P. acknowledges the support of the Zuckerman STEM Leadership Program (as Zuckerman Postdoctoral Scholar) and of the ERC (under HomDyn 833423).
F.T. acknowledges the support of the FNRS (under CR FC 4057) and of the KU Leuven (under PDM 19/145).

1. Introduction and main result

1.1. Margulis’ arithmeticity theorem and conjecture

Let 𝐆\mathbf{G} be a connected semisimple \mathbb{R}-group with rank(𝐆)2\operatorname{rank}_{\mathbb{R}}(\mathbf{G})\geq 2. Then, G=𝐆()G=\mathbf{G}(\mathbb{R}) is a semisimple Lie group and Margulis’ celebrated arithmeticity theorem [Mar75] states that every irreducible lattice ΓG\Gamma\subset G is arithmetic (see §3 for precise definitions and statements).

The starting point of this paper is the following consequence of arithmeticity for irreducible non-cocompact lattices [Mar91, IX, (4.21-A)]:

Theorem (Margulis).

Assume that 𝐆\mathbf{G} has no \mathbb{R}-anisotropic factor. There is a neighbourhood UGU\subset G of the identity such that for any irreducible non-cocompact lattice ΓG\Gamma\subset G, the intersection UΓU\cap\Gamma consists of unipotent elements.

Margulis [Mar91, IX, (4.21-B)] then conjectured that an analogous statement would hold for cocompact lattices. In fact, Margulis indicates that this conjecture would follow from a weaker form of Lehmer’s conjecture, which we recall below (see namely Conjecture (C)).

Conjecture (A) (Margulis).

Let 𝐆\mathbf{G} be a connected semisimple \mathbb{R}-group. Suppose rank(𝐆)2\operatorname{rank}_{\mathbb{R}}(\mathbf{G})\geq 2. Then there exists a neighborhood U𝐆()U\subset\mathbf{G}(\mathbb{R}) of the identity such that for any irreducible cocompact lattice Γ𝐆()\Gamma\subset\mathbf{G}(\mathbb{R}), the intersection UΓU\cap\Gamma consists of elements of finite order.

For the purpose of this paper, it will be useful to work with the following statement.

Margulis’ conjecture for a family 𝒯\mathscr{T} of semisimple \mathbb{R}-groups.

For each 𝐆𝒯\mathbf{G}\in\mathscr{T}, there exists a neighborhood U𝐆()U\subset\mathbf{G}(\mathbb{R}) of the identity such that for any irreducible cocompact arithmetic lattice Γ\Gamma in 𝐆()\mathbf{G}(\mathbb{R}), the intersection UΓU\cap\Gamma consists of elements of finite order.

Note that in view of the arithmeticity theorem, ‘Margulis’ conjecture for higher rank groups’ is in this sense simply Conjecture (A).

Remark.

An even stronger statement than Conjecture (A) holds for pp-adic groups. Namely, if kk is a non-archimedean local field of characteristic zero and 𝐆\mathbf{G} is an algebraic kk-group (of dimension >0>0), then there is an open neighborhood U𝐆(k)U\subset\mathbf{G}(k) of the identity such that each nontrivial element hUh\in U generates a non-discrete subgroup [Mar91, IX, (3.5)]. In particular, every lattice in 𝐆(k)\mathbf{G}(k) must intersect UU trivially.

In contrast, in the real case one cannot omit elements of finite order from the statement of Conjecture (A). Indeed, in general one can construct a sequence (Γm)m(\Gamma_{m})_{m\in\mathbb{N}} of cocompact arithmetic lattices in 𝐆()\mathbf{G}(\mathbb{R}) and torsion elements γmΓm\gamma_{m}\in\Gamma_{m} tending to 11. In the case of 𝐒𝐋2()\operatorname{\mathbf{SL}}_{2}(\mathbb{R}) and 𝐒𝐋2()\operatorname{\mathbf{SL}}_{2}(\mathbb{C}), the reader can consult [MR03, §12.5] for a thorough discussion on the existence of elements of arbitrary order in cocompact lattices.

1.2. Lehmer’s conjecture

Let P[X]P\in\mathbb{C}[X] be a monic polynomial of degree dd with roots α1,,αd\alpha_{1},\,\ldots,\,\alpha_{d}\in\mathbb{C}. The Mahler measure of PP is

(P)=i=1dmax{1,|αi|}.\operatorname{\mathcal{M}}(P)=\prod_{i=1}^{d}\max\{1,|\alpha_{i}|\}.

In the following, let α1,,αs(P)\alpha_{1},\dots,\alpha_{s(P)} be an enumeration of the roots of PP in \mathbb{C} which have absolute value strictly greater than 1, repeated according to their multiplicity, so that s(P)s(P) denotes their count (with multiplicity) and we may rewrite (P)=i=1s(P)|αi|.\operatorname{\mathcal{M}}(P)=\prod_{i=1}^{s(P)}|\alpha_{i}|. If α¯\alpha\in\overline{\mathbb{Q}} is an algebraic integer, the Mahler measure (α)\operatorname{\mathcal{M}}(\alpha) of α\alpha will be defined as the Mahler measure (Pα)\operatorname{\mathcal{M}}(P_{\alpha}) of its minimal polynomial PαP_{\alpha} over \mathbb{Z}. The Mahler measure on algebraic integers is obviously invariant under the action of Ga(¯/)\operatorname{Ga\ell}(\overline{\mathbb{Q}}/\mathbb{Q}).

The Mahler measure is multiplicative. By virtue of Kronecker’s theorem, for PP a monic, irreducible polynomial with integer coefficients, we have

(P)=1P(X)=X,orPis a cyclotomic polynomial.\operatorname{\mathcal{M}}(P)=1\quad\Leftrightarrow\quad P(X)=X,\ \text{or}\ P\ \text{is a cyclotomic polynomial.}

In 1933, Lehmer [Leh33] asked whether one could find irreducible polynomials with integer coefficients whose Mahler measure gets arbitrarily close to 1 (but is not 1). It is conjectured that this is not possible:

Conjecture (B) (Lehmer(1)(1)(1)Lehmer always insisted that he had not formulated his problem as a conjecture, although we will customarily refer to it as such.).

There exists ε>0\varepsilon>0 such that for any (irreducible) monic polynomial PP with integer coefficients, either

(P)=1or(P)>1+ε.\operatorname{\mathcal{M}}(P)=1\quad\text{or}\quad\operatorname{\mathcal{M}}(P)>1+\varepsilon.

In fact, Lehmer’s polynomial

PLehmer=X10+X9X7X6X5X4X3+X+1,P_{\mathrm{Lehmer}}=X^{10}+X^{9}-X^{7}-X^{6}-X^{5}-X^{4}-X^{3}+X+1,

for which (PLehmer)=1.17628\operatorname{\mathcal{M}}(P_{\mathrm{Lehmer}})=1.17628..., is suspected to attain the smallest Mahler measure greater than 1. Partial results towards Lehmer’s conjecture are known; we list some of them below.

We will be concerned with the following weaker version of Lehmer’s conjecture.

Conjecture (C) (weak Lehmer).

For each ss\in\mathbb{N}, there exists ε(s)>0\varepsilon(s)>0 such that for any (irreducible) monic polynomial PP with integer coefficients and s(P)ss(P)\leq s, either

(P)=1or(P)>1+ε(s).\operatorname{\mathcal{M}}(P)=1\quad\textrm{or}\quad\operatorname{\mathcal{M}}(P)>1+\varepsilon(s).

For a given ss, we will call the statement in Conjecture (C) Lehmer’s conjecture at level ss. In this way, Conjecture (C) could be described as “Lehmer’s conjecture at all levels”, and Lehmer’s conjecture (B) as “Lehmer’s conjecture at all levels uniformly”.

1.3. Main result

Fix an absolutely (almost) simple isotropic \mathbb{R}-group 𝐅\mathbf{F} and consider, for each integer s1s\geq 1, the family of semisimple \mathbb{R}-groups

𝒯𝐅(s)={i=1r𝐅×i=1tRes/(𝐅)|r,t,r+2ts}.\mathscr{T}_{\mathbf{F}}^{(s)}=\left\{\prod_{i=1}^{r}\mathbf{F}\times\prod_{i=1}^{t}\operatorname{Res}_{\mathbb{C}/\mathbb{R}}(\mathbf{F})\ \middle|\ r,t\in\mathbb{N},\;r+2t\leq s\right\}.

The following theorem is the main result of this paper.

Theorem.

Let s1s\geq 1. Then Margulis’ conjecture for any of the families 𝒯𝐅(s)\mathscr{T}_{\mathbf{F}}^{(s)} defined above(2)(2)(2)As the statement in Margulis’ conjecture is insensitive to isogenies (see namely the first paragraph of §3.2), in the theorem, one could of course replace each element of the family 𝒯𝐅(s)\mathscr{T}_{\mathbf{F}}^{(s)} by another semisimple group isogenous to it. implies Lehmer’s conjecture at level ss. In consequence, Margulis’ conjecture (A) is equivalent to the weak version (C) of Lehmer’s conjecture.

In view of the theorem, let us emphasize that Margulis’ conjecture for products of simple groups of any fixed \mathbb{R}-type and their extensions to \mathbb{C} is already sufficient to imply the weak Lehmer conjecture, which in turn implies Margulis’ conjecture for all semisimple groups (see §3). For the sake of clarity, we will first carry the argument out in detail for type 𝖠n\mathsf{A}_{n} split in §4. To complete the proof of the theorem, we then give a comprehensive treatment of all other \mathbb{R}-types in §5. It is noteworthy that the argument works for every \mathbb{R}-type in a similar way.

It would be interesting to know whether Margulis’ conjecture has implications for the full Lehmer conjecture (B), beyond Conjecture (C). Connections between the full Lehmer conjecture and linear groups have already been brought to light, for example in the work of Breuillard [Bre07] and of Breuillard–Varju [BV20].

1.4. Notation and conventions

Throughout the paper, we adopt the conventions of Bourbaki. In particular, ={0,1,2,}\mathbb{N}=\{0,1,2,\dots\} and we denote =\{0}\mathbb{N}^{*}=\mathbb{N}\backslash\{0\}.

  • If P[x]P\in\mathbb{C}[x], s(P)s(P) denotes the number of roots of PP in \mathbb{C} which have absolute value >1>1, counted with multiplicity.

  • (P)\operatorname{\mathcal{M}}(P) denotes the Mahler measure of P[x]P\in\mathbb{C}[x].

  • If α¯\alpha\in\overline{\mathbb{Q}} is an algebraic integer, PαP_{\alpha} denotes its minimal polynomial over \mathbb{Z}.

  • 𝒪K\mathcal{O}_{K} denotes the ring of integers of a number field KK.

  • Mn(A)\mathrm{M}_{n}(A) denotes the set of n×nn\times n matrices with entries in an algebraic structure AA, endowed with whichever structure is inherited from AA.

Acknowledgements

The authors would like to thank the anonymous referee for his helpful comments, which improved the clarity of the paper and led to a more thorough treatment of the exceptional groups. The authors also thank E. Breuillard and G. Margulis for interesting conversations.

2. A short history

In this section, we provide a very short (and incomplete) survey of known results and references about the arithmetic and geometry of the famous Lehmer problem. For a more extensive treatment, we refer the reader to the surveys of Smyth [Smy08, Smy15], and Ghate and Hironaka [GH01].

2.1. Some known results about the Lehmer conjecture

Let α¯\alpha\in\overline{\mathbb{Q}} be an algebraic integer which is not an integer nor a root of unity, and let d2d\geq 2 denote the degree of PαP_{\alpha}. Then, the following lower bound for M(α)M(\alpha) is known:

(2.1.1) (α)>1+14(loglogdlogd)3.\operatorname{\mathcal{M}}(\alpha)>1+\frac{1}{4}\left(\frac{\log\log d}{\log d}\right)^{3}.

This was proved by Voutier [Vou96, Theorem], improving the bound of Dobrowolski [Dob79, Theorem 1] (who obtained a factor of 1/12001/1200 instead of 1/41/4). Another bound due to Laurent(3)(3)(3)In a private communication, Michel Laurent informed us that this bound was not published and that Schinzel [Sch73] proved the sharp inequality (α)(1+52)d/2\operatorname{\mathcal{M}}(\alpha)\geq\big{(}\tfrac{1+\sqrt{5}}{2}\big{)}^{d/2}, for every α\alpha which is a totally real algebraic integer of degree d2d\geq 2. (1983), emphasizing the number of real roots, is given by Margulis [Mar91, p. 322]: if PP is a non-cyclotomic polynomial of degree d2d\geq 2 with rr real roots, then

(α)cr2/dlog(1+dr),\operatorname{\mathcal{M}}(\alpha)\geq c^{r^{2}/d\log(1+\frac{d}{r})},

where c>0c>0 is an absolute constant.

Recall that a polynomial P[X]P\in\mathbb{Z}[X] is called palindromic if

P(X)=Xdeg(P)P(X1).P(X)=X^{\deg(P)}P(X^{-1}).

Every palindromic polynomial PP of odd degree is divisible by X+1X+1, hence irreducible palindromic polynomials of degree >1>1 must have even degree.

Smyth [Smy71] proved that the polynomial PSmyth(X)=X3X1P_{\mathrm{Smyth}}(X)=X^{3}-X-1 had the smallest Mahler measure among non-palindromic polynomials.

Theorem ([Smy71]).

Let α¯\alpha\in\overline{\mathbb{Q}}. If (Pα)<(PSmyth)\operatorname{\mathcal{M}}(P_{\alpha})<\operatorname{\mathcal{M}}(P_{\mathrm{Smyth}}), then PαP_{\alpha} is palindromic.

This effectively reduces Conjectures (B) and (C) to palindromic polynomials.

Although we will not use it, we note in passing another instance for which the Lehmer conjecture is known to hold. Borwein, Dobrowolski, and Mossinghoff [BDM07] proved that if PP is a polynomial of degree dd without cyclotomic factors, all of whose coefficients are odd integers, then

log(P)log54(11d+1).\log\operatorname{\mathcal{M}}(P)\geq\frac{\log 5}{4}\left(1-\frac{1}{d+1}\right).

2.2. The case of Salem numbers

Recall that a Salem number is an algebraic integer uu\in\mathbb{R} which is >1>1 and all of whose Galois conjugates in \mathbb{C} have absolute value 1\leq 1, with at least one of absolute value =1=1. Hence, any Salem number α\alpha satisfies s(Pα)=1s(P_{\alpha})=1 where PαP_{\alpha} is the minimal polynomial of α\alpha, and it is clear that PαP_{\alpha} is palindromic. Conversely, any irreducible palindromic polynomial P[x]P\in\mathbb{Z}[x] with s(P)=1s(P)=1 and degP4\deg P\geq 4 is the minimal polynomial of a Salem number. The following conjecture is thus equivalent to Lehmer’s conjecture at level s=1s=1 (see the paragraph after Conjecture (C)).

Conjecture (D) (Salem, arithmetic version).

There exists ε>0\varepsilon>0 such that every Salem number α\alpha satisfies α>1+ε\alpha>1+\varepsilon.

Salem numbers constitute an important family of algebraic numbers. For example, the polynomial PLehmerP_{\text{Lehmer}} with the smallest known Mahler measure (see §1.2) turns out to be the minimal polynomial of a Salem number.

A beautiful connection with discrete subgroups of Lie groups was established by Sury [Sur92]: he proved that Conjecture (D) was equivalent to the following conjecture.

Conjecture (E) (Salem, geometric version).

There exists a neighborhood U𝐒𝐋2()U\subset\operatorname{\mathbf{SL}}_{2}(\mathbb{R}) of the identity such that for any torsion-free cocompact arithmetic lattice Γ𝐒𝐋2()\Gamma\subset\operatorname{\mathbf{SL}}_{2}(\mathbb{R}), we have ΓU={e}\Gamma\cap U=\{e\}.

Remark.

Conjecture (E) is simply Margulis’ conjecture for 𝐒𝐋2\operatorname{\mathbf{SL}}_{2}, hence Theorem 1.3 for s=1s=1 includes Sury’s result as a special case. In fact, Theorem 1.3 for s=1s=1 shows that Margulis’ conjecture (A) for any one of the isotropic absolutely (almost) simple \mathbb{R}-groups (e.g. 𝐒𝐋n\operatorname{\mathbf{SL}}_{n}, 𝐒𝐎q\operatorname{\mathbf{SO}}_{q}, etc.) implies Conjecture (D).

2.3. Lengths of shortest geodesics in arithmetic hyperbolic orbifolds

Extending the picture to Kleinian groups, Neumann and Reid [NR92] formulated the following conjecture.

Conjecture (F) (Short geodesic conjecture).

There is a positive universal lower bound for the lengths of closed geodesics in arithmetic hyperbolic 22- and 33-orbifolds.

The short geodesic conjecture for hyperbolic 2-orbifolds is equivalent to Conjecture (E), hence to (D). On the other hand, the short geodesic conjecture for hyperbolic 3-orbifolds is equivalent to the following complex Salem conjecture. Call an algebraic integer α\alpha\in\mathbb{C} a complex Salem number if α\alpha is not real, α\alpha and its complex conjugate α¯\overline{\alpha} are the only two Galois-conjugates of α\alpha in \mathbb{C} of absolute value >1>1, and α\alpha has at least one conjugate of absolute value =1=1.

Conjecture (G) (Complex Salem).

There exists ε>0\varepsilon>0 such that every complex Salem number α\alpha satisfies |α|>1+ε|\alpha|>1+\varepsilon.

In fact, the short geodesic conjecture for hyperbolic 33-orbifolds implies the conjecture for 22-orbifolds. This can be seen arithmetically, as the complex Salem conjecture implies the traditional Salem conjecture. Indeed, if α\alpha is a Salem number, then Pα(x2)P_{\alpha}(-x^{2}) is the minimal polynomial of a complex Salem number with the same Mahler measure as α\alpha.

For a detailed introduction to Conjecture (F), we refer the reader to the book of Maclachlan and Reid [MR03].

Remark.

Similarly as for Conjecture (D), the proof of Theorem 1.3 (for s=2s=2, r=0r=0) shows that Margulis’ conjecture for the restriction to \mathbb{R} of any simple \mathbb{C}-group implies Conjecture (G).

Salem numbers can also be used to obtain a lower bound for the length of closed geodesics in noncompact arithmetic hyperbolic orbifolds of even dimension nn. This was done by Emery, Ratcliffe and Tschantz [ERT19]. More precisely, for any integer n2n\geq 2, let n\mathcal{H}^{n} denote hyperbolic nn-space, and define

βn=min{logααis a Salem number withdegPαn}.\beta_{n}=\min\{\log\alpha\mid\alpha~{}\text{is a Salem number with}~{}\deg P_{\alpha}\leq n\}.

For any even dimension nn, if ΓIsom(n)\Gamma\subset\operatorname{Isom}(\mathcal{H}^{n}) is a non-uniform arithmetic lattice, then the length of any closed geodesic in n/Γ\mathcal{H}^{n}/\Gamma is at least βn\beta_{n} [ERT19, Corollary 1.3]. It follows that Conjecture (D) is equivalent to the existence of a uniform lower bound on the length of closed geodesics in non-compact arithmetic hyperbolic orbifolds of even dimension.

2.4. Homotopy type of locally symmetric spaces

Let us conclude this section with a consequence of Conjecture (A) in the context of locally symmetric spaces. Let SS be a symmetric space (e.g., S=G/KS=G/K where GG is a semisimple Lie group, KGK\subset G a maximal compact subgroup). Following Gelander [Gel04], an SS-manifold is a complete Riemannian manifold locally isometric to SS, i.e., a manifold of the form Γ\S\Gamma\backslash S, where ΓIsom(S)\Gamma\subset\operatorname{Isom}(S) is a discrete torsion-free subgroup. It is irreducible if Γ\Gamma is an irreducible lattice. For d,vd,v\in\mathbb{N}, a (d,v)(d,v)-simplicial complex is a simplicial complex with at most vv vertices, all of degree at most dd. An interesting consequence of Conjecture (A) is the following conjecture of Gelander.

Conjecture (H) ([Gel04]).

For any symmetric space SS of noncompact type, there are constants α(S)\alpha(S), d(S)d(S), such that any irreducible SS-manifold MM (assumed to be arithmetic if dim(S)=3\dim(S)=3) is homotopically equivalent to a (d(S),α(S)vo(M))(d(S),\alpha(S)\operatorname{vo\ell}(M))-simplicial complex.

Conjecture (H) was recently proved by Fraczyk [Fra21, Theorem 1.16] for arithmetic 3-manifolds.

3. Margulis’ arithmeticity theorem

In this section, we briefly review Margulis’ arithmeticity results [Mar91, Chapter IX] in a form that will be used to show that Conjecture (C) implies Conjecture (A). Let us first recall the definitions.

Let 𝐆\mathbf{G} be a connected semisimple \mathbb{R}-group and let iI𝐆i\prod_{i\in I}\mathbf{G}_{i} be a decomposition of 𝐆\mathbf{G} as an almost direct product of almost \mathbb{R}-simple \mathbb{R}-subgroups. For any subset JIJ\subset I, we write 𝐆J=iJ𝐆i\mathbf{G}_{J}=\prod_{i\in J}\mathbf{G}_{i}. We denote by 𝐆is\mathbf{G}^{\operatorname{is}} (resp. 𝐆anis\mathbf{G}^{\mathrm{anis}}) the subgroup of 𝐆\mathbf{G} which is the almost direct product of the \mathbb{R}-isotropic (resp. \mathbb{R}-anisotropic) factors of 𝐆\mathbf{G}. We also let G=𝐆()G=\mathbf{G}(\mathbb{R}), GJ=𝐆J()G_{J}=\mathbf{G}_{J}(\mathbb{R}) and Gis=𝐆is()G^{\operatorname{is}}=\mathbf{G}^{\operatorname{is}}(\mathbb{R}).

A lattice ΓG\Gamma\subset G is said to be irreducible if for any non-empty proper subset JIJ\subset I, the index [Γ:(ΓGJ)(ΓGIJ)][\Gamma:(\Gamma\cap G_{J})\cdot(\Gamma\cap G_{I\setminus J})] is infinite.

An irreducible lattice ΓG\Gamma\subset G such that GisΓG^{\operatorname{is}}\cdot\Gamma is dense in GG is called arithmetic if there exist a connected non-commutative almost \mathbb{Q}-simple \mathbb{Q}-group 𝐇\mathbf{H} (endowed with some \mathbb{Z}-structure), and an \mathbb{R}-epimorphism τ:𝐇𝐆\tau:\mathbf{H}\to\mathbf{G} such that:

  1. (i)

    the Lie group (kerτ)()(\ker\tau)(\mathbb{R}) is compact;

  2. (ii)

    the subgroups τ(𝐇())\tau(\mathbf{H}(\mathbb{Z})) and Γ\Gamma are commensurable.

3.1. The arithmeticity theorem

We can now state Margulis’ celebrated arithmeticity theorem.

Theorem ([Mar91, IX, (1.16)]).

Let 𝐆\mathbf{G} be a connected semisimple \mathbb{R}-group and Γ\Gamma an irreducible lattice in 𝐆()\mathbf{G}(\mathbb{R}), with 𝐆is()Γ\mathbf{G}^{\operatorname{is}}(\mathbb{R})\cdot\Gamma dense in 𝐆()\mathbf{G}(\mathbb{R}). Suppose that rank𝐆2\operatorname{rank}_{\mathbb{R}}\mathbf{G}\geq 2. Then the lattice Γ\Gamma is arithmetic.

In the remainder of this section, we will assume that rank(𝐆)2\operatorname{rank}_{\mathbb{R}}(\mathbf{G})\geq 2, that GG has no compact factors (i.e. 𝐆is=𝐆\mathbf{G}^{\operatorname{is}}=\mathbf{G}) and has trivial center. Let ΓG\Gamma\subset G be an irreducible lattice. Then the following facts are among the key steps of the proof of the arithmeticity theorem. We refer the reader to [Zim84, §6.1] or [Ben08, §11.5] for proofs and details.

  1. (3.1.1)

    The trace field K=(trAdΓ)K=\mathbb{Q}(\operatorname{tr}\operatorname{Ad}\Gamma) of Γ\Gamma, the field generated by the set {tr(Ad(γ))|γΓ}\{\operatorname{tr}(\operatorname{Ad}(\gamma))\,|\,\gamma\in\Gamma\}, is a number field. This follows from Margulis’ superrigidity theorem, together with the fact that Γ\Gamma is finitely generated.

  2. (3.1.2)

    Since Γ\Gamma is Zariski-dense in 𝐆()\mathbf{G}(\mathbb{R}), 𝐆\mathbf{G} can be defined over KK. That is, there is a KK-group 𝐆\mathbf{G}^{\prime} and a place v0v_{0} of KK such that Kv0=K_{v_{0}}=\mathbb{R}, 𝐆𝐆\mathbf{G}\cong\mathbf{G}^{\prime} as \mathbb{R}-groups, and the image of Γ\Gamma under this isomorphism lies in 𝐆(K)\mathbf{G}^{\prime}(K). In the following, we will identify 𝐆\mathbf{G} and 𝐆\mathbf{G}^{\prime}, and simply assume that Γ𝐆(K)\Gamma\subset\mathbf{G}(K).

  3. (3.1.3)

    There is a semisimple \mathbb{Q}-group 𝐇\mathbf{H} (endowed with a \mathbb{Z}-structure) with trivial center, an \mathbb{R}-epimorphism τ:𝐇𝐆\tau:\mathbf{H}\to\mathbf{G} with (kerτ)()(\ker\tau)(\mathbb{R}) compact, and a homomorphism ι:Γ𝐇()\iota:\Gamma\to\mathbf{H}(\mathbb{Q}) such that τι\tau\circ\iota is the identity, ι(Γ)\iota(\Gamma) is Zariski dense in 𝐇\mathbf{H} and ι(Γ)\iota(\Gamma) is commensurable with 𝐇()\mathbf{H}(\mathbb{Z}). The group 𝐇\mathbf{H} can be constructed as the restriction of scalars ResK/(𝐆)\operatorname{Res}_{K/\mathbb{Q}}(\mathbf{G}) of 𝐆\mathbf{G} from KK to \mathbb{Q}.

3.2. The weak Lehmer conjecture implies Margulis’ conjecture

We start by indicating how Conjecture (C) implies Conjecture (A). Note that as long as the lattice Γ\Gamma is arithmetic, the argument given below works equally well for groups of rank one. It shows in fact that Conjecture (C) implies Margulis’ conjecture for all semisimple \mathbb{R}-groups (in the sense defined below the statement of Conjecture (A)). Let thus 𝐆\mathbf{G} be a semisimple group and Γ\Gamma an irreducible arithmetic lattice in 𝐆()\mathbf{G}(\mathbb{R}).

Without loss of generality, we may assume that 𝐆\mathbf{G} has trivial center and is without anisotropic factors. Indeed, if 𝐆\mathbf{G} has center 𝐂\mathbf{C}, and UU^{\prime} is a neighborhood of (𝐆/𝐂)()(\mathbf{G}/\mathbf{C})(\mathbb{R}) as in (A), then the preimage UU of UU^{\prime} under the canonical map π:𝐆()(𝐆/𝐂)()\pi:\mathbf{G}(\mathbb{R})\to(\mathbf{G}/\mathbf{C})(\mathbb{R}) has the required property: if Γ\Gamma is an irreducible cocompact lattice in 𝐆()\mathbf{G}(\mathbb{R}), then π(Γ)\pi(\Gamma) is an irreducible cocompact lattice in (𝐆/𝐂)()(\mathbf{G}/\mathbf{C})(\mathbb{R}); hence if γU\gamma\in U, π(γ)\pi(\gamma) must have finite order, and since 𝐂()\mathbf{C}(\mathbb{R}) is finite, so does γ\gamma. Similarly, if 𝐆=𝐆is×𝐆anis\mathbf{G}=\mathbf{G}^{\operatorname{is}}\times\mathbf{G}^{\mathrm{anis}} and UU^{\prime} is a neighborhood of 𝐆is\mathbf{G}^{\operatorname{is}} as in (A), then the preimage UU of UU^{\prime} under the canonical map π:𝐆()𝐆is()\pi:\mathbf{G}(\mathbb{R})\to\mathbf{G}^{\operatorname{is}}(\mathbb{R}) has the required property: if Γ\Gamma is an irreducible cocompact lattice in 𝐆()\mathbf{G}(\mathbb{R}), then π(Γ)\pi(\Gamma) is an irreducible cocompact lattice in 𝐆is()\mathbf{G}^{\operatorname{is}}(\mathbb{R}); hence if γU\gamma\in U, π(γ)\pi(\gamma) has finite order mm, and γm𝐆anis()Γ\gamma^{m}\in\mathbf{G}^{\mathrm{anis}}(\mathbb{R})\cap\Gamma. As 𝐆anis()\mathbf{G}^{\mathrm{anis}}(\mathbb{R}) is compact and Γ\Gamma is discrete, the latter is a finite group and γ\gamma has finite order.

Let 𝐇\mathbf{H} be the group obtained in (3.1.3), so that we have the following diagram.

Γ{\Gamma}𝐇(){\mathbf{H}(\mathbb{R})}𝐆(){\mathbf{G}(\mathbb{R})}GL(Lie(𝐇())){\mathrm{GL}(\operatorname{Lie}(\mathbf{H}(\mathbb{R})))}GL(Lie(𝐆())){\mathrm{GL}(\operatorname{Lie}(\mathbf{G}(\mathbb{R})))}ι{\iota}τ{\tau}Ad{\operatorname{Ad}}Ad{\operatorname{Ad}}

Recall also from (3.1.3) that τ\tau has compact kernel and that ι(Γ)\iota(\Gamma) is commensurable with 𝐇()\mathbf{H}(\mathbb{Z}).

Since the adjoint representation 𝐇Ad𝐇\mathbf{H}\to\operatorname{Ad}\mathbf{H} is defined over \mathbb{Q}, we can find a finite-index subgroup Λ\Lambda of 𝐇()\mathbf{H}(\mathbb{Z}) for which Ad(Λ)Ad(𝐇)()\operatorname{Ad}(\Lambda)\subset\operatorname{Ad}(\mathbf{H})(\mathbb{Z}) (see for example [Mar91, I, (3.1.1)]); in particular, Ad(Λ)\operatorname{Ad}(\Lambda) preserves a lattice in Lie(𝐇())\operatorname{Lie}(\mathbf{H}(\mathbb{R})). Since Λ\Lambda and ι(Γ)\iota(\Gamma) are commensurable, Ad(ι(Γ))\operatorname{Ad}(\iota(\Gamma)) also stabilizes a lattice in Lie(𝐇())\operatorname{Lie}(\mathbf{H}(\mathbb{R})) [Mar91, IX, (4.19)]. Hence the characteristic polynomials of the elements of Ad(ι(Γ))\operatorname{Ad}(\iota(\Gamma)) have integer coefficients.

Let us write 𝐇()=F×K\mathbf{H}(\mathbb{R})=F\times K as a direct product where KK is compact and FF is without compact factors. The morphism τ\tau then induces an isogeny F𝐆()F\to\mathbf{G}(\mathbb{R}) and dτ\mathrm{d}\tau restricts to an isomorphism Lie(F)Lie(𝐆())\operatorname{Lie}(F)\to\operatorname{Lie}(\mathbf{G}(\mathbb{R})). Let x𝐇()x\in\mathbf{H}(\mathbb{R}) and write Ad𝐇()(x)=AdF(y)AdK(z)\operatorname{Ad}_{\mathbf{H}(\mathbb{R})}(x)=\operatorname{Ad}_{F}(y)\oplus\operatorname{Ad}_{K}(z) for some yFy\in F, zKz\in K. If we denote PxP_{x}, PyP_{y}, PzP_{z} the respective characteristic polynomials of Ad𝐇()(x)\operatorname{Ad}_{\mathbf{H}(\mathbb{R})}(x), AdF(y)\operatorname{Ad}_{F}(y), AdK(z)\operatorname{Ad}_{K}(z), we have that Px=PyPzP_{x}=P_{y}\cdot P_{z}. Because KK is compact, all roots of PzP_{z} in \mathbb{C} must have absolute value 1; in consequence, (Pz)=1\operatorname{\mathcal{M}}(P_{z})=1, hence (Px)=(Py)\operatorname{\mathcal{M}}(P_{x})=\operatorname{\mathcal{M}}(P_{y}). Moreover, PxP_{x} satisfies s(Px)=s(Py)dimF=dim𝐆s(P_{x})=s(P_{y})\leq\dim F=\dim\mathbf{G}.

Now pick γΓ\gamma\in\Gamma and apply the last paragraph to x=ι(γ)x=\iota(\gamma). Since τ(ι(γ))=γ\tau(\iota(\gamma))=\gamma and dτAd(ι(γ))=Ad(γ)dτ\mathrm{d}\tau\circ\operatorname{Ad}(\iota(\gamma))=\operatorname{Ad}(\gamma)\circ\mathrm{d}\tau, we obtain that the characteristic polynomial PγP_{\gamma} of Ad𝐆()(γ)\operatorname{Ad}_{\mathbf{G}(\mathbb{R})}(\gamma) equals PyP_{y}. We have in turn

(Pγ)=(Pι(γ))ands(Pγ)=s(Pι(γ)).\operatorname{\mathcal{M}}(P_{\gamma})=\operatorname{\mathcal{M}}(P_{\iota(\gamma)})\quad\text{and}\quad s(P_{\gamma})=s(P_{\iota(\gamma)}).

Let f:𝐆()[1,[f:\mathbf{G}(\mathbb{R})\to[1,\infty[ be defined by f(g)=(Pg)f(g)=\operatorname{\mathcal{M}}(P_{g}), where as before PgP_{g} denotes the characteristic polynomial of Ad𝐆()(g)\operatorname{Ad}_{\mathbf{G}(\mathbb{R})}(g); note that ff is a continuous function. If the weak version of Lehmer’s conjecture holds at level s=dim𝐆s=\dim\mathbf{G}, we can find ε>0\varepsilon>0 such that any polynomial PP with integer coefficients and s(P)dim𝐆s(P)\leq\dim\mathbf{G} satisfies either (P)=1\operatorname{\mathcal{M}}(P)=1 or (P)>1+ε\operatorname{\mathcal{M}}(P)>1+\varepsilon. This applies to Pι(γ)P_{\iota(\gamma)} for any γΓ\gamma\in\Gamma: as we observed, Pι(γ)P_{\iota(\gamma)} has integer coefficients and s(Pι(γ))dim𝐆s(P_{\iota(\gamma)})\leq\dim\mathbf{G}. Thus, by the above, the open neighborhood U=f1([1,ε[)U=f^{-1}([1,\varepsilon[) of 11 in 𝐆()\mathbf{G}(\mathbb{R}) is such that UΓU\cap\Gamma consists of elements γ\gamma for which (Pγ)=(Pι(γ))=1\operatorname{\mathcal{M}}(P_{\gamma})=\operatorname{\mathcal{M}}(P_{\iota(\gamma)})=1. This means that for γU\gamma\in U, PγP_{\gamma} is a product of cyclotomic polynomials. Since Γ\Gamma is cocompact, Ad(γ)\operatorname{Ad}(\gamma) is semisimple and thus Ad(γ)\operatorname{Ad}(\gamma), hence also γ\gamma, have finite order.

4. Proof of the main theorem for SLn\mathrm{SL}_{n}

We now proceed to prove Theorem 1.3 (in particular, that Conjecture (A) implies Conjecture (C)). To this end, we will assume Conjecture (C) fails and construct a sequence of cocompact lattices in suitable groups of the family 𝒯𝐅(s)\mathscr{T}_{\mathbf{F}}^{(s)} violating Margulis’ conjecture for this family. As it is more transparent, we first give the full argument for 𝐅=𝐒𝐋n\mathbf{F}=\operatorname{\mathbf{SL}}_{n}, i.e. for the family

𝒯𝐒𝐋n(s)={i=1r𝐒𝐋n×i=1tRes/(𝐒𝐋n)|r,t,r+2ts}.\mathscr{T}_{\operatorname{\mathbf{SL}}_{n}}^{(s)}=\left\{\prod_{i=1}^{r}\operatorname{\mathbf{SL}}_{n}\times\prod_{i=1}^{t}\operatorname{Res}_{\mathbb{C}/\mathbb{R}}(\operatorname{\mathbf{SL}}_{n})\ \middle|\ r,t\in\mathbb{N},\;r+2t\leq s\right\}.

This is already sufficient to establish the equivalence between Conjectures (A) and (C). In the last section (§5), we then indicate the corresponding changes for 𝐅\mathbf{F} an arbitrary absolutely (almost) simple isotropic \mathbb{R}-group, completing the proof of Theorem 1.3.

4.1. Reduction to palindromic polynomials with control on the archimedean places

Given P[x]P\in\mathbb{Z}[x], let us denote as before α1,αs(P)\alpha_{1},\dots\alpha_{s(P)} the roots of PP in \mathbb{C} of absolute value >1>1, and label them in such a way α1,,αr(P)\alpha_{1},\dots,\alpha_{r(P)} lie in \mathbb{R} and αr(P)+1,,αs(P)\alpha_{r(P)+1},\dots,\alpha_{s(P)} do not. In addition, let us order the latter roots so that α¯r(P)+i=α(r(P)+s(P))/2+i\overline{\alpha}_{r(P)+i}=\alpha_{(r(P)+s(P))/2+i} for 1i(s(P)r(P))/21\leq i\leq(s(P)-r(P))/2. For each ss^{\prime}\in\mathbb{N}, we will consider the set of polynomials

𝒫s={P[X]|Pis monic, irreducible, palindromic, and s(P)s},\mathscr{P}_{\leq s^{\prime}}=\big{\{}P\in\mathbb{Z}[X]\,|\,P\ \text{is monic, irreducible, palindromic, and }s(P)\leq s^{\prime}\big{\}},

and for rssr\leq s\leq s^{\prime}, its subsets

𝒫s,r={P𝒫ss(P)=s and r(P)=r}.\mathscr{P}_{s,r}=\big{\{}P\in\mathscr{P}_{\leq s^{\prime}}\mid s(P)=s\textrm{ and }r(P)=r\big{\}}.

By construction, 𝒫s\mathscr{P}_{\leq s^{\prime}} is the disjoint union s=0sr=0s𝒫s,r\bigcup_{s=0}^{s^{\prime}}\bigcup_{r=0}^{s}\mathscr{P}_{s,r}.

Suppose that Conjecture (C) does not hold. That is, there are ss^{\prime}\in\mathbb{N} and a sequence (Pm)m(P_{m})_{m\in\mathbb{N}} of monic, irreducible polynomials with integer coefficients such that s(Pm)ss(P_{m})\leq s^{\prime} and (Pm)1\operatorname{\mathcal{M}}(P_{m})\to 1 while (Pm)>1\operatorname{\mathcal{M}}(P_{m})>1. By virtue of Smyth’s theorem (§2.1), we may assume that each PmP_{m} is palindromic, i.e. that Pm𝒫sP_{m}\in\mathscr{P}_{\leq s^{\prime}}. Moreover, up to extracting an appropriate subsequence, we may assume that Pm𝒫s,rP_{m}\in\mathscr{P}_{s,r} for some fixed integers rssr\leq s\leq s^{\prime}.(4)(4)(4)If ss^{\prime} were to be the smallest integer for which Lehmer’s conjecture at level ss^{\prime} fails, then obviously s=ss=s^{\prime}. Of course, s>0s>0. In view of the bound (2.1.1), it must be that the sequence (degPm)m(\deg P_{m})_{m} is unbounded. We may thus additionally assume that degPm>2s\deg P_{m}>2s. This implies the following important feature: PmP_{m} must have a root of absolute value 11. Indeed, PmP_{m} has ss roots of absolute value >1>1, and because PmP_{m} is palindromic, as many roots of absolute value <1<1.

The discussion above shows that the negation of Lehmer’s conjecture at level ss^{\prime} (cf. the paragraph following Conjecture (C)) amounts to the following statement.

Negation of Lehmer’s conjecture at level ss^{\prime}.

There are fixed integers r,sr\in\mathbb{N},\,s\in\mathbb{N}^{*} with rssr\leq s\leq s^{\prime} such that:

  1. (Ls,r)

    for any ε>0\varepsilon>0, there is a polynomial P𝒫s,rP\in\mathscr{P}_{s,r} with at least one root in \mathbb{C} of absolute value 11, for which 1<(P)<1+ε1<\operatorname{\mathcal{M}}(P)<1+\varepsilon.

Using statement (Ls,r) as the main ingredient, for each such pair (s,r)(s,r), we will construct in the semisimple \mathbb{R}-group

(\star) 𝐆s,r=i=1r𝐒𝐋n×i=1(sr)/2Res/(𝐒𝐋n)(n2),\mathbf{G}_{s,r}=\prod_{i=1}^{r}\operatorname{\mathbf{SL}}_{n}\times\prod_{i=1}^{(s-r)/2}\operatorname{Res}_{\mathbb{C}/\mathbb{R}}(\operatorname{\mathbf{SL}}_{n})\qquad(n\geq 2),

a sequence of cocompact lattices Γm<𝐆s,r()\Gamma_{m}<\mathbf{G}_{s,r}(\mathbb{R}) and a sequence of elements γmΓm\gamma_{m}\in\Gamma_{m} of infinite order, such that γme𝐆s,r()\gamma_{m}\to e\in\mathbf{G}_{s,r}(\mathbb{R}) as mm\to\infty. This shall provide a counterexample to Margulis’ conjecture for 𝐆s,r\mathbf{G}_{s,r}, thus proving the first part of theorem 1.3. The equivalence of Margulis’ conjecture (A) and the weak version (C) of Lehmer’s conjecture then follows immediately by combining this with §3.2.

4.2. The number fields KK and LL

Let rsr\leq s be as above. Pick a polynomial P𝒫s,rP\in\mathscr{P}_{s,r}, set 2d=degP2d=\deg P, and let L=(α)L=\mathbb{Q}(\alpha) denote the number field generated over \mathbb{Q} by a root α\alpha of PP. Since PP is palindromic, α1\alpha^{-1} is also a root of PP. In particular, α1\alpha^{-1} is integral over \mathbb{Z}, and the assignment τ:αα1\tau:\alpha\mapsto\alpha^{-1} defines a non-trivial automorphism of LL (which restricts to an automorphism of its ring of integers 𝒪L\mathcal{O}_{L}). Let K=(α+α1)K=\mathbb{Q}(\alpha+{\alpha}^{-1}) denote the subfield of LL generated over \mathbb{Q} by α+α1\alpha+{\alpha}^{-1}. Since KK is the fixed field of τ\tau, LL is a quadratic extension of KK whose Galois group is {id,τ}\{\operatorname{id},\tau\}. Note that the minimal polynomial of α\alpha over KK is X2(α+α1)X+1X^{2}-(\alpha+{\alpha}^{-1})X+1.

For 1is1\leq i\leq s, let σi\sigma_{i} denote the embedding LL\to\mathbb{C} defined by σi(α)=αi\sigma_{i}(\alpha)=\alpha_{i}. By definition, LL has rr real embeddings σ1,,σr\sigma_{1},\,\dots,\,\sigma_{r} and srs-r complex embeddings σr+1,,σs\sigma_{r+1},\,\dots,\,\sigma_{s} for which the image of α\alpha has absolute value >1>1. LL also has ds{d}-s pairs of conjugate complex embeddings σs+1,σ¯s+1,,σd,σ¯d\sigma_{s+1},\,\overline{\sigma}_{s+1},\,\dots,\,\sigma_{d},\,\overline{\sigma}_{d} for which α\alpha maps to an element of absolute value 11. Observe that precomposition with τ\tau sends {σ1,,σr}\{\sigma_{1},\dots,\sigma_{r}\} and {σr+1,,σs}\{\sigma_{r+1},\dots,\sigma_{s}\} respectively to the set of real and complex embeddings of LL for which the image of α\alpha has absolute value <1<1. Similarly, σiτ=σ¯i\sigma_{i}\circ\tau=\overline{\sigma}_{i} for s+1ids+1\leq i\leq d. By construction, the embeddings σi\sigma_{i} and σiτ\sigma_{i}\circ\tau for 1id1\leq i\leq d agree on KK. This shows that {σ1,,σd}\{\sigma_{1},\dots,\sigma_{d}\} is the complete set of embeddings of KK into \mathbb{C} (we omit the restriction to KK from the notation). The image σi(α+α1)\sigma_{i}(\alpha+{\alpha}^{-1}) of the generator of KK belongs to \mathbb{R} if and only if σi(α)\sigma_{i}(\alpha)\in\mathbb{R} or |σi(α)|=1|\sigma_{i}(\alpha)|=1. In consequence, restricted to KK, the embeddings σ1,,σr\sigma_{1},\dots,\sigma_{r} and σs+1,,σd\sigma_{s+1},\dots,\sigma_{d} are real, σr+1,,σs\sigma_{r+1},\dots,\sigma_{s} are complex (coming in conjugate pairs), and KK has signature (rs+d,(sr)/2)(r-s+d,(s-r)/2).

4.3. The KK-group 𝐆\mathbf{G}

Let h:Ln×LnLh:L^{n}\times L^{n}\to L be the τ\tau-hermitian form given by

h(x,y)=x1τ(y1)++xnτ(yn)x,yLn,h(x,y)=x_{1}\tau(y_{1})+\dots+x_{n}\tau(y_{n})\qquad x,\,y\in L^{n},

and let 𝐆=𝐒𝐔h\mathbf{G}=\operatorname{\mathbf{SU}}_{h} be the special unitary group associated to the form hh. 𝐆\mathbf{G} is a linear algebraic KK-group, whose group of KK-points is (isomorphic to) the group of hh-unitary matrices in Mn(L)\mathrm{M}_{n}(L) of determinant 11.(5)(5)(5)In fact, since the equations defining 𝐆\mathbf{G} can be taken with coefficients in 𝒪K\mathcal{O}_{K}, 𝐆\mathbf{G} can also be viewed as an 𝒪K\mathcal{O}_{K}-group scheme. The KK\otimes_{\mathbb{Q}}\mathbb{R}-points of 𝐆\mathbf{G} can be computed easily by studying the behavior of the extension L/KL/K and of the form hh under the different embeddings {σ1,,σd}\{\sigma_{1},\dots,\sigma_{d}\} of KK.

In the present setting, the extension L/KL/K splits at the places σ1,,σr\sigma_{1},\dots,\sigma_{r} (since these extend to real places of LL) and also at the places σr+1,,σs\sigma_{r+1},\dots,\sigma_{s} (since these are complex places of KK), but not at σs+1,,σd\sigma_{s+1},\dots,\sigma_{d} (since there we have LKKσiL\otimes_{K}K_{\sigma_{i}}\cong\mathbb{C}). At these last places, the hermitian form hh becomes the standard hermitian form on n×n\mathbb{C}^{n}\times\mathbb{C}^{n} after identifying the completion of L/KL/K with /\mathbb{C}/\mathbb{R} via σi\sigma_{i} (s+1ids+1\leq i\leq d). Altogether, we have

𝐆(K)i=1r𝐒𝐋n()×i=1(sr)/2𝐒𝐋n()×i=1ds𝐒𝐔n(),\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\cong\prod_{i=1}^{r}\operatorname{\mathbf{SL}}_{n}(\mathbb{R})\times\prod_{i=1}^{(s-r)/2}\operatorname{\mathbf{SL}}_{n}(\mathbb{C})\times\prod_{i=1}^{d-s}\operatorname{\mathbf{SU}}_{n}(\mathbb{R}),

where 𝐒𝐔n()\operatorname{\mathbf{SU}}_{n}(\mathbb{R}) denotes the standard anisotropic special unitary group over \mathbb{R}.

4.4. The lattice Γ\Gamma and the element γ\gamma

Let Γ\Gamma be the group of hh-unitary n×nn\times n matrices of determinant 11 with entries in 𝒪L\mathcal{O}_{L}.(6)(6)(6)With this definition, Γ\Gamma is (commensurable to) the 𝒪K\mathcal{O}_{K}-points of 𝐆\mathbf{G} when it is viewed as an 𝒪K\mathcal{O}_{K}-group scheme as above. A classical theorem of Borel and Harish–Chandra [BHC62] states that Γ\Gamma is a lattice in 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) when embedded diagonally using the inequivalent archimedean places of KK among σ1,,σd\sigma_{1},\dots,\sigma_{d}. (Apply [BHC62, Theorem 7.8] to ResK/(𝐆)\operatorname{Res}_{K/\mathbb{Q}}(\mathbf{G}) and observe that ResK/(𝐆)()𝐆(K)\operatorname{Res}_{K/\mathbb{Q}}(\mathbf{G})(\mathbb{R})\cong\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}); under this isomorphism, ResK/(𝐆)()\operatorname{Res}_{K/\mathbb{Q}}(\mathbf{G})(\mathbb{Z}) is commensurable to Γ\Gamma.) By virtue of the strong approximation theorem (for number fields), Γ\Gamma is an irreducible lattice. Moreover, if PP has at least one root in \mathbb{C} of absolute value 11 (i.e. if s<ds<d), we claim that Γ\Gamma is a cocompact lattice in 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}). Indeed, for any embedding σ:K\sigma:K\to\mathbb{C} which extends to LL in such a way |σ(α)|=1|\sigma(\alpha)|=1, the group 𝐆(Kσ)𝐒𝐔n()\mathbf{G}(K_{\sigma})\cong\operatorname{\mathbf{SU}}_{n}(\mathbb{R}) is compact, and hence 𝐆\mathbf{G} must be KK-anisotropic (as it is anisotropic over KσK_{\sigma}). The claim then follows from Godement’s criterion [BHC62, Theorem 11.8].

Let 𝐆s,r\mathbf{G}_{s,r} be the \mathbb{R}-group defined in  4.1(\star4.1). The canonical surjection π:𝐆(K)𝐆s,r()\pi:\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\to\mathbf{G}_{s,r}(\mathbb{R}) has compact kernel i=1ds𝐒𝐔n()\prod_{i=1}^{d-s}\operatorname{\mathbf{SU}}_{n}(\mathbb{R}); this implies that the image π(Γ)\pi(\Gamma) of Γ\Gamma under π\pi is an irreducible, cocompact (provided s<ds<d), lattice in 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}).

We set γ\gamma to be the diagonal hh-unitary n×nn\times n-matrix diag(α,α1,1,,1)\operatorname{diag}(\alpha,\alpha^{-1},1,\dots,1) in Γ=𝐆(𝒪K)\Gamma=\mathbf{G}(\mathcal{O}_{K}). Viewed as an element of 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}), the non-trivial block of π(γ)\pi(\gamma) is

((α1α11),,(αrαr1),(αr+1αr+11),,(α(s+r)/2α(s+r)/21)),\left(\left(\begin{sma\ell\ell matrix}\alpha_{1}&\\ &\alpha_{1}^{-1}\end{sma\ell\ell matrix}\right),\dots,\left(\begin{sma\ell\ell matrix}\alpha_{r}&\\ &\alpha_{r}^{-1}\end{sma\ell\ell matrix}\right),\left(\begin{sma\ell\ell matrix}\alpha_{r+1}&\\ &\alpha_{r+1}^{-1}\end{sma\ell\ell matrix}\right),\dots,\left(\begin{sma\ell\ell matrix}\alpha_{(s+r)/2}&\\ &\alpha_{(s+r)/2}^{-1}\end{sma\ell\ell matrix}\right)\right),

where we labelled the roots αi\alpha_{i} of PP in \mathbb{C} as in §4.1.

4.5. The sequence (γm)m(\gamma^{\prime}_{m})_{m} and Margulis’ conjecture

Assume (Ls,r) holds, set t=(sr)/2t=(s-r)/2 and pick a sequence (Pm)m(P_{m})_{m\in\mathbb{N}^{*}} of polynomials in 𝒫s,r\mathscr{P}_{s,r} with at least one root of absolute value 1, satisfying

1<(Pm)<exp(ηm,t)withηm,t=12mt+1.1<\operatorname{\mathcal{M}}(P_{m})<\exp(\eta_{m,t})\quad\text{with}\quad\eta_{m,t}=\frac{1}{2m^{t+1}}.

For each mm\in\mathbb{N}^{*}, let us denote

αm,Lm,Km,hm,𝐆m,Γm,γm,andπm,\alpha_{m},\quad L_{m},\quad K_{m},\quad h_{m},\quad\mathbf{G}_{m},\quad\Gamma_{m},\quad\gamma_{m},\quad\text{and}\quad\pi_{m},

all the objects stemming from the construction in §§4.24.4 applied to the polynomial PmP_{m}. As above, the non-trivial block of πm(γm)𝐆s,r()\pi_{m}(\gamma_{m})\in\mathbf{G}_{s,r}(\mathbb{R}) is

((αm,1αm,11),,(αm,rαm,r1),(αm,r+1αm,r+11),,(αm,r+tαm,r+t1)),\left(\left(\begin{sma\ell\ell matrix}\alpha_{m,1}&\\ &\alpha_{m,1}^{-1}\end{sma\ell\ell matrix}\right),\dots,\left(\begin{sma\ell\ell matrix}\alpha_{m,r}&\\ &\alpha_{m,r}^{-1}\end{sma\ell\ell matrix}\right),\left(\begin{sma\ell\ell matrix}\alpha_{m,r+1}&\\ &\alpha_{m,r+1}^{-1}\end{sma\ell\ell matrix}\right),\dots,\left(\begin{sma\ell\ell matrix}\alpha_{m,r+t}&\\ &\alpha_{m,r+t}^{-1}\end{sma\ell\ell matrix}\right)\right),

where αm,i\alpha_{m,i} are roots of PmP_{m} in \mathbb{C} labelled according to §4.1.

Let UmU_{m} denote the inversion-invariant neighborhood of 11 in \mathbb{C} given by

Um={z|1mlog|z|1m and 2πmarg(z)2πm}.U_{m}=\left\{z\in\mathbb{C}\,\middle|\,-\frac{1}{m}\leq\log|z|\leq\frac{1}{m}\textrm{ and }-\frac{2\pi}{m}\leq\arg(z)\leq\frac{2\pi}{m}\right\}.

By construction, we have 1<|αm,i|(Pm)<exp(ηm,t)1<|\alpha_{m,i}|\leq\operatorname{\mathcal{M}}(P_{m})<\exp(\eta_{m,t}) for 1ir+t1\leq i\leq r+t. Unfortunately, it may very well be that αm,iUm\alpha_{m,i}\notin U_{m}. To remedy this, we will use Dirichlet’s simultaneous approximation theorem (see e.g. [Sch80, II, §1, Theorem 1A]). Let us recall it under the form we will use.

Lemma (Dirichlet).

Let t,mt,m\in\mathbb{N}^{*}. For any x1,,xt/x_{1},\dots,x_{t}\in\mathbb{R}/\mathbb{Z}, there exists an integer 0<cmt0<c\leq m^{t} for which cxicx_{i} lies in [1/m,1/m]+[-1/m,1/m]+\mathbb{Z} for all 1it1\leq i\leq t.

Applying the approximation theorem to the arguments of αm,r+12,,αm,r+t2\alpha_{m,r+1}^{2},\dots,\alpha_{m,r+t}^{2} (scaled by 1/2π1/2\pi) yields an integer cmc_{m} with 0<cmmt0<c_{m}\leq m^{t}, such that

αm,i2cmUmfor1ir+t.\alpha_{m,i}^{2c_{m}}\in U_{m}\quad\textrm{for}\quad 1\leq i\leq r+t.

Indeed, for r+1ir+tr+1\leq i\leq r+t, by construction of cmc_{m}, we have 2π/marg(αm,i2cm)2π/m-2\pi/m\leq\arg(\alpha_{m,i}^{2c_{m}})\leq 2\pi/m; when 1ir1\leq i\leq r, it is clear that arg(αm,i2)=0\arg(\alpha_{m,i}^{2})=0. In both cases,

1<|αm,i2cm||αm,i|2mt<exp(2mtηm,t)=exp(1/m).1<|\alpha_{m,i}^{2c_{m}}|\leq|\alpha_{m,i}|^{2m^{t}}<\exp(2m^{t}\cdot\eta_{m,t})=\exp(1/m).

Finally, let γm\gamma^{\prime}_{m} denote the element πm(γm2cm)πm(Γm)𝐆s,r()\pi_{m}(\gamma_{m}^{2c_{m}})\in\pi_{m}(\Gamma_{m})\leq\mathbf{G}_{s,r}(\mathbb{R}). Because {Umm}\{U_{m}\mid m\in\mathbb{N}^{*}\} forms a basis of neighborhoods of 11 in \mathbb{C}, the discussion above (together with the continuity of the regular representation Lσi×𝐆𝐋2(Kσi)L_{\sigma_{i}}^{\times}\to\operatorname{\mathbf{GL}}_{2}(K_{\sigma_{i}})) shows that γm1𝐆s,r()\gamma^{\prime}_{m}\to 1\in\mathbf{G}_{s,r}(\mathbb{R}) as mm\to\infty. In addition, πm(Γm)\pi_{m}(\Gamma_{m}) is an irreducible, cocompact lattice in 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) by §4.4, and, as |αm,1|>1|\alpha_{m,1}|>1, γm\gamma^{\prime}_{m} has infinite order. This contradicts Conjecture (A) for the group 𝐆s,r\mathbf{G}_{s,r} and concludes the proof of theorem 1.3 for 𝐅=𝐒𝐋n\mathbf{F}=\operatorname{\mathbf{SL}}_{n}.

5. The argument for other simple groups

In this section, we indicate the modifications to run the argument for an isotropic \mathbb{R}-group 𝐅\mathbf{F} which is either classical, or adjoint of exceptional type 𝖦2\mathsf{G}_{2}, 𝖥4\mathsf{F}_{4}, 𝖤6\mathsf{E}_{6}, 𝖤7\mathsf{E}_{7} or 𝖤8\mathsf{E}_{8}. As any absolutely (almost) simple isotropic \mathbb{R}-group is isogenous to one of these (see e.g. [Tit66b]), this is sufficient to conclude the proof of Theorem 1.3.

For brevity, we omit the parts of the argument which are analogous (if not identical) to their counterparts in §§4.14.5. The leitmotiv is the construction of a KK-form 𝐆\mathbf{G} of 𝐅\mathbf{F} with the appropriate archimedean shape, in which the norm torus kerNL/KResL/K(𝐆𝐋1)\ker\mathrm{N}_{L/K}\leq\operatorname{Res}_{L/K}(\operatorname{\mathbf{GL}}_{1}) embeds over 𝒪K\mathcal{O}_{K}.

We mostly keep the notation of §§4.14.5. Given a root α\alpha of some P𝒫s,rP\in\mathscr{P}_{s,r}, the fields LL and KK are constructed identically as in §4.2. Let LL^{\prime} denote the quadratic étale KK-algebra K[X]/(X2+1)K[X]/(X^{2}+1), and let τ\tau^{\prime} denote the non-trivial automorphism of LL^{\prime} fixing KK. Whenever 1-1 is not a square in KK (e.g. when KK is a real field, which is the case if we pick PP according to (Ls,r)), LL^{\prime} is a totally complex number field; otherwise, LL^{\prime} is the product of two copies of KK. We will denote by σi\sigma^{\prime}_{i} one of the two τ\tau^{\prime}-conjugate extensions to LL^{\prime} of the embedding σi:K\sigma_{i}:K\to\mathbb{C} (1id1\leq i\leq d).

In addition, we will make use of three quaternion algebras, defined over KK by the following symbols:

D+=(αα1)2,(αα1)2K,D=(αα1)2,(αα1)2K,D=1,1K,D^{+}=\langle(\alpha-{\alpha}^{-1})^{2},(\alpha-{\alpha}^{-1})^{2}\rangle_{K},\quad D^{-}=\langle-(\alpha-{\alpha}^{-1})^{2},-(\alpha-{\alpha}^{-1})^{2}\rangle_{K},\quad D^{\prime}\>=\langle-1,-1\rangle_{K},

and whose conjugation involutions we denote by τ+\tau^{+}, τ\tau^{-}, and τ\tau^{\prime}, respectively. (This last abuse of notation is excused by the fact the restriction of the conjugation involution of DD^{\prime} to any image of LL^{\prime} corresponds to τ\tau^{\prime}.) Note that (αα1)2(\alpha-{\alpha}^{-1})^{2} does indeed belong to KK, and that σi((αα1)2)\sigma_{i}((\alpha-{\alpha}^{-1})^{2}) is positive for 1ir1\leq i\leq r, and negative for s+1ids+1\leq i\leq d. Thus, the quaternion algebra D+D^{+} (resp. DD^{-}, DD^{\prime}) splits over KσiK_{\sigma_{i}} if and only if 1is1\leq i\leq s (resp. r+1idr+1\leq i\leq d, r+1isr+1\leq i\leq s). Moreover, as L=K(αα1)L=K(\alpha-{\alpha}^{-1}), LL embeds in D+D^{+}, or equivalently D+D^{+} splits over LL. Similarly, LL^{\prime} embeds in DD^{\prime}.

Once constructed, we systematically endow the KK-group 𝐆\mathbf{G} with the 𝒪K\mathcal{O}_{K}-structure obtained by writing the canonical equations defining 𝐆\mathbf{G} over 𝒪K\mathcal{O}_{K} using the basis {1,α}\{1,\alpha\} of LL over KK (and its extension to an 𝒪K\mathcal{O}_{K}-structure on D+D^{+}, or the canonical bases of LL^{\prime}, D+D^{+}, DD^{-}, DD^{\prime}, depending on the need). Unless indicated otherwise, we then set Γ=𝐆(𝒪K)\Gamma=\mathbf{G}(\mathcal{O}_{K}) to be the group of solutions in 𝒪K\mathcal{O}_{K} of these equations. The reader will easily verify that this integral structure has the claimed properties.

For any field extension MM of KK and any KK-algebra AA, we set AM=AKMA_{M}=A\otimes_{K}M. We also set

𝐆s,r=i=1r𝐅×i=1(sr)/2Res/(𝐅)𝒯𝐅(s),\mathbf{G}_{s,r}=\prod_{i=1}^{r}\mathbf{F}\times\prod_{i=1}^{(s-r)/2}\operatorname{Res}_{\mathbb{C}/\mathbb{R}}(\mathbf{F})\in\mathscr{T}_{\mathbf{F}}^{(s)},

and as in §4.4, π\pi will be the projection 𝐆𝐆s,r\mathbf{G}\to\mathbf{G}_{s,r} with compact kernel to be constructed below.

We now give the modifications by the \mathbb{R}-type of 𝐅\mathbf{F}, following the descriptions provided by Tits [Tit66b, Table II] (except perhaps for groups of type 𝖤\mathsf{E}). The reader may also find [KMRT98] a useful reference.

5.1. Inner form of type 𝖠\mathsf{A}.

𝐅\mathbf{F} is of the form 𝐒𝐋n,D0\operatorname{\mathbf{SL}}_{n,D_{0}} for some division algebra D0D_{0} over \mathbb{R} and nn\in\mathbb{N}^{*}. As the split case has already been covered, we may as well assume that D0D_{0}\neq\mathbb{R}, hence D0D_{0} is simply Hamilton’s quaternion algebra. Note that in fact n2n\geq 2 since 𝐅\mathbf{F} is isotropic.

Consider the involution ττ\tau^{-}\otimes\tau on the quaternion LL-algebra DL=DKLD^{-}_{L}=D^{-}\otimes_{K}L. As it coincides with τ\tau when restricted to the center LL of DLD^{-}_{L}, this involution is of the second kind. Let h:(DL)n×(DL)nDLh:(D^{-}_{L})^{n}\times(D^{-}_{L})^{n}\to D^{-}_{L} be the ττ\tau^{-}\otimes\tau-hermitian form given by

h(x,y)=x1(ττ)(y1)++xn(ττ)(yn)x,y(DL)n,h(x,y)=x_{1}(\tau^{-}\otimes\tau)(y_{1})+\dots+x_{n}(\tau^{-}\otimes\tau)(y_{n})\qquad x,\,y\in(D^{-}_{L})^{n},

and let 𝐆\mathbf{G} be the special unitary group 𝐒𝐔h,DL\operatorname{\mathbf{SU}}_{h,D^{-}_{L}} associated to hh.

The local structure of GG is as follows. Of course, 𝐆\mathbf{G} splits at the complex places σr+1,,σs\sigma_{r+1},\dots,\sigma_{s} of KK. For s+1ids+1\leq i\leq d, the quaternion algebra DLD^{-}_{L} splits over LσiL_{\sigma_{i}}\cong\mathbb{C}. Nonetheless, 𝐆\mathbf{G} remains an outer form (of 𝐒𝐋2n\operatorname{\mathbf{SL}}_{2n}) over KσiK_{\sigma_{i}}, since ττ\tau^{-}\otimes\tau induces the non-trivial automorphism of the field extension Lσi/Kσi/L_{\sigma_{i}}/K_{\sigma_{i}}\cong\mathbb{C}/\mathbb{R}. In view of the coefficients of hh, we deduce that 𝐆\mathbf{G} becomes isomorphic to 𝐒𝐔2n\operatorname{\mathbf{SU}}_{2n} (the usual anisotropic special unitary \mathbb{R}-group) over KσiK_{\sigma_{i}}\cong\mathbb{R} for s+1ids+1\leq i\leq d. Lastly, for 1ir1\leq i\leq r, we have that LL is contained in KσiK_{\sigma_{i}}. This means that

DLKKσiDK(LKKσi)DK(KσiKσi)DKσiDKσi,D^{-}_{L}\otimes_{K}K_{\sigma_{i}}\cong D^{-}\otimes_{K}(L\otimes_{K}K_{\sigma_{i}})\cong D^{-}\otimes_{K}(K_{\sigma_{i}}\oplus K_{\sigma_{i}})\cong D^{-}_{K_{\sigma_{i}}}\oplus D^{-}_{K_{\sigma_{i}}},

with ττ\tau^{-}\otimes\tau inducing a flip of the two summands. Thus 𝐆𝐒𝐋n,DKσi\mathbf{G}\cong\operatorname{\mathbf{SL}}_{n,D^{-}_{K_{\sigma_{i}}}} over KσiK_{\sigma_{i}}, that is 𝐆\mathbf{G} becomes an inner form (of 𝐒𝐋2n\operatorname{\mathbf{SL}}_{2n}) over KσiK_{\sigma_{i}}. We had already observed that DKσiD^{-}_{K_{\sigma_{i}}} does not split for 1ir1\leq i\leq r, hence DKσiD0D^{-}_{K_{\sigma_{i}}}\cong D_{0} after identifying KσiK_{\sigma_{i}} with \mathbb{R}; in turn, 𝐆𝐒𝐋n,D0\mathbf{G}\cong\operatorname{\mathbf{SL}}_{n,D_{0}} over KσiK_{\sigma_{i}}. Altogether, we obtain

𝐆(K)i=1r𝐒𝐋n,D0()×i=1(sr)/2𝐒𝐋2n()×i=1ds𝐒𝐔2n(),\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\cong\prod_{i=1}^{r}\operatorname{\mathbf{SL}}_{n,D_{0}}(\mathbb{R})\times\prod_{i=1}^{(s-r)/2}\operatorname{\mathbf{SL}}_{2n}(\mathbb{C})\times\prod_{i=1}^{d-s}\operatorname{\mathbf{SU}}_{2n}(\mathbb{R}),

so that 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) projects onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) with compact kernel.

The element γΓ\gamma\in\Gamma is taken to be diag(α,α1,1,,1)\operatorname{diag}(\alpha,\alpha^{-1},1,\dots,1), seen as an element of 𝐆(𝒪K)𝐒𝐋n(DL)\mathbf{G}(\mathcal{O}_{K})\subset\operatorname{\mathbf{SL}}_{n}(D^{-}_{L}). One has indeed ατ(α)=NL/K(α)=1\alpha\cdot\tau(\alpha)=\mathrm{N}_{L/K}(\alpha)=1, so that γ\gamma preserves hh.

5.2. Outer form of type 𝖠\mathsf{A}.

𝐅\mathbf{F} is of the form 𝐒𝐔h0\operatorname{\mathbf{SU}}_{h_{0}} for some hermitian form h0:n×nh_{0}:\mathbb{C}^{n}\times\mathbb{C}^{n}\to\mathbb{C} of indefinite signature, say (p,np)(p,n-p) with pnpp\geq n-p.

The symmetric KK-bilinear form bb on L×LL\times L associated to the quadratic form NL/K\mathrm{N}_{L/K} on LL allows us to construct a τ\tau^{\prime}-hermitian form hbh_{b} on LKLL\otimes_{K}L^{\prime} by setting

hb(11,22)=b(1,2)1τ(2)1,2L,1,2L.h_{b}(\ell_{1}\otimes\ell^{\prime}_{1},\ell_{2}\otimes\ell^{\prime}_{2})=b(\ell_{1},\ell_{2})\ell^{\prime}_{1}\cdot\tau^{\prime}(\ell^{\prime}_{2})\qquad\ell_{1},\,\ell_{2}\in L,\;\ell^{\prime}_{1},\,\ell^{\prime}_{2}\in L^{\prime}.

Let VV denote the vector LL^{\prime}-space (LKL)npL(2pn)(L\otimes_{K}L^{\prime})^{n-p}\oplus L^{\prime(2p-n)}, and let h:V×VLh:V\times V\to L^{\prime} be the τ\tau^{\prime}-hermitian form given as the orthogonal sum

h=hb(np)1(2pn),h=h_{b}^{\oplus(n-p)}\oplus\langle 1\rangle^{\oplus(2p-n)},

where 1\langle 1\rangle denotes the τ\tau^{\prime}-hermitian form (x,y)xτ(y)(x,y)\mapsto x\tau^{\prime}(y). Let 𝐆\mathbf{G} be the special unitary group 𝐒𝐔h\operatorname{\mathbf{SU}}_{h} associated to hh.

The local structure of 𝐆\mathbf{G} is as follows. Of course, 𝐆\mathbf{G} splits at the complex places σr+1,,σs\sigma_{r+1},\dots,\sigma_{s} of KK. At the other places, one has by construction Lσi/Kσi/L^{\prime}_{\sigma^{\prime}_{i}}/K_{\sigma_{i}}\cong\mathbb{C}/\mathbb{R} with τ\tau^{\prime} inducing complex conjugation; thus 𝐆\mathbf{G} remains an outer form (of 𝐒𝐋n\operatorname{\mathbf{SL}}_{n}) there. In addition, the quadratic form NL/K\mathrm{N}_{L/K} (hence also the hermitian form hbh_{b}) has local signature (1,1)(1,1) or (2,0)(2,0) over KσiK_{\sigma_{i}} for respectively 1ir1\leq i\leq r or s+1ids+1\leq i\leq d. Thus hh has local signature respectively (p,np)(p,n-p) or (n,0)(n,0) over KσiK_{\sigma_{i}}. Altogether, we have that

𝐆(K)i=1r𝐒𝐔h0()×i=1(sr)/2𝐒𝐋n()×i=1ds𝐒𝐔n(),\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\cong\prod_{i=1}^{r}\operatorname{\mathbf{SU}}_{h_{0}}(\mathbb{R})\times\prod_{i=1}^{(s-r)/2}\operatorname{\mathbf{SL}}_{n}(\mathbb{C})\times\prod_{i=1}^{d-s}\operatorname{\mathbf{SU}}_{n}(\mathbb{R}),

so that 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) projects onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) with compact kernel.

Since NL/K(α)=1\mathrm{N}_{L/K}(\alpha)=1, multiplication by α\alpha induces an isometry of the quadratic KK-space (L,b)(L,b), and hence a unitary operator uαu_{\alpha} of the hermitian space (LKL,hb)(L\otimes_{K}L^{\prime},h_{b}). Let then γ\gamma be the element of 𝐆(K)\mathbf{G}(K) which acts as uαu_{\alpha} on the first component of VV, and as the identity on the other components. Note that γ𝐆(𝒪K)\gamma\in\mathbf{G}(\mathcal{O}_{K}) by choice of the 𝒪K\mathcal{O}_{K}-structure.

5.3. Type 𝖡\mathsf{B}.

𝐅\mathbf{F} is of the form 𝐒𝐎q0\operatorname{\mathbf{SO}}_{q_{0}} for some quadratic form q0q_{0} on n\mathbb{R}^{n} (nn odd) of indefinite signature, say (p,np)(p,n-p) with pnpp\geq n-p.

Let VV be the KK-module LnpK2pnL^{n-p}\oplus K^{2p-n}, and endow VV with the quadratic form qq over KK given as the orthogonal sum

q=NL/K(np)1(2pn),q=\mathrm{N}_{L/K}^{\oplus(n-p)}\oplus\langle 1\rangle^{\oplus(2p-n)},

where 1\langle 1\rangle denotes the quadratic form xx2x\mapsto x^{2} on KK. Let 𝐆\mathbf{G} be the special orthogonal group 𝐒𝐎q\operatorname{\mathbf{SO}}_{q} associated to qq.

𝐆\mathbf{G} splits at the complex places σr+1,,σs\sigma_{r+1},\dots,\sigma_{s} of KK. For s+1ids+1\leq i\leq d, the quadratic KσiK_{\sigma_{i}}-form NLσi/Kσi\mathrm{N}_{L_{\sigma_{i}}/K_{\sigma_{i}}} is positive definite (since Lσi/Kσi/L_{\sigma_{i}}/K_{\sigma_{i}}\cong\mathbb{C}/\mathbb{R}), whereas for 1ir1\leq i\leq r, it has signature (1,1)(1,1). Thus qq has local signature respectively (n,0)(n,0) and (p,np)(p,n-p). Writing 𝐒𝐎n\operatorname{\mathbf{SO}}_{n} for the usual anisotropic special orthogonal \mathbb{R}-group, this means

𝐆(K)i=1r𝐒𝐎q0()×i=1(sr)/2𝐒𝐎n()×i=1ds𝐒𝐎n(),\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\cong\prod_{i=1}^{r}\operatorname{\mathbf{SO}}_{q_{0}}(\mathbb{R})\times\prod_{i=1}^{(s-r)/2}\operatorname{\mathbf{SO}}_{n}(\mathbb{C})\times\prod_{i=1}^{d-s}\operatorname{\mathbf{SO}}_{n}(\mathbb{R}),

so that 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) projects onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) with compact kernel.

Since NL/K(α)=1\mathrm{N}_{L/K}(\alpha)=1, multiplication by α\alpha induces an isometry uαu_{\alpha} of the quadratic KK-space (L,b)(L,b). We take γ\gamma to be the element of 𝐆(K)\mathbf{G}(K) which acts as uαu_{\alpha} on the first component of VV, and as the identity on the other components. Note again that γ𝐆(𝒪K)\gamma\in\mathbf{G}(\mathcal{O}_{K}) by choice of the 𝒪K\mathcal{O}_{K}-structure.

5.4. Type 𝖢\mathsf{C}, non-split.

𝐅\mathbf{F} is of the form 𝐒𝐔h0,D0\operatorname{\mathbf{SU}}_{h_{0},D_{0}}, where D0D_{0} is a quaternion \mathbb{R}-algebra and h0h_{0} is a hermitian form (with respect to quaternion conjugation) on D0nD_{0}^{n}, of signature say (p,np)(p,n-p) with pnpp\geq n-p. Either D0D_{0} is Hamilton’s quaternion algebra, or D0D_{0} splits, in which case quaternion conjugation corresponds to adjugation and 𝐒𝐔h0,D0\operatorname{\mathbf{SU}}_{h_{0},D_{0}} splits to become 𝐒𝐩2n\operatorname{\mathbf{Sp}}_{2n} regardless of the signature of h0h_{0}. We start with the case where D0D_{0} does not split, and thus h0h_{0} is indefinite (because 𝐅\mathbf{F} is isotropic).

The symmetric KK-bilinear form bb associated to the quadratic form NL/K\mathrm{N}_{L/K} on LL allows us to construct a τ\tau^{\prime}-hermitian form hbh_{b} on LKDL\otimes_{K}D^{\prime} by setting

hb(1d1,2d2)=b(1,2)d1τ(d2)1,2L,d1,d2D.h_{b}(\ell_{1}\otimes d^{\prime}_{1},\ell_{2}\otimes d^{\prime}_{2})=b(\ell_{1},\ell_{2})d^{\prime}_{1}\cdot\tau^{\prime}(d^{\prime}_{2})\qquad\ell_{1},\,\ell_{2}\in L,\;d^{\prime}_{1},\,d^{\prime}_{2}\in D^{\prime}.

Let VV be the (left) DD^{\prime}-module (LKD)npD2pn(L\otimes_{K}D^{\prime})^{n-p}\oplus{D^{\prime}}^{2p-n}, and endow VV with the τ\tau^{\prime}-hermitian form hh given as the orthogonal sum

h=hb(np)1(2pn).h=h_{b}^{\oplus(n-p)}\oplus\langle 1\rangle^{\oplus(2p-n)}.

Let 𝐆\mathbf{G} be the special unitary group 𝐒𝐔h,D\operatorname{\mathbf{SU}}_{h,D^{\prime}} associated to hh.

Of course, 𝐆\mathbf{G} splits at the complex places σr+1,,σs\sigma_{r+1},\dots,\sigma_{s}. On the other hand, DKσiD0D^{\prime}_{K_{\sigma_{i}}}\cong D_{0} when 1ir1\leq i\leq r or s+1ids+1\leq i\leq d, by construction. From the local signatures of NL/K\mathrm{N}_{L/K}, one computes that hh has local signatures (p,np)(p,n-p) or (n,0)(n,0) for 1ir1\leq i\leq r or s+1ids+1\leq i\leq d respectively. In other words, writing 𝐒𝐩n,0\operatorname{\mathbf{Sp}}_{n,0} for the anisotropic \mathbb{R}-form of 𝐒𝐩2n\operatorname{\mathbf{Sp}}_{2n},

𝐆(K)i=1r𝐒𝐔h0()×i=1(sr)/2𝐒𝐩2n()×i=1ds𝐒𝐩n,0(),\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\cong\prod_{i=1}^{r}\operatorname{\mathbf{SU}}_{h_{0}}(\mathbb{R})\times\prod_{i=1}^{(s-r)/2}\operatorname{\mathbf{Sp}}_{2n}(\mathbb{C})\times\prod_{i=1}^{d-s}\operatorname{\mathbf{Sp}}_{n,0}(\mathbb{R}),

so that 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) projects onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) with compact kernel.

Since NL/K(α)=1\mathrm{N}_{L/K}(\alpha)=1, multiplication by α\alpha induces an isometry of the quadratic KK-space (L,b)(L,b), and hence a unitary operator uαu_{\alpha} of the hermitian space (LKD,hb)(L\otimes_{K}D^{\prime},h_{b}). Let then γ𝐆(𝒪K)\gamma\in\mathbf{G}(\mathcal{O}_{K}) act as uαu_{\alpha} on the first component of VV, and as the identity on the other components.

5.5. Type 𝖢\mathsf{C}, split.

Now if D0D_{0} is split, we consider instead the τ+\tau^{+}-hermitian form h=1nh=\langle 1\rangle^{\oplus n} on (D+)n(D^{+})^{n}. Since D+D^{+} splits over KσiK_{\sigma_{i}} for 1is1\leq i\leq s, so does the KK-group 𝐆=𝐒𝐔h,D+\mathbf{G}=\operatorname{\mathbf{SU}}_{h,D^{+}}. On the other hand, D+D^{+} does not split at the remaining places, and the signature of hh indicates that 𝐆(K)\mathbf{G}(K\otimes\mathbb{R}) projects appropriately onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}). It remains to observe that α\alpha can be seen as an element of D+D^{+} through the canonical embedding of LL, and viewed as such, the element γ=diag(α,1,,1)𝐆𝐋n(D+)\gamma=\operatorname{diag}(\alpha,1,\dots,1)\in\operatorname{\mathbf{GL}}_{n}(D^{+}) actually belongs to 𝐆(𝒪K)\mathbf{G}(\mathcal{O}_{K}) because τ+\tau^{+} coincides with τ\tau on LL.

5.6. Type 𝖣\mathsf{D}.

𝐅\mathbf{F} is either of the form 𝐒𝐎q0\operatorname{\mathbf{SO}}_{q_{0}} for some quadratic form q0q_{0} on n\mathbb{R}^{n} (nn even) of indefinite signature, or of the form 𝐒𝐔h0,D0\operatorname{\mathbf{SU}}_{h_{0},D_{0}} for h0h_{0} the standard hermitian form on D0nD_{0}^{n} (n2n\geq 2) with respect to an involution of orthogonal type, where D0D_{0} is Hamilton’s quaternion algebra. The first case is treated exactly like type 𝖡\mathsf{B}; we thus focus on the second case.

Let ρ\rho denote the involution of orthogonal type on DD^{-} given by

ρ(x1+x2𝒊+x3𝒋+x4𝒌)=x1+x2𝒊+x3𝒋x4𝒌x1,x2,x3,x4K\rho(x_{1}+x_{2}\bm{i}+x_{3}\bm{j}+x_{4}\bm{k})=x_{1}+x_{2}\bm{i}+x_{3}\bm{j}-x_{4}\bm{k}\qquad x_{1},x_{2},x_{3},x_{4}\in K

in the canonical basis {1,𝒊,𝒋,𝒌}\{1,\bm{i},\bm{j},\bm{k}\} of DD^{-} over KK. As previously, the symmetric KK-bilinear form bb associated to NL/K\mathrm{N}_{L/K} allows us to construct a ρ\rho-hermitian form hbh_{b} on LKDL\otimes_{K}D^{-}. Let VV be the (left) DD^{-}-module (LKD)(D)n2(L\otimes_{K}D^{-})\oplus(D^{-})^{n-2}, and endow VV with the ρ\rho-hermitian form given as the orthogonal sum

h=hb1(n2).h=h_{b}\oplus\langle 1\rangle^{\oplus(n-2)}.

Let 𝐆\mathbf{G} be the special unitary group 𝐒𝐔h,D\operatorname{\mathbf{SU}}_{h,D^{-}}.

By construction, DD0D^{-}\cong D_{0} over KσiK_{\sigma_{i}} for 1ir1\leq i\leq r, and hh is easily seen to be equivalent to the standard form h0h_{0} over D0D_{0}. For s+1ids+1\leq i\leq d however, DD^{-} splits over KσiK_{\sigma_{i}}, and ρ\rho becomes the transposition involution on M2(Kσi)\mathrm{M}_{2}(K_{\sigma_{i}}). Moreover, at these places the form bb is positive definite; so hh is again equivalent to the standard form 1n\langle 1\rangle^{\oplus n}, showing that over KσiK_{\sigma_{i}}, 𝐆𝐒𝐎2n\mathbf{G}\cong\operatorname{\mathbf{SO}}_{2n} is anisotropic (s+1ids+1\leq i\leq d). Altogether,

𝐆(K)i=1r𝐒𝐔h0,D0()×i=1(sr)/2𝐒𝐎2n()×i=1ds𝐒𝐎2n(),\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\cong\prod_{i=1}^{r}\operatorname{\mathbf{SU}}_{h_{0},D_{0}}(\mathbb{R})\times\prod_{i=1}^{(s-r)/2}\operatorname{\mathbf{SO}}_{2n}(\mathbb{C})\times\prod_{i=1}^{d-s}\operatorname{\mathbf{SO}}_{2n}(\mathbb{R}),

so that 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) projects onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) with compact kernel.

Again, multiplication by α\alpha induces a unitary operator uαu_{\alpha} on the hermitian space (LKD,hb)(L\otimes_{K}D^{-},h_{b}), and we take γ𝐆(𝒪K)\gamma\in\mathbf{G}(\mathcal{O}_{K}) to act as uαu_{\alpha} on the first component of VV, and as the identity on the other components.

5.7. Type 𝖦2\mathsf{G}_{2}.

𝐅\mathbf{F} is of the form 𝐀𝐮𝐭O0\operatorname{\mathbf{Aut}}O_{0}, where O0O_{0} is the split octonion \mathbb{R}-algebra. (Note that regardless of the base field, the only isotropic form of 𝖦2\mathsf{G}_{2} is the split one.)

Let O+O^{+} be the octonion KK-algebra obtained by applying the Cayley–Dickson construction to D+D^{+} with the parameter 1-1. Recall that this means O+=D+D+O^{+}=D^{+}\oplus D^{+}\bm{\ell}, endowed with the multiplication rule

(a+b)(c+d)=(acτ+(d)b)+(da+bτ+(c))a,b,c,dD+.(a+b\bm{\ell})(c+d\bm{\ell})=(ac-\tau^{+}(d)b)+(da+b\tau^{+}(c))\bm{\ell}\qquad a,b,c,d\in D^{+}.

(Recall that a split octonion algebra may be obtained by the Cayley-Dickson construction applied to a split quaternion algebra with any parameter.)

Let 𝐆\mathbf{G} be the KK-group 𝐀𝐮𝐭O+\operatorname{\mathbf{Aut}}O^{+}.

For 1is1\leq i\leq s, the algebra O+O^{+}, hence also 𝐆\mathbf{G}, splits over KσiK_{\sigma_{i}} because D+D^{+} does, whereas 𝐆\mathbf{G} is anisotropic over KσiK_{\sigma_{i}} for s+1ids+1\leq i\leq d, because at these places O+O^{+} is a (nonassociative) division algebra since its norm form (which is the double of the norm form of D+D^{+}) is positive definite. Altogether, writing 𝐅anis\mathbf{F}^{\mathrm{anis}} for the anisotropic \mathbb{R}-group of type 𝖦2\mathsf{G}_{2},

𝐆(K)i=1r𝐅()×i=1(sr)/2𝐅()×i=1ds𝐅anis(),\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\cong\prod_{i=1}^{r}\mathbf{F}(\mathbb{R})\times\prod_{i=1}^{(s-r)/2}\mathbf{F}(\mathbb{C})\times\prod_{i=1}^{d-s}\mathbf{F}^{\mathrm{anis}}(\mathbb{R}),

so that 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) projects onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) with compact kernel.

One checks that given xD+x\in D^{+} with τ+(x)x=1\tau^{+}(x)x=1, the assignment a+ba+(xb)a+b\bm{\ell}\mapsto a+(xb)\bm{\ell} defines an automorphism uxu_{x} of O+O^{+} (see [SV00, §2.1]). As LL embeds in D+D^{+} (with τ+\tau^{+} restricting to τ\tau), α\alpha can be seen as an element of D+D^{+} of norm 1. The automorphism γ=uα\gamma=u_{\alpha} of O+O^{+} then belongs to 𝐆(𝒪K)\mathbf{G}(\mathcal{O}_{K}) (if as before, 𝐆\mathbf{G} is endowed with the 𝒪K\mathcal{O}_{K}-structure induced by an 𝒪K\mathcal{O}_{K}-structure on O+O^{+} extending the basis {1,α}\{1,\alpha\} of LD+L\subset D^{+}).

5.8. Preliminaries for type 𝖥4\mathsf{F}_{4}

Recall that given a field MM, an octonion algebra OO over MM, and parameters c=(c1,c2,c3)(M×)3c=(c_{1},c_{2},c_{3})\in(M^{\times})^{3}, the set

𝒜M(O,c)={(x1y3c11c3y¯2c21c1y¯3x2y1y2c31c2y¯1x3)|x1,x2,x3M,y1,y2,y3O}\mathcal{A}_{M}(O,c)=\left\{\begin{pmatrix}x_{1}&y_{3}&c_{1}^{-1}c_{3}\overline{y}_{2}\\ c_{2}^{-1}c_{1}\overline{y}_{3}&x_{2}&y_{1}\\ y_{2}&c_{3}^{-1}c_{2}\overline{y}_{1}&x_{3}\end{pmatrix}\;\middle|\;x_{1},x_{2},x_{3}\in M,y_{1},y_{2},y_{3}\in O\right\}

of (c1,c2,c3)(c_{1},c_{2},c_{3})-hermitian 3×33\times 3 matrices with entries in OO (endowed with the Jordan product) forms an exceptional Jordan algebra called an Albert algebra. When the underlying octonion algebra is split, any choices of c1,c2,c3c_{1},c_{2},c_{3} yield isomorphic Albert algebras, which are accordingly called split.

Over \mathbb{R}, there are three isomorphism classes of Albert algebras, represented by: the split Albert algebra of (1,1,1)(1,1,1)-hermitian 3×33\times 3 matrices with entries in the split octonion \mathbb{R}-algebra, the algebra of (1,1,1)(1,1,1)-hermitian 3×33\times 3 matrices with entries in Cayley’s octonion algebra (which we call the definite Albert \mathbb{R}-algebra), and the algebra of (1,1,1)(1,1,-1)-hermitian 3×33\times 3 matrices with entries in Cayley’s octonion algebra (which we call the indefinite Albert \mathbb{R}-algebra).

5.9. Type 𝖥4\mathsf{F}_{4}, split

𝐅\mathbf{F} is of the form 𝐀𝐮𝐭A0\operatorname{\mathbf{Aut}}A_{0}, where A0A_{0} is the split Albert \mathbb{R}-algebra.

Let O+O^{+} be the octonion KK-algebra obtained by applying the Cayley–Dickson construction to D+D^{+} with the parameter 1-1, and let A+=𝒜K(O+,(1,1,1))A^{+}=\mathcal{A}_{K}\big{(}O^{+},(1,1,1)\big{)}. Let 𝐆\mathbf{G} be the KK-group 𝐀𝐮𝐭A+\operatorname{\mathbf{Aut}}A^{+}; it is a simple group of type 𝖥4\mathsf{F}_{4} (see namely [SV00, Ch. 5–7] for this fact and others concerning the structure of Albert algebras).

As mentioned in §5.7, O+O^{+} splits over KσiK_{\sigma_{i}} for 1is1\leq i\leq s, whereas it remains a division algebra over KσiK_{\sigma_{i}} for s+1ids+1\leq i\leq d. Thus A+A^{+} hence 𝐆\mathbf{G} splits over KσiK_{\sigma_{i}} for 1is1\leq i\leq s, while for s+1ids+1\leq i\leq d, A+A^{+} is isomorphic over KσiK_{\sigma_{i}} to the definite Albert \mathbb{R}-algebra, hence 𝐆\mathbf{G} is isomorphic over KσiK_{\sigma_{i}} to the anisotropic simple \mathbb{R}-group 𝐅anis\mathbf{F}^{\mathrm{anis}} of type 𝖥4\mathsf{F}_{4}. Altogether,

𝐆(K)i=1r𝐅()×i=1(sr)/2𝐅()×i=1ds𝐅anis(),\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R})\cong\prod_{i=1}^{r}\mathbf{F}(\mathbb{R})\times\prod_{i=1}^{(s-r)/2}\mathbf{F}(\mathbb{C})\times\prod_{i=1}^{d-s}\mathbf{F}^{\mathrm{anis}}(\mathbb{R}),

so that 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) projects onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) with compact kernel.

Given xD+x\in D^{+} of norm 1, the automorphism uxu_{x} of O+O^{+} from §5.7 extends canonically to an automorphism of A+A^{+}, still to be denoted uxu_{x}. By choice of the 𝒪K\mathcal{O}_{K}-structure on LL, and in turn on D+D^{+}, O+O^{+} and A+A^{+}, the automorphism γ=uα\gamma=u_{\alpha} of A+A^{+} belongs to 𝐆(𝒪K)\mathbf{G}(\mathcal{O}_{K}).

5.10. Type 𝖥4\mathsf{F}_{4}, non-split

𝐅\mathbf{F} is of the form 𝐀𝐮𝐭A0\operatorname{\mathbf{Aut}}A_{0}, where this time A0A_{0} is the indefinite Albert \mathbb{R}-algebra.

Let OO^{\prime} be the octonion KK-algebra obtained by applying the Cayley–Dickson construction to DD^{\prime} with the parameter 1-1, and let A=𝒜K(O,(1,1,(αα1)2)A^{\prime}=\mathcal{A}_{K}(O^{\prime},(1,1,-(\alpha-{\alpha}^{-1})^{2}). Let 𝐆\mathbf{G} be the KK-group 𝐀𝐮𝐭A\operatorname{\mathbf{Aut}}A^{\prime}.

The octonion algebra OO^{\prime} is isomorphic to Cayley’s octonion algebra over KσiK_{\sigma_{i}} for all 1id1\leq i\leq d. The signs of the parameters defining AA^{\prime} vary under the different embeddings of KK in such a way that for 1is1\leq i\leq s, AA^{\prime} is isomorphic over KσiK_{\sigma_{i}} to A0A_{0}, hence also 𝐆\mathbf{G} to 𝐅\mathbf{F}. For s+1ids+1\leq i\leq d, AA^{\prime} is isomorphic over KσiK_{\sigma_{i}} to the definite Albert \mathbb{R}-algebra, hence 𝐆\mathbf{G} is isomorphic over KσiK_{\sigma_{i}} to the anisotropic \mathbb{R}-form 𝐅anis\mathbf{F}^{\mathrm{anis}}, as required.

Conjugation(7)(7)(7)Here, we only use that OO^{\prime} is a KK-(bi)module, which splits as a direct sum of KK and the KK-submodule of totally imaginary octonions. Conjugation by a matrix with arbitrary entries in OO^{\prime} makes no sense since OO^{\prime} is not an associative algebra. by the matrix ((α+α1)/2 0 (αα1)2/2 0 1 0 1/2 0 (α+α1)/2)𝐆𝐋3(K)\left(\begin{sma\ell\ell matrix}{(\alpha+{\alpha}^{-1})}/{2}&\mbox{ }0\mbox{ }&{(\alpha-{\alpha}^{-1})^{2}}/{2}\\ \mbox{ }0\mbox{ }&1&\mbox{ }0\mbox{ }\\ {1}/{2}&\mbox{ }0\mbox{ }&{(\alpha+{\alpha}^{-1})}/{2}\end{sma\ell\ell matrix}\right)\in\operatorname{\mathbf{GL}}_{3}(K) defines a linear map uα𝐆𝐋(A)u_{\alpha}\in\operatorname{\mathbf{GL}}(A^{\prime}). Using the fact that the above matrix preserves the diagonal quadratic form 1,1,(αα1)2\langle 1,1,-(\alpha-{\alpha}^{-1})^{2}\rangle, it is readily seen that uαu_{\alpha} is in fact an automorphism of the Albert algebra AA^{\prime}. What is perhaps less obvious is that γ=uα\gamma=u_{\alpha} belongs to an arithmetic lattice in 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}). Let Ω\Omega be the 𝒪K\mathcal{O}_{K}-submodule of M3(O)\mathrm{M}_{3}(O^{\prime}) spanned by the canonical basis. Conjugation by the matrix (1 0 (α+α1)/2 0 1 0  0  0 1/2)𝐆𝐋3(K)\left(\begin{sma\ell\ell matrix}1&\mbox{ }0\mbox{ }&{(\alpha+{\alpha}^{-1})}/{2}\\ \mbox{ }0\mbox{ }&1&\mbox{ }0\mbox{ }\\ \mbox{ }0\mbox{ }&\mbox{ }0\mbox{ }&{1}/{2}\end{sma\ell\ell matrix}\right)\in\operatorname{\mathbf{GL}}_{3}(K) transforms Ω\Omega into Ω\Omega^{\prime}, which has the property that the 𝒪K\mathcal{O}_{K}-submodule AΩA^{\prime}\cap\Omega^{\prime} of AA^{\prime} is preserved by γ\gamma. In other words, γ\gamma belongs to Γ=Stab𝐆(K)(AΩ)\Gamma=\operatorname{Stab}_{\mathbf{G}(K)}(A^{\prime}\cap\Omega^{\prime}), which is a lattice when embedded diagonally in 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}). This can be seen easily as follows. The quadratic form 1,(αα1)2\langle 1,-(\alpha-{\alpha}^{-1})^{2}\rangle is nothing but the norm form of L/KL/K written in the KK-basis {1,αα1}\{1,\alpha-{\alpha}^{-1}\} of LL. The matrix (1(α+α1)/2 0 1/2)\left(\begin{sma\ell\ell matrix}1&{(\alpha+{\alpha}^{-1})}/{2}\\ \mbox{ }0\mbox{ }&{1}/{2}\end{sma\ell\ell matrix}\right) changes basis back to the standard basis {1,α}\{1,\alpha\}. The non-trivial block of γ\gamma in this new basis then rewrites ( 0 11α+α1)𝐆𝐋2(𝒪K),\left(\begin{sma\ell\ell matrix}\mbox{ }0\mbox{ }&-1\\ 1&\alpha+{\alpha}^{-1}\end{sma\ell\ell matrix}\right)\in\operatorname{\mathbf{GL}}_{2}(\mathcal{O}_{K}), which is just the matrix of multiplication by α\alpha on LL.(8)(8)(8)If there were such things as hermitian forms over octonions, one would directly extend the norm form of L/KL/K to OO^{\prime} instead of working with the form 1,(αα1)2\langle 1,-(\alpha-{\alpha}^{-1})^{2}\rangle. This would simplify the notation and avoid the base-change computations. This also shows that γ\gamma is not torsion, and that γ2cm1\gamma^{2c_{m}}\to 1 whenever α2cm1\alpha^{2c_{m}}\to 1 (as needed in the argument of §4.5).

5.11. Preliminaries for type 𝖤\mathsf{E}

In order to treat groups of type 𝖤\mathsf{E}, we briefly recall Tits’ construction of the exceptional simple Lie algebras. Let BB and CC be composition algebras over a field MM of characteristic 2,3\neq 2,3; this means that B,CB,C are chosen among MM itself, quadratic étale algebras, quaternion algebras, or octonion algebras over MM. Let JJ be the Jordan algebra 𝒜M(B,(c1,c2,c3))\mathcal{A}_{M}(B,(c_{1},c_{2},c_{3})), for some c1,c2,c3Mc_{1},c_{2},c_{3}\in M (defined analogously to §5.8). The set

𝔏(C,J)=Der(C)(CJ)Der(J),\mathfrak{L}(C,J)=\operatorname{Der}(C)\oplus(C^{\circ}\otimes J^{\circ})\oplus\operatorname{Der}(J),

where CC^{\circ} (resp. JJ^{\circ}) denotes the kernel of the trace of CC (resp. JJ), can be endowed with a Lie bracket which turns it into a semisimple Lie MM-algebra whose absolute type is given by the Freudenthal-Tits magic square:

dimMC dimMB 12481𝖠1𝖠2𝖢3𝖥42𝖠2𝖠2×𝖠2𝖠5𝖤64𝖢3𝖠5𝖣6𝖤78𝖥4𝖤6𝖤7𝖤8\begin{array}[]{|c|c|c|c|c|}\hline\cr\hbox{\diagbox[]{{\shortstack[l]{$\dim_{M}C$}}}{{\shortstack[r]{$\dim_{M}B$}}}{}}&1&2&4&8\\ \hline\cr 1&\mathsf{A}_{1}&\mathsf{A}_{2}&\mathsf{C}_{3}&\mathsf{F}_{4}\\ \hline\cr 2&\mathsf{A}_{2}&\mathsf{A}_{2}\times\mathsf{A}_{2}&\mathsf{A}_{5}&\mathsf{E}_{6}\\ \hline\cr 4&\mathsf{C}_{3}&\mathsf{A}_{5}&\mathsf{D}_{6}&\mathsf{E}_{7}\\ \hline\cr 8&\mathsf{F}_{4}&\mathsf{E}_{6}&\mathsf{E}_{7}&\mathsf{E}_{8}\\ \hline\cr\end{array}

This construction and the magic square were discovered independently by Freudenthal and Tits. Quite remarkably, when extended appropriately to include 𝖦2\mathsf{G}_{2}, the construction gives Lie algebras of all the exceptional types in a unified way. For a more detailed description, we refer the reader to the original articles of Freudenthal [[Fre54a][Fre63]] and Tits [Tit66a], as well as to the lecture notes of Jacobson [Jac71] and the Book of Involutions [KMRT98, Ch. IX].

It turns out that over \mathbb{R}, Tits’ construction produces all possible forms of simple real Lie algebras of type 𝖤\mathsf{E} or 𝖥\mathsf{F}, as CC and JJ run through all possible combinations. We refer to [Jac71, pp. 119–121] for a summary of which input (C,J)(C,J) to use for each \mathbb{R}-form. We will use this in the remaining cases (types 𝖤6\mathsf{E}_{6}, 𝖤7\mathsf{E}_{7}, 𝖤8\mathsf{E}_{8}) to describe the group 𝐅\mathbf{F} and construct an appropriate form of 𝐅\mathbf{F} over 𝒪K\mathcal{O}_{K}.(9)(9)(9)For 𝖤6\mathsf{E}_{6} (and perhaps for 𝖤7\mathsf{E}_{7}), one could have used a description more resemblant to that of 𝖦2\mathsf{G}_{2} and 𝖥4\mathsf{F}_{4}. However, the smallest nontrivial irreducible representation of a group of type 𝖤8\mathsf{E}_{8} is the adjoint representation, hence one essentially cannot avoid looking at a group of type 𝖤8\mathsf{E}_{8} as the automorphism groups of its Lie algebra. In fact, the case 𝖥4\mathsf{F}_{4} treated above is also encompassed by this approach (take C=C=\mathbb{R} and BB an octonion \mathbb{R}-algebra); but we decided to treat 𝖥4\mathsf{F}_{4} beforehand because it will be used for type 𝖤\mathsf{E}, in the following way.

As is easily seen from the definition of the Lie bracket, any automorphism φ\varphi of CC (resp. JJ) induces an automorphism φ~\widetilde{\varphi} of 𝔏(C,J)\mathfrak{L}(C,J) by acting via conjugation on the Lie subalgebra Der(C)\operatorname{Der}(C) (resp. Der(J)\operatorname{Der}(J)), acting canonically on the first (resp. second) tensor component of CJC^{\circ}\otimes J^{\circ}, and acting trivially on Der(J)\operatorname{Der}(J) (resp. Der(C)\operatorname{Der}(C)). This assignment yields morphisms of algebraic groups 𝐀𝐮𝐭C𝐀𝐮𝐭𝔏(C,J)\operatorname{\mathbf{Aut}}C\to\operatorname{\mathbf{Aut}}\mathfrak{L}(C,J), resp. 𝐀𝐮𝐭J𝐀𝐮𝐭𝔏(C,J)\operatorname{\mathbf{Aut}}J\to\operatorname{\mathbf{Aut}}\mathfrak{L}(C,J), whose images commute, and which for the last row (resp. column) of the magic square correspond to the familiar embeddings 𝖦2\mathsf{G}_{2} (resp. 𝖥4\mathsf{F}_{4}) 𝖥4,𝖤6,𝖤7,𝖤8\hookrightarrow\mathsf{F}_{4},\mathsf{E}_{6},\mathsf{E}_{7},\mathsf{E}_{8}.

5.12. Type 𝖤\mathsf{E}

𝐅\mathbf{F} is of the form 𝐀𝐮𝐭𝔏(C0,A0)\operatorname{\mathbf{Aut}}\mathfrak{L}(C_{0},A_{0}), where C0C_{0} is a quadratic étale, a quaternion or an octonion \mathbb{R}-algebra (depending on the absolute rank of 𝐅\mathbf{F}), and A0A_{0} is an Albert \mathbb{R}-algebra. There are respectively four, three and two possibilities for the isotropic \mathbb{R}-group 𝐅\mathbf{F}. We list them below (labelled by the signature of their Killing form), and fix inputs (C0,A0)(C_{0},A_{0}) when there is more than one choice (obviously, a similar construction works with any of the choices). We begin with 𝖤6\mathsf{E}_{6}:

  1. (𝖤66\mathsf{E}_{6}^{6})

    C0C_{0}\cong\mathbb{R}\oplus\mathbb{R} and A0A_{0} is split (as in §5.9). In this case, 𝐅\mathbf{F} is split.

  2. (𝖤626\mathsf{E}_{6}^{-26})

    C0C_{0}\cong\mathbb{R}\oplus\mathbb{R} and A0A_{0} is (up to isomorphism) any of the two non-split Albert \mathbb{R}-algebras; we pick A0A_{0} to be indefinite (as in §5.10). In this case, 𝐅\mathbf{F} is the non-split inner \mathbb{R}-form of type 𝖤6\mathsf{E}_{6}.

The remaining \mathbb{R}-forms of 𝖤6\mathsf{E}_{6} are outer forms. The two isotropic ones are given by:

  1. (𝖤62\mathsf{E}_{6}^{2})

    C0C_{0}\cong\mathbb{C} and A0A_{0} split.

  2. (𝖤614\mathsf{E}_{6}^{-14})

    C0C_{0}\cong\mathbb{C} and A0A_{0} is indefinite (as in §5.10).

Now for 𝖤7\mathsf{E}_{7}:

  1. (𝖤77\mathsf{E}_{7}^{7})

    C0C_{0} and A0A_{0} are split. In this case, 𝐅\mathbf{F} is split.

  2. (𝖤725\mathsf{E}_{7}^{-25})

    C0C_{0} is split and A0A_{0} is any of the two non-split Albert \mathbb{R}-algebras; we pick A0A_{0} indefinite (as in §5.10).

  3. (𝖤75\mathsf{E}_{7}^{-5})

    C0C_{0} is Hamilton’s quaternion algebra and A0A_{0} is split or indefinite; we pick A0A_{0} split.

And lastly for 𝖤8\mathsf{E}_{8}:

  1. (𝖤88\mathsf{E}_{8}^{8})

    C0C_{0} and A0A_{0} are split, or alternatively C0C_{0} is Cayley’s octonion algebra and A0A_{0} is indefinite; we pick C0C_{0} and A0A_{0} both split. In this case, 𝐅\mathbf{F} is split.

  2. (𝖤824\mathsf{E}_{8}^{-24})

    C0C_{0} is Cayley’s octonion algebra and A0A_{0} is split, or alternatively C0C_{0} is split and A0A_{0} is any of the two non-split Albert \mathbb{R}-algebras; we pick the former.

Note that for each rank, exactly one combination for (C0,A0)(C_{0},A_{0}) is disregarded, namely the one where C0C_{0} is not split and A0A_{0} is definite. This combination gives the (unique) anisotropic \mathbb{R}-form of type 𝖤\mathsf{E} of the corresponding rank.

For each one of these possibilities, we define the KK-algebras CC and AA as indicated in the table below. The KK-algebras O+O^{+}, A+A^{+} are those defined in §5.9, and OO^{\prime}, AA^{\prime} are defined in §5.10.

\mathbb{R}-type 𝖤66\mathsf{E}_{6}^{6} 𝖤626\mathsf{E}_{6}^{-26} 𝖤62\mathsf{E}_{6}^{2} 𝖤614\mathsf{E}_{6}^{14} 𝖤77\mathsf{E}_{7}^{7} 𝖤725\mathsf{E}_{7}^{-25} 𝖤75\mathsf{E}_{7}^{-5} 𝖤88\mathsf{E}_{8}^{8} 𝖤824\mathsf{E}_{8}^{-24}
CC LL LL LL^{\prime} LL^{\prime} D+D^{+} D+D^{+} DD^{\prime} O+O^{+} OO^{\prime}
AA A+A^{+} AA^{\prime} A+A^{+} AA^{\prime} A+A^{+} AA^{\prime} A+A^{+} A+A^{+} A+A^{+}

We then let 𝐆\mathbf{G} be the KK-group 𝐀𝐮𝐭𝔏(C,A)\operatorname{\mathbf{Aut}}\mathfrak{L}(C,A) afforded by Tits’ construction.

In view of the behavior of the chosen algebras CC and AA at the archimedean places of KK (described in §5.9 or §5.10), 𝐆\mathbf{G} is isomorphic to 𝐅\mathbf{F} over KσiK_{\sigma_{i}} for i=1,,si=1,\dots,s, whereas it remains anisotropic over KσiK_{\sigma_{i}} for i=s+1,,di=s+1,\dots,d. Thus 𝐆(K)\mathbf{G}(K\otimes_{\mathbb{Q}}\mathbb{R}) projects onto 𝐆s,r()\mathbf{G}_{s,r}(\mathbb{R}) with compact kernel.

Let uαu_{\alpha} be the automorphism of AA constructed from α\alpha in §5.9 (if A=A+A=A^{+}) or in §5.10 (if A=AA=A^{\prime}). The induced automorphism γ=u~α\gamma=\widetilde{u}_{\alpha} of 𝔏(C,A)\mathfrak{L}(C,A) is then the required element of 𝐆(𝒪K)\mathbf{G}(\mathcal{O}_{K}), where we use for 𝔏(C,A)\mathfrak{L}(C,A) the 𝒪K\mathcal{O}_{K}-structure canonically induced by that of CC and AA (given in §5.9 or §5.10). Indeed, γ\gamma is not torsion since the morphism ~:𝐀𝐮𝐭A𝐀𝐮𝐭𝔏(C,A)\widetilde{\phantom{u}}:\operatorname{\mathbf{Aut}}A\to\operatorname{\mathbf{Aut}}\mathfrak{L}(C,A) used to construct it is injective, and γ2cm1\gamma^{2c_{m}}\to 1 whenever α2cm1\alpha^{2c_{m}}\to 1 because (the extension of) this morphism (to any completion of KK) is continuous.

References

  • [BDM07] Peter Borwein, Edward Dobrowolski, and Michael J. Mossinghoff. Lehmer’s problem for polynomials with odd coefficients. Ann. of Math. (2), 166(2):347–366, 2007.
  • [Ben08] Yves Benoist. Réseaux des groupes de lie. Notes de Cours, 2008.
  • [BHC62] Armand Borel and Harish-Chandra. Arithmetic subgroups of algebraic groups. Ann. of Math. (2), 75:485–535, 1962.
  • [Bre07] Emmanuel Breuillard. On uniform exponential growth for solvable groups. Pure Appl. Math. Q., 3(4):949–667, 2007.
  • [BV20] Emmanuel Breuillard and Péter P. Varjú. Entropy of Bernoulli convolutions and uniform exponential growth for linear groups. J. Anal. Math., 140(2):443–481, 2020.
  • [Dob79] Edward Dobrowolski. On a question of Lehmer and the number of irreducible factors of a polynomial. Acta Arith., 34(4):391–401, 1979.
  • [ERT19] Vincent Emery, John G. Ratcliffe, and Steven T. Tschantz. Salem numbers and arithmetic hyperbolic groups. Trans. Amer. Math. Soc., 372(1):329–355, 2019.
  • [Fra21] Mikołaj Fraczyk. Strong limit multiplicity for arithmetic hyperbolic surfaces and 3-manifolds. Invent. Math., 224(3):917–985, 2021.
  • [Fre54a] Hans Freudenthal. Beziehungen der E7{E_{7}} und E8{E_{8}} zur Oktavenebene. I. Nederl. Akad. Wet., Proc., Ser. A, 57:218–230, 1954.
  • [Fre54b] Hans Freudenthal. Beziehungen der E7E_{7} und E8E_{8} zur Oktavenebene. II. Nederl. Akad. Wet., Proc., Ser. A, 57:363–368, 1954.
  • [Fre55] Hans Freudenthal. Beziehungen der E7E_{7} und E8E_{8} zur Oktavenebene. III, IV. Nederl. Akad. Wet., Proc., Ser. A, 58:151–157, 277–285, 1955.
  • [Fre59] Hans Freudenthal. Beziehungen der 𝔈7{\mathfrak{E}}_{7} und 𝔈8{\mathfrak{E}}_{8} zur Oktavenebene. V-IX. Nederl. Akad. Wet., Proc., Ser. A, 62:165–179, 180–191, 192–201, 447–465, 466–474, 1959.
  • [Fre63] Hans Freudenthal. Beziehungen der 𝔈7{\mathfrak{E}}_{7} und 𝔈8{\mathfrak{E}}_{8} zur Oktavenebene. X, XL. Nederl. Akad. Wet., Proc., Ser. A, 66:457–471, 472–487, 1963.
  • [Gel04] Tsachik Gelander. Homotopy type and volume of locally symmetric manifolds. Duke Math. J., 124(3):459–515, 2004.
  • [GH01] Eknath Ghate and Eriko Hironaka. The arithmetic and geometry of Salem numbers. Bull. Amer. Math. Soc. (N.S.), 38(3):293–314, 2001.
  • [Jac71] Nathan Jacobson. Exceptional Lie algebras. Lecture Notes in Pure and Applied Mathematics. New York: Marcel Dekker, Inc., 1971.
  • [KMRT98] Max-Albert Knus, Alexander Merkurjev, Markus Rost, and Jean-Pierre Tignol. The book of involutions. With a preface by J. Tits., volume 44. Providence, RI: American Mathematical Society, 1998.
  • [Leh33] Derrick H. Lehmer. Factorization of certain cyclotomic functions. Ann. Math. (2), 34:461–479, 1933.
  • [Mar75] Gregory A. Margulis. Discrete groups of motions of manifolds of nonpositive curvature. In Proceedings of the International Congress of Mathematicians (Vancouver, B.C., 1974), Vol. 2, pages 21–34. Canad. Math. Congress, Montreal, Que., 1975.
  • [Mar91] Gregory A. Margulis. Discrete subgroups of semisimple Lie groups, volume 17 of Ergebnisse der Mathematik und ihrer Grenzgebiete (3) [Results in Mathematics and Related Areas (3)]. Springer-Verlag, Berlin, 1991.
  • [MR03] Colin Maclachlan and Alan W. Reid. The arithmetic of hyperbolic 3-manifolds, volume 219 of Graduate Texts in Mathematics. Springer-Verlag, New York, 2003.
  • [NR92] Walter D. Neumann and Alan W. Reid. Arithmetic of hyperbolic manifolds. In Topology ’90 (Columbus, OH, 1990), volume 1 of Ohio State Univ. Math. Res. Inst. Publ., pages 273–310. de Gruyter, Berlin, 1992.
  • [Sch73] Andrzej Schinzel. On the product of the conjugates outside the unit circle of an algebraic number. Acta Arith., 24:385–399, 1973.
  • [Sch80] Wolfgang M. Schmidt. Diophantine approximation, volume 785 of Lecture Notes in Mathematics. Springer, Berlin, 1980.
  • [Smy71] Chris Smyth. On the product of the conjugates outside the unit circle of an algebraic integer. Bull. London Math. Soc., 3:169–175, 1971.
  • [Smy08] Chris Smyth. The Mahler measure of algebraic numbers: a survey. In Number theory and polynomials, volume 352 of London Math. Soc. Lecture Note Ser., pages 322–349. Cambridge Univ. Press, Cambridge, 2008.
  • [Smy15] Chris Smyth. Seventy years of Salem numbers. Bull. Lond. Math. Soc., 47(3):379–395, 2015.
  • [Sur92] B. Sury. Arithmetic groups and Salem numbers. Manuscripta Math., 75(1):97–102, 1992.
  • [SV00] Tonny A. Springer and Ferdinand D. Veldkamp. Octonions, Jordan algebras and exceptional groups. Revised English version of the original German notes. Berlin: Springer, revised english version of the original german notes edition, 2000.
  • [Tit66a] Jacques Tits. Algèbres alternatives, algèbres de Jordan et algèbres de Lie exceptionnelles. I: Construction. Nederl. Akad. Wet., Proc., Ser. A, 69:223–237, 1966.
  • [Tit66b] Jacques Tits. Classification of algebraic semisimple groups. In Algebraic Groups and Discontinuous Subgroups (Proc. Sympos. Pure Math., Boulder, Colo., 1965), pages 33–62. Amer. Math. Soc., Providence, R.I., 1966.
  • [Vou96] Paul Voutier. An effective lower bound for the height of algebraic numbers. Acta Arith., 74(1):81–95, 1996.
  • [Zim84] Robert J. Zimmer. Ergodic theory and semisimple groups, volume 81 of Monographs in Mathematics. Birkhäuser Verlag, Basel, 1984.