This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Convexity and Aigner’s Conjectures

Greg McShane Institut Fourier 100 rue des maths, BP 74, 38402 St Martin d’Hères cedex, France mcshane at ujf-grenoble.fr
Abstract.

Markov numbers are integers that appear in triples which are solutions of a Diophantine equation, the so-called Markov cubic

x2+y2+z23xyz=0.x^{2}+y^{2}+z^{2}-3xyz=0.

A classical topic in number theory, these numbers are related to many areas of mathematics such as combinatorics, hyperbolic geometry, approximation theory and cluster algebras.

One can associate to each a positive rational number a Markov number in a natural way. We give a new unified proof of certain conjectures from Martin Aigner’s book, Markov’s Theorem and 100 Years of the Uniqueness Conjecture. Our proof relies on a relationship between Markov numbers and the lengths of closed simple geodesics on the punctured torus discovered by H. Cohn.

1. Introduction

The Markov numbers are the positive integer solutions of the Diophantine equation

x2+y2+z23xyz=0x^{2}+y^{2}+z^{2}-3xyz=0

Markov in the late 1800s showed that all these solutions could be given the structure of a binary tree. It became customary (and useful) to index the Markov numbers by the rationals in the interval [0,1][0,1] which stand at the same place in the Stern–Brocot binary tree (see Figures 2 and 3). Frobenius’ conjecture asserts that each Markov number appears at most once in this tree. If this conjecture is true, then the obvious order of Markov numbers gives rise to a new strict order on the rationals.

1.1. Aigner’s Monotonicity Conjectures

Aigner [1] proposed three conjectures to better understand this order.

  1. (1)

    (The fixed numerator conjecture) Let p,qp,q and ii be positive integers such that i>0,p<qi>0,p<q and gcd(p,q)=1,gcd(p,q+i)=1gcd(p,q)=1,gcd(p,q+i)=1 then

    mp/q<mp/(q+i).m_{p/q}<m_{p/(q+i)}.
  2. (2)

    (The fixed denominator conjecture) Let p,qp,q and ii be positive integers such that i>0,p<qi>0,p<q and gcd(p,q)=1,gcd(p+i,q)=1gcd(p,q)=1,gcd(p+i,q)=1 then

    mp/q<m(p+i)/q.m_{p/q}<m_{(p+i)/q}.
  3. (3)

    (The fixed sum conjecture) Let p,qp,q and ii be positive integers such that i>0,p<qi>0,p<q and gcd(p,q)=1,gcd(pi,q+i)=1gcd(p,q)=1,gcd(p-i,q+i)=1 then

    mp/q<m(pi)/(q+i).m_{p/q}<m_{(p-i)/(q+i)}.

The fixed numerator conjecture was proved in [8] using ideas from cluster algebra theory and snake graphs. Their proof relies on a connection between Christoffel paths and Markov numbers (see [2] for a discussion) which provides a formulation of Markov numbers as continuant polynomials. The proof is rather technical and introduces several interesting tools for studying paths in the Cayley graph of the monoid 2\mathbb{N}^{2} with the usual generators. In [9], the authors prove the fixed numerator conjecture by using ”transformations” on paths in the Cayley graph. We will show how these results can be deduced by considerations of convexity established in [7]. Our proof is represented visually in Figure 4.

1.2. Geometry of the Markov numbers

The question of counting for Markov triples, that is for R>0R>0 estimating the size of the set of Markov triples (x,y,z)(x,y,z) for which the sup norm is less than RR, was first investigated by Gurwood in his thesis. He established an asymptotic formula using a correspondence between Markov and Farey trees. Somewhat later Zagier [10] obtained an improved error term. At the time of writing the best result is due to a Rivin and the author [7]:

M(R)=C(logR)2+O(logRloglogR),M(R)=C(\log R)^{2}+O(\log R\log\log R),

for a numerical constant which has a geometric interpretation.

The method introduced in [7] differs from that of Gurwood and Zagier in that it relies on ideas of H. Cohn to interpret the question in terms of geometry of hyperbolic surfaces. In [4] Cohn shows that the Markov triple (1,1,1)(1,1,1) is, up to multiplication by 3, the image under the character map χ\chi of the modular torus. Recall that the modular torus is the quotient of the upper half \mathbb{H} plane by Γ\Gamma, the commutator subgroup of PSL(2,)\text{PSL}(2,\mathbb{Z}), acting by Mobius transformations. Secondly, he shows that the permutations and the Vieta flips used to construct Markov’s binary tree are induced by automorphisms of the fundamental group of the torus and concludes that Aut()\text{Aut}(\mathbb{Z}*\mathbb{Z}) acts transitively on Markov triples. It is well known that if ρ:PSL(2,)\rho:\mathbb{Z}*\mathbb{Z}\rightarrow\text{PSL}(2,\mathbb{R}) is a discrete faithful representation and γ\gamma a closed curve homotopic on the surface X=/ρ()X=\mathbb{H}/\rho(\mathbb{Z}*\mathbb{Z}), then X(γ)\ell_{X}(\gamma), the length of the closed geodesic homotopic to γ\gamma on the surface XX, can be computed from the trace using the relation

2coshX(γ)/2=|tr ρ(γ)|.2\cosh\ell_{X}(\gamma)/2=|\text{tr\,}\rho(\gamma)|.

We note that the absolute value of the trace of an element of PSL(2,)\text{PSL}(2,\mathbb{R}) is well defined. Now identifying \mathbb{Z}*\mathbb{Z} with the fundamental group of the modular torus Cohn shows that if xx is a Markov number then there exists a simple loop γ\gamma such that

x=13|tr ρ(γ)|.x=\frac{1}{3}|\text{tr\,}\rho(\gamma)|.

so there is an explicit relation between Markov numbers and lengths of simple geodesics on the modular torus. In [7] the simple closed curves are embedded as a subset of primitive elements in the integer lattice 22\mathbb{Z}^{2}\subset\mathbb{R}^{2} and the length function γX(γ)\gamma\mapsto\ell_{X}(\gamma) is shown, using a simple geometric argument, to extend to a proper convex function :2+\ell:\mathbb{R}^{2}\mapsto\mathbb{R}_{+}. Thus the counting problem for Markov numbers can be settled by counting integer points in the set

1([0,R])=R1([0,1])2\ell^{-1}([0,R])=R\ell^{-1}([0,1])\subset\mathbb{R}^{2}

It is easy to visualize this set using a very short computer program see for example Figure 1.

In summary, given a fraction p/qp/q one obtains a primitive lattice point (q,p)2(q,p)\in\mathbb{Z}^{2} and so a closed simple geodesic γp/qX\gamma_{p/q}\subset X such that

Mp/q=23cosh(X(γp/q))2)=23cosh((q,p)s),M_{p/q}=\frac{2}{3}\cosh\left(\frac{\ell_{X}(\gamma_{p/q}))}{2}\right)=\frac{2}{3}\cosh(\|(q,p)\|_{s}),

where .s\|.\|_{s} is the norm induced by the function 12:2+\frac{1}{2}\ell:\mathbb{R}^{2}\mapsto\mathbb{R}_{+}.

Refer to caption
Figure 1. A level set SS for the the length function \ell.

1.3. Restatement of Conjectures

Thus the fixed numerator conjecture is equivalent to : Let p,qp,q and ii be positive integers such that i>0,p<qi>0,p<q and pp is coprime with both q,q+iq,q+i then

(q,p)s<(q+i,p)s.\|(q,p)\|_{s}<\|(q+i,p)\|_{s}.

It is natural now to ask oneself whether we need to restrict to integers. In fact, using quite simple geometric arguments one can prove a result, in a non discrete setting, which yields Aigner’s conjectures as a corollary:

Theorem 1.1.

Let p,qp,q be real non negative numbers and i>0i>0 then

(1) (q,p)s<(q+i,p)s\|(q,p)\|_{s}<\|(q+i,p)\|_{s}

and

(2) (q,p)s<(q,p+i)s\|(q,p)\|_{s}<\|(q,p+i)\|_{s}

If in addition p<qp<q then

(3) (q,p)s<(q+i,pi)s\|(q,p)\|_{s}<\|(q+i,p-i)\|_{s}

1.4. Organisation, Remarks

We give a brief account of the geometry and topology of the once punctured torus in Sections 2 and 3. This is followed in Section 4 by the proof of the convexity of the length function \ell which appeared in [7]. We conclude with the proof of Theorem 1.1 in Section 5.

One could of course state many more theorems using these ideas but our purpose here is simply to show why Aigner’s conjectures are manifestations of the convexity of the length function.

Finally we note that it is unlikely that purely geometric methods would yield a proof of Frobenius’ conjecture as a natural variation of it is false [6].

1.5. Thanks

I thank Vlad Segesciu and Louis Funar for many useful conversations over the years concerning this subject. I am grateful too to Xu Binbin for comments on the preliminary versions of the text.

2. Hyperbolic structures and representations

The rank two free group \mathbb{Z}*\mathbb{Z} arises as the fundamental group of exactly two non-homeomorphic orientable surfaces: the three-holed sphere Σ0,3\Sigma_{0,3}, the one-holed torus Σ1,1\Sigma_{1,1}, Furthermore Σ1,1\Sigma_{1,1} enjoys the remarkable property that every automorphism of its fundamental group is geometric ie induced by a homeomorphism. Equivalently, every homotopy-equivalence Σ1,1Σ1,1\Sigma_{1,1}\rightarrow\Sigma_{1,1} is homotopic to a homeomorphism. Much time and energy has been devoted to the geometric and topological properties of the one holed torus see for example [3].

2.1. Fricke theory

It has been known since the time of Fricke that, as a consequence of Riemann’s Uniformization Theorem, the moduli space of hyperbolic structures on the three holed sphere and the one holed torus can be identified with semi algebraic subsets of 3\mathbb{R}^{3}. More precisely, one obtains a hyperbolic structure XX on Σ\Sigma from a discrete faithful representation ρ\rho of π1(Σ)\pi_{1}(\Sigma) into isom()\mathrm{isom}(\mathbb{H}) such that /ρ(π1(Σ))\mathbb{H}/\rho(\pi_{1}(\Sigma)) is homeomorphic to Σ\Sigma. One can lift any such ρ\rho from a representation into isom()+\mathrm{isom}(\mathbb{H})^{+}, which is isomorphic to PSL(2,)\text{P}\mathrm{SL}(2,\mathbb{R}), to a representation

ρ~:π1(Σ)SL(2,).\tilde{\rho}:\pi_{1}(\Sigma)\rightarrow\text{SL}(2,\mathbb{R}).

Doing this allows one to consider the trace tr ρ~(γ)\text{tr\,}\tilde{\rho}(\gamma)\in\mathbb{R} for any element γ\gamma of the free group so that many problems can be reduced to questions of linear algebra. In particular one defines a character map

χ:X(tr ρ~(α),tr ρ~(β),tr ρ~(αβ)),\chi:X\mapsto(\text{tr\,}\tilde{\rho}(\alpha),\text{tr\,}\tilde{\rho}(\beta),\text{tr\,}\tilde{\rho}(\alpha\beta)),

which provides an embedding of the moduli space of hyperbolic structures on Σ\Sigma. Goldman [5] studied the dynamical system defined by the action of the group of outer automorphisms of π1(Σ)\pi_{1}(\Sigma)\simeq\mathbb{Z}*\mathbb{Z} on the moduli space. If ϕ\phi is an automorphism then it acts on the representations

ρ~ρ~ϕ1\tilde{\rho}\mapsto\tilde{\rho}\circ\phi^{-1}

and evidently this induces an action on the points in the image of the embedding above. It turns out that, by applying Cayley-Hamilton theorem to SL(2,)\mathrm{SL}(2,\mathbb{R}), it is easy to show that this action is the restriction of polynomial diffeomorphisms of 3\mathbb{R}^{3}.

2.2. Markov triples

An important aspect of the theory of the action of the group of outer automorphisms is that it admits an invariant function. In the case of the pair of orientable surfaces this is

κ:(x,y,z)x2+y2+z2xyz.\kappa:(x,y,z)\mapsto x^{2}+y^{2}+z^{2}-xyz.

The level set κ1(0)\kappa^{-1}(0) has five connected components - the singleton (0,0,0)(0,0,0) and four codimension one manifolds homeomorphic to 2\mathbb{R}^{2}. The non trivial components can be identified with the Teichmueller space of a once punctured torus. It is well known that there are integer points in κ1(0)+3\kappa^{-1}(0)\cap\mathbb{R}_{+}^{3}, for example (3,3,3)(3,3,3), and that they form a single orbit for the action of the outer automorphisms. From such a point one obtains a Markov triple, that is a solution in positive integers of the Markov cubic

x2+y2+z23xyz=0x^{2}+y^{2}+z^{2}-3xyz=0

simply by dividing through by 33. This map is natural in that it is a conjugation of automorphisms of the level set κ1(0)\kappa^{-1}(0) with the automorphisms of the Markov cubic. So, since we will not be interested in arithemetic properties of these solutions, we will also refer in what follows to elements of κ1(0)3\kappa^{-1}(0)\cap\mathbb{N}^{3} as Markov numbers. An integer which appears in a Markov triple is called a Markov number.

2.2.1. Reduction theory

The fact that the Markov triples form a single orbit is a corollary of a classical result of Markov in reduction theory which we now explain briefly. We start by giving the Markov triples an ordering, in the obvious way, using the sup norm on 3\mathbb{R}^{3} and proceed to show that any solution can be obtained from (1,1,1)(1,1,1) by repeatedly applying generators of the group of automorphisms of the cubic. This automorphism group can be shown to be generated by:

  1. (1)

    sign change automorphisms (x,y,z)(x,y,z)(x,y,z)\mapsto(x,-y,-z).

  2. (2)

    coordinate permutations eg (x,y,z)(y,x,z)(x,y,z)\mapsto(y,x,z).

  3. (3)

    a Vieta flip (x,y,z)(x,y,3xyz)(x,y,z)\mapsto(x,y,3xy-z).

Since Markov triples are solutions in positive integers we will use just the Markov morphisms that is the group generated by permutations and the Vieta flip. Given a Markov triple (x,y,z)(1,1,1)(x,y,z)\neq(1,1,1) we may apply a permutation so that xy<zx\leq y<z which one can try to “reduce” using the Vieta flip, that is replacing the it by the triple (x,y,3xyz)(x,y,3xy-z). We obtain a smaller solution provided 3xy<2z3xy<2z and this inequality holds for every Markov triple except (1,1,1)(1,1,1). The reduction process gives rise to the structure of a rooted binary tree, the Markov tree, on the Markov triples with (1,1,1)(1,1,1) as the root triple.

3. Automorphisms of \mathbb{Z}*\mathbb{Z}

Let \mathbb{Z}*\mathbb{Z} denote the free group on 2 generators which we will denote α\alpha and β\beta. We think of α\alpha and β\beta as simple loops meeting in a single point in a holed torus.

3.1. Primitive elements

An element γ\gamma\in\mathbb{Z}*\mathbb{Z} is called primitive, if there exists an automorphism ϕ\phi, such that γ=ϕ(α)\gamma=\phi(\alpha). If ϕ(δ)=β\phi(\delta)=\beta, then γ\gamma and δ\delta are called associated primitives.

Following [7] let ϕ:2\phi:\mathbb{Z}*\mathbb{Z}\rightarrow\mathbb{Z}^{2} be the canonical abelianizing homomorphism. The kernel of ϕ\phi is a characteristic subgroup, that is it is Aut()\text{Aut}(\mathbb{Z}*\mathbb{Z})-invariant, so there is a homorphism from Aut()\text{Aut}(\mathbb{Z}*\mathbb{Z}) to the automorphisms of 2\mathbb{Z}^{2} namely GL(2,)\mathrm{GL}(2,\mathbb{Z}). In fact this homomorphism is surjective and faithful. Moreover,

  1. (1)

    If γ\gamma is primitive then ϕ(γ)\phi(\gamma) is a primitive element of 2\mathbb{Z}^{2}.

  2. (2)

    Aut()\text{Aut}(\mathbb{Z}*\mathbb{Z}) acts transitively on primitive elements of \mathbb{Z}*\mathbb{Z}

  3. (3)

    GL(2,)\mathrm{GL}(2,\mathbb{Z}) acts transitively on primitive elements of 2\mathbb{Z}^{2}.

3.2. Topological realizations of automorphisms of \mathbb{Z}*\mathbb{Z}

It is useful, and this is essentially Cohn’s point of view [4], to identify \mathbb{Z}*\mathbb{Z} with the fundamental group of the holed torus and think of Out()\mathrm{Out}(\mathbb{Z}*\mathbb{Z}), the outer automorphism group of \mathbb{Z}*\mathbb{Z}, as being the mapping class group of the holed torus. The outer automorphism group is generated by the so-called Nielsen transformations, which either permute the basis α,β\alpha,\beta, or transform it into αβ,β\alpha\beta,\beta, or αβ1,β\alpha\beta^{-1},\beta. Both these latter transformations can be realized topologically as Dehn twists on the holed torus. We identify 2\mathbb{Z}^{2} with the first homology of the holed torus Σ\Sigma and will call the cover Σ~Σ\tilde{\Sigma}\rightarrow\Sigma corresponding to kerϕ\ker\phi the homology cover or maximal abelian cover of Σ\Sigma.

4. Convexity of the length function

Let γ\gamma\in\mathbb{Z}*\mathbb{Z} be some closed curve and X(γ)\ell_{X}(\gamma) the minimum over the lengths wrt the hyperbolic structure XX of all closed curves in the homotopy class of γ\gamma. In fact the minimum is attained and is lthe length of the unique (oriented) closed geodesic in the homotopy class.

Theorem 4.1.

The length function is convex on H1(X,)H_{1}(X,\mathbb{Q}) and so extends continuously (as a convex function) to H1(X,)H_{1}(X,\mathbb{R}).

Since the proof is short and instructive we shall reproduce it now.

Let XX be a holed torus equipped with a hyperbolic structure. If hh is a primitive homology class (that is, not a multiple of another class), then the shortest multicurve representing a non-trivial homology class hh is a simple closed geodesic and a multiple of such a geodesic otherwise. Further, the shortest multicurve representing hh is unique.

Begin by defining a valuation \ell on the first homology with integral coefficients, where (h)\ell(h) is just the length of the shortest multicurve representing h. The valuation of the trivial homology class is defined to be 0. By the previous paragraph \ell statisfies

(nh)=n(h),n.\ell(nh)=n\ell(h),\forall n\in\mathbb{N}.

Moreover if h,gh,g are not commensurable then \ell satisfies the strict triangle inequality

(h+g)<(h)+(g).\ell(h+g)<\ell(h)+\ell(g).

This is because the union of the shortest multicurves representing hh and gg is not embedded, that is there is a transverse intersection somewhere. Hence, the shortest multicurve representing h+gh+g is strictly shorter than the union of the minimisers for h,gh,g.

5. Proof of Theorem 1.1.

Refer to caption
Figure 2. Triples in a subtree of Markov’s tree.
Refer to caption
Figure 3. Corresponding rationals in the Farey tree.

The correspondence between rationals and Markov numbers is determined by the values at 0/1,1/2,1/10/1,1/2,1/1 and, following [8] (see Figures 2 and 3), these are by convention (1,5,2)(1,5,2). Now, passing from these fractions to primitive lattice points we have the values of the norm at (1,0),(2,1),(1,1)(1,0),(2,1),(1,1) We can easily find the value at (0,1)(0,1) using Vieta flipping and we see that it is 11 ie the same value as for (1,0)(1,0). In fact one can go a step further and show that the three vectors (1,0),(0,1),(1,1)(1,0),(0,1),(-1,1) correspond to the fundamental triple of Markov numbers (1,1,1)(1,1,1). The unit ball of the associated stable norm looks just like the convex set depicted in Figure 1.

Now let p,qp,q be non negative real numbers as in the statement of Theorem 1.1. For any ii the point (q+i,p)(q+i,p) lies on the line y=py=p. Since :2+\ell:\mathbb{R}^{2}\mapsto\mathbb{R}_{+} is a proper convex function this line meets the set

S={(x,y)2,(x,y)=(q,p)}S=\{(x,y)\in\mathbb{R}^{2},\,\ell(x,y)=\ell(q,p)\}

in exactly two points one of which is (q,p)(q,p) and the other has a negative xx coordinate. It follows that for i>0i>0 the point (q+i,p)(q+i,p) is outside of the norm ball

B={(x,y)2,(x,y)(q,p)}B=\{(x,y)\in\mathbb{R}^{2},\,\ell(x,y)\leq\ell(q,p)\}

and the first part of the theorem follows.

A similar argument using a vertical line instead of a horizontal one yields a proof of the second part of the Theorem 1.1.

Finally for the third part one must consider the line (q,p)+i(1,1)(q,p)+i(-1,1). It is easy to check that it meets SS in (q,p)(q,p) and (p,q)(p,q) and so for i>0i>0 we have the required inequality.

Refer to caption
Figure 4. A portion of the set BB with the point (q,p)(q,p) on its boundary SS and the three lines (horizontal, vertical and diagonal) that appear in our proof of the conjectures.

References

  • [1] M. Aigner, Markov’s theorem and 100 years of the uniqueness conjecture. A mathematical journey from irrational numbers to perfect matchings. Springer, Cham, 2013.
  • [2] J. Berstel, A.Lauve, C. Reutenauer, F. Saliola. Combinatorics on Words: Christoffel Words and Repetitions in Words, CRM Monograph Series Volume: 27; 2008
  • [3] Buser, Peter, and K -D. Semmler. The geometry and spectrum of the one holed torus. Commentarii Mathematici Helvetici 63.1 (1988): 259-274.
  • [4] Harvey Cohn Approach to Markov’s Minimal Forms Through Modular Functions Annals of Mathematics Second Series, Vol. 61, No. 1 1955
  • [5] William M Goldman The modular group action on real SL(2)–characters of a one-holed torus Geom. Topol. Volume 7, Number 1 (2003), 443-486.
  • [6] Greg McShane, Hugo Parlier, Multiplicities of simple closed geodesics and hypersurfaces in Teichmüller space, Geom. Topol. Volume 12, Number 4 (2008), 1883-1919.
  • [7] Greg McShane, Igor Rivin A norm on homology of surfaces and counting simple geodesics International Mathematics Research Notices, Volume 1995, Issue 2, 1995
  • [8] M. Rabideaua, R. Schiffler, Continued fractions and orderings on the Markov numbers, Advances in Mathematics Vol 370, 2020.
  • [9] C Lagisquet and E. Pelantová and S. Tavenas and L. Vuillon, On the Markov numbers: fixed numerator, denominator, and sum conjectures. https://arxiv.org/abs/2010.10335
  • [10] Zagier, Don B. (1982). On the Number of Markov Numbers Below a Given Bound. Mathematics of Computation. 160 (160): 709–723