This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Two-variable polynomials with dynamical Mahler measure zero

Annie Carter Matilde Lalín Michelle Manes Alison Beth Miller  and  Lucia Mocz Annie Carter: Department of Mathematics, University of California San Diego, 9500 Gilman Drive # 0112, La Jolla, CA 92093-0112, USA [email protected] Matilde Lalín: Département de mathématiques et de statistique, Université de Montréal. CP 6128, succ. Centre-ville. Montreal, QC H3C 3J7, Canada [email protected] Michelle Manes: Department of Mathematics, University of Hawaii, 2565 McCarthy Mall, Honolulu, HI 96822, USA [email protected] Alison Beth Miller: Mathematical Reviews, 416 Fourth St., Ann Arbor, MI 48103, USA [email protected] Lucia Mocz: Department of Mathematics, University of Chicago, 5734 S. University Avenue, Room 108 Chicago, IL, 60637, USA [email protected]
Abstract.

We discuss several aspects of the dynamical Mahler measure for multivariate polynomials. We prove a weak dynamical version of Boyd–Lawton formula and we characterize the polynomials with integer coefficients having dynamical Mahler measure zero both for the case of one variable (Kronecker’s lemma) and for the case of two variables, under the assumption that the dynamical version of Lehmer’s question is true.

Key words and phrases:
Mahler measure, dynamical Mahler measure, polynomial, preperiodic points, equidistribution
2020 Mathematics Subject Classification:
Primary 11R06; Secondary 11G50, 37P15, 37P30

1. Introduction

This paper considers a generalized version of Mahler measure attached to a discrete dynamical system. In [ST12, PST05], the authors consider a single-variable generalized Mahler measure, which we extend here to multivariate polynomials. We begin with some background on both Mahler measure and discrete dynamical systems as well as the connections between them.

1.1. Mahler measure

The (logarithmic) Mahler measure of a non-zero rational function P(x1,xn)×P\in\mathbb{C}(x_{1},\dots x_{n})^{\times} is defined by

m(P)=1(2πi)n𝕋nlog|P(z1,,zn)|dz1z1dznzn,\mathrm{m}(P)=\frac{1}{(2\pi i)^{n}}\int_{\mathbb{T}^{n}}\log|P(z_{1},\cdots,z_{n})|\frac{dz_{1}}{z_{1}}\cdots\frac{dz_{n}}{z_{n}},

where 𝕋n={(z1,,zn)n:|z1|==|zn|=1}\mathbb{T}^{n}=\{(z_{1},\dots,z_{n})\in\mathbb{C}^{n}\,:\,|z_{1}|=\cdots=|z_{n}|=1\} is the unit torus.

In the particular case of a single variable polynomial, Jensen’s formula implies that if

P(x)=ai(xαi),P(x)=a\prod_{i}(x-\alpha_{i}),

then

(1.1) m(P)=log|a|+|αi|>1log|αi|.\mathrm{m}(P)=\log|a|+\sum_{|\alpha_{i}|>1}\log|\alpha_{i}|.

The single variable polynomial version was first introduced by Lehmer [Leh33] as a useful tool in a method for producing large prime numbers by generalizing Mersenne’s sequences.

It is natural to ask the question about the range of possible values of Mahler measure. For P[x]P\in\mathbb{Z}[x], Kronecker’s lemma [Kro57] implies that m(P)=0\mathrm{m}(P)=0 if and only if PP is (up to a sign) monic and a product of cyclotomic polynomials and a monomial.

From the context of finding large prime numbers, Lehmer posed the following question:

Given ε>0\varepsilon>0, can we find a polynomial P[x]P\in\mathbb{Z}[x] such that 0<m(P)<ε0<\mathrm{m}(P)<\varepsilon?

Lehmer showed that

m(x10+x9x7x6x5x4x3+x+1)=0.162357612\mathrm{m}(x^{10}+x^{9}-x^{7}-x^{6}-x^{5}-x^{4}-x^{3}+x+1)=0.162357612\dots

and declared that this was the polynomial with the smallest positive measure that he was able to find. To this day, Lehmer’s question remains unanswered and Lehmer’s polynomial remains the one with the smallest known positive Mahler measure. Some key progress on this question can be found, for example, in the works of Breusch [Bre51] and Smyth [Smy71], who proved that the Mahler measures of non-reciprocal polynomials have a lower bound; Dobrowolski [Dob79], who gave the first lower bound that approaches zero when the degree goes to infinity; and the recent proof of Dimitrov [Dim] of the Schinzel–Zassenhaus conjecture.

Mahler measure for multivariate polynomials was first considered by Mahler [Mah62] in connection to heights and their applications in transcendence theory. Later Smyth [Smy81, Boy81b] obtained the first exact formulas involving Dirichlet LL-functions and the Riemann zeta function, such as

m(1+x+y)=\displaystyle\mathrm{m}(1+x+y)= 334πL(χ3,2),\displaystyle\frac{3\sqrt{3}}{4\pi}L(\chi_{-3},2),
m(1+x+y+z)=\displaystyle\mathrm{m}(1+x+y+z)= 72π2ζ(3).\displaystyle\frac{7}{2\pi^{2}}\zeta(3).

More recently, various formulas have been proven or conjectured involving LL-functions associated to more complicated arithmetic-geometric objects such as elliptic curves. For example, the following formula was conjectured by Deninger [Den97] and Boyd [Boy98] and proven by Rogers and Zudilin [RZ14]

m(x+1x+y+1y+1)=154π2L(E15,2),\mathrm{m}\left(x+\frac{1}{x}+y+\frac{1}{y}+1\right)=\frac{15}{4\pi^{2}}L(E_{15},2),

where E15E_{15} is an elliptic curve of conductor 15.

We finish the discussion on classical Mahler measure by recalling a result due to Boyd [Boy81b] and Lawton [Law83]. If P(x1,,xn)×P\in\mathbb{C}(x_{1},\dots,x_{n})^{\times}, then

(1.2) limq(𝐤)m(P(x,xk2,,xkn))=m(P(x1,,xn)),\lim_{q({\bf k})\rightarrow\infty}\mathrm{m}(P(x,x^{k_{2}},\dots,x^{k_{n}}))=\mathrm{m}(P(x_{1},\dots,x_{n})),

where

q(𝐤)=min{H(𝐬):𝐬=(s2,,sn)n1,𝐬(0,,0), and j=2nsjkj=0}q({\bf k})=\min\left\{H({\bf s}):{\bf s}=(s_{2},\dots,s_{n})\in\mathbb{Z}^{n-1},{\bf s}\not=(0,\dots,0),\mbox{ and }\sum_{j=2}^{n}s_{j}k_{j}=0\right\}

where H(𝐬)=max{|sj|:2jn}H({\bf s})=\max\{|s_{j}|:2\leq j\leq n\}. Intituively, the above limit is taken while k2,,knk_{2},\dots,k_{n} go to infinity independently from each other.

This result expresses the Mahler measure of a multivariate polynomial in terms of the Mahler measure of a single variable polynomial. It sheds light into the Lehmer conjecture for multivariate polynomials, since the existence of any such polynomial with small Mahler measure would immediately imply the existence of infinitely many single variable polynomials with small Mahler measure.

1.2. Arithmetic dynamics

In classical (holomorphic) dynamics, one studies topological and analytic properties of orbits of points under iteration of self-maps of \mathbb{C} or, more generally, of a complex manifold. In the younger field of arithmetic dynamics, the self-maps act on number theoretic objects (for example, algebraic varieties) and the objects and maps are defined over fields of number theoretic interest (number fields, function fields, finite fields, non-archimedean fields, etc.).

We fix the following notation: Let XX be a set, possibly with additional structure. For f:XXf:X\to X and LAut(X)L\in\operatorname{Aut}(X), we write

(1.3) fn=fffn-fold composition, and fL:=L1fL.f^{n}=\underbrace{f\circ f\circ\cdots\circ f}_{\text{$n$-fold composition}},\quad\text{ and }\quad f^{L}:=L^{-1}\circ f\circ L.

This conjugation is a natural dynamical equivalence relation in that it respects iteration; that is, (fL)n=(fn)L(f^{L})^{n}=(f^{n})^{L}. In the sequel, we are concerned with polynomial maps f:f:\mathbb{C}\to\mathbb{C}, so we consider equivalence up to conjugation by affine maps L(z)=az+b[z]L(z)=az+b\in\mathbb{C}[z] with a0a\neq 0.

Let αX\alpha\in X. The forward orbit of α\alpha under ff is the set 𝒪f(α)={fn(α):n0}\mathcal{O}_{f}(\alpha)=\{f^{n}(\alpha)\,:\,n\geq 0\}. Questions in arithmetic dynamics are often motivated by an analogy between arithmetic geometry and dynamical systems in which, for example, rational and integral points on varieties correspond to rational and integral points in orbits, and torsion points on abelian varieties correspond to preperiodic points (points with finite orbit). See [Sil07] for more comprehensive background and motivation and [BIJ+19] for a survey of recent progress in the field of arithmetic dynamics.

Two complementary sets play an important role in holomorphic dynamics: The Fatou set of ff is the maximal open set on which the family of iterates {fn:n1}\{f^{n}\,:\,n\geq 1\} is equicontinuous (considered as maps on 1()\mathbb{P}^{1}(\mathbb{C})); its complement is the Julia set of ff, which we denote 𝒥f\mathcal{J}_{f}. Informally, the Julia set is where the interesting dynamics happens. For a polynomial f:f:\mathbb{C}\to\mathbb{C}, we may define the Julia set a bit more concretely.

Let f[z]f\in\mathbb{C}[z]. The filled Julia set of ff is

𝒦f={z:fn(z)↛ as n}.\mathcal{K}_{f}=\{z\in\mathbb{C}\,:\,f^{n}(z)\not\rightarrow\infty\mbox{ as }n\rightarrow\infty\}.

The Julia set of ff is the boundary of the filled Julia set. That is, 𝒥f=𝒦f\mathcal{J}_{f}=\partial\mathcal{K}_{f}. It follows from these definitions that for a polynomial f[z]f\in\mathbb{C}[z], both 𝒦f\mathcal{K}_{f} and 𝒥f\mathcal{J}_{f} are compact. In dynamical Mahler measure, the Julia set 𝒥f\mathcal{J}_{f} for a general polynomial ff will play the role of the unit torus 𝕋\mathbb{T}.

Two families of polynomials play an important role in this work: The pure power maps f(z)=zdf(z)=z^{d} and the Chebyshev polynomials. Both of these families have special dynamical properties that arise because they are related to endomorphisms of algebraic groups.

Consider the multiplicative group 𝔾m\mathbb{G}_{m} where for a field KK, the KK-valued points are 𝔾m(K)=K\mathbb{G}_{m}(K)=K^{*}. The endomorphism ring of 𝔾m\mathbb{G}_{m} is \mathbb{Z}:

\displaystyle\mathbb{Z} End(𝔾m)\displaystyle\to\operatorname{End}(\mathbb{G}_{m})
d\displaystyle d zd.\displaystyle\mapsto z^{d}.

Iteration of pure power maps is particularly easy to understand:

f(z)=zd means fn(z)=zdn.f(z)=z^{d}\quad\text{ means }\quad f^{n}(z)=z^{d^{n}}.

For d2d\geq 2 and KK\subseteq\mathbb{C} we have three cases: If |α|>1|\alpha|>1, then |αdn||\alpha^{d^{n}}|\to\infty with nn\to\infty. If |α|<1|\alpha|<1, then |αdn|0|\alpha^{d^{n}}|\to 0 with nn\to\infty. If |α|=1|\alpha|=1, then |αdn|=1|\alpha^{d^{n}}|=1 for all nn. So for pure power maps, we can understand the Julia sets completely: 𝒦f\mathcal{K}_{f} is the unit disc, and 𝒥f\mathcal{J}_{f} is the unit circle.

To give another example, since the automorphism of 𝔾m\mathbb{G}_{m} given by zz1z\mapsto z^{-1} commutes with the power map zzdz\mapsto z^{d}, the power map descends to an endomorphism of 𝔸1\mathbb{A}^{1}, which we denote TdT_{d}, the dthd^{\operatorname{th}} Chebyshev polynomial.111Classically the Chebyshev polynomials are normalized so that T~d(cost)=cos(dt)\widetilde{T}_{d}(\cos t)=\cos(dt). The two normalizations satisfy T~d(w)=12Td(2w)\widetilde{T}_{d}(w)=\frac{1}{2}T_{d}(2w).

𝔾mzzd𝔾m𝔾m/{z=z1}zzd𝔾m/{z=z1}zz+z1zz+z1𝔸1wTd(w)𝔸1\begin{CD}\mathbb{G}_{m}@>{{z\mapsto z^{d}}}>{}>\mathbb{G}_{m}\\ @V{}V{}V@V{}V{}V\\ \mathbb{G}_{m}/\{z=z^{-1}\}@>{{z\mapsto z^{d}}}>{}>\mathbb{G}_{m}/\{z=z^{-1}\}\\ @V{}V{{z\mapsto z+z^{-1}}}V@V{}V{{z\mapsto z+z^{-1}}}V\\ \mathbb{A}^{1}@>{{w\mapsto T_{d}(w)}}>{}>\mathbb{A}^{1}\\ \end{CD}

Taking as a definition the fact that Td[w]T_{d}\in\mathbb{Z}[w] satisfies

(1.4) Td(z+z1)=zd+zd,T_{d}(z+z^{-1})=z^{d}+z^{-d},

one may prove existence and uniqueness of the Chebyshev polynomials along with a recursive formula:

(1.5) Td(w)={2d=0,wd=1,wTd1(w)Td2(w)d2.T_{d}(w)=\begin{cases}2&d=0,\\ w&d=1,\\ wT_{d-1}(w)-T_{d-2}(w)&d\geq 2.\end{cases}

The following rule for composition of Chebyshev polynomials follows directly from the definition in (1.4):

TdTe(w)=Tde(w)=TeTd(w),T_{d}\circ T_{e}(w)=T_{de}(w)=T_{e}\circ T_{d}(w),

and it gives a simple form of iteration

(1.6) Tdn(w)=Tdn(w).T_{d}^{n}(w)=T_{d^{n}}(w).

For d2d\geq 2, we have two cases for Td:T_{d}:\mathbb{C}\rightarrow\mathbb{C}: If α[2,2]\alpha\in[-2,2], then Td(α)[2,2]T_{d}(\alpha)\in[-2,2]. If α[2,2]\alpha\in\mathbb{C}\setminus[-2,2], then |Tdn(α)|=|Tdn(α)||T_{d}^{n}(\alpha)|=|T_{d^{n}}(\alpha)|\to\infty with nn\to\infty. Thus we understand the Julia set in this case as well: 𝒦Td=𝒥Td=[2,2]\mathcal{K}_{T_{d}}=\mathcal{J}_{T_{d}}=[-2,2].

See [Sil07, Chapter 6] for more on the dynamics of pure power maps, Chebyshev polynomials, and other maps arising from algebraic groups, including proofs of some of the statements above.

1.3. Dynamical Mahler measure and statement of results

An important tool in the study of arithmetic dynamics is the canonical height of a point in 1\mathbb{P}^{1} relative to a morphism ff, defined by Call and Silverman [CS93] and modeled after Tate’s construction of the canonical (Néron-Tate) height on abelian varieties. The standard (Weil) height of a point β=ab\beta=\frac{a}{b}\in\mathbb{Q} (written in lowest terms) can be computed as h(β)=max{|a|,|b|}h(\beta)=\max\{|a|,|b|\}, and, observing that aa and bb are (up to sign) the coefficients of the minimal polynomial of α\alpha over \mathbb{Z}, this definition extends easily to algebraic numbers α¯\alpha\in\bar{\mathbb{Q}}. If f[z]f\in\mathbb{Q}[z] is a polynomial and α¯1\alpha\in\mathbb{P}^{1}_{\bar{\mathbb{Q}}}, we then define:

h^f(α)=limndnh(fn(α)),\hat{h}_{f}(\alpha)=\lim_{n\to\infty}d^{-n}h(f^{n}(\alpha)),

where fnf^{n} represents the nn-fold composition of ff and d=deg(f)d=\deg(f). In analogy with the canonical height on abelian varieties, we find that this limit exists and that

  • h^f(f(α))=(degf)h^f(α)\hat{h}_{f}(f(\alpha))=(\deg f)\hat{h}_{f}(\alpha), and

  • h^f(α)=0\hat{h}_{f}(\alpha)=0 if and only if α\alpha is a preperiodic point for ff (that is, fm(α)=fn(α)f^{m}(\alpha)=f^{n}(\alpha) for some natural numbers m>nm>n).

Let α¯\alpha\in\bar{\mathbb{Q}} with minimal polynomial P[z]P\in\mathbb{Z}[z]. The theory of heights and Mahler measure are connected via the formula

(1.7) [(α):]h(α)=m(P).[\mathbb{Q}(\alpha):\mathbb{Q}]h(\alpha)=\mathrm{m}(P).

Let 𝒦\mathcal{K}\subset\mathbb{C} be compact. For any Borel probability measure ν\nu on 𝒦\mathcal{K} the energy of ν\nu is given by [Ran95, Definition 3.2.1]:

I(ν)=𝒦𝒦log|zw|dν(z)𝑑ν(w).I(\nu)=\int_{\mathcal{K}}\int_{\mathcal{K}}\log|z-w|d\nu(z)d\nu(w).

An equilibrium measure on 𝒦\mathcal{K} is a measure μ\mu satisfying

I(μ)=supνI(ν),I(\mu)=\sup_{\nu}I(\nu),

where the supremum is taken over all Borel probability measures. An equilibrium measure always exists and is often unique [Ran95, Theorem 3.2.2].

Let

f(z)=adzd++a0[z],f(z)=a_{d}z^{d}+\cdots+a_{0}\in\mathbb{C}[z],

where ad0a_{d}\not=0 and d2d\geq 2.

Brolin [Bro65], Lyubich [Lyu83], and Freire, Lopes, and Mañe [FLM83] construct a unique equilibrium measure μf\mu_{f}, supported on the Julia set 𝒥f\mathcal{J}_{f}, which is invariant under ff. That is, fμf=μff_{*}\mu_{f}=\mu_{f}, where the push-forward fμf_{*}\mu is defined by fμ(B)=μ(f1(B))f_{*}\mu(B)=\mu(f^{-1}(B)) for any Borel set BB. We also have

(1.8) I(μf)=1d1log|ad|,I(\mu_{f})=-\frac{1}{d-1}\log|a_{d}|,

and in particular, I(μf)=0I(\mu_{f})=0 when ff is monic.

Suppose that f[z]f\in\mathbb{Z}[z] is monic, and that β=ab\beta=\frac{a}{b}\in\mathbb{Q} (written in lowest terms). Then one can show that

h^f(β)=𝒥flog|bza|dμf(z).\hat{h}_{f}(\beta)=\int_{\mathcal{J}_{f}}\log|bz-a|d\mu_{f}(z).

More generally, if α\alpha is an algebraic integer with monic minimal polynomial P(z)P(z), then

(1.9) [(α):]h^f(α)=𝒥flog|P(z)|dμf(z)=:mf(P).[\mathbb{Q}(\alpha):\mathbb{Q}]\hat{h}_{f}(\alpha)=\int_{\mathcal{J}_{f}}\log|P(z)|d\mu_{f}(z)=:\mathrm{m}_{f}(P).

Notice that we are associating the canonical height for the rational map ff to a generalized Mahler measure mf\mathrm{m}_{f} for polynomials P[x]P\in\mathbb{Z}[x]. Taking f(z)=z2f(z)=z^{2} recovers the familiar Mahler measure from equation (1.7).

Piñeiro, Szpiro, and Tucker [PST05] have shown that, like heights, this generalized Mahler measure may be computed via an adelic formula:

(1.10) [K(α):]h^f(α)=places v of K𝒥f,vlog|P(z)|vdμf,v(z),[K(\alpha):\mathbb{Q}]\hat{h}_{f}(\alpha)=\sum_{\text{places $v$ of }K}\int_{\mathcal{J}_{f,v}}\log|P(z)|_{v}d\mu_{f,v}(z),

where KK is a number field, α1(K)\alpha\in\mathbb{P}^{1}(K), PP is a minimal polynomial for α\alpha over KK, and 𝒥f,v\mathcal{J}_{f,v} denotes the Julia set of f:1(v)1(v)f:\mathbb{P}^{1}(\mathbb{C}_{v})\to\mathbb{P}^{1}(\mathbb{C}_{v}). For finite vv, 𝒥f,vlog|P(z)|vdμf,v(z)=0\int_{\mathcal{J}_{f,v}}\log|P(z)|_{v}d\mu_{f,v}(z)=0 unless ff has bad reduction at vv or all the coefficients of PP have nonzero vv-adic valuation. Thus, equation (1.9) follows from (1.10) by specializing to K=K=\mathbb{Q} and α\alpha algebraic integer.

The contributions at the finite places in equation (1.10) were shown to have the appropriate integral forms by Favre and Rivera-Letelier [FRL06, Proposition 1.3]. Thus, equation (1.9) can also be extended to the case where ff is not necessarily monic if the integral is replaced by a sum of integrals at different places.

Szpiro and Tucker [ST12] used diophantine approximation to show that this generalized Mahler measure of a polynomial PP at a place vv can be computed by averaging log|P|v\log|P|_{v} over the periodic points of ff.

It is not easy to give precise examples of mf(P)\mathrm{m}_{f}(P). In the classical case when f(z)=zdf(z)=z^{d} with d2d\geq 2, then mf(P)=m(P)\mathrm{m}_{f}(P)=\mathrm{m}(P) for P(x)×P\in\mathbb{C}(x)^{\times}. For TdT_{d} the dthd^{\text{th}} Chebyshev polynomial with d2d\geq 2, normalized so that Td(z+z1)=zd+zdT_{d}(z+z^{-1})=z^{d}+z^{-d}, one can show that mTd(P)=m(P(z+z1))\mathrm{m}_{T_{d}}(P)=\mathrm{m}\left(P\circ(z+z^{-1})\right). Because of the underlying group structure discussed above, these examples are straightforward to compute (see [CLMM]). For general polynomials, exact computation is much more difficult, and in fact these two families are the exceptions to many theorems stated in the sequel.

One can compute approximations to dynamical Mahler measure in some cases. For example, Ingram [Ing13] gave an asymptotic formula for mf(x)\mathrm{m}_{f}(x) for f(z)=z2+cf(z)=z^{2}+c as cc\rightarrow\infty

mf(x)=\displaystyle\mathrm{m}_{f}(x)= log|c|2+18log(1+2c+1c2+1c3)+O(1c7)\displaystyle\frac{\log|c|}{2}+\frac{1}{8}\log\left(1+\frac{2}{c}+\frac{1}{c^{2}}+\frac{1}{c^{3}}\right)+O\left(\frac{1}{c^{7}}\right)
=\displaystyle= log|c|2+14c18c2+524c3516c4+1740c52948c6+O(1c7).\displaystyle\frac{\log|c|}{2}+\frac{1}{4c}-\frac{1}{8c^{2}}+\frac{5}{24c^{3}}-\frac{5}{16c^{4}}+\frac{17}{40c^{5}}-\frac{29}{48c^{6}}+O\left(\frac{1}{c^{7}}\right).

Similarly for f(z)=(zc)2f(z)=(z-c)^{2}, one can find

mf(x)=log|c|12c14c2512c358c41720c52924c6+O(1c7).\mathrm{m}_{f}(x)=\log|c|-\frac{1}{2c}-\frac{1}{4c^{2}}-\frac{5}{12c^{3}}-\frac{5}{8c^{4}}-\frac{17}{20c^{5}}-\frac{29}{24c^{6}}+O\left(\frac{1}{c^{7}}\right).

It is natural to ask about higher-dimensional (multivariate) forms of this generalized Mahler measure.

Definition 1.1.

Let f[z]f\in\mathbb{Z}[z] monic of degree d2d\geq 2. The ff-dynamical Mahler measure of P(x1,xn)×P\in\mathbb{C}(x_{1},\dots x_{n})^{\times} is given by

(1.11) mf(P)=𝒥f𝒥flog|P(z1,,zn)|dμf(z1)𝑑μf(zn).\mathrm{m}_{f}(P)=\int_{\mathcal{J}_{f}}\dots\int_{\mathcal{J}_{f}}\log|P(z_{1},\cdots,z_{n})|d\mu_{f}(z_{1})\cdots d\mu_{f}(z_{n}).

We will later see that the above integral always converges and that in fact, mf(P)0\mathrm{m}_{f}(P)\geq 0 if P[x1,,xn]P\in\mathbb{Z}[x_{1},\dots,x_{n}] is nonzero.

It is an interesting to ask whether multivariable dynamical Mahler measure can also be interpreted as a height. We do not answer the question in this paper, but we suspect that the answer is yes. Zhang [Zha95], in the more general setting of a variety XX with a map Φ:XX\Phi:X\to X and a line bundle \mathcal{L} such that Φ=d\Phi^{*}\mathcal{L}=\mathcal{L}^{d}, defined a canonical height hΦ,(Y)h_{\Phi,\mathcal{L}}(Y) of an arbitrary subvariety YXY\subset X. Chambert-Loir and Thuillier [CLT09] showed that in the case where X=nX=\mathbb{P}^{n}, Φ([x0,xn])=[x02,,xn2]\Phi([x_{0},\dotsc x_{n}])=[x_{0}^{2},\dotsc,x_{n}^{2}] and =𝒪(1)\mathcal{L}=\mathcal{O}(1), that, for a homogeneous polynomial P[x0,,xn]P\in\mathbb{Z}[x_{0},\dotsc,x_{n}] and YY the hypersurface {P(x0,x1,,xn)=0}\{P(x_{0},x_{1},\dotsc,x_{n})=0\}, hΦ,(Y)=m(P)h_{\Phi,\mathcal{L}}(Y)=\mathrm{m}(P) is the usual multivariable Mahler measure of PP, which is also equal to the multivariable Mahler measure of the inhomogeneous polynomial P(1,x1,,xn)P(1,x_{1},\dotsc,x_{n}).

We conjecture that the multivariable dynamical Mahler measure can be recovered in a similar way, using instead the map Φ=f××f:(1)n(1)n\Phi=f\times\cdots\times f:(\mathbb{P}^{1})^{n}\to(\mathbb{P}^{1})^{n}, which acts as ff on each of the nn components.

In this note, we prove the following results.

Lemma 1.2 (Dynamical Kronecker’s Lemma).

Let f[z]f\in\mathbb{Z}[z] be monic of degree d2d\geq 2 and let P[x]P\in\mathbb{Z}[x]. If mf(P)=0\mathrm{m}_{f}(P)=0, then P(x)=±i(xαi)P(x)=\pm\prod_{i}(x-\alpha_{i}) where each αi\alpha_{i} is a preperiodic point of ff.

Proposition 1.3 (Weak Dynamical Boyd–Lawton).

Let f[z]f\in\mathbb{Z}[z] monic of degree d2d\geq 2 and let P[x,y]P\in\mathbb{C}[x,y]. Then

lim supnmf(P(x,fn(x)))mf(P(x,y)).\limsup_{n\to\infty}\mathrm{m}_{f}(P(x,f^{n}(x)))\leq\mathrm{m}_{f}(P(x,y)).

The above statement is expected to be true with an equality. This would correspond to a dynamical version of the Boyd–Lawton limit (1.2).

For the main result, we will assume the following conjecture.

Conjecture 1.4 (Dynamical Lehmer’s conjecture).

[Sil07, Conjecture 3.25] There is some δ=δf>0\delta=\delta_{f}>0 such that any single variable polynomial P[x]P\in\mathbb{Z}[x] with mf(P)>0\mathrm{m}_{f}(P)>0 satisfies mf(P)>δ\mathrm{m}_{f}(P)>\delta.

Theorem 1.5.

Assume the Dynamical Lehmer’s conjecture.

Let f[z]f\in\mathbb{Z}[z] be a monic polynomial of degree d2d\geq 2 which is not conjugate to zdz^{d} or to ±Td(z)\pm T_{d}(z), where Td(z)T_{d}(z) is the dd-th Chebyshev polynomial. Then any polynomial P[x,y]P\in\mathbb{Z}[x,y] which is irreducible in [x,y]\mathbb{Z}[x,y] (but not necessarily irreducible in [x,y]\mathbb{C}[x,y]) with mf(P)=0\mathrm{m}_{f}(P)=0, and which contains both variables xx and yy, divides a product of complex polynomials of the following form:

f~n(x)L(f~m(y)),\tilde{f}^{n}(x)-L(\tilde{f}^{m}(y)),

where m,n0m,n\geq 0 are integers, L[z]L\in\mathbb{C}[z] is a linear polynomial commuting with an iterate of ff, and f~[z]\tilde{f}\in\mathbb{C}[z] is a nonlinear polynomial of minimal degree commuting with an iterate of ff (with possibly different choices of LL, f~\tilde{f}, nn, and mm for each factor).

As a partial converse, suppose there exists a product of complex polynomials FjF_{j} such that

  1. (1)

    each FjF_{j} has the form f~n(x)L(f~m(y))\tilde{f}^{n}(x)-L(\tilde{f}^{m}(y)), where LL and f~\tilde{f} are as above (with possibly different choices of LL, f~\tilde{f}, nn, and mm for each jj);

  2. (2)

    Fj[x,y]\prod F_{j}\in\mathbb{Z}[x,y]; and

  3. (3)

    PP divides Fj\prod F_{j} in [x,y]\mathbb{Z}[x,y].

Then mf(P)=0\mathrm{m}_{f}(P)=0.

Note that the second half of Theorem 1.5 is only a partial converse. We can guarantee that Fj[x,y]\prod F_{j}\in\mathbb{Q}[x,y] by enlarging the set {Fj}\{F_{j}\} to contain all the Galois conjugates of all its members. However, as we do not know whether the polynomials FjF_{j} have algebraic integer coefficients, we cannot guarantee Fj[x,y]\prod F_{j}\in\mathbb{Z}[x,y]. We conjecture however that it is always the case that the FjF_{j} have algebraic integer coefficients, in which case we could strengthen this converse.

We remark that Theorem 1.5 complements a statement for classical Mahler measure. Indeed, it is shown in [EW99, Theorem 3.10] that for any primitive polynomial P[x1±,,xn±]P\in\mathbb{Z}[x_{1}^{\pm},\dots,x_{n}^{\pm}], m(P)\mathrm{m}(P) is zero if and only if PP is a monomial times a product of cyclotomic polynomials evaluated on monomials. This case is fundamentally different from what is considered in Theorem 1.5, as the classical Mahler measure contains more structure given by the multiplicative nature of the function f(z)=zdf(z)=z^{d}.

It is also interesting to compare our results with [Zha95, Conjecture 2.5], which says that a subvariety whose dynamical canonical height is 0 must be preperiodic. If Zhang’s height agrees with the dynamical Mahler measure, as we expect to be the case, then Lemma 1.2 and Theorem 1.5 can be seen as special cases of Zhang’s Conjecture 2.5 for the dynamical maps f:11f:\mathbb{P}^{1}\to\mathbb{P}^{1} and f×f:1×11×1f\times f:\mathbb{P}^{1}\times\mathbb{P}^{1}\to\mathbb{P}^{1}\times\mathbb{P}^{1}, respectively. (Note that Zhang’s conjecture is known not to hold in general: see [GTZ11] for a counterexample. However, it may be possible to use the arguments in [GNY19] to obtain a proof for the case of f×f:1×11×1f\times f:\mathbb{P}^{1}\times\mathbb{P}^{1}\to\mathbb{P}^{1}\times\mathbb{P}^{1}.)

This article is organized as follows. Section 2 covers some background on complex dynamics. Section 3 is devoted to showing that the dynamical Mahler measure is well-defined in general, and non-negative for polynomials with integral coefficients, while Section 4 discusses the dynamical version of Kronecker’s Lemma 1.2. The Weak Dynamical Boyd–Lawton Proposition 1.3 is discussed and proven in Section 5. It serves as a preparation for the proof of our main result, Theorem 1.5, covered in Sections 6 and 7. Finally, in Section 8, we give a precise description of the polynomials LL and f~\tilde{f} that appear in the statement of this theorem.

Acknowledgements

We are grateful to Patrick Ingram for proposing that we study the dynamical Mahler measure of multivariate polynomials and for many early discussions, and to the anonymous referees for their careful reading of the article and several helpful suggestions. This project was initiated as part of the BIRS workshop “Women in Numbers 5”, held virtually in 2020. We thank the workshop organizers, Alina Bucur, Wei Ho, and Renate Scheidler for their leadership and encouragement that extended for the whole duration of this project. This work has been partially supported by the Natural Sciences and Engineering Research Council of Canada (Discovery Grant 355412-2013 to ML), the Fonds de recherche du Québec - Nature et technologies (Projets de recherche en équipe 256442 and 300951 to ML), the Simons Foundation (grant number 359721 to MM), and the National Science Foundation (grant DMS-1844206 supporting AC and grant DMS-1902772 to LM). This material is based upon work supported by and while the third author served at the National Science Foundation. Any opinion, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.

2. Some background on complex dynamics

In this section we recall some results from complex dynamics that will be useful for the rest of this article. The main reference for this is the book of Ransford [Ran95].

Let ν\nu be a finite Borel measure on \mathbb{C} with compact support 𝒦\mathcal{K}. One can define the potential function [Ran95, Definition 3.1.1]

pν:[,)p_{\nu}:\mathbb{C}\rightarrow[-\infty,\infty)

by

(2.1) pν(z):=𝒦log|zw|dν(w).p_{\nu}(z):=\int_{\mathcal{K}}\log|z-w|d\nu(w).

The potential pνp_{\nu} is subharmonic in \mathbb{C} by [Ran95, Theorem 3.1.2].

Thus we can write the energy as

I(ν):=𝒦𝒦log|zw|dν(z)𝑑ν(w)=𝒦pν(z)𝑑ν(z).I(\nu):=\int_{\mathcal{K}}\int_{\mathcal{K}}\log|z-w|d\nu(z)d\nu(w)=\int_{\mathcal{K}}p_{\nu}(z)d\nu(z).

Then Frostman’s Theorem [Ran95, Theorem 3.3.4] implies that if μ\mu is the equilibrium measure of 𝒦\mathcal{K}, then

  1. (1)

    pμI(μ)p_{\mu}\geq I(\mu) on \mathbb{C};

  2. (2)

    pμ=I(μ)p_{\mu}=I(\mu) on 𝒦\mathcal{K}\setminus\mathcal{E}, where 𝒦\mathcal{E}\subseteq\partial{\mathcal{K}}.

By (1.8), I(μf)I(\mu_{f})\not=-\infty and pμfp_{\mu_{f}} is a finite number.

Let 𝒟\mathcal{D} a proper sub-domain of the Riemann sphere. A Green’s function [Ran95, Definition 4.4.1] for 𝒟\mathcal{D} is a map g𝒟:𝒟×𝒟(,]g_{\mathcal{D}}:\mathcal{D}\times\mathcal{D}\rightarrow(-\infty,\infty] such that for each w𝒟w\in\mathcal{D}:

  1. (1)

    g𝒟(,w)g_{\mathcal{D}}(\cdot,w) is harmonic on 𝒟{w}\mathcal{D}\setminus\{w\}, and bounded outside each neighbourhood of ww;

  2. (2)

    g𝒟(w,w)=g_{\mathcal{D}}(w,w)=\infty, and as zwz\rightarrow w,

    g𝒟(z,w)={log|z|+O(1)w=,log|zw|+O(1)w;g_{\mathcal{D}}(z,w)=\begin{cases}\log|z|+O(1)&w=\infty,\\ -\log|z-w|+O(1)&w\not=\infty;\end{cases}
  3. (3)

    g𝒟(z,w)0g_{\mathcal{D}}(z,w)\rightarrow 0 as zζz\rightarrow\zeta for ζ𝒟\zeta\in\partial\mathcal{D} outside of a Borel polar subset of 𝒟\partial\mathcal{D}.

A set \mathcal{E}\subset\mathbb{C} is called polar if I(ν)=I(\nu)=-\infty for every finite Borel measure ν0\nu\not=0 for which suppν\mathrm{supp}\,\nu is a compact subset of \mathcal{E} [Ran95, Definition 3.2.2]. Otherwise \mathcal{E} is called non-polar. If 𝒟\mathcal{D} is a domain of the Riemann sphere such that 𝒟\partial\mathcal{D} is non-polar, then, there is a unique Green’s function g𝒟g_{\mathcal{D}} for 𝒟\mathcal{D} [Ran95, Theorem 4.4.2].

In particular, the proof of [Ran95, Theorem 4.4.2] gives us explicitly that, if 𝒟\partial\mathcal{D} is non-polar and if μ\mu is the equilibrium measure on 𝒟\mathcal{D}, then

(2.2) g𝒟(z,)=pμ(z)I(μ)g_{\mathcal{D}}(z,\infty)=p_{\mu}(z)-I(\mu)

for z𝒟{}z\in\mathcal{D}\setminus\{\infty\}.

If 𝒦f\mathcal{K}_{f} is the filled Julia set, we remark that [Ran95, Corollary 6.5.4] implies that the Green’s function of its complement is given by

g1()𝒦f(z,)=limn1deg(f)nlog+|fn(z)|,g_{\mathbb{P}^{1}(\mathbb{C})\setminus\mathcal{K}_{f}}(z,\infty)=\lim_{n\rightarrow\infty}\frac{1}{\deg(f)^{n}}\log^{+}|f^{n}(z)|,

where log+|α|=log|α|\log^{+}|\alpha|=\log|\alpha| when |α|>1|\alpha|>1 and 0 otherwise.

From now on, we will write gf(z,w)g_{f}(z,w) for the Green’s function of the complement of the filled Julia set.

We can summarize the results of the discussion above in the following two propositions:

Proposition 2.1.

If f[z]f\in\mathbb{C}[z] monic of degree d2d\geq 2, then

(2.3) gf(z,)=pμf(z)g_{f}(z,\infty)=p_{\mu_{f}}(z)

for z𝒦fz\notin\mathcal{K}_{f}.

Proof.

By [Ran95, Theorem 6.5.1], the Julia set 𝒥f=𝒦f\mathcal{J}_{f}=\partial\mathcal{K}_{f} is non-polar, and so equation (2.2) applies. Since ff is monic, by (1.8), we have I(μf)=0I(\mu_{f})=0. ∎

Proposition 2.2.

If f[z]f\in\mathbb{C}[z] is monic of degree d2d\geq 2, then pμf(z)0p_{\mu_{f}}(z)\geq 0 for all zz\in\mathbb{C}, and the equality pμf(z)=0p_{\mu_{f}}(z)=0 holds exactly when z𝒦fz\in\mathcal{K}_{f}.

Proof.

If ff is monic, we have I(μf)=0I(\mu_{f})=0 by (1.8) (see also [Ran95, Theorem 6.5.1]), and since pμfI(μf)p_{\mu_{f}}\geq I(\mu_{f}), we conclude that pμf0.p_{\mu_{f}}\geq 0. By Frostman’s Theorem [Ran95, Theorem 3.3.4], we have pμf(z)=I(μf)p_{\mu_{f}}(z)=I(\mu_{f}) for zz in the interior of 𝒦f\mathcal{K}_{f}, and since 1()𝒦f\mathbb{P}^{1}(\mathbb{C})\setminus\mathcal{K}_{f} is a regular domain [Ran95, Corollary 6.5.5], we have pμf(z)=I(μf)p_{\mu_{f}}(z)=I(\mu_{f}) on 𝒥f\mathcal{J}_{f} also [Ran95, Theorem 4.2.4].

Conversely, since the Julia set 𝒥f=𝒦f\mathcal{J}_{f}=\partial\mathcal{K}_{f} is non-polar, we have gf(z,)>0g_{f}(z,\infty)>0 for z𝒦fz\not\in\mathcal{K}_{f} [Ran95, Theorem 4.4.3]. By equation (2.3), we conclude that pμf(z)=0p_{\mu_{f}}(z)=0 if and only if z𝒦fz\in\mathcal{K}_{f}. ∎

3. Convergence and positivity of the dynamical Mahler measure

We will prove that the dynamical Mahler measure defined by (1.11) is well-defined. Moreover, it is non-negative when P[x1,,xn]P\in\mathbb{Z}[x_{1},\dots,x_{n}] is nonzero. We will follow a structure of a proof that is similar to that of [EW99, Lemma 3.7].

We start first with the following result, which can be considered an analogue to Jensen’s formula in the dynamical universe.

Lemma 3.1.

Let f[z]f\in\mathbb{Z}[z] monic of degree d2d\geq 2. If P(x)=ai(xαi)[x]P(x)=a\prod_{i}(x-\alpha_{i})\in\mathbb{C}[x], then

mf(P)=log|a|+ipμf(αi).\mathrm{m}_{f}(P)=\log|a|+\sum_{i}p_{\mu_{f}}(\alpha_{i}).
Proof.

By definition, we have

mf(P)=𝒥flog|ai(zαi)|dμf(z)=𝒥flog|a|dμf(z)+i𝒥flog|zαi|dμf(z),\mathrm{m}_{f}(P)=\int_{\mathcal{J}_{f}}\log\big{|}a\prod_{i}(z-\alpha_{i})\big{|}d\mu_{f}(z)=\int_{\mathcal{J}_{f}}\log|a|d\mu_{f}(z)+\sum_{i}\int_{\mathcal{J}_{f}}\log|z-\alpha_{i}|d\mu_{f}(z),

and the result follows from equation (2.1). ∎

Now we proceed to prove the main result of this section.

Proposition 3.2.

Let f[z]f\in\mathbb{Z}[z] monic of degree d2d\geq 2 and let P(x1,,xn)×P\in\mathbb{C}(x_{1},\dots,x_{n})^{\times}. Then the integral in (1.11) defining the ff-dynamical Mahler measure converges. Moreover, if P[x1,,xn]P\in\mathbb{Z}[x_{1},\dots,x_{n}] is nonzero, then mf(P)0\mathrm{m}_{f}(P)\geq 0.

Proof.

First assume that P[x1,,xn]P\in\mathbb{C}[x_{1},\dots,x_{n}] is nonzero. Let C=supz𝒥f|z|C=\sup_{z\in\mathcal{J}_{f}}|z|, and write

P(x1,,xn)=ai1,,inx1i1xnin.P(x_{1},\dots,x_{n})=\sum a_{i_{1},\dots,i_{n}}x_{1}^{i_{1}}\cdots x_{n}^{i_{n}}.

Then for (z1,,zn)𝒥fn(z_{1},\dots,z_{n})\in\mathcal{J}_{f}^{n}, we have

|P(z1,,zn)||ai1,,in|Ci1++in=:LC(P).|P(z_{1},\dots,z_{n})|\leq\sum|a_{i_{1},\dots,i_{n}}|C^{i_{1}+\cdots+i_{n}}=:L_{C}(P).

From this, we see that mf(P)log(LC(P))<\mathrm{m}_{f}(P)\leq\log(L_{C}(P))<\infty.

For the rest of the statement, we proceed by induction. The case n=1n=1 is a consequence of what we have just discussed and Lemma 3.1, which implies that mf(P)>\mathrm{m}_{f}(P)>-\infty when P[x]P\in\mathbb{C}[x] is nonzero and mf(P)0\mathrm{m}_{f}(P)\geq 0 when P[x]P\in\mathbb{Z}[x] is nonzero. Assume that the result is proven for all polynomials in n1n-1 variables. We write

P(x1,,xn)\displaystyle P(x_{1},\dots,x_{n}) =ad(x2,,xn)x1d++a0(x2,,xn)\displaystyle=a_{d}(x_{2},\dots,x_{n})x_{1}^{d}+\cdots+a_{0}(x_{2},\dots,x_{n})
=ad(x2,,xn)j=1d(x1gj(x2,,xn)),\displaystyle=a_{d}(x_{2},\dots,x_{n})\prod_{j=1}^{d}(x_{1}-g_{j}(x_{2},\dots,x_{n})),

for certain algebraic functions g1,,gdg_{1},\dots,g_{d}. Then we have that

𝒥f𝒥f\displaystyle\int_{\mathcal{J}_{f}}\dots\int_{\mathcal{J}_{f}} log|P(z1,,zn)|dμf(z1)dμf(zn)\displaystyle\log|P(z_{1},\dots,z_{n})|d\mu_{f}(z_{1})\cdots d\mu_{f}(z_{n})
=mf(ad)+j=1d𝒥f𝒥flog|z1gj(z2,,zn)|dμf(z1)𝑑μf(zn)\displaystyle=\mathrm{m}_{f}(a_{d})+\sum_{j=1}^{d}\int_{\mathcal{J}_{f}}\dots\int_{\mathcal{J}_{f}}\log|z_{1}-g_{j}(z_{2},\dots,z_{n})|d\mu_{f}(z_{1})\cdots d\mu_{f}(z_{n})
=mf(ad)+j=1d𝒥f𝒥fpμf(gj(z2,,zn))𝑑μf(z2)𝑑μf(zn)\displaystyle=\mathrm{m}_{f}(a_{d})+\sum_{j=1}^{d}\int_{\mathcal{J}_{f}}\dots\int_{\mathcal{J}_{f}}p_{\mu_{f}}(g_{j}(z_{2},\dots,z_{n}))d\mu_{f}(z_{2})\cdots d\mu_{f}(z_{n})
=mf(ad)+𝒥f𝒥fj=1dpμf(gj(z2,,zn))dμf(z2)dμf(zn).\displaystyle=\mathrm{m}_{f}(a_{d})+\int_{\mathcal{J}_{f}}\dots\int_{\mathcal{J}_{f}}\sum_{j=1}^{d}p_{\mu_{f}}(g_{j}(z_{2},\dots,z_{n}))d\mu_{f}(z_{2})\cdots d\mu_{f}(z_{n}).

Here our final integrand is upper semicontinuous, since pμfp_{\mu_{f}} is upper semicontinuous and the multiset of values {gi(z2,,zn)}\{g_{i}(z_{2},\dotsc,z_{n})\} depends continuously on (z2,,zn)(z_{2},\dotsc,z_{n}) (even though the individual gig_{i} are not themselves continuous at the branch locus). By Proposition 2.2, pμf0p_{\mu_{f}}\geq 0. By induction, mf(ad)0\mathrm{m}_{f}(a_{d})\geq 0. Therefore the above integral is non-negative and the first paragraph shows that the entire right-hand side is bounded above. Therefore the integral defining the Mahler measure exists.

Now assume that P(x1,,xn)×P\in\mathbb{C}(x_{1},\dots,x_{n})^{\times}. We write P=FGP=\frac{F}{G}, with F,G[x1,,xn]F,G\in\mathbb{C}[x_{1},\dots,x_{n}] nonzero and apply the above to conclude that mf(F),mf(G)\mathrm{m}_{f}(F),\mathrm{m}_{f}(G) are well-defined. Then we have mf(P)=mf(F)mf(G)\mathrm{m}_{f}(P)=\mathrm{m}_{f}(F)-\mathrm{m}_{f}(G) and mf(P)\mathrm{m}_{f}(P) is well-defined too. This concludes the proof of the statement. ∎

4. Dynamical Kronecker’s Lemma

The goal of this section is to characterize the single variable polynomials with integral coefficients PP having mf(P)=0\mathrm{m}_{f}(P)=0, which is the dynamical version of Kronecker’s Lemma:

Lemma 4.1 (Kronecker, [Kro57]).

Let P(x)=i(xαi)[x]P(x)=\prod_{i}(x-\alpha_{i})\in\mathbb{Z}[x]. If all of the roots of PP satisfy |αi|1|\alpha_{i}|\leq 1, then the αi\alpha_{i} are zero or roots of unity.

In fact, more is true. Equation (1.1) implies the following statement.

Corollary 4.2.

Let P[x]P\in\mathbb{Z}[x]. If m(P)=0\mathrm{m}(P)=0, then the roots of PP are either zero or roots of unity. Conversely, if PP is primitive and its roots either zero or roots of unity, then m(P)=0\mathrm{m}(P)=0.

An immediate consequence of Lemma 3.1 is the following.

Lemma 4.3.

Let f[z]f\in\mathbb{Z}[z] be monic of degree d2d\geq 2 and P(x)=ai(xαi)[x]P(x)=a\prod_{i}(x-\alpha_{i})\in\mathbb{Z}[x]. Then we have mf(P)=0\mathrm{m}_{f}(P)=0 if and only if |a|=1|a|=1 and all roots αi\alpha_{i} lie in 𝒦f\mathcal{K}_{f}.

Proof.

Because aa\in\mathbb{Z}, we have log|a|0\log|a|\geq 0, and by Proposition 2.2, all the other summands in the dynamical Jensen formula from Lemma 3.1 are also nonnegative. Therefore, mf(P)=0\mathrm{m}_{f}(P)=0 if and only if log|a|=0\log|a|=0 and pμf(αi)=0p_{\mu_{f}}(\alpha_{i})=0 for all ii. The first condition log|a|=0\log|a|=0 holds exactly when a=±1a=\pm 1. The second condition is an immediate consequence of the second statement in Proposition 2.2. ∎

We are now ready to prove the Dynamical Kronecker’s Lemma (Lemma 1.2).

Proof of Lemma 1.2.

Since Lemma 4.3 implies that P(x)P(x) is (up to a sign) monic, we can write

P(x)=±i(xαi).P(x)=\pm\prod_{i}(x-\alpha_{i}).

Consider the polynomials

Pn(x)=i=1d(xfn(αi)).P_{n}(x)=\prod_{i=1}^{d}(x-f^{n}(\alpha_{i})).

The coefficients of PnP_{n} are symmetric functions in the algebraic integers fn(αi)f^{n}(\alpha_{i}) and thus symmetric functions of the αi\alpha_{i}, so they are elements of \mathbb{Z} (all the conjugates of each αi\alpha_{i} are present as roots of PP, since the coefficients are rational). Since mf(P)=0\mathrm{m}_{f}(P)=0, Lemma 4.3 implies that αi𝒦f\alpha_{i}\in\mathcal{K}_{f}, and the same is true for fn(αi)f^{n}(\alpha_{i}). Since 𝒦f\mathcal{K}_{f} is compact, the fn(αi)f^{n}(\alpha_{i}) are uniformly bounded, and the same is true for the coefficients of PnP_{n}. Thus, the set {Pn}n\{P_{n}\}_{n\in\mathbb{N}} must be finite. In other words, there are n1n2n_{1}\not=n_{2} for which

Pn1=Pn2.P_{n_{1}}=P_{n_{2}}.

That means,

{fn1(α1),,fn1(αd)}={fn2(α1),,fn2(αd)}.\{f^{n_{1}}(\alpha_{1}),\dots,f^{n_{1}}(\alpha_{d})\}=\{f^{n_{2}}(\alpha_{1}),\dots,f^{n_{2}}(\alpha_{d})\}.

Thus, there is a permutation σ𝕊d\sigma\in\mathbb{S}_{d} such that

fn1(αi)=fn2(ασ(i)).f^{n_{1}}(\alpha_{i})=f^{n_{2}}(\alpha_{\sigma(i)}).

If σ\sigma has order kk, we get,

fkn1(αi)=fkn2(αi),f^{kn_{1}}(\alpha_{i})=f^{kn_{2}}(\alpha_{i}),

which shows that each αi\alpha_{i} is preperiodic. ∎

Remark 4.4.

A more general approach to dynamical Mahler measure, for example as suggested in [FRL06], may allow mf\mathrm{m}_{f} to be defined for ff a non-monic polynomial or indeed a rational function. The following argument shows that Lemma 1.2 continues to hold in this case: Noting that log|PQ|=log|P|+log|Q|\log|PQ|=\log|P|+\log|Q|, we see from equation (1.9) that mf(P)\mathrm{m}_{f}(P) vanishes with the ff-canonical height of the roots of PP. Over a global field, the canonical height h^f\hat{h}_{f} of a rational function vanishes precisely at preperiodic points for ff (see, for example, [Sil07, Theorem 3.22]).

5. Weak Dynamical Boyd–Lawton Theorem

A result of Boyd [Boy81b] and Lawton [Law83] given by equation (1.2) allows one to compute, in the classical setting, the higher dimensional Mahler measure by a series of approximations by one-dimensional Mahler measures. Here we exhibit the dynamical version of a weaker result for two variables, Proposition 1.3. Our result is analogous to Lemma 2 in [Boy81a].

First we need to establish the following lemma, which is analogous to Lemma 1 in [Boy81a].

Lemma 5.1.

If F:𝒥f2F:\mathcal{J}_{f}^{2}\to\mathbb{R} is continuous, then

limn𝒥fF(z,fn(z))𝑑μf(z)=𝒥f𝒥fF(z1,z2)𝑑μf(z1)𝑑μf(z2).\lim_{n\to\infty}\int_{\mathcal{J}_{f}}F(z,f^{n}(z))d\mu_{f}(z)=\int_{\mathcal{J}_{f}}\int_{\mathcal{J}_{f}}F(z_{1},z_{2})d\mu_{f}(z_{1})d\mu_{f}(z_{2}).
Proof.

By making the change of variables z2=fn(z)z_{2}=f^{n}(z), we have

(5.1) 𝒥fF(z,fn(z))𝑑μf(z)=z2𝒥fdnz1fn(z2)F(z1,z2)dμf(z2),\int_{\mathcal{J}_{f}}F(z,f^{n}(z))d\mu_{f}(z)=\int_{z_{2}\in\mathcal{J}_{f}}d^{-n}\sum_{z_{1}\in f^{-n}(z_{2})}F(z_{1},z_{2})d\mu_{f}(z_{2}),

where d=deg(f)d=\deg(f). We use this to expand our left hand side, interchanging the limit and integral by the bounded convergence theorem.

limn𝒥fF(z,fn(z))𝑑μf(z)=limnz2𝒥fdnz1fn(z2)F(z1,z2)dμf(z2)=z2𝒥f(limndnz1fn(z2)F(z1,z2))𝑑μf(z2)\begin{split}\lim_{n\to\infty}\int_{\mathcal{J}_{f}}F(z,f^{n}(z))d\mu_{f}(z)&=\lim_{n\to\infty}\int_{z_{2}\in\mathcal{J}_{f}}d^{-n}\sum_{z_{1}\in f^{-n}(z_{2})}F(z_{1},z_{2})d\mu_{f}(z_{2})\\ &=\int_{z_{2}\in\mathcal{J}_{f}}\left(\lim_{n\to\infty}d^{-n}\sum_{z_{1}\in f^{-n}(z_{2})}F(z_{1},z_{2})\right)d\mu_{f}(z_{2})\end{split}

By [Ran95, Theorem 6.5.8], we have that the limit inside the parentheses is equal to z1𝒥fF(z1,z2)𝑑μf(z1)\int_{z_{1}\in\mathcal{J}_{f}}F(z_{1},z_{2})d\mu_{f}(z_{1}), as desired. ∎

We are now ready to prove the Weak Dynamical Boyd–Lawton Theorem.

Proof of Proposition 1.3.

We follow Lemma 2 in [Boy81a]. First notice that for δ>0\delta>0, log(|P(z1,z2)|+δ)\log\left(|P(z_{1},z_{2})|+\delta\right) is a continuous function on 𝒥f\mathcal{J}_{f}. Therefore, by Lemma 5.1,

limn𝒥flog(|P(z,fn(z))|+δ)𝑑μf(z)=𝒥f𝒥flog(|P(z1,z2)|+δ)𝑑μf(z1)𝑑μf(z2).\lim_{n\to\infty}\int_{\mathcal{J}_{f}}\log(|P(z,f^{n}(z))|+\delta)d\mu_{f}(z)=\int_{\mathcal{J}_{f}}\int_{\mathcal{J}_{f}}\log(|P(z_{1},z_{2})|+\delta)d\mu_{f}(z_{1})d\mu_{f}(z_{2}).

Since mf(P(z,fn(z))𝒥flog(|P(z,fn(z))|+δ)dμf(z)\mathrm{m}_{f}(P(z,f^{n}(z))\leq\int_{\mathcal{J}_{f}}\log(|P(z,f^{n}(z))|+\delta)d\mu_{f}(z), we have

lim supnmf(P(z,fn(z)))𝒥f𝒥flog(|P(z1,z2)|+δ)𝑑μf(z1)𝑑μf(z2),\limsup_{n\rightarrow\infty}\mathrm{m}_{f}(P(z,f^{n}(z)))\leq\int_{\mathcal{J}_{f}}\int_{\mathcal{J}_{f}}\log(|P(z_{1},z_{2})|+\delta)d\mu_{f}(z_{1})d\mu_{f}(z_{2}),

and this is true for all δ>0\delta>0. Letting δ0\delta\rightarrow 0 and using monotone convergence gives the result. ∎

6. A family of polynomials having dynamical Mahler measure zero

In Section 4 we discussed necessary and sufficient conditions for a single variable polynomial with integer coefficients to have dynamical Mahler measure zero. In this section we will show that the dynamical Mahler measure of P(x,y)=xyP(x,y)=x-y is zero independently of ff as long as ff is monic, and we will also use this fact to construct a more general class of polynomials whose dynamical Mahler measure is zero. This is the beginning of the proof of Theorem 1.5, which will be finished in Section 7.

Recall that the multivariate dynamical Mahler measure

mf(P)=𝒥f𝒥flog|z1z2|dμf(z1)𝑑μf(z2)\mathrm{m}_{f}(P)=\int_{\mathcal{J}_{f}}\int_{\mathcal{J}_{f}}\log|z_{1}-z_{2}|d\mu_{f}(z_{1})d\mu_{f}(z_{2})

is precisely the energy I(μf)I(\mu_{f}) of the equilibrium measure. By (1.8), when ff is monic, I(μf)=0I(\mu_{f})=0, and therefore mf(P)=0\mathrm{m}_{f}(P)=0.

Notice that the strong Boyd–Lawton Conjecture, which claims that mf(P(x,y))=limnmf(P(x,fn(x))\mathrm{m}_{f}(P(x,y))=\lim_{n\rightarrow\infty}\mathrm{m}_{f}(P(x,f^{n}(x)), applies in this case. Indeed, for n>0n>0, the polynomial Pn(x)=xfn(x)P_{n}(x)=x-f^{n}(x) has roots equal to the nn-periodic points of ff, so

mf(Pn)=fn(z)=zgf(z,),\mathrm{m}_{f}(P_{n})=\sum_{f^{n}(z)=z}g_{f}(z,\infty),

where gf(z,)g_{f}(z,\infty) denotes the Green’s function of the filled Julia set 𝒦f\mathcal{K}_{f}. However if zz is periodic, then g(z,)=limmdmlog+|fm(z)|g(z,\infty)=\lim_{m\to\infty}d^{-m}\log^{+}|f^{m}(z)| is 0 since the log term is bounded. Hence limnmf(Pn)=0\lim_{n\to\infty}\mathrm{m}_{f}(P_{n})=0, and this coincides with mf(P)=0\mathrm{m}_{f}(P)=0, which is what one would predict from the strong Boyd–Lawton Conjecture.

Now we intend to prove a more general result.

Lemma 6.1.

Let LL be a linear polynomial in [z]\mathbb{C}[z] commuting with an iterate of ff, and f~\tilde{f} be a nonlinear polynomial in [z]\mathbb{C}[z] commuting with an iterate of ff. Then for any nonnegative integers n,mn,m,

mf(f~n(x)L(f~m(y)))=0.\mathrm{m}_{f}\Big{(}\tilde{f}^{n}(x)-L(\tilde{f}^{m}(y))\Big{)}=0.

To prove Lemma 6.1, we first need a lemma on dynamical systems.

Lemma 6.2.

Let g[z]g\in\mathbb{C}[z] be any polynomial commuting with a power of ff (including the case of gg linear). Then gg sends the Julia set 𝒥f\mathcal{J}_{f} to itself. Furthermore, gg preserves the invariant measure μf\mu_{f}. Explicitly, this means that for any test function ϕ\phi, 𝒥fϕ(z)𝑑μf=𝒥fϕ(g(z))𝑑μf\int_{\mathcal{J}_{f}}\phi(z)d\mu_{f}=\int_{\mathcal{J}_{f}}\phi(g(z))d\mu_{f}.

Proof.

Since ff and fnf^{n} have the same Julia set and invariant measure, we can replace ff with fnf^{n} to reduce to the case where gg commutes with ff.

Since gg commutes with ff, the measure gμfg_{*}\mu_{f} on \mathbb{C} defined by gμf(X)=μf(g1(X))g_{*}\mu_{f}(X)=\mu_{f}(g^{-1}(X)) is also invariant under ff and has total measure 11. By uniqueness of the invariant measure, we have gμf=μfg_{*}\mu_{f}=\mu_{f} as desired. Since 𝒥f\mathcal{J}_{f} is the support of μf\mu_{f}, it too is preserved by gg. ∎

Proof of Lemma 6.1.

We apply the invariance of measure in Lemma 6.2 to both variables in this integral, and obtain

𝒥f𝒥flog|f~n(z1)L(f~m(z2))|dμf(z1)𝑑μf(z2)=𝒥f𝒥flog|z1z2|dμf(z1)𝑑μf(z2),\int_{\mathcal{J}_{f}}\int_{\mathcal{J}_{f}}\log|\tilde{f}^{n}(z_{1})-L(\tilde{f}^{m}(z_{2}))|d\mu_{f}(z_{1})d\mu_{f}(z_{2})=\int_{\mathcal{J}_{f}}\int_{\mathcal{J}_{f}}\log|z_{1}-z_{2}|d\mu_{f}(z_{1})d\mu_{f}(z_{2}),

which we know to be 0. ∎

Remark 6.3.

We will see in Section 8, Proposition 8.7 that for f[z]f\in\mathbb{Z}[z] both LL and f~\tilde{f} have algebraic coefficients.

As a corollary, we obtain a result proving part of Theorem 1.5.

Corollary 6.4.

Let f[z]f\in\mathbb{Z}[z] be monic of degree d2d\geq 2, and let P[x,y]P\in\mathbb{Z}[x,y] be nonzero. Suppose there exists a product of complex polynomials FjF_{j} such that

  1. (1)

    each FjF_{j} has the form f~n(x)L(f~m(y))\tilde{f}^{n}(x)-L(\tilde{f}^{m}(y)), where LL is a linear polynomial commuting with an iterate of ff and f~\tilde{f} is a nonlinear polynomial of minimal degree commuting with an iterate of ff (with possibly different choices of LL, f~\tilde{f}, nn, and mm for each jj);

  2. (2)

    Fj[x,y]\prod F_{j}\in\mathbb{Z}[x,y]; and

  3. (3)

    PP divides Fj\prod F_{j} in [x,y]\mathbb{Z}[x,y].

Then mf(P)=0\mathrm{m}_{f}(P)=0.

Proof.

We have

jFj(x,y)=P(x,y)Q(x,y)\prod_{j}F_{j}(x,y)=P(x,y)Q(x,y)

for some Q[x,y]Q\in\mathbb{Z}[x,y]. Taking Mahler measures of both sides, we get:

0=mf(P)+mf(Q)0=\mathrm{m}_{f}(P)+\mathrm{m}_{f}(Q)

and both summands are nonnegative (since PP and QQ have integer coefficients), so we must have that mf(P)=mf(Q)=0\mathrm{m}_{f}(P)=\mathrm{m}_{f}(Q)=0. ∎

7. Two-variable Dynamical Kronecker

Our goal in this section is to finish the proof of our main result, Theorem 1.5. More precisely, in Corollary 6.4 we proved that certain polynomials have dynamical Mahler measure zero. It is now time to prove that those are the only polynomials that satisfy this condition. In other words we will prove the following result.

Theorem 7.1.

Assume the Dynamical Lehmer’s Conjecture 1.4.

Let f[z]f\in\mathbb{Z}[z] be a monic polynomial of degree d2d\geq 2 which is not conjugate to zdz^{d} or to ±Td(z)\pm T_{d}(z), where Td(z)T_{d}(z) is the dd-th Chebyshev polynomial. Let P[x,y]P\in\mathbb{Z}[x,y] be such that mf(P)=0\mathrm{m}_{f}(P)=0, PP is irreducible in [x,y]\mathbb{Z}[x,y], and PP uses both variables. Then every irreducible factor of PP in [x,y]\mathbb{C}[x,y] divides a polynomial of the following form:

f~n(x)L(f~m(y)),\tilde{f}^{n}(x)-L(\tilde{f}^{m}(y)),

where m,n0m,n\geq 0 are integers, LL is a linear polynomial commuting with an iterate of ff, and f~\tilde{f} is a nonlinear polynomial of minimal degree commuting with an iterate of ff.

In order to prove Theorem 7.1 we will need to assume the Dynamical Lehmer’s Conjecture 1.4. In addition, we will use Proposition 1.3. Finally, we will also need the following unlikely intersections result due to Ghioca, Nguyen and Ye [GNY19].

Proposition 7.2.

[GNY19, Theorem 1.5] Let f[z]f\in\mathbb{C}[z] be a polynomial of degree d2d\geq 2 which is not conjugate to zdz^{d} or to ±Td(z)\pm T_{d}(z), and let Φ:1×11×1\Phi:\mathbb{P}^{1}\times\mathbb{P}^{1}\to\mathbb{P}^{1}\times\mathbb{P}^{1} be defined by Φ(x,y)=(f(x),f(y)).\Phi(x,y)=(f(x),f(y)). Let C1×1C\subset\mathbb{P}^{1}\times\mathbb{P}^{1} be an irreducible curve defined over \mathbb{C} which projects dominantly onto both coordinates. Then CC contains infinitely many preperiodic points under the action of Φ\Phi if and only if CC is an irreducible component of the locus of an equation of the form f~n(x)=L(f~m(y))\tilde{f}^{n}(x)=L(\tilde{f}^{m}(y)), where m,n0m,n\geq 0 are integers, LL is a linear polynomial commuting with an iterate of ff, and f~\tilde{f} is a nonlinear polynomial of minimal degree commuting with an iterate of ff.

The following definition will be key to our proof:

Definition 7.3.

The triple (f,P,α)(f,P,\alpha) with f[z]f\in\mathbb{C}[z], P[x,y]P\in\mathbb{C}[x,y], and α\alpha\in\mathbb{C} satisfies the bounded orders property (BOP) if the set of nonnegative integers

{ordαP(x,fn(x))}\{\mathrm{ord}_{\alpha}P(x,f^{n}(x))\}

is bounded. (In the above sequence we ignore any terms where P(x,fn(x))P(x,f^{n}(x)) is the zero polynomial, which can happen for only finitely many nn.)

We say that (f,P)(f,P) satisfies the preperiodic bounded orders property (preperiodic BOP) if (f,P,α)(f,P,\alpha) has BOP for every α\alpha\in\mathbb{C} such that α\alpha is a preperiodic point of ff.

The following general reduction properties follow directly from the definition. We omit their proof.

Lemma 7.4.

Let f[z]f\in\mathbb{C}[z] and P[x,y]P\in\mathbb{C}[x,y].

  1. (1)

    (Translation) Let Pα(x,y)=P(x+α,y+α)P_{-\alpha}(x,y)=P(x+\alpha,y+\alpha) and fα(z)=f(z+α)αf_{-\alpha}(z)=f(z+\alpha)-\alpha. If (fα,Pα,0)(f_{-\alpha},P_{-\alpha},0) has BOP, then (f,P,α)(f,P,\alpha) has BOP.

  2. (2)

    (Iteration) For 0i<k0\leq i<k let P(id×fi)P\circ(\mathrm{id}\times f^{i}) be given by P(id×fi)(x,y)=P(x,fi(y))P\circ(\mathrm{id}\times f^{i})(x,y)=P(x,f^{i}(y)). If (fk,P(id×fi),α)(f^{k},P\circ(\mathrm{id}\times f^{i}),\alpha) has BOP for 0i<k0\leq i<k, then (f,P,α)(f,P,\alpha) has BOP.

  3. (3)

    (Divisibility) If (f,P,α)(f,P,\alpha) has BOP and QPQ\mid P, then (f,Q,α)(f,Q,\alpha) has BOP.

Now we consider some less immediate reduction properties.

Lemma 7.5 (Composition/Pushforward reduction).

Suppose that f[z]f\in\mathbb{C}[z] and P[x,y]P\in\mathbb{C}[x,y].

  1. (4)

    (Composition) Let fP=P(f×f)f^{*}P=P\circ(f\times f) be the polynomial given by fP=P(f(x),f(y))f^{*}P=P(f(x),f(y)). If (f,P,f(α))(f,P,f(\alpha)) has BOP, then (f,fP,α)(f,f^{*}P,\alpha) has BOP.

  2. (5)

    (Pushforward) For P(x,y)P(x,y) irreducible in [x,y]\mathbb{C}[x,y], we define fP(x,y)f_{*}P(x,y) to be an irreducible polynomial in [x,y]\mathbb{C}[x,y] which vanishes on the curve {(f(x),f(y)):P(x,y)=0}\{(f(x),f(y))\,:\,P(x,y)=0\} (fP(x,y)f_{*}P(x,y) is defined up to multiplication by a non-zero scalar222The fact that the pushforward is only defined up to multiplication by a scalar is not significant, since we only use this construction in the context of order of vanishing.). More generally, we can extend this definition multiplicatively: for any P(x,y)[x,y]P(x,y)\in\mathbb{C}[x,y] with irreducible factorization P=jFjP=\prod_{j}F_{j}, define fP=jfFjf_{*}P=\prod_{j}f_{*}F_{j}.

    Then for any PP in [x,y]\mathbb{C}[x,y], if (f,fP,f(α))(f,f_{*}P,f(\alpha)) has BOP, (f,P,α)(f,P,\alpha), also has BOP.

Proof.

(4) Composition: For Q=fPQ=f^{*}P and Qn(z)=Q(z,fn(z))Q_{n}(z)=Q(z,f^{n}(z)),

ordz=αQn(z)\displaystyle\mathrm{ord}_{z=\alpha}Q_{n}(z) =ordz=αP(f(z),fn+1(z))\displaystyle=\mathrm{ord}_{z=\alpha}P(f(z),f^{n+1}(z))
=ordz=αPn(f(z))\displaystyle=\mathrm{ord}_{z=\alpha}P_{n}(f(z))
=ordz=α(f(z)f(α))ordz=f(α)Pn(z).\displaystyle=\mathrm{ord}_{z=\alpha}(f(z)-f(\alpha))\cdot\mathrm{ord}_{z^{\prime}=f(\alpha)}P_{n}(z^{\prime}).

Hence the sequence ordz=αQn(z)\mathrm{ord}_{z=\alpha}Q_{n}(z) is bounded if ordz=f(α)Pn(z)\mathrm{ord}_{z^{\prime}=f(\alpha)}P_{n}(z^{\prime}) is.

(5) Pushforward: Assume (f,fP,f(α))(f,f_{*}P,f(\alpha)) has BOP. Since BOP is preserved by taking products, we can without loss of generality assume that PP is irreducible. By the composition property, (f,ffP,α)(f,f^{*}f_{*}P,\alpha) also has BOP. However, by construction, the two-variable polynomial ffPf^{*}f_{*}P vanishes at any (x,y)(x,y) with P(x,y)=0P(x,y)=0, and since PP is irreducible, we must have PffPP\mid f^{*}f_{*}P by the Nullstellensatz. Hence (f,P,α)(f,P,\alpha) has BOP by divisibility. ∎

The motivation for the definition of BOP is the following result.

Proposition 7.6.

Theorem 7.1 holds under the extra hypothesis that (f,P)(f,P) satisfies preperiodic BOP.

Proof.

Suppose that P[x,y]P\in\mathbb{Z}[x,y] is irreducible and mf(P)=0\mathrm{m}_{f}(P)=0. We consider the family of polynomials Pn(x)=P(x,fn(x))P_{n}(x)=P(x,f^{n}(x)).

By Weak Dynamical Boyd–Lawton (Proposition 1.3), we have that lim supnmf(Pn(x))=0\limsup_{n\to\infty}\mathrm{m}_{f}(P_{n}(x))=0. We now apply the Dynamical Lehmer’s Conjecture 1.4 to obtain that mf(Pn(x))=0\mathrm{m}_{f}(P_{n}(x))=0 for n0n\gg 0. Then if α\alpha is any root of Pn(x)P_{n}(x), then, by the single variable Kronecker’s Lemma 1.2, α\alpha is preperiodic for ff.

We now argue that the set S(P)={α:Pn(α)=0 for some n}S(P)=\{\alpha\,:\,P_{n}(\alpha)=0\text{ for some }n\} is infinite. Indeed,

deg(Pn)=αS(P)ordα(Pn(z))\deg(P_{n})=\sum_{\alpha\in S(P)}\mathrm{ord}_{\alpha}(P_{n}(z))

goes to \infty as nn\to\infty, but preperiodic BOP for (f,P)(f,P) says that the individual summands are bounded, so the sum must be infinite. Notice that S(P)S(P) is the union of sets S(Q)S(Q), as QQ ranges through the irreducible factors of PP in [x,y]\mathbb{C}[x,y], and at least one of these sets must be infinite. However, because PP is irreducible in [x,y]\mathbb{Z}[x,y], the irreducible factors of PP form a set of Galois conjugates, and so the sets S(Q)S(Q) all have the same cardinality, and are all infinite. Hence any irreducible factor QQ of PP has the property that the set S(Q)={α:Qn(α)=0 for some n}S(Q)=\{\alpha\,:\,Q_{n}(\alpha)=0\text{ for some }n\} is infinite.

We are now in a position to apply Proposition 7.2 to QQ. Indeed, if α\alpha is a root of some QnQ_{n}, then the curve Q(x,y)=0Q(x,y)=0 passes through the point (α,fn(α))(\alpha,f^{n}(\alpha)), whose coordinates are both preperiodic for ff. Also, the dominance condition in Proposition 7.2 follows from the fact that PP uses both variables xx and yy.

Thus, we conclude that each complex factor Q(x,y)Q(x,y) of P(x,y)P(x,y) divides a polynomial of the form f~n(x)L(f~m(y))\tilde{f}^{n}(x)-L(\tilde{f}^{m}(y)). ∎

It now remains to show the following result.

Proposition 7.7.

If f[z]f\in\mathbb{C}[z] of degree d2d\geq 2 and P[x,y]P\in\mathbb{C}[x,y] is such that every irreducible factor involves both variables, then (f,P)(f,P) satisfies preperiodic BOP.

Proof.

We must show that if ff and PP satisfy the above conditions, then for any preperiodic point α\alpha of ff, the triple (f,P,α)(f,P,\alpha) satisfies BOP. Our strategy will be to show this in the case where α\alpha is a fixed point of ff, and so we first show that the general case can be reduced to this case.

Assume that we have proved that all such triples (g,Q,β)(g,Q,\beta) with β\beta a fixed point of gg have BOP. Take a triple (f,P,α)(f,P,\alpha) with α\alpha preperiodic for ff. Then there exists an NN such that f2N(α)=fN(α)f^{2N}(\alpha)=f^{N}(\alpha), so that fN(α)f^{N}(\alpha) is a fixed point for fNf^{N}. By hypothesis, (fN,fN(P(id×fi)),fN(α))(f^{N},f^{N}_{\ast}(P\circ(\mathrm{id}\times f^{i})),f^{N}(\alpha)) has BOP for each ii (note that fN(P(id×fi))f^{N}_{\ast}(P\circ(\mathrm{id}\times f^{i})) does not vanish on any horizontal or vertical line because PP does not, so all its irreducible factors involve both variables). By pushforward, (fN,P(id×fi),α)(f^{N},P\circ(\mathrm{id}\times f^{i}),\alpha) has BOP for each ii, and by iteration (f,P,α)(f,P,\alpha) does also.

We have thus reduced the argument to showing that (f,P,α)(f,P,\alpha) has BOP whenever α\alpha is a fixed point for ff. By translation, we can further assume that α=0\alpha=0. Additionally, if f(0)f^{\prime}(0) is a root of unity of order k>1k>1, then for g=fkg=f^{k}, we have that g(0)=1g^{\prime}(0)=1. Using the iteration reduction to replace ff by g=fkg=f^{k} as necessary, we can assume that if f(0)f^{\prime}(0) is a root of unity, then f(0)=1f^{\prime}(0)=1.

So suppose that f[z]f\in\mathbb{C}[z] is a monic polynomial such that f(0)=0f(0)=0 and f(0)f^{\prime}(0) is either 11 or not a root of unity, and P[x,y]P\in\mathbb{C}[x,y] is such that every irreducible factor involves both variables. We show that the sequence {ord0P(z,fn(z))}\{\mathrm{ord}_{0}P(z,f^{n}(z))\} is bounded.

We can factor P(x,y)=h(x)j(ygj(x))P(x,y)=h(x)\prod_{j}(y-g_{j}(x)), where hh is a nonzero polynomial in xx and gjg_{j} are non-constant Puiseux series in xx (formal power series with possibly fractional exponents, and also possibly a finite number of negative exponents, which are obtained by dividing out by hh). Then

ord0P(z,fn(z))=ord0(h(z))+jord0(fn(z)gj(z)).\mathrm{ord}_{0}P(z,f^{n}(z))=\mathrm{ord}_{0}(h(z))+\sum_{j}\mathrm{ord}_{0}(f^{n}(z)-g_{j}(z)).

Since ord0(h(z))\mathrm{ord}_{0}(h(z)) is independent of nn, it suffices to show that each summand ord0(fn(z)gj(z))\mathrm{ord}_{0}(f^{n}(z)-g_{j}(z)) is bounded. Note that if gj(z)g_{j}(z) has a term with a negative or non-integer exponent, since fn(z)f^{n}(z) is a polynomial, this bounds ord0(fn(z)gj(z))\mathrm{ord}_{0}(f^{n}(z)-g_{j}(z)), so we can restrict to the case where gj[[z]]g_{j}\in\mathbb{C}[[z]].

We first deal with the case of f(0)=0f^{\prime}(0)=0 separately. In this case, ord0(fn(z))\mathrm{ord}_{0}(f^{n}(z))\to\infty as nn\to\infty, so ord0(fn(z)g(z))=ord0(g(z))\mathrm{ord}_{0}(f^{n}(z)-g(z))=\mathrm{ord}_{0}(g(z)) for sufficiently large nn and the sequence is bounded.

Hence we are reduced to the case f(0)0f^{\prime}(0)\neq 0, and either f(0)f^{\prime}(0) is not a root of unity or f(0)=1f^{\prime}(0)=1. We will use the following notation: if F[[z]]F\in\mathbb{C}[[z]] is a power series, then [zi]F(z)[z^{i}]F(z) denotes the coefficient of ziz^{i} in F(z)F(z). We have that ord0(fn(z)g(z))\mathrm{ord}_{0}(f^{n}(z)-g(z)) is the smallest ii such that [zi]fn(z)[zi]g(z)[z^{i}]f^{n}(z)\not=[z^{i}]g(z). The claim states that the sequence of such ii is bounded as nn varies, which will follow from the following sublemma:

Lemma 7.8.

Let f(z)f(z) be a polynomial of degree d>1d>1 with f(0)=0f(0)=0 and f(0)0f^{\prime}(0)\neq 0, and either f(0)f^{\prime}(0) is not a root of unity, or f(0)=1f^{\prime}(0)=1. There exists some ii such that the set of coefficients {[zi]fn(z)}n1\{[z^{i}]f^{n}(z)\}_{n\geq 1} contains any number at most once.

Proof of Lemma 7.8.

Since f(0)=0f(0)=0, and f(0)0f^{\prime}(0)\neq 0, we can write f(z)=c1z+higher order termsf(z)=c_{1}z+\text{higher order terms}. If c1c_{1} is not a root of unity, then

fn(z)=c1nz+higher order termsf^{n}(z)=c_{1}^{n}z+\text{higher order terms}

and i=1i=1 works.

On the other hand, if c1=1c_{1}=1, we can write f(z)=z+ci0zi0+higher order termsf(z)=z+c_{i_{0}}z^{i_{0}}+\text{higher order terms}, where ci0c_{i_{0}} is the first nonzero coefficient after the linear one. We can calculate

fn(z)=z+nci0zi0+higher order terms,f^{n}(z)=z+nc_{i_{0}}z^{i_{0}}+\text{higher order terms},

so the value i=i0i=i_{0} works. ∎

Now, for the value of ii given by the above lemma, there can be at most one nn such that [zi]fn(z)=[zi]g(z)[z^{i}]f^{n}(z)=[z^{i}]g(z), and for all other nn we have ord0(fn(z)g(z))i\mathrm{ord}_{0}(f^{n}(z)-g(z))\leq i. Hence the values of ord0(fn(z)g(z))\mathrm{ord}_{0}(f^{n}(z)-g(z)) are bounded. This finishes the proof in the case that α\alpha is a fixed point of ff, and so by our reductions, also the proof of Proposition 7.7. ∎

Combining Propositions 7.6 and 7.7 gives Theorem 7.1. Finally, Theorem 1.5 is a consequence of combining Theorem 7.1 and Corollary 6.4.

8. A detailed study of LL and f~\tilde{f}

For a fixed nonlinear, monic polynomial f[z]f\in\mathbb{Z}[z] of degree dd which is not conjugate to a power function or a Chebyshev polynomial, Theorem 1.5 characterizes the irreducible polynomials P[x,y]P\in\mathbb{Z}[x,y] with mf(P)=0\mathrm{m}_{f}(P)=0 in terms of linear polynomials L(z)L(z) and nonlinear polynomials f~(z)\tilde{f}(z) of minimal degree commuting with an iterate of ff. In this section we describe the polynomials LL and f~\tilde{f} which arise in this manner. Having integral coefficients plays no special role here, so we will work throughout with polynomials over \mathbb{C}.

For a general given polynomial f(z)=i=0daizif(z)=\sum_{i=0}^{d}a_{i}z^{i}, we define β=ad1/(dad)\beta=a_{d-1}/(da_{d}). In our case ff is monic, so β=ad1/d\beta=a_{d-1}/d. The constant β\beta is invariant under replacing ff by one of its iterates. Conjugating ff by zzβz\mapsto z-\beta gives a polynomial fβf_{\beta} of degree dd whose degree-(d1)(d-1) coefficient is 0. Define j(f)j(f) to be the greatest common divisor of those i1i-1 for which ai0a_{i}\neq 0. The key result is the following, due to Boyce:

Theorem 8.1.

[Boy72, Theorem 4] If g(z)g(z) is a degree-mm polynomial that commutes with f(z)f(z), then the degree-mm polynomials commuting with f(z)f(z) are exactly those of the form ug(z)+(u1)βug(z)+(u-1)\beta for uu a j(fβ)j(f_{\beta})-th root of unity.

This will allow us to completely understand the choices for both LL and f~\tilde{f}, modulo the calculation of j(fβn)j(f^{n}_{\beta}), as soon as we have identified suitable seed polynomials gg. In the linear case, we may take g(z)=zg(z)=z. The following result, a form of Ritt’s theorem [Rit23] as it appears in Boyce [Boy72], tells us how to proceed in the nonlinear case:

Theorem 8.2.

[Jul22, Rit23] If two polynomials ff and gg commute under composition, then up to conjugation with the same linear polynomial, either both are power functions, both are Chebyshev polynomials, or both are roots of unity times iterates of the same polynomial (in which case the roots of unity themselves will commute with the latter polynomial).

Since we are assuming that f(z)f(z) is not conjugate to either zdz^{d} or ±Td(z)\pm T_{d}(z), in the nonlinear case we may then take g(z)g(z) to be a polynomial of minimal degree, some iterate of which is equal to f(z)f(z) or to f(z)f(z) times a root of unity that commutes with gg. Note that such a polynomial will commute with all iterates of ff, and will have minimal degree among all nonlinear polynomials which commute with any iterate of ff.

We now wish to determine, for a fixed monic polynomial f(z)f(z), what values are taken by the quantity j(fβn)j(f^{n}_{\beta}). It will simplify the notation to index the coefficients of fβnf^{n}_{\beta} in order of decreasing degree, so we denote by cn,ic_{n,i} the degree-(dni)(d^{n}-i) coefficient of fβnf^{n}_{\beta}; note that cn,0=1c_{n,0}=1 and cn,1=0c_{n,1}=0 for all nn. Let rr denote the greatest common divisor of those indices ii for which c1,i0c_{1,i}\neq 0. As gcd(d1,di1)=gcd(d1,i)\gcd(d-1,d-i-1)=\gcd(d-1,i) for any ii, we have j(fβ)=gcd(d1,r)j(f_{\beta})=\gcd(d-1,r).

Although j(fβn)j(f^{n}_{\beta}) is defined in terms of the full set of coefficients of fβnf^{n}_{\beta}, it turns out that it is in fact completely determined by its largest-degree coefficients cn,0,,cn,dc_{n,0},\ldots,c_{n,d}. We elaborate on this below.

Lemma 8.3.

For i=0,,di=0,\ldots,d, the degree-(dni)(d^{n}-i) coefficient cn,ic_{n,i} of fβn(z)f^{n}_{\beta}(z) is equal to the degree-(dni)(d^{n}-i) coefficient of (fβ(z))dn1(f_{\beta}(z))^{d^{n-1}}. In particular,

cn,i=i1++idn1=ic1,i1c1,idn1.c_{n,i}=\sum_{i_{1}+\cdots+i_{d^{n-1}}=i}c_{1,i_{1}}\cdots c_{1,i_{d^{n-1}}}.
Proof.

Notice that when computing fβn(z)f^{n}_{\beta}(z) as fβ(fβn1(z))f_{\beta}(f^{n-1}_{\beta}(z)) for n2n\geq 2, the contribution to the first d+1d+1 coefficients comes exclusively from the term (fβn1(z))d(f^{n-1}_{\beta}(z))^{d}, since fβf_{\beta} has no degree-(d1)(d-1) term, and the highest coefficient of (fβn1(z))d2(f^{n-1}_{\beta}(z))^{d-2} has degree dn1(d2)=dn2dn1<dndd^{n-1}(d-2)=d^{n}-2d^{n-1}<d^{n}-d.

Further, the first d+1d+1 coefficients of (fβn1(z))d(f^{n-1}_{\beta}(z))^{d} are determined by the first d+1d+1 coefficients of fβn1(z)f^{n-1}_{\beta}(z), since any contribution of the coefficient of degree dn1d1d^{n-1}-d-1 or below will have at most degree dn1(d1)+dn1d1=dnd1d^{n-1}(d-1)+d^{n-1}-d-1=d^{n}-d-1.

Thus the first d+1d+1 coefficients of fβn(z)f^{n}_{\beta}(z) are determined by the first d+1d+1 coefficients of fβn1(z)f^{n-1}_{\beta}(z), from the term of the form (fβn1(z))n(f^{n-1}_{\beta}(z))^{n}. Proceeding by induction, we conclude that the first d+1d+1 coefficients of fβn(z)f^{n}_{\beta}(z) are determined by the first d+1d+1 coefficients of fβ(z)f_{\beta}(z) (i.e. all the coefficients of fβ(z)f_{\beta}(z)), from a term of the form (fβ(z))dn1(f_{\beta}(z))^{d^{n-1}}.

In other words, the first d+1d+1 coefficients of fβn(z)f^{n}_{\beta}(z) are given by the first d+1d+1 coefficients of (fβ(z))dn1(f_{\beta}(z))^{d^{n-1}}. ∎

Lemma 8.4.

Let r=gcd(i:c1,i0)r=\gcd(i\,:\,c_{1,i}\neq 0), as above. Then cn,i=0c_{n,i}=0 unless idn(modr)i\equiv d^{n}\pmod{r}.

Proof.

We have c1,i=0c_{1,i}=0 unless id(modr)i\equiv d\pmod{r} by construction. Suppose towards induction that the statement holds for some fixed nn. If one multiplies kr+dkr+d monomials of degree equivalent to dnd^{n} modulo rr, the product has degree equivalent to (kr+d)(dn)dn+1(modr)(kr+d)(d^{n})\equiv d^{n+1}\pmod{r}. The expression (fβn(z))kr+d(f^{n}_{\beta}(z))^{kr+d} is thus a sum of monomials whose degrees are equivalent to dn+1d^{n+1} modulo rr, and it follows that fβn+1(z)=fβ(fβn(z))f^{n+1}_{\beta}(z)=f_{\beta}(f^{n}_{\beta}(z)) is also. ∎

Lemma 8.5.

Enumerate the indices i0<<imi_{0}<\ldots<i_{m} for which c1,ij0c_{1,i_{j}}\neq 0; note that i0=0i_{0}=0, and since ff is not conjugate to a power function, there is at least one nonzero coefficient c1,ic_{1,i} with i>0i>0. Let

I={ik:gcd(i0,,ik)gcd(i0,,ik1)};I=\{i_{k}\,:\,\gcd(i_{0},\ldots,i_{k})\neq\gcd(i_{0},\ldots,i_{k-1})\};

this set always contains at least i0i_{0} and i1i_{1}. Then cn,i0c_{n,i}\neq 0 for all nn and for all iIi\in I.

Proof.

For k=0,,mk=0,\ldots,m, let rk=gcd(i0,,ik)r_{k}=\gcd(i_{0},\ldots,i_{k}), so that I={ik:rkrk1}I=\{i_{k}\,:\,r_{k}\neq r_{k-1}\}.

As i1i_{1} is the smallest positive index ii such that c1,i0c_{1,i}\neq 0, it is also the smallest positive index such that cn,i0c_{n,i}\neq 0, with cn,i1=dn1c1,i1c_{n,i_{1}}=d^{n-1}c_{1,i_{1}} in this case.

Suppose that ikIi_{k}\in I. It follows that cn,i=0c_{n,i}=0 for i<iki<i_{k} with i0(modrk1)i\not\equiv 0\pmod{r_{k-1}}. The coefficient cn,ikc_{n,i_{k}} is the sum of terms of the form

sc1,s,\prod_{s}c_{1,s},

where the product runs over the entries of a dn1d^{n-1}-tuple of indices summing to iki_{k}. Since rkrk1r_{k}\neq r_{k-1}, the integer rk1r_{k-1} does not divide iki_{k}, so if s<iks<i_{k} for each index ss, then at least one index has s0(modrk1)s\not\equiv 0\pmod{r_{k-1}}, and the corresponding coefficient c1,sc_{1,s} is thus zero. So the only nonzero terms have one index ss equal to iki_{k} and the others equal to 0; in other words,

cn,ik=dn1c1,ik0.c_{n,i_{k}}=d^{n-1}c_{1,i_{k}}\neq 0.\qed
Corollary 8.6.

We have j(fβn)=gcd(dn1,r)j(f^{n}_{\beta})=\gcd(d^{n}-1,r).

Proof.

It follows from Lemma 8.4 that j(fβn)gcd(dn1,r)j(f^{n}_{\beta})\geq\gcd(d^{n}-1,r). From Lemma 8.5, we know that the degree-(dni)(d^{n}-i) coefficient of fβn(z)f^{n}_{\beta}(z) is nonzero for each iIi\in I, and as the greatest common divisor of the elements of II is rr, it follows that j(fβn)gcd(dn1,r)j(f^{n}_{\beta})\leq\gcd(d^{n}-1,r) also. ∎

Proposition 8.7.

Suppose that r=gcd(i:c1,i0)r=\gcd(i\,:\,c_{1,i}\neq 0) has prime factorization piei\prod p_{i}^{e_{i}}. Let

r=pidpiei.r^{\prime}=\prod_{p_{i}\nmid d}p_{i}^{e_{i}}.

Then there are exactly rr^{\prime} choices for the linear polynomial L(z)L(z) and for the nonlinear polynomial f~(z)\tilde{f}(z) appearing in the statement of Theorem 1.5: the linear polynomials commuting with some iterate of ff are exactly those of the form L(z)=uz+(u1)βL(z)=uz+(u-1)\beta for uu an (r)(r^{\prime})-th root of unity, and the nonlinear polynomials of minimal degree commuting with some iterate of ff are exactly those of the form uf~0(z)+(u1)βu\tilde{f}_{0}(z)+(u-1)\beta, where f~0(z)\tilde{f}_{0}(z) is a polynomial of minimal degree, some iterate of which is equal to ff, and uu is an (r)(r^{\prime})-th root of unity.

Proof.

We have shown that j(fβn)=gcd(dn1,r)j(f^{n}_{\beta})=\gcd(d^{n}-1,r). If pidp_{i}\mid d, then pidn1p_{i}\nmid d^{n}-1, so also j(fβn)=gcd(dn1,r)j(f^{n}_{\beta})=\gcd(d^{n}-1,r^{\prime}); in particular, j(fβn)rj(f^{n}_{\beta})\mid r^{\prime}. As gcd(d,r)=1\gcd(d,r^{\prime})=1, dd is a unit in /r\mathbb{Z}/r^{\prime}\mathbb{Z}, so there exists an nn such that dn1(modr)d^{n}\equiv 1\pmod{r^{\prime}}, and in this case we have j(fβn)=rj(f^{n}_{\beta})=r^{\prime}. The result is now an immediate consequence of Theorem 8.1. ∎

Example 8.8.

Consider the polynomial f(z)=fβ(z)=z14z2f(z)=f_{\beta}(z)=z^{14}-z^{2}. Then d=14d=14, β=0\beta=0, r=12r=12, r=3r^{\prime}=3, and

dn\displaystyle d^{n} {2(mod12)if n=1,4(mod12)if n is even,8(mod12)otherwise,\displaystyle\equiv\begin{cases}2\pmod{12}&\text{if $n=1$},\\ 4\pmod{12}&\text{if $n$ is even},\\ 8\pmod{12}&\text{otherwise},\end{cases}
{2(mod3)if n is odd,1(mod3)if n is even.\displaystyle\equiv\begin{cases}2\pmod{3}&\text{if $n$ is odd},\\ 1\pmod{3}&\text{if $n$ is even}.\end{cases}

Thus

j(fβn)=gcd(dn1,3)={1if n is odd,3if n is even.j(f^{n}_{\beta})=\gcd(d^{n}-1,3)=\begin{cases}1&\text{if $n$ is odd},\\ 3&\text{if $n$ is even}.\end{cases}

Thus the linear polynomials commuting with some iterate of ff are exactly those of the form L(z)=uzL(z)=uz for uu a third root of unity. As dd is not a power of any smaller integer, ff is not an iterate of any polynomial of smaller degree, so the nonlinear polynomials of minimal degree commuting with an iterate of ff are exactly those of the form f~(z)=uf(z)\tilde{f}(z)=uf(z) for uu a third root of unity.

References

  • [BIJ+19] Robert Benedetto, Patrick Ingram, Rafe Jones, Michelle Manes, Joseph H. Silverman, and Thomas J. Tucker, Current trends and open problems in arithmetic dynamics, Bull. Amer. Math. Soc. (N.S.) 56 (2019), no. 4, 611–685. MR 4007163
  • [Boy72] William M. Boyce, On polynomials which commute with a given polynomial, Proc. Amer. Math. Soc. 33 (1972), 229–234. MR 291138
  • [Boy81a] David W. Boyd, Kronecker’s theorem and Lehmer’s problem for polynomials in several variables, J. Number Theory 13 (1981), no. 1, 116–121. MR 602452
  • [Boy81b] by same author, Speculations concerning the range of Mahler’s measure, Canad. Math. Bull. 24 (1981), no. 4, 453–469. MR 644535
  • [Boy98] by same author, Mahler’s measure and special values of LL-functions, Experiment. Math. 7 (1998), no. 1, 37–82. MR 1618282
  • [Bre51] Robert Breusch, On the distribution of the roots of a polynomial with integral coefficients, Proc. Amer. Math. Soc. 2 (1951), 939–941. MR 45246
  • [Bro65] Hans Brolin, Invariant sets under iteration of rational functions, Ark. Mat. 6 (1965), 103–144 (1965). MR 194595
  • [CLMM] A. Carter, M. Lalín, M. Manes, and A. B. Miller, On the dynamical Mahler measure, in preparation.
  • [CLT09] Antoine Chambert-Loir and Amaury Thuillier, Mesures de Mahler et équidistribution logarithmique, Ann. Inst. Fourier (Grenoble) 59 (2009), no. 3, 977–1014. MR 2543659
  • [CS93] Gregory S. Call and Joseph H. Silverman, Canonical heights on varieties with morphisms, Compositio Math. 89 (1993), no. 2, 163–205. MR 1255693
  • [Den97] Christopher Deninger, Deligne periods of mixed motives, KK-theory and the entropy of certain 𝐙n{\bf Z}^{n}-actions, J. Amer. Math. Soc. 10 (1997), no. 2, 259–281. MR 1415320
  • [Dim] Vesselin Dimitrov, A proof of the Schinzel-Zassenhaus conjecture on polynomials, arXiv:1912.12545.
  • [Dob79] E. Dobrowolski, On a question of Lehmer and the number of irreducible factors of a polynomial, Acta Arith. 34 (1979), no. 4, 391–401. MR 543210
  • [EW99] Graham Everest and Thomas Ward, Heights of polynomials and entropy in algebraic dynamics, Universitext, Springer-Verlag London, Ltd., London, 1999. MR 1700272
  • [FLM83] Alexandre Freire, Artur Lopes, and Ricardo Mañé, An invariant measure for rational maps, Bol. Soc. Brasil. Mat. 14 (1983), no. 1, 45–62. MR 736568
  • [FRL06] Charles Favre and Juan Rivera-Letelier, Équidistribution quantitative des points de petite hauteur sur la droite projective, Math. Ann. 335 (2006), no. 2, 311–361. MR 2221116
  • [GNY19] D. Ghioca, K. D. Nguyen, and H. Ye, The dynamical Manin-Mumford conjecture and the dynamical Bogomolov conjecture for split rational maps, J. Eur. Math. Soc. (JEMS) 21 (2019), no. 5, 1571–1594. MR 3941498
  • [GTZ11] Dragos Ghioca, Thomas J. Tucker, and Shouwu Zhang, Towards a dynamical Manin-Mumford conjecture, Int. Math. Res. Not. IMRN (2011), no. 22, 5109–5122. MR 2854724
  • [Ing13] Patrick Ingram, Variation of the canonical height for a family of polynomials, J. Reine Angew. Math. 685 (2013), 73–97. MR 3181564
  • [Jul22] Gaston Julia, Mémoire sur la permutabilité des fractions rationnelles, Ann. Sci. École Norm. Sup. (3) 39 (1922), 131–215. MR 1509242
  • [Kro57] L. Kronecker, Zwei Sätze über Gleichungen mit ganzzahligen Coefficienten, J. Reine Angew. Math. 53 (1857), 173–175. MR 1578994
  • [Law83] Wayne M. Lawton, A problem of Boyd concerning geometric means of polynomials, J. Number Theory 16 (1983), no. 3, 356–362. MR 707608
  • [Leh33] D. H. Lehmer, Factorization of certain cyclotomic functions, Ann. of Math. (2) 34 (1933), no. 3, 461–479. MR 1503118
  • [Lyu83] M. Ju. Lyubich, Entropy properties of rational endomorphisms of the Riemann sphere, Ergodic Theory Dynam. Systems 3 (1983), no. 3, 351–385. MR 741393
  • [Mah62] K. Mahler, On some inequalities for polynomials in several variables, J. London Math. Soc. 37 (1962), 341–344. MR 0138593
  • [PST05] Jorge Pineiro, Lucien Szpiro, and Thomas J. Tucker, Mahler measure for dynamical systems on 1{\mathbb{P}}^{1} and intersection theory on a singular arithmetic surface, Geometric methods in algebra and number theory, Progr. Math., vol. 235, Birkhäuser Boston, Boston, MA, 2005, pp. 219–250. MR 2166086
  • [Ran95] Thomas Ransford, Potential theory in the complex plane, London Mathematical Society Student Texts, vol. 28, Cambridge University Press, Cambridge, 1995. MR 1334766
  • [Rit23] J. F. Ritt, Permutable rational functions, Trans. Amer. Math. Soc. 25 (1923), no. 3, 399–448. MR 1501252
  • [RZ14] Mathew Rogers and Wadim Zudilin, On the Mahler measure of 1+X+1/X+Y+1/Y1+X+1/X+Y+1/Y, Int. Math. Res. Not. IMRN (2014), no. 9, 2305–2326. MR 3207368
  • [Sil07] Joseph H. Silverman, The arithmetic of dynamical systems, Graduate Texts in Mathematics, vol. 241, Springer, New York, 2007. MR 2316407
  • [Smy71] C. J. Smyth, On the product of the conjugates outside the unit circle of an algebraic integer, Bull. London Math. Soc. 3 (1971), 169–175. MR 289451
  • [Smy81] by same author, On measures of polynomials in several variables, Bull. Austral. Math. Soc. 23 (1981), no. 1, 49–63. MR 615132
  • [ST12] L. Szpiro and T. J. Tucker, Equidistribution and generalized Mahler measures, Number theory, analysis and geometry, Springer, New York, 2012, pp. 609–638. MR 2867934
  • [Zha95] Shouwu Zhang, Small points and adelic metrics, J. Algebraic Geom. 4 (1995), no. 2, 281–300. MR 1311351