This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Reciprocal-log approximation and planar PDE solvers

Yuji Nakatsukasa and Lloyd N. Trefethen [email protected] and [email protected], Mathematical Institute, University of Oxford, Oxford, OX2 6GG, UK.
Abstract

This article is about both approximation theory and the numerical solution of partial differential equations (PDEs). First we introduce the notion of reciprocal-log or log-lightning approximation of analytic functions with branch point singularities at points {zk}\{z_{k}\} by functions of the form g(z)=kck/(log(zzk)sk)g(z)=\sum_{k}c_{k}/(\log(z-z_{k})-s_{k}), which have NN poles potentially distributed along a Riemann surface. We prove that the errors of best reciprocal-log approximations decrease exponentially with respect to NN and that exponential or near-exponential convergence (i.e., at a rate O(exp(CN/logN))O(\exp(-CN/\log N))) also holds for near-best approximations with preassigned singularities constructed by linear least-squares fitting on the boundary. We then apply these results to derive a “log-lightning method” for numerical solution of Laplace and related PDEs in two-dimensional domains with corner singularities. The convergence is near-exponential, in contrast to the root-exponential convergence for the original lightning methods based on rational functions.

keywords:
reciprocal-log approximation, rational approximation, lightning solver
AMS:
65N35, 41A30, 65E05

1 Introduction

In this paper we introduce a new problem in approximation theory. On the interval [,0][-\infty,0\kern 0.5pt], it is known that functions F(s)F(s) like ease^{as} (Rea>0\hbox{\rm\kern 0.3ptRe\kern 1.0pt}a>0) with essential singularities at s=s=-\infty can be approximated by rational functions r(s)r(s) with exponential convergence as a function of nn, the degree of the rational function. Our starting point in section 2 is the change of variables z=esz=e^{s} from s[,0]s\in[-\infty,0\kern 0.5pt] to z[0,1]z\in[\kern 0.5pt0,1]. This gives us exponentially convergent approximations on [0,1][\kern 0.5pt0,1] of functions f(z)f(z) like zaz^{a} (a>0a>0) with branch point singularities at z=0z=0 by functions of the form g(z)=r(log(z))g(z)=r(\log(z)). In the generic case where the poles {sk}\{s_{k}\} of rr are distinct, the approximation can be written in the partial fractions form

g(z)=c0+k=1ncklog(z)sk.g(z)=c_{0}+\sum_{k=1}^{n}{c_{k}\over\log(z)-s_{k}}. (1)

We show that approximations of this kind can be computed by linear least-squares fitting, and we note the equioscillatory characterization of the error in the case of minimax (best \infty-norm) approximations. With both best and near-best approximations, one encounters the startling property that the oscillations typically cluster double-exponentially near the singularity, readily coming as close (in theory) as a distance of, say, 10100010^{-1000}. This is in contrast to the well-known case of rational approximation, where the clustering is just exponential [22].

Section 3 generalizes the discussion to domains in the complex zz-plane with m1m\geq 1 singularities, typically at corners. Here we consider approximations of the generic form

g(z)=j=1mk=1njcjklog(zzj)sjk+p0(z),g(z)=\sum_{j=1}^{m}\kern 2.0pt\sum_{k=1}^{n_{j}}\kern 1.0pt{c_{jk}\over\log(z-z_{j})-s_{jk}}+p_{0}(z), (2)

where p0p_{0} is a polynomial of degree n0n_{0} (possibly 0), with total number of degrees of freedom N=1+njN=1+\sum n_{j}. The least-squares method extends to this case too and again gives exponentially or near-exponentially convergent approximations. (By near-exponential, we mean at a rate O(exp(CN/logN))O(\exp(-CN/\log N)) with C>0C>0.) This and other claims are first explained heuristically and illustrated numerically. Then they are established rigorously by a succession of theorems in section 4, deriving accuracy estimates from Hermite integrals, and section 5, showing how a problem with several singularities can be decomposed into single-singularity problems by Cauchy integrals over open arcs.

As is typical with computations based on redundant expansions, numerical instability resulting from the use of ill-conditioned matrices may be a problem; we stabilize our computations by the Arnoldi orthogonalization technique presented in [1]. Section 6 considers additional fine points associated with the “10100010^{-1000}\kern 1.0pt” effect mentioned above, proposing in particular that it may be a good idea to constrain the singular part of a reciprocal-log approximation to approach zero more rapidly at each singularity.

Up to this point, the paper is all approximation theory and algorithms. The motivation, however, is numerical solution of PDEs by a “log-lightning” analogue of the lightning solvers for Laplace, Helmholtz, and biharmonic equations presented in [2], [6], and [7]. A proof of concept is briefly presented in section 7 to establish that, as intended, these approximations can be used to derive exponentially convergent new methods for solving PDEs in planar domains with corner singularities.

In a word, this is a paper about approximation by functions with logarithmic branch points rather than just pole singularities. What starts out as a seemingly arbitrary rearrangement of familiar rational approximation theory turns out to have potentially significant consequences for scientific computing.

A publication of Driscoll and Fornberg in 2001 proposed the introduction of a logarithmic term to approximate functions with jumps, with coefficients computed in the manner of quadratic Hermite–Padé approximation [4]. Their technique is different from what is being proposed here, but the use of logarithms to resolve singularities gives a family resemblance.

2 Approximation on [𝟎,𝟏][\kern 0.5pt0,1]

A classic problem in approximation theory is the rational minimax approximation of F(s)=esF(s)=e^{s} on the interval [,0][-\infty,0\kern 0.5pt]. To be precise, for each nn, let us consider rational functions r(s)r(s) of degree nn, meaning that r(s)r(s) can be written as p(s)/q(s)p(s)/q(s) for degree nn polynomials pp and qq. In the generic case where the poles are distinct and finite, rr can be written in partial fractions form as

r(s)=c0+k=1ncksskr(s)=c_{0}+\sum_{k=1}^{n}{c_{k}\over s-s_{k}} (3)

with coefficients {ck}\{c_{k}\} and poles {sk}\{s_{k}\}. Of course, some rational functions have confluent poles (i.e., poles of order >1>1) and cannot be written in this form, and we shall sometimes make use of such approximations.

Cody, Meinardus, and Varga showed in 1969 [3] that approximations r(s)r(s) to F(s)=esF(s)=e^{s} exist with exponential convergence, meaning Fr=O(Cn)\|F-r\|=O(C^{-n}) as nn\to\infty for some C>1C>1, where \|\cdot\| is the \infty-norm on [,0][-\infty,0\kern 0.5pt]. The optimal constant is C=1/H=9.28903,C=1/H=9.28903\dots, where HH is a number related to elliptic integrals known as Halphen’s constant; for a discussion with references, see chapter 25 of [19]. Exponential convergence of best rational approximations at the same asymptotic rate applies to F(s)=easF(s)=e^{as} for any aa with Rea>0\hbox{\rm\kern 0.3ptRe\kern 1.0pt}a>0 and to certain other functions F(s)F(s) related to ease^{as} by rational transformations [16]. Moreover, convergence at this rate occurs not just on [,0][-\infty,0\kern 0.5pt] but throughout the complex plane [16].

By the change of variables z=esz=e^{s} and the definitions f(z)=F(s)f(z)=F(s) and g(z)=r(s)g(z)=r(s), we transplant this theory to a new setting. Now the domain is z[0,1]z\in[\kern 0.5pt0,1], and ff is a function such as zaz^{a} that may have a branch point singularity at z=0z=0. What is new is that (3) now takes the form

g(z)=c0+k=1ncklog(z)sk,g(z)=c_{0}+\sum_{k=1}^{n}{c_{k}\over\log(z)-s_{k}}, (4)

where log\log denotes the standard branch of the logarithm. Thus, by the transplantation of existing theory, we know that a wide class of functions on [0,1][\kern 0.5pt0,1] with branch point singularities at z=0z=0 can be approximated by functions of the form (4) with exponential convergence at the rate O((9.28903)n)O((9.28903\dots)^{-n}).

More important for scientific computing are approximations that are not best but readily computable, still with exponential convergence though not at the optimal rate. One way to derive such approximations is by expressing F(s)F(s) as an integral along a Hankel contour winding around (,0](-\infty,0\kern 0.5pt] in the ss-plane. If the integral is approximated by a truncation of the transplanted equispaced trapezoidal rule on the real axis, one obtains rational approximations (3) to F(s)F(s), hence reciprocal-log approximations (4) to f(z)f(z), which converge at rates O(Cn)O(C^{-n}) with values of CC on the order of 3 or 4 depending on details of contours and parameters. The leader in developing these methods has been Weideman [24, 28]; for a review, see section 15 of [23].

The approach we shall make use of is a variation on this theme motivated by the lightning PDE solvers of [2, 6, 7] and section 7.  Rather than deriving coefficients from the trapezoidal rule, we will obtain them by solving a linear least-squares problem. Typically this will be posed on a discrete subset of [0,1][\kern 0.5pt0,1], with exponential clustering of sample points at z=0z=0. Experiments show that an effective choice of singularities {sk}\{s_{k}\} for such approximations lies on a parabolic contour in the style of [24, 28],

sk=n4(1+iθk)2,θk=π+2π(k12)/n,1kn.s_{k}={n\over 4}(1+i\theta_{k})^{2},\quad\theta_{k}=-\pi+2\pi(k-\textstyle{1\over 2})/n,\quad 1\leq k\leq n. (5)

For simplicity, however, we will also sometimes use a configuration in which all the points sks_{k} are taken to be confluent at a single value s0s_{0} of scale O(n)O(n); we normally take s0=n/2s_{0}=n/2. In this case (4) changes to the confluent form

g(z)=p(1log(z)s0),g(z)=p\kern-1.0pt\left({1\over\log(z)-s_{0}}\right), (6)

where pp is a polynomial of degree nn. As we shall prove in section 4, these methods give exponential convergence as nn\to\infty.

We illustrate these ideas with a MATLAB code segment whose fifth line realizes the distribution (5):

          
          
                M = 1000;
                z = logspace(-24,0,M)’;
                f = sqrt(z);
                n = 10;
                kk = 1:n; tk = -pi + 2*pi*(kk-.5)/n; sk = (n/4)*(1+1i*tk).^2;
                A = [z.^0 1./(log(z)-sk)];
                c = A\f;
                g = A*c;
          

The maximum error on [0,1][\kern 0.5pt0,1] is 1.5×1041.5\times 10^{-4}, and this number decreases exponentially as nn\to\infty, as shown in the first image of Figure 1. There is a marked contrast with the root-exponential behavior of the lightning rational approximation of [6]. The second image shows the singularities {sk}\{s_{k}\} in the case n=32n=32. Note that although the geometry in the ss-plane is just an arc, the corresponding points zk=exp(sk)z_{k}=\exp(s_{k}) in the zz domain lie along a spiral on the Riemann surface of g(z)g(z) associated with its definition in terms of the function log(z)\log(z), clustering exponentially at both ends at z=0z=0. With n=32n=32, the last value in the upper half ss-plane is s3266.1+48.7is_{32}\approx-66.1+48.7i. The corresponding value z32z_{32} has modulus 1.0×10311.0\times 10^{-31} and argument 15.5π15.5\pi, putting it 8 sheets above the main branch of the Riemann surface of g(z)g(z).

Refer to caption
Fig. 1: Exponential convergence of approximations (4)(\ref{partfracr})(5)(\ref{hankelpts}) (top row) and (6)(\ref{partfracrc}) (bottom row) to z\sqrt{z} computed by least-squares fitting on an exponentially clustered grid in [0,1][\kern 0.5pt0,1]. (These data are obtained with Arnoldi stabilization [1]; the numbers obtained without stabilization are shown by the small dots.) A strong contrast is apparent with the root-exponential convergence of lightning rational approximations [6] for the same problem. The right column shows the points {sk}\{s_{k}\} or s0s_{0} in the ss-plane for n=32n=32. Since the imaginary parts Imsk\hbox{\rm\kern 0.3ptIm\kern 1.0pt}s_{k} extend far beyond [π,π][-\pi,\pi] in the first case, the values exp(sk)\exp(s_{k}), which are the poles of g(z)g(z), lie on many different sheets of a Riemann surface.

The second row of Figure 1 shows corresponding results for the confluent-pole approximation (6) with s0=n/2s_{0}=n/2. These poles are further from optimal, and the exponential convergence is about 15% slower. Certain fine points associated with confluent computations are discussed in section 6. To minimize the effects very near the singularity discussed there and make the comparison of Figure 1 a fair one, the least-squares fitting for this figure was done on a grid of 20002000 points logarithmically spaced in [10100,1][10^{-100},1].

Refer to caption
Fig. 2: Error curves for reciprocal-log approximations (4)(\ref{partfracr}) of z\sqrt{z} on [0,1][\kern 0.5pt0,1] with n=2,4,8,16n=2,4,8,16 with singularities sks_{k} given by (5)(\ref{hankelpts}). In the left column, coefficients {ck}\{c_{k}\} are computed by least-squares fitting on an exponential clustered grid from of 10001000 points from 102010^{-20} to 11. In the right column, the fits are then improved to LL^{\infty} optimal for the given {sk}\{s_{k}\} by a Lawson iteration (iteratively reweighted least-squares) [9]. This improves the maximal error by about a factor of 22 at the cost of eliminating the enhanced accuracy for z0z\approx 0. Each pair shares a common vertical scale, with each curve showing n+1n+1 extrema of alternating sign.
Refer to caption
Fig. 3: Error curves for true minimax reciprocal-log approximations of z\sqrt{z} on [0,1][\kern 0.5pt0,1] with n=1,2,4,8n=1,2,4,8 on [0,1][\kern 0.5pt0,1] (left) and [1020,1][10^{-20},1] (right). Note that these values of nn are half those of Figure 2, reflecting the roughly squared accuracy of minimax approximations. Again each pair shares a common vertical scale, and now there are 2n+22n+2 equioscillating extrema.

Figure 2 shows error curves g(z)zg(z)-\sqrt{z}, z[1020,1]z\in[10^{-20},1], for approximations (4). On the left these are the approximations computed by least-squares fitting with the singularities {sk}\{s_{k}\} of (5), as in the first row of Figure (1). Note that the maximal error decreases exponentially with nn and that the error is much smaller for z0z\approx 0 than for z1z\approx 1, because the clustered grid puts more weight there. The right column shows error curves for approximations with the same {sk}\{s_{k}\} whose coefficients have been modified to achieve LL^{\infty} optimality by 20 steps of a linear Lawson iteration, also known as iteratively reweighted least-squares [9]. These equioscillatory error curves are elegant, but we doubt whether the “optimal” approximations are superior in practice. Their maximal error is lower by only about a factor of 2, at the cost of greatly worsening the accuracy for z0z\approx 0. See chapter 16 and Myth 3 of Appendix A in [19].

The poles of our approximations g(z)g(z) lie exponentially close to the origin in the zz-plane, at least when the formula (5) is used. Exponential clustering of poles is familiar in rational approximation theory [22]. Another phenomenon appears in Figure 2 that is unfamiliar, however, and that is doubly-exponential clustering of oscillating extrema. To see this, consider the leftmost minimum of each error curve in the right column of the figure. With n=2,n=2, 44, and 88, these minima lie at approximately z=101.5z=10^{-1.5}, 104.510^{-4.5}, and (as shown by computations on a longer interval) 102510^{-25}. A rough model of such behavior would be zmin=exp(1.7n)z_{\min{}}=\exp(-1.7^{\kern 0.3ptn}), which suggests values for n=16n=16 and 32 on the order of 10200010^{-2000} and 1010,000,00010^{-10{,}000{,}000}. In the figure, we see nothing like this, since the computations are carried out on [1020,1][10^{-20},1], and it would be impossible to resolve much beyond [10300,1][10^{-300},1] in standard IEEE floating point arithmetic. However, this probably does not matter, as the approximants on [0,1][\kern 0.5pt0,1] will not differ much from those associated with a truncated interval such as [1020,1][10^{-20},1]. We discuss these matters further in section 6.

All these approximants are linear, based on poles sks_{k} fixed a priori. What about true minimax approximations, with {sk}\{s_{k}\} chosen optimally to minimize the error? For the current setting of [0,1][\kern 0.5pt0,1], everything is known about such approximations since they are transplants of minimax rational approximants on [,0][-\infty,0\kern 0.5pt] [19, chap. 24]. As illustrated in Figure 3, which was computed by the method outlined on pp. 217–219 of [19], the errors will be about the squares of what we have seen before, and now there are 2n+22n+2 equioscillatory extreme points. We shall not discuss such approximations further, because they become impractical once one moves to planar domains.

3 Approximation on a planar domain

We now turn to domains in the complex plane. In the basic case one has a closed Jordan region EE with piecewise analytic boundary Γ\Gamma and a function ff continuous in EE and analytic in the interior. We suppose that ff is analytic on Γ\Gamma except for branch point singularities at a finite set of points {zj}\{z_{j}\}, 1jm1\leq j\leq m, which may for example be corners. As mentioned in the introduction, we consider approximations to ff of the generic form

g(z)=j=1mk=1njcjklog(zzj)sjk+p0(z),g(z)=\sum_{j=1}^{m}\kern 2.0pt\sum_{k=1}^{n_{j}}\kern 1.0pt{c_{jk}\over\log(z-z_{j})-s_{jk}}+p_{0}(z), (7)

where p0p_{0} is a degree n0n_{0} polynomial, with N=1+njN=1+\sum n_{j}. Each function log(zzj)\log(z-z_{j}) denotes a fixed choice of a branch of the logarithm that is continuous in EE. Since standard hardware puts the branch of the logarithm on the negative real axis, it is often convenient in a computer code to work with rotated terms of the form log(eiθj(zzj))\log(e^{i\theta_{j}}(z-z_{j})) for certain real numbers θj\theta_{j}. We also employ the confluent variant

g(z)=j=1mpj(1log(zzj)sj)+p0(z),g(z)=\sum_{j=1}^{m}\kern 2.0ptp_{j}\kern-2.0pt\left({1\over\log(z-z_{j})-s_{j}}\right)+p_{0}(z), (8)

where each pjp_{j} is a polynomial of degree njn_{j}. In both (7) and (8), we often take n0=0n_{0}=0, so that p0p_{0} reduces to a constant.

The essential point, to be proved in the next two sections, is that exponential or near-exponential convergence is guaranteed so long as the numbers njn_{j} (for j1j\geq 1) increase in proportion to NN as NN\to\infty, in the absence of rounding errors of course. In this section we will explore just one example defined on the domain shown in Figure 5, a two-thirds bite of the unit disk. The function is

f(z)=zlog(12z)(1z/ω)1/2(1z/ω¯)3/2f(z)=z\log(-\textstyle{1\over 2}z)\cdot(1-z/\omega)^{1/2}\cdot(1-z/\bar{\omega})^{3/2} (9)

with ω=eπi/3\omega=e^{\pi i/3}, with branch point singularities at each of the corners. The boundary Γ\Gamma is discretized by 500 points along each segment exponentially clustered down to distances from the corners of about 101410^{-14}.

Refer to caption
Fig. 4: Convergence of three approximants to the function ff of (9)(\ref{3corner}) on the domain EE shown in Figure 5. Down to accuracy of 10810^{-8} or so, one sees root-exponential convergence of the AAA and lightning approximations and exponential convergence of the log-lightning approximations. Beyond this point, rounding errors muddy the water but the log-lightning approximations are by far the most accurate.

As indicated in Figure 4, we computed three approximations to ff for values of NN up to 150150. One curve shows the accuracy of AAA rational approximations of degree N1N-1. (AAA approximations are rational approximations that are computed very quickly by an algorithm described in [12], implemented in the aaa command of Chebfun; we calculated them with the “cleanup” option turned off.) The AAA approximations are close to optimal when the degree is not too large, and this curve shows root-exponential convergence down to an error on the order of 10810^{-8}. Another curve shows lightning rational approximations [6] of the form

r(z)=j=13k=1njcjkzsjk+p0(z)r(z)=\sum_{j=1}^{3}\kern 2.0pt\sum_{k=1}^{n_{j}}\kern 1.0pt{c_{jk}\over z-s_{jk}}+p_{0}(z) (10)

with nj=N1\sum n_{j}=N-1, where p0p_{0} is a polynomial of degree n0n_{0}; we take n0=N/41n_{0}=N/4-1 and nj=N/4n_{j}=N/4 for j=1,2,3j=1,2,3. (The choice n0=0n_{0}=0, which is often a good one for (7) and (8), tends to slow down convergence significantly for (10).) The poles were fixed at distances djk=exp(4(knj))d_{jk}=\exp(4(\sqrt{\vphantom{n_{j}}k}-\sqrt{\vphantom{k}n_{j}}\kern 1.0pt)), 1knj1\leq k\leq n_{j} from the corners zjz_{j} [6, eq. (3.2)]. Note that the convergence is again root-exponential, at a rate five or six times slower than with AAA with its adaptively determined poles. Clearly AAA has some advantages, but it has drawbacks of sometimes greater sensitivity to rounding errors and occasional placement of poles inside a domain where they are not wanted. More fundamentally, as we shall discuss in section 7, lightning methods generalize immediately to approximation of the real part for solving Laplace problems, whereas AAA does not.

Refer to caption
Fig. 5: Phase portraits of the function ff of (9)(\ref{3corner}) and three approximants with N100N\approx 100 degrees of freedom. All the approximations give many digits of accuracy in EE. Outside EE, the branch cuts of ff are approximated by strings of poles for lightning and AAA approximants, and by true branch cuts for the log-lightning approximant. The circus tent effect in the lightning plot reflects the polynomial term p0p_{0} in (10)(\ref{lightning}).

The final curve in Figure 4 shows log-lightning approximations (8) with sj=nj/3s_{j}=n_{j}/3, now with n0=0n_{0}=0 and nj=(N1)/3n_{j}=(N-1)/3 for j=1,2,3j=1,2,3. The convergence appears exponential to around 10910^{-9}, after which it slows for reasons at least partly related to rounding errors.

Figure 5 displays phase portraits [27] for ff and its three approximations with N=100N=100. The rational approximations simulate branch cuts by strings of poles, whereas the reciprocal log approximation incorporates true branch cuts. For ff itself, the shapes of the branch cuts are arbitrary. If we had coded the function differently, their directions could have been altered, for example, but this would have had no effect on the values of ff on Γ\Gamma and therefore on the three approximations and the other three images. For the lightning and log-lightning approximations, the forms of the branch cuts are fixed by our computation: they have been set to be straight lines bisecting the exterior angles, the natural default choice in the absence of other information. For the AAA approximation, the branch cuts are presumably more nearly optimal, curved in a fashion analyzed by Stahl for the case of Padé approximation [15].

4 Convergence theorems: one singularity

In this section we consider the reciprocal-log approximation of a function f(z)f(z) with a branch point singularity at z=0z=0. By arguments adapted from those of section 2 of [6], we show that if ff can be analytically continued along any contour in the complex plane avoiding z=0z=0, then exponentially convergent approximations exist with the singularities {sk}\{s_{k}\} extending a distance O(n)O(n) into the right half-plane (Thm. 2). If the analyticity assumption holds just in a bounded neighborhood of 0, then near-exponentially convergent approximations exist with {sk}\{s_{k}\} of bounded positive real part (Thm. 6).

To make these ideas precise the theorems of this and the next section use the following definition, which covers the familiar singularities of type za(logz)bz^{a}(\log z)^{b} [10, 26]. The first condition, (11), essentially asserts that ff can be analytically continued along arbitrary contours in a punctured neighborhood of each singularity zkz_{k}. But all we really need is continuation along curves that spiral in logarithmically to zkz_{k}, so that after the s=log(zzk)s=\log(z-z_{k}) change of variables they lie in a VV-shaped wedge in the ss-plane, and that is why (11) takes the form it does. The second condition, (12), asserts that ff is Hölder continuous at zkz_{k}, so that with s=log(zzk)s=\log(z-z_{k}) we get exponential decay as Res\hbox{\rm\kern 0.3ptRe\kern 1.0pt}s\to-\infty. A function like (zzk)alog(zzk)(z-z_{k})^{a}\log(z-z_{k}), however, will grow slowly as zz winds around zkz_{k}, and that is why (12) includes the factor involving arg(zzk)\hbox{\rm\kern 0.3ptarg\kern 1.0pt}(z-z_{k}).

Definition 1.

Let EE be a closed set in the complex plane. A function ff is analytic on EE with branch point singularities at z1,,zmz_{1},\dots,z_{m} on the boundary if ff is analytic in the interior of EE and can be analytically continued to a neighborhood of each boundary point of EE that is not among the zkz_{k}; and if moreover there is a neighborhood of each zkz_{k} within which ff can be analytically continued to a multivalued function f~\tilde{f} along any curve that avoids zkz_{k} and satisfies

|arg(zzk)||zzk|τ|\hbox{\rm\kern 0.3ptarg\kern 1.0pt}(z-z_{k})|\leq|z-z_{k}|^{-\tau} (11)

for some constant τ>0\tau>0, with f~\tilde{f} satisfying

|f~(z)f~(zk)|c|zzk|a(1+|arg(zzk)|b)|\tilde{f}(z)-\tilde{f}(z_{k})|\leq c\kern 1.0pt|z-z_{k}|^{a}(1+|\hbox{\rm\kern 0.3ptarg\kern 1.0pt}(z-z_{k})|^{b}) (12)

for some constants a,b,c>0a,b,c>0.

For our first theorem, it is enough to use confluent singularities and consider approximations of the form (6) with s0=σns_{0}=\sigma n,

g(z)=p(1log(z)σn),g(z)=p\kern-1.0pt\left({1\over\log(z)-\sigma n}\right), (13)

where pp is a polynomial of degree nn and σ>0\sigma>0 is a constant. This result is sharp in the sense that faster than exponential convergence is in general not possible, as we know from the equivalence of the problem of reciprocal-log approximation of zaz^{a} on [0,1][\kern 0.5pt0,1] to that of rational approximation of ease^{as} on [,0][-\infty,0\kern 0.5pt]. The condition (14) looks almost like a repeat of (12), but (12) is a local condition, only needing to apply for zz near zkz_{k}, whereas (14) applies even as zz\to\infty. With s=logzs=\log z, it is designed to ensure that F(s)F(s) is under control in the right half of the ss-plane as well as the left.

Theorem 2.

Let EE be a compact set in 𝐂\(,0){\bf C}\kern 1.0pt\backslash\kern 1.0pt(-\infty,0\kern 0.5pt) with 0 on the boundary; more generally, EE might be a multilevel surface wrapping around 0 a finite number of times. Let ff be an analytic function on EE with a branch point singularity at z=0z=0 in the sense of Definition 1, and assume further that ff can be analytically continued to a multivalued function f~\tilde{f} along all curves in the zz-plane avoiding z=0z=0, satisfying in addition to (12)(\ref{bpdef})

|f~(z)|C(1+|z|a)(1+|argz|b)|\tilde{f}(z)|\leq C\kern 0.3pt(1+|z|^{a^{\prime}})(1+|\hbox{\rm\kern 0.3ptarg\kern 1.0pt}z|^{b^{\prime}}) (14)

for some constants a,b,c>0a^{\prime},b^{\prime},c^{\prime}>0. Then for all sufficiently small σ>0\sigma>0, there exist functions of the form (13)(\ref{e1}) satisfying

fgexp(Cn)n\|\kern 0.7ptf-g\|\leq\exp(-Cn)\quad\forall n (15)

for some C>0C>0, where \|\cdot\| is the supremum norm on EE.

By the change of variables s=logzs=\log z and subtraction of f(0)f(0), Theorem 1 is equivalent to the following theorem on rational approximation. As just mentioned, (16) constrains F(s)F(s) in the left half ss-plane, and (17) in the right. The function es+se2se^{s}+s\kern 0.3pte^{2s}, for example, satisfies the conditions with a<1a<1, a>2a^{\prime}>2, and b,b>1b,b^{\prime}>1.

Theorem 3.

Let EE be a subset of 𝐂{\bf C} with bounded ReE\hbox{\rm\kern 0.3ptRe\kern 1.0pt}E and |ImE||\hbox{\rm\kern 0.3ptIm\kern 1.0pt}E|. Let FF be an entire function of ss satisfying for all sufficiently small Res\hbox{\rm\kern 0.3ptRe\kern 1.0pt}s

|F(s)|ceaRes(1+|Ims|b)|F(s)|\leq c\kern 1.0pte^{a\kern 0.7pt\hbox{\scriptsize\rm\kern 0.3ptRe\kern 1.0pt}s}(1+|\hbox{\rm\kern 0.3ptIm\kern 1.0pt}s|^{b}) (16)

and for all ss

|F(s)|c(1+eaRes)(1+|Ims|b)|F(s)|\leq c^{\prime}(1+e^{a^{\prime}\kern 0.7pt\hbox{\scriptsize\rm\kern 0.3ptRe\kern 1.0pt}s})(1+|\hbox{\rm\kern 0.3ptIm\kern 1.0pt}s|^{b^{\prime}}) (17)

for some constants a,b,c,a,b,c>0a,b,c,a^{\prime},b^{\prime},c^{\prime}>0. Then for all sufficiently small σ>0\sigma>0, there exist type (n1,n)(n-1,n) rational functions rr of the form

r(s)=p(1sσn),r(s)=p\kern-1.0pt\left({1\over s-\sigma n}\right), (18)

where pp is a polynomial of degree nn, with

Frexp(Cn)n\|F-r\|\leq\exp(-Cn)\quad\forall n (19)

for some C>0C>0, where \|\cdot\| is the supremum norm on EE.

Thus our job is to prove Theorem 3. The proof will be based on the Hermite integral formula for rational interpolation [25, Thm. 2, Chap. 8]. Neither the poles {sk}\{s_{k}\} nor the interpolation points {αk}\{\alpha_{k}\} in the statement below need to be distinct, and an interpolation point of multiplicity ν>1\nu>1 is interpreted, as usual, to mean interpolation at that point of f,f,,f(ν1)f,f^{\prime},\dots,f^{(\nu-1)}. By a Hankel contour, we mean a continuous curve winding counterclockwise around (,0](-\infty,0\kern 0.5pt] from \infty to \infty.

Lemma 4.

(Hermite integral formula) Let FF be an analytic function of ss satisfying (16)(\ref{h3}) on and inside a Hankel contour Γ\Gamma in the ss-plane. Let interpolation points α1,,αn[,0]\alpha_{1},\dots,\alpha_{n}\in[-\infty,0\kern 0.5pt] and poles s1,,sns_{1},\dots,s_{n} anywhere in the ss-plane be given; neither {αj}\{\alpha_{j}\} nor {sj}\{s_{j}\} need to be distinct. Let rr be the unique type (n1,n)(n-1,n) rational function with poles at {sj}\{s_{j}\} that interpolates FF at {αj}\{\alpha_{j}\}. Then for any ss enclosed by Γ\Gamma,

F(s)r(s)=12πiΓϕ(s)ϕ(t)F(t)ts𝑑t,F(s)-r(s)={1\over 2\pi i}\int_{\Gamma}{\phi(s)\over\phi(t)}{F(t)\over t-s}\kern 1.0ptdt, (20)

where

ϕ(s)=j=1nsαjssj.\phi(s)=\prod_{j=1}^{n}{s-\alpha_{j}\over s-s_{j}}. (21)

In proving Theorem 3, we will suppose E=(,0]E=(-\infty,0\kern 0.5pt] without loss of generality; the estimates change only by constant factors for other domains since the arguments involve a geometry that scales with nn as nn\to\infty. We will use this choice of interpolation points in (,0](-\infty,0\kern 0.5pt]:

αj=nσ(1+tj1tj)2,tj=eiπj/(n+1),1jn.\alpha_{j}=n\kern 1.0pt\sigma\kern-1.0pt\left({1+t_{j}\over 1-t_{j}}\right)^{2},\quad t_{j}=e^{i\pi j/(n+1)},\quad 1\leq j\leq n. (22)

These are derived from potential theory [11, 25], where poles and interpolation points are interpreted as point charges of opposite signs, and a minimal-energy charge configuration gives asymptotically optimal approximations as nn\to\infty. The function

s=nσ(1+t1t)2s=n\kern 0.7pt\sigma\kern-1.0pt\left({1+t\over 1-t}\right)^{2} (23)

maps the unit disk in the tt-plane to the ss-plane slit along (,0](-\infty,0\kern 0.5pt], with t=0t=0 mapping to s=nσs=n\kern 0.5pt\sigma. Thus the equilibrium distribution of interpolation points on the unit circle corresponding to poles at t=0t=0, namely the uniform distribution, maps to the distribution determined by (23) in the ss-plane slit along (,0](-\infty,0\kern 0.5pt] with poles at s=nσs=n\kern 0.5pt\sigma, as in the function (18). (The {αj}\{\alpha_{j}\} are called Fejér–Walsh points [17].) The following lemma, illustrated in Figure 6, was provided to us by Peter Baddoo.

Lemma 5.

With αj\alpha_{j} defined by (22)(\ref{interppts}), sj=nσs_{j}=n\kern 0.7pt\sigma, and ss related to tt by (23)(\ref{potent}), the function ϕ(s)\phi(s) of (21)(\ref{e7}) can be written

ϕ(s)=tn1tn+1(n+1)(t1t)=tn+tn+2++tnn+1.\phi(s)={t^{-n-1}-t^{n+1}\over(n+1)(t^{-1}-t)}={t^{-n}+t^{-n+2}+\cdots+t^{n}\over n+1}. (24)

For s(,0]s\in(-\infty,0\kern 0.5pt],

|ϕ(s)|1.|\phi(s)|\leq 1. (25)

For s𝐂\{(,0]σ}s\in{\bf C}\kern 1.0pt\backslash\{(-\infty,0\kern 0.5pt]\cup\sigma\},

|ϕ(ns)|1/neu(s)as n,|\phi(n\kern 0.3pts)|^{1/n}\to e^{u(s)}\quad\hbox{as }n\to\infty, (26)

where

u(s)=|log|t||>0,u(s)=|\log|t|\kern 2.0pt|>0, (27)

with uniform convergence on compact subsets of 𝐂\{(,0]σ}{\bf C}\kern 1.0pt\backslash\{(-\infty,0\kern 0.5pt]\cup\sigma\}.

Proof.

The formulas (25)–(27) follow readily from (24), so (24) is what must be established. First we note that each complex number snσs\neq n\kern 0.7pt\sigma corresponds under (23) to two numbers tt and t1t^{-1}, and the right-hand side of (24) gives the same value for both, so (24) does indeed define a function of ss. This is a meromorphic function with nn zeros at the points αj\alpha_{j} and a pole of order nn, it can be verified, at s=nσ,s=n\kern 0.7pt\sigma, corresponding to t=0t=0. But at s=s=\infty, corresponding to t=1t=1, this function is analytic. Thus it is a degree nn rational function with the same zeros and poles as ϕ(s)\phi(s) as defined by (21), and since it takes the value 11 at s=s=\infty, it must be the same function. ∎

Proof of Theorem 3. As mentioned above, we take E=(,0]E=(-\infty,0\kern 0.5pt] without loss of generality, and we choose Γ\Gamma for Lemma 4 as a V-shape passing through nσn\kern 0.5pt\sigma at a fixed acute angle, as sketched in Figure 7. (Any acute angle will suffice.) We divide Γ\Gamma into a “head” in the right half-plane, a “tail” with Retn\hbox{\rm\kern 0.3ptRe\kern 1.0pt}t\leq-n, and a “middle” with nRet0-n\leq\hbox{\rm\kern 0.3ptRe\kern 1.0pt}t\leq 0, and show that their contributions IheadI_{\scriptsize\rm head}, ItailI_{\scriptsize\rm tail}, and ImiddleI_{\scriptsize\rm middle} to (20) are each exponentially small. (A quantity depending on nn is exponentially small if it is O(exp(Cn))O(\exp(-Cn)) as nn\to\infty for some C>0C>0 and exponentially large if its reciprocal is exponentially small.) Note that the denominator tst-s of (20) is bounded below in absolute value, |ϕ(s)||\phi(s)| is bounded by 11 by (25), and 2π2\pi is just a constant. Thus it is enough to show that each of the three integrals

|F(t)||ϕ(t)|𝑑t\int{|F(t)|\over|\phi(t)|}\kern 1.0ptdt (28)

is exponentially small. In the tail, F(t)F(t) is exponentially small and decreases exponentially as Ret\hbox{\rm\kern 0.3ptRe\kern 1.0pt}t\to-\infty by (16), whereas 1/|ϕ(t)|1/|\phi(t)| is bounded by Lemma 5; thus ItailI_{\scriptsize\rm tail} is exponentially small. In the middle, F(t)F(t) grows at most algebraically with nn by (16), whereas 1/|ϕ(t)|1/|\phi(t)| is exponentially small by Lemma 5; thus ImiddleI_{\scriptsize\rm middle} is exponentially small. In the head, F(t)F(t) is exponentially large, but by (17), at most O(exp(Aσn))O(\exp(A\kern 0.7pt\sigma n)). On the other hand, by Lemma 5 again, 1/|ϕ(t)|1/|\phi(t)| is exponentially small. In particular, |ϕ(t)|>exp(dn)|\phi(t)|>\exp(d\kern 0.3ptn) uniformly in this region for some d>0d>0 that is independent of σ\sigma. It follows that if σ<d/A\sigma<d/A, then IheadI_{\scriptsize\rm head} too is exponentially small.

Refer to caption
Fig. 6: Level curves n1log|ϕ(ns)|=0,0.1,,1n^{-1}\log|\phi(n\kern 0.5pts)|=0,0.1,\dots,1 for n=100n=100 to illustrate Lemma 5. As nn increases, the contours converge to radial lines, with the level 0 contour (red) narrowing toward (,0](-\infty,0\kern 0.5pt]. Thus |ϕ(ns)|1/n|\phi(n\kern 0.5pts)|^{1/n} approaches a value >1{>}\kern 1.0pt1 at each point s𝐂\{(,0]σ}s\in{\bf C}\kern 1.0pt\backslash\{(-\infty,0\kern 0.5pt]\cup\sigma\} and is bounded by 11 for s(,0]s\in(-\infty,0\kern 0.5pt].

Refer to caption

Fig. 7: Hankel contour Γ\Gamma for the proof of Theorem 3 with E=(,0]E=(-\infty,0\kern 0.5pt], with all poles at s=nσs=n\kern 0.5pt\sigma. In the tail, F(t)F(t) in (20)(\ref{herm}) is exponentially small. In the middle, F(t)F(t) is at most algebraically large and 1/ϕ(t)1/\phi(t) is exponentially small. In the head, F(t)F(t) is exponentially large, but its exponential growth rate is limited by σ\sigma, so if σ\sigma is small enough, F(t)/ϕ(t)F(t)/\phi(t) is still exponentially small.

We now turn to the second theorem about approximation near z=0z=0, assuming analyticity only in a bounded domain. As before, our prototypical example for discussion without loss of generality is E=[0,1]E=[\kern 0.5pt0,1]. For the argument we again use a V-shaped contour Γ\Gamma for the Hermite integral, as shown in Figure 8, but now, Γ\Gamma must be fixed rather than growing with nn. This necessitates a change in the choice of poles. If the poles are all at the apex, then the same argument as before ensures exponentially small contributions to the error from IheadI_{\scriptsize\rm head} and ImiddleI_{\scriptsize\rm middle}, but no longer so small (just root-exponential) from ItailI_{\scriptsize\rm tail}. Instead it is necessary to distribute poles along Γ\Gamma, and a successful strategy is to distribute them along segments extending a distance σL=σρn\sigma L=\sigma\rho\kern 1.0ptn from the apex on the two sides of Γ\Gamma, for any constant ρ>0\rho>0.

Specifically, one way to distribute poles is by solving a potential theory problem in the infinite VV domain. As suggested by the contour lines u=0.1,0.2,,0.9u=0.1,0.2,\dots,0.9 in Figure 8, let uu be the unique harmonic function in the infinite slit wedge taking values u=0u=0 on (,0](-\infty,0\kern 0.5pt] and u=1u=1 on the segments of length σL\sigma L just mentioned, with a homogeneous Neumann condition on the remainder of the boundary. This function uu can be calculated by means of a conformal map of the lower half of the slit wedge onto a rectangle K<Rew<K-K<\hbox{\rm\kern 0.3ptRe\kern 1.0pt}w<K, 0<Imw<K0<\hbox{\rm\kern 0.3ptIm\kern 1.0pt}w<K^{\prime} in the complex ww-plane, with the Dirichlet boundary components mapping to the vertical sides and the Neumann components to the top and bottom; the ratio K/2KK^{\prime}/2K is known as the conformal module of the rectangle. If the interior angle of the wedge is π/μ\pi/\mu for some μ>1\mu>1, the map has the explicit representation

s(w)=σσ(b+y1+2by)μ,y=sn(w|M2),s(w)=\sigma-\sigma\left({b+y\over 1+2b-y}\right)^{\mu},\quad y=\hbox{sn}(w\kern 1.0pt|\kern 1.0ptM^{-2}), (29)

where sn is the Jacobi elliptic sine function and

M=1+2b,b=Lμ+Lμ+L2μ.M=1+2b,\quad b=L^{\mu}+\sqrt{L^{\mu}+L^{2\mu}}. (30)

The numbers KK and KK^{\prime} are the complete elliptic integrals of the first kind with parameters M2M^{-2} and 1M21-M^{-2}, respectively [14]. Our Fejér–Walsh choice of poles and interpolation points for the theorem will be

sk=s(wk)wk=K+i(k12)K/n,1kns_{k}=s(w_{k})\quad w_{k}=K+i(k-\textstyle{1\over 2})K^{\prime}/n,\quad 1\leq k\leq n (31)

and

αk=s(w~k)w~k=K+i(k12)K/n,1kn.\alpha_{k}=s(\tilde{w}_{k})\quad\tilde{w}_{k}=-K+i(k-\textstyle{1\over 2})K^{\prime}/n,\quad 1\leq k\leq n. (32)

Refer to caption

Fig. 8: Hankel contour Γ\Gamma for the proof of Theorem 6 with E=[0,1]E=[\kern 0.5pt0,1], together with level curves u=0.1,,0.9u=0.1,\dots,0.9 for the associated potential theory problem obtained from the conformal map (29)(\ref{conf1})(30)(\ref{conf2}). Poles are distributed according to (31)(\ref{polechoice}) along segments of length σL=O(n)\sigma L=O(n) on each side of Γ\Gamma. In the tail, F(t)F(t) in (20)(\ref{herm}) is exponentially small. In the head, ϕ(s)/ϕ(t)\phi(s)/\phi(t) is near-exponentially small.
Theorem 6.

Let EE be a simply-connected compact set in 𝐂\(,0){\bf C}\kern 1.0pt\backslash\kern 1.0pt(-\infty,0\kern 0.5pt); more generally, EE might be a multilevel surface wrapping around 0 a finite number of times. Let ff be an analytic function in EE with a branch point singularity at z=0z=0 in the sense of Definition 1. Then there exist functions

g(z)=c0+k=1ncklog(z)skg(z)=c_{0}+\sum_{k=1}^{n}{c_{k}\over\log(z)-s_{k}} (33)

satisfying

fgexp(Cn/logn)n\|\kern 0.7ptf-g\|\leq\exp(-Cn/\log n)\quad\forall n (34)

for some C>0C>0, where \|\cdot\| is the supremum norm on EE. If E=[0,1]E=[\kern 0.5pt0,1], a suitable choice of {sk}\{s_{k}\} is (31)(\ref{polechoice}) for any μ\mu with tan(π/2μ)<τ\tan(\pi/2\mu)<\tau and sufficiently small σ\sigma.

Proof.

As with Theorem 2, the proof is carried out via the Hermite integral formula for the equivalent problem of rational approximation of a function F(s)F(s) on the domain log(E)\log(E) in the s=logzs=\log z variable; again we first remove f(0)f(0) from the problem by setting c0=f(0)c_{0}=f(0). If EE is a compact set in 𝐂\(,0){\bf C}\kern 1.0pt\backslash\kern 1.0pt(-\infty,0\kern 0.5pt), or a multilevel surface wrapping around 0 a finite number of times, then log(E)\log(E) is a closed subset of 𝐂{}{\bf C}\cup\{\infty\} with upper-bounded ReE\hbox{\rm\kern 0.3ptRe\kern 1.0pt}E and |ImE||\hbox{\rm\kern 0.3ptIm\kern 1.0pt}E|. We take E=[0,1]E=[\kern 0.5pt0,1] and log(E)=[,0]\log(E)=[-\infty,0\kern 0.5pt] for simplicity, as sketched in Figure 8; the more general case is treated by an adjustment of the contour Γ\Gamma and the conformal map. By the assumption (11), FF can be extended to a single-valued analytic function throughout a V-shaped region in the ss-plane, as in the figure, for any inner angle π/μ\pi/\mu with tan(π/2μ)<τ\tan(\pi/2\mu)<\tau and sufficiently small σ\sigma. As in the proof of Theorem 2, we now estimate the Hermite integral (20) over the “head” region of Γ\Gamma, consisting of the two segments of length σL=σρn\sigma L=\sigma\rho\kern 1.0ptn touching the apex, and the remaining “tail.” The contribution ItailI_{\scriptsize\rm tail} is exponentially small because of (12), which in the ss variable becomes (16). The contribution from IheadI_{\scriptsize\rm head} is near-exponentially small, which we establish as follows. The factor F(t)/(ts)F(t)/(t-s) in (20) is bounded as a consequence of (16) and the contour Γ\Gamma being fixed, so we need to show that ϕ(s)/ϕ(t)\phi(s)/\phi(t) is near-exponentially small. From potential theory related to Theorem 19 of chapter 9 of [25], it is known that this selection of {sk}\{s_{k}\} and {αk}\{\alpha_{k}\} as Fejér–Walsh points ensures |ϕ(s)/ϕ(t)|exp(n(K/K))|\phi(s)/\phi(t)|\approx\exp(-n(K/K^{\prime})) as nn\to\infty. The result follows from the estimate K/K(4μ/π)lognK^{\prime}/K\sim(4\mu/\pi)\log n as nn\to\infty, which can be derived from [14, eq. (19.9.5)]. ∎

5 Convergence theorem: multiple singularities

Theorems 2 and 6 are local assertions, establishing exponential and near-exponential resolution of isolated singularities. It remains to show that global approximations can be constructed by adding together these local pieces. We do this in the style of Theorem 6, requiring analyticity just in a neighborhood of EE. The argument is adapted from the discussion around Theorem 2.3 in [6] and illustrated schematically in Figure 9.

Theorem 7.

Let EE be a simply-connected compact set in 𝐂{\bf C} and let ff be an analytic function in EE with branch point singularities in the sense of Definition 1 at boundary points z1,,zmz_{1},\dots,z_{m}; more generally, EE might be a multilevel surface wrapping around each branch point a finite number of times. Then there exist functions

g(z)=c0+j=1mk=1njcjklog(zzj)sjkg(z)=c_{0}+\sum_{j=1}^{m}\kern 2.0pt\sum_{k=1}^{n_{j}}\kern 1.0pt{c_{jk}\over\log(z-z_{j})-s_{jk}} (35)

satisfying

fgexp(Cn/logn)n\|\kern 0.7ptf-g\|\leq\exp(-Cn/\log n)\quad\forall n (36)

for some C>0C>0, where \|\cdot\| is the supremum norm on EE.

Proof.

The function ff can be represented on EE as a Cauchy integral

f(z)=12πiΓf(t)tz𝑑t,f(z)={1\over 2\pi i}\int_{\Gamma}{f(t)\over t-z}\kern 1.0ptdt, (37)

where Γ\Gamma is any fixed contour that lies outside EE and within the region of analyticity of ff, except that Γ\Gamma touches EE at each of the points {zk}\{z_{k}\}. We split up Γ\Gamma into mm pieces Γk\Gamma_{k}, with Γk\Gamma_{k} touching just zkz_{k}, giving

f(z)=f1(z)++fm(z),f(z)=f_{1}(z)+\cdots+f_{m}(z), (38)

where fkf_{k} is the Cauchy integral evaluated over the arc Γk\Gamma_{k},

fk(z)=12πiΓkf(t)tz𝑑t.f_{k}(z)={1\over 2\pi i}\int_{\Gamma_{k}}{f(t)\over t-z}\kern 1.0ptdt. (39)

For the mathematics of Cauchy integrals over open arcs, see Chapter 14 of [8].

Refer to caption

Fig. 9: Illustrations for the proof of Theorem 7. A function ff on EE is decomposed into mm pieces, with fkf_{k} defined by a Cauchy integral of ff over an arc Γk\Gamma_{k} touching the branch point zkz_{k}; Γk\Gamma_{k} is a pair of logarithmic spirals on a Riemann surface to which ff is extended by analytic continution. The right image shows the configuration near a branch point after a change of variables s=log(eiα(zzk))s=\log(e^{i\alpha}(z-z_{k})). The arc defining fkf_{k} becomes a disjoint pair of rays log(Γk)\log(\Gamma_{k}) in the ss-plane, and poles {sk}\{s_{k}\} of the rational approximation will be placed along segments of length O(n)O(n) on a V-shaped contour lying between log(Γk)\log(\Gamma_{k}) and (,0](-\infty,0\kern 0.5pt].

We are done if we can show that each function fkf_{k} can be approximated on EE with near-exponential accuracy by a function of the form (33), with log(z)\log(z) replaced by log(zzk)\log(z-z_{k}) (that is, a continuous branch of log(zzk)\log(z-z_{k}) in EE). This will be ensured by Theorem 6 if fkf_{k} satisfies the conditions (11) and (12) in our definition of a branch point singularity. At this stage, the choice of Γ\Gamma becomes crucial. A choice of Γ\Gamma as a closed, non-intersecting contour in the zz-plane will not do; the corresponding functions fkf_{k} will not be defined along curves winding around zkz_{k} as required by (11). Instead, Γ\Gamma must be taken to be a spiral on a Riemann surface for ff whose projection on the zz-plane is a self-intersecting contour with logarithmic spiral behavior at each zkz_{k}, as sketched in Figure 9. The integrals defining each fkf_{k} now make sense since each zkz_{k} is a branch point around which ff can be analytically continued. Condition (12) holds since it holds for ff by assumption and fkf_{k} differs from ff by a function that is analytic near zkz_{k}, namely the sum of the contributions fjf_{j} with jkj\neq k. ∎

6 Behavior near singularities

As mentioned in the penultimate paragraph of section 2 and illustrated in Figure 3, reciprocal-log approximations have surprising properties near the singularities. Whereas error curves for rational approximations vary over a scale exponentially close to the singularity, for reciprocal-log approximations this becomes doubly-exponential. Fortunately, so far as we are aware, these effects need not cause difficulties in using these approximations, and in particular, it appears that they do not necessitate the use of extended-precision arithmetic. We shall now explain our understanding of these matters with reference to Figure 10, which presents four approximations of z\sqrt{z} for z[0,1]z\in[\kern 0.5pt0,1]. In each case the approximation is computed over [1020,1][10^{-20},1] and then the absolute value of the error is plotted over [1080,1][10^{-80},1]. For comparison, fine dots show the corresponding results for computations over [1080,1][10^{-80},1].

Refer to caption
Fig. 10: Four approximations of z\sqrt{z} computed on [1020,1][10^{-20},1] and then plotted over [1080,1][10^{-80},1] (the absolute value of the error). The fine dots show corresponding results for an approximation computed over [1080,1][10^{-80},1]. See the text for discussion.

Image (a) shows the best (minimax) reciprocal-log approximation with n=4n=4. We see here that the best approximation over [1020,1][10^{-20},1] is by no means optimal over [1080,1][10^{-80},1]. The error is 10 times larger there (6.2e-4 vs. 5.0e-5), and this ratio will grow rapidly with increasing nn. One might think this implies that reciprocal-log approximations must be ineffective at approximating all the way up to the singularity, but the small dots in the figure, corresponding to the best approximation over [1080,1][10^{-80},1], contradict this expectation, showing an error of just 7.6e-5. As can be seen in Figure 3, the error in best approximation over all of [0,1][\kern 0.5pt0,1] is also not much larger, just 8.7e-5.

Image (b) shows the error for reciprocal-log approximation (4) with poles positioned on a Hankel contour according to (5), now with n=8n=8. Again the error in the left-hand part of [1080,1][10^{-80},1] is larger than it needs to be, but this may be less important for these approximations since the error is dominated by larger values of zz.

Image (c) shows a very different situation for reciprocal-log approximation (6) with confluent poles at s0=n/2s_{0}=n/2. Now the approximation over [1020,1][10^{-20},1] gives an error over [1080,1][10^{-80},1] that is orders of magnitude too large.

It is at this point that we wish to pause and ask, what might be the practical implications of such behavior? It certainly seems disturbing if an approximation is much less accurate on [1080,1][10^{-80},1] than on [1020,1][10^{-20},1]. There are three rather disparate observations to be made, which combine to an encouraging picture.

First we note that in IEEE floating point arithmetic, whereas one can represent numbers near 0 smaller than 1030010^{-300}, the resolution near other points zkz_{k} is only on the order of 16 digits. Thus even if we wanted to compute approximations by least-squares fitting on a grid in zk+(0,1020]z_{k}+(0,10^{-20}\kern 0.8pt], we could not do so. This might seem worrying, if one takes Figure 10 to suggest that such domains will be required.

The second observation is that as a purely practical matter, they should not be required. On a planar domain EE such as that of Figure 5, for example, if a function ff is approximated to a satisfactory precision everywhere except at distances <1020<10^{-20} from the vertices {zk}\{z_{k}\}, what difference does it make? What is wrong with an approximation that is mathematically inaccurate in theory, but only at points where it will never be evaluated?

And yet the use of such approximations would be troubling. This brings us to the third observation, which is the basis of image (d) of Figure 10: it may be possible to make an approximation accurate on zk+(0,1020]z_{k}+(0,10^{-20}\kern 0.8pt] without doing any arithmetic there. The trick is to forget least-squares gridding in zk+(0,1020]z_{k}+(0,10^{-20}\kern 0.8pt] and instead impose additional conditions at zkz_{k}, forcing the singular part of the approximation to decay there at a rate O(1/(log|zzk|)J)O(1/(\log|z-z_{k}|)^{J}) for some power J>1J>1. This is natural since in floating point arithmetic, all of zk+(0,1020]z_{k}+(0,10^{-20}\kern 0.8pt] rounds to zkz_{k} anyway. Image (d) corresponds to an approximation of this kind with J=2J=2: we approximate in the nn-dimensional space spanned by 1/(log(z)n/2)j1/(\log(z)-n/2)^{j} with 2jn+22\leq j\leq n+2. The work and the number of degrees of freedom are unchanged, and now least-squares fitting on [1020,1][10^{-20},1] gives an approximation that is accurate on all of [0,1][\kern 0.5pt0,1]. We call this a pinned approximation, since certain terms of the approximation are constrained to be zero at the singularity. In this example J=2J=2 is effective, but for larger nn a choice closer to J=n/2J=n/2 will be called for. To estimate this number, one can use analysis such as that of (22) or (32) to monitor how many interpolation points might be expected to fall closer to zkz_{k} than about machine precision. For the problem of image (d) with n=8n=8, (22) gives distances exp(αj)0.8,0.6,0.3,0.06,0.003,105,1013,1056\exp(\alpha_{j})\approx 0.8,0.6,0.3,0.06,0.003,10^{-5},10^{-13},10^{-56}, and our pinned approximation might be thought of as effectively replacing the last two of these numbers by zero. There are many mathematical questions here and it would be interesting to investigate them.

7 Log-lightning PDE solvers

The motivation for this work has been the development of numerical methods for the solution of partial differential equations on planar domains with corners, beginning with the Laplace equation with Dirichlet boundary conditions. For these problems we do not know an analytic function on EE, just its real part on the boundary. This is no difficulty for linear least-squares fitting, however, where all that matters is having an efficient approximation space. The idea of solving Laplace problems in this fashion was presented in [6], where root-exponential convergence of approximations based on rational functions is established theoretically and experimentally. Here we move to reciprocal-log approximations, and the theorems of sections 4 and 5 assert that the convergence should improve to exponential or near-exponential.

Refer to caption
Fig. 11: Convergence of lightning and log-lightning solutions to the NA Digest problem of the Laplace equation on an L-shaped region [18]. The difference between root-exponential and exponential convergence is evident.

Figure 11 presents a pair of curves confirming this expectation. The solutions computed here are for the problem posed on NA Digest in December 2019, defined by the L-shaped region with vertices 0,2,2+i,1+i,1+2i,2i0,2,2+i,1+i,1+2i,2i and boundary data u(z)=Re(z2)u(z)=\hbox{\rm\kern 0.3ptRe\kern 1.0pt}(z^{2}) [18]. As usual, NN denotes the total number of degrees of freedom, which in this plot is a number of the form 14n+114n+1 with n=2,4,6,.n=2,4,6,\dots. For each nn, the approximating function consists of the real part of a polynomial of degree nn together with nn singularities {sk}\{s_{k}\} at each of the six corners as defined by (5), though with a factor n/3n/3 instead of n/4n/4 in front. This gives 7(n+1)7(n+1) complex degrees of freedom, hence 14n+214n+2 real ones, and the number reduces to 14n+114n+1 since the imaginary part of the constant term of the polynomial does not affect the fit.

This experiment is a long way from the software that has been developed for lightning approximation [20]. The boundary has been resolved simply by 500 exponentially clustered points on each side, whereas adaptive software can make the gridding both more efficient and more careful. More importantly, each corner has been allocated the same number nn of singularities, whereas adaptive software will put more singularities at vertices with stronger singularities. So although the log-lightning convergence curve of Figure 11 is promising, the reliable fast way to solve Laplace problems at this date is still by means of the laplace code at [20]. This code also addresses a number of variations such as Neumann boundary conditions, discontinuous data, and piecewise smooth curved boundaries.

The number of degrees of freedom NN is not a direct measure of computer time. For reasons of linear algebra, the work actually increases in proportion to N2N^{2} or N3N^{3}, depending on whether the grid is fixed or refined with NN, which tends to make the difference between the lightning and log-lightning computations greater than it may appear in Figure 11. A consideration pushing in the other direction is that computer evaluation of a complex logarithm logz\log z is typically slower than that of a reciprocal 1/z1/z by a factor such as 33 or 44.

It was shown in [7] that the original lightning Laplace solver can be generalized to a lightning Helmholtz solver by replacing poles by Hankel functions. Presumably there is a Helmholtz generalization of the log-lightning method too, but we have not investigated this.

8 Conclusion

Exponential or near-exponential convergence of approximations to branch point singularities is a new phenomenon. Reciprocal-log approximations are interesting in theory and may have consequences in practice if good software can be developed.

Philosophical questions are attached to this kind of approximation. It is an old idea that polynomials and rational functions have a special status since “the only operations that can really be carried out numerically are the four elementary operations of addition, subtraction, multiplication and division.”111This quote comes from the PhD thesis of Hilbert’s student Kirchberger in 1902 [19, pp. 196–197]. This point of view has contributed to the dominance of polynomials and rational functions in approximation theory, with other classes of approximating functions seeming less fundamental. We are not ourselves immune to the impression that there is something contrived about approximations of the forms (1) and (2). But the philosophical distinction that may have seemed sharp before the arrival of computers seems less sharp today.

A fascinating aspect of reciprocal-log approximations is their exploitation of the behavior of a function on a Riemann surface. As one of the smallest consequences of this feature, these approximations have no difficulty at all in treating domains with slits, and the prospects for further adventures with multivalued functions seem enticing.

Acknowledgments

The research for this article was carried out in regular communication with André Weideman, whose many suggestions we are happy to acknowledge. Another key contribution was Lemma 5, from Peter Baddoo. In addition we are grateful for helpful comments from Marco Fasondini, Abi Gopal, Alex Townsend, and Elias Wegert.

References

  • [1] P. D. Brubeck, Y. Nakatsukasa, and L. N. Trefethen, Vandermonde with Arnoldi, SIAM Rev., to appear.
  • [2] P. D. Brubeck and L. N. Trefethen, Lightning Stokes solver, manuscript in preparation.
  • [3] W. J. Cody, G. Meinardus, and R. S. Varga, Chebyshev rational approximations to exe^{-x} in [0,+)[\kern 0.5pt0,+\infty) and applications to heat-conduction problems, J. Approx. Th., 2 (1969), pp. 50–65.
  • [4] T. A. Driscoll and B. Fornberg, A Padé-based algorithm for overcoming the Gibbs phenomenon, Numer. Algs., 26 (2001), pp. 77–92.
  • [5] D. Gaier, Lectures on Complex Approximation, Birkhäuser, Basel, 1987.
  • [6] A. Gopal and L. N. Trefethen, Solving Laplace problems with corner singularities via rational functions, SIAM J. Numer. Anal., 57 (2019), pp. 2074–2094.
  • [7] A. Gopal and L. N. Trefethen, New Laplace and Helmholtz solvers, Proc. Nat. Acad. Sci., 116 (2019), p. 10223.
  • [8] P. Henrici, Applied and Computational Complex Analysis, Vol. 3, Wiley, New York, 1986.
  • [9] C. L. Lawson, Contributions to the Theory of Linear Least Maximum Approximations, PhD thesis, UCLA, 1961.
  • [10] R. S. Lehman, Development of the mapping function at an analytic corner, Pacific J. Math., 7 (1957), pp. 1437–1449.
  • [11] E. Levin and E. Saff, Potential theoretic tools in polynomial and rational approximation, in Harmonic Analysis and Rational Approximation, Springer, 2006, pp. 71–94.
  • [12] Y. Nakatsukasa, O. Sète, and L. N. Trefethen, The AAA algorithm for rational approximation, SIAM J. Sci. Comput., 40 (2018), A1494–A1522.
  • [13] D. J. Newman, Rational approximation to |x||x|, Mich. Math. J., 11 (1964), pp 11–14.
  • [14] F. W. Olver, et al., NIST Handbook of Mathematical Functions, Cambridge University Press, 2010.
  • [15] H. Stahl, The convergence of Padé approximants to functions with branch points, J. Approx. Th., 91 (1997), pp. 139–204.
  • [16] H. Stahl and T. Schmelzer, An extension of the ‘1/9’ problem, J. Comp. Appl. Math., 233 (2009), pp. 821–834.
  • [17] G. Starke, Fejér–Walsh points for rational functions and their use in the ADI iterative method, J. Comp. Appl. Math., 46 (1993), pp. 129–141.
  • [18] L. N. Trefethen, 8-digit Laplace solutions on polygons?, Posting on NA Digest at http://www.netlib.org/na-digest-html, December 3, 2018.
  • [19] L. N. Trefethen, Approximation Theory and Approximation Practice, Extended Edition, SIAM, 2019.
  • [20] L. N. Trefethen, Lightning Laplace software, https://people.maths.ox.ac.uk/trefethen/ lightning, 2020.
  • [21] L. N. Trefethen, Numerical conformal mapping with rational functions, Comp. Meth. Funct. Thy., 2020.
  • [22] L. N. Trefethen, Y. Nakatsukasa, and J. A. C. Weideman, Exponential node clustering at singularities for rational approximation, quadrature, and PDEs, submitted to Numer. Math.
  • [23] L. N. Trefethen and J. A. C. Weideman, The exponentially convergent trapezoidal rule, SIAM Rev., 56 (2014), pp. 385–458.
  • [24] L. N. Trefethen, J. A. C. Weideman, and T. Schmelzer, Talbot quadratures and rational approximations, BIT Numer. Math., 46 (2006), pp. 653–670.
  • [25] J. L. Walsh, Interpolation and Approximation by Rational Functions in the Complex Domain, Amer. Math. Soc., 1935.
  • [26] W. Wasow, Asymptotic development of the solution of Dirichlet’s problem at analytic corners, Duke Math. J., 24 (1957), pp. 47–56.
  • [27] E. Wegert, Visual Complex Functions: An Introduction with Phase Portraits, Birkhäuser, 2012.
  • [28] J. A. C. Weideman and L. N. Trefethen, Parabolic and hyperbolic contours for computing the Bromwich integral, Math. Comput., 76 (2007), pp. 1341–1356.