Implementation of high-order,
discontinuous Galerkin time stepping
for fractional diffusion problems

William McLean

Abstract

The discontinuous Galerkin (dG) method provides a robust and flexible technique for the time integration of fractional diffusion problems. However, a practical implementation uses coefficients defined by integrals that are not easily evaluated. We describe specialised quadrature techniques that efficiently maintain the overall accuracy of the dG method. In addition, we observe in numerical experiments that known superconvergence properties of dG time stepping for classical diffusion problems carry over in a modified form to the fractional-order setting.

1 Introduction

The discontinuous Galerkin (dG) method provides an effective numerical procedure for the time integration of diffusion problems. In the mid-1980s, Eriksson, Johnson and Thomée [2] provided the first detailed error analysis, which has been subsequently extended and refined by numerous authors [[]and references therein]MakridakisNochetto2006,SchmutzWihler2019,SchotzauSchwab2001. The dG method has also proved effective for time stepping of fractional diffusion problems [9, 12] of the form

\partial_{t}u+\partial_{t}^{1-\alpha}Au=f(t)\quad\text{for $0<t\leq T$, with $u(0)=u_{0}$.}

(1)

Here, $A$ is a linear, second-order, elliptic partial differential operator over a spatial domain $\Omega$ , subject to a homogeneous Dirichlet boundary condition $u=0$ on $\partial\Omega$ . (Our notation suppresses the dependence of $u$ and $f$ on the spatial variables.) The fractional diffusion exponent is assumed to satisfy $0<\alpha<1$ (the sub-diffusive case), and the fractional time derivative is understood in the Riemann–Liouville sense: for $t>0$ and $\mu>0$ ,

\partial_{t}^{\mu}v=\frac{\partial}{\partial t}\int_{0}^{t}\omega_{\mu}(t-s)v(s)\,ds\quad\text{where}\quad\omega_{\mu}(t)=\frac{t^{\mu-1}}{\Gamma(\mu)}.

The partial integro-differential equation (1) arises in a variety of physical models [3, 11] of diffusing particles whose behaviour is described by a continuous-time random walk for which the waiting-time distribution is a power law that decays like $1/t^{1+\alpha}$ . The expected waiting time is therefore infinite, and the mean-square displacement is proportional to $t^{\alpha}$ . Standard Brownian motion is recovered in the limit as $\alpha\to 1$ , when (1) reduces to the classical diffusion equation.

Our main concern in the present work is with the practical implementation of dG time stepping for (1), and in particular with the accurate evaluation of certain coefficients $H^{n,n-\ell}_{ij}$ used during the $n$ th step. Section 2 introduces the dG method for the fractional ODE case of (1), in which the operator $A$ is replaced by a scalar $\lambda>0$ . We will see in the simplest, lowest-order scheme, when the dG solution is piecewise-constant in time, that

H^{n,0}_{11}=\int_{t_{n-1}}^{t_{n}}\frac{d}{dt}\biggl{(}\int_{t_{n-1}}^{t}\omega_{\alpha}(t-s)\,ds\biggr{)}\,dt

and

H^{n,n-\ell}_{11}=\int_{t_{n-1}}^{t_{n}}\frac{d}{dt}\biggl{(}\int_{t_{\ell-1}}^{t_{\ell}}\omega_{\alpha}(t-s)\,ds\biggr{)}\,dt\quad\text{for $1\leq\ell\leq n-1$,}

where $0=t_{0}<t_{1}<t_{2}<\cdots$ are the discrete time levels. We easily verify that $H^{n,0}_{11}=\omega_{\alpha+1}(k_{n})=k_{n}^{\alpha}/\Gamma(\alpha+1)$ , for a step-size $k_{n}=t_{n}-t_{n-1}$ , and

	$\displaystyle H^{n,n-\ell}_{11}$	$\displaystyle=\omega_{\alpha+1}(t_{n}-t_{\ell-1})-\omega_{\alpha+1}(t_{n}-t_{\ell})$		(2)
		$\displaystyle\qquad{}-\omega_{\alpha+1}(t_{n-1}-t_{\ell-1})+\omega_{\alpha+1}(t_{n-1}-t_{\ell}),$		(2)

but for higher-order schemes the coefficients become progressively more complicated. Although the $H^{n,n-\ell}_{ij}$ can always be evaluated via repeated integration by parts, the resulting expressions are likely to suffer from roundoff when evaluated in floating-point arithmetic if $n-\ell$ is large. Consider just the lowest order case (2) with uniform time steps $t_{n}=nk$ , so that

H^{n,n-\ell}_{11}=k^{\alpha}\bigl{[}\omega_{\alpha+1}(n-\ell+1)-2\omega_{\alpha+1}(n-\ell)+\omega_{\alpha+1}(n-\ell-1)\bigr{]}.

Since the factor in square brackets is a second-difference of $\omega_{\alpha+1}$ , its magnitude decays like $(n-\ell)^{\alpha-2}$ as $n-\ell$ increases, but the individual terms grow like $(n-\ell)^{\alpha}$ .

We are therefore led to evaluate the coefficients $H^{n,n-\ell}_{ij}$ via quadratures with positive weights. No special techniques are needed for $\ell\leq n-2$ , but when $\ell=n$ or $n-1$ we must deal with weakly singular integrands. In Section 3, we show how certain substitutions reduce the problem to dealing with integrands that are either smooth, or are products of smooth functions and standard Jacobi weight functions. Similar substitutions, known as Duffy transformations [1], have long been used to compute singular integrals arising in the boundary element method.

Section 4 introduces a spatial discretisation for the fractional PDE (1) and describes the structure of the linear system that must be solved at each time step. In Section 5, we specialise the expressions for the coefficients by choosing Legendre polynomials as the shape functions employed in the dG time stepping.

Section 6 describes a post-processing technique that, when applied to the dG solution $U$ , produces a more accurate approximate solution $\widehat{U}$ , known as the reconstruction [6] of $U$ . If $U$ is a piecewise polynomial of degree at most $r-1$ , then $\widehat{U}$ is a piecewise polynomial of degree at most $r$ . For a classical diffusion problem, both $U$ and $\widehat{U}$ are known to be quasi-optimal, that is, accurate of order $k^{r}$ and $k^{r+1}$ , respectively. Thus, it is natural to ask what happens in the fractional-order case, and we investigate this question in numerical experiments reported in Section 7.

2 A fractional ODE

Our central concern is present already in the zero-dimensional case when we replace the elliptic operator $A$ with a scalar $\lambda\geq 0$ , so that the solution $u(t)$ is a real-valued function satisfying the fractional ODE

u^{\prime}+\lambda\partial_{t}^{1-\alpha}u=f(t)\quad\text{for $0<t\leq T$, with $u(0)=u_{0}$.}

(3)

For the time discretisation, we introduce a grid

0=t_{0}<t_{1}<t_{2}<\cdots<t_{N}=T,

and form the vector $\boldsymbol{t}=(t_{0},t_{1},\ldots,t_{N})$ . Let $k_{n}=t_{n}-t_{n-1}$ denote the length of the $n$ th (open) subinterval $I_{n}=(t_{n-1},t_{n})$ . We form the disjoint union

I=I_{1}\cup I_{2}\cup\cdots\cup I_{N},

and for any function $v:I\to\mathbb{R}$ write

v^{n}_{+}=\lim_{\epsilon\downarrow 0}v(t+\epsilon),\qquad v^{n}_{-}=\lim_{\epsilon\downarrow 0}v(t-\epsilon),\qquad\llbracket v\rrbracket^{n}=v^{n}_{+}-v^{n}_{-},

provided the one-sided limits exist.

Given a vector $\boldsymbol{r}=(r_{1},r_{2},\ldots,r_{N})$ of integers $r_{n}\geq 0$ , the trial space $\mathcal{X}=\mathcal{X}(\boldsymbol{t},\boldsymbol{r})$ consists of the functions $X:I\to\mathbb{R}$ such that $X|_{I_{n}}\in\mathbb{P}_{r_{n}-1}$ for $1\leq n\leq N$ . Here, $\mathbb{P}_{m}$ denotes the space of polynomials of degree at most $m\geq 0$ , with real coefficients. The dG solution $U\in\mathcal{X}$ of (3) is then defined by [9, 12]

\llbracket U\rrbracket^{n-1}X^{n-1}_{+}+\int_{I_{n}}(U^{\prime}+\lambda\partial_{t}^{1-\alpha}U)X\,dt=\int_{I_{n}}fX\,dt

(4)

for $X\in\mathbb{P}_{r_{n}-1}$ and $1\leq n\leq N$ , where, in the case $n=1$ , we set $U^{0}_{-}=u_{0}$ so that $\llbracket U\rrbracket^{0}=U^{0}_{+}-U^{0}_{-}=U^{0}_{+}-u_{0}$ . (The monograph of Thomée [15, Chapter 12] is a standard reference providing a general introduction to dG time stepping for classical diffusion problems.)

To compute $U$ , we choose for each $n$ a basis $\psi_{n1}$ , $\psi_{n2}$ , …, $\psi_{nr_{n}}$ for $\mathbb{P}_{r_{n}-1}$ and write

U(t)=\sum_{j=1}^{r_{n}}U^{nj}\psi_{nj}(t)\quad\text{for $t\in I_{n}$.}

(5)

When $X=\psi_{ni}$ , we find that

U^{n-1}_{+}X^{n-1}_{+}+\int_{I_{n}}U^{\prime}X\,dt=\sum_{j=1}^{r_{n}}G^{n}_{ij}U^{nj}\quad\text{and}\quad U^{n-1}_{-}X^{n-1}_{+}=\sum_{j=1}^{r_{n-1}}K^{n,n-1}_{ij}U^{n-1,j},

with coefficients given by

G^{n}_{ij}=\psi_{nj}(t_{n-1})\psi_{ni}(t_{n-1})+\int_{I_{n}}\psi_{nj}^{\prime}\psi_{ni}\,dt

and

K^{n,n-1}_{ij}=\psi_{n-1,j}(t_{n-1})\psi_{ni}(t_{n-1}).

Owing to the convolutional structure of the fractional derivative, it is convenient to introduce the notation

\bar{\ell}=n-\ell

and define, if $t\in I_{n}$ ,

\rho^{n\bar{\ell}}_{j}(t)=\rho^{n,n-\ell}_{j}(t)=\int_{I_{\ell}}\omega_{\alpha}(t-s)\psi_{\ell j}(s)\,ds\quad\text{for $1\leq\ell\leq n-1$,}

with

\rho^{n\bar{n}}_{j}(t)=\rho^{n0}_{j}(t)=\int_{t_{n-1}}^{t}\omega_{\alpha}(t-s)\psi_{nj}(s)\,ds.

We find that

\partial_{t}^{1-\alpha}U=\sum_{\ell=1}^{n}\sum_{j=1}^{r_{\ell}}U^{\ell j}(\rho^{n\bar{\ell}}_{j})^{\prime}(t)\quad\text{for $t\in I_{n}$,}

and thus

\int_{I_{n}}(\partial_{t}^{1-\alpha}U)X\,dt=\sum_{\ell=1}^{n}\sum_{j=1}^{r_{\ell}}H^{n\bar{\ell}}_{ij}U^{\ell j}\quad\text{where}\quad H^{n\bar{\ell}}_{ij}=H^{n,n-\ell}_{ij}=\int_{I_{n}}(\rho^{n\ell}_{j})^{\prime}\psi_{ni}\,dt.

Hence, putting

F^{ni}=\int_{I_{n}}f\psi_{ni}\,dt,

the dG method (4) requires

\sum_{j=1}^{r_{n}}\bigl{(}G^{n}_{ij}+\lambda H^{n0}_{ij}\bigr{)}U^{nj}=F^{ni}-\sum_{\ell=1}^{n-1}\sum_{j=1}^{r_{\ell}}\lambda H^{n,n-\ell}_{ij}U^{\ell j}\\ +\begin{cases}\psi_{1i}(0)u_{0},&n=1,\\ \sum_{j=1}^{r_{n-1}}K^{n,n-1}_{ij}U^{n-1,j},&2\leq n\leq N.\end{cases}

(6)

At the $n$ th time step, this $r_{n}\times r_{n}$ linear system must be solved to determine $U^{n1}$ , $U^{n2}$ ,…, $U^{nr_{n}}$ and hence $U(t)$ for $t\in I_{n}$ .

Remark 1.

If we send $\alpha\to 1$ , so that the fractional ODE in (3) reduces to the classical ODE $u^{\prime}+\lambda u=f(t)$ , then $H^{n\bar{\ell}}_{ij}=0$ for $1\leq\bar{\ell}\leq n-1$ . Indeed, since $\omega_{1}(t)=1$ , we see that $\rho^{n\bar{\ell}}_{j}(t)=\int_{I_{\ell}}\psi_{\ell j}(s)\,ds$ is constant and so $(\rho^{n\bar{\ell}}_{j})^{\prime}(t)=0$ for $t\in I_{n}$ . Moreover, $(\rho^{n0}_{j})^{\prime}(t)=\psi_{nj}(t)$ so $H^{n0}_{ij}=\int_{I_{n}}\psi_{nj}\psi_{ni}\,dt$ .

Remark 2.

Later we will show certain symmetry properties of $H^{n0}_{ij}$ using the identity

\int_{a}^{b}\biggl{(}\frac{\partial}{\partial t}\int_{a}^{t}\omega_{\alpha}(t-s)u(s)\,ds\biggr{)}v(t)\,dt\\ =-\int_{a}^{b}u(s)\biggl{(}\frac{\partial}{\partial s}\int_{s}^{b}\omega_{\alpha}(t-s)v(t)\,dt\biggr{)}\,ds.

(7)

In fact, the substitution $x=t-s$ gives

	$\displaystyle\frac{\partial}{\partial t}\int_{a}^{t}\omega_{\alpha}(t-s)u(s)\,ds$	$\displaystyle=\frac{\partial}{\partial t}\int_{0}^{t-a}\omega_{\alpha}(x)u(t-x)\,dx$
		$\displaystyle=\omega_{\alpha}(t-a)u(a)+\int_{0}^{t-a}\omega_{\alpha}(x)u^{\prime}(t-x)\,dx$
		$\displaystyle=\omega_{\alpha}(t-a)u(t_{n-1})+\int_{a}^{t}\omega_{\alpha}(t-s)u^{\prime}(s)\,ds,$

and (7) follows after reversing the order of integration and then integrating by parts. Similarly, for $\ell\leq n-1$ ,

\int_{a}^{b}\biggl{(}\frac{\partial}{\partial t}\int_{a}^{b}\omega_{\alpha}(t-s)u(s)\,ds\biggr{)}v(t)\,dt=-\int_{a}^{b}u(s)\biggl{(}\frac{\partial}{\partial s}\int_{a}^{b}\omega_{\alpha}(t-s)v(t)\,dt\biggr{)}\,ds.

(8)

3 Evaluation of the coefficients

To compute $G^{n}_{ij}$ , $H^{n\ell}_{ij}$ and $K^{n,n-1}_{ij}$ it is convenient to map each closed subinterval $\bar{I}_{n}=[t_{n-1},t_{n}]$ to the reference element $[-1,1]$ . We therefore define the affine function $\mathsf{t}_{n}:[-1,1]\to\bar{I}_{n}$ by

\mathsf{t}_{n}(\tau)=\tfrac{1}{2}\bigl{[}(1-\tau)t_{n-1}+(1+\tau)t_{n}\bigr{]}\quad\text{for $-1\leq\tau\leq 1$,}

and let

\Psi_{nj}(\tau)=\psi_{nj}(t)\quad\text{for $t=\mathsf{t}_{n}(\tau)$ and $-1\leq\tau\leq 1$.}

In this way,

G^{n}_{ij}=\Psi_{nj}(-1)\Psi_{ni}(-1)+\int_{-1}^{1}\Psi_{nj}^{\prime}(\tau)\Psi_{ni}(\tau)\,d\tau

(9)

and

K^{n,n-1}_{ij}=\Psi_{n-1,j}(+1)\Psi_{ni}(-1).

(10)

Both of these coefficients are readily computed; the remainder of this section is devoted to $H^{n\bar{\ell}}_{ij}$ . The formulae in the next lemma allow us to compute $H^{n0}_{ij}$ to machine precision via Gauss–Legendre and Gauss–Jacobi quadrature.

Lemma 3.

If we define the polynomial

\Phi^{n}_{ij}(y)=\frac{1}{2}\int_{-1}^{1}\Psi_{nj}\bigl{(}\tfrac{1}{2}(1-y)(1+z)-1\bigr{)}\Psi^{\prime}_{ni}\bigl{(}1-\tfrac{1}{2}(1-y)(1-z)\bigr{)}\,dz,

then

	$\displaystyle H^{n0}_{ij}=\frac{(k_{n}/2)^{\alpha}}{\Gamma(\alpha)}$	$\displaystyle\biggl{(}\Psi_{ni}(1)\int_{-1}^{1}(1-\sigma)^{\alpha}\Psi_{nj}(\sigma)\,d\sigma$
		$\displaystyle\qquad{}-\int_{-1}^{1}(1+y)^{\alpha-1}(1-y)\Phi^{n}_{ij}(y)\,dy\biggr{)}.$

Proof:

Since $\rho^{n0}_{j}(t_{n-1})=0$ , integration by parts gives

	$\displaystyle H^{n0}_{ij}$	$\displaystyle=\rho^{n0}_{j}(t_{n})\psi_{ni}(t_{n})-\int_{I_{n}}\rho^{n0}_{j}(t)\psi^{\prime}_{ni}(t)\,dt$
		$\displaystyle=\rho^{n0}_{j}(t_{n})\Psi_{ni}(1)-\int_{-1}^{1}\rho^{n0}_{j}\bigl{(}\mathsf{t}_{n}(\tau)\bigr{)}\Psi_{ni}^{\prime}(\tau)\,d\tau,$

and since $\mathsf{t}_{n}(\tau)-\mathsf{t}_{n}(\sigma)=(\tau-\sigma)k_{n}/2$ , the substitution $s=\mathsf{t}_{n}(\sigma)$ yields

	$\displaystyle\rho^{n0}_{j}\bigl{(}\mathsf{t}_{n}(\tau)\bigr{)}$	$\displaystyle=\frac{k_{n}}{2}\int_{-1}^{\tau}\omega_{\alpha}\bigl{(}\mathsf{t}_{n}(\tau)-\mathsf{t}_{n}(\sigma)\bigr{)}\Psi_{nj}(\sigma)\,d\sigma$
		$\displaystyle=\frac{(k_{n}/2)^{\alpha}}{\Gamma(\alpha)}\int_{-1}^{\tau}(\tau-\sigma)^{\alpha-1}\Psi_{nj}(\sigma)\,d\sigma.$

Thus,

H^{n0}_{ij}=\frac{(k_{n}/2)^{\alpha}}{\Gamma(\alpha)}\biggl{(}\Psi_{ni}(1)\int_{-1}^{1}(1-\sigma)^{\alpha-1}\Psi_{nj}(\sigma)\,d\sigma-B^{n}_{ij}\biggr{)},

where

B^{n}_{ij}=\int_{-1}^{1}\int_{-1}^{\tau}(\tau-\sigma)^{\alpha-1}\Psi_{nj}(\sigma)\,d\sigma\,\Psi_{ni}^{\prime}(\tau)\,d\tau.

We make the substitution $1+y=\tau-\sigma$ , which results in a fixed singularity at $y=-1$ , and then reverse the order of integration:

	$\displaystyle B^{n}_{ij}$	$\displaystyle=\int_{-1}^{1}\int_{-1}^{\tau}(1+y)^{\alpha-1}\Psi_{nj}(\tau-y-1)\,dy\,\Psi_{ni}^{\prime}(\tau)\,d\tau$
		$\displaystyle=\int_{-1}^{1}(1+y)^{\alpha-1}\int_{y}^{1}\Psi_{nj}(\tau-y-1)\Psi_{ni}^{\prime}(\tau)\,d\tau\,dy.$

The substitution $\tau=\tfrac{1}{2}\bigl{[}(1-z)y+(1+z)\bigr{]}$ then yields

\int_{y}^{1}\Psi_{nj}(\tau-y-1)\Psi_{ni}^{\prime}(\tau)\,d\tau\,dy=(1-y)\Phi^{n}_{ij}(y),

implying the desired formula for $H^{n0}_{ij}$ . $\spadesuit$

To deal with $H^{n,n-\ell}_{ij}$ for $\ell\leq n-1$ , we introduce the notation

t_{n-1/2}=\mathsf{t}_{n}(0)=\tfrac{1}{2}(t_{n-1}+t_{n})\quad\text{and}\quad D_{n\bar{\ell}}=D_{n,n-\ell}=t_{n-1/2}-t_{\ell-1/2},

with

\Delta_{n\bar{\ell}}(\tau,\sigma)=\Delta_{n,n-\ell}(\tau,\sigma)=\frac{\tau k_{n}-\sigma k_{\ell}}{2D_{n\bar{\ell}}},

so that

\mathsf{t}_{n}(\tau)-\mathsf{t}_{\ell}(\sigma)=D_{n\bar{\ell}}\bigl{(}1+\Delta_{n\bar{\ell}}(\tau,\sigma)\bigr{)}.

Lemma 4.

If $1\leq\ell\leq n-1$ , then

H^{n\bar{\ell}}_{ij}=\frac{D_{n\bar{\ell}}^{\alpha-1}}{\Gamma(\alpha)}\,\frac{k_{\ell}}{2}\bigl{(}\Psi_{ni}(1)\mathcal{A}^{n\bar{\ell}}_{j}-\Psi_{ni}(-1)\mathcal{B}^{n\bar{\ell}}_{j}-\mathcal{C}^{n\bar{\ell}}_{ij}\bigr{)},

where

	$\displaystyle\mathcal{A}^{n\bar{\ell}}_{j}$	$\displaystyle=\int_{-1}^{1}\bigl{(}1+\Delta_{n\bar{\ell}}(1,\sigma)\bigr{)}^{\alpha-1}\Psi_{\ell j}(\sigma)\,d\sigma,$
	$\displaystyle\mathcal{B}^{n\bar{\ell}}_{j}$	$\displaystyle=\int_{-1}^{1}\bigl{(}1+\Delta_{n\bar{\ell}}(-1,\sigma)\bigr{)}^{\alpha-1}\Psi_{\ell j}(\sigma)\,d\sigma,$
	$\displaystyle\mathcal{C}^{n\bar{\ell}}_{ij}$	$\displaystyle=\int_{-1}^{1}\Psi_{ni}^{\prime}(\tau)\int_{-1}^{1}\bigl{(}1+\Delta_{n\bar{\ell}}(\tau,\sigma)\bigr{)}^{\alpha-1}\Psi_{\ell j}(\sigma)\,d\sigma\,d\tau.$

Proof:

Integrating by parts, we find that

	$\displaystyle H^{n\bar{\ell}}_{ij}$	$\displaystyle=\rho^{n\bar{\ell}}_{j}(t_{n})\psi_{ni}(t_{n})-\rho^{n\bar{\ell}}_{j}(t_{n-1})\psi_{ni}(t_{n-1})-\int_{I_{n}}\rho^{n\bar{\ell}}_{j}(t)\psi_{ni}^{\prime}(t)\,dt$
		$\displaystyle=\rho^{n\bar{\ell}}_{j}(t_{n})\Psi_{ni}(1)-\rho^{n\bar{\ell}}_{j}(t_{n-1})\Psi_{ni}(-1)-\int_{-1}^{1}\rho^{n\bar{\ell}}_{j}\bigl{(}\mathsf{t}_{n}(\tau)\bigr{)}\Psi^{\prime}_{ni}(\tau)\,d\tau.$

The substitution $s=\mathsf{t}_{\ell}(\sigma)$ gives

\rho^{n\bar{\ell}}_{j}\bigl{(}\mathsf{t}_{n}(\tau)\bigr{)}=\frac{D_{n\bar{\ell}}^{\alpha-1}}{\Gamma(\alpha)}\,\frac{k_{\ell}}{2}\int_{-1}^{1}\bigl{(}1+\Delta_{n\bar{\ell}}(\tau,\sigma)\bigr{)}^{\alpha-1}\Psi_{\ell j}(\sigma)\,d\sigma,

and the formula for $H^{n\bar{\ell}}_{ij}$ follows at once. $\spadesuit$

Notice that

1+\Delta_{n\bar{\ell}}(1,\sigma)=\frac{2(t_{n}-t_{\ell})+(1-\sigma)k_{\ell}}{k_{n}+2(t_{n-1}-t_{l})+k_{\ell}}>0\quad\text{for $1\leq\ell\leq n-1$,}

so the integrand of $\mathcal{A}^{n\bar{\ell}}_{ij}$ is always smooth. However,

1+\Delta_{n\bar{\ell}}(-1,\sigma)=\frac{2(t_{n-1}-t_{\ell})+(1-\sigma)k_{\ell}}{k_{n}+2(t_{n-1}-t_{l})+k_{\ell}},

so the integrands of $\mathcal{B}^{n\bar{\ell}}_{j}$ and $\mathcal{C}^{n\bar{\ell}}_{j}$ are weakly singular if $\bar{\ell}=1$ (i.e., if $\ell=n-1$ ). The next lemma provides alternative expressions that are amenable to Gauss–Jacobi and Gauss–Legendre quadrature.

Lemma 5.

Let $\rho_{n}=k_{n}/k_{n-1}$ . Then,

	$\displaystyle\mathcal{A}^{n1}_{j}$	$\displaystyle=(1+\rho_{n})^{1-\alpha}\int_{-1}^{1}(2\rho_{n}+1-\sigma)^{\alpha-1}\Psi_{n-1,j}(\sigma)\,d\sigma,$
	$\displaystyle\mathcal{B}^{n1}_{j}$	$\displaystyle=(1+\rho_{n})^{1-\alpha}\int_{-1}^{1}(1-\sigma)^{\alpha-1}\Psi_{n-1,j}(\sigma)\,d\sigma$

and

\mathcal{C}^{n1}_{ij}=(1+\rho_{n})^{1-\alpha}\biggl{(}\\ \int_{-1}^{1}(1+\tau)^{\alpha}\Psi^{\prime}_{ni}(\tau)\int_{0}^{1}(\rho_{n}+z)^{\alpha-1}\Psi_{n-1,j}\bigl{(}1-z(1+\tau)\bigr{)}\,dz\,d\tau\\ +\int_{-1}^{1}(1-\sigma)^{\alpha}\Psi_{n-1,j}(\sigma)\int_{0}^{1}(\rho_{n}z+1)^{\alpha-1}\Psi_{ni}^{\prime}\bigl{(}z(1-\sigma)-1\bigr{)}\,dz\,d\sigma\biggr{)}.

Proof:

Since $1+\Delta_{n1}(-1,\sigma)=k_{n-1}(1-\sigma)/(k_{n}+k_{n-1})$ , the formula for $\mathcal{B}^{n1}_{ij}$ follows at once. To deal with $\mathcal{C}^{n,1}_{ij}$ we begin by mapping $[-1,1]^{2}$ onto $[0,2]^{2}$ with the substitution $(\tau,\sigma)=(x-1,1-y)$ . In this way, the singularity at $(\tau,\sigma)=(-1,1)$ moves to $(x,y)=(0,0)$ , and

\mathcal{C}^{n1}_{ij}=\int_{0}^{2}\int_{0}^{2}\bigl{(}1+\Delta_{n1}(x-1,1-y)\bigr{)}^{\alpha-1}\Psi_{n-1,j}(1-y)\Psi_{ni}^{\prime}(x-1)\,dx\,dy

with

1+\Delta_{n1}(x-1,1-y)=\frac{xk_{n}+yk_{n-1}}{k_{n}+k_{n-1}}.

By splitting the integration domain $[0,2]^{2}$ into the triangular halves where $x>y$ and $x<y$ , we obtain

	$\displaystyle\mathcal{C}^{n1}_{ij}$	$\displaystyle=\int_{0}^{2}\Psi_{ni}^{\prime}(x-1)\int_{0}^{x}\biggl{(}\frac{xk_{n}+yk_{n-1}}{k_{n}+k_{n-1}}\biggr{)}^{\alpha-1}\Psi_{n-1,j}(1-y)\,dy\,dx$
		$\displaystyle\qquad{}+\int_{0}^{2}\Psi_{n-1,j}(1-y)\int_{0}^{y}\biggl{(}\frac{xk_{n}+yk_{n-1}}{k_{n}+k_{n-1}}\biggr{)}^{\alpha-1}\Psi_{ni}^{\prime}(x-1)\,dx\,dy.$

The substitution $y=zx$ tranforms the inner integral in the first term to

x^{\alpha}\int_{0}^{1}\biggl{(}\frac{k_{n}+zk_{n-1}}{k_{n}+k_{n-1}}\biggr{)}^{\alpha-1}\Psi_{n-1,j}(1-zx)\,dz,

and the substitution $x=zy$ transforms that in the second to

y^{\alpha}\int_{0}^{1}\biggl{(}\frac{zk_{n}+k_{n-1}}{k_{n}+k_{n-1}}\biggr{)}^{\alpha-1}\Psi_{ni}^{\prime}(zy-1)\,dz.

Thus,

	$\displaystyle\mathcal{C}^{n1}_{ij}$	$\displaystyle=\int_{0}^{2}x^{\alpha}\Psi^{\prime}_{ni}(x-1)\int_{0}^{1}\biggl{(}\frac{k_{n}+zk_{n-1}}{k_{n}+k_{n-1}}\biggr{)}^{\alpha-1}\Psi_{n-1,j}(1-zx)\,dz\,dx$
		$\displaystyle\qquad{}+\int_{0}^{2}y^{\alpha}\Psi_{n-1,j}(1-y)\int_{0}^{1}\biggl{(}\frac{zk_{n}+k_{n-1}}{k_{n}+k_{n-1}}\biggr{)}^{\alpha-1}\Psi_{ni}^{\prime}(zy-1)\,dz\,dy.$

Now make the substitutions $x=1+\tau$ and $y=1-\sigma$ . $\spadesuit$

We also have the following alternative representation.

Lemma 6.

If $1\leq\ell\leq n-2$ , then

H^{n\bar{\ell}}_{ij}=-\frac{1-\alpha}{\Gamma(\alpha)}\,\frac{k_{n}k_{\ell}}{4}\,D_{n\bar{\ell}}^{\alpha-2}\int_{-1}^{1}\Psi_{ni}(\tau)\int_{-1}^{1}\bigl{(}1+\Delta_{n\ell}(\tau,\sigma)\bigr{)}^{\alpha-2}\Psi_{\ell j}(\sigma)\,d\sigma\,d\tau.

Proof:

When $\ell\leq n-2$ ,

(\rho^{n\bar{\ell}}_{j})^{\prime}(t)=\int_{I_{\ell}}\omega_{\alpha-1}(t-s)\psi_{\ell j}(s)\,ds\quad\text{for $t>t_{\ell}$,}

and so

H^{n\bar{\ell}}_{ij}=\int_{I_{n}}\psi_{ni}(t)\int_{I_{\ell}}\omega_{\alpha-1}(t-s)\psi_{\ell j}(s)\,ds.

(11)

The result now follows via the substitutions $t=\mathsf{t}_{n}(\tau)$ and $s=\mathsf{t}_{\ell}(\sigma)$ , noting that $\Gamma(\alpha)=(\alpha-1)\Gamma(\alpha-1)$ . $\spadesuit$

Remark 7.

If the time levels are uniformly spaced, and if the reference basis functions are the same for each subinterval, say

k_{\ell}=k,\quad r_{\ell}=r\quad\text{and}\quad\Psi_{\ell j}=\Psi_{j}\quad\text{for $1\leq\ell\leq n$ and $1\leq j\leq r$,}

then

D_{n\bar{\ell}}=\bar{\ell}k\quad\text{and}\quad\Delta_{n\bar{\ell}}(\tau,\sigma)=\frac{\tau-\sigma}{2\bar{\ell}},

so the formulae of Lemma 4 show that $H^{n\bar{\ell}}_{ij}$ depends on $n$ and $\ell$ only through the difference $\bar{\ell}=n-\ell$ ; for further details, see Example 12 below.

4 Spatial discretisation

The initial-boundary value problem (1) is known to be well-posed [5, 8, 10]. Let $\langle u,v\rangle=\int_{\Omega}uv$ denote the usual inner product in $L^{2}(\Omega)$ , and let $a(u,v)$ denote the bilinear form associated with $A$ via the first Green identity. For example, if $A=-\nabla^{2}$ then $a(u,v)=\int_{\Omega}\nabla u\cdot\nabla v$ . In this way, the weak solution $u:(0,T]\to H^{1}_{0}(\Omega)$ satisfies

\langle\partial_{t}u,v\rangle+a(\partial_{t}^{1-\alpha}u,v)=\langle f(t),v\rangle\quad\text{for $v\in H^{1}_{0}(\Omega)$ and $0<t\leq T$.}

We choose a finite dimensional subspace $V_{n}\subseteq H^{1}_{0}(\Omega)$ for $0\leq n\leq N$ , and form the vector $\boldsymbol{V}=(V_{1},\ldots,V_{N})$ . For example, $V_{n}$ might be a (conforming) finite element space constructed using a triangulation of $\Omega$ . Our trial space $\mathcal{X}=\mathcal{X}(\boldsymbol{t},\boldsymbol{r},\boldsymbol{V})$ then consists of the functions $X:I\to H^{1}_{0}(\Omega)$ such that $X|_{I_{n}}\in\mathbb{P}_{r_{n}-1}(I_{n};V_{n})$ , that is, the restriction $X|_{I_{n}}$ is a polynomial in $t$ of degree at most $r_{n}-1$ , with coefficients from $V_{n}$ . Generalising (4), the dG solution $U\in\mathcal{X}$ of (1) satisfies

\bigl{\langle}\llbracket U\rrbracket^{n-1},X^{n-1}_{+}\bigr{\rangle}+\int_{I_{n}}\langle\partial_{t}U,X\rangle\,dt+\int_{I_{n}}a(\partial_{t}^{1-\alpha}U,X)\,dt=\int_{I_{n}}\langle f(t),X\rangle\,dt

(12)

for $X\in\mathbb{P}_{r_{n}-1}(I_{n};V_{n})$ and $1\leq n\leq N$ , with $U^{0}_{-}=U_{0}$ for a suitable $U_{0}\in V_{0}$ such that $U_{0}\approx u_{0}$ .

We choose a basis $\{\phi_{np}\}_{p=1}^{P_{n}}$ for $V_{n}$ . In the expansion (5), the coefficient $U^{nj}$ is now a function in $V_{n}$ , so there exist real numbers $U^{nj}_{q}$ such that

U^{nj}(x)=\sum_{q=1}^{P_{n}}U^{nj}_{q}\phi_{nq}(x)\quad\text{for $x\in\Omega$;}

for example, $U^{nj}_{q}=U^{nj}(x_{nq})$ if $x_{nq}$ is the $q$ th free node of a finite element mesh and if $\phi_{nq}$ is the corresponding nodal basis function. Similarly, for the discrete initial data, there are real numbers $U_{0q}$ such that

U_{0}(x)=\sum_{q=1}^{P_{0}}U_{0q}\phi_{0q}(x)\quad\text{for $x\in\Omega$.}

Choosing $X(x,t)=\psi_{ni}(t)\phi_{nq}(x)$ in (12), we find that the equations (6) for time stepping the scalar problem generalise to

\sum_{j=1}^{r_{n}}\sum_{q=1}^{P_{n}}\bigl{(}G^{n}_{ij}M^{nn}_{pq}+H^{n0}_{ij}A^{nn}_{pq}\bigr{)}U^{nj}_{q}=F^{ni}_{p}-\sum_{\ell=1}^{n-1}\sum_{j=1}^{r_{\ell}}\sum_{q=1}^{P_{\ell}}H^{n,n-\ell}_{ij}A^{n\ell}_{pq}U^{\ell j}_{q}\\ +\begin{cases}\psi_{1i}(0)\sum_{q=1}^{P_{0}}M^{10}_{pq}U_{0q},&n=1,\\[6.0pt] \sum_{j=1}^{r_{n-1}}\sum_{q=1}^{P_{n-1}}K^{n,n-1}_{ij}M^{n,n-1}_{pq}U^{n-1,j}_{q},&2\leq n\leq N,\end{cases}

(13)

where

M^{n\ell}_{pq}=\langle\phi_{\ell q},\phi_{np}\rangle,\qquad A^{n\ell}_{pq}=a(\phi_{\ell q},\phi_{np}),\qquad F^{ni}_{p}=\int_{I_{n}}\langle f(t),\phi_{np}\rangle\psi_{ni}(t)\,dt.

By introducing the $P_{n}\times P_{\ell}$ mass matrix $\boldsymbol{M}^{n\ell}=[M^{n\ell}_{pq}]$ and stiffness matrix $\boldsymbol{A}^{n\ell}=[A^{n\ell}_{pq}]$ , and forming the column vectors

\boldsymbol{U}^{nj}=\begin{bmatrix}U^{nj}_{1}\\ U^{nj}_{2}\\ \vdots\\ U^{nj}_{P_{n}}\end{bmatrix},\qquad\boldsymbol{F}^{ni}=\begin{bmatrix}F^{ni}_{1}\\ F^{ni}_{2}\\ \vdots\\ F^{ni}_{P_{n}}\end{bmatrix},\qquad\boldsymbol{U}_{0}=\begin{bmatrix}U_{01}\\ U_{02}\\ \vdots\\ U_{0P_{0}}\end{bmatrix},

we can rewrite the equations (13) as

\sum_{j=1}^{r_{n}}\bigl{(}G^{n}_{ij}\boldsymbol{M}^{nn}+H^{n0}_{ij}\boldsymbol{A}^{nn}\bigr{)}\boldsymbol{U}^{nj}=\boldsymbol{F}^{ni}-\sum_{\ell=1}^{n-1}\sum_{j=1}^{r_{\ell}}H^{n,n-\ell}_{ij}\boldsymbol{A}^{n\ell}\boldsymbol{U}^{\ell j}\\ +\begin{cases}\psi_{1i}(0)\boldsymbol{M}^{10}\boldsymbol{U}_{0},&n=1,\\[6.0pt] \sum_{j=1}^{r_{n-1}}K^{n,n-1}_{ij}\boldsymbol{M}^{n,n-1}\boldsymbol{U}^{n-1,j},&2\leq n\leq N.\end{cases}

(14)

To write (14) even more compactly, define the $r_{n}\times r_{n}$ matrix $\boldsymbol{G}^{n}=[G^{n}_{ij}]$ and the $r_{n}\times r_{\ell}$ matrix $\boldsymbol{H}^{n\bar{\ell}}=[H^{n\bar{\ell}}_{ij}]$ , together with the (block) column vectors

\boldsymbol{U}^{n}=\begin{bmatrix}\boldsymbol{U}^{n1}\\ \boldsymbol{U}^{n2}\\ \vdots\\ \boldsymbol{U}^{nr_{n}}\end{bmatrix}\quad\text{and}\quad\boldsymbol{F}^{n}=\begin{bmatrix}\boldsymbol{F}^{n1}\\ \boldsymbol{F}^{n2}\\ \vdots\\ \boldsymbol{F}^{nr_{n}}\end{bmatrix}.

We also form the $r_{n}\times r_{n-1}$ matrix $\boldsymbol{K}^{n,n-1}=[K^{n,n-1}_{ij}]$ and the column vector

\boldsymbol{\psi^{0}_{+}}=\begin{bmatrix}\psi_{11}(0)\\ \psi_{12}(0)\\ \vdots\\ \psi_{1r_{n}}(0)\\ \end{bmatrix}.

Utilising the Kronecker product, the linear system (14) takes the form

\bigl{(}\boldsymbol{G}^{n}\otimes\boldsymbol{M}^{nn}+\boldsymbol{H}^{n0}\otimes\boldsymbol{A}^{nn}\bigr{)}\boldsymbol{U}^{n}=\boldsymbol{F}^{n}-\sum_{\ell=1}^{n-1}\bigl{(}\boldsymbol{H}^{n,n-\ell}\otimes\boldsymbol{A}^{n\ell}\bigr{)}\boldsymbol{U}^{\ell}\\ +\begin{cases}\bigl{(}\boldsymbol{\psi}^{0}_{+}\otimes\boldsymbol{M}^{10}\bigr{)}\boldsymbol{U}_{0},&n=1,\\[6.0pt] \bigl{(}\boldsymbol{K}^{n,n-1}\otimes\boldsymbol{M}^{n,n-1}\bigr{)}\boldsymbol{U}^{n-1,j},&2\leq n\leq N.\end{cases}

(15)

5 Legendre polynomials

Let $P_{0}$ , $P_{1}$ , $P_{2}$ , …denote the Legendre polynomials with the standard normalisation $P_{j}(1)=1$ for all $j\geq 0$ . By choosing

\Psi_{nj}(\tau)=P_{j-1}(\tau),

(16)

we obtain a convenient and well-conditioned basis for $\mathbb{P}_{r_{n}-1}$ with the properties

\int_{-1}^{1}\Psi_{nj}(\tau)\Psi_{ni}(\tau)\,d\tau=\frac{2\delta_{ij}}{2j-1}\quad\text{and}\quad\Psi_{nj}(-\tau)=(-1)^{j-1}\Psi_{nj}(\tau)

for $i$ , $j\in\{1,2,\ldots,r_{n}\}$ .

Lemma 8.

With the choice (16) of basis functions,

\Psi_{nj}(1)=1\quad\text{and}\quad\Psi_{nj}(-1)=(-1)^{j-1},

(17)

and the coefficients (9) and (10) are given by

G^{n}_{ij}=\begin{cases}(-1)^{i+j},&\text{if $i\geq j$,}\\ 1,&\text{if $i<j$,}\end{cases}

and

K^{n,n-1}_{ij}=(-1)^{i-1}.

Proof:

The properties (17) follow from $P_{j}(1)=1$ and $P_{j}(-1)=(-1)^{j}$ . Hence, the formula for $K^{n,n-1}_{ij}$ follows from (10), and by (9),

G^{n}_{ij}=(-1)^{i+j}+E_{ij}\quad\text{where}\quad E_{ij}=\int_{-1}^{1}P_{j-1}^{\prime}(\tau)P_{i-1}(\tau)\,d\tau.

If $j\leq i$ , then $E_{ij}=0$ because $P_{j-1}^{\prime}$ is orthogonal to $P_{i-1}$ . Otherwise, if $j>i$ , then $P_{j-1}$ is orthogonal to $P_{i-1}^{\prime}$ so integration by parts gives

E_{ij}=\bigl{[}P_{j-1}(x)P_{i-1}(x)\bigr{]}_{-1}^{1}-\int_{-1}^{1}P_{j-1}(x)P_{i-1}^{\prime}(x)\,dx=1-(-1)^{i+j}

and hence $G^{n}_{ij}=1$ . $\spadesuit$

Example 9.

If $r_{n}=4$ and $r_{n-1}=3$ , then the matrices $\boldsymbol{G}^{n}=[G^{n}_{ij}]$ and $\boldsymbol{K}^{n,n-1}=[K^{n,n-1}_{ij}]$ are

\boldsymbol{G}^{n}=\left[\begin{array}[]{rrrr}1&1&1&1\\ -1&1&1&1\\ 1&-1&1&1\\ -1&1&-1&\phantom{-}1\end{array}\right]\quad\text{and}\quad\boldsymbol{K}^{n,n-1}=\left[\begin{array}[]{rrr}1&1&1\\ -1&-1&-1\\ 1&1&1\\ -1&-1&-1\end{array}\right].

We have no analogous, simple formula for the remaining coefficients $H^{n\ell}_{ij}$ . However, when $\bar{\ell}=0$ ( $\ell=n$ ) the following parity property holds.

Lemma 10.

With the choice (16) of basis functions,

H^{n0}_{ji}=(-1)^{i+j}H^{n0}_{ij}.

(18)

Proof:

Using (7), we find that

H^{n0}_{ji}=\int_{-1}^{1}(BP_{i-1})^{\prime}(\tau)P_{j-1}(\tau)\,d\tau=-\int_{-1}^{1}P_{i-1}(\sigma)(B^{*}P_{j-1})^{\prime}(\sigma)\,d\sigma,

where

(Bv)(\tau)=\int_{-1}^{\tau}\omega_{\alpha}(\tau-\sigma)v(\sigma)\,d\sigma\quad\text{and}\quad(B^{*}v)(\sigma)=\int_{\sigma}^{1}\omega_{\alpha}(\tau-\sigma)v(\tau)\,d\tau.

Let $(RV)(\tau)=V(-\tau)$ . A short calculation shows that $RB^{*}=BR$ , so

	$\displaystyle(B^{*}P_{j-1})^{\prime}(-\sigma)$	$\displaystyle=-\frac{d}{d\sigma}\bigl{[}(B^{}P_{j-1})(-\sigma)\bigr{]}=-\frac{d}{d\sigma}(RB^{}P_{j-1})^{\prime}(\sigma)$
		$\displaystyle=-(BRP_{j-1})^{\prime}(\sigma)=(-1)^{j}(BP_{j-1})^{\prime}(\sigma),$

and therefore, using the substitution $\sigma=-x$ ,

	$\displaystyle H^{n0}_{ji}$	$\displaystyle=(-1)^{j+1}\int_{-1}^{1}P_{i-1}(-x)(BP_{j-1})^{\prime}(x)\,dx$
		$\displaystyle=(-1)^{i+j}\int_{-1}^{1}(BP_{j-1})^{\prime}(x)P_{i-1}(x)\,dx=(-1)^{i+j}H^{n0}_{ij},$

as claimed. $\spadesuit$

Remark 11.

In the limit as $\alpha\to 1$ , we see from Remark 1 that

H^{n0}_{ij}\to\int_{I_{n}}\psi_{nj}(t)\psi_{ni}(t)\,dt=\frac{k_{n}}{2}\int_{-1}^{1}\Psi_{j}(\tau)\Psi_{i}(\tau)\,d\tau=\frac{k_{n}\delta_{ij}}{2j-1}.

Example 12.

Consider the uniform case $k_{n}=k$ , $r_{n}=r$ and $\Psi_{nj}=\Psi_{j}$ for $1\leq n\leq N$ (as in Remark 7), with $\Psi_{j}(\tau)=P_{j-1}(\tau)$ as above. We then have

H^{n\bar{\ell}}_{ij}=k^{\alpha}H^{\bar{\ell}}_{ij}\quad\text{for $1\leq\ell\leq n\leq N$ and $i$, $j\in\{1,2,\ldots,r\}$,}

where, by Lemma 3,

	$\displaystyle H^{0}_{ij}=\frac{1}{2^{\alpha}\Gamma(\alpha)}$	$\displaystyle\biggl{(}\int_{-1}^{1}(1-\sigma)^{\alpha}P_{j-1}(\sigma)\,d\sigma$		(19)
		$\displaystyle\qquad{}-\int_{-1}^{1}(1+y)^{\alpha-1}(1-y)\Phi_{ij}(y)\,dy\biggr{)},$		(19)

with

\Phi_{ij}(y)=\frac{1}{2}\int_{-1}^{1}P_{j-1}\bigl{(}\tfrac{1}{2}(1-y)(1+z)-1\bigr{)}P^{\prime}_{i-1}\bigl{(}1-\tfrac{1}{2}(1-y)(1-z)\bigr{)}\,dz,

(20)

and by Lemma 4,

H^{\bar{\ell}}_{ij}=\frac{\bar{\ell}^{\alpha-1}}{2\Gamma(\alpha)}\bigl{(}\mathcal{A}^{\bar{\ell}}_{j}+(-1)^{i}\mathcal{B}^{\bar{\ell}}_{j}-\mathcal{C}^{\bar{\ell}}_{ij}\bigr{)}\quad\text{for $\ell\geq 1$,}

with, letting $\Delta_{\bar{\ell}}(\tau)=\tau/(2\bar{\ell})$ ,

	$\displaystyle\mathcal{A}^{\bar{\ell}}_{j}$	$\displaystyle=\int_{-1}^{1}\bigl{(}1+\Delta_{\bar{\ell}}(1-\sigma)\bigr{)}^{\alpha-1}P_{j-1}(\sigma)\,d\sigma,$
	$\displaystyle\mathcal{B}^{\bar{\ell}}_{j}$	$\displaystyle=\int_{-1}^{1}\bigl{(}1-\Delta_{\bar{\ell}}(1+\sigma)\bigr{)}^{\alpha-1}P_{j-1}(\sigma)\,d\sigma,$
	$\displaystyle\mathcal{C}^{\bar{\ell}}_{ij}$	$\displaystyle=\int_{-1}^{1}P_{i-1}^{\prime}(\tau)\int_{-1}^{1}\bigl{(}1+\Delta_{\bar{\ell}}(\tau-\sigma)\bigr{)}^{\alpha-1}P_{j-1}(\sigma)\,d\sigma\,d\tau.$

Moreover, Lemma 5 provides alternative expressions when $\bar{\ell}=1$ :

	$\displaystyle\mathcal{A}^{1}_{ij}$	$\displaystyle=2^{1-\alpha}\int_{-1}^{1}(3-\sigma)^{\alpha-1}P_{j-1}(\sigma)\,d\sigma,$
	$\displaystyle\mathcal{B}^{1}_{ij}$	$\displaystyle=2^{1-\alpha}\int_{-1}^{1}(1-\sigma)^{\alpha-1}P_{j-1}(\sigma)\,d\sigma$

and

	$\displaystyle\mathcal{C}^{1}_{ij}$	$\displaystyle=2^{1-\alpha}\biggl{(}\int_{-1}^{1}(1+\tau)^{\alpha}P_{i-1}^{\prime}(\tau)\int_{0}^{1}(1+z)^{\alpha-1}P_{j-1}\bigl{(}1-z(1+\tau)\bigr{)}\,dz\,d\tau$
		$\displaystyle{}+\int_{-1}^{1}(1-\sigma)^{\alpha}P_{j-1}(\sigma)\int_{0}^{1}(z+1)^{\alpha-1}P_{i-1}^{\prime}\bigl{(}z(1-\sigma)-1\bigr{)}\,dz\,d\sigma\biggr{)}.$

Likewise, Lemma 6 provides an alternative expression for $\bar{\ell}\geq 2$ :

H^{\bar{\ell}}_{ij}=-\frac{1-\alpha}{4\Gamma(\alpha)}\,\bar{\ell}^{\alpha-2}\int_{-1}^{1}P_{i-1}(\tau)\int_{-1}^{1}\bigl{(}1+\Delta_{\bar{\ell}}(\tau-\sigma)\bigr{)}^{\alpha-2}P_{j-1}(\sigma)\,d\sigma\,d\tau.

(21)

Finally, by arguing as in the proof of Lemma 10, we can show that

H^{\bar{\ell}}_{ji}=(-1)^{i+j}H^{\bar{\ell}}_{ij}\quad\text{for all $\bar{\ell}\geq 0$.}

(22)

6 Reconstruction

Throughout this section, we continue to use the Legendre basis (16). Some insight into the dG method can be had by considering the trivial case of (1) when $A=0$ , that is, $\partial_{t}u=f(t)$ for $0<t\leq T$ , with $u(0)=u_{0}$ . The dG scheme (12) then reduces to

\bigl{\langle}\llbracket U\rrbracket^{n},X^{n}_{+}\bigr{\rangle}+\int_{I_{n}}\langle\partial_{t}U,X\rangle\,dt=\int_{I_{n}}\langle\partial_{t}u,X\rangle\,dt

(23)

for $X\in\mathbb{P}_{r_{n}-1}(I_{n};V_{n})$ and $1\leq n\leq N$ , with $U^{0}_{-}=U_{0}$ . To state our next result, let $\mathcal{P}_{n}$ denote the orthoprojector from $L_{2}(\Omega)$ onto $V_{n}$ , and define

\mathcal{Q}_{n\ell}=\mathcal{P}_{n}\mathcal{P}_{n-1}\cdots\mathcal{P}_{\ell+1}.

Lemma 13.

If $A=0$ and $U_{0}=\mathcal{P}_{0}u_{0}$ , then for $1\leq n\leq N$ the dG solution $U\in\mathcal{X}$ satisfies

U^{n}_{-}=\mathcal{P}_{n}u(t_{n})+\sum_{\ell=0}^{n-1}\mathcal{Q}_{n\ell}(\mathcal{P}_{\ell}-I)u(t_{\ell})

(24)

and

\int_{I_{n}}\langle U-u,\partial_{t}X\rangle\,dt=0\quad\text{for all $X\in\mathbb{P}_{n}(I_{n};V)$.}

(25)

Proof:

Integrating by parts in (23), we find that

\langle U^{n}_{-}-u(t_{n}),X^{n}_{-}\rangle=\langle U^{n-1}_{-}-u(t_{n-1}),X^{n}_{+}\rangle+\int_{I_{n}}\langle U-u,\partial_{t}X\rangle\,dt.

Given $v\in V_{n}$ , by choosing the constant function $X(t)=v$ for $t\in I_{n}$ we deduce that $\langle U^{n}_{-}-u(t_{n}),v\rangle=\langle U^{n-1}_{-}-u(t_{n-1}),v\rangle$ and so (25) is satisfied. Moreover,

\mathcal{P}_{n}\bigl{(}U^{n}_{-}-u(t_{n})\bigr{)}=\mathcal{P}_{n}\bigl{(}U^{n-1}_{-}-u(t_{n-1})\bigr{)},

and, by the choice of initial condition, we see that (24) is satisfied for $n=1$ :

	$\displaystyle U^{1}_{-}-\mathcal{P}_{1}u(t_{1})$	$\displaystyle=\mathcal{P}_{1}\bigl{(}U^{1}_{-}-u(t_{1})\bigr{)}=\mathcal{P}_{1}(I-\mathcal{P}_{0}+\mathcal{P}_{0})\bigl{(}U^{0}_{-}-u(t_{0})\bigr{)}$
		$\displaystyle=\mathcal{P}_{1}(\mathcal{P}_{0}-I)u(t_{0})+\mathcal{P}_{1}(U_{0}-\mathcal{P}_{0}u_{0})=\mathcal{Q}_{11}(\mathcal{P}_{0}-I)u(t_{0}).$

Letting $n\geq 2$ , we make the induction hypothesis

U^{n-1}_{-}=\mathcal{P}_{n-1}u(t_{n-1})+\sum_{\ell=0}^{n-1}\mathcal{Q}_{n-1,\ell}(\mathcal{P}_{\ell}-I)u(t_{\ell}),

and observe that

	$\displaystyle U^{n}_{-}-\mathcal{P}_{n}u(t_{n})$	$\displaystyle=\mathcal{P}_{n}\bigl{(}U^{n}_{-}-u(t_{n})\bigr{)}=\mathcal{P}_{n}(I-\mathcal{P}_{n-1}+\mathcal{P}_{n-1})\bigl{(}U^{n-1}_{-}-u(t_{n-1})\bigr{)}$
		$\displaystyle=\mathcal{P}_{n}(\mathcal{P}_{n-1}-I)u(t_{n-1})+\mathcal{P}_{n}\sum_{\ell=0}^{n-1}\mathcal{Q}_{n-1,\ell}(\mathcal{P}_{\ell}-I)u(t_{\ell}),$

which gives the desired formula (24). $\spadesuit$

For the remainder of this section, we will assume that the subspaces $V_{n}$ are nested, as follows:

V_{0}\supseteq V_{1}\supseteq V_{2}\supseteq\cdots\supseteq V_{N}.

(26)

It follows that $\mathcal{P}_{\ell+1}(\mathcal{P}_{\ell}-I)=0$ for $0\leq\ell\leq N-1$ and so

U^{n}_{-}=\mathcal{P}_{n}u(t_{n}).

(27)

The following explicit representation for $U$ holds.

Lemma 14.

If $A=0$ , $U_{0}=\mathcal{P}_{0}u_{0}$ and the subspaces satisfy (26), then

U(t)=\sum_{j=1}^{r_{n}-1}a_{nj}\psi_{nj}(t)+\tilde{a}_{n}\psi_{nr_{n}}(t)\quad\text{for $t\in I_{n}$,}

(28)

where

a_{nj}=\frac{2j-1}{k_{n}}\int_{I_{n}}\mathcal{P}_{n}u(t)\psi_{nj}(t)\,dt

are the local Fourier–Legendre coefficients of $\mathcal{P}_{n}u$ , but

\tilde{a}_{n}=\mathcal{P}_{n}u(t_{n})-\sum_{j=1}^{r_{n}-1}a_{nj}.

Proof:

By definition, $U|_{I_{n}}\in\mathbb{P}_{r_{n}-1}(I_{n};V_{n})$ so there exist coefficients $a_{nj}$ and $\tilde{a}_{n}$ in $V_{n}$ such that $U$ has the desired expansion. The formula for $a_{nj}$ follows at once from the orthogonality property of the $\psi_{nj}$ (see Remark 11). The formula for $\tilde{a}_{n}$ follows from (27) because $\psi_{nj}(t_{n})=P_{j-1}(1)=1$ for all $j$ . $\spadesuit$

We have a Peano kernel $\mathsf{G}_{r}$ for the Fourier–Legendre expansion of degree $r$ ,

f(\tau)=\sum_{j=1}^{r+1}b_{j}\Psi_{j}(\tau)+\int_{-1}^{1}\mathsf{G}_{r}(\tau,\sigma)f^{(r+1)}(\sigma)\,d\sigma\quad\text{for $-1\leq\tau\leq 1$,}

assuming $f:[-1,1]\to\mathbb{R}$ is $C^{r+1}$ , and also a Peano kernel $\mathsf{M}_{j}(\tau)$ for the $j$ th coefficient:

b_{j}=\frac{2j-1}{2}\int_{-1}^{1}f(\tau)\Psi_{j}(\tau)\,d\tau=\int_{-1}^{1}\mathsf{M}_{j}(\tau)f^{(j-1)}(\tau)\,d\tau.

Thus, if $t=\mathsf{t}_{n}(\tau)$ and $s=\mathsf{t}_{n}(\sigma)$ , and if we define the local Peano kernels

\mathsf{g}_{nr}(t,s)=(k_{n}/2)^{r}\mathsf{G}_{r}(\tau,\sigma)\quad\text{and}\quad\mathsf{m}_{nj}(t)=(k_{n}/2)^{j-2}\mathsf{M}_{j}(\tau),

then

\mathcal{P}_{n}u(t)=\sum_{j=1}^{r_{n}+1}a_{nj}\psi_{nj}(t)+\int_{I_{n}}\mathsf{g}_{r_{n}}(t,s)\mathcal{P}_{n}u^{(r_{n}+1)}(s)\,ds\quad\text{for $t\in I_{n}$,}

(29)

and

a_{nj}=\int_{I_{n}}\mathsf{m}_{nj}(s)\mathcal{P}_{n}u^{(j-1)}(s)\,ds.

It follows that $a_{nj}=O(k_{n}^{j-1})$ provided $u$ is $C^{j-1}$ on $\bar{I}_{n}$ .

Theorem 15.

Assume that $A=0$ , $U_{0}=\mathcal{P}u_{0}$ and the subspaces satisfy (26). If $u:\bar{I}_{n}\to L^{2}(\Omega)$ is $C^{r_{n}+1}$ , then $a_{n,r_{n}+1}=O(k_{n}^{r_{n}})$ and

\mathcal{P}_{n}u(t)-U(t)=a_{n,r_{n}+1}\bigl{[}\psi_{n,r_{n}+1}(t)-\psi_{n,r_{n}}(t)\bigr{]}+O(k_{n}^{r_{n}+1})\quad\text{for $t\in I_{n}$.}

(30)

Proof:

Subtracting (28) from (29), we have

\mathcal{P}_{n}u(t)-U(t)=(a_{n,r_{n}}-\tilde{a}_{n})\psi_{nr_{n}}(t)+a_{n,r_{n}+1}\psi_{n,r_{n}+1}(t)+O(k_{n}^{r_{n}+1})

for $t\in I_{n}$ . Since $U^{n}_{-}=\mathcal{P}_{n}u(t_{n})$ and $\psi_{n,r_{n}}(t_{n})=\psi_{n,r_{n}+1}(t_{n})=1$ , taking the limit as $t\to t_{n}$ yields $a_{n,r_{n}}-\tilde{a}_{n}=-a_{n,r_{n}+1}+O(k_{n}^{r_{n}+1})$ . $\spadesuit$

Corollary 16.

$\mathcal{P}_{n}\llbracket U\rrbracket^{n-1}=2(-1)^{r_{n}+1}a_{n,r_{n}+1}+O(k_{n}^{r_{n}+1})$ .

Proof:

As $t\to t_{n-1}^{+}$ , the left-hand side of (30) tends to

	$\displaystyle\mathcal{P}_{n}u(t_{n-1})-U^{n-1}_{+}$	$\displaystyle=\mathcal{P}_{n}(I-\mathcal{P}_{n-1}+\mathcal{P}_{n-1})U^{n-1}_{-}-U^{n-1}_{+}=\mathcal{P}_{n}U^{n-1}_{-}-U^{n-1}_{+}$
		$\displaystyle=-\mathcal{P}_{n}(U^{n-1}_{+}-U^{n-1}_{-})=-\mathcal{P}_{n}\llbracket U\rrbracket^{n-1},$

and on the right-hand side, $\psi_{n,r_{n}+1}(t)-\psi_{n,r_{n}}(t)$ tends to $P_{r_{n}}(-1)-P_{r_{n}-1}(-1)=(-1)^{r_{n}}-(-1)^{r_{n}-1}=2(-1)^{r_{n}}$ . $\spadesuit$

Refer to caption — Figure 1: The polynomials $P_{r}(\tau)-P_{r-1}(\tau)$ .

In light of Theorem 15, we consider the polynomials

\psi_{n,r_{n}+1}(t)-\psi_{n,r_{n}}(t)=\Psi_{r_{n}+1}(\tau)-\Psi_{r_{n}}(\tau)=P_{r_{n}}(\tau)-P_{r_{n}-1}(\tau).

As illustrated in Figure 1, there are $r+1$ points

-1=\tau_{r0}<\tau_{r1}<\cdots<\tau_{rr}=1

such that

(P_{r}-P_{r-1})(\tau_{rj})=0\quad\text{for $1\leq j\leq r$.}

In fact, the $r$ zeros $\tau_{r1}$ , $\tau_{r2}$ , …, $\tau_{rr}$ are the points of a right-Radau quadrature rule [4, Chapter 9] on the interval $[-1,1]$ . We put

t^{*}_{nj}=\mathsf{t}_{n}(\tau_{r_{n}j})\quad\text{for $0\leq j\leq r_{n}$,}

(31)

so that $t^{*}_{n-1}=t^{*}_{n0}<t_{n1}<\cdots<t^{*}_{nr_{n}}=t_{n}$ and

\psi_{n,r_{n}+1}(t^{*}_{nj})-\psi_{n,r_{n}}(t^{*}_{nj})=0\quad\text{for $1\leq j\leq r_{n}$.}

From Theorem 15, we see that $\mathcal{P}_{n}u(t)-U(t)=O(k_{n}^{r_{n}})$ for general $t\in I_{n}$ , but $\mathcal{P}_{n}u(t^{*}_{nj})-U(t^{*}_{nj})=O(k_{n}^{r_{n}+1})$ for $1\leq j\leq r_{n}$ . Let $\widehat{\mathcal{X}}$ denote the space obtained from $\mathcal{X}$ by increasing the maximum allowed polynomial degree over the subinterval $I_{n}$ from $r_{n}$ to $\hat{r}_{n}=r_{n}+1$ , for $1\leq n\leq N$ . The reconstruction $\widehat{U}\in\widehat{\mathcal{X}}$ of $U\in\mathcal{X}$ is then defined by requiring that

\widehat{U}(t^{*}_{nj})=U(t^{*}_{nj})\quad\text{for $1\leq j\leq r_{n}-1$,}

and that the one-sided limits at the end points are

\widehat{U}^{n-1}_{+}=\mathcal{P}_{n}U^{n-1}_{-}\quad\text{and}\quad\widehat{U}^{n}_{-}=U^{n}_{-}.

Since $\widehat{U}|_{I_{n}}$ is a polynomial of degree at most $\hat{r}_{n}-1=r_{n}$ , it is uniquely determined by these $r_{n}+1$ interpolation conditions. Notice also that $\widehat{U}$ is continuous at $t_{n-1}$ if $V_{n-1}=V_{n}$ because $\mathcal{P}_{n}U^{n-1}_{-}=U^{n-1}_{-}$ .

Makridakis and Nochetto [6] introduced the reconstruction in their analysis of a posteriori error bounds for parabolic PDEs. Since the polynomial $(U-\widehat{U})|_{I_{n}}$ has degree at most $r_{n}$ and vanishes at $t^{*}_{nj}$ for $1\leq n\leq r_{n}$ , it must be a multiple of $\psi_{n,r_{n}+1}-\psi_{nr_{n}}$ . In fact, by taking limits as $t\to t_{n-1}^{+}$ , we see that

U(t)-\widehat{U}(t)=\frac{1}{2}(-1)^{r_{n}}\mathcal{P}_{n}\llbracket U\rrbracket^{n-1}\bigl{[}\psi_{n,r_{n}+1}(t)-\psi_{n,r_{n}}(t)\bigr{]}\quad\text{for $t\in I_{n}$.}

(32)

At the same time, by Theorem 15 and Corollary 16,

U(t)-\mathcal{P}_{n}u(t)=\frac{1}{2}(-1)^{r_{n}}\mathcal{P}_{n}\llbracket U\rrbracket^{n-1}\bigl{[}\psi_{n,r_{n}+1}(t)-\psi_{n,r_{n}}(t)\bigr{]}+O(k_{n}^{r_{n}+1})\quad\text{for $t\in I_{n}$,}

(33)

implying that $\widehat{U}-\mathcal{P}_{n}u$ is $O(k_{n}^{r_{n}+1})$ on $I_{n}$ . One of our principal aims in the next section is to investigate numerically the error in the dG solution $U$ and its reconstruction $\widehat{U}$ in non-trival cases where $A\neq 0$ . We can hope that something like (33) still holds, because the time derivative in the term $\partial_{t}^{1-\alpha}Au$ is of lower order than in $\partial_{t}u$ . Notice that (5) and (32) imply

\widehat{U}(t)=\sum_{j=1}^{\hat{r}_{n}}\widehat{U}^{nj}\psi_{nj}(t)\quad\text{for $t\in I_{n}$,}

where

\widehat{U}^{nj}=\begin{cases}U^{nj},&1\leq j\leq r_{n}-1,\\ U^{nr_{n}}+\tfrac{1}{2}(-1)^{r_{n}}\mathcal{P}_{n}\llbracket U\rrbracket^{n-1},&j=r_{n},\\ \tfrac{1}{2}(-1)^{r_{n}+1}\mathcal{P}_{n}\llbracket U\rrbracket^{n-1},&j=r_{n}+1=\hat{r}_{n}.\end{cases}

7 Numerical experiments

A Julia package [7] provides functions to evaluate the coefficients $G^{n}_{ij}$ , $K^{n,n-1}_{ij}$ and $H^{n\bar{\ell}}_{ij}$ based on the results of Sections 3 and 5. This package also includes (in the examples directory) the scripts used for the examples below.

7.1 The matrix ${\boldsymbol{H}^{\bar{\ell}}}$

Let $\alpha=3/4$ , and consider for simplicity the case when $k_{n}=k$ and $r_{n}=r$ are constant for all $n$ , so that the formulae of Example 12 apply. To get a sense of how the matrix entries $H^{\bar{\ell}}_{ij}$ behave, we computed

\boldsymbol{H}^{0}=\left[\begin{array}[]{rrrr}1.08807&0.15544&0.07065&0.04239\\ -0.15544&0.49458&0.09326&0.04834\\ 0.07065&-0.09326&0.33839&0.06893\\ -0.04239&0.04834&-0.06893&0.26319\end{array}\right]

and

\boldsymbol{H}^{1}=\left[\begin{array}[]{rrrr}-0.34623&-0.13428&-0.06884&-0.04219\\ 0.13428&0.08414&0.05405&0.03690\\ -0.06884&-0.05405&-0.04050&-0.03048\\ 0.04219&0.03690&0.03048&0.02472\end{array}\right],

which illustrate the property (22). The factor $\bigl{(}1+\Delta_{\bar{\ell}}(\tau-\sigma)\bigr{)}^{\alpha-2}$ in (21) becomes very smooth as $\bar{\ell}$ increases, with the result that $H^{\bar{\ell}}_{ij}$ decays rapidly to zero as $i+j$ increases. Even for $\bar{\ell}=2$ , we have

\boldsymbol{H}^{2}=10^{-1}\times\left[\begin{array}[]{rrrr}-0.91483&-0.10220&-0.01261&-0.00164\\ 0.10220&0.02027&0.00355&0.00059\\ -0.01261&-0.00355&-0.00080&-0.00016\\ 0.00164&0.00059&0.00016&0.00004\end{array}\right],

and Figure 2 shows this behaviour for larger values of $\bar{\ell}$ , with entries in the lower right corner of the matrix reaching the order of the machine epsilon ( $2^{-52}\approx 2.22\times 10^{-16}$ ) once $\bar{\ell}$ is of order $100$ .

The value of $H^{0}_{ij}$ can be computed to machine precision using Gauss quadrature with $M_{\sigma}=\lceil j/2\rceil$ and $M_{y}=\lceil(i+j)/2\rceil-1$ points for the integrals with respect to $\sigma$ and $y$ in (19), and using $M_{z}=\lceil(i+j)/2\rceil-1$ points for the integral with respect to $z$ in (20). When $\ell\geq 1$ , let $H^{\bar{\ell}}_{ij}(M)$ denote the value of $H^{\bar{\ell}}_{ij}$ computed by applying $M$ -point Gauss rules to (21), that is, $M^{2}$ points for the double integral. For a given absolute tolerance $\mathtt{atol}$ , let $M^{\bar{\ell}}_{r}(\mathtt{atol})$ denote the smallest $M$ for which

\bigl{|}H^{\bar{\ell}}_{ij}(M)-H^{\bar{\ell}}_{ij}(12)\bigr{|}<\mathtt{atol}\quad\text{for all $i$, $j\in\{1,2,\ldots,r\}$.}

Table 1 lists some values of $M^{\bar{\ell}}_{r}(\mathtt{atol})$ for $\mathtt{atol}=10^{-14}$ . Unsurprisingly, fewer quadrature points are needed as $\bar{\ell}$ increases.

Table 1: Numbers of Gauss points

M^{\bar{\ell}}_{r}(\mathtt{atol})

required for

\texttt{atol}=10^{-14}

, when

\alpha=3/4

$r$	$\bar{\ell}=1$	$\bar{\ell}=2$	$\bar{\ell}=10$	$\bar{\ell}=100$	$\bar{\ell}=1000$
1	9	9	5	3	2
2	9	9	5	3	2
3	9	9	5	4	3
4	10	10	6	4	3
5	10	10	6	5	4
6	11	11	7	5	4

7.2 A fractional ODE

We consider the initial-value problem (3) in the case

\alpha=1/2,\quad\lambda=1/2,\quad f(t)=\cos\pi t,\quad u_{0}=1,\quad T=2,

(34)

for which the solution is

u(t)=u_{0}E_{1/2}(-\lambda\sqrt{t})+\int_{0}^{t}E_{1/2}(-\lambda\sqrt{t-s})f(s)\,ds,

where $E_{\alpha}(x)=\sum_{n=0}^{\infty}t^{n}/\Gamma(1+n\alpha)$ denotes the Mittag–Leffler function. The substitution $s=(1-y^{2})t$ yields a smooth integrand, allowing $u(t)$ to be computed accurately via Gauss quadrature on the unit interval $[0,1]$ . Note that $E_{1/2}(-x)=\operatorname{erfcx}(x)=e^{x^{2}}\operatorname{erfc(x)}$ is just the scaled complementary error function.

Figure 3 shows $u$ , together with the dG solution $U$ using piecewise quadratics ( $r=3$ ) and only $N=3$ subintervals. In Figure 4 we plot the absolute errors,

\widehat{E}(t)=|\widehat{U}(t)-u(t)|\quad\text{and}\quad E^{n}_{j}=\begin{cases}|U(t^{*}_{n0}+0)-u(t^{*}_{n0})|,&j=0,\\ |U(t^{*}_{nj})-u(t^{*}_{nj})|,&1\leq j\leq r-1,\\ |U(t^{*}_{nr}-0)-u(t^{*}_{nr})|,&j=r,\end{cases}

(35)

again using piecewise quadratics but now with $N=5$ subintervals of uniform size $k_{n}=k=T/N$ . Two features are immediately apparent. First, the accuracy is poor near $t=0$ , reflecting the singular behaviour of the solution: for $m\geq 1$ , the $m$ th derivative $u^{(m)}(t)$ blows up like $t^{-(m-1/2)}$ as $t\to 0$ . Second, on intervals $I_{n}$ away from $0$ , the error is notably smaller at the right-Radau points ( $t^{*}_{nj}$ for $1\leq j\leq 3$ ) than at the left endpoint ( $t^{*}_{n0}=t_{n-1}$ ).

Table 2: Maximum weighted errors (36) at the points

t^{*}_{nj}

using piecewise quadratics (

r=3

) on a uniform grid.

$N$	$E^{\max}_{0}$		$E^{\max}_{1}$		$E^{\max}_{2}$		$E^{\max}_{3}$
8	8.0e-03		8.8e-05		1.3e-04		1.0e-04
16	1.2e-03	2.69	1.4e-05	2.62	1.4e-05	3.15	9.3e-06	3.46
32	1.7e-04	2.87	1.5e-06	3.25	1.3e-06	3.40	8.2e-07	3.50
64	2.2e-05	2.94	1.4e-07	3.42	1.2e-07	3.47	7.2e-08	3.51
128	2.8e-06	2.97	1.3e-08	3.47	1.1e-08	3.49	6.3e-09	3.51
256	3.6e-07	2.98	1.1e-09	3.49	9.5e-10	3.50	5.5e-10	3.51

In Table 2, we show how the quantities

E^{\max}_{j}=\max_{1\leq n\leq N}(t^{*}_{nj})^{r-\alpha}E^{n}_{j}

(36)

behave as $N$ grows. These results, together with similar computations using other choices of $\alpha$ and $r\geq 2$ , lead us to conjecture that, in general, using a constant time step $k$ ,

E^{n}_{0}\leq C(t^{*}_{n0})^{\alpha-r}k^{r}\quad\text{for $2\leq n\leq N$,}

whereas

E^{n}_{j}\leq C(t^{*}_{nj})^{\alpha-r}k^{r+\alpha}\quad\text{for $1\leq n\leq N$ and $1\leq j\leq r$,}

and that, consequently,

|\widehat{U}(t)-u(t)|\leq Ct^{\alpha-r}k^{r+\alpha}\quad\text{for~{}$t_{1}\leq t\leq T$.}

However, using piecewise-constants ( $r=1$ ) we do not observe any superconvergence, with both $E^{\max}_{0}$ and $E^{\max}_{1}$ behaving like $Ct_{n}^{1-\alpha}k$ , albeit with a noticably smaller constant in the case of $E^{\max}_{1}$ .

To suppress the growth in the error as $t$ approaches $0$ , we can use a graded mesh of the form

t_{n}=(n/N)^{q}T\quad\text{for $0\leq n\leq N$,}

(37)

with a suitable grading exponent $q\geq 1$ . Table 3 shows the maximum error in the reconstruction, i.e., $\max_{0\leq t\leq T}|\widehat{U}(t)-u(t)|$ , together with the associated convergence rates, for four choices of $q$ and using $T=1$ as the final time. These errors appear to be of order $k^{\min(3.5,q\alpha)}$ where $k=\max_{1\leq n\leq N}k_{n}\leq CN^{-1}$ . We conjecture that, in general,

|\widehat{U}(t)-u(t)|\leq Ck^{\min(r+\alpha,q\alpha)}\quad\text{for $0\leq t\leq T$, provided $r\geq 2$.}

(38)

Table 3: Maximum error in the reconstruction

\widehat{U}(t)

for

0\leq t\leq T=1

, using piecewise quadratics (

r=3

) for four choices of the mesh grading exponent

q

; see (37).

$N$	$q=1$		$q=3$		$q=5$		$q=6$
8	1.1e-02		1.4e-03		4.1e-04		7.1e-04
16	6.0e-03	0.84	3.8e-04	1.89	5.2e-05	2.95	9.1e-05	2.97
32	4.3e-03	0.50	1.3e-04	1.50	7.9e-06	2.72	1.0e-05	3.19
64	3.0e-03	0.50	4.7e-05	1.50	1.4e-06	2.51	9.8e-07	3.35
128	2.1e-03	0.50	1.7e-05	1.50	2.5e-07	2.50	9.2e-08	3.41
256	1.5e-03	0.50	5.9e-06	1.50	4.3e-08	2.50	8.4e-09	3.45

7.3 A fractional PDE

Consider the elliptic operator $A=-\partial^{2}/\partial x^{2}$ for the 1D spatial domain $\Omega=(0,L)$ . To construct a reference solution, we exploit fact that the Laplace transform of $u$ ,

\tilde{u}(x,z)=\int_{0}^{\infty}e^{-zt}u(x,t)\,dt,

satisfies the two-point boundary-value problem

\omega^{2}\tilde{u}-\tilde{u}_{xx}=g(x,z)\quad\text{for $0<x<L$,}\quad\text{with $\tilde{u}(0,z)=0=\tilde{u}(L,z)$,}

where

\omega=z^{\alpha/2}\quad\text{and}\quad g(x,z)=z^{\alpha-1}[u_{0}(x)+\tilde{f}(x,z)].

The variation-of-parameters formula leads to the integral representation

\tilde{u}(x,z)=\frac{\sinh\omega(L-x)}{\omega\sinh\omega L}\int_{0}^{x}g(x,z)\sinh\omega\xi\,d\xi\\ +\frac{\sinh\omega x}{\omega\sinh\omega L}\int_{x}^{L}g(x,z)\sinh\omega\xi\,d\xi,

and the Laplace inversion formula then gives

u(x,t)=\frac{1}{2\pi i}\int_{\Gamma}e^{zt}\tilde{u}(x,z)\,dz,

(39)

for a contour $\Gamma$ homotopic to the imaginary axis and passing to the right of all singularities of the integrand.

We choose as data the functions

u_{0}(x)=C_{0}x(L-x)\quad\text{and}\quad f(x,t)=C_{f}te^{-t},

(40)

for constants $C_{0}$ and $C_{f}$ , and find that

	$\displaystyle\tilde{u}(x,z)$	$\displaystyle=\frac{C_{0}}{z}\,\frac{\rho_{1}(x)\sinh\omega(L-x)+\rho_{1}(L-x)\sinh\omega x}{\sinh\omega L}$
		$\displaystyle\qquad{}+\frac{C_{f}}{z(z+1)^{2}}\frac{\rho_{2}(x)\sinh\omega(L-x)+\rho_{2}(L-x)\sinh\omega x}{\sinh\omega L},$

where

\rho_{1}(x)=\bigl{(}\omega x(L-x)-2\omega^{-1}\bigr{)}\cosh\omega x+(2x-L)\sinh\omega x+2\omega^{-1}

and

\rho_{2}(x)=\cosh\omega x-1.

To evaluate the contour integral (39) we apply an optimised equal-weight quadrature rule that arises after deforming $\Gamma$ into the left branch of an hyperbola [16]. Figure 5 shows the reference solution over the time interval $[0,2]$ in the case

\alpha=0.6,\qquad L=2,\qquad C_{0}=1,\qquad C_{f}=2.

(41)

In Figure 6, we plot the $L_{2}$ -norms of the jumps, $\|\llbracket U\rrbracket^{n-1}\|$ , together with the errors in $U(t)$ and its reconstruction $\widehat{U}(t)$ . The dG method used piecewise-quadratics ( $r=3$ ), first with a uniform mesh of $N=12$ subintervals (top), and then with a non-uniform mesh of $N=40$ subintervals (bottom). In both cases, the spatial discretisation used (continuous) piecewise cubics on a uniform grid with $20$ subintervals. Since $u_{0}$ is a quadratic polynomial in this instance, we simply put $U_{0}=u_{0}$ . Consistent with our conjecture (33), we observe that

\sup_{t_{n-1}<t<t_{n}}\|U(t)-u(t)\|\approx\bigl{\|}\llbracket U\rrbracket^{n-1}\bigr{\|}.

Motivated by our conjecture (38), the second mesh was graded for $0\leq t_{n}\leq 1$ by taking $q=(r+\alpha)/\alpha$ , $N=34$ and $T=1$ in the formula (37), followed by a uniform mesh on the other half $[1,2]$ of the time interval. We see that the mesh grading is effective at resolving the solution for $t$ near zero, albeit with a substantial increase in the overal computational cost.

Acknowledgements

This project was supported by a UNSW Faculty Research Grant (PS47152/IR001/MATH).

References

[1] Michael G. Duffy “Quadrature over a pyramid or cube of integrands with a singularity at a vertex” In SIAM J. Numer. Anal. 19, 1982, pp. 1260–1262 DOI: 10.1137/0719090
[2] Kenneth Eriksson, Claes Johnson and Vidar Thomée “Time discretization of parabolic problems by the discontinuous Galerkin method” In ESAIM: M2AN 19, 1985, pp. 611–643 DOI: 10.1051/m2an/1985190406111
[3] J. Klafter and I. M. Sokolov “First Steps in Random Walks” Oxford University Press, 2011
[4] Vladimir Ivanovich Krylov “Approximate Calculation of Integrals”, ACM Monographs New York: Macmillan, 1962
[5] Kim-Ngan Le, William McLean and Martin Stynes “Existence, uniqueness and regularity of the solution of the time-fractional Fokker–Planck equation with general forcing” In Commun. Pure Appl. Anal. 18, 2019, pp. 2765–2787 DOI: 10.3934/cpaa.2019124
[6] Charalambos Makridakis and Richardo H. Nochetto “A posteriori error analysis for higher order dissipative methods for evolution problems” In Numer. Math. 104, 2006, pp. 489–514 DOI: 10.1007/s00211-006-0013-6
[7] William McLean “FractionalTimeDG: Generate coefficient arrays needed for discontinuous Galerkin time-stepping of fractional diffusion problems” Github, https://github.com/billmclean/FractionalTimeDG.jl, 2020
[8] William McLean “Regularity of solutions to a time-fractional diffusion equation” In ANZIAM J. 52, 2010, pp. 123–138 DOI: 10.1017/S1446181111000617
[9] William McLean and Kassem Mustapha “Convergence analysis of a discontinuous Galerkin method for a sub-diffusion equation” In Numer. Algor. 52, 2009, pp. 69–88 DOI: 10.1007/s11075-008-9258-8
[10] William McLean, Kassem Mustapha, Raed Ali and Omar Knio “Well-posedness of time-fractional advection-diffusion-reaction equations” In Fract. Calc. Appl. Anal. 22, 2019, pp. 918–944 DOI: 10.1515/fca-2019-0050
[11] Ralf Metzler and Joseph Klafter “The random walk’s guide to anomalous diffusion: a fractional dynamics approach” In Physics Reports 339, 2000, pp. 1–77 DOI: 10.1016/S0370-1573(00)00070-3
[12] Kassem Mustapha “Time-stepping discontinuous Galerkin methods for fractional diffusion problems” In Numer. Math. 130, 2015, pp. 497–516 DOI: 10.1007/s00211-014-0669-2
[13] Lars Schmutz and Thomas P. Wihler “The variable-order discontinuous Galerkin time stepping scheme for parabolic evolution problems is uniformly $L^{\infty}$ -stable” In SIAM J. Numer. Anal. 57, 2019, pp. 293–319 DOI: 10.1137/17M1158835
[14] Dominik Schötzau and Christoph Schwab “Time discretization of parabolic problems by the hp-version of the discontinuous Galerkin finite element method” In SIAM J. Numer. Anal. 38, 2001, pp. 837–875 DOI: 10.1137/S0036142999352394
[15] Vidar Thomée “Galerkin Finite Element Methods for Parabolic Problems” Springer, 2006
[16] J. A. C. Weideman and L. N. Trefethen “Parabolic and hyperbolic contours for computing the Bromwich integral” In Math. Comp. 76, 2007, pp. 1341–1356 DOI: 10.1090/S0025-5718-07-01945-X

Implementation of high-order, discontinuous Galerkin time stepping for fractional diffusion problems

Abstract

1 Introduction

2 A fractional ODE

Remark 1.

Remark 2.

3 Evaluation of the coefficients

Lemma 3.

Proof:

Lemma 4.

Proof:

Lemma 5.

Proof:

Lemma 6.

Proof:

Remark 7.

4 Spatial discretisation

5 Legendre polynomials

Lemma 8.

Proof:

Example 9.

Lemma 10.

Proof:

Remark 11.

Example 12.

6 Reconstruction

Lemma 13.

Proof:

Lemma 14.

Proof:

Theorem 15.

Proof:

Corollary 16.

Proof:

7 Numerical experiments

7.1 The matrix 𝑯ℓ¯{\boldsymbol{H}^{\bar{\ell}}}

7.2 A fractional ODE

7.3 A fractional PDE

Acknowledgements

References

Author address

Implementation of high-order,
discontinuous Galerkin time stepping
for fractional diffusion problems

7.1 The matrix ${\boldsymbol{H}^{\bar{\ell}}}$