[datatype=bibtex] \map[overwrite=true] \step[fieldsource=fjournal] \step[fieldset=journal, origfieldval]
Positivity preservation of implicit discretizations of the advection equation
Abstract
We analyze, from the viewpoint of positivity preservation, certain discretizations of a fundamental partial differential equation, the one-dimensional advection equation with periodic boundary condition. The full discretization is obtained by coupling a finite difference spatial semi-discretization (the second- and some higher-order centered difference schemes, or the Fourier spectral collocation method) with an arbitrary -method in time (including the forward and backward Euler methods, and a second-order method by choosing suitably). The full discretization generates a two-parameter family of circulant matrices , where each matrix entry is a rational function in and . Here, denotes the CFL number, being proportional to the ratio between the temporal and spatial discretization step sizes. The entrywise non-negativity of the matrix —which is equivalent to the positivity preservation of the fully discrete scheme—is investigated via discrete Fourier analysis and also by solving some low-order parametric linear recursions. We find that positivity preservation of the fully discrete system is impossible if the number of spatial grid points is even. However, it turns out that positivity preservation of the fully discrete system is recovered for odd values of provided that and are chosen suitably. These results are interesting since the systems of ordinary differential equations obtained via the spatial semi-discretizations studied are not positivity preserving.
1 Background and motivation
In this work, we investigate the positivity of some discretizations of the advection equation with periodic boundary condition
(1) |
where is the unknown function, is a given differentiable initial function, and is a constant. The exact solution of the Cauchy problem (1), given by (where denotes the fractional part), is positivity preserving; i.e.
(2) |
Positivity is often important in this context, since may represent a concentration or density that cannot be negative.
Remark 1.
Herein the term positivity is always meant in the weak sense; i.e. it means non-negativity.
Finite difference spatial semi-discretization of (1) on a uniform grid with mesh spacing and yields a system of ordinary differential equations
(3) |
where , is a circulant matrix [2, Section 5.16], and is the number of grid points (the points and are identified due to the periodic boundary condition). If one uses an upwind spatial discretization
(4) |
then the exact solution of (3) is also positivity preserving. Moreover, a corresponding full discretization will be positivity preserving too, under an appropriate time step size restriction if, for example, the forward (explicit) Euler method or any strong stability preserving method [5] is used in time; see, e.g. [3].
Positivity-preserving methods for transport equations are typically based on low-order upwind-biased spatial discretizations like that above, or involve nonlinear limiters (or both). Here we instead consider the positivity of linear higher-order centered discretizations. A second-order scheme is obtained with the centered difference discretization
(5) |
However, this spatial semi-discretization is not positivity preserving, since the matrix has at least one negative off-diagonal entry [7, Chapter I, Theorem 7.2]. This implies that any consistent full discretization based on (5) must fail to preserve positivity under sufficiently small step sizes . Indeed, a full discretization based on the scheme (5) and forward Euler in time is not positivity preserving for any step size .
On the other hand, interestingly, using (5) with backward (implicit) Euler time integration, one observes positivity preservation under large enough time step sizes provided that the parity of the number of spatial grid points is odd. To investigate the differences between the behavior of the forward and backward Euler methods, we will study the -method [6, Chapter IV.3] as time discretization applied to (3)
(6) |
For , the -family includes both Euler methods as limiting cases: the forward Euler method for , the backward Euler method for , and the only second-order -method for ; any -method with is implicit.
Remark 2.
As is customary in the context of space-time discretizations of partial differential equations, superscripts of in (6) (and later in this work) are not exponents but denote time discretization steps.
Remark 3.
Similarly to (2), the semi-discrete system (3) and the fully discrete system (6) are said to be positivity preserving, if for any componentwise non-negative vector of initial condition
-
•
, the solution of (3) stays componentwise non-negative
-
•
, the solution of (6) stays componentwise non-negative ,
respectively.
The motivation for this work is not to develop new positivity preserving methods, but to study the positivity of some of fundamental discretizations such as the second-order centered difference method (5) and the -method (6). As we will see, the combination of these methods does not preserve positivity in general, nor in the limit of small time step size, so it is not typically recommended in practice. Nevertheless, this study may both shed light on the behavior of more complicated methods used in practice and provide tools that can be used to study the positivity of those methods. In the later sections of the paper, we combine higher-order methods in space with the -method in time, as a next step in this direction.
1.1 Structure of the paper and notation
In Section 2, we first characterize
positivity preservation of full discretizations of (1) resulting from
finite difference spatial and one-step time discretizations. Then, in Section 2.1, we study positivity of the second-order centered differences in space
combined with the -method in
time, using discrete Fourier
analysis. We also point out some connections with structured non-negative inverse eigenvalue problems. In Section 3, we study this particular full discretization in more detail.
In Section 3.1, by setting up and solving certain parametric linear recursions, we derive explicit, non-trigonometric formulae for the entries of the full discretization matrix . Then, in Section 3.2, we use these formulae to provide precise results on the non-negativity of in terms of roots of some sparse polynomials.
In Section 4, we discuss some observations and results regarding
higher-order spatial discretizations, including high-order centered differences
(in Section 4.1) and spectral collocation methods (in Section 4.2).
We summarize our findings in Section 5.
Throughout the paper, the set of positive integers is denoted by , the complex imaginary unit is , the identity matrix is , and to emphasize the dimensions of a matrix, we will sometimes write, for example, . The symbol means that for every entry of the matrix . The positive integer is the number of spatial grid points within the interval , and the matrices and are the matrices corresponding to the spatial and the full discretizations, respectively. The three key parameters in our investigations will be , and (see (6) and (12)).
The computations in this work have been carried out by using Wolfram Mathematica version 11.
2 Discrete Fourier analysis
From here on, we consider the problem (1) on the domain with periodic boundary condition . Finite difference discretization in space with step size leads to (3) with for . The circulant matrix has the eigendecomposition
(7) |
where the (unitary) matrix of normalized eigenvectors has entries
(8) |
and is the diagonal matrix of eigenvalues , which depends on the particular finite difference method chosen. Here are evenly spaced angles
(9) |
such that are the roots of unity. Applying a one-step time discretization with step size and stability function to (3) leads to the iteration
(10) |
where
(11) |
and
(12) |
is the CFL number. Then it is easily seen that
For one-step methods, is a rational function, and products and inverses of circulant matrices are also circulant [2, Fact 5.16.7], so is also a real, circulant matrix. Thus it is defined completely by the entries of its first row, which are given by
(13) |
2.1 Second-order centered discretization in space, -method in time
In what follows we assume . Consider the case of a 3-point centered difference approximation in space (having order 2):
so that is a circulant matrix with entries on the central three diagonals:
(14) |
that is,
(15a) | ||||
(15b) |
It is known that the eigenvalues of are
(16) |
Now we consider the -method [6, Chapter IV.3] in time, whose stability function is
(17) |
so with the second-order centered difference in space we get from (13) for that the entries of the full discretization matrix are
(18) |
Note that the angles are symmetric about the -axis if is odd, and also if is even and is odd. If both and are even, then the angles are symmetric about the origin. Therefore, we have that
for all and for any value of . Moreover the factors and in (18) keep this symmetry. Thus, the imaginary part of (18) vanishes, yielding for all , as expected. So for we get
We will also make use of the identities
(19) |
This leads to the following expressions for the first, second and last entries of the first row of :
These entries will have a special role in the forthcoming analysis. We distinguish two cases.
- Case 1: is even.
-
By using similar symmetry arguments as before, we conclude that for even the entries of matrix are given by
Considering the above expression for we have
Thus the discretization using second-order centered differences in space and the -method in time cannot preserve positivity when is even, regardless of the values of and . We can arrive at the same conclusion by observing that for any we have , so that one of these (non-zero) entries must always be negative.
- Case 2: is odd.
-
Let us now consider the case of odd . Then for . Writing
and taking with and fixed, we find that
We see that
while
Thus for fixed and , the matrix is non-negative if is large enough.
We now show that also holds for with fixed and for large enough. Clearly, we only need to verify the non-negativity of entry for large enough. In fact, for and for any we have . To see this, consider a summand with in :
Its partial derivative with respect to is
and , hence the function
is positive at , strictly decreases, and its limit when is , completing the proof of the claim.
Summarizing the above, we have proved the following for any .
Theorem 1.
Consider the advection equation (1) with periodic boundary condition discretized using
-order centered differences in space and the -method in time with spatial grid
points and .
The full discretization takes the form (10),
where
(i) if is even, then has at least one negative entry;
(ii) if is odd and , then for large enough all
entries of are non-negative.
A refinement of Theorem 1 for odd values of will be given at the end of Section 3; see Theorem 2. As for the interval appearing in Theorem 1, see also Figures 3–4.
Remark 4.
In the formulae leading to Theorem 1 we used a trigonometric representation of the matrix entries . Here we highlight a related approach to studying the non-negativity of by relying only on the eigenvalues () of . According to (11), (16) and (17), we have
The main question in the context of non-negative inverse eigenvalue problems is to find (necessary or sufficient) conditions for a set to be the spectrum of some non-negative matrix. One such condition is the following. It is known [1, Chapter 4] that if is the spectrum of an non-negative matrix, then
(20) |
For example, for and , (20) with and yields the lower bounds
(21) |
where the approximate values of are given below:
As we see, the necessary condition (20)—valid for any non-negative matrix—already implies that there are positive lower bounds on , although these bounds are not optimal.
It is possible to sharpen the lower bounds in (21) by making use of some more specific results. We know in addition that the matrix is circulant, which leads us to the realm of structured non-negative inverse eigenvalue problems. For example, the spectra of non-negative circulant matrices have been characterized (with a necessary and sufficient condition) in [9, Theorem 10]. From this theorem we get (still for and ) the lower bound
As we will see, the precise lower bound for this matrix—according to our Theorem 2 with and —is
Remark 5.
It is not restrictive to assume in (1). If we assumed instead, then the results of Theorems 1 and 2 would remain valid (together with Figures 3–4, for example), with all the arguments in their proofs being essentially the same. For example, as we will see in Section 3, the non-negativity of (the first row of) matrix is governed by the elements and for and odd—this would change to elements and for and odd.
3 Second-order centered discretization in space and -method in time—algebraic characterization of the entries of the full discretization matrix
The results of Section 2 are based on the eigendecomposition of the full discretization matrix . In this section, instead of using trigonometric functions, we give an algebraic description of the matrix entires by exploiting the relation in (11) with defined in (14). Explicitly, this means
(22) |
but the dependence of on its parameters will often be suppressed.
It is trivial that for we have , hence cannot hold for any . The case has been discussed in Section 2.1. Thus, throughout the rest of this section, we can assume that
(23) |
3.1 Explicit description of the matrix entries for odd values of
To illustrate the structure of , we present its first row (as a vector, and with the common denominator of the entries in front of it) for the smallest values of .
Example 1.
Each element of is a rational function in the variables and . From (22) it is clear that
(24) |
where and are certain bivariate polynomials in and , and
(25) |
Remark 6.
The subscripts of thus refer to the position of the polynomial within the first row of , and the size of , respectively.
The key to describing algebraically is the observation that the polynomials and satisfy certain low-order linear recursions with constant coefficients. As already indicated by Section 2.1, the leftmost entry () behaves differently than the rest ().
Remark 7.
Mathematica’s FindLinearRecurrence command proved to be an efficient tool for discovering these linear recursions.
First, let us introduce some new variables. On the one hand, as suggested by Example 1, it seems convenient to set
Then, due to the sign assumptions, . On the other hand, as we will soon see, the polynomial
will appear as a (factor of a) characteristic polynomial, and its roots are
(26) |
This motivates us to introduce yet another variable, which will further simplify our exposition. We set
(27) |
It is seen that the transformation
is a bijection. Moreover, the following (inverse) relations
and
(28) |
are easily verified. We can now start describing the entries of the first row of .
Remark 8.
Although the expressions and will become in general rational functions in the variable , we still call them polynomials (referring to their structure in the original variables and ).
The polynomials . By carrying out some determinant expansions, we see that the determinants (25) obey the second-order parametric recursion
(29) |
with initial conditions
(cf. Example 1). After solving this recursion, we obtain
which, in terms of the variable , becomes
(30) |
The polynomials . They satisfy the recursion
that is, with coefficients being the same as in (29), but with initial conditions
(cf. Example 1). By solving this recursion, we derive that
(31) |
where the numerator is
(32) |
Remark 9.
Here, the subscript stands for leftmost. This polynomial will play a special role in the next section.
The polynomials . They satisfy a third-order recursion in the variable ,
(33) |
with initial conditions
The characteristic polynomial of recursion (33) is
hence the characteristic roots are as in (26), and . Based on this, one easily obtains the explicit solution as
(34) |
The polynomials . They satisfy the same third-order recursion in the variable as (33),
but with initial conditions
The explicit solution of this recursion is
(35) |
Remark 10.
We note that, for any fixed , the polynomials satisfy the same third-order recursion (33) in the variable , with triplets of initial conditions depending on . However, we cannot use this approach to proceed, since setting up the initial conditions would require, among others, the knowledge of the polynomials (for ), (for ), (for ), and so on.
The polynomials (, ). They satisfy the following second-order recursion in the variable when is fixed (hence having only finitely many terms for a particular ):
For the initial conditions of this final recursion, we use the general forms of and in (34) and (35) to get for any and that
(36) |
where the polynomials are defined as
(37) |
As a special case, we set
in other words we have
(38) |
where the subscript stands for rightmost.
Remark 11.
As a by-product, we have obtained the following set of identities by comparing the trigonometric and algebraic representations presented so far. They are also interesting from a structural point of view: although the number of terms in the trigonometric sums increases as gets larger, the polynomials in are sparse polynomials (also known as lacunary polynomials or fewnomials)—the number of terms does not increase as the polynomial degree increases.
Corollary 1.
3.2 Non-negativity of the matrix entries for odd values of
In this section we present a detailed description of the non-negativity properties of the matrix , thanks to the explicit forms for the entries obtained in Section 3.1. Throughout this section we still assume (23).
By taking into account (24), (30), (31), (32), (36), (37), and the fact that now (see (27)), the following corollary is evident.
Corollary 2.
For a given pair
and for any

The following lemma proves some of the observations about the polynomials suggested by Figure 1 for even and odd indices .
Lemma 1.
Let us fix arbitrarily. Then
for any , ;
for any , .
Proof.
For the even indices, we have
while for the odd indices,
∎
By combining Corollary 2 and Lemma 1, we have obtained the following result, expressing the fact that the non-negativity of is determined only by the polynomials appearing in the numerators of its top left and top right entries.
Corollary 3.
For a given pair
and
The non-negativity of has therefore been reduced to studying the simultaneous non-negativity of two parametric polynomials, and , over the -interval . The content of Lemmas 2 and 3 is illustrated by Figure 2.
Lemma 2 (about the sign of ).
Let us fix arbitrarily, and recall that by definition . Then there is a unique such that .
Let
(39) |
Then for , and for .
Moreover, , , and
(40) |
Proof.
For fixed , the continuous function is strictly increasing, and , hence there is a unique root. This root is strictly increasing in , because the function is strictly decreasing for fixed . Finally notice that is equivalent to , and for any one has
From this we easily get (40) and also the limit of (). ∎
Remark 12.
The asymptotic series (as ) of both bounds in (40) has the form
Lemma 3 (about the sign of ).
Let us fix arbitrarily, and recall that by definition
.
(i) Suppose that . Then, for any , .
(ii) Suppose now that . Then there is a unique such that .
Let
(41) |
Then for , and for .
Moreover, on the one hand, for fixed , the function is strictly decreasing, and .
On the other hand, for fixed , the function is strictly increasing, and we have the one-sided limits and .
Proof.
We notice that the expression is linear in , so by setting
we easily get for any that
(42) |
where the symbol denotes either , or , or on both sides of the equivalence. It is seen that for fixed we have the one-sided limits
(43) |
Now we show that the function
(44) |
The partial derivative
is positive, if . But
so the positivity of will follow if we show that is strictly decreasing. Indeed,
where
hence it is enough to verify . And this is true, since , and
Now, as (44) has been checked, it is obvious that continuity, (42), (43) and (44) imply statement (i) of the lemma, and, at the same time, regarding statement (ii) of the lemma, the existence of a unique root , the positivity of on , and the negativity of on .
We finally discuss the monotonicity and limit properties of the root . For fixed , the function is strictly increasing, since
This implies that, for any fixed , the function is strictly decreasing. Moreover, for fixed , we see from the definition that , so, due to (42) with “equality”, one has for fixed that solves ; in other words, . To show the validity of the last sentence of the lemma, we fix , and simply take into account again (42) with “equality”, (43) and (44). ∎

In order to return to the original variables from the variable —based on (28), (39) and (41)—we define
(45) |
and similarly,
(46) |
The value is introduced here for convenience so as to make our descriptions shorter.
A reformulation of Corollary 3 in terms of the variables is given below.
Corollary 4.
For any and we have
and
In particular,
Proof.
Some growth rates, monotonicity and limit properties of and —defined in (45)–(46)—are collected below; see also Figures 3 and 4.
Corollary 5.
(i) For any and , we have , and
(48) |
The asymptotic series for these lower and upper bounds have the form
being approximately
In particular, .
(ii) For fixed , (and for ). Finally, for fixed , we have the limit
(49) |
and, for fixed , the one-sided limits
(50) |
Proof.
(i) The monotonicity of in for fixed follows from the monotonicity of
in Lemma 2 together with (47), and inequality (48) is
just (40) under the transformation (47).
(ii) We similarly obtain the monotonicity of in for fixed , and the limit (49) from Lemma 3 via (47), by also noting that
As for the limit in (50), we know from Lemma 3 that (from below), and , hence when .


The following result explains why the “left half” of Figure 3 is “empty” (cf. Corollary 4)—the result is non-trivial, since for fixed , (cf. Figure 4).
Lemma 4.
For any there is a unique such that
This also satisfies
Moreover, the sequence is strictly increasing in , and . In particular, for any and we have
Proof.
Let us fix . Due to (45)–(46), is finite but is infinite for any , so cannot hold. For , by also using (47), we have
Here is independent of , and is fixed, so by definitions (39) and (41) this means that must be chosen in a way such that . By using the notation introduced in the proof of Lemma 3, this is equivalent to . Hence, if holds for some , then . Now, by Lemma 2, we have , and is strictly increasing in its first argument (see (44)), so the sequence is also strictly increasing.
The same monotonicity argument shows that holds for some if and only if (see (42) and the characterization of in Lemma 3).
For , one explicitly computes that
(51) |
so . Therefore, we have for , also implying that for any and , we have . ∎
The following theorem summarizes the results of Section 3. In the theorem, we assume , and .
Theorem 2 (About the full discretization matrix corresponding to the -order centered discretization in space and -method in time).
-
Fix arbitrarily. Then can never hold, i.e. for any and there is at least one strictly negative entry of the matrix .
-
Let . Then
-
Fix arbitrarily. Then there are finitely many values of for which there exists with . For any such value of , the set of admissible values of has the form , with suitable constants (the possible case means that there is no upper but only a lower bound on ); see also Corollary 5.
-
Let . Then for each there is a constant such that
In addition, for any , , and the two-sided estimates in (48) with hold.
Proof.
The case has already been discussed at the beginning of Section 3. In general, for , we know from Corollary 4 that, for any , the set of values for which holds has the form .
The range is covered by Lemma 4.
For , (51) shows that for one has . But for any we know (see Corollary 5) that
hence cannot hold for any .
For fixed , becomes finite for all sufficiently large (see (46)). But according to Corollary 5, is decreasing in for , and , so the inequality can hold only for finitely many values of .
Finally, for , and we can use Corollary 5 (i) with . ∎
Remark 13.
The “lower left corner point” of each shaded region in Figure 3 corresponds to a pair for which . This means that here the leftmost and the rightmost entries of the first row of simultaneously vanish (and the other entries are non-negative). For , this happens for and ; the corresponding matrix is
(52) |
4 Other spatial discretizations
The spatial semi-discretization considered in Sections 2.1–3 is not positivity preserving. The same will hold true for each spatial semi-discretization to be investigated in Sections 4.1–4.2 below: it is well-known [7, Chapter I, Theorem 7.2] that a linear constant-coefficient system of ordinary differential equations (3) is positivity preserving if and only if the matrix has no negative off-diagonal entries. The violation of this last condition is clear for the matrix in (14), for all matrices in Section 4.1, and also for the ones in Section 4.2 (due to ).
However, as we will see, it is again possible to obtain a positivity-preserving full discretization scheme when the spatial discretizations covered in this section (higher-order centered spatial discretizations and Fourier spectral collocation methods) are suitably combined with the -method.
Remark 14.
For (that is, when the explicit Euler time discretization is applied), the matrix in (11) becomes , hence cannot hold for any due to the negative off-diagonal entries of . Therefore, in what follows, we can assume .
4.1 Higher-order centered discretizations in space, -method in time
The coefficients of the centered differences can be found, e.g., in [4], from which the corresponding circulant matrices describing the spatial discretization can be constructed. Here, we examine the first few cases.
- •
-
•
When the stencil width is 5 (implying and -order accuracy), the entries on the central diagonals are .
-
•
When the stencil width is 7 (implying and -order accuracy), the entries on the central diagonals are .
In all above central diagonals, the middle corresponds to the main diagonal. We can obtain an eigendecomposition (7) for the matrix , with eigenvectors given by (8) and eigenvalues , where
(53) |
As before, is defined in (9), and is a vector consisting of the last coefficients of the central diagonals of , with denoting the stencil width. For instance, the vector is equal to , , and for stencil widths , , and , respectively.
After the matrix has been chosen, we couple this spatial discretization with the -method as time discretization, and the full discretization matrix is obtained (see (11) and (17)). As seen in Section 2.1, the matrix is a real, circulant matrix so it can be characterized by the entries of its first row, which take the form
(54) |
Our computations suggest that the non-negativity properties of the matrix family again depend on the parity of .
- Case 1: is even.
-
By using symbolic calculations, we have found that, for the -order scheme, cannot hold for any and when . Similarly, for the -order scheme, we checked (again symbolically) that does not hold for any and when . Therefore, positivity preservation is impossible in these cases.
The following proposition extends the above observations for the -order scheme when is a general even number—although only for sufficiently large values of .
Proposition 1.
Consider the iterative formula (10) applied to the advection equation (1) with periodic boundary condition. Let the matrix result from the -order centered discretization in space with spatial grid points and the -method in time. Also, let be the CFL number defined in (12). If is even, then there exists such that the matrix has at least one negative entry for any .
Proof.
We show that for any and , where is a constant depending on and .
Let in (54), then by using (19) and we have
(55) where in the last equality we used , , and the symmetry of angles () about the -axis when is even (explicitly, the identities and for positive integers , and ). Define the function
Then, we can express (4.1) by summing only over indices for which (and separating the case when is divisible by 4), yielding
(56) We will also use the identity
First, observe that and are positive for each index in (56). Now for the -order centered spatial discretization, easy calculations show that ,
so they are all positive as well. Let us fix arbitrarily, and notice that for each we can find such that for . Let , then for . Therefore, for any and .
∎
Remark 15.
It is easily seen that the previous proof boils down to the fact that for the -order centered spatial discretization we have for , where
We know from (53) that for the -order scheme
so we again have for . Therefore, the analogue of Proposition 1 is true for the -order scheme as well (c.f. the proof of Proposition 2).
Remark 16.
Based on the observations above Proposition 1, we conjecture the following: given an arbitrary finite difference centered discretization in space coupled with the -method, then for even , and for all values and —that is, can be chosen in general.
- Case 2: is odd.


4.2 Fourier spectral collocation in space, -method in time
Here, we consider the spectral method that results from extending the finite difference stencil to include the whole spatial grid. In this section, we assume . The resulting spatial semi-discretization can again be written in the form (3) where the matrix takes the form [10, 8]
for even , and
for odd . As in Section 2, the spectral collocation matrices have an eigendecomposition given by (7), with eigenvectors (8), but now the entries of the diagonal matrix are
(57) |
where has been defined in (9).
When this matrix is coupled with the -method as time discretization, the full discretization matrix in (11) is obtained. The entries of the first row of the circulant matrix are again given by (13), and now they take the form
(58) |
Similarly to the Sections 2 and 4.1, we distinguish between even and odd sizes of the discretization matrices .
- Case 1: is even.
-
From (58) we have (also using (19)) that
(59) The following proposition proves that (59) is negative for sufficiently large CFL number .
Proposition 2.
Consider the iterative formula (10) applied to the advection equation (1) with periodic boundary condition. Let the matrix result from a given even spactral collocation method in space (with spatial grid points) and the -method in time. Let also be the CFL number defined in (12). If is even, then there exists such that the matrix has at least one negative entry for any .
Proof.
The proof is similar to that of Proposition 1: we show that for all and , where depends on and .
As previously, we conjecture that for all values of and . We have been able to verify this for , as follows. The expression is a rational function in and , whose denominator is positive. By introducing a new variable and dividing by , we can write the numerator as a univariate polynomial . For example,
We have confirmed with symbolic calculations that for all in the cases .
Remark 17.
When generating the matrix for the symbolic calculations for larger values of for the actual full discretization, it is of course computationally more efficient to use the formula in (11) instead of (because in the latter form one would need to evaluate the inverse of a non-sparse matrix).
- Case 2: is odd.

5 Conclusions
In this work, we have studied the positivity preservation of certain fundamental discretizations of the advection equation with periodic boundary condition (1). Our detailed investigations in Sections 2–3 were devoted to the full discretization obtained by coupling the second-order centered differences in space with the -method in time. Rather than using SSP theory [5], we have employed a direct approach, first based on discrete Fourier analysis and then on a polynomial representation of the entries of the full discretization matrix. The characterization of the matrix entries, along with the related trigonometric identities presented in Corollary 1, may be of independent interest. In Section 4, we considered higher-order centered differences or Fourier spectral collocation in space, and again the -method in time.
For all full discretizations constructed this way, we have found similar behavior. If the number of spatial grid points is even, no method is positivity preserving, while if is odd, some methods may be positivity preserving. Positivity is generally enhanced by taking larger values of the CFL number , larger values of the time-discretization parameter , or smaller values of the spatial grid points . These tendencies, and more specific results, are described in Theorem 2, and can be seen in Figures 3, 5, 6, and 7. Our positive results about the full discretizations are perhaps unexpected, since neither of the underlying spatial semi-discretizations preserves positivity.
Although some of the spatial discretizations considered above have high order, the -method as time discretization typically has order only 1 (order 2 occurs only for ). Therefore, we emphasize that our goal in this work is not to provide efficient discretizations but rather to understand the behavior of these simple building blocks, as a means of gaining insight and understanding the positivity of more complicated discretizations that may not be amenable to a thorough analysis.
There are several possible future directions for research building on this work. Other finite difference spatial discretizations could be studied using similar techniques, and higher-order one-step time discretizations could easily be incorporated via (11). Similarly, finite difference discretizations of other linear partial differential equations could be analyzed with the same techniques. Further areas for extension might include other boundary conditions or multidimensional problems.
References
- [1] Abraham Berman and Robert J. Plemmons “Nonnegative matrices in the mathematical sciences” Revised reprint of the 1979 original 9, Classics in Applied Mathematics Society for IndustrialApplied Mathematics (SIAM), Philadelphia, PA, 1994, pp. xx+340 DOI: 10.1137/1.9781611971262
- [2] Dennis S. Bernstein “Matrix mathematics” Theory, facts, and formulas Princeton University Press, Princeton, NJ, 2009, pp. xlii+1139 DOI: 10.1515/9781400833344
- [3] Imre Fekete, David I. Ketcheson and Lajos Lóczi “Positivity for convective semi-discretizations” In Journal of Scientific Computing 74.1, 2018, pp. 244–266 DOI: 10.1007/s10915-017-0432-9
- [4] Bengt Fornberg “Generation of finite difference formulas on arbitrarily spaced grids” In Mathematics of Computation 51.184, 1988, pp. 699–706 DOI: 10.2307/2008770
- [5] Sigal Gottlieb, David Ketcheson and Chi-Wang Shu “Strong stability preserving Runge-Kutta and multistep time discretizations” World Scientific Publishing Co. Pte. Ltd., Hackensack, NJ, 2011, pp. xii+176 DOI: 10.1142/7498
- [6] E. Hairer and G. Wanner “Solving ordinary differential equations. II” Stiff and differential-algebraic problems, Second revised edition 14, Springer Series in Computational Mathematics Springer-Verlag, Berlin, 2010, pp. xvi+614 DOI: 10.1007/978-3-642-05221-7
- [7] Willem Hundsdorfer and Jan Verwer “Numerical solution of time-dependent advection-diffusion-reaction equations” 33, Springer Series in Computational Mathematics Springer-Verlag, Berlin, 2003, pp. x+471 DOI: 10.1007/978-3-662-09017-6
- [8] Roger Peyret “Spectral methods for incompressible viscous flow” 148, Applied Mathematical Sciences Springer-Verlag, New York, 2002, pp. xii+432 DOI: 10.1007/978-1-4757-6557-1
- [9] Oscar Rojo and Ricardo L. Soto “Existence and construction of nonnegative matrices with complex spectrum” In Linear Algebra and its Applications 368, 2003, pp. 53–69 DOI: 10.1016/S0024-3795(02)00650-X
- [10] Lloyd N. Trefethen “Spectral methods in MATLAB” 10, Software, Environments, and Tools Society for IndustrialApplied Mathematics (SIAM), Philadelphia, PA, 2000, pp. xviii+165 DOI: 10.1137/1.9780898719598