Inverse properties of a class of pentadiagonal matrices related to higher order difference operators
Bakytzhan Kurmanbek
Nazarbayev University, Department of Mathematics, 53 Kabanbay Batyr Ave, Nur-Sultan 010000, Kazakhstan
Yogi Erlangga
Zayed University, Department of Mathematics, Abu Dhabi Campus, P.O. Box 144534, United Arab Emirates
Yerlan Amanbek
Nazarbayev University, Department of Mathematics, 53 Kabanbay Batyr Ave, Nur-Sultan 010000, Kazakhstan
Corresponding author
Abstract
This paper analyzes the convergence of fixed-point iterations of the form and the properties of the inverse of the related pentadiagonal matrices, associated with the fourth-order nonlinear beam equation. This nonlinear problem is discretized using the finite difference method with the clamped-free and clamped-clamped boundary conditions in the one dimension. Explicit formulas for the inverse of the matrices and norms of the inverse are derived. In iterative process, the direct computation of inverse matrix allows to achieve an efficiency. Numerical results were provided.
Many applications give arise to mathematical problems that involve numerical computations with pentadiagonal matrices, which require their inversion (see [1] and references therein). Even though inversion of a nonsingular pentadiagonal matrix can be done efficiently by a numerical linear algebra software, explicit inverse formulas are useful, for example, in a computer algebra software.
Early results on inverses of banded matrices can be traced as far back as to the work of [2, 3, 4] for general band matrices. Results for band Toeplitz matrices are given in [5], with explicit inverse formulas for tridiagonal matrices in [6, 7] and pentadiagonal matrices in [8, 9, 10, 1, 11, 12, 13]. In addition, properties including determinants of such matrices related to finite difference operators have been investigated, e.g. in [14, 15, 16].
In this study, we focus on the specific pentadiagonal matrices arising in a fixed-point iteration for numerically solving the fourth-order nonlinear beam equation:
This nonlinear equation finds applications in mechanical and civil engineering, which models, e.g., a cantilever beam subjected to swelling pressure on one side. In the above equation, the right-hand side term is the swelling pressure, which in this form is proposed by Grob [17], based on empirical studies (see, e.g., [18] and the references therein), is the length of the beam, and represents the mechanical property of the beam, which are assumed to be constant.
Scaling the domain to unity using the dimensionless variable and setting yields
(1)
where . We shall use this formulation throughout. For (1) two types of boundary conditions are employed:
1.
Clamped-Free (CF) condition:
(2)
2.
Clamped-Clamped (CC) condition:
(3)
Since , obviously, can not be a solution, even though it satisfies the boundary conditions.
The solution of (1) with the boundary conditions (2) is concave up and an increasing function, which can be deduced from a mixed formulation of (2):
(4)
From the first part of (4), with in , , and increases in . The condition requires that in , which furthermore, together with the condition , implies that and decreases. From the second part of (4), we have ; thus, is an increasing function in . Since , , which implies and increases. This characterization also holds in the finite-difference setting based on the second-order scheme we use in this paper (c.f. Section 4).
Numerical methods based on finite element methods for (1) have been proposed and studied, e.g., in [19, 20], where focus is given on the accurate approximation of the solution. This paper approaches the problem from a different angle, with emphasis put on the convergence of the iteration method of the form
where , and the properties of the related iteration matrices involved. Using the second-order finite difference approach, these matrices are pentadiagonal and near Toeplitz.
In this paper, we present explicit formulas for inverses of the specific pentadiagonal matrices and their bounds of norms, which are necessary in the convergence analysis of the fixed-point iteration. As the inverse can be formed explicitly, we are able to construct an exact norm of some of those matrices. The convergence rate for the clamped-free and clamped-clamped problems were derived and then numerical examples were presented for different parameters.
The paper is organized as follows. Section 2 is devoted to the convergence and the inverse of the iteration matrix for problem with the clamp-free condition. Similar discussion for the clamp-clamp condition is given in Section 3. Numerical results are presented in Section 4, followed by some concluding remarks in Section 5.
2 The case with clamped-free boundary conditions
We consider equidistant grid points on the closed interval , with the distance (grid size) , at which the solution of (1) is approximated by a finite difference scheme. Each grid point is indexed by , where and correspond to the boundary points. Throughout the paper, we shall consider for to be a meaningful approximation to the differential operator , even though may not be of practical interest.
For the interior nodes, , the fourth-order derivative is approximated by the second-order finite difference scheme:
where and . For , we just impose the boundary condition . For , corresponds to a fictitious point outside the computational domain, which is eliminated using the central scheme approximation to the boundary condition . Similar approaches are used for and , with the boundary conditions be approximated by appropriate second-order finite difference schemes.
The resultant system of nonlinear equations is
(5)
where , with , and
(13)
Here, is a nonsymmetric, nondiagonally dominant pentadiagonal matrix.
Our first result on is that it is nonsingular. In fact, we have the following theorem of the explicit inverse of matrix
The proof is done by the direct computation. Let be matrix such that . We want to show that the product
In other words, is the identity matrix .
(i)
The case and .
In this case, , , , , , while the others are . Therefore,
(14)
If , then , , , , and , yielding .
For , we consider several cases.
(a)
; Then , , , , , yielding .
(b)
; Then , , , , , yielding ;
(c)
; Then , , , , , yielding .
(d)
; Then , , , , , yielding .
(ii)
The case .
For , , , and ; Thus, .
For , we have , , and ; Thus, .
(iii)
The case , with .
For , we have , , , ; Thus, .
For , then , , , and . We have .
(iv)
For the case , similar computations using (14) complete the proof.
∎
From now on, we shall use to denote the -entry of , the inverse of ; thus, .
The following corollary is a consequence of Theorem 2.1.
Corollary 2.2.
The inverse of is a positive matrix; i.e., , implying .
Proof.
By Theorem 2.1 it follows that is positive. Notice that, for , . Consequently, entries determined by the other 2 parts of Theorem 2.1 are also positive.
∎
The above positivity result is important in the context of the fixed-point iteration we devise to solve the nonlinear system (5). Consider the iteration
(15)
Since (Corollary 2.2), the recipe (15) generates a sequence of positive vectors , if started with . As the solution of this type of boundary-value problem is a nonnegative function (c.f., Section 1; see also later for the finite-difference equation case), if the above iteration converges, it converges to a positive solution.
Let . Our starting point for the convergence analysis is the relation, with ,
where , such that the vector . Since is a sequence of positive vectors, is also a positive vector, and consequently the diagonal entries of are strictly less than 1. Thus, , and
(16)
We define to be
(17)
Convergence guarantee of the fixed point iteration (15) requires , which in turn, for given and chosen , requires that
(18)
Lemma 2.3.
For the inverse of in Theorem 2.1, the following holds true:
Proof.
From Theorem 2.1 it follows that and , thus, one can notice that , for .
∎
For the inequality , with given by (13), the following inequalities hold, with and the rows of :
Proof.
We have proved the inequalities for , which comes from the -th row of . Now suppose that they hold also for . Associated with is the inequality from the -th row of , which gives
by assumption. Next, note that
. Thus, , by assumption.
∎
Theorem 2.6.
The solution of the finite difference system (5) is a nonnegative vector , with increasing .
Proof.
On the nodes , approximation to the differential term leads to
which is of the same structure as the rows of . By Lemma 2.5,
Therefore,
With (from the boundary condition and (from using central finite differencing on ), from the most right inequality, we get . Also, ; thus . In general, we have , .
∎
3 The case with clamped-clamped boundary conditions
In this section, we consider the case with the boundary conditions (3). Conditions at are treated in the same way as at , leading to (5), but now with and given by
(29)
However, to simplify our notation, we shall consider the case where and in the subsequent analysis; in this case, .
In constrast to (13), the matrix (29) is centrosymmetric and near Toeplitz. Furthermore, it admits the rank-2 decomposition as follows:
(30)
where is an tridiagonal symmetric Toeplitz matrix, and
(35)
is a symmetric M-matrix, with positive inverse given explicitly by (see, e.g., [10])
(36)
is symmetric positive definite because (and ) is symmetric positive (semi) definite. The inverse of can be computed by applying the Sherman-Morrison formula on (30):
(37)
Because is symmetric positive definite, the middle term on the right-hand side is also symmetric positive definite. Rewriting (30) as
One can verify that change signs. Thus is not an M-matrix.
For ,
for .
Theorem 3.1.
The inverse of given by (29) is a positive matrix. Furthermore, let , , and . The entries of are
•
, for
•
, otherwise.
Proof.
Let , with . Denote by , the product of and the -th column of . For ,
Using , for , we have, with ,
where
and
Using similar calculation for , we get
hence,
Consider the -th row of :
We have
where
Notice that , . With and ,
∎
By Theorem 3.1, starting from , the fixed-point iteration (15) is guaranteed to generate a sequence of positive vectors.
In the sequel, we present two ways of constructing an estimate for norms of the inverse of . The first approach is based on the factorization in (37). The result is presented in the next theorem.
Theorem 3.2.
For ,
Proof.
Note that , due to symmetry. Thus, we shall consider only . Using (36),
The maximum of the rowsum is then attained for . Thus,
(68)
with equality holding when is odd.
We now estimate the 1-norm of . Let , and consider . For a fixed , can be viewed as a linear function of . can then be viewed as the rectangular rules that approximate the area made by the function and the -axis. In this case, treating ,
where and .
Since the matrix is persymmetric, we just need to consider . Then,
Now,
Thus, for ,
since for .
Combining with , we get the desired result. Furthermore, using Hölder’s inequality, .
∎
The second approach uses the knowledge of the entries of in Theorem 3.1. Tedious calculation results in exact norms in some cases, and hence much stronger estimates than the previous estimates.
Theorem 3.3.
For ,
If is odd, then the equality holds for .
Proof.
We shall first consider the case . In this case, by using ,
For ,
because of the centrosymmetry of . Calculating each sum using the formula for the entries , we get
where and
Also,
where
Assuming that , the maximum of the rowsum is obtained from the condition . In this regard, we have
where
The only acceptable solution of the above equation is . The other solutions are rejected: and . One can verify that maximizes the rowsum.
Let be odd. With ,
If is even, then is not a row of the matrix ; the maximum of the rowsum will then be attained at or . Either case satisfies
Symmetry of leads to . Using Hölder’s inequality, the above inequality holds also for .
∎
Table 1 shows the computed norms of the inverse and compares them with the estimate given by Theorem 3.3. For odd and the norms are exact. For even , Theorem 3.3 gives an estimate that leads to a small gap. This gap relative to the estimate becomes negligible with an increase in . To support this statement, the reader is referred to Fig. 1 and Fig. 2 in log scales. The numerical tests are performed for all even from to . The relative error is computed as , where from Theorem 3.3. As shown in Fig. 2 (left), the relative error decreases as increases for or . On the other hand, according to the numerical observation the difference between and the upper bound become constant relative to the norm as increases, see Fig. 2 (right).
Table 1: Computed and the estimates, for the clamped-clamped case.
Figure 1: The upper bound and actual norm or (left) and (right) in log scale
(a) or (left)
(b) (right)
Figure 2: The relative errors in log scale
For , the factor . So, alternatively, if satisfies the condition in the above theorem, we can have a simpler bound: . This factor approaches from above as . Since the latter is slightly less than the former, for a fixed , one can expect a slight improvement of convergence by increasing .
4 Numerical Results
In this section, we present numerical results from solving (1) with (2) or (3) using the fixed point method(15). We compare the observed convergence with the theoretical bound given by (17) and Theorem 2.4 (for the clamped-free case) or Theorem 3.3 (for the clamped-clamped case).The fixed point method (15) is declared to have reached a convergence if , where . Solutions at convergence are shown in Figure 3 for the clamped-free and clamped-clamped case, with .
Figure 3: Solution at convergence with , : clamped-free (left), clamped-clamped (right)
For both cases, the actual convergence rates are lower than the estimate (Tables 2–5), with increasing gaps between the two as increases. As is exact, except for , (due to the explicit inverse of ), this suggests that the gap in the convergence rate is mainly due to the estimate . The numerical experiments suggest that the simple fixed-point method (15) can be used for a wider range of than suggested by the theoretical results. For instance, with and , we have . The method still however converges to the solution at the maximum rate of .
Table 2: Observed maximum convergence rate for clamped-free case, with . In brackets are the theoretical rate based on Theorem 2.4.
0.010
[0.016]
0.010
[0.016]
0.010
[0.017]
1
0.074
[0.125]
0.074
[0.125]
0.074
[0.125]
8
0.400
[1.000]
0.400
[1.000]
0.402
[1.000]
Table 3: Observed maximum convergence rate for clamped-free case, with . In brackets are the theoretical rate based on Theorem 2.4.
0.010
[0.016]
0.010
[0.016]
0.010
[0.017]
1
0.074
[0.125]
0.074
[0.125]
0.074
[0.125]
8
0.400
[1.000]
0.400
[1.000]
0.402
[1.000]
Table 4: Observed maximum convergence rate for the clamped-clamped case, with . In brackets are the theoretical rate based on Theorem 3.3.
0.0003
[0.0033]
0.0003
[0.00033]
0.0003
[0.00033]
1
0.0020
[0.0026]
0.0020
[0.0026]
0.0020
[0.0026]
8
0.0158
[0.0209]
0.0159
[0.0209]
0.0161
[0.0209]
32
0.0615
[0.0836]
0.0619
[0.0836]
0.0627
[0.0836]
128
0.2223
[0.3344]
0.2237
[0.3344]
0.2262
[0.3344]
Table 5: Observed maximum convergence rate for the clamped-clamped case, with . In brackets are the theoretical rate based on Theorem 3.3
0.0002
[0.00033]
0.0002
[0.00033]
0.0003
[0.00033]
1
0.0020
[0.0026]
0.0020
[0.0026]
0.0020
[0.0026]
8
0.0157
[0.0208]
0.0159
[0.0208]
0.0160
[0.0208]
32
0.0614
[0.0834]
0.0618
[0.0834]
0.0625
[0.0834]
128
0.2218
[0.3336]
0.2232
[0.3336]
0.2257
[0.3336]
5 Conclusion
The explicit inverse formula for pentadiagonal matrices arising in the fourth-order nonlinear beam boundary value problem were constructed. The explicit formula helped computing some norms of their inverse, used to estimate the convergence of a fixed-point iteration for solving the nonlinear system of equations. Further research on the convergence upper bounds is necessary to extend our knowledge of the convergence rate in the fixed point method.
Acknowledgment
BK and YA wishes to acknowledge the research grant, No AP08052762, from the Ministry of Education and Science of the Republic of Kazakhstan and the Nazarbayev University Faculty Development Competitive Research Grant (NUFDCRG), Grant No 110119FD4502.
References
[1]
Chaojie Wang, Hongyi Li, and Di Zhao.
An explicit formula for the inverse of a pentadiagonal toeplitz
matrix.
Journal of Computational and Applied Mathematics, 278:12–18,
2015.
[2]
EL Allgower.
Exact inverses of certain band matrices.
Numerische Mathematik, 21(4):279–284, 1973.
[3]
Wayne W Barrett and Philip J Feinsilver.
Inverses of banded matrices.
Linear Algebra and its Applications, 41:111–130, 1981.
[4]
Lars Rehnqvist.
Inversion of certain symmetric band matrices.
BIT Numerical Mathematics, 12(1):90–98, 1972.
[5]
Murray Dow.
Explicit inverses of toeplitz and associated matrices.
ANZIAM Journal, 44:185–215, 2002.
[6]
Wayne W Barrett.
A theorem on inverse of tridiagonal matrices.
Linear Algebra and Its Applications, 27:211–217, 1979.
[7]
Jiteng Jia, Tomohiro Sogabe, and Moawwad El-Mikkawy.
Inversion of k-tridiagonal matrices with toeplitz structure.
Computers & Mathematics with Applications, 65(1):116–125,
2013.
[8]
P Rózsa.
On the inverse of band matrices.
Integral equations and operator theory, 10(1):82–95, 1987.
[9]
Xi-Le Zhao and Ting-Zhu Huang.
On the inverse of a general pentadiagonal matrix.
Applied Mathematics and Computation, 202(2):639–646, 2008.
[10]
F Diele and L Lopez.
The use of the factorization of five-diagonal matrices by tridiagonal
toeplitz matrices.
Applied mathematics letters, 11(3):61–69, 1998.
[11]
Xiao-Guang Lv and Ting-Zhu Huang.
A note on inversion of toeplitz matrices.
Applied Mathematics Letters, 20(12):1189–1193, 2007.
[12]
Bakytzhan Kurmanbek, Yogi Erlangga, and Yerlan Amanbek.
Inverse properties of a class of seven-diagonal (near) toeplitz
matrices.
arXiv preprint arXiv:2103.09868, 2021.
[13]
Mohamed Elouafi.
Explicit inversion of band toeplitz matrices by discrete fourier
transform.
Linear and Multilinear Algebra, 66(9):1767–1782, 2018.
[14]
Yerlan Amanbek, Zhibin Du, Yogi Erlangga, Carlos M. da Fonseca, Bakytzhan
Kurmanbek, and António Pereira.
Explicit determinantal formula for a class of banded matrices.
Open Mathematics, 18(1):1227–1229, 2020.
[15]
Bakytzhan Kurmanbek, Yerlan Amanbek, and Yogi Erlangga.
A proof of andjelić-fonseca conjectures on the determinant of
some toeplitz matrices and their generalization.
Linear and Multilinear Algebra, pages 1–8, 2020.
[16]
Yaroslav Shitov.
The determinants of certain (0, 1) toeplitz matrices.
Linear Algebra and its Applications, 2021.
[17]
H Grob.
Schwelldruck im belchentunnel.
In Proc. Int. Symp. für Untertagebau, Luzern, pages
99–119, 1972.
[18]
PA Von Wolffersdorff and S Fritzsche.
Laboratory swell tests on overconsolidated clay and diagenetic
solidified clay rocks.
Proc. Geotechnical Measurements and Modelling, Karlsruhe, AA
Balkema Pub, pages 407–412, 2003.
[19]
Piotr Skrzypacz, Daulet Nurakhmetov, and Dongming Wei.
Generalized stiffness and effective mass coefficients for power-law
euler–bernoulli beams.
Acta Mechanica Sinica, pages 1–16, 2019.
[20]
Dongming Wei, Yu Liu, Dichuan Zhang, Match Wai Lun Ko, and Jong R Kim.
Numerical analysis for retaining walls subjected to swelling
pressure.
In Proceedings of 2016 International Conference on Architecture,
Structure and Civil Engineering (ICASCE’16), London (UK) Mar, pages 26–27,
2016.