2cm2cm2.54cm2cm
The standard form and convergence theory of the relaxation Kaczmarz-Tanabe method for solving linear systems
Abstract
The Kaczmarz method is a popular iterative method for solving consistent, overdetermined linear system such as medical imaging in computerized tomography. The Kaczmarz’s iteration repeatedly scans all equations in order, which leads to lower computational efficiency especially in solving a large scale problem. The standard form of Kaczmarz-Tanabe’s iteration proposed recently effectively overcomes the computational redundancy problem of the Kaczmarz method. In this paper, we introduce relaxation parameters into the Kaczmarz-Tanabe method based on the relaxation Kaczmarz method, and consider the standard form and convergence of this combination. Moreover, we analyze and prove the sufficient conditions for convergence of the relaxation Kaczmarz-Tanabe method, i.e., . Numerical experiments show the convergence characteristics of the relaxation Kaczmarz-Tanabe method corresponding to these parameters.
Key words. Kaczmarz method; Relaxation Kaczmarz method; Relaxation Kaczmarz-Tanabe method; Computerized tomography
AMS subject classification. 65F10; 65F08; 65N22; 65J20
1 Introduction
In medical imaging tomography (see, i.e., [1, 2, 3]) such as computerized tomography (CT), magnetic resonance imaging (MRI) and electrical impedance tomography (EIT), people are often required to solve the following linear system of equations [4, 5, 6],
(1.1) |
where are also called projection matrix and measurement vector, respectively. We suppose that (1.1) is over-determined and consistent, and is a true solution. When (1.1) has many solutions, is used to denote the minimum least-squares solution.
The Kaczmarz method [7] is one of the most popular methods to solve (1.1) in computerized tomography. Let , Kaczmarz’s iteration is written as
(1.2) |
where , and denote the inner product of and the square norm of in , respectively.
The Kaczmarz method was originally discovered by Kaczmarz in 1937, and it was rediscovered by Gordon, Bender and Herman [8] to deal with X-ray photography as an algebraic reconstruction technique (ART) in 1970. The convergence of the Kaczmarz method was established by Tanabe in 1971. Recently, Kang and Zhou gave the convergence rate results of the Kaczmarz method [9]. In order to improve computing speed, Eggermont [10], Censor [11], et al. considered the problem of block calculation and adopted approximate and executable iterative forms of the block Kaczmarz method to solve this problem. The strictly block Kaczmarz method was given by Rebrova and Needell [12]. In addition, the simultaneous iterative reconstructive technique (SIRT) as another important branch is also developed rapidly, and there appeared many known methods) such as Cimmino method[13], Landweber method [14], CAV method [15], DROP method [16], and SART method[17], etc.
With the development of hardware technology and computing technology, the idea of stochastic theory is continuously integrated into the non-randomized methods. In this context, people introduced and investigated the randomized Kaczmarz method [18, 19, 20, 21, 22] and greedy randomized Kaczmarz methods [23, 24].
From the perspective of SIRT methods, Kang [25] considered the matrix-vector form of the Kaczmarz method, i.e., the Kaczmarz-Tanabe’s iteration which is named by Popa [26], and the iteration reads
(1.3) |
where denotes identity matrix of whatever size appropriate to the context, and
Compared with the classic form of the Kaczmarz-Tanabe iteration, the iteration (1.3) has been greatly improved. However, each column of is the product of matrix and column vector from . Notice this defect of , Kang [27] considered the decomposition of and gave the standard form of the Kaczmarz-Tanabe’s iteration, i.e.,
(1.4) |
where is defined as [27, Corollary 2.7]. The iterative scheme (1.4) has no essential difference from the iterative formula of the SIRT methods.
In this paper, we combine the relaxation Kaczmarz method and the Kaczmarz-Tanabe method, and introduce relaxation parameters into Kaczmarz-Tanabe method and propose the relaxation Kaczmarz-Tanabe method. Just as the convergence of the relaxation method is related to the relaxation parameters, the convergence of the relaxation Kaczmarz-Tanabe is also related to the relaxation parameters. Hence we further consider the convergence conditions (about the relaxation parameters) and convergence results of the relaxation Kaczmarz-Tanabe method.
The rest of this work is organized as follows. In Section 2, we consider the relaxation Kaczmarz method and derive the sufficient conditions for its convergence. In Section 3, we consider the standard form of the relaxation Kaczmarz-Tanabe method, and discuss their properties such as decomposition of , convergence condition, convergence, and so on. Section 4 is about the algorithm flow to compute appearing in (3.19) and (3.24). In Section LABEL:section.numerical.tests, we verify the effectiveness of the Kaczmarz-Tanabe method with given relaxation parameters by several classical numerical examples.
2 The relaxation Kaczmarz method and its convergence
The relaxation Kaczmarz’s iteration is written as
(2.1) |
where and are relaxation parameters. Let
(2.2) |
then it follows from (2.1) that
(2.3) |
Denote
(2.4) |
then the following lemma gives a sufficient condition for the nonexpansion of linear operator .
Lemma 2.1.
Let be defined as in (2.4), then, for any ,
hold and the equality hold iif . Moreover, for any there also holds when .
Proof. For any , we have
(2.5) |
If and , then , consequently, . Conversely, if , then , and it follows from (2.5) that . Moreover, there also holds when . ∎
Lemma 2.2.
Proof. We prove the result by mathematical induction. First, we know that . Second, we suppose that for any given there holds , then . In fact, for any ,
This proves . To sum up, for any .∎
Lemma 2.2 shows that, for any relaxation parameter , the iterative error of the relaxation Kaczmarz method belong to . The following theorem gives the convergence result of the relaxation Kaczmarz method for solving a consistent linear system when (where ).
Theorem 2.3.
For a consistent linear system , let be generated by (2.1) and relaxation parameter . Then the sequence converges to the solution when .
3 The standard form of the relaxation Kaczmarz-Tanabe method
We start with the first equation of (1.1) and perform the iteration (2.3) in turn until the -th equation, and denote the -th iteration with , then get
(3.1) |
For , let
(3.2) | |||
(3.3) |
If for each , then . In addition, we agree that , where denote identity matrix of whatever size appropriate to the context.
Let
(3.4) |
Thus we get from (3), (3.2), (3.3) and (3.4) that
Consequently, it follows by repeating this iteration that
(3.5) |
Taking , then we get the relaxation Kaczmarz-Tanabe’s iteration
(3.6) |
where .
Lemma 3.1.
Let and be defined as in (3.3), then there holds
(3.7) |
By Lemma 3.1, we can get the alternative form of (3.6), i.e.,
For the further discussion to (3.6), we extend the conceptions of sequential projection matrix, sequential projection matrix set and sequential compatible to (see [27, Definition 2.1 & Definition 2.2]).
Definition 3.2.
We call a sequential projection matrix with parameters on , and denote the sequential projection matrix set with , i.e.,
(3.8) |
Definition 3.3.
For any , if there exist such that
(3.9) |
then we call and sequentially compatible. In general, for any , if there exist such that
(3.10) |
then we call and forward sequential compatible, and call compatible vector of on .
Theorem 3.4.
Suppose has no zero row, and is defined by (3.8), then and are forward sequential compatible.
Proof. Similar to [27, Theorem 2.6], we will complete the proof by mathematical induction. We take the subscript of as an ordered array. Obviously, , that is, (3.10) holds for and . In fact, for any , (3.10) obviously holds for because is the agreement. Consequently, As the first step of mathematical induction, we prove (3.10) holds for . Actually,
So (3.9) holds for , where
Second, we suppose (3.10) holds for any given subjected to and , i.e., there exists such that
Then, we prove (3.10) holds for . Because of , then
(3.11) |
From the hypothesis, there exist and such that
Then, it follows from (3.11) that
Denote
This proves (3.10) holds for .
To sum up the above, the conclusion is proved for all with respect to . Namely, and generated by relaxation Kaczmarz’s iteration are forward sequential compatible. ∎
In order to obtain the standard form (i.e., matrix-vector form) of the relaxation Kaczmarz-Tanabe’s iteration, we first import the following decomposition theorem about .
Theorem 3.5.
Suppose has no zero rows, and let be defined by (3.4), then there exists a unit upper triangular matrix such that .
Proof. According to and Theorem 3.4, the corollary can be proved by taking , where is determined by the forward sequential compatible of and . ∎
Theorem 3.5 is a general result, but it shows that there exists a structure-reserved decomposition in relaxation Kaczmarz-Tanabe’s iteration when a relaxation parameter is introduced into Kaczmarz’s iteration. However, the decomposition is not unique when is not row full rank.
By Theorem 3.5, then we get the standard form of the relaxation Kaczmarz-Tanabe’s iteration from (3.6) that
(3.12) |
In order to get the special form of in Kaczmarz-Tanabe’s iteration, Kang introduced the concept of index set in [27, Definition 2.9], i.e.,
Definition 3.6.
The index set is defined as follows
where are positive integers satisfied . is integer between and , and . For any there satisfy
By Definition 3.6, we can give the following expression of for any and .
Lemma 3.7.
Suppose that has no zero row, is the sequential projection matrix of and . For any , denote
(3.13) |
Then,
(3.14) |
holds. That is, the compatible vector of on is .
Proof. When and , we have
By the agreement ,
(3.15) |
Thus (3.14) holds by taking according to (3.13). Because (3.14) holds for any , we then get the compatible vector of on . ∎
Theorem 3.8.
Proof. For any ,
(3.20) |
By Lemma 3.7 and (3.19), we then get
(3.21) |
Moreover, when , (3.21) obviously holds. Therefore, for any , there holds , which means . ∎
In the context, we especially call the iteration (3.12) relaxation Kaczmarz-Tanabe’s iteration when is taken according to (3.19).
Let be a matrix obtained by multiplying the -th row of the identity matrix by and adding it to the -th row, i.e., the diagonal elements of are all , the - element is , and all other elements are . Consequently, we have the following decomposition theorem of .
Theorem 3.9.
Proof. For any , from (3), we have
(3.23) |
From (3), the coefficient of is actually the -element of when , i.e.,
Denote , in order to show , we only need to prove , i.e.,
where and are the th and th columns of the identity matrix in , respectively. Owing to when and when , when it then follows from the notation of that
This proves
for any and . Additionally, there also holds
for any . Consequently, the conclusion is proved. ∎
Theorem 3.9 actually provides a computational method of for the relaxation Kaczmarz-Tanabe’s algorithm. Moreover, there also holds the following decomposition about .
Lemma 3.10.
Let be defined like in Theorem 3.8, then there exists decomposition of
(3.24) |
where is an upper triangular matrix with diagonal elements .
Proof. The proof of the lemma is trivial and omitted here. ∎
3.1 The convergence of the relaxation Kaczmarz-Tanabe method
Next, we consider the convergence of the relaxation Kaczmarz-Tanabe method and give a sufficient condition of convergence for the relaxation Kaczmarz-Tanabe’s iteration. Notice that each relaxation Kaczmarz-Tanabe’s iteration is a composite of relaxation Kaczmarz’s iterations, consequently, it is natural to transfer the convergence condition of the relaxation Kaczmarz method to the relaxation Kaczmarz-Tanabe method.
Lemma 3.11.
For a linear system , let be defined by (3.3), and we suppose for each . Then, for any , there holds , and the equality holds iif .
Proof. When , from , therefore there holds
(3.25) |
If , from Lemma 2.1, there exists at least an index such that . We suppose that is the smallest index in that , then there holds
These prove the conclusions. ∎
Lemma 3.11 actually gives the condition to ensure the nonexpansion of on . Let be generated by (3.12) and , then
it follows from Lemma 3.1 that
(3.26) |
By Lemma 3.11, thus we have the following convergence of the relaxation Kaczmarz-Tanabe method.
Theorem 3.12.
Proof. Let , by Lemma 3.11, we have . Because the sequence generated by the relaxation Kaczmarz-Tanabe’s iteration is a subsequence of the sequence generated by the relaxation Kaczmarz’s iteration, there holds from Lemma 2.2 that . Consequently, by (3.26), we have
which shows as , i.e., .∎
Moreover, we have the following convergence rate of the relaxation Kaczmarz-Tanabe method for solving error-free linear system.
Corollary 3.13.
Under the conditions of Theorem 3.12, there also hold
(3.27) |
and
(3.28) |
where is the singular value of .
4 The related algorithms
The implementation of the relaxation Kaczmarz-Tanabe method are divided into two processes. The first process is to compute for given , and the second process is to execute the Kaczmarz-Tanabe’s iteration repeatedly. In Algorithm 4, we list the flowchart to compute . After computing , the implementation of the relaxation Kaczmarz-Tanabe’s iteration is trivial, and we list the process in Algorithm 4.
Algorithm 1 The calculation of matrix
Algorithm 2 The relaxation Kaczmarz-Tanabe method solver
5 Conclusion
In this paper, we consider the relaxation Kaczmarz-Tanabe method by introducing the relaxation parameter idea into Kaczmarz-Tanabe method. The development of this work is based on the research to the Kaczmarz-Tanabe method. We first make some exploration on the selections of the relaxation parameter for the relaxation Kaczmarz-Tanabe method, then analyze the related theories of relaxation Kaczmarz-Tanabe’s iteration such as the the decompositions of and .
In numerical tests, we experiment several classic examples to verify these theoretical results of the selections of relaxation parameters for the relaxation Kaczmarz-Tanabe method. Although we prove that is the sufficient condition for the convergence of the relaxation Kaczmarz-Tanabe method, the operation of adjusting parameters one by one is tedious and unnecessary. Consequently, we carry out many experiments with constant and random parameters to test the effect of parameter selection, moreover the experimental results of random parameters setting show the effectiveness of the specific case. However the selection of optimal parameters still needs further research, and in the current situation may still be a compromise for the relaxation Kaczmarz-Tanabe method.
Reference
References
- [1] R. S. Ledley and W. R. Ayers. Computerized medical imaging and graphics evolves from computerized tomography. Comput. Med. Imag. Grap., 12(1):v–xviii, 1988.
- [2] G. T. Herman. Fundamentals of computerized tomography. Academic Press, 2010.
- [3] F. Natterer. The mathematics of computerized tomography. Medical Physics, 32, 1986.
- [4] B. Jin and Y. Xu. Adaptive reconstruction for electrical impedance tomography with a piecewise constant conductivity. Inverse Problems, 36:014003, 2020.
- [5] G. S. Alberti, H. Ammari, B. Jin, J. K. Seo, and W. Zhang. The linearized inverse problem in multifrequency electrical impedance tomography. SIAM J. Imaging Sci., 9:1525–51, 2016.
- [6] Y. T. Chow, K. Ito, and J. Zou. A direct sampling method for electrical impedance tomography. Inverse Problems, 30:095003, 2014.
- [7] S. Kaczmarz. Angenäherte auflösung von systemen linearer gleichungen. Bull. Int. Acad. Pol. Sci. Lett. A, 35:355–357, 1937.
- [8] R. Gordon, R. Bender, and G. T. Herman. Algebraic reconstrction techniques (ART) for three dimensional electron microscopy and X-ray photography. J. Theoret. Biol., 29(3):471–481, 1970.
- [9] C. Kang and H. Zhou. The extension of convergence rates of Kaczmarz type methods. J. Comput. Appl. Math., 382:113099, 2021.
- [10] P. P. B. Eggermont, G. T. Herman, and A. Lent. Iterative algorithms for large partitioned linear systems, with applications to image reconstruction. Linear Algebra Appl., 40(OCT):37–67, 1981.
- [11] Y. Censor, P. P. B. Eggermont, and D. Gordon. Strong underrelaxation in Kaczmarz’s method for inconsistent systems. Numer. Math., 41(1):83–92, 1983.
- [12] E. Rebrova and D. Needell. On block gaussian sketching for the kaczmarz method. Numer. Algorithms, 86(1):443–473, 2021.
- [13] G. Cimmino. Calcolo approssimato per le soluzioni dei sistemi di equazioni lineari. La Ricerca Scientifica,Series II, 9:326–333, 1938.
- [14] L. Landweber. An iteration formula for Fredholm integral equations of the first kind. Am. J. Math., 73(3):615–624, 1951.
- [15] Y. Censor, G. Dan, and R. Gordon. Component averaging: An efficient iterative parallel algorithm for large and sparse unstructured problems. Parallel Comput., 27(6):777–808, 2001.
- [16] Y. Censor, T. Elfving, G. T. Herman, and T. Nikazad. On diagonally relaxed orthogonal projection methods. SIAM J. Sci. Comput., 30(1):473–504, 2007.
- [17] P. C. Hansen and J. S. Jorgensen. AIR Tools II: Algebraic iterative reconstruction method, improved implementation. Numer. Algorithms, 79(1):107–137, 2018.
- [18] T. Strohmer and R. Vershynin. A randomized Kaczmarz algorithm with exponential convergence. J. Fourier Anal. Appl, 15:262–278, 2009.
- [19] Y. L. Jiao, B. T. Jin, and X. L. Lu. Preasymptotic convergence of randomized Kaczmarz method. Inverse Problems, 33, 2017.
- [20] D. Needell. Randomized Kaczmarz solver for noisy linear systems. Behav. Inf. Technol., 50(2):395–403, 2010.
- [21] D. Needell and J. A. Tropp. Paved with good intentions: Analysis of a randomized block Kaczmarz method. Linear Algebra Appl., 441(1):199–221, 2014.
- [22] D. Needell, R. Zhao, and A. Zouzias. Randomized block Kaczmarz method with projection for solving least squares. Linear Algebra Appl., 484:322–343, 2015.
- [23] Z. Z. Bai and W. T. Wu. On convergence rate of the randomized Kaczmarz method. Linear Algebra Appl., 553:252–269, 2018.
- [24] Z. Z. Bai and W. T. Wu. On greedy randomized Kaczmarz method for solving large sparse linear systems. SIAM J. Sci. Comput., 40:A592–A606, 2018.
- [25] C. Kang. Convergence rates of the Kaczmarz-Tanabe method for linear system. J. Comput. Appl. Math., 394:113577, 2021.
- [26] C. Popa. Convergence rates for Kaczmarz-type algorithms. Numer. Algorithms, 79:1–17, 2018.
- [27] C. Kang. The standard forms and convergence theory of the Kaczmarz-Tanabe type methods for solving linear systems. Journal of Computational and Applied Mathematics, page 115333, 2023.