Lower Bound on the Optimal Access Bandwidth of ()-MDS Array Code with Degraded Read Friendly
Ting-Yi Wu1,
Yunghsiang S. Han21,
Zhengrui Li31,
Bo Bai1,
Gong Zhang1,
Liang Chen1,
Xiang Wu1
1Huawei Technologies Co., Ltd.
2Dongguan University of Technology
3University of Science and Technology of China
I Preliminary
Notation:
•
denotes the field of size .
•
denotes empty set.
•
and .
•
.
•
.
•
.
•
.
•
•
We let the parity-check matrix of the ()-MDS array be
(1)
where and these upper identities make the degraded read friendly.
To satisfy MDS property, must be invertible for any ,
which is equivalent to that must be invertible [1] for any ,
For any node , it stores two symbols as a vector .
To repair the th node, we have to find two check equations from the row space of in order to retrieve .
Let be a repair matrix of size , denotes two check equations from the row space of such that
(2)
and
(3)
where for all .
If can be used to recover , we have to make sure that is invertible and recover as
(4)
To be able to repair each node, we have to design the corresponding repair matrix for each .
Since some repair matrices can be used to recover multiple nodes, i.e., and are both invertible for some , we are not restricted to find different repair matrices for recovering all nodes.
Let be the number of repair matrices, , and be a partition of ,
a repair process of size is formed by repair matrices, , where can be used to recover is .
The formal definition of a repiar process of size is given below.
Definition 1.
A repair process of repair matrices is defined as
(5)
such that , , , form a partition of and is invertible for .
We then explain the process of recovering by a , where .
Let
Since , implies due to .
Let be a repair matrix of , we have an invertible matrix such that
(24)
Since and are invertible, we conclude that is also invertible.
From (24), we have for all . Since both and are invertible if , is invertible if .
Therefore, from (9), we have .
∎
Example.
is okay, but
is impossible, where and for all .
Lemma 5.
Given a , then is of rank for all .
Note.
This lemma can be applied to .
Proof:
Since and is invertible if ,
(25)
(26)
(27)
Therefore, for all .
∎
Lemma 6.
Given a , if and are both in (or ), and
are both in (or ) for some , , and , then is of rank .
Note.
This lemma can be applied to .
Proof:
Since is invertible and if is of full rank,
(28)
Combining (28) with Lemma 5, we conclude that
is of rank for all .
∎
Lemma 7.
If for some , , and invertible matrix , then both and can be used to repair nodes in and .
Proof:
Since is invertible, is invertible if and only if is invertible.
With the facts that is invertible when and is invertible when , we can conclude that both and are invertible when .
Therefore, both and can be used to repair nodes in .
From (9), , for some , denotes that , so .
Also, , for some , denotes that , so .
Furthermore, , for some and , denotes that , so .
Also, , for some and , denotes that , so .
We then prove that if for some and , then .
Assuming , so .
Hence we can conclude that from (9).
By contradiction, , for some and , implies .
Following the similar way, we can also prove that if for some and , then .
Combining above results and (9), we can conclude that for some , , and for all . Therefore, .
∎
Theorem 1.
For any such that and are both in (or ), and
are both in (or ) for some , and , where and ,
then the repair process
(29)
has the same total repair bandwidth as has.
Proof:
By Lemmas 5 and 6, we can have an invertible transform matrix such that .
Then according to Lemma 7, can repair nodes in and . Therefore,
(30)
(31)
(32)
(33)
(34)
∎
Lemma 8.
Given a , if for some , then
(35)
(36)
where .
Proof:
Due to Lemma 4 and , we can not have for all .
By Definition 2,
(37)
(38)
(39)
(40)
∎
We then present the properties of repair matrices of any .
Lemma 9.
Given a ,
if for some , then is not invertible, where .
Proof:
Since , we have
(41)
which induces
(42)
By pigeonhole principle, there must be , such that and both (or ).
WLOG, we assume and then we have
(43)
(44)
(45)
Since is not invertible and is invertible, we can conclude that is not invertible.
∎
We first prove that for some if .
Let for some , then we have
(50)
Since is not invertible, then
is not invertible as well,
which implies that the determinant of is .
Therefore, implies .
We next prove that for some ,
if and such that .
Let and , we have
(51)
(52)
Since is invertible, then
(53)
for some .
Lastly, we prove that for some if for some .
Since
(54)
and ,
for some .
Therefore, we can conclude that
,
if and such that .
A similar proof can be made for (3).
∎
Theorem 2.
Given a and let , if for some , then either for all or for all .
Proof:
From Lemma 8 and ,
we can find at least different indexes from , , , and , such that . Hence, .
By pigeonhole principle, there must be two of are either in or .
WLOG, we assume .
By Lemma 10, we can construct a repair matrix
(55)
where and for all .
Suppose there is a such that , then
(56)
for some .
Since , we have and .
Therefore, , which contradicts to the assumption of .
Therefore, we can prove that, for all , .
A similar proof can be made for for all when we assume .
∎
III Lower bound on
Let be the set of the all effective111From Theorem 1, we know that can be reduced to if there exists different and different such that (or ) and (or ).
For those repair processes of size satisfy the above constraint are said to be not effective and excluded from to avoid duplicate computation.
repair processes of size , our goal is to minimizes the total repair bandwidth over for a given , i.e.,
(57)
Let
(58)
as stated in Lemmas 3 and 8,
the total repair bandwidth of is
(59)
(60)
(61)
Therefore,
(62)
which means that minimizing over is equivalent to maximizing over .
For any repair process , we define as
(63)
From Theorem 2, we know that either for all or for all . Hence we can categorize all repair matrices into types.
Since we can swap any and , , without effecting the total repair bandwidth,
WLOG, we assume that there is a such that
(64)
and
(65)
Since we can also swap any and in without effecting the total repair bandwidth of ,
WLOG, we can assume that any is less than any for all . Let for all , then can be uniquely decided by as
(66)
(67)
(68)
(69)
(70)
(71)
where .
By summarizing Theorems 1 and 2,
we can have the following constraints on .
•
Constraint 1-L:
There exists no distinct and distinct such that ;
•
Constraint 1-R:
There exists no distinct and distinct such that ;
We then turn our attention to the block matrices which satisfy above constraints without considering the repair process .
Given , , and such that , , and , we define a set of block matrices as
(72)
where .
For every , similar with the definitions of and , we define the and as
(73)
(74)
It should be noted that, as discussed at the beginning of this section, we can construct a for any repair process such that and ; on the contrary, there is no guarantee that we can construct a such that for any .
Therefore, we can conclude that
(75)
where
(76)
We then turn our attention to rather than .
Theorem 3.
.
Proof:
Given an , we consider an such that
(77)
where , , and .
Since there must be at least invertible matrices in the th blockwise row of any , we have for any .
Since for all , the total repair bandwidth of any is lower bounded as
(78)
(79)
(80)
for any .
Therefore,
(81)
(82)
(83)
(84)
(85)
(86)
(87)
∎
Lemma 11.
Proof:
Given an
(88)
where , , and . We then have
(89)
(90)
(91)
(92)
(93)
(94)
(95)
∎
Lemma 12.
Given an such that and .
Let
(96)
where , , , , and .
Then,
(97)
Proof:
According to (72), the constraints (1-L and 2-L) on the first rows of are different from the constraints (1-R and 2-R) on the last rows of .
Therefore, the design of the first row is irrelevant to the design of the remaining rows and vice versa.
Also, for any , we can replace the matrices in with the matrices in and the matrices in with the matrices in without effecting the total repair bandwidth of ,
hence we have
(98)
where and .
WLOG, we assume .
We first prove that, for any , we can find an such that .
If is with , we can construct by replacing , for all , with matrices in without violating any constraint. Then
(99)
(100)
Therefore, we only need to consider those while minimizing the total repair bandwidth.
Let be with , as discussed before, we can have for all to minimize to be .
We then consider the first two rows of .
All those for all and
for all can be in without violating Constraint 1-L. Hence, we have
(101)
where its submatrix
(102)
must satisfy Constraint 1-L.
Let , if , then the submatrix in (102) must violate Constraint 1-L according to the pigeonhole principle, which induces .
Therefore,
(103)
(104)
(105)
where (105) can be minimized by maximizing when .
Since by (102) and , we can minimize by assigning and as did in (88).
∎
can be achieved by the in (88) while .
For those ,
where , we can achieve
(108)
by
(109)
and .
Therefore, we conclude that
(110)
(111)
(114)
(115)
∎
We then look into when .
From Constraints 1-L, 1-R, 2-L, and 2-R
we notice that, among all , we can minimize the total bandwidth by separately minimizing bandwidth of the first rows in and the bandwidth of the remaining rows in , i.e.,
(116)
(117)
(119)
Also as discussed in (62), we can turn the minimization problem in to a maximization program. Therefore, we first focus on the following problem,
Next, for those remaining rows of , we can have a similar lemma below.
Lemma 15.
Given and , for any , we can always construct a such that
(143)
Proof:
The proof is similar to the proof of Lemma 14, hence we omit it here.
∎
Combining Theorems 3, 4 and Lemmas 14, 15, we can lower bound for all in the following theorem.
Theorem 5.
Let
(144)
and
(145)
then for all .
Proof:
As discussed in (98), we only need to consider the case of .
Lemma 11 and Theorem 4 have proven for the case of . We next consider the case of .
For the case of and , any can be replaced by some such that due to Lemma
14.
Therefore, when and .
The remaining case is of and .
Since , this case is equivalent to the case of .
For the case of ,
any can be replaced by some such that
, , . Therefore, we can conclude that
(146)
when .
We then prove that (146) can be further lower bounded by .
As discussed in the beginning of this section, we swap rows or columns without effecting the repairing bandwidth of . Hence, WLOG, we can assume and for any .
By following the same trick in proving Lemma 12, we can have
The derived lower bound, for , is plotted in Figure 1.
The values at and are achieved by the codes provided by the department in Israel,
which implies that the derived bound is so tight that it can be achieved.
It should be noted that for all , except when and .
Figure 1: The lower bound from to .
References
[1] John R. Silvester, “Determinants of Block Matrices,”
The Mathematical Gazette, vol. 84, no. 501, pp. 460–467, Nov. 2000.