Repairing Reed-Solomon Codes
with Side Information

Thi Xinh Dinh^†∗, Ba Thong Le^∗, Son Hoang Dau^†, Serdar Boztas^†, Stanislav Kruglik^‡, Han Mao Kiah^‡,
Emanuele Viterbo**, Tuvi Etzion^#, and Yeow Meng Chee^#
^†RMIT University, ^‡Nanyang Technological University, ^∗Tay Nguyen University,
^∗∗Monash University, ^#National University of Singapore

Abstract

We generalize the problem of recovering a lost/erased symbol in a Reed-Solomon code to the scenario in which some side information about the lost symbol is known. The side information is represented as a set $S$ of linearly independent combinations of the sub-symbols of the lost symbol. When $S=\varnothing$ , this reduces to the standard problem of repairing a single codeword symbol. When $S$ is a set of sub-symbols of the erased one, this becomes the repair problem with partially lost/erased symbol. We first establish that the minimum repair bandwidth depends on $|S|$ and not the content of $S$ and construct a lower bound on the repair bandwidth of a linear repair scheme with side information $S$ . We then consider the well-known subspace-polynomial repair schemes and show that their repair bandwidths can be optimized by choosing the right subspaces. Finally, we demonstrate several parameter regimes where the optimal bandwidths can be achieved for full-length Reed-Solomon codes.

I Introduction

To prevent data loss and increase data availability in distributed storage systems, a file is usually split into $k$ data chunks and transformed (encoded) into $n>k$ coded chunks using an erasure code, and then stored across $n$ different storage nodes. If the code is MDS [1], such a system can withstand any $n-k$ failures because the entire file can be recovered from any $k$ chunks. When only one node fails, which is usually the most typical case (see, e.g. [2]), a repair/replacement node must download enough data from other helper nodes to recover its lost chunk. In the repair-bandwidth problem [3, 4], one seeks to minimize the repair bandwidth, i.e. the amount of downloaded data required for a successful recovery of the lost chunk. A low-bandwidth repair scheme can also be used for degraded read, in which requests for a chunk stored at an unavailable node can be served by other available nodes [5]. This important problem has been extensively studied in the literature (see, e.g. [6] and the references therein).

In this work we generalize the setting of the repair-bandwidth problem to accommodate side information (see Fig. 1 for a toy example). In information theory, the concept of side information has been investigated in numerous contexts, including source coding [7], channel coding [8], list decoding [9], index coding [10, 11], and private information retrieval [12]. In the context of the repair problem, side information refers to the additional information that the repair node knows about the lost chunk while recovering it. The side information could arise due to a partial loss of data, which means that part of the chunk is still accessible and serves as side information, or due to partial information gained from the previous communications or from other sources. The question of interest is that given the side information, what is the lowest repair bandwidth we can achieve. We refer to this as the repair-bandwidth with side information problem.

Refer to caption — Figure 1: An illustration of repair schemes that recover $\bm{a}=(a_{1},a_{2})$ with and without side information. The side information $a_{1}+a_{2}$ leads to a reduction of 1 bit in the repair bandwidth. The repair node first obtains $a_{2}\leftarrow(a_{2}+b_{1})-b_{1}$ , and then $a_{1}\leftarrow(a_{1}+a_{2})-a_{2}$ .

In the scope of this work, we focus on Reed-Solomon codes, which is currently the most widely used families of erasure codes in distributed storage systems (see [13]). The repair bandwidth as well as the closely related metric called I/O cost and sub-packetization size have been investigated in a number of recent works for different families of Reed-Solomon codes [14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 13, 33, 34, 35, 36, 37]. For an $[n,k]_{\mathbb{F}_{q^{\ell}}}$ Reed-Solomon code over the finite field $\mathbb{F}_{q^{\ell}}$ , a coded chunk, which is an element in $\mathbb{F}_{q^{\ell}}$ (called a symbol), is repaired from a number of $\mathbb{F}_{q}$ -elements (called sub-symbols) extracted from other coded chunks. Each symbol $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}$ consists of $\ell$ sub-symbols in $\mathbb{F}_{q}$ , i.e. $\bm{\gamma}=(\gamma_{1},\ldots,\gamma_{\ell})\in\mathbb{F}_{q}^{\ell}$ . The repair bandwidth is measured in the number of extracted sub-symbols, while the side information of a symbol $\bm{\gamma}$ can be represented as a set $S$ of $\mathbb{F}_{q}$ -linearly independent combinations of its sub-symbols $\gamma_{1},\ldots,\gamma_{\ell}$ . In summary, our contributions are given below.

•

We show that the minimum repair bandwidth for a codeword symbol of a Reed-Solomon code given the side information $S$ depends on it size $|S|$ , and independent of the specific choice of its elements.
•

We obtain a lower bound on the repair bandwidth of a linear repair scheme for a failed node with side information.
•

For subspace-polynomial repair schemes for $[n,k]_{\mathbb{F}_{q^{\ell}}}$ Reed-Solomon codes with $n-k\geq q^{m}$ , $m<\ell$ ([15, 16, 17, 31]), we prove that special subspaces can be chosen to minimize the repair bandwidth among all such schemes.
•

A subspace that minimizes the repair bandwidth among all subspace-polynomial repair schemes can be found by solving an optimization problem on subspace intersections, which is of its own interest. We solve the problem in a few parameter regimes, leaving others for future research.

The paper is organized as follows. Section II provides required notations and definitions. Section III is devoted for the description and solutions of the repair with side information problem in different cases. We conclude the paper in Section IV.

II Preliminaries

II-A Definitions and Notations

Let $q$ be a prime power, $\mathbb{F}_{q}$ be the finite field of $q$ elements and $\mathbb{F}_{q^{\ell}}$ be the extension field of degree $\ell$ of $\mathbb{F}_{q}$ . We use $[n]$ to denote the set $\{1,2,\ldots,n\}$ , $a\mid b$ to denote that $a$ divides $b$ , and $(a,b)$ for $\gcd(a,b)$ , for $a,b\in\mathbb{Z}$ . For a set $U$ , let $U^{*}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}U\setminus\{0\}$ , and $\bm{\gamma}U\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\{\bm{\gamma}u\colon u\in U\}$ . We use $\mathsf{span}_{\mathbb{F}_{q}}(U)$ to denote the $\mathbb{F}_{q}$ -subspace of $\mathbb{F}_{q^{\ell}}$ spanned by a subset $U\subseteq\mathbb{F}_{q^{\ell}}$ . We use $\dim_{\mathbb{F}_{q}}(\cdot)$ and $\mathsf{rank}_{\mathbb{F}_{q}}(\cdot)$ (or $\dim(\cdot)$ and $\mathsf{rank}(\cdot)$ for short) for the dimension of a subspace and the rank of a set of vectors over $\mathbb{F}_{q}$ . The (field) trace of an element ${\bm{b}}\in\mathbb{F}_{q^{\ell}}$ over $\mathbb{F}_{q}$ is $\mathsf{Tr}_{{\mathbb{F}}_{q^{\ell}}/{\mathbb{F}}_{q}}({\bm{b}})=\sum_{i=0}^{\ell-1}{\bm{b}}^{q^{i}}$ . We also write $\mathsf{Tr}(\cdot)$ instead of $\mathsf{Tr}_{{\mathbb{F}}_{q^{\ell}}/{\mathbb{F}}_{q}}(\cdot)$ for simplicity.

Let ${\mathcal{C}}$ be a linear $[n,k]$ code over $\mathbb{F}_{q^{\ell}}$ . Then ${\mathcal{C}}$ is an $k$ -dimensional $\mathbb{F}_{q^{\ell}}$ -subspace of $\mathbb{F}_{q^{\ell}}^{n}$ . A codeword of ${\mathcal{C}}$ is an element $\vec{\bm{c}}=({\bm{c}}_{1},{\bm{c}}_{2},\ldots,{\bm{c}}_{n})\in{\mathcal{C}}$ and its codeword symbols are the components ${\bm{c}}_{j}$ , $j\in[n]$ . The dual code of a code ${\mathcal{C}}$ is the orthogonal complement ${\mathcal{C}}^{\perp}$ of ${\mathcal{C}}$ , ${\mathcal{C}}^{\perp}=\{\vec{\bm{g}}\in\mathbb{F}_{q^{\ell}}^{n}:\langle\vec{\bm{c}},\vec{\bm{g}}\rangle=0,\forall\vec{\bm{c}}\in{\mathcal{C}}\}$ , where $\langle\vec{\bm{c}},\vec{\bm{g}}\rangle$ is the scalar product of $\vec{\bm{c}}$ and $\vec{\bm{g}}$ . The code ${\mathcal{C}}^{\perp}$ is an $\mathbb{F}_{q^{\ell}}$ -subspace of $\mathbb{F}_{q^{\ell}}^{n}$ with dimension $n-k$ . The elements of ${\mathcal{C}}^{\perp}$ are called dual codewords. The number $n-k$ is called the redundancy of the code ${\mathcal{C}}$ .

Definition 1.

Let $A=\{\bm{\alpha}_{j}\}_{j=1}^{n}$ be a subset of size $n$ in $\mathbb{F}_{q^{\ell}}$ . A Reed-Solomon code $\text{RS}(A,k)\subseteq\mathbb{F}_{q^{\ell}}^{n}$ of dimension $k$ with evaluation points $A$ is defined as

\text{RS}(A,k)=\big{\{}\big{(}f(\bm{\alpha}_{1}),\ldots,f(\bm{\alpha}_{n})\big{)}\colon f\in\mathbb{F}_{q^{\ell}}[x],\ \deg(f)<k\big{\}},\vspace{-5pt}

where $\mathbb{F}_{q^{\ell}}[x]$ is the ring of polynomials over $\mathbb{F}_{q^{\ell}}$ . We also use the notation RS $(n,k)$ , ignoring the evaluation points.

A generalized Reed-Solomon code, $\text{GRS}(A,k,\vec{\bm{\lambda}})$ , where $\vec{\bm{\lambda}}=(\bm{\lambda}_{1},\ldots,\bm{\lambda}_{n})\in\mathbb{F}_{q^{\ell}}^{n}$ , is the set of codewords $\big{(}\bm{\lambda}_{1}g(\bm{\alpha}_{1}),\ldots,\bm{\lambda}_{n}g(\bm{\alpha}_{n})\big{)}$ , where $\bm{\lambda}_{j}\neq 0$ for all $j\in[n]$ , $g\in\mathbb{F}_{q^{\ell}}[x],\ \deg(g)<n-k$ . The dual code of a Reed-Solomon code $\text{RS}(A,k)$ is a generalized Reed-Solomon code $\text{GRS}(A,n-k,\vec{\bm{\lambda}})$ , for some multiplier vector $\vec{\bm{\lambda}}$ ([1, Chap. 10]). We sometimes use the notation GRS $(n,k)$ , ignoring $A$ and $\vec{\bm{\lambda}}$ .

Let $f(x)$ be a polynomial corresponding to a codeword of the Reed-Solomon code ${\mathcal{C}}=\text{RS}(A,k)$ , and $g(x)$ be a polynomial of degree at most $n-k-1$ , which corresponds to a codeword of the dual code ${\mathcal{C}}^{\perp}$ . Then $\sum_{j=1}^{n}g(\bm{\alpha}_{j})\big{(}\bm{\lambda}_{j}f(\bm{\alpha}_{j})\big{)}=0,$ and we call the polynomial $g(x)$ a check polynomial for ${\mathcal{C}}$ .

II-B Trace Repair Method

Let RS $(n,k)$ be a Reed-Solomon code over $\mathbb{F}_{q^{\ell}}$ with evaluation points $A$ , $\vec{\bm{c}}$ a codeword corresponding to polynomial $f(x)$ , $f\in\mathbb{F}_{q^{\ell}}[x],\ \deg(f)<k$ , and ${\bm{c}}_{j^{*}}=f(\bm{\alpha}^{*})$ is a codeword symbol/node of $\vec{\bm{c}}$ , where $\bm{\alpha}^{*}=\bm{\alpha}_{j^{*}}\in A$ . A (linear) trace repair scheme for $f(\bm{\alpha}^{*})$ corresponds to a set of $\ell$ check polynomials $\left\{g_{i}(x)\right\}_{i\in[\ell]}$ , $g_{i}\in\mathbb{F}_{q^{\ell}}[x],\ \deg(g_{i})<n-k$ , that satisfies the Full-Rank Condition: $\mathsf{rank}_{\mathbb{F}_{q}}\{g_{i}(\bm{\alpha}^{*})\}_{i\in[\ell]}=\ell$ . The repair bandwidth of such a repair scheme (in $\mathbb{F}_{q}$ -symbols) is ${\sf{bw}}=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\mathsf{rank}_{\mathbb{F}_{q}}(\{g_{i}(\bm{\alpha})\}_{i\in[\ell]})$ . To repair all $n$ components of $\vec{\bm{c}}$ , we need $n$ such repair schemes (possibly with repetition). See, e.g. [31], for a detailed explanation of why the above scheme works with an example.

III Recovering an Erased Symbol with Side Information

III-A The Problem Description

Let ${\mathcal{C}}$ be an RS $(n,k)$ code over $\mathbb{F}_{q^{\ell}}$ with evaluation points $A\subseteq\mathbb{F}_{q^{\ell}}$ . Suppose that the codeword symbol $f(\bm{\alpha}^{*})$ is erased and needs to be recovered, given a set of $\mathbb{F}_{q}$ -linearly independent combinations of its sub-symbols (elements of $\mathbb{F}_{q}$ ) as side information. Note that for each vector of coefficients $\vec{{\bm{a}}}=(a_{1},\ldots,a_{\ell})\in\mathbb{F}_{q}^{\ell}$ , there exists a $\bm{\beta}\in\mathbb{F}_{q^{\ell}}$ such that the equality $\sum_{i=1}^{\ell}a_{i}\xi_{i}=\mathsf{Tr}(\bm{\beta}\bm{\xi})$ holds for all $\bm{\xi}\in\mathbb{F}_{q^{\ell}}$ . Therefore, we can represent the side information as a set $S=\{\bm{\beta}_{i}\}_{i=1}^{s}$ , where $s=|S|$ , and assume that $S$ is $\mathbb{F}_{q}$ -linearly independent. We assume that the replacement/recovery node already knows $s$ traces $\{\mathsf{Tr}(\bm{\beta}_{i}f(\bm{\alpha}^{*}))\}_{i\in[s]}$ , where $\{\bm{\beta}_{i}\}_{i\in[s]}\subseteq\mathbb{F}_{q^{\ell}}$ is $\mathbb{F}_{q}$ -linearly independent. Equivalently, we can represent the side information as a subspace ${\mathcal{S}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\mathsf{span}_{\mathbb{F}_{q}}(S)$ . We call $S$ the side information set and ${\mathcal{S}}$ the side information subspace. Note that $S$ is a basis of ${\mathcal{S}}$ and $s=\dim({\mathcal{S}})$ .

According to the trace-repair method, it needs to reconstruct some $\ell-s$ traces of $f(\bm{\alpha}^{*})$ , namely, $\{\mathsf{Tr}(\bm{\beta}_{i}f(\bm{\alpha}^{*}))\}_{i\in[s+1,\ell]}$ , referred to as target traces, where $\{\bm{\beta}_{i}\}_{i\in[\ell]}$ is an $\mathbb{F}_{q}$ -basis of $\mathbb{F}_{q^{\ell}}$ . We refer to $T\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\{\bm{\beta}_{i}\}_{i\in[s+1,\ell]}$ as the target set and ${\mathcal{T}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\mathsf{span}_{\mathbb{F}_{q}}(T)$ the target subspace with respect to the side information set $S$ (or the side information subspace ${\mathcal{S}}$ ). We capture this discussion in Proposition 1. Its proof is similar to [16, Thm. 4] and can be found in Appendix V-A.

Proposition 1.

Let $S\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\{\bm{\beta}_{i}\}_{i\in[s]}$ be a linearly independent set and $f(\bm{\alpha}^{*})$ be a symbol of Reed-Solomon code RS $(n,k)$ over $\mathbb{F}_{q^{\ell}}$ , $n\leq q^{\ell}$ . A linear repair scheme for $f(\bm{\alpha}^{*})$ with side information $S$ corresponds to a collection of $\ell-s$ polynomials $\{g_{i}(x)\}_{i\in[s+1,\ell]}\subset\mathbb{F}_{q^{\ell}}[x]$ , where $\deg(g_{i})<n-k$ , $T\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\{g_{i}(\bm{\alpha}^{*})\}_{i\in[s+1,\ell]}$ and $S\cup T$ are linearly independent.

III-B Optimal Repair Bandwidths Only Depend on the Side Information Set Size

We demonstrate below that the optimal repair bandwidth for recovering an erasure depends on $s=|S|$ but not on the specific choice of $S$ . We first need an auxiliary lemma.

Lemma 1.

Given two $\mathbb{F}_{q}$ -subspaces ${\mathcal{S}}$ and ${\mathcal{T}}^{\prime}$ of $\mathbb{F}_{q^{\ell}}$ of dimensions $s$ and $t=\ell-s$ , respectively. Then there exists an element $\bm{\delta}\in\mathbb{F}_{q^{\ell}}^{*}$ such that ${\mathcal{S}}\oplus{\mathcal{T}}=\mathbb{F}_{q^{\ell}}$ , where ${\mathcal{T}}=\bm{\delta}{\mathcal{T}}^{\prime}$ . Equivalently, given two $\mathbb{F}_{q}$ -linearly independent subsets $S=\{\bm{\beta}_{i}\}_{i\in[s]}$ and $T^{\prime}=\{\bm{\beta}^{\prime}_{i}\}_{i\in[s+1,\ell]}$ of $\mathbb{F}_{q^{\ell}}$ , where $s\in[\ell]$ , there exists $\bm{\delta}\in\mathbb{F}_{q^{\ell}}^{*}$ such that $S\cup T$ , where $T=\bm{\delta}T^{\prime}$ , forms an $\mathbb{F}_{q}$ -basis of $\mathbb{F}_{q^{\ell}}$ .

Proof.

For each $\bm{\gamma}\in{\mathcal{T}}^{\prime}\setminus\{0\}$ , as $\{\bm{\delta}\bm{\gamma}\colon\bm{\delta}\in\mathbb{F}_{q^{\ell}}^{*}\}=\mathbb{F}_{q^{\ell}}^{*}$ , there are exactly $q^{s}-1$ such $\bm{\delta}$ so that $\bm{\delta}\bm{\gamma}\in{\mathcal{S}}\setminus\{0\}$ (as $|{\mathcal{S}}|=q^{s}$ ). Let $U\triangleq\{\bm{\delta}\in\mathbb{F}_{q^{\ell}}^{*}\colon\exists\bm{\gamma}\in{\mathcal{T}}^{\prime}\setminus\{0\}\text{ such that }\bm{\delta}\bm{\gamma}\in S\}$ , then $U=\bigcup_{\bm{\gamma}\in{\mathcal{T}}^{\prime*}}\{\bm{\delta}\in\mathbb{F}_{q^{\ell}}^{*}\colon\bm{\delta}\bm{\gamma}\in{\mathcal{S}}\}.$ We have

\begin{split}|U|&\leq\sum_{\bm{\gamma}\in{\mathcal{T}}^{\prime*}}|\{\bm{\delta}\in\mathbb{F}_{q^{\ell}}^{*}\colon\bm{\delta}\bm{\gamma}\in{\mathcal{S}}\}|\\ &=\sum_{\bm{\gamma}\in{\mathcal{T}}^{\prime*}}(q^{s}-1)=(q^{\ell-s}-1)(q^{s}-1)\\ &=(q^{\ell}-1)-(q^{\ell-s}+q^{s}-2)<q^{\ell}-1,\end{split}

for $s\in[\ell]$ . Hence, there exists a $\bm{\delta}\notin U$ , $\bm{\delta}\neq 0$ , satisfying that for every $\bm{\gamma}\in{\mathcal{T}}^{\prime*},\bm{\delta}\bm{\gamma}\notin{\mathcal{S}}$ . Thus, ${\mathcal{S}}\cap\bm{\delta}{\mathcal{T}}^{\prime}=\{0\}$ or $S\cup\bm{\delta}T^{\prime}$ forms a basis of $\mathbb{F}_{q^{\ell}}$ as desired. ∎

Proposition 2.

The minimum repair bandwidth for a codeword symbol of an RS $(n,k)$ over $\mathbb{F}_{q^{\ell}}$ given the side information $S$ depends on $|S|$ but not on the specific choice of its elements.

Proof.

Let $S=\{\bm{\beta}_{i}\}_{i\in[s]}$ and $S^{\prime}=\{\bm{\beta}^{\prime}_{i}\}_{i\in[s]}$ be two different sets of side information for repairing the same codeword symbol $f(\bm{\alpha}^{*})$ . It suffices to show that for every repair scheme for $f(\bm{\alpha}^{*})$ with side information $S^{\prime}$ , there exists a repair scheme for $f(\bm{\alpha}^{*})$ with side information $S$ achieving the same repair bandwidth. To this end, let $\{g_{i}(x)\}_{i\in[s+1,\ell]}\subset\mathbb{F}_{q^{\ell}}[x]$ corresponds to the repair scheme with side information $S^{\prime}$ , i.e. $g_{i}(\bm{\alpha}^{*})=\bm{\beta}^{\prime}_{i}$ , $i\in[s+1,\ell]$ and $\{\bm{\beta}^{\prime}_{i}\}_{i\in[\ell]}$ form an $\mathbb{F}_{q}$ -basis of $\mathbb{F}_{q^{\ell}}$ . According to Lemma 1, there exists $\bm{\delta}\in\mathbb{F}_{q^{\ell}}$ such that $\{\bm{\beta}_{1},\ldots,\bm{\beta}_{s},\bm{\delta}\bm{\beta}^{\prime}_{s+1},\ldots,\bm{\delta}\bm{\beta}^{\prime}_{\ell}\}$ is linearly independent. Therefore, the polynomials $h_{i}(x)\triangleq\bm{\delta}g_{i}(x)$ for all $i\in[s+1,\ell]$ form a repair scheme for $f(\bm{\alpha}^{*})$ with side information $S$ and with the same repair bandwidth as $g_{i}(x)$ ’s. ∎

III-C A Lower Bound on the Bandwidth with Side Information

We provide a lower bound for the repair bandwidth with side information for one erasure in a Reed-Solomon code. The lower bound is similar to those in [15, 16, 17, 31], replacing $q^{\ell}$ by $q^{\ell-s}$ at some places. When $s=0$ , it reduces to the existing bound (no side information). Its proof can be found in Appendix V-B.

Proposition 3.

Any linear repair scheme with side information size $s$ for Reed-Solomon code RS $(A,k)$ over $\mathbb{F}_{q^{\ell}}$ requires a repair bandwidth of at least

t\lfloor b_{\mathsf{AVE}}\rfloor+(n-1-t)\lceil b_{\mathsf{AVE}}\rceil

sub-symbols over $\mathbb{F}_{q}$ , where $n=|A|\leq q^{\ell}$ , $r=n-k$ , $b_{\mathsf{AVE}}$ and $t$ are defined as $b_{\mathsf{AVE}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\log_{q}(\frac{n-1}{T})$ , $T\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\frac{(r-1)(q^{\ell-s}-1)+n-1}{q^{\ell-s}}$ , and $t\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}n-1$ if $b_{\mathsf{AVE}}\in{\mathbb{Z}}$ , $t\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\big{\lfloor}\frac{T-(n-1)q^{-\lceil b_{\mathsf{AVE}}\rceil}}{q^{-\lfloor b_{\mathsf{AVE}}\rfloor}-q^{-\lceil b_{\mathsf{AVE}}\rceil}}\big{\rfloor}$ otherwise.

In some special cases, the lower bound in Proposition 3 can be explicitly computed.

Corollary 1.

Consider a full-length RS $(n=q^{\ell},k)$ code over $\mathbb{F}_{q^{\ell}}$ with $n-k=q^{m}$ for some $1\leq m<\ell$ . Assume that $(\ell-s)\mid\ell$ and $\ell\geq m(\ell-s)$ . Then every linear repair scheme with side information set size $s$ requires a repair bandwidth of at least $(q^{\ell}-1)(\ell-s)-\frac{(q^{\ell-s}-1)(q^{m}-1)}{q-1}$ sub-symbols in $\mathbb{F}_{q}$ .

Proof.

With $n=q^{\ell}$ and $n-k=q^{m}$ , we have

T=\dfrac{(q^{m}-1)(q^{\ell-s}-1)+q^{\ell}-1}{q^{\ell-s}}\vspace{-5pt}

and

b_{\mathsf{AVE}}=\log_{q}\dfrac{q^{\ell}-1}{T}=\log_{q}\bigg{(}\dfrac{q^{\ell}-1}{(q^{m}-1)(q^{\ell-s}-1)+q^{\ell}-1}q^{\ell-s}\bigg{)}.

Next, we show that $\ell-s-1<b_{\mathsf{AVE}}<\ell-s$ . Indeed, the second inequality is obvious because $q^{\ell}-1<(q^{m}-1)(q^{\ell-s}-1)+q^{\ell}-1$ . For the first inequality, we need to show that

\dfrac{q^{\ell}-1}{(q^{m}-1)(q^{\ell-s}-1)+q^{\ell}-1}>\dfrac{1}{q},

which is equivalent to

(q^{\ell}-1)(q-1)>(q^{m}-1)(q^{\ell-s}-1)\Longleftrightarrow\dfrac{q^{\ell}-1}{q^{m}-1}>\dfrac{q^{\ell-s}-1}{q-1},\vspace{-5pt}

which is true because

\dfrac{q^{\ell}-1}{q^{m}-1}\geq\dfrac{q^{m(\ell-s)}-1}{q^{m}-1}=\sum_{i=0}^{\ell-s-1}q^{mi}\geq\sum_{i=0}^{\ell-s-1}q^{i}=\dfrac{q^{\ell-s}-1}{q-1},

noting that either the first or the second inequality must be strict: if $m=1$ (so that the second inequality becomes equality) then the first inequality is strict since $q^{\ell}>q^{\ell-s}=q^{m(\ell-s)}$ . Thus, $b_{\mathsf{AVE}}\notin{\mathbb{Z}}$ and $\lfloor b_{\mathsf{AVE}}\rfloor=\ell-s-1$ and $\lceil b_{\mathsf{AVE}}\rceil=\ell-s$ . Plugging this in the formula for $t$ in Proposition 3 we obtain

t=\left\lfloor\frac{T-(q^{\ell}-1)q^{-\lceil b_{\mathsf{AVE}}\rceil}}{q^{-\lfloor b_{\mathsf{AVE}}\rfloor}-q^{-\lceil b_{\mathsf{AVE}}\rceil}}\right\rfloor=\dfrac{(q^{m}-1)(q^{\ell-s}-1)}{q-1}.

Finally, using Proposition 3 we obtain a lower bound of

	$\displaystyle t(\ell-s-1)+(q^{\ell}-1-t)(\ell-s)=(q^{\ell}-1)(\ell-s)-t$
	$\displaystyle=(q^{\ell}-1)(\ell-s)-(q^{\ell-s}-1)(q^{m}-1)/(q-1)$

sub-symbols over $\mathbb{F}_{q}$ on the repair bandwidth as claimed. ∎

III-D Optimal Subspace-Polynomial-Based Repair Schemes

In this section we investigate the repair bandwidth incurred by the subspace-polynomial repair scheme introduced in [17, 31], which generalizes the trace-polynomial-based scheme in [15, 16], under the new assumption of side information. We show that in contrast to the standard repair problem (with no side information), the repair bandwidth of such a scheme depends on the specific choice of the subspace. In particular, we transform the problem of finding subspace-polynomial repair schemes with minimum bandwidths possible into another one on subspace intersection, which on its own is an intriguing problem.

Before presenting Theorem 1, we note that given side information set $S=\{\bm{\beta}_{i}\}_{i\in[s]}$ , to construct a subspace-polynomial repair scheme, one first needs to find a target set $T=\{\bm{\beta}_{i}\}_{i\in[s+1,\ell]}$ such that $S\cup T$ forms an $\mathbb{F}_{q}$ -basis of $\mathbb{F}_{q^{\ell}}$ (see Proposition 1 and the discussion preceding it). Next, given that $r=n-k\geq q^{m}$ , for some $m<\ell$ , one picks an $m$ -dimensional subspace ${\mathcal{W}}$ of $\mathbb{F}_{q^{\ell}}$ , and form the $\ell-s$ check polynomials $g_{i}(x)\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\frac{L_{\mathcal{W}}\big{(}\bm{\beta}_{i}(x-\bm{\alpha}^{*})\big{)}}{x-\bm{\alpha}^{*}}$ , $i\in[s+1,\ell]$ . Note that $L_{\mathcal{W}}(x)\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\prod_{\bm{\omega}\in{\mathcal{W}}}(x-\bm{\omega})$ is the subspace polynomial which is a linearly map constructed over ${\mathcal{W}}$ and $\ker(L_{\mathcal{W}})={\mathcal{W}}$ . The check polynomials are then used in the trace repair scheme.

Lemma 2.

Consider an RS $(n,k)$ with evaluation points $A\subseteq\mathbb{F}_{q^{\ell}}$ satisfying $n-k\geq q^{m}$ , $m<\ell$ . Consider also a repair scheme with side information of size $s$ , that consists of $\ell-s$ polynomials $\left\{g_{i}(x)\right\}_{i\in[s+1,\ell]}$ , where $g_{i}(x)\triangleq L_{\mathcal{W}}\big{(}\bm{\beta}_{i}(x-\bm{\alpha}^{*})\big{)}/(x-\bm{\alpha}^{*})$ and $T\triangleq\{\bm{\beta}_{i}\}_{i\in[s+1,\ell]}$ is a target set. This scheme has bandwidth (with $|A|-1$ helper nodes)

(|A|-1)(\ell-s)-\textstyle\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\dim\big{(}(\bm{\alpha}-\bm{\alpha}^{*}){\mathcal{T}}\cap{\mathcal{W}}\big{)}

sub-symbols in $\mathbb{F}_{q}$ , where ${\mathcal{T}}\triangleq\mathsf{span}_{\mathbb{F}_{q}}(T)$ .

Proof.

The node storing $f(\bm{\alpha})$ computes $\ell-s$ traces $\mathsf{Tr}\Big{(}\frac{L_{\mathcal{W}}\big{(}\bm{\beta}_{i}(\bm{\alpha}-\bm{\alpha}^{*})\big{)}}{\bm{\alpha}-\bm{\alpha}^{*}}f(\bm{\alpha})\Big{)}$ , $i\in[s+1,\ell]$ . However, due to the linearity of trace, it only needs to send $\mathsf{rank}_{\mathbb{F}_{q}}\big{(}\left\{L_{\mathcal{W}}\big{(}\bm{\beta}_{i}(\bm{\alpha}-\bm{\alpha}^{*})\big{)}\right\}_{i\in[s,\ell+1]}\big{)}$ traces. To compute this rank, let ${\mathcal{U}}\triangleq(\bm{\alpha}-\bm{\alpha}^{*}){\mathcal{T}}=\mathsf{span}_{\mathbb{F}_{q}}\big{(}(\bm{\alpha}-\bm{\alpha}^{*})T\big{)}$ and $\tau\colon{\mathcal{U}}\to\mathbb{F}_{q^{\ell}}$ defined as $\tau(u)=L_{\mathcal{W}}(u)$ for every $u\in{\mathcal{U}}$ . Then $\dim({\mathcal{U}})=\dim({\mathcal{T}})=\ell-s$ and $\ker(\tau)={\mathcal{U}}\cap\ker(L_{\mathcal{W}})={\mathcal{U}}\cap{\mathcal{W}}$ . Using the rank-nullity theorem, we obtain

\begin{split}&\mathsf{rank}_{\mathbb{F}_{q}}\big{(}\left\{L_{\mathcal{W}}\big{(}\bm{\beta}_{i}(\bm{\alpha}-\bm{\alpha}^{*})\big{)}\right\}_{i\in[s,\ell+1]}\big{)}\\ &=\dim\big{(}{\sf{im}}(\tau)\big{)}=\dim({\mathcal{U}})-\dim\big{(}\ker(\tau)\big{)}\\ &=(\ell-s)-\dim\big{(}(\bm{\alpha}-\bm{\alpha}^{*}){\mathcal{T}}\cap{\mathcal{W}}\big{)}.\end{split}

Summing this over all $\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}$ completes the lemma. ∎

Given the side information subspace ${\mathcal{S}}$ , when constructing a subspace-polynomial repair scheme, we have the freedom in selecting relevant target subspace ${\mathcal{T}}$ and ${\mathcal{W}}$ . Hence, one can optimize the bandwidth over such ${\mathcal{T}}$ and ${\mathcal{W}}$ . This is in stark contrast to the case of standard repair without side information, in which any subspace ${\mathcal{W}}$ would lead to the same repair bandwidth [17, 31]. Theorem 1 formalizes this fact.

Theorem 1.

Consider an RS $(A,k)$ , $A\subseteq\mathbb{F}_{q^{\ell}}$ , $|A|=n$ , and $n-k\geq q^{m}$ , $m<\ell$ . Then the minimum bandwidth that a subspace-polynomial repair scheme for $f(\bm{\alpha}^{*})$ ( $\bm{\alpha}^{*}\in A$ ) can achieve, given the side information subspace ${\mathcal{S}}$ , $\dim({\mathcal{S}})=s$ , is

(|A|-1)(\ell-s)-\textstyle\max_{{\mathcal{T}},{\mathcal{W}}}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\dim\big{(}(\bm{\alpha}-\bm{\alpha}^{*}){\mathcal{T}}\cap{\mathcal{W}}\big{)},

Proof.

Follows directly from Proposition 1 and Lemma 2. ∎

Note that Theorem 1 converts the repair bandwidth with side information problem restricted to subspace-polynomial repair scheme to a pure subspace-intersection problem stated below.

(Subspace-Intersection Problem) Given $\bm{\alpha}^{*}\in A\subseteq\mathbb{F}_{q^{\ell}}$ and an $s$ -dimensional subspace ${\mathcal{S}}$ of $\mathbb{F}_{q^{\ell}}$ , find ${\mathcal{T}}$ and ${\mathcal{W}}$ that maximizes the sum $\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\dim\big{(}(\bm{\alpha}-\bm{\alpha}^{*}){\mathcal{T}}\cap{\mathcal{W}}\big{)}$ among all $(\ell-s)$ -dimensional subspaces ${\mathcal{T}}$ and $m$ -dimensional subspaces ${\mathcal{W}}$ satisfying ${\mathcal{S}}\oplus{\mathcal{T}}=\mathbb{F}_{q^{\ell}}$ .

The subspace-intersection problem can be tricky to solve, especially for general $A$ . Therefore, we limit ourselves to the more tractable case when $A\equiv\mathbb{F}_{q^{\ell}}$ , for which optimal repair bandwidths were known for the standard repair setting (without side information) when $q^{\ell}-k\geq q^{m}$ [17, 31]. We also assume that $(\ell-s)\mid\ell$ . With these assumptions, in Corollary 2, we can replace ${\mathcal{T}}$ by $\mathbb{F}_{q^{\ell-s}}$ in the optimization problem. Note that while $\mathbb{F}_{q^{\ell-s}}$ may not be a valid ${\mathcal{T}}$ (i.e $\mathbb{F}_{q^{\ell-s}}\oplus{\mathcal{S}}\neq\mathbb{F}_{q^{\ell}}$ ), by Lemma 1, at least one of its cosets is. Finally, although this corollary provides an upper bound instead of an exact formulation for the bandwidth, later on, using the lower bound in Proposition 3, we can establish optimal repair bandwidths for subspace polynomial schemes in some parameter regimes.

Corollary 2.

Consider a full-length RS $(n=q^{\ell},k)$ with evaluation points $A=\mathbb{F}_{q^{\ell}}$ , where $n-k\geq q^{m}$ , $m<\ell$ . Let $S$ be a side information set with $s=|S|$ and $(\ell-s)\mid\ell$ . Then there exists a subspace-polynomial repair scheme for $f(\bm{\alpha}^{*})$ ( $\bm{\alpha}^{*}\in A$ ), given the side information set $S$ , with repair bandwidth

(q^{\ell}-1)(\ell-s)-\max_{{\mathcal{W}}}\textstyle\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim\big{(}\bm{\gamma}\mathbb{F}_{q^{\ell-s}}\cap{\mathcal{W}}\big{)},

(1)

where the $\max$ is taken over all $m$ -dimensional $\mathbb{F}_{q}$ -subspaces ${\mathcal{W}}$ of $\mathbb{F}_{q^{\ell}}$ .

Proof.

Let ${\mathcal{S}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\mathsf{span}_{\mathbb{F}_{q}}(S)$ be the side information subspace. By Theorem 1, the minimum bandwidth achieved by a subspace-polynomial repair scheme with side information subspace ${\mathcal{S}}$ is

(q^{\ell}-1)(\ell-s)-\max_{{\mathcal{T}},{\mathcal{W}}}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\dim\big{(}(\bm{\alpha}-\bm{\alpha}^{*}){\mathcal{T}}\cap{\mathcal{W}}\big{)},

(2)

where the $\max$ is taken over all $\mathbb{F}_{q}$ -subspaces ${\mathcal{T}}$ and ${\mathcal{W}}$ of $\mathbb{F}_{q^{\ell}}$ with $\dim({\mathcal{T}})=\ell-s$ , ${\mathcal{S}}\oplus{\mathcal{T}}=\mathbb{F}_{q^{\ell}}$ , and $\dim({\mathcal{W}})=m$ . To prove the existence of a repair scheme with bandwidth given by (1), we show that a (multiplicative) coset of $\mathbb{F}_{q^{\ell-s}}$ can be a (valid) target subspace. Indeed, according to Lemma 1, there exists $\bm{\delta}\in\mathbb{F}_{q^{\ell}}^{*}$ such that ${\mathcal{S}}\oplus\bm{\delta}\mathbb{F}_{q^{\ell-s}}=\mathbb{F}_{q^{\ell}}$ , i.e. $\bm{\delta}\mathbb{F}_{q^{\ell-s}}$ is a valid target subspace w.r.t. the side information subspace ${\mathcal{S}}$ . Hence, setting ${\mathcal{T}}=\bm{\delta}\mathbb{F}_{q^{\ell-s}}$ in (2) and using the assumption that $A=\mathbb{F}_{q^{\ell}}$ , there exists a subspace-polynomial repair scheme given the side information subspace ${\mathcal{S}}$ that achieves the bandwidth

(q^{\ell}-1)(\ell-s)-\max_{{\mathcal{W}}}\sum_{\bm{\alpha}\in\mathbb{F}_{q^{\ell}}\setminus\{\bm{\alpha}^{*}\}}\dim\big{(}(\bm{\alpha}-\bm{\alpha}^{*})\bm{\delta}\mathbb{F}_{q^{\ell-s}}\cap{\mathcal{W}}\big{)},

which is the same as (1) when replacing $(\bm{\alpha}-\bm{\alpha}^{*})\bm{\delta}$ by $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ , noting that $\big{\{}(\bm{\alpha}-\bm{\alpha}^{*})\bm{\delta}\colon\bm{\alpha}\in\mathbb{F}_{q^{\ell}}\setminus\{\bm{\alpha}^{*}\}\big{\}}\equiv\mathbb{F}_{q^{\ell}}^{*}$ . ∎

Using Corollary 2, assuming a full-length code with $(\ell-s)\mid\ell$ , we can now construct a few concrete subspace-polynomial repair schemes that achieve optimal repair bandwidths among all linear schemes. To construct the first repair scheme achieving optimal repair bandwidth, we first prove an auxiliary lemma.

Lemma 3.

For every $a\mid\ell$ , $b\mid\ell$ , and $(a,b)=1$ , and for every $\bm{\gamma},\bm{\delta}\in\mathbb{F}_{q^{\ell}}^{*}$ , it holds that $\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap\bm{\delta}\mathbb{F}_{q^{b}})\in\{0,1\}$ .

Proof.

Note that $\mathbb{F}_{q^{a}}^{*}=\big{\{}\bm{\xi}^{\frac{q^{\ell}-1}{q^{a}-1}}\big{\}}_{i=0}^{q^{a}-2}$ and $\mathbb{F}_{q^{b}}^{*}=\big{\{}\bm{\xi}^{\frac{q^{\ell}-1}{q^{b}-1}}\big{\}}_{i=0}^{q^{b}-2}$ , where $\bm{\xi}$ is a primitive element of $\mathbb{F}_{q^{\ell}}$ . To show that $\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap\bm{\delta}\mathbb{F}_{q^{b}})\in\{0,1\}$ , it suffices to show that for any ${\bm{u}},{\bm{v}}\in\bm{\gamma}\mathbb{F}_{q^{a}}^{*}\cap\bm{\delta}\mathbb{F}_{q^{b}}^{*}$ , it holds that ${\bm{u}}/{\bm{v}}\in\mathbb{F}_{q}$ . Indeed, for such ${\bm{u}}$ and ${\bm{v}}$ , there exist $x$ , $y$ , $z$ , and $w$ such that

\begin{split}{\bm{u}}&=\bm{\gamma}\big{(}\bm{\xi}^{\frac{q^{\ell}-1}{q^{a}-1}}\big{)}^{x}=\bm{\delta}\big{(}\bm{\xi}^{\frac{q^{\ell}-1}{q^{b}-1}}\big{)}^{z},\ {\bm{v}}=\bm{\gamma}\big{(}\bm{\xi}^{\frac{q^{\ell}-1}{q^{a}-1}}\big{)}^{y}=\bm{\delta}\big{(}\bm{\xi}^{\frac{q^{\ell}-1}{q^{b}-1}}\big{)}^{w},\end{split}

which implies that

\frac{{\bm{u}}}{{\bm{v}}}=\big{(}\bm{\xi}^{\frac{q^{\ell}-1}{q^{a}-1}}\big{)}^{x-y}=\big{(}\bm{\xi}^{\frac{q^{\ell}-1}{q^{b}-1}}\big{)}^{z-w}\in\mathbb{F}_{q^{a}}\cap\mathbb{F}_{q^{b}}={\mathbb{F}}_{q^{(a,b)}}=\mathbb{F}_{q}.

The proof follows. ∎

The following theorem indicates the existence of optimal repair schemes for a full-length Reed-Solomon codes with side information size $s$ , where $(\ell-s)|\ell$ . We prove that the existing subspace repair scheme can be constructed from a subfield $\mathbb{F}_{q^{m}}$ of $\mathbb{F}_{q^{\ell}}$ , where $n-k\geq q^{m}$ , $\ell>m\geq 1$ , $m|\ell$ , and $(\ell-s,m)=1$ . However, for any coset of $\mathbb{F}_{q^{m}}$ , the proof is still right.

Theorem 2.

Consider a full-length Reed-Solomon codes RS $(n=q^{\ell},k)$ over $\mathbb{F}_{q^{\ell}}$ with $n-k\geq q^{m}$ for some $\ell>m\geq 1$ . If $(\ell-s)\mid\ell$ , $m\mid\ell$ , and $(\ell-s,m)=1$ , then there exists a linear repair scheme with side information of size $s$ that uses the repair bandwidth of $(q^{\ell}-1)(\ell-s)-(q^{\ell-s}-1)(q^{m}-1)/(q-1)$ sub-symbols in $\mathbb{F}_{q}$ . The scheme is optimal when $n-k=q^{m}$ .

Proof.

Set ${\mathcal{W}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\mathbb{F}_{q^{m}}$ , and let $T\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\{\bm{\beta}_{i}\}_{i=s+1}^{\ell}$ be a basis of ${\mathcal{T}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\mathbb{F}_{q^{\ell-s}}$ . We now consider the subspace repair scheme constructed from ${\mathcal{W}}$ . The statement that the achieved bandwidth is optimal among all linear repair schemes is obvious due to Corollary 1, noting that the assumptions $(\ell-s)\mid\ell$ , $m\mid\ell$ , and $(\ell-s,m)=1$ imply $\ell\geq(\ell-s)m$ . It remains to show that the stated scheme has the stated bandwidth. Indeed, from Lemma 2, it is sufficient to prove that $\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim\big{(}\bm{\gamma}{\mathcal{T}}\cap{\mathcal{W}}\big{)}=\frac{(q^{\ell-s}-1)(q^{m}-1)}{q-1}$ . By Lemma 3, we note that to get the sum $\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim(\bm{\gamma}{\mathcal{T}}\cap\mathbb{F}_{q^{m}})$ , we compute the number of elements $\bm{\gamma}$ so that $\dim(\bm{\gamma}{\mathcal{T}}\cap\mathbb{F}_{q^{m}})=1$ . As the set of $1$ -dimensional intersections $\bm{\gamma}{\mathcal{T}}\cap{\mathcal{W}}$ is a partition of ${\mathcal{W}}$ into $1$ -dimensional subspaces, there are $\frac{|{\mathcal{W}}^{*}|}{q-1}$ such subspaces. Moreover, each of them is repeated $q^{\ell-s}-1$ times, since $\bm{\gamma}^{\prime}{\mathcal{T}}=\bm{\gamma}{\mathcal{T}}$ for all $\bm{\gamma}^{\prime}\in\bm{\gamma}{\mathcal{T}}$ . Thus, the number of elements $\bm{\gamma}$ with $\dim(\bm{\gamma}{\mathcal{T}}\cap\mathbb{F}_{q^{m}})=1$ is $(q^{\ell-s}-1)(q^{m}-1)/(q-1)$ , which completes the proof. ∎

Now we consider a greedy construction that generates $m$ -dimensional subspaces ${\mathcal{W}}$ , $m<\ell$ , that generate subspace-polynomial repair schemes with minimal repair bandwidths. Assume that $a\mid\ell$ . The aim is to construct a subspace ${\mathcal{W}}$ of $\mathbb{F}_{q^{\ell}}$ satisfying $\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}})\in\{0,1\}$ for all $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ .

Lemma 4.

Assume that $a\mid\ell$ and that $q^{\ell}>\binom{q^{m-1}}{2}\Big{(}\frac{q^{a}-1}{q-1}\Big{)}^{2}+1$ . Then there exists an $m$ -dimensional subspace ${\mathcal{W}}$ satisfying that $\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}})\in\{0,1\}$ for all $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ .

Proof.

We will construct in a greedy manner a set $\{\bm{w}_{1},\dots,\bm{w}_{m}\}\subset\mathbb{F}_{q^{\ell}}^{*}$ that satisfies two properties given below.

•

(P1) $\{\bm{w}_{1},\dots,\bm{w}_{m}\}$ is $\mathbb{F}_{q}$ -linearly independent, and
•

(P2) the subspace ${\mathcal{W}}_{m}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\mathsf{span}_{\mathbb{F}_{q}}\big{(}\{\bm{w}_{1},\dots,\bm{w}_{m}\}\big{)}$ satisfies that $\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}}_{m})\in\{0,1\}$ for all $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$

The first element $\bm{w}_{1}$ can be picked arbitrarily in $\mathbb{F}_{q^{\ell}}^{*}$ because ${\mathcal{W}}_{1}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\mathsf{span}_{\mathbb{F}_{q}}(\{w_{1}\})$ satisfies (P1) and (P2) obviously. Assume that we have already had a set $\{\bm{w}_{1},\dots,\bm{w}_{m-1}\}$ that satisfies (P1) and (P2). We now show that we can find $\bm{w}_{m}$ so that $\{\bm{w}_{1},\dots,\bm{w}_{m}\}$ satisfies (P1) and (P2) given that $a\mid\ell$ and $q^{\ell}>\binom{q^{m-1}}{2}\Big{(}\frac{q^{a}-1}{q-1}\Big{)}^{2}+1$ . Consider the set

B_{m-1}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\{\bm{\alpha}_{1}{\bm{u}}+\bm{\alpha}_{2}{\bm{v}}\colon{\bm{u}},{\bm{v}}\in{\mathcal{W}}_{m-1},{\bm{u}}\neq{\bm{v}},\bm{\alpha}_{1},\bm{\alpha}_{2}\in\mathbb{F}_{q^{a}}\}.\vspace{-5pt}

Claim 1: $|B_{m-1}|\leq\binom{q^{m-1}}{2}\Big{(}\frac{q^{a}-1}{q-1}\Big{)}^{2}+1<|\mathbb{F}_{q^{\ell}}|$ .

Claim 2: Any $\bm{w}_{m}\in\mathbb{F}_{q^{\ell}}\setminus B_{m-1}$ satisfies (P1)-(P2).

Proof of Claim 1.

Note that $\bm{\alpha}_{1}=\bm{\alpha}_{2}=0$ gives $0\in B_{m-1}$ . Since $(\tau\bm{\alpha}){\bm{u}}=\bm{\alpha}(\tau{\bm{u}})$ and $\tau{\bm{u}}\in{\mathcal{W}}_{m-1}$ for $\tau\in\mathbb{F}_{q}$ and ${\bm{u}}\in{\mathcal{W}}_{m-1}$ , to count the elements of $B_{m-1}$ corresponding to $\bm{\alpha}_{1}\neq 0$ and $\bm{\alpha}_{2}\neq 0$ , we only need to consider $(q^{a}-1)/(q-1)$ values for each $\bm{\alpha}_{1}$ and $\bm{\alpha}_{2}$ . Moreover, we can ignore the case $\bm{\alpha}_{1}\neq 0$ and $\bm{\alpha}_{2}=0$ or vice versa as the resulting elements are already counted for $\bm{\alpha}_{1}\neq 0$ and $\bm{\alpha}_{2}\neq 0$ when setting either ${\bm{v}}=0$ or ${\bm{u}}=0$ , respectively. Thus, other than $0$ , $B_{m-1}$ has at most $\big{(}\frac{q^{a}-1}{q-1}\big{)}^{2}\binom{q^{m-1}}{2}$ other elements, where the binomial factor counts the number of distinct pairs ${\bm{u}},{\bm{v}}\in{\mathcal{W}}_{m-1}$ . Thus, $|B_{m-1}|\leq\big{(}\frac{q^{a}-1}{q-1}\big{)}^{2}\binom{q^{m-1}}{2}+1$ elements. ∎

Proof of Claim 2.

Since ${\mathcal{W}}_{m-1}\subseteq B_{m-1}$ , $\bm{w}_{m}\notin{\mathcal{W}}_{m-1}$ , which implies (P1). Assume, for the sake of contradiction, that $\dim({\mathcal{W}}_{m}\cap\bm{\gamma}\mathbb{F}_{q^{a}})\geq 2$ for some $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ . Then there exist ${\bm{u}},{\bm{v}}\in{\mathcal{W}}_{m-1}$ , ${\bm{u}}\neq{\bm{v}}$ so that either a) $\{\bm{w}_{m}+{\bm{u}},\bm{w}_{m}+{\bm{v}}\}\subset\bm{\gamma}\mathbb{F}_{q^{a}}$ and $\mathsf{rank}_{\mathbb{F}_{q}}(\{\bm{w}_{m}+{\bm{u}},\bm{w}_{m}+{\bm{v}}\})=2$ , or b) $\{\bm{w}_{m}+{\bm{u}},{\bm{v}}\}\subset\bm{\gamma}\mathbb{F}_{q^{a}}$ and $\mathsf{rank}_{\mathbb{F}_{q}}(\{\bm{w}_{m}+{\bm{u}},{\bm{v}}\})=2$ . If a) occurs, then there exist ${\bm{x}},{\bm{y}}\in\mathbb{F}_{q^{a}}$ , ${\bm{x}}\neq 0$ , ${\bm{y}}\neq 0$ , ${\bm{x}}\neq{\bm{y}}$ , so that $\bm{w}_{m}+{\bm{u}}=\bm{\gamma}{\bm{x}}$ and $\bm{w}_{m}+{\bm{v}}=\bm{\gamma}{\bm{y}}$ , which implies that $\bm{w}_{m}=\frac{{\bm{y}}}{{\bm{x}}-{\bm{y}}}{\bm{u}}+\frac{{\bm{x}}}{{\bm{y}}-{\bm{x}}}{\bm{v}}\in B_{m-1}$ , which contradicts our assumption. The case b) can be treated similarly. ∎

The proof of Lemma 4 follows from these two claims. Indeed, by Claim 1, there exists at least one element in $\mathbb{F}_{q^{\ell}}\setminus B_{m-1}$ , which is the desired $\bm{w}_{m}$ according to Claim 2. ∎

Theorem 3.

Consider a full-length Reed-Solomon codes RS $(n=q^{\ell},k)$ over $\mathbb{F}_{q^{\ell}}$ with $n-k\geq q^{m}$ for some $\ell>m\geq 1$ . If $(\ell-s)\mid\ell$ and $q^{\ell}>\binom{q^{m-1}}{2}\Big{(}\frac{q^{\ell-s}-1}{q-1}\Big{)}^{2}+1$ , then there exists a linear repair scheme with side information of size $s$ that uses the repair bandwidth of $(q^{\ell}-1)(\ell-s)-(q^{\ell-s}-1)(q^{m}-1)/(q-1)$ sub-symbols in $\mathbb{F}_{q}$ . The scheme is optimal when $n-k=q^{m}$ .

Proof.

By Lemma 4, there exists an $m$ -dimensional subspace ${\mathcal{W}}$ satisfying $\dim(\bm{\gamma}\mathbb{F}_{q^{\ell-s}}\cap{\mathcal{W}})\in\{0,1\}$ , for all $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ . The rest of the proof proceeds similarly to that of Theorem 2. ∎

III-E Bandwidth Reductions Given Side Information

To illustrate the repair bandwidth reduction in the presence of side information, we consider as an example the parameter regime assumed in Theorem 2. Case 1: $s=\ell-1$ and $m=\ell/d$ for some constant $d\geq 2$ . Theorem 2 gives a repair bandwidth with side information ${\sf{bw}_{SI}}=(q^{\ell}-1)-(q^{\ell/d}-1)$ . The optimal repair bandwidth with no side information is ${\sf{bw}}=(q^{\ell-1}-1)(1-1/d)\ell$ (see [17, 31]). Clearly, $\lim_{\ell\to\infty}{\sf{bw}_{SI}}/{\sf{bw}}=0$ . Case 2: $s=c\ell/(c-1)$ , i.e. $\ell-s=\ell/c$ , and $m=\ell/d$ , for some constants $c,d\geq 2$ . Then ${\sf{bw}_{SI}}=(q^{\ell}-1)\ell/c-(q^{\ell/c}-1)(q^{\ell/d}-1)/(q-1)$ , whereas ${\sf{bw}}=(q^{\ell}-1)(d-1)\ell/d$ . Clearly, ${\sf{bw}}-{\sf{bw}_{SI}}\geq(q^{\ell/c}-1)(q^{\ell/d}-1)/(q-1)\to\infty$ as $\ell\to\infty$ .

IV Conclusions

We proposed the problem of repairing a single erasure of Reed-Solomon codes with side information, which generalizes the standard repair problem, and established a lower bound on the repair bandwidth of a linear repair scheme. The problem of constructing optimal subspace-polynomial repair schemes can be reduced to a subspace intersection problem, which is interesting in its own right. We settled this problem for a few parameter regimes, leaving the general case open for future research.

Acknowledgement

The work of Son Hoang Dau was supported by the Australia Research Council DECRA Grant DE180100768 and DP Grant DP200100731. The work of Han Mao Kiah was supported by the Ministry of Education, Singapore, under its MOE AcRF Tier 2 Award under Grant MOE-T2EP20121-0007 and MOE AcRF Tier 1 Award under Grant RG19/23. The work of Stanislav Kruglik was supported by the Ministry of Education, Singapore, under its MOE AcRF Tier 2 Award under Grant MOE-T2EP20121-0007.

References

[1] F. J. MacWilliams and N. J. A. Sloane, The Theory of Error-Correcting Codes. Amsterdam: North-Holland, 1977.
[2] K. V. Rashmi, N. B. Shah, D. Gu, H. Kuang, D. Borthakur, and K. Ramchandran, “A solution to the network challenges of data recovery in erasure-coded distributed storage systems: A study on the Facebook warehouse cluster,” in Proc. USENIX Conf. Hot Topics Storage File Syst. (HotStorage), 2013, pp. 8–8.
[3] A. Dimakis, P. Godfrey, Y. Wu, M. Wainwright, and K. Ramchandran, “Network coding for distributed storage systems,” IEEE Trans. Inform. Theory, vol. 56, no. 9, pp. 4539–4551, 2010.
[4] A. Dimakis, K. Ramchandran, Y. Wu, and C. Suh, “A survey on network codes for distributed storage,” Proc. IEEE, vol. 99, no. 3, pp. 476–489, 2011.
[5] O. Khan, R. Burns, J. Plank, W. Pierce, and C. Huang, “Rethinking erasure codes for cloud file systems: Minimizing i/o for recovery and degraded reads,” in Proc. 13th USENIX Conf. File Storage Technol. (FAST), 2012.
[6] S. B. Balaji, M. N. Krishnan, M. Vajha, V. Ramkumar, B. Sasidharan, and P. V. Kumar, “Erasure coding for distributed storage: an overview,” Science China Information Sciences, vol. 61, no. 10, 2018.
[7] A. Wyner, “On source coding with side information at the decoder,” IEEE Trans. Inform. Theory, vol. 21, no. 3, pp. 294–300, 1975.
[8] G. Keshet, Y. Steinberg, and N. Merhav, Channel Coding in the Presence of Side Information, ser. Foundations and Trends in Communication and Information Theory, 2008.
[9] V. Guruswami, “List decoding with side information,” in Proc. IEEE Annual Conference on Computational Complexity, 2003, pp. 300–309.
[10] Y. Birk and T. Kol, “Coding on demand by an informed source (ISCOD) for efficient broadcast of different supplemental data to caching clients,” IEEE Trans. Inform. Theory, vol. 52, no. 6, pp. 2825–2830, 2006.
[11] Z. Bar-Yossef, Y. Birk, T. S. Jayram, and T. Kol, “Index coding with side information,” IEEE Trans. Inform. Theory, vol. 57, no. 3, pp. 1479–1494, 2011.
[12] S. Kadhe, B. Garcia, A. Heidarzadeh, S. E. Rouayheb, and A. Sprintson, “Private information retrieval with side information,” IEEE Trans. Inform. Theory, vol. 66, no. 4, pp. 2032–2043, 2020.
[13] T. X. Dinh, L. Y. Nhi Nguyen, L. J. Mohan, S. Boztas, T. T. Luong, and S. H. Dau, “Practical considerations in repairing Reed-Solomon codes,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), 2022, pp. 2607–2612.
[14] K. Shanmugam, D. S. Papailiopoulos, A. G. Dimakis, and G. Caire, “A repair framework for scalar MDS codes,” IEEE J. Selected Areas Comm. (JSAC), vol. 32, no. 5, pp. 998–1007, 2014.
[15] V. Guruswami and M. Wootters, “Repairing Reed-Solomon codes,” in Proc. Annu. Symp. Theory Comput. (STOC), 2016.
[16] ——, “Repairing Reed-Solomon codes,” IEEE Trans. Inform. Theory, vol. 63, no. 9, pp. 5684–5698, 2017.
[17] S. H. Dau and O. Milenkovic, “Optimal repair schemes for some families of Reed-Solomon codes,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), 2017, pp. 346–350.
[18] S. H. Dau, I. Duursma, H. M. Kiah, and O. Milenkovic, “Repairing Reed-Solomon codes with two erasures,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), 2017, pp. 351–355.
[19] ——, “Repairing Reed-Solomon codes with multiple erasures,” IEEE Trans. Inform. Theory, vol. 54, no. 10, pp. 6567–6582, 2018.
[20] M. Ye and A. Barg, “Explicit constructions of high-rate MDS array codes with optimal repair bandwidth,” IEEE Trans. Inform. Theory, vol. 63, no. 4, pp. 2001–2014, 2017.
[21] W. Li, Z. Wang, and H. Jafarkhani, “A tradeoff between the sub-packetization size and the repair bandwidth for Reed-Solomon code,” in Proc. 55th Annual Allerton Conf. Comm. Control Comput. (Allerton), 2017, pp. 942–949.
[22] ——, “On the sub-packetization size and the repair bandwidth of Reed-Solomon codes,” IEEE Trans. Inform. Theory, vol. 65, no. 9, pp. 5484–5502, 2019.
[23] ——, “Repairing Reed-Solomon Codes Over $GF(2^{\ell})$ ,” IEEE Comm. Lett., vol. 24, no. 1, pp. 34–37, 2020.
[24] A. Chowdhury and A. Vardy, “Improved schemes for asymptotically optimal repair of MDS codes,” in Proc. 55th Annual Allerton Conf. Comm Control Comput. (Allerton), 2017.
[25] ——, “Improved schemes for asymptotically optimal repair of MDS codes,” IEEE Trans. Inform. Theory, vol. 67, no. 8, pp. 5051–5068, 2021.
[26] I. Tamo, M. Ye, and A. Barg, “Optimal repair of Reed-Solomon codes: Achieving the cut-set bound,” in Proc. 58th Annual IEEE Symp. Foundations Computer Sci. (FOCS), 2017.
[27] ——, “The repair problem for Reed-Solomon codes: Optimal repair of single and multiple erasures with almost optimal node size,” IEEE Trans. Inform. Theory, vol. 65, no. 5, pp. 2673–2695, 2018.
[28] S. H. Dau and E. Viterbo, “Repair schemes with optimal I/O costs for full-length Reed-Solomon codes with two parities,” in Proc. IEEE Inform. Theory Workshop (ITW), 2018, pp. 590–594.
[29] S. H. Dau, I. Duursma, and H. Chu, “On the I/O costs of some repair schemes for full-length Reed-Solomon codes,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), 2018, pp. 1700–1704.
[30] I. Duursma and S. H. Dau, “Low bandwidth repair of the RS(10,4) Reed-Solomon code,” in Proc. Inform. Theory Applicat. Workshop (ITA), 2017.
[31] S. H. Dau, T. X. Dinh, H. M. Kiah, T. T. Luong, and O. Milenkovic, “Repairing Reed-Solomon codes via subspace polynomials,” IEEE Trans. Inform. Theory, vol. 67, no. 10, pp. 6395–6407, 2021.
[32] W. Li, H. Dau, Z. Wang, H. Jafarkhani, and E. Viterbo, “On the I/O costs in repairing short-length Reed-Solomon codes,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), 2019, pp. 1087–1091.
[33] T. X. Dinh, S. Boztas, S. H. Dau, and E. Viterbo, “Designing compact repair groups for Reed-Solomon codes,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), 2023, pp. 2027–2032.
[34] J. Xu, Y. Zhang, K. Wang, and Z. Zhang, “Cooperative repair of Reed-Solomon codes via linearized permutation polynomials,” IEEE Trans. Inform. Theory, pp. 1–11, 2023.
[35] A. Berman, S. Buzaglo, A. Dor, Y. Shany, and I. Tamo, “Repairing Reed–Solomon codes evaluated on subspaces,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), 2021, pp. 867–871.
[36] R. Con and I. Tamo, “Nonlinear repair of Reed-Solomon codes,” IEEE Trans. Inform. Theory, vol. 68, no. 8, pp. 5165–5177, 2022.
[37] R. Con, N. Shutty, I. Tamo, and M. Wootters, “Repairing Reed-Solomon codes over prime fields via exponential sums,” in Proc. IEEE Int. Symp. Inform. Theory (ISIT), 2023, pp. 1330–1335.
[38] A. W. Marshall, I. Olkin, and B. C. Arnold, Inequalities: Theory of Majorization and Its Applications, ser. Springer Series in Statistics, 2011.

V Appendix

V-A Proof of Proposition 1

The first part of this appendix is devoted for the discussion on the definition and the existence of a linear repair scheme for a failed node with side information of size $s$ of Reed-Solomon code RS $(n,k)$ . Similar to an (exact) linear repair scheme for a failed node with standard repair, a linear repair scheme for a node with side information of size $s$ is described by elements $\bm{\gamma}$ ’s used in each trace along with a linear algorithm.

We first propose the definition of a linear repair scheme with side information of size $s$ which is modeled after the definition of a linear repair scheme with the standard repair in [15].

Definition 2.

A linear repair scheme with side information $S=\{\bm{\beta}_{i}\}_{i\in[s]}$ for a symbol $f(\bm{\alpha}^{*})$ of Reed-Solomon code RS $(n,k)$ with evaluation point set $A$ , $|A|=n$ , over the coding field $\mathbb{F}_{q^{\ell}}$ and the base field $\mathbb{F}_{q}$ consists of
$\bullet$ a set $Q_{\bm{\alpha}}\subset\mathbb{F}_{q^{\ell}}$ , for each $\bm{\alpha}\neq\bm{\alpha}^{*}$ , and
$\bullet$ $\ell-s$ coefficients $\eta_{i}\in\mathbb{F}_{q},i\in[s+1,\ell]$ , where $\eta_{i}$ ’s are $\mathbb{F}_{q}$ -linear coefficients of the queries $\cup_{\bm{\alpha}\in A\setminus\{0\}}\{\mathsf{Tr}(\bm{\gamma}f(\bm{\alpha})):\bm{\gamma}\in Q_{\bm{\alpha}}\}$ so that there is a linear reconstruction algorithm that computes $f(\bm{\alpha}^{*})=\sum_{i\in[\ell]}\eta_{i}\bm{\nu}_{i}$ , where $\eta_{i}=\mathsf{Tr}\big{(}\bm{\beta}_{i}f(\bm{\alpha}^{*})\big{)}$ , $i\in[s]$ , which are already known from the side information $S$ , and $\{\bm{\nu}_{1},\dots,\bm{\nu}_{\ell}\}$ is an $\mathbb{F}_{q}$ -basis of $\mathbb{F}_{q^{\ell}}$ . The repair bandwidth $b$ is the total number of sub-symbols in $\mathbb{F}_{q}$ returned by each node $\bm{\alpha}$ , i.e., $b=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\mid Q_{\bm{\alpha}}\mid$ .

Lemma 5.

Suppose there is a linear repair scheme for repairing $f(\bm{\alpha}^{*})$ of RS $(A,k)$ with side information $S=\{\bm{\beta}_{i}\}_{i\in[s]}$ given by a set $\{Q_{\bm{\alpha}}\}_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}$ and a linear algorithm as in Definition 2. Then, there is an $\mathbb{F}_{q}$ -linearly independent set $B=\{\bm{\beta}_{i}\}_{i\in[s+1,\ell]}$ so that there are elements $\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}$ satisfying $\bm{\beta}_{i}f(\bm{\alpha}^{*})=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha})$ , for all $f\in\mathbb{F}_{q^{\ell}}[x]$ of degree less than $k$ , where $\{\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}\}_{\bm{\beta}_{i}\in B}\subseteq\mathsf{span}_{\mathbb{F}_{q}}(Q_{\bm{\alpha}})$ .

Proof.

Suppose $B=\{\bm{\beta}_{s+1},\dots,\bm{\beta}_{\ell}\}$ is an $\mathbb{F}_{q}$ -linearly independent set of $\mathbb{F}_{q^{\ell}}$ so that $\{\bm{\beta}_{1},\dots,\bm{\beta}_{\ell}\}$ is a basis of $\mathbb{F}_{q^{\ell}}$ and $\{\bm{\nu}_{1},\dots,\bm{\nu}_{\ell}\}$ is its trace dual basis. According to Definition 2, the linear repair algorithm computes coefficients $\eta_{i}\in\mathbb{F}_{q}$ so that $f(\bm{\alpha}^{*})=\sum_{i\in[\ell]}\eta_{i}\bm{\nu}_{i}$ , where $\eta_{i}=\mathsf{Tr}\big{(}\bm{\beta}_{i}f(\bm{\alpha}^{*})\big{)}$ for $i\in[s]$ , and $\eta_{i}$ ’s, $i\in[s+1,\ell]$ , are $\mathbb{F}_{q}$ -linear functions of the queries in $\big{\{}\mathsf{Tr}\big{(}\bm{\gamma}f(\bm{\alpha})\big{)}:\bm{\gamma}\in\cup_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}Q_{\bm{\alpha}}\big{\}}$ , i.e., for $i\in[s+1,\ell]$ ,

\begin{split}\eta_{i}&=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\sum_{\bm{\gamma}\in Q_{\bm{\alpha}}}\omega_{\bm{\alpha},\bm{\gamma}}\mathsf{Tr}\big{(}\bm{\gamma}f(\bm{\alpha})\big{)}\\ &=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\mathsf{Tr}\big{(}\big{(}\sum_{\bm{\gamma}\in Q_{\bm{\alpha}}}\omega_{\bm{\alpha},\bm{\gamma}}\bm{\gamma}\big{)}f(\bm{\alpha})\big{)}\\ &=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\mathsf{Tr}\big{(}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha})\big{)}\\ &=\mathsf{Tr}\big{(}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha})\big{)},\end{split}

for $\omega_{\bm{\alpha},\bm{\gamma}}\in\mathbb{F}_{q}$ , and $\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\sum_{\bm{\gamma}\in Q_{\bm{\alpha}}}\omega_{\bm{\alpha},\bm{\gamma}}\bm{\gamma}\in\mathsf{span}_{\mathbb{F}_{q}}(Q_{\bm{\alpha}})$ . Furthermore, $\mathsf{Tr}\big{(}\beta_{i}f(\bm{\alpha}^{*})\big{)}=\mathsf{Tr}\big{(}\bm{\beta}_{i}\sum_{j\in[\ell]}\eta_{j}\bm{\nu}_{j}\big{)}=\sum_{j\in[\ell]}\eta_{j}\mathsf{Tr}(\bm{\beta}_{i}\bm{\gamma}_{j})=\eta_{i}$ , $i\in[s+1,\ell]$ . Then, for $i\in[s+1,\ell]$ ,

\mathsf{Tr}\big{(}\beta_{i}f(\bm{\alpha}^{*})\big{)}=\mathsf{Tr}\big{(}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha})\big{)}.

(3)

The Equation 3 holds for all polynomials $f\in\mathbb{F}_{q^{\ell}}[x]$ , $\deg(f)<k$ , then for all $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ and for all $i\in[s+1,\ell]$ it still holds, i.e., $\mathsf{Tr}\big{(}\bm{\gamma}\beta_{j}f(\bm{\alpha}^{*})\big{)}=\mathsf{Tr}\big{(}\bm{\gamma}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha}^{*})\big{)}$ , which derives to $\beta_{i}f(\bm{\alpha}^{*})=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha})$ . This completes the proof. ∎

Lemma 5 ensures for the existence of a linear algorithm to repair a failed node $f(\bm{\alpha}^{*})$ once there exists a linear repair scheme by ensuring the existence of the set $\{\bm{\beta}_{i}\}_{i\in[s+1,\ell]}$ and the elements $\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}$ , for $i\in[s+1,\ell]$ and $\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}$ . We now propose a linear algorithm to repair $f(\bm{\alpha}^{*})$ of RS $(A,k)$ with side information $S=\{\bm{\beta}_{i}\}_{i\in[s]}$ , which is modeled after the linear algorithm to repair RS $(A,k)$ in [15].

Algorithm 1.

Linear repair with side information $S=\{\bm{\beta}_{i}\}_{i\in[s]}$ . Input: A set $A$ of evaluation points, a point $\bm{\alpha}^{*}\in A$ of the failed node $f(\bm{\alpha}^{*})$ , the $s$ traces $\mathsf{Tr}\big{(}\bm{\beta}_{i}f(\bm{\alpha}^{*})\big{)}$ corresponding to the side information $S=\{\bm{\beta}_{i}\}_{i\in[s]}$ , the access to linear queries of the form $\mathsf{Tr}(\bm{\gamma}f(\bm{\alpha}))$ , for all $\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}$ .
Output: the value $f(\bm{\alpha}^{*})$ .
Steps:

1.

Choose a linearly independent set $B=\{\beta_{i}\}_{i\in[s+1,\ell]}$ .
2.

Choose elements $\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}\in\mathbb{F}_{q^{\ell}}$ for each pair of $\bm{\beta}_{i}\in B$ and $\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}$ so that $\beta_{i}f(\bm{\alpha}^{*})=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha})$ .
3.

for $\bm{\beta}_{i}\in B$ do
4.

Choose an arbitrary spanning set $Q_{\bm{\alpha}}$ for the set $\{\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}\}_{i\in[s+1,\ell]}$ and get the queries $\mathsf{Tr}\big{(}\bm{\gamma}f(\bm{\alpha})\big{)}$ , $\bm{\gamma}\in Q_{\bm{\alpha}}$ .
5.

Compute $\mathsf{Tr}\big{(}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha})\big{)}$ for each $\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}$ through the traces $\mathsf{Tr}(\bm{\gamma}f(\bm{\alpha}))$ ’s.
6.

Compute $\mathsf{Tr}\big{(}\bm{\beta}_{i}f(\bm{\alpha}^{*})\big{)}=\mathsf{Tr}\big{(}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\bm{\mu}_{\bm{\beta}_{i},\bm{\alpha}}f(\bm{\alpha})\big{)},$ $i\in[s+1,\ell]$ , by taking the trace of both sides of the equation in Step 2.
7.

end
8.

Compute $f(\bm{\alpha}^{*})$ from $\{\mathsf{Tr}\big{(}\bm{\beta}_{i}f(\bm{\alpha}^{*})\big{)}\}_{i\in[\ell]}$ :

$f(\bm{\alpha}^{*})=\sum_{i\in[\ell]}\mathsf{Tr}\big{(}\bm{\beta}_{i}f(\bm{\alpha}^{*})\big{)}\bm{\nu}_{i},$

where $\{\bm{\nu}_{1},\dots,\bm{\nu}_{\ell}\}$ is the dual basis of $\{\bm{\beta}_{1},\dots,\bm{\beta}_{\ell}\}$ .

The following proof of Proposition 1 indicates that a linear repair scheme for a node with side information size $s$ of a code RS $(n,k)$ is equivalent to a set of $\ell-s$ polynomials of degree less than $n-k$ .

Proof of Proposition 1.

Supposing that $\{\bm{\nu}_{i}\}_{i=1}^{\ell}$ is the trace-dual basis of $\{\bm{\beta}_{i}\}_{i=1}^{\ell}$ , where $\{b_{i}\}_{i\in[s]}=S$ and $\{b_{i}\}_{i=s+1}^{\ell}=T$ . Supposing that $f(\bm{\alpha}^{*})=\sum_{i=1}^{\ell}\eta_{i}\bm{\nu}_{i}.$ According Lemma 5, the work of defining $f(\bm{\alpha}^{*})$ with side information $S$ is now the work of defining $\ell-s$ coefficients $\eta_{i}$ , $i\in[s+1,\ell]$ , and $\mathsf{Tr}\big{(}\bm{\beta}_{i}f(\bm{\alpha}^{*})\big{)}=\eta_{i}$ . This means that to define $\eta_{i}$ , $i\in[s+1,\ell]$ , it is enough to find $\mathsf{Tr}\big{(}\bm{\beta}_{i}f(\bm{\alpha}^{*})\big{)}$ , or $\mathsf{Tr}\big{(}g_{i}(\bm{\alpha}^{*})f(\bm{\alpha}^{*})\big{)}$ , $i\in[s+1,\ell]$ . Each polynomial $g_{i}(x)$ , $i\in[s+1,\ell]$ , of degree less than $n-k$ corresponds to a dual codeword of the Reed-Solomon codes RS $(A,k)$ , which returns $\sum_{j=1}^{n}g_{i}(\bm{\alpha}_{j})\lambda_{j}f(\bm{\alpha}_{j})=0$ , where $\lambda=(\lambda_{1},\dots,\lambda_{n})\in\mathbb{F}_{q^{\ell}}^{n}$ . Then, $g_{i}(\bm{\alpha}^{*})\lambda^{*}f(\bm{\alpha}^{*})=-\sum_{\bm{\alpha}_{j}\neq\bm{\alpha}^{*}}g_{i}(\bm{\alpha}_{j})\lambda_{j}f(\bm{\alpha}_{j})$ , which is equivalent to $g_{i}(\bm{\alpha}^{*})f(\bm{\alpha}^{*})=-\sum_{\bm{\alpha}_{j}\neq\bm{\alpha}^{*}}g_{i}(\bm{\alpha}_{j})\frac{\lambda_{j}}{\lambda^{*}}f(\bm{\alpha}_{j})$ . Applying the trace function on two sides of this equation we get $\mathsf{Tr}\big{(}g_{i}(\bm{\alpha}^{*})f(\bm{\alpha}^{*})\big{)}=-\sum_{\bm{\alpha}_{j}\neq\bm{\alpha}^{*}}\mathsf{Tr}\big{(}g_{i}(\bm{\alpha}_{j})\frac{\lambda_{j}}{\lambda^{*}}f(\bm{\alpha}_{j})\big{)}$ . In conclusion, each $\eta_{i}$ , $i\in[s+1,\ell]$ , can be computed through the traces $\mathsf{Tr}\big{(}g_{i}(\bm{\alpha}_{j})\frac{\lambda_{j}}{\lambda^{*}}f(\bm{\alpha}_{j})\big{)},\bm{\alpha}_{j}\neq\bm{\alpha}^{*}$ , which can totally be defined through the polynomials $g_{i}(x)$ , $i\in[s+1,\ell]$ . ∎

V-B Proof of Proposition 3

Proof of Proposition 3.

According to Proposition 1, a linear repair scheme with side information size $s$ for a failed node $f(\bm{\alpha}^{*})$ corresponds to a set of $\ell-s$ polynomials. Supposing that the repair scheme is the polynomial set $\{g_{i}(x)\}_{i\in[s+1,\ell]}$ , where $\mathsf{rank}_{\mathbb{F}_{q}}\big{(}\{g_{s+1}(\bm{\alpha}^{*}),\dots,g_{\ell}(\bm{\alpha}^{*})\}\big{)}=\ell-s$ and $\mathsf{rank}_{\mathbb{F}_{q}}\big{(}\{g_{s+1}(\bm{\alpha}),\dots,g_{\ell}(\bm{\alpha})\}\big{)}=b_{\bm{\alpha}}$ , for all $\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}$ . Therefore, the repair bandwidth of the repair scheme is $b=\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}b_{\bm{\alpha}}$ . For each $\bm{\alpha}\in A$ , let $S_{\bm{\alpha}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\{\vec{e}=(e_{s+1},\dots,e_{\ell})\in\mathbb{F}_{q}^{\ell-s}:\sum_{i\in[s+1,\ell]}e_{i}g_{i}(\bm{\alpha})=0\}$ . Since $\mathsf{rank}_{\mathbb{F}_{q}}\big{(}\{g_{i}(\bm{\alpha})\}_{i\in[s+1,\ell]}\big{)}=b_{\bm{\alpha}}$ , $\dim_{\mathbb{F}_{q}}(S_{\bm{\alpha}})=\ell-s-b_{\bm{\alpha}}$ . Averaging the size $|\{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}:\vec{e}\in S_{\bm{\alpha}}\}|$ over all nonzero vectors $\vec{e}\in\mathbb{F}_{q}^{\ell-s}$ , we have

\begin{split}&\mu\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\frac{1}{q^{\ell-s}-1}\sum_{\vec{e}\in\mathbb{F}_{q}^{\ell-s}\setminus\{\vec{0}\}}\mid\{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}:\vec{e}\in S_{\bm{\alpha}}\}\mid\\ &=\frac{1}{q^{\ell-s}-1}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\mid\{\vec{e}\in\mathbb{F}_{q}^{\ell-s}\setminus\{\vec{0}\}:\vec{e}\in S_{\bm{\alpha}}\}\mid\\ &=\frac{1}{q^{\ell-s}-1}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}\big{(}q^{\ell-s-b_{\bm{\alpha}}}-1\big{)}.\end{split}

Then, there exists some $\vec{e}^{*}=(e_{s+1}^{*},\dots,e_{\ell}^{*})\in\mathbb{F}_{q}^{\ell-s}\setminus\{0\}$ such that $|\{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}:\vec{e}^{*}\in S_{\bm{\alpha}}\}|\geq\mu$ . Let $g(x)\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\sum_{i\in[s+1,\ell]}e_{i}^{*}g_{i}(x)$ , $g(x)$ vanishes on at least $\mu$ elements of $A\setminus\{\bm{\alpha}^{*}\}$ . Furthermore, it follows from $\{g_{i}(\bm{\alpha}^{*})\}_{i\in[s+1,\ell]}$ is linearly independent and $\vec{e}^{*}\neq 0$ that $g(\bm{\alpha}^{*})\neq 0$ . Hence, $g(x)$ corresponds to a nonzero dual codeword of RS $(A,k)$ and has at most $r-1$ roots, where $r=n-k$ . Then, $\mu\leq r-1$ , which allows that

\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}q^{-b_{\bm{\alpha}}}\leq\frac{(r-1)(q^{\ell-s}-1)+n-1}{q^{\ell-s}}.

(4)

Put

T\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\frac{(r-1)(q^{\ell-s}-1)+n-1}{q^{\ell-s}},b_{\mathsf{AVE}}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\log\frac{n-1}{T}.

Let

b_{\min}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\min_{b_{\bm{\alpha}}\in\{0,\dots,\ell-s\}}\sum_{\bm{\alpha}\in A\setminus\{\bm{\alpha}^{*}\}}b_{\bm{\alpha}}

(5)

subject to (4).

The minimum occurs when $b_{\bm{\alpha}}$ ’s are balanced and equal to $b_{\mathsf{AVE}}$ . Supposing that $t$ is the biggest integer satisfying

b_{1}^{*}=\dots=b_{t}^{*}=\lfloor b_{\mathsf{AVE}}\rfloor,b_{t+1}^{*}=\dots=b_{n-1}^{*}=\lceil b_{\mathsf{AVE}}\rceil,

where $\sum_{i\in[n-1]}q^{-b_{i}^{*}}\leq T$ , and $(b_{1}^{*},\dots,b_{n-1}^{*})$ is an optimal solution for (5). To obtain this solution, the $``$ balancing $"$ procedure as in [31] is applied. The computation for $t$ is easily obtained. Then, we get the lower bound as desired. ∎

V-C A discussion on the subspace intersections with the lowest repair bandwidth

A condition for an $m$ -dimensional subspace ${\mathcal{W}}$ so that the repair scheme constructed from this subspace by Lemma 2 and Corollary 2 obtains the minimal repair bandwidth among all $m$ -dimensional $\mathbb{F}_{q}$ -subspaces is that the sum $\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim(\bm{\gamma}\mathbb{F}_{q^{\ell-s}}\cap{\mathcal{W}})$ achieves the maximal value among all $m$ -dimensional $\mathbb{F}_{q}$ -subspaces. One concrete consideration for obtaining the maximal sum is the case when the intersection subspaces have dimension $0$ or $1$ . More particularly, for a parameter $m$ , if there exists an $m$ -dimensional subspaces ${\mathcal{W}}_{0}$ with $\dim(\bm{\gamma}\mathbb{F}_{q^{\ell-s}}\cap{\mathcal{W}}_{0})\in\{0,1\}$ , for all $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ , then a sufficient condition for an arbitrary $m$ -dimensional subspace ${\mathcal{W}}$ used to construct subspace polynomial repair scheme that obtains the lowest repair bandwidth, i.e., the sum $\sum_{\bm{\alpha}\in\mathbb{F}_{q^{\ell}}\setminus\{\bm{\alpha}^{*}\}}\dim\big{(}(\bm{\alpha}-\bm{\alpha}^{*})\bm{\delta}\mathbb{F}_{q^{\ell-s}}\cap{\mathcal{W}}\big{)}$ achieves maximal value among all subspaces dimension $m$ , is also the condition that $\dim(\bm{\gamma}\mathbb{F}_{q^{\ell-s}}\cap{\mathcal{W}})\in\{0,1\}$ , for all $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ . Moreover, for the codes RS $(n,k)$ , where $n-k=q^{m}$ , this condition is the necessary and sufficient condition for the repair scheme constructed by ${\mathcal{W}}$ obtaining the optimal repair bandwidth. The repair schemes constructed in Theorems 2 and Theorem 3 are of this consideration. We will make the above discussion clearer in Corollary 3. Since our proof for the conclusion of subspace intersection of dimensions $0$ or $1$ is based on the majorization of two real number sequences, we first recall some basic results on this problem. For two sequences of real numbers $x=(x_{1},\dots,x_{p})$ and $x^{\prime}=(x^{\prime}_{1},\dots,x^{\prime}_{p})$ , supposing that $x_{1}\geq\dots\geq x_{p}$ and $x^{\prime}_{1}\geq\dots\geq x^{\prime}_{p}$ , we say that $x$ is majorized by $x^{\prime}$ or $x^{\prime}$ majorizes $x$ if $\sum_{i=1}^{p}x_{i}=\sum_{i=1}^{p}x^{\prime}_{i}$ and $\sum_{i=1}^{j}x_{i}\leq\sum_{i=1}^{j}x^{\prime}_{i}$ , for all $j\in[p-1]$ [38, A.1, p.8].

Lemma 6.

[38, B.1, p.156] The inequality $\sum_{i=1}^{p}\phi(x_{i})\leq\sum_{i=1}^{p}\phi(x^{\prime}_{i})$ is satisfied for all continuous convex function $\phi:\mathbb{R}\rightarrow\mathbb{R}$ if and only if $x$ is majorized by $x^{\prime}$ .

Now we have the condition to get maximal value for the sum of intersection dimensions. Supposing that $p\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}\frac{q^{\ell}-1}{q^{a}-1}$ , which is the number of disjoint cosets of $\mathbb{F}_{q^{a}}^{*}$ in $\mathbb{F}_{q^{\ell}}^{*}$ . Since each pair of cosets are completely coincided or disjoint, and for all $\bm{\gamma}^{\prime}\in\bm{\gamma}\mathbb{F}_{q^{a}}^{*},\bm{\gamma}^{\prime}\mathbb{F}_{q^{a}}^{*}=\bm{\gamma}\mathbb{F}_{q^{a}}^{*}$ , we only need to consider the sums over $p$ disjoint cosets with the value of each dimension in each sum repeated $|\mathbb{F}_{q^{a}}^{*}|=q^{a}-1$ times. We have the following proposition.

Proposition 4.

Let $\bm{\gamma}_{1}\mathbb{F}_{q^{a}}^{*},\dots,\bm{\gamma}_{p}\mathbb{F}_{q^{a}}^{*}$ are $p$ disjoint cosets and the two sequences $d\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}(d_{1},\dots,d_{p})$ , $d^{\prime}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}(d^{\prime}_{1},\dots,d^{\prime}_{p})$ are the dimensions of the intersection of subspaces $\bm{\gamma}_{i}\mathbb{F}_{q^{a}}$ of these cosets with ${\mathcal{W}}$ and ${\mathcal{W}}^{\prime}$ , respectively. Without lost of generality, we can suppose that $d_{1}\geq\dots\geq d_{p}$ and $d^{\prime}_{1}\geq\dots\geq d^{\prime}_{p}$ . Let $x\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}(x_{1},\dots,x_{p})$ and $x^{\prime}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}(x^{\prime}_{1},\dots,x^{\prime}_{p})$ , where $x_{i}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}q^{d_{i}}-1$ if $d_{i}>0$ and $x_{i}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}0$ if $d_{i}=0$ , and $x^{\prime}_{i}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}q^{d^{\prime}_{i}}-1$ if $d^{\prime}_{i}>0$ and $x^{\prime}_{i}\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}0$ if $d^{\prime}_{i}=0$ . Then, if $x$ is majorized by $x^{\prime}$ then $\sum_{i=1}^{p}d_{i}\geq\sum_{i=1}^{p}d^{\prime}_{i}$ .

Proof.

Since $x$ is majorized by $x^{\prime}$ and $d_{i}=-\phi(x_{i}),d^{\prime}_{i}=-\phi(x^{\prime}_{i})$ , where $\phi(t)\stackrel{{\scriptstyle\mbox{\tiny$\triangle$}}}{{=}}-\log_{q}(t+1)$ is a continuous convex function over $[0,+\infty)$ . The proof is completed by applying Lemma 6 for $x$ and $x^{\prime}$ and the computation of $d_{i}$ and $d^{\prime}_{i}$ through function $\phi(t)$ . ∎

Corollary 3.

Let $a|\ell$ , ${\mathcal{W}}$ and ${\mathcal{W}}^{\prime}$ are two $m$ -dimensional subspaces of $\mathbb{F}_{q^{\ell}}$ where $\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}})\in\{0,1\}$ , for all $\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}$ . Then, $\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}})\geq\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}}^{\prime})$ . Moreover, $\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}})=\frac{(q^{a}-1)(q^{m}-1)}{q-1}$ , and if there exists $\bm{\gamma}^{\prime}\in\mathbb{F}_{q^{\ell}}^{*}$ so that $\dim(\bm{\gamma}^{\prime}\mathbb{F}_{q^{a}}\cap{\mathcal{W}}^{\prime})>1$ then $\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}})>\sum_{\bm{\gamma}\in\mathbb{F}_{q^{\ell}}^{*}}\dim(\bm{\gamma}\mathbb{F}_{q^{a}}\cap{\mathcal{W}}^{\prime})$ .

Proof.

The proof is completed by applying Proposition 4 and Lemma 6 for two sequences $x$ , $x^{\prime}$ in the special case where $d_{i}\in\{0,1\}$ , for all $i\in[p]$ . When all the intersections $\bm{\gamma}\mathbb{F}_{q^{a}}^{*}\cap{\mathcal{W}}$ is of dimension $0$ or $1$ , the set of $1$ -dimensional intersections is a partition of ${\mathcal{W}}$ , which allows that the number of these subspaces is $\frac{q^{m}-1}{q-1}$ . Since each of the intersection is repeated $q^{a}-1$ times, the total of dimensions is $\frac{(q^{a}-1)(q^{m}-1)}{q-1}$ . If there exists $d^{\prime}_{j}>1$ , for some $j\in[p]$ , then the strict inequality is achieved. ∎

Repairing Reed-Solomon Codes with Side Information

Abstract

I Introduction

II Preliminaries

II-A Definitions and Notations

Definition 1.

II-B Trace Repair Method

III Recovering an Erased Symbol with Side Information

III-A The Problem Description

Proposition 1.

III-B Optimal Repair Bandwidths Only Depend on the Side Information Set Size

Lemma 1.

Proof.

Proposition 2.

Proof.

III-C A Lower Bound on the Bandwidth with Side Information

Proposition 3.

Corollary 1.

Proof.

III-D Optimal Subspace-Polynomial-Based Repair Schemes

Lemma 2.

Proof.

Theorem 1.

Proof.

Corollary 2.

Proof.

Lemma 3.

Proof.

Theorem 2.

Proof.

Lemma 4.

Proof.

Proof of Claim 1.

Proof of Claim 2.

Theorem 3.

Proof.

III-E Bandwidth Reductions Given Side Information

IV Conclusions

Acknowledgement

References

V Appendix

V-A Proof of Proposition 1

Definition 2.

Lemma 5.

Proof.

Algorithm 1.

Proof of Proposition 1.

V-B Proof of Proposition 3

Proof of Proposition 3.

V-C A discussion on the subspace intersections with the lowest repair bandwidth

Lemma 6.

Proposition 4.

Proof.

Corollary 3.

Proof.

Repairing Reed-Solomon Codes
with Side Information