This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Matrix Product Density Operators: when do they have a local parent Hamiltonian?

Chi-Fang (Anthony) Chen Institute for Quantum Information and Matter
California Institute of Technology, Pasadena, CA 91125, USA
   Kohtaro Kato Institute for Quantum Information and Matter
California Institute of Technology, Pasadena, CA 91125, USA
Center for Quantum Information and Quantum Biology,
Institute for Open and Transdisciplinary Research Initiatives,
Osaka University, Osaka 560-8531, Japan
   Fernando G.S.L. Brandão AWS Center for Quantum Computing, Pasadena, CA Institute for Quantum Information and Matter
California Institute of Technology, Pasadena, CA 91125, USA
Abstract

We study whether one can write a Matrix Product Density Operator (MPDO) as the Gibbs state of a quasi-local parent Hamiltonian. We conjecture this is the case for generic MPDO and give supporting evidences. To investigate the locality of the parent Hamiltonian, we take the approach of checking whether the quantum conditional mutual information decays exponentially. The MPDO we consider are constructed from a chain of 1-input/2-output (‘Y-shaped’) completely-positive maps, i.e. the MPDO have a local purification. We derive an upper bound on the conditional mutual information for bistochastic channels and strictly positive channels and show that it decays exponentially if the correctable algebra of the channel is trivial.

We also introduce a conjecture on a quantum data processing inequality that implies the exponential decay of the conditional mutual information for every Y-shaped channel with trivial correctable algebra. We additionally investigate a close but nonequivalent cousin: MPDO measured in a local basis. We provide sufficient conditions for the exponential decay of the conditional mutual information of the measured states and numerically confirm they are generically true for certain random MPDO.

I Introduction

Tensor networks provide useful ansatz for quantum many-body systems. In one-dimensional (1D) systems, the ground states of gapped local Hamiltonians can be efficiently approximated by Matrix Product States (MPS) Affleck1987 ; Perez-Garcia2007 ; Hastings2007 . For the converse, generic MPS (which are technically called injective MPS) always have a gapped, local, and frustration-free parent Hamiltonian whose unique ground state is the MPS Fannes1992 ; Perez-Garcia2007 . This correspondence between MPS and its parent Hamiltonian establishes a deep connection with gapped quantum systems, leading to the complete classification of 1D gapped quantum phases Schuch2011 . For higher-dimensional systems, Projected Entangled Pair States (PEPS) are a natural generalization of MPS. PEPS have been used successfully to study gapped ground states Affleck1987 ; verstraete2004renormalization . Although the structural characterization of PEPS has not been completely established, local parent Hamiltonians also exist for injective and semi-injective PEPS Molnar2018 .

Matrix Product Density Operators (MPDO) are generalizations of MPS to describe 1D mixed states. In Ref. Hastings2006 , Hastings showed that any 1D Gibbs state of a local Hamiltonian (local Gibbs state in short) can be well-approximated by an MPDO with a polynomial bond dimension. This result justifies MPDO as a successful ansatz to study Gibbs states Verstraete2004aa .

As a generic MPS is the ground state of the parent Hamiltonian, one may anticipate that a “generic” MPDO could be written as the Gibbs state of a local parent Hamiltonian. However, a fundamental obstacle to the latter question is the lack of a handy characterization of a generic MPDO. While Matrix Product Operators (MPO) are quite analogous to MPS, it is computationally hard to decide its positivity PhysRevLett.113.160503 . Therefore, to make progress, one needs to hand-pick some parameterizable family of MPDO to begin with.

Indeed, Cirac et al. analyzed MPDO at certain “renormalization fixed-points”, and showed that these fixed-point MPDO are Gibbs states of nearest-neighbor commuting Hamiltonians Cirac2017 . Unfortunately, unlike MPS, a renormalization operation transforming a given MPDO to these fixed-point MPDO has not been well-defined yet.

In this paper, we will focus on a physically motivated family: locally purifiable MPDO. These models arise from condensed matter physics, as they appear as the reduced state of MPS or a boundary state of PEPS. Technically, these models are chains of CP maps that connect to information theory. This structure is what enables much of our subsequent discussion.

Given the family of MPDO, our technical approach is to study whether the conditional mutual information (CMI) decays exponentially. The CMI I(A:C|B)ρI(A:C|B)_{\rho} is a function defined for a tripartite state ρABC\rho_{ABC} as

I(A:C|B)ρ:=S(AB)ρ+S(BC)ρS(B)ρS(ABC)ρ,I(A:C|B)_{\rho}:=S(AB)_{\rho}+S(BC)_{\rho}-S(B)_{\rho}-S(ABC)_{\rho}, (1)

where S(A)ρ=TrρAlog2ρAS(A)_{\rho}=-\mathrm{Tr}\rho_{A}\log_{2}\rho_{A} is the von Neumann entropy of the reduced state on AA. Small values of CMI (with respect to certain tri-partitions of 1D system) turn out to be the necessary and sufficient condition for being well-approximated by a local Gibbs state KB16 (see Sec. II.3 for more details).

Now, we are in a position to present the guiding question of this work. Whenever the answer is affirmative, the parent Hamiltonians of the MPDO are (quasi-)local.

Question I.1.

For which MPDO does the CMI decay exponentially? Namely, for any tri-partition ABCABC of the system where BB separates AA from CC with distance \ell and constant c>0c>0

I(A:C|B)\displaystyle I(A:C|B) =O(ec)?\displaystyle=O(e^{-c\ell})\,? (2)

In addition, we investigate the CMI of the MPDO after local measurements on the conditioning system BB. As the CMI does not always decrease under measurement, it must be studied as an independent problem. The exponential decay of the CMI after measurement implies the outcome distribution is nearly a classical Markov distribution.

Refer to caption
Figure 1: The interplay between the main question, the main results, and the open conjectures. We ask which locally purifiable MPDO has exponential decay of CMI. When the MPDO is constructed by Y-shaped CPTP maps 𝒩\mathcal{N}, we obtain results for bistochastic channels and channels with a forgetful component. For general channels, we propose a strong data-processing inequality that implies decay of CMI but only prove a weaker version. When 𝒩\mathcal{N} is only CP, we are only able to show the exponential decay of CMI for the measurement outcomes of MPDO. This figure omits the discussion on the trace norm CMI.

Main result. In this work, we explore Question I.1 in certain locally purifiable MPDO and point out missing technical conjectures that may lead to more general results.

We first study the quantum CMI of each tripartition of states generated by 1-input/2-output, “Y-shaped” in short, channels. In particular, for bistochastic Y-shaped channels, we obtain an analytical bound for the CMI with a decay rate constant (Theorem III.1). The decay rate constant is strictly smaller than one if the channel has trivial correctable algebra, as defined in operator-algebra quantum error correction theory. We then generalize the argument for a slightly larger class of channels and derive a weaker bound on the CMI with another decay rate constant (Theorem III.2). Therefore we show the exponential decay of the CMI (and a trace norm analog of CMI) for these MPDO. We also prove the exponential decay of CMI for Y-shaped channels with a forgetful component which, in particular, include every strictly positive Y-shaped channels (Proposition III.2).

For general Y-shaped channels, we show that the CMI I(A:C|B)I(A:C|B) must strictly decrease if one applies a channel with trivial correctable algebra on CC (Proposition III.3). To bridge this result to Question I.1, we proposed a conjecture in the form of a data processing inequality for CMI with an explicit decay rate (Conjecture III.1), which would imply exponential decay of CMI (Proposition III.4) for MPDO generated by such Y-shaped channels. The conjecture is known to be true for classical systems, however no result is known for quantum systems.

We further study the CMI after local measurements on the conditional system BB in the computational basis for general locally purifiable MPDO (generated by completely positive (CP) Y-shaped maps)111 For technical reasons, the proof works for CP maps without the trace-preserving (TP) constraint, whereas in the unmeasured case it has to be a channel (CPTP-map). . We prove two sufficient conditions, one is stronger than the other, that guarantee the exponential decay of the CMI of the measured MPDO (Theorem III.3). We then provide a simple polynomial algorithm to check the stronger condition (Proposition III.6), and we numerically find the condition generically holds for MPDO generated by Y-shaped channels whose Stinespring unitaries are sampled from the Haar measure.

Proof ideas. The proof for the decay of CMI for bistochastic channel (Theorem III.1) relies on decomposing the state into an uncorrelated state on ABAB and CC plus a deviation which is traceless on CC. We show the Hilbert-Schimdt norm of the deviation contracts under a bistochastic channel. In Theorem III.2, we instead consider contraction of the deviation under the trace norm, using the associated tools for trace norm. For Y-shaped channels with a forgetful component (Proposition III.2), we use relative entropy convexity to show the CMI contracts at each step. The strict decay of the CMI for general channels (Proposition III.3) follows from techniques in operator-algebra quantum error correction and properties of the Petz recovery map.

For the measured MPDO (Theorem III.3) we are conditioning on a classical system B¯\bar{B}, which implies the CMI I(A:C|B¯)I(A:C|\bar{B}) equals to the average of the mutual information for each outcome state ρAC,B¯=b\rho_{AC,\bar{B}=b}. The outcome states are constructed by sequential CP self-maps, and these are contractions in Hilbert’s projective metric. The CMI decays exponentially if the contraction ratio of every CP-map is strictly less than 1, which holds if the maps are all strictly positive (Condition 2). We also provide a stronger condition, Condition 1, which guarantees strict positivity after certain coarse-graining.

Structure of the paper. We introduce basic concepts on MPDO and a few backgrounds in Section II. In Section III, we state our results and a conjecture on the data-processing inequality. The proofs are presented in Section IV with several lemmas, whose detailed proof is left to Section V.

II Preliminary

In this section, we introduce basic notations and quantum information theoretical concepts that will be used in this paper. A quantum state (density operator) ρ\rho is a bounded operator on a finite-dimensional Hilbert space \mathcal{H} satisfying positivity ρ0\rho\geq 0 and unit trace Trρ=1\mathrm{Tr}\rho=1. We denote the set of quantum states on \mathcal{H} by 𝒮()\mathcal{S}(\mathcal{H}). Quantum systems are often denoted by capital letters A,B,C,A,B,C,... and we abuse the same notation for the associated Hilbert spaces and the bounded operator spaces. We denote the completely mixed state on a Hilbert space BB by τB\tau_{B}. The reduced state of ρ\rho associated to the system AA is denoted by ρA\rho_{A}. \|\cdot\| is the operator norm, p\|\cdot\|_{p} is the pp-Schatten norm, pq\|\cdot\|_{p-q} is the pqp-q superoperator norm, and cb\|\cdot\|_{cb} is the completely bounded 1-1 norm.

II.1 Matrix Product Density Operators

A general open boundary (uniform) MPDO ρ𝒮(n)\rho\in\mathcal{S}(\mathcal{H}_{n}) is a quantum state ρ\rho written as

ρ=s1,s1,,sn,snL|Ansn,snA1s1,s1|R×|sns1sns1|,\displaystyle\rho=\sum_{s_{1},s^{\prime}_{1},...,s_{n},s^{\prime}_{n}}\langle L|A^{s_{n},s^{\prime}_{n}}_{n}...A_{1}^{s_{1},s^{\prime}_{1}}|R\rangle\times|s_{n}...s_{1}\rangle\langle s^{\prime}_{n}...s^{\prime}_{1}|\,, (3)

where each {Aksisi}si,si\{A_{k}^{s_{i}s^{\prime}_{i}}\}_{s_{i},s^{\prime}_{i}} is a set of DD-dimensional matrices and |L,|R|L\rangle,|R\rangle are DD-dimensional vectors with a constant DD. Here, Aks1s1,|LA_{k}^{s_{1}s^{\prime}_{1}},|L\rangle and |R|R\rangle are chosen so that positivity ρ0\rho\geq 0 is satisfied for arbitrary nn. This is a non-trivial condition and in general it is computationally hard to determine whether a given MPO is positive or not PhysRevLett.113.160503 . In this paper, we will specialize to specific sub-classes of MPDO to guarantee the positivity of the state.

A1A_{1}s1|\langle s^{\prime}_{1}||s1|s_{1}\rangleA2A_{2}s2|\langle s^{\prime}_{2}||s2|s_{2}\rangleA3A_{3}s3|\langle s^{\prime}_{3}||s3|s_{3}\rangleA4A_{4}s4|\langle s^{\prime}_{4}||s4|s_{4}\rangleA5A_{5}s5|\langle s^{\prime}_{5}||s5|s_{5}\rangleL|\langle L||R|R\rangle
Figure 2: The tensor network representation for general MPDO (3). It differs from MPS that both bra and ket indices are distinguished, describing an operator instead of a vector.

Throughout this paper, we consider MPDO constructed by concatenating Completely-Positive (CP) maps, i.e., locally purifiable MPDO. Let ,𝒦\mathcal{H},\mathcal{K} be finite-dimensional Hilbert spaces and 𝒩:()(𝒦)\mathcal{N}:\mathcal{B}(\mathcal{H})\to\mathcal{B}(\mathcal{K}\otimes\mathcal{H}) be a 1-input/2-output CP-map where one of the two output systems has the same dimension as the input. We interchangeably refer such a map as a Y-shaped map. For some initial state σ\sigma and \ell\in\mathbb{N}, we obtain a (possibly unnormalized) state ρσ(𝒩)𝒮(𝒦)\rho^{\sigma}_{\ell}(\mathcal{N})\in\mathcal{S}(\mathcal{K}^{\otimes\ell}\otimes\mathcal{H}) defined by

ρσ(𝒩):=(id𝒦(1)𝒩)(id𝒦𝒩)𝒩[σ],\rho^{\sigma}_{\ell}(\mathcal{N}):=\left(\rm id_{\mathcal{K}^{\otimes(\ell-1)}}\otimes\mathcal{N}\right)\circ...\circ\left(\rm id_{\mathcal{K}}\otimes\mathcal{N}\right)\circ\mathcal{N}[\sigma], (4)

where id𝒦\rm id_{\mathcal{K}} is the identity map on 𝒦\mathcal{K}. We mostly abbreviate ρσ(𝒩)\rho_{\ell}^{\sigma}(\mathcal{N}) as ρσ\rho_{\ell}^{\sigma}. This kind of states are classified as a finite-dimensional CC^{*}-finitely correlated states Fannes1992  222Eq. (4) contains states without the consistency constraint imposed on the input state TrBρBCσ=σ\mathrm{Tr}_{B}\rho^{\sigma}_{BC}=\sigma, which is required for CC^{*}-infinite finitely correlated states.. By using the Kraus representation 𝒩(X)=iTiXTi\mathcal{N}(X)=\sum_{i}T_{i}XT_{i}^{\dagger}, one can easily verify ρσ\rho_{\ell}^{\sigma} is a MPDO.

The normalization Trρσ=1\mathrm{Tr}\rho^{\sigma}_{\ell}=1 is guaranteed when σ\sigma is a quantum state and 𝒩\mathcal{N} is a quantum channel, i.e., CP and Trace-Preserving (CPTP) map. Since 1D local Gibbs states always have exponentially decaying two-point correlation Araki1969 , we are interested in Eq. (4) with finite correlation length. The MPDO has a finite correlation length if Tr𝒦𝒩\mathrm{Tr}_{\mathcal{K}}\circ\mathcal{N} has unique maximum eigenvalue 1, and especially the correlation length is exactly zero when Tr𝒦𝒩\mathrm{Tr}_{\mathcal{K}}\circ\mathcal{N} is a constant channel (Fig. 3). The choice of σ\sigma is rather arbitrary in finite systems. For convenience we mainly choose σ\sigma as one side of the maximally entangled state, and denote the corresponding state on 𝒦\mathcal{H}\otimes\mathcal{K}^{\otimes\ell}\otimes\mathcal{H} by ρ\rho_{\ell}. We often denote AA and CC to be two systems \mathcal{H} at the end, and Bi(i=1,,)B_{i}\,(i=1,...,\ell) to be the remaining systems 𝒦\mathcal{K}. Note that we can recover arbitrary ρσ\rho^{\sigma}_{\ell} from ρ\rho_{\ell} by applying a suitable positive operator on AA, which is the other side of the maximally entangled state σAA¯\sigma_{A\bar{A}}, and then trace out AA.

Refer to caption
Refer to caption
Figure 3: (Top) The chain of Y-shaped channel 𝒩\mathcal{N}. The zigzag line represents a maximally entangled pair. When we concatenate another channel to system AB1B2B3C3AB_{1}B_{2}B_{3}C_{3}, subsystem C3C_{3} is mapped to system B4C4B_{4}C_{4}. (Bottom) Y-shaped channel with zero correlation length can be generated by unitaries or isometries and maximally entangled pairs, where tracing out any system BkB_{k} makes the left and right segment uncorrelated.

When we further perform a projective measurement on BB to Eq. (4) in the computational basis {|s1s2slB}\{|s_{1}s_{2}...s_{l}\rangle_{B}\}, the output subsystem B¯{\bar{B}} becomes entirely classical (Fig. 4). The corresponding Y-shaped map 𝒩\mathcal{N} can then be decomposed into 𝒩[]=sk|sksk|sk[]\mathcal{N}[\cdot]=\sum_{s_{k}}|s_{k}\rangle\langle s_{k}|\otimes\mathcal{M}_{s_{k}}[\cdot], where s[]\mathcal{M}_{s}[\cdot] is a CP self-map defined as

sk[ρ]\displaystyle\mathcal{M}_{s_{k}}[\rho] :=sk|Bk(𝒩[ρ])|skBk.\displaystyle:=\langle s_{k}|_{B_{k}}\left(\mathcal{N}[\rho]\right)|s_{k}\rangle_{B_{k}}\,. (5)

Note that if 𝒩\mathcal{N} is TP, then ss\sum_{s}\mathcal{M}_{s} is a CPTP-map and thus {s}s\{\mathcal{M}_{s}\}_{s} forms a quantum instrument.

We can rewrite the measured MPDO by

ρAB¯C=bp(b)|bb|B¯ρAC,b,\displaystyle\rho_{A\bar{B}C}=\sum_{b}p(b)|b\rangle\langle b|_{\bar{B}}\otimes\rho_{AC,b}, (6)

where each ρAC,b\rho_{AC,b} is an output state with a particular measurement outcome b={s1,,sl}b=\{s_{1},...,s_{l}\}

ρAC,b=Mb[σAA¯]Tr(Mb[σAA¯]):=ss1[σAA¯]Tr(ss1[σAA¯]),\rho_{AC,b}=\frac{M_{b}[\sigma_{A\bar{A}}]}{\mathrm{Tr}(M_{b}[\sigma_{A\bar{A}}])}:=\frac{\mathcal{M}_{s_{\ell}}\cdots\mathcal{M}_{s_{1}}[\sigma_{A\bar{A}}]}{\mathrm{Tr}(\mathcal{M}_{s_{\ell}}\cdots\mathcal{M}_{s_{1}}[\sigma_{A\bar{A}}])}, (7)

with probability

p(b)=Tr(ss1[σAA¯]).p(b)=\mathrm{Tr}(\mathcal{M}_{s_{\ell}}\cdots\mathcal{M}_{s_{1}}[\sigma_{A\bar{A}}]). (8)
Refer to caption
Figure 4: The chain of Y-shaped CP maps reduced to sequences of CP self-maps upon measurement outcomes on B1,,B4B_{1},\ldots,B_{4}. It mimics a classical hidden Markov chain, with the extra bit of the quantumness confined in system AA and CC.

II.2 Strong data-processing inequality constants

The quantum relative entropy D(ρσ):=Trρlog2(ρσ)D(\rho\|\sigma):=\mathrm{Tr}\rho\log_{2}(\rho-\sigma) is a distance-like measure between two quantum states ρ,σ\rho,\sigma. One crucial feature of the relative entropy is that it obeys the data-processing inequality (DPI): for any states ρ,σ\rho,\sigma and any CPTP-map \mathcal{E}, it follows that Linbald1975 ; Uhlmann:1977aa

D([ρ][σ])D(ρσ).\displaystyle D\left(\mathcal{E}[\rho]\|\mathcal{E}[\sigma]\right)\leq D(\rho\|\sigma). (9)

The above DPI implies the monotonicity of the mutual information I(A:C)ρ:=D(ρACρAρC)I(A:C)_{\rho}:=D(\rho_{AC}\|\rho_{A}\otimes\rho_{C}) and the conditional mutual information I(A:C|B)ρI(A:C|B)_{\rho} under a local CPTP-map C:CC\mathcal{E}_{C}:C\to C^{\prime}, that is,

I(A:C)C(ρAC)\displaystyle I(A:C^{\prime})_{\mathcal{E}_{C}(\rho_{AC})} I(A:C)ρAC,\displaystyle\leq I(A:C)_{\rho_{AC}}\,, (10)
I(A:C|B)C(ρABC)\displaystyle I(A:C^{\prime}|B)_{\mathcal{E}_{C}(\rho_{ABC})} I(A:C|B)ρABC.\displaystyle\leq I(A:C|B)_{\rho_{ABC}}\,. (11)

The equality holds if and only if there is a CPTP-map C:CC\mathcal{R}_{C^{\prime}}:{C^{\prime}\to C} such that [ρ]=ρ\mathcal{R}\circ\mathcal{E}[\rho]=\rho Petz1986 ; Petz1988 .

Strong DPI constants, or the contraction coefficients are multiplicative factors in the above monotonicity inequalities. For the mutual information and CMI, they are defined as

ηAC(C)\displaystyle\eta_{AC}(\mathcal{E}_{C}) :=supρACI(A:C)C[ρAC]I(A:C)ρAC,\displaystyle:=\sup_{\rho_{AC}}\frac{I(A:C^{\prime})_{\mathcal{E}_{C}[\rho_{AC}]}}{I(A:C)_{\rho_{AC}}}\,, (12)
ηABC(C)\displaystyle\eta_{ABC}(\mathcal{E}_{C}) :=supρABCI(A:C|B)C[ρABC]I(A:C|B)ρABC,\displaystyle:=\sup_{\rho_{ABC}}\frac{I(A:C^{\prime}|B)_{\mathcal{E}_{C}[\rho_{ABC}]}}{I(A:C|B)_{\rho_{ABC}}}\,, (13)

where the values are bounded as 0ηAC(C),ηABC(C)10\leq\eta_{AC}(\mathcal{E}_{C}),\eta_{ABC}(\mathcal{E}_{C})\leq 1 by monotonicity. Note that ηABC(C)\eta_{ABC}(\mathcal{E}_{C}) reduces to ηAC(C)\eta_{AC}(\mathcal{E}_{C}) by taking BB to be trivial. The strong DPI constants bound how much correlations are preserved after applying the channel on subsysem CC.

In the classical case, the strong DPI constants for different quantities are equivalent.

Theorem II.1 (Tight contractive DPI for classical mutual information (anantharam2013maximal, , Theorem 4) ).

For any probability distribution pACp_{AC} and any classical channel :CC\mathcal{E}:C\to C^{\prime},

ηAC()=ηABC()=s(),\eta_{AC}(\mathcal{E})=\eta_{ABC}(\mathcal{E})=s^{*}(\mathcal{E})\,, (14)

where s()s^{*}(\mathcal{E}) is the strong DPI constant for the relative entropy on CC

s():=supp,qpqD([qC][pC])D(pCqC).s^{*}(\mathcal{E}):=\sup_{\begin{subarray}{c}p,q\\ p\neq q\end{subarray}}\frac{D\left(\mathcal{E}\left[q_{C}\right]\right.\left\|\mathcal{E}\left[p_{C}\right]\right)}{D\left(p_{C}\right.\left\|q_{C}\right)}\,. (15)

Remarkably, ηAC()\eta_{AC}(\mathcal{E}) and ηABC()\eta_{ABC}(\mathcal{E}) are independent of A,BA,B in the classical case. The crucial point is that the classical mutual information and the CMI are functions of conditional probability distribution on CC. These functions are then written by convex combinations in classical systems, and regardless of how large auxilary classical system is involved, the Carathéodory theorem implies the auxilary system can always be reduced to dimension |C|+1|C|+1 Danninger-Uchida2001 .

Unfortunately, no analog of conditional distribution nor the Carathéodory-like cardinality bound has been found for quantum systems (see e.g., Ref. 6555753 for a discussion on this problem). Therefore the classical approach failed to work in quantum regime. For this reason, for quantum systems it remains open whether Theorem II.1 also holds, or whether the strong DPI constants are independent of the size of the auxiliary systems or not.

II.3 The conditional mutual information and Gibbs states

The CMI and Gibbs states are intimately connected. In classical systems, the Hammersley-Clifford theorem HCthm states that a (positive) probability distribution pXYZp_{XYZ} is a Gibbs distribution

pXYZ(x,y,z)=ehXY(x,y)hYZ(y,z)Z,p_{XYZ}(x,y,z)=\frac{e^{-h_{XY}(x,y)-h_{YZ}(y,z)}}{Z}\,, (16)

where ZZ is the normalization constant, if and only if I(X:Z|Y)p=0I(X:Z|Y)_{p}=0. Moreover, CMI can be written as

I(X:Z|Y)p=minq:I(X:Z|Y)q=0D(pXYZqXYZ),\displaystyle I(X:Z|Y)_{p}=\min_{q:I(X:Z|Y)_{q}=0}D(p_{XYZ}\|q_{XYZ})\,, (17)

and thus the small value of the CMI implies the state is close to a Gibbs state (16). These results are naturally extended to 1D classical spin chains.

Although Eq. (17) does not hold in general quantum systems, Ref. KB16 shows the following bound:

Theorem II.2 (Theorem 1,KB16 ).

Let ρA1An\rho_{A_{1}...A_{n}} be a state satisfying I(A1Ak1:AkAn|Ak)εI(A_{1}...A_{k-1}:A_{k}...A_{n}|A_{k})\leq\varepsilon. Then there exists a local Hamiltonian H=i=1hAi,Ai+1H=\sum_{i=1}h_{A_{i},A_{i+1}} with hAi,Ai+1h_{A_{i},A_{i+1}} only acts on AiAi+1A_{i}A_{i+1}, such that the relative entropy is controlled by

D(ρeHTreH)nε.D\left(\rho\left\|\frac{e^{-H}}{\mathrm{Tr}e^{-H}}\right.\right)\leq n\varepsilon\,. (18)

If ε=0\varepsilon=0, we recover the quantum Hammersley-Clifford theorem Hayden2004 ; brown2012quantum and the Hamiltonian HH is the sum of terms like logρAi,Ai+1-\log\rho_{A_{i},A_{i+1}}.

When MPDO has exponentially decaying CMI for appropriate tripartitions, then we first coarse-grain sufficiently many (possibly logarithmic w.r.t. 1/ε1/\varepsilon) neighbouring sites as one site and then apply Theorem II.2 to show that the MPDO is well-approximated by a local Gibbs state.

Corollary II.1.

Suppose ρB1Bm\rho_{B_{1}...B_{m}} satisfies exponential decay of CMI for every tripartition

I(B1Bn:Bn++1Bm|Bn+1Bn+)𝒪(ec).\displaystyle I\left(B_{1}\cdots B_{n}:B_{n+\ell+1}\cdots B_{m}|B_{n+1}\cdots B_{n+\ell}\right)\leq\mathcal{O}(e^{-c\ell}). (19)

Then coarse graining =Ω(1clog(m/ϵ))\ell=\Omega\left(\frac{1}{c}\log(m/\epsilon)\right) sites ensures (e.g., B1B:=A1B_{1}\cdots B_{\ell}:=A_{1} ) there exist a (quasi-)local Hamiltonian H=i=1hAi,Ai+1H=\sum_{i=1}h_{A_{i},A_{i+1}} such that

D(ρA1Am||eHTr[eH])ϵ.\displaystyle D\left(\rho_{A_{1}...A_{m}}||\frac{e^{-H}}{\mathrm{Tr}[e^{-H}]}\right)\leq\epsilon. (20)

Note that relative entropy estimate converts to fidelity by

D(ρ||σ)2log(F(ρ,σ)).\displaystyle D(\rho||\sigma)\geq-2\log(F(\rho,\sigma)). (21)

II.4 The correctable algebra and recovery channels

To characterize what information is left undisturbed by a channel, we use the theory of operator-algebra quantum-error correction Beny2007 ; Beny2007a . For a given channel :AA\mathcal{E}:A\to A^{\prime}, we define the correctable algebra 𝒜()(A)\mathcal{A}(\mathcal{E})\subset\mathcal{B}(\mathcal{H}_{A}) as

𝒜():=Alg{OA|[OA,EaEb]=0,a,b}.\displaystyle\mathcal{A}(\mathcal{E}):={\rm Alg}\left\{O_{A}\left|\left[O_{A},E_{a}^{\dagger}E_{b}\right]=0\,,\;\;\forall a,b\right.\right\}. (22)

𝒜()\mathcal{A}(\mathcal{E}) is a finite-dimensional CC^{*}-algebra containing all observables whose information is perfectly preserved under \mathcal{E}. One can always perfectly recover the information associated to an observable in the correctable algebra: there exists a CPTP-map :AA\mathcal{R}:A^{\prime}\to A called recovery map such that

(OA)=OA,OA𝒜().\displaystyle\mathcal{E}^{\dagger}\circ\mathcal{R}^{\dagger}(O_{A})=O_{A}\,,\quad\forall O_{A}\in\mathcal{A}(\mathcal{E})\,. (23)

This implies that for any OA𝒜()O_{A}\in\mathcal{A}(\mathcal{E}), there always is a corresponding observable (OA)\mathcal{R}^{\dagger}(O_{A}) on AA^{\prime} such that Tr(OAρA)=Tr((OA)(ρA))\mathrm{Tr}(O_{A}\rho_{A})=\mathrm{Tr}\left(\mathcal{R}^{\dagger}(O_{A})\mathcal{E}(\rho_{A})\right) for all ρA\rho_{A}. The condition in Eq. (22) is a generalization of the well-known Knill-Laflamme condition PhysRevLett.84.2525 in the standard quantum error correction theory, in which 𝒜()=(code)\mathcal{A}(\mathcal{E})=\mathcal{B}(\mathcal{H}_{code}) for a protected code subspace code\mathcal{H}_{code}.

A common candidate for the recovery channel is the Petz map:

Definition 1.

The Petz recovery map 𝒫σ,\mathcal{P}_{\sigma,\mathcal{E}} with the reference state σ\sigma and the target channel \mathcal{E} is defined as

𝒫σ,[]:=σ1/2[(σ)1/2()(σ)1/2]σ1/2.\mathcal{P}_{\sigma,\mathcal{E}}[\cdot]:=\sigma^{1/2}\mathcal{E}^{\dagger}\left[\mathcal{E}(\sigma)^{-1/2}(\cdot)\mathcal{E}(\sigma)^{-1/2}\right]\sigma^{1/2}. (24)

As it only features moderately in this paper, we save the vast literature on universal recovery channel elsewhere (see e.g., Junge_2018 ), and include the relevant ideas on invariant subspaces in Appendix A. This yields another characterization of the correctable algebra handy for our purposes.

Proposition II.1.

The correctable algebra of a channel \mathcal{E} equals to the fixed-point algebra of the Petz-recovery channel composed with channel 𝒫τ,\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E} (and the dual 𝒫τ,\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau,\mathcal{E}}^{\dagger}, due to being self-adjoint).

S𝒫τ,=S𝒫τ,=𝒜()S_{\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau,\mathcal{E}}^{\dagger}}=S_{\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E}}=\mathcal{A}(\mathcal{E}) (25)

III Main results

Here we state our main results in a concrete way.

III.1 Decay of CMI for bistochastic Y-shaped channels

Our first main result is showing that any state constructed from a bistochastic 333Usually, bistochastic refers to a self-map channel in the literature. Here we have higher output dimension than input, but we abuse the terminology in this paper. Y-shaped channel (CPTP-map) is exponentially close to a product state on ABAB and CC. This provides an upper bound of the CMI (and also the trace norm CMI in Sec. III.1.1). See Sec. IV.1 for the proof.

Theorem III.1.

For a bistochastic (unital) Y-shaped channel 𝒩[τC1]=τB1τC2\mathcal{N}[\tau_{C_{1}}]=\tau_{B_{1}}\otimes\tau_{C_{2}}, consider a family of states defined by

ρAB1BC:=(idid𝒦(1)𝒩)(idid𝒦𝒩)𝒩[σAA¯],\rho_{AB_{1}\cdots B_{\ell}C_{\ell}}:=\left(\rm id_{\mathcal{H}}\otimes\rm id_{\mathcal{K}}^{\otimes(\ell-1)}\otimes\mathcal{N}\right)\circ...\circ\left(\rm id_{\mathcal{H}}\otimes\rm id_{\mathcal{K}}\otimes\mathcal{N}\right)\circ\mathcal{N}[\sigma_{A\bar{A}}], (26)

where σAA¯\sigma_{A\bar{A}} is the maximally entangled state. Then, it holds that

ρAB1BCρAB1BτC1=O(η).\lVert\rho_{AB_{1}\cdots B_{\ell}C_{\ell}}-\rho_{AB_{1}\cdots B_{\ell}}\otimes\tau_{C}\rVert_{1}=O(\eta^{\ell}). (27)

This implies the CMI is bounded as

I(A:C|B1B)ρl\displaystyle I(A:C|B_{1}\cdots B_{\ell})_{\rho_{l}} =O(η).\displaystyle=O(\ell\eta^{\ell}). (28)

The contraction ratio η\eta is given as

η:=lim supρCτCD(𝒩[ρC]τB1C)D(ρCτC),\eta:=\limsup_{\rho_{C}\rightarrow\tau_{C}}\frac{D(\mathcal{N}[\rho_{C}]\|\tau_{B_{1}C})}{D(\rho_{C}\|\tau_{C})}, (29)

where η<1\eta<1 if and only if 𝒩\mathcal{N} has trivial correctable algebra. Furthermore, η\eta coincides with the second eigenvalue of channel 𝒵:=𝒫τ,𝒩𝒩\mathcal{Z}:=\mathcal{P}_{\tau,\,\mathcal{N}}\circ\mathcal{N}, where 𝒫τ,𝒩\mathcal{P}_{\tau,\,\mathcal{N}} is the Petz recovery map 10.1093/qmath/39.1.97 as in Eq.(24) with the completely mixed state τ\tau as reference.

Remark. Here we only consider a particular tripartition ABCABC to show the decay of the CMI. However, under additional moderate assumptions, the exponential decay of CMI of this tripartition implies the CMI is also small for other tripartitions like A:=AB1Bn,B:=Bn+1Bn+m,C:=Bn+m+1BCA^{\prime}:=AB_{1}...B_{n},B^{\prime}:=B_{n+1}...B_{n+m},C^{\prime}:=B_{n+m+1}...B_{\ell}C_{\ell}. To see this, we use the chain rule of CMI: I(A:C|B)ρ=I(A:C|B1B)ρ+I(A:Bn+m+1B|B1Bn+m)ρ+I(B1Bn:Bn+m+1BC|Bn+1Bn+m)ρI(A^{\prime}:C^{\prime}|B^{\prime})_{\rho}=I(A:C_{\ell}|B_{1}...B_{\ell})_{\rho}+I(A:B_{n+m+1}...B_{\ell}|B_{1}...B_{n+m})_{\rho}+I(B_{1}...B_{n}:B_{n+m+1}...B_{\ell}C_{\ell}|B_{n+1}...B_{n+m})_{\rho}. The second term is upper bounded by I(A:Cn+m|B1Bn+m)ρI(A:C_{n+m}|B_{1}...B_{n+m})_{\rho} by the monotonicity of the CMI, and thus the first and the second terms obey the exponential decay as mm grows. Bounding the third term may require an additional assumption. One way to bound the last term is by explicit calculation. For instance, it is 0 for the bistochastic case since ρBC=τBC\rho_{BC}=\tau_{BC}. Another way to bound it is assuming the zero correlation length condition (bottom, Fig. 3), under which the state can be generated by the inverse channel 𝒩~:CBA{\tilde{\mathcal{N}}}:C\to BA as well as 𝒩\mathcal{N}. This guarantees I(B1Bn:Bn+m+1BC|Bn+1Bn+m)ρI(B_{1}...B_{n}:B_{n+m+1}...B_{\ell}C_{\ell}|B_{n+1}...B_{n+m})_{\rho} is upper bounded by I(An:Cn+m|Bn+1Bn+m)ρI(A_{n}:C_{n+m}|B_{n+1}...B_{n+m})_{\rho}, which is the CMI for ρm\rho_{m} up to the translation shift.

In this paper, we pay less attention to the case of periodic MPDO where output CC of channel got fed into the input AA, as this gluing step might be tricky. In the bistochastic case this is tractable (Appendix B) and the state becomes even simpler as the maximally mixed state plus an exponentially decaying global operator. There, the parent Hamiltonian would be trivial in the thermodynamic limit.

A crucial property of bistochastic Y-shaped channels is they act as a contraction onto certain operator subspace. This implies Eq. (27) at large \ell, and then the CMI bound follows from the continuity of the entropy. An example of bistochastic Y-shaped channel is given as follows.

Example: A bistochastic YY-shaped channel p:(2)(22)\mathcal{E}_{p}:\mathcal{B}({\mathbb{C}}^{2})\to\mathcal{B}({\mathbb{C}}^{2}\otimes{\mathbb{C}}^{2}) defined by Kraus operators

K1=[1p21p2001p21p200],K2=[p2p200p2p200],K3=[001p21p2001p21p2],K4=[00p2p200p2p2],\displaystyle K_{1}=\left[\begin{array}[]{cc}\frac{\sqrt{1-p}}{2}&\frac{\sqrt{1-p}}{2}\\ 0&0\\ \frac{\sqrt{1-p}}{2}&-\frac{\sqrt{1-p}}{2}\\ 0&0\\ \end{array}\right],K_{2}=\left[\begin{array}[]{cc}\frac{\sqrt{p}}{2}&\frac{\sqrt{p}}{2}\\ 0&0\\ -\frac{\sqrt{p}}{2}&\frac{\sqrt{p}}{2}\\ 0&0\\ \end{array}\right],K_{3}=\left[\begin{array}[]{cc}0&0\\ \frac{\sqrt{1-p}}{2}&\frac{\sqrt{1-p}}{2}\\ 0&0\\ -\frac{\sqrt{1-p}}{2}&\frac{\sqrt{1-p}}{2}\\ \end{array}\right],K_{4}=\left[\begin{array}[]{cc}0&0\\ \frac{\sqrt{p}}{2}&\frac{\sqrt{p}}{2}\\ 0&0\\ \frac{\sqrt{p}}{2}&-\frac{\sqrt{p}}{2}\\ \end{array}\right], (46)

has exponentially decaying CMI for p(0,1/2)(1/2,1)p\in(0,1/2)\cup(1/2,1). Otherwise, I(A:C|B)ρ=0I(A:C|B)_{\rho_{\ell}}=0 for p=1/2p=1/2 and I(A:C|B)ρ=1I(A:C|B)_{\rho_{\ell}}=1 for p=0,1p=0,1.

While being bistochastic is crucial to provide a decay rate defined by relative entropy in Theorem III.1, we can work with slightly general channels with a likely suboptimal decay rate. See Sec. IV.2 for the proof.

Theorem III.2.

For a channel such that 𝒩[νC1]=σB1νC2\mathcal{N}[\nu_{C_{1}}]=\sigma_{B_{1}}\otimes\nu_{C_{2}} for some σ\sigma and ν\nu, and the maximally entangled input state σAA¯\sigma_{A\bar{A}}, the resulting chain

ρAB1BC:=(idid𝒦(1)𝒩)(idid𝒦𝒩)𝒩[σAA¯]\rho_{AB_{1}\cdots B_{\ell}C_{\ell}}:=\left(\rm id_{\mathcal{H}}\otimes\rm id_{\mathcal{K}}^{\otimes(\ell-1)}\otimes\mathcal{N}\right)\circ...\circ\left(\rm id_{\mathcal{H}}\otimes\rm id_{\mathcal{K}}\otimes\mathcal{N}\right)\circ\mathcal{N}[\sigma_{A\bar{A}}] (47)

satisfies

ρAB1BCρAB1BνC1=O(η).\lVert\rho_{AB_{1}\cdots B_{\ell}C_{\ell}}-\rho_{AB_{1}\cdots B_{\ell}}\otimes\nu_{C}\rVert_{1}=O(\eta^{\ell}). (48)

This implies the CMI is bounded as

I(A:C|B1B)ρ\displaystyle I(A:C|B_{1}\cdots B_{\ell})_{\rho_{\ell}} =O(η).\displaystyle=O(\ell\eta^{\ell}). (49)

The contraction ratio η\eta is given as

η:=min(1,16dCη1,C),\eta:=\min(1,16d_{C}\eta_{1,C}), (50)

where η1,C\eta_{1,C} is the trace norm contraction ratio defined on system C only

η1,C:=supρC,ρCC[ρC]C[ρC]1ρCρC1.\eta_{1,C}:=\sup_{\rho_{C},\rho^{\prime}_{C}}\frac{\lVert\mathcal{E}_{C}[\rho_{C}]-\mathcal{E}_{C}[\rho^{\prime}_{C}]\lVert_{1}}{\lVert\rho_{C}-\rho^{\prime}_{C}\lVert_{1}}. (51)

The decay of CMI for more general tripartition follows from the same argument as the bistochastic case (Theorem III.1 ). The distinction, though, is that our bound for the contraction ratio suffers from extra factor of dimension dCd_{C} due to conversion to the completely bounded norm (Lemma V.3), so the bound is meaningful only for the noisy regime η1,C<(16dc)1\eta_{1,C}<(16d_{c})^{-1}.

III.1.1 The trace norm CMI

In some cases, the trace norm (or the trace distance) has more applicable tools than the relative entropy. We now introduce a trace-norm variant of CMI. We expect it to help understanding the qualitative behavior of the CMI.

Definition 2 (The Trace norm CMI).
I1(A:C|B):=ρABCρAρBC1ρABρAρB1I_{1}(A:C|B):=\lVert\rho_{ABC}-\rho_{A}\otimes\rho_{BC}\rVert_{1}-\lVert\rho_{AB}-\rho_{A}\otimes\rho_{B}\rVert_{1} (52)

The definition is motivated by the form of CMI as I(A:C|B)=I(A:BC)I(A:B)I(A:C|B)=I(A:BC)-I(A:B) and replace each mutual information by the trace distance (without the normalization for the simplicity). The mutual information and the trace distance bound each other (in a non-linear way) by the quantum Pinsker inequality (wilde2011classical, , Theorem  11.9.1) and the continuity of entropy, and thus we expect this trace-norm variant of CMI partially captures common characteristics of the states as the CMI. Indeed, in the special cases analyzed in the following two theorems (Theorem III.1, Theorem III.2), we arrive at nearly same bound on decay rate for both CMI and the trace norm CMI, up to fixed overheads and logarithmic correction.

Proposition III.1.

The trace norm CMI is bounded as

I1(A:C|B1B)ρ=O(η)I_{1}(A:C|B_{1}\cdots B_{\ell})_{\rho_{\ell}}=O(\eta^{\ell}) (53)

for both states defined in Theorem III.1 and Theorem III.2, with η\eta defined respectively.

Unfortunately, we do not know to what extent the properties of CMI carry over, such as the connection to Markov states and recoverability. We leave further analysis on this function to future works.

III.2 Decay of CMI for Y-shaped channels with a forgetful component

Although we have shown the upper bound of CMI for bistochastic Y-shaped channels, they are only a small portion of the space of channels. Here, we show another suggestive result towards Question I.1.

Proposition III.2.

Consider a Y-shaped channel 𝒩:CB1C\mathcal{N}:C\rightarrow B_{1}C such that there is a constant 0η<10\leq\eta<1, a forgetful channel FF and another channel 𝒩\mathcal{N}^{\prime} such that

𝒩=(1η)F+η𝒩,\mathcal{N}=(1-\eta)F+\eta\mathcal{N}^{\prime}\,, (54)

where F[ρ]=σF[\rho]=\sigma maps any state to the same state. Then for any tripartite state ρABC\rho_{A^{\prime}B^{\prime}C}, the CMI contracts after applying 𝒩\mathcal{N},

I(A:C|BB1)(ρ)ηI(A:C|B)ρ,I(A^{\prime}:C|B^{\prime}B_{1})_{\mathcal{E}(\rho)}\leq\eta I(A^{\prime}:C|B^{\prime})_{\rho}\,, (55)
I1(A:C|BB1)(ρ)ηI1(A:C|B)ρ.I_{1}(A^{\prime}:C|B^{\prime}B_{1})_{\mathcal{E}(\rho)}\leq\eta I_{1}(A^{\prime}:C|B^{\prime})_{\rho}\,. (56)

Hence the resulting chain has exponentially decaying CMI and trace norm CMI for any tripartition

I(AB1Bn:Bn+m+1BC|Bn+1Bn+m)ρ\displaystyle I(AB_{1}\ldots B_{n}:B_{n+m+1}\ldots B_{\ell}C_{\ell}|B_{n+1}\cdots B_{n+m})_{\rho_{\ell}} =O(ηm),\displaystyle=O(\eta^{m})\,, (57)
I1(AB1Bn:Bn+m+1BC|Bn+1Bn+m)ρ\displaystyle I_{1}(AB_{1}\ldots B_{n}:B_{n+m+1}\ldots B_{\ell}C_{\ell}|B_{n+1}\cdots B_{n+m})_{\rho_{\ell}} =O(ηm).\displaystyle=O(\eta^{m})\,. (58)

Remarkably, each application of 𝒩\mathcal{N} contracts the CMI by η\eta, from simple application of convexity of relatively entropy and monotonicity of CMI (proof in Sec. V.2.3). Intuitively the contraction ratio is η\eta because the forgetful channel necessarily removes any correlation with system ABAB. The step wise decay implies the decay of CMI for any tripartition with a long conditioned system BB, i.e. the Y-shaped channel is applied many times.

Note that any strictly positive Y-shaped channel 𝒩(ρ)>0\mathcal{N}(\rho)>0 always has a decomposition (54) with FF being the completely depolarizing channel. Moreover, for any Y-shaped channel 𝒩\mathcal{N}^{\prime} and any 0η<10\leq\eta<1, we can always construct a perturbed channel 𝒩\mathcal{N} in Eq. (54) satisfying 𝒩𝒩cb2(1η)\|\mathcal{N}-\mathcal{N}^{\prime}\|_{cb}\leq 2(1-\eta).

In the sense that any channel can be perturbed to have a forgetful component, Proposition III.2 is true for most of quantum channels. We should note that the appropriate choice of ‘generic’ MPDO certainly depends on the constraint of the problem at hand, for example if we restrict the number of Kraus operators to be small, then strict positivity would not hold generically.

A channel with forgetful component necessarily has trivial correctable algebra, but not vice versa. We can see this fact through an explicit example.

Example: The classical channel given by the following Kraus operators has trivial correctable algebra, but has no forgetful component. Input states |11|,|22|,|33||1\rangle\langle 1|,|2\rangle\langle 2|,|3\rangle\langle 3| are mapped to |22|+|33|2,|11|+|33|2,|11|+|22|2\frac{|2\rangle\langle 2|+|3\rangle\langle 3|}{2},\frac{|1\rangle\langle 1|+|3\rangle\langle 3|}{2},\frac{|1\rangle\langle 1|+|2\rangle\langle 2|}{2}, whose intersection is empty and thus cannot have a shared forgetful component.

K1=12[000100100],K2=12[010000010],K3=12[001001000].\displaystyle K_{1}=\frac{1}{\sqrt{2}}\left[\begin{array}[]{ccc}0&0&0\\ 1&0&0\\ 1&0&0\\ \end{array}\right],K_{2}=\frac{1}{\sqrt{2}}\left[\begin{array}[]{ccc}0&1&0\\ 0&0&0\\ 0&1&0\\ \end{array}\right],K_{3}=\frac{1}{\sqrt{2}}\left[\begin{array}[]{ccc}0&0&1\\ 0&0&1\\ 0&0&0\\ \end{array}\right]. (68)

III.3 A completely contractive DPI of CMI that implies exponential decay of CMI

We have shown the channels we considered in Sec. III.1, III.2 incur the exponential decay of CMI. For channels with a forgetful component (Proposition III.2), the exponential decay simply comes from a contraction occurring at each application of channel, and it is tempting to ask whether this is the general case. This motivates the following discussion on the data processing inequality.

First, we show that a single application of any channel with trivial correctable algebra always induces strict decay of CMI.

Proposition III.3.

Let :CC\mathcal{E}:C\to C^{\prime} be a CPTP map which has trivial correctable algebra. Consider a tripartite system ABCA\otimes B\otimes C. Then, for any state ρABC\rho_{ABC} with I(A:C|B)ρ>0I(A:C|B)_{\rho}>0, we have

I(A:C|B)(ρ)<I(A:C|B)ρ.\displaystyle I(A:C^{\prime}|B)_{\mathcal{E}(\rho)}<I(A:C|B)_{\rho}\,. (69)

The proof is given in Sec. V.2.1. By regarding \mathcal{E} as Y-shaped channel 𝒩Ci1BiCi\mathcal{N}_{C_{i-1}\to B_{i}C_{i}}, we see that I(A:C|B)ρk<I(A:C|B)ρk1I(A:C|B)_{\rho_{k}}<I(A:C|B)_{\rho_{k-1}} holds for each kk (we abuse notation BCBC for two different lengths). Note that having trivial correctable algebra provides only a sufficient condition to have a strict decay of the CMI. There is a Y-shaped channel with non-trivial correctable algebra which obeys Eq. (69) (see Ref. KB20 for a complementary result).

Unfortunately, recursively applying Eq. (69) may not imply the exponential decay of CMI of ρ\rho_{\ell} as the decay might get slower as system BB gets larger. We therefore propose the following conjecture to explicitly include a contraction ratio as a strong DPI constant.

Conjecture III.1 (Completely Contractive DPI for the CMI).

For any channel :CC\mathcal{E}:C\rightarrow C^{\prime} with trivial correctable algebra 𝒜()=I\mathcal{A}(\mathcal{E})=\mathbb{C}I, there exists a constant η<1\eta<1 such that for any tripartite system ABCABC and any state ρABC\rho_{ABC}, it holds that

I(A:C|B)(ρ)ηI(A:C|B)ρ.I(A:C^{\prime}|B)_{\mathcal{E}(\rho)}\leq\eta I(A:C|B)_{\rho}. (70)

Invoking standard monotonicity inequalities(Sec. V.2.2), the contraction accumulates and thus Conjecture III.1 implies the desired exponential decay of CMI:

Proposition III.4.

If Conjecture III.1 is true, then for any ρ(𝒩)\rho_{\ell}(\mathcal{N}) constructed by Y-shaped channel 𝒩\mathcal{N} with trivial correctable algebra,

I(A:C|B)ρ=𝒪(e(lnη)m)I(A^{\prime}:C^{\prime}|B^{\prime})_{\rho}=\mathcal{O}(e^{-(\ln\eta)m}) (71)

for each tripartition ABC=ABCA^{\prime}B^{\prime}C^{\prime}=ABC where BB^{\prime} separates AA^{\prime} from CC^{\prime} with distance mm.

Conjecture III.1 holds for the channels with a forgetful component (Proposition III.2), where ηABC()\eta_{ABC}(\mathcal{E}) does not approach 1 as ABAB grows. In the classical case generally an unbounded auxiliary system can be reduced to have bounded dimension only depending on CC, and for the strong DPI constants the auxiliary system can be ignored ηABC=ηAC=ηC\eta_{ABC}=\eta_{AC}=\eta_{C} (Theorem II.1). In the quantum case we do not know whether ABAB can be reduced to a bounded dimension. Hence, the conjecture is open in general quantum systems and even showing its simpler variant for the mutual information (setting BB trivial) would be a breakthrough. Very recently the case when AA is classical and BB is trivial was proven with a contraction ratio analogous to the classical case hirche2020contraction .

A similar DPI can be proposed for the trace norm CMI.

Conjecture III.2 (completely contractive DPI for the trace norm CMI).

For any channel :CC\mathcal{E}:C\rightarrow C^{\prime} with local contraction ratio

η1,C:=supρC,ρCC[ρC]C[ρC]1ρCρC1<1\eta_{1,C}:=\sup_{\rho_{C},\rho^{\prime}_{C}}\frac{\lVert\mathcal{E}_{C}[\rho_{C}]-\mathcal{E}_{C}[\rho^{\prime}_{C}]\lVert_{1}}{\lVert\rho_{C}-\rho^{\prime}_{C}\lVert_{1}}<1 (72)

There exists a global constant η<1\eta<1 such that for any tripartite system ABCABC and any state ρABC\rho_{ABC}, it holds that

I1(A:C|B)(ρ)ηI1(A:C|B)ρ.I_{1}(A:C^{\prime}|B)_{\mathcal{E}(\rho)}\leq\eta I_{1}(A:C|B)_{\rho}. (73)

We do not know if extra constants or factor of dimension dCd_{C} should be present between η1,C\eta_{1,C} and η\eta like the crude bound in Theorem III.2. The trace norm seems more tractable than mutual information in the bipartite case.

Proposition III.5 (Contraction of the trace norm mutual information).

For any channel 𝒩:CC\mathcal{N}:C\rightarrow C^{\prime}, it holds that

𝒩[ρBCρBρC]14η1,CdCρBCρBρC1\displaystyle\lVert\mathcal{N}[\rho_{BC}-\rho_{B}\otimes\rho_{C}]\rVert_{1}\leq 4\eta_{1,C}d_{C}\lVert\rho_{BC}-\rho_{B}\otimes\rho_{C}\rVert_{1} (74)

for any state ρBC\rho_{BC}.

Here the contraction ratio is controlled by a bounded factor, where the dimension factor comes from bounding the completely bounded superoperator 111-1 norm (Lemma V.3).

III.4 Sufficient conditions for exponential decay of classically-conditioned mutual information

In this section we consider the CMI of MPDO after performing the local measurement on the computational basis of BB. Here we do not require the original map 𝒩\mathcal{N} in Eq. (4) to be trace-preserving.

Theorem III.3 (Sufficient conditions for decay of CMI for measured MPDO).

For a B-measured MPDO as in Eq. (6), the following are implications in the order 12341\implies 2\implies 3\implies 4:

  1. 1.

    (Condition 1) Each s\mathcal{M}_{s} has pp Kraus operators {E1s,,Eps}\{E^{s}_{1},\cdots,E^{s}_{p}\} such that EpsE^{s}_{p} is invertible, and

    {(Eps)1E1s,,(Eps)1Ep1s}\left\{\left(E^{s}_{p}\right)^{-1}E^{s}_{1},\cdots,\left(E^{s}_{p}\right)^{-1}E^{s}_{p-1}\right\}

    generate full matrix algebra MD()M_{D}(\mathbb{C}) by addition and multiplication.

  2. 2.

    (Condition 2) There exists uniform coarse-graining length ξ\xi such that any sequence t:=sk+lsk\mathcal{M}^{\prime}_{t}:=\mathcal{M}_{s_{k+l}}\cdots\mathcal{M}_{s_{k}}, where t=(sk,,sk+l)t=(s_{k},...,s_{k+l}), is strictly positive map for all lξl\geq\xi. In other words,

    det(t[ρ])0,t,ρ{\rm det}(\mathcal{M}^{\prime}_{t}[\rho])\neq 0\,,\quad\forall t,\forall\rho (75)

    whenever |t|ξ|t|\geq\xi.

  3. 3.

    The mutual information of A,CA,C observing any outcome bb is bounded by a uniform decay rate c1>0c_{1}>0

    I(A:C)bc0ec1.I(A:C)_{b}\leq c_{0}e^{-c_{1}\ell}. (76)
  4. 4.

    The measured conditional mutual information decays exponentially with the length \ell of system BB

    I(A:C|B¯)c0ec1.I(A:C|\bar{B})\leq c_{0}e^{-c_{1}\ell}. (77)

Condition 2 (strict positivity of all s\mathcal{M}_{s}) is unlikely a necessary condition for the decay of CMI, but it is convenient to state. We include coarse-graining in the statement as it is sometimes needed to ensure strict positivity of all s\mathcal{M}_{s}. For example, if the original Y-shaped CP map comes from tracing out a system much smaller than the bond dimension, then it may not be strictly positive.

A similar bound for the coarse-graining length ξ\xi for iteration of a single channel was given by the quantum Wielant inequality 5550282 : for every primitive channel 𝒩\mathcal{N} (with Kraus rank pp and Hilbert space dimension DD), ξ=D2(D2p+1)\xi={D^{2}(D^{2}-p+1)} iteration guarantees full-rank output. Our Condition 1 is a multi-channel generalization: it is a sufficient condition for all possible sequences of ss2s1\mathcal{M}_{s_{\ell}}\circ\cdots\circ\mathcal{M}_{s_{2}}\circ\mathcal{M}_{s_{1}} generated by CP maps {1,2,n}\{\mathcal{M}_{1},\mathcal{M}_{2},\cdots\mathcal{M}_{n}\} to become strictly positive for coarse graining length lξ:=D2p+1l\geq\xi:=D^{2}-p+1. See Sec. IV.3 for the proof.

Although strict positivity (Condition 2) is NP-hard to check in general gaubert2014checking , we prove that Condition 1 can be verified in a polynomial time (see Sec. V.4.5 for the details).

Proposition III.6.

Condition 1 can be checked in time polynomial in the bond dimension DD and the number of Kraus operators pp.

We numerically verify Condition 1 (generating full algebra) for Y-shaped channels generated by Haar-random Stinespting unitary (Fig. 3, bottom). The unitaries are the same random sample U(D2)U(D^{2}), and we have checked for computationally tractable local Hilbert space dimensions D8.D\leq 8. 444githubcode is the repository of the jupyter code. The code checks condition 1 for arbitary MPDO given the Kraus operators.. Both conditions imply Eq. (76), which further implies our original goal, Eq. (77).

IV Proofs of main theorems

Here we present proofs of the main results by putting the key lemmas together, whose proofs we postpone in Sec. V.

IV.1 Theorem III.1: decay of CMI for bistochastic channel

The bistochastic assumption greatly simplifies the structure of state. The state can be decomposed into a uncorrelated state ρABτC\rho_{AB}\otimes\tau_{C} plus a deviation GABCG_{ABC} that is traceless on system CC. At each application of channel in the generation of ρ\rho_{\ell}, there are two key implications of being bistochastic: 1) the uncorrelated state ρABτC\rho_{AB}\otimes\tau_{C} is mapped to an uncorrelated state where system CC remains the maximally mixed state τC\tau_{C}, 2) the deviation GABCG_{ABC} contracts w.r.t the normalized Hilbert-Schimdt norm. The rest of the proofs are obtained by standard conversion between norms. The proofs of the lemmas are shown independently in Sec. V.1.

Proof.

Let us take an operator basis containing identity which is orthogonal in the Hibert-Schmidt inner product. Then any operator on CC can be decomposed into the maximally mixed state τC\tau_{C} and the traceless part KCK_{C}. After applying bistochastic Y-shaped channel 𝒩:CiBi+1Ci+1\mathcal{N}:C_{i}\to B_{i+1}C_{i+1}, the traceless part is mapped to

𝒩[KC]\displaystyle\mathcal{N}[K_{C}] =OBτC+jOBjKCj\displaystyle=O_{B}\otimes\tau_{C}+\sum_{j}O^{j}_{B}\otimes K^{j}_{C}

where we abused the notation O,OjO,O^{j} to denote general operators and K,KjK,K^{j} to denote traceless operators. In the following we will use the curly brackets {OBKC}:=jOBjKBj\{O_{B}\otimes K_{C}\}:=\sum_{j}O_{B}^{j}\otimes K^{j}_{B} as a shorthand notation for this type of linear combinations. Iteratively decomposing system C into the traceless and maximally mixed components after applying 𝒩\mathcal{N}, we can write down the structure of the state ρAB1BC\rho_{AB_{1}\cdots B_{\ell}C_{\ell}} explicitly. Starting with the maximally entangled input state σAA¯=τAτA¯+{KAKA¯}\sigma_{A\bar{A}}=\tau_{A}\otimes\tau_{\bar{A}}+\{K_{A}\otimes K_{\bar{A}}\}, we obtain that

ρAB1C1\displaystyle\rho_{AB_{1}C_{1}} =𝒩[τAA¯+{KAKA¯}]=τAB1τC1+OAB1τC1+{OAB1KC1}\displaystyle=\mathcal{N}[\tau_{A\bar{A}}+\{K_{A}\otimes K_{\bar{A}}\}]=\tau_{AB_{1}}\otimes\tau_{C_{1}}+O_{AB_{1}}\otimes\tau_{C_{1}}+\{O_{AB_{1}}\otimes K_{C_{1}}\}
=ρAB1τC1+{OAB1KC1},\displaystyle=\rho_{AB_{1}}\otimes\tau_{C_{1}}+\{O_{AB_{1}}\otimes K_{C_{1}}\}, (78)
ρAB1B2C2\displaystyle\rho_{AB_{1}B_{2}C_{2}} =𝒩[ρAB1C1]=ρB1τB2τC2+OAB1OB2τC2+{OAB1OB2KCl}\displaystyle=\mathcal{N}[\rho_{AB_{1}C_{1}}]=\rho_{B_{1}}\otimes\tau_{B_{2}}\otimes\tau_{C_{2}}+O_{AB_{1}}\otimes O_{B_{2}}\otimes\tau_{C_{2}}+\{O_{AB_{1}}\otimes O_{B_{2}}\otimes K_{C_{l}}\}
=ρAB1B2τC2+{OAB1B2KC2},\displaystyle=\rho_{AB_{1}B_{2}}\otimes\tau_{C_{2}}+\{O_{AB_{1}B_{2}}\otimes K_{C_{2}}\}, (79)
ρAB1BC\displaystyle\rho_{AB_{1}\cdots B_{\ell}C_{\ell}} =τAB1τC\displaystyle=\tau_{AB_{1}}\otimes\cdots\tau_{C_{\ell}} (80)
+OAB1τB2τC\displaystyle+O_{AB_{1}}\otimes\tau_{B_{2}}\cdots\tau_{C_{\ell}} (81)
+\displaystyle+\cdots (82)
+{OAB1OB2KC}\displaystyle+\{O_{AB_{1}}\otimes O_{B_{2}}\cdots\otimes K_{C_{\ell}}\} (83)
=ρABτC+{OABKC},\displaystyle=\rho_{AB}\otimes\tau_{C_{\ell}}+\{O_{AB}\otimes K_{C_{\ell}}\}\,, (84)

where we simply denote system B1BB_{1}\ldots B_{\ell} by BB. The second term {OABKC}\{O_{AB}\otimes K_{C_{\ell}}\} in Eq. (84) is exponentially suppressed in the normalized Hilbert-Schmidt norm. To see this, we use the following lemma for bistochastic channels.

Lemma IV.1 (DPI for normalized HS-norm).

For bistochastic channel 𝒩C[τC]=τC\mathcal{N}_{C}[\tau_{C}]=\tau_{C^{\prime}}, the normalized Hilbert-Schimdt norm of operators KECK_{EC} traceless on CC contracts. More precisely, if TrC(KEC)=0Tr_{C}(K_{EC})=0 then

Tr(dEC𝒩C[K]EC2)ηTr(dECKEC2)\mathrm{Tr}(d_{EC^{\prime}}\mathcal{N}_{C}[K]^{2}_{EC})\leq\eta\mathrm{Tr}(d_{EC}K^{2}_{EC}) (85)

with the coefficient η\eta defined as

η:=lim supρCτCD(𝒩[ρ]τC)D(ρCτC).\eta:=\limsup_{\rho_{C}\rightarrow\tau_{C}}\frac{D(\mathcal{N}[\rho]\|\tau_{C^{\prime}})}{D(\rho_{C}\|\tau_{C})}. (86)

Let K\mathbb{P}_{K} be the orthogonal projection onto the traceless operator subspace on CC, GABC:={OABKC}G_{ABC}:=\{O_{AB}\otimes K_{C}\} and GAA¯:={KAKA¯}G_{A\bar{A}}:=\{K_{A}\otimes K_{\bar{A}}\}. Then, we obtain an expression

GABC=K𝒩K𝒩K𝒩[GAA¯].\displaystyle G_{ABC}=\mathbb{P}_{K}\circ\mathcal{N}\circ\mathbb{P}_{K}\circ\mathcal{N}...\mathbb{P}_{K}\circ\mathcal{N}[G_{A\bar{A}}]\,. (87)

By Lemma IV.1, the normalized HS norm of GABCG_{ABC} contracts under K𝒩\mathbb{P}_{K}\circ\mathcal{N} since the projection does not increase the HS norm. We thus obtain

dABCTr(GABC)2\displaystyle d_{ABC}\mathrm{Tr}(G_{ABC})^{2} =dAdB1BldCTr(K𝒩K𝒩𝒩(GAA¯))2\displaystyle=d_{A}d_{B_{1}...B_{l}}d_{C}\mathrm{Tr}(\mathbb{P}_{K}\circ\mathcal{N}\circ\mathbb{P}_{K}\circ\mathcal{N}\circ...\mathcal{N}(G_{A\bar{A}}))^{2} (88)
dAdB1BldCTr(𝒩K𝒩𝒩(GAA¯))2\displaystyle\leq d_{A}d_{B_{1}...B_{l}}d_{C}\mathrm{Tr}(\mathcal{N}\circ\mathbb{P}_{K}\circ\mathcal{N}\circ...\mathcal{N}(G_{A\bar{A}}))^{2} (89)
ηdADBB1B1CTr(K𝒩𝒩(GAA¯))2\displaystyle\leq\eta\cdot d_{A}D_{B_{B_{1}...B_{\ell-1}}C}\mathrm{Tr}(\mathbb{P}_{K}\circ\mathcal{N}\circ...\mathcal{N}(G_{A\bar{A}}))^{2} (90)
\displaystyle\vdots (91)
dAdCηlTr(GAA¯)2.\displaystyle\leq d_{A}d_{C}\eta^{l}\mathrm{Tr}(G_{A\bar{A}})^{2}\,. (92)

By Cauchy-Schawrtz inequality, we can bound the trace norm of GABCG_{ABC} by the HS norm.

GABC1dABCTr(GABC)2ηdA2Tr(GAA¯)2.\lVert G_{ABC}\rVert_{1}\leq d_{ABC}\mathrm{Tr}(G_{ABC})^{2}\leq\eta^{\ell}d_{A}^{2}\mathrm{Tr}(G_{A\bar{A}})^{2}. (93)

Here we used dA=dCd_{A}=d_{C}.

Since GABC1\|G_{ABC}\|_{1} is exponentially suppressed, ρABC\rho_{ABC} is close to ρABτC\rho_{AB}\otimes\tau_{C} for large \ell. The next step is to convert to CMIs, and clearly ρABτC\rho_{AB}\otimes\tau_{C} has zero CMI. From the continuity of entropy, the difference of the CMI of two states are bounded as follows.

Lemma IV.2 (Continuity of CMI).

If ρABC=σABC+GABC\rho_{ABC}=\sigma_{ABC}+G_{ABC} and dABCTr(GABC)2ϵ1/ed_{ABC}\mathrm{Tr}(G_{ABC})^{2}\leq\epsilon\leq 1/\mathrm{e}, then

|I(A:C|B)ρI(A:C|B)σ|4log(dABC)ϵ4log(ϵ)ϵ.|I(A:C|B)_{\rho}-I(A:C|B)_{\sigma}|\leq 4\log(d_{ABC})\epsilon-4\log(\epsilon)\epsilon. (94)

By setting σABC=ρABτC\sigma_{ABC}=\rho_{AB}\otimes\tau_{C} and taking large enough \ell such that the deviation GABCG_{ABC} is small enough ηdA2Tr(GAA¯)21/e\eta^{\ell}d_{A}^{2}\mathrm{Tr}(G_{A\bar{A}})^{2}\leq 1/\mathrm{e}, we obtain the exponential decay of the CMI:

I(A:C|B)ρ\displaystyle I(A:C|B)_{\rho} 4(log(dB1)+2log(dA))ηdA2Tr(GAA¯)2\displaystyle\leq 4(\log(d_{B_{1}})\ell+2\log(d_{A}))\eta^{\ell}d_{A}^{2}\mathrm{Tr}(G_{A\bar{A}})^{2}
4(log(η)+log(dA2Tr(GAA¯)2))ηdA2Tr(GAA¯)2\displaystyle\hskip 99.58464pt-4\left(\ell\log(\eta)+\log(d^{2}_{A}\mathrm{Tr}(G_{A\bar{A}})^{2})\right)\eta^{\ell}d_{A}^{2}\mathrm{Tr}(G_{A\bar{A}})^{2} (95)
O(η).\displaystyle\leq O(\ell\eta^{\ell}). (96)

Lastly, we alternatively characterize η\eta by a spectral property:

Lemma IV.3.

For a bistochastic channel \mathcal{E}, the second largest singular value (which coincides with eigenvalue due to being self-adjoint) of the Petz-recovery channel composed with the channel 𝒫τ,\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E} is exactly the contraction ratio η\eta.

η=λ2(𝒫τ,)=lim supρτD([ρ]τ)D(ρτ).\eta=\lambda_{2}(\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E})=\limsup_{\rho\rightarrow\tau}\frac{D(\mathcal{E}[\rho]\|\tau^{\prime})}{D(\rho\|\tau)}\,. (97)

Therefore η<1\eta<1 if and only if the largest singular value (or equivalently the eigenvalue) λ1=1\lambda_{1}=1 is unique. By Proposition II.1, the unit eigenvalue subspace of λ1\lambda_{1} being one-dimensional is equivalent to the correctable algebra being trivial.

𝒜()=Iλ2(𝒫τ,)=η<1.\mathcal{A}(\mathcal{E})=\mathbb{C}I\iff\lambda_{2}(\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E})=\eta<1\,. (98)

This completes the second statement of the proof. ∎

IV.2 Theorem III.2: decay of trace norm CMI for partially invariant channel

The proof structure is analogous to the previous Section IV.1, with the key distinction being the norm and the associated tools. Under the trace norm we have results for a slightly general family of channels satisfying 𝒩[νC]=σBνC\mathcal{N}[\nu_{C}]=\sigma_{B}\otimes\nu_{C}. Though, unlike the HS norm, the trace norm suffers from tensoring with an auxiliary system by a bounded factor of dimension of system CC (Lemma V.3). The contraction ratio obtained this way is likely not the most stringent, but at least it implies strict contraction of the CMI when the channel is sufficiently noisy. The proofs for the key lemmas are shown in Sec. V.3.

Proof.

First, we decompose the state into a product state ρABνC\rho_{AB}\otimes\nu_{C} plus some deviation GABCG_{ABC} traceless on system CC. Suppose

𝒩[KC]\displaystyle\mathcal{N}[K_{C}] =ν𝒩[KC]+(idν)𝒩[KC]=OBνC+{OBKC},\displaystyle=\mathbb{P}_{\nu}\circ\mathcal{N}[K_{C}]+(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}[K_{C}]=O_{B}\otimes\nu_{C}+\{O_{B}\otimes K_{C}\}, (99)

where ν:=νCTrC[]\mathbb{P}_{\nu}:=\nu_{C}Tr_{C}[\cdot] is the forgetful channel on subsystem C. Again we used KCK_{C} to denote an operator traceless on system CC, but we should note the difference from Sec. IV.1 that the projection (idν)(\rm id-\mathbb{P}_{\nu}) is not an orthogonal projection under HS norm. Writing the maximally entangled state as σAA¯=τAνA¯+{OAKA¯}\sigma_{A\bar{A}}=\tau_{A}\otimes\nu_{\bar{A}}+\{O_{A}\otimes K_{\bar{A}}\}, we obtain that

ρAB1C1\displaystyle\rho_{AB_{1}C_{1}} =𝒩[τAνA¯+{OAKA¯}]=τAσB1νC1+OAB1νC1+{OAB1KC1}\displaystyle=\mathcal{N}[\tau_{A}\otimes\nu_{\bar{A}}+\{O_{A}\otimes K_{\bar{A}}\}]=\tau_{A}\otimes\sigma_{B_{1}}\otimes\nu_{C_{1}}+O_{AB_{1}}\otimes\nu_{C_{1}}+\{O_{AB_{1}}\otimes K_{C_{1}}\}
=ρAB1νC1+{OAB1KC1},\displaystyle=\rho_{AB_{1}}\otimes\nu_{C_{1}}+\{O_{AB_{1}}\otimes K_{C_{1}}\}, (100)
ρAB1B2C1\displaystyle\rho_{AB_{1}B_{2}C_{1}} =𝒩[ρAB1C1]\displaystyle=\mathcal{N}[\rho_{AB_{1}C_{1}}] (101)
=ρAB1σB2νC2+OAB1OB2νC2+{OAB1OB2KC2}\displaystyle=\rho_{AB_{1}}\otimes\sigma_{B_{2}}\otimes\nu_{C_{2}}+O_{AB_{1}}\otimes O_{B_{2}}\otimes\nu_{C_{2}}+\{O_{AB_{1}}\otimes O_{B_{2}}\otimes K_{C_{2}}\}
=ρAB1B2νC2+{OAB1B2KC2},\displaystyle=\rho_{AB_{1}B_{2}}\otimes\nu_{C_{2}}+\{O_{AB_{1}B_{2}}\otimes K_{C_{2}}\}, (102)
ρAB1BC\displaystyle\rho_{AB_{1}\cdots B_{\ell}C_{\ell}} =τAσB1σB2νC\displaystyle=\tau_{A}\otimes\sigma_{B_{1}}\otimes\sigma_{B_{2}}\cdots\nu_{C_{\ell}}
+OAB1σB2νC\displaystyle+O_{AB_{1}}\otimes\sigma_{B_{2}}\cdots\nu_{C_{\ell}}
+\displaystyle+\cdots
+{OAB1OB2KC}\displaystyle+\{O_{AB_{1}}\otimes O_{B_{2}}\cdots\otimes K_{C_{\ell}}\}
=ρABνC+{OABKC}\displaystyle=\rho_{AB}\otimes\nu_{C_{\ell}}+\{O_{AB}\otimes K_{C_{\ell}}\} (103)
=ρABνC+(idν)𝒩(idν)𝒩(GAA¯).\displaystyle=\rho_{AB}\otimes\nu_{C_{\ell}}+(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}...(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}(G_{A\bar{A}}). (104)

Let GABC:=(idν)𝒩(idν)𝒩(GAA¯)G_{ABC}:=(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}...\circ(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}(G_{A\bar{A}}), which is the output of iteration of the map (idν)𝒩(idν)(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}\circ(\rm id-\mathbb{P}_{\nu}), where we multiply a copy of (idν)(\rm id-\mathbb{P}_{\nu}) as its square is equal to itself. The contraction ratio can be obtained by

GABC1(idν)𝒩(idν)cbGAA¯1,\lVert G_{ABC}\rVert_{1}\leq\lVert(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}\circ(\rm id-\mathbb{P}_{\nu})\rVert_{cb}^{\ell}\lVert G_{A\bar{A}}\rVert_{1}, (105)

where the completely bounded norm for a super-operator ϕ:dd\phi:\mathcal{M}_{d}\rightarrow\mathcal{M}_{d^{\prime}} is defined by555This is 1-1 super-operator norm with arbitrarily large ancilla.

ϕcb:=supk,X(ϕidk)[X]1X1.\displaystyle\lVert\phi\rVert_{cb}:=\sup_{k,X}\frac{\lVert(\phi\otimes id_{k})[X]\rVert_{1}}{\lVert X\rVert_{1}}. (106)

Unfortunately, the completely bounded norm of the difference of channels is not less than one in general. However, we do obtain a meaningful contraction bound for η1,C\eta_{1,C} small enough.

Lemma IV.4.

The completely bounded norm of the difference between channels is bounded by

(idν)𝒩(idν)cb2𝒩𝒩νcb16dCη1,C,\lVert(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}\circ(\rm id-\mathbb{P}_{\nu})\rVert_{cb}\leq 2\lVert\mathcal{N}-\mathcal{N}\circ\mathbb{P}_{\nu}\rVert_{cb}\leq 16d_{C}\eta_{1,C}, (107)

where η1,C\eta_{1,C} is the trace norm contraction ratio

η1,C:=supρC,ρC𝒩[ρC]𝒩[ρC]1ρCρC1.\eta_{1,C}:=\sup_{\rho_{C},\rho^{\prime}_{C}}\frac{\lVert\mathcal{N}[\rho_{C}]-\mathcal{N}[\rho^{\prime}_{C}]\lVert_{1}}{\lVert\rho_{C}-\rho^{\prime}_{C}\lVert_{1}}. (108)

The constant factor 1616 is crude, and the factor of dimension dCd_{C} is due to converting 1-1 superoperator norm to the diamond norm. From this lemma Eq. (105) reduces to

GABC1(16dCη1,C)GAA¯1.\lVert G_{ABC}\rVert_{1}\leq\left(16d_{C}\eta_{1,C}\right)^{\ell}\lVert G_{A\bar{A}}\rVert_{1}. (109)

By the triangle inequality we show that ρABC\rho_{ABC} becomes close to a product state in trace norm

ρABCρABνC1O((16ηCdC)).\left\|\rho_{ABC}-\rho_{AB}\otimes\nu_{C}\right\|_{1}\leq O\left((16\eta_{C}d_{C})^{\ell}\right). (110)

We conclude the proof by calling the continuity of CMI (Lemma IV.2) again. ∎

IV.3 Thoerem III.3: decay of BB-measured CMI in MPDO

First, 343\implies 4 is immediate from that the measured CMI is equal to the expectation of mutual information between AA and CC over measurement outcomes bb in B¯\bar{B},

I(A:C|B¯)ρ=bpB¯(b)I(A:C)b,\displaystyle I(A:C|\bar{B})_{\rho}=\sum_{b}p_{\bar{B}}(b)I(A:C)_{b}, (111)

where I(A:C)bI(A:C)_{b} is the mutual information of state ρAC|B¯=bTrB¯(|bb|ρABC)\rho_{AC|\,{\bar{B}}=b}\propto\mathrm{Tr}_{\bar{B}}(|b\rangle\langle b|\rho_{ABC}), defined on the Hilbert space of system AA and CC only, as we can individually discuss each classical outcome B¯=b\bar{B}=b.

We show the remaining two indications 121\implies 2 and 232\implies 3 in the following.

IV.3.1 232\implies 3: uniformly bounding every sequence quantum operations by contraction in Hilbert’s projective metric

Proof.

We show Condition 2 implies the mutual information I(A:C)bI(A:C)_{b} decays exponentially with a uniform decay rate cc for every outcome bb. As showed in the above, this implies the decay of CMI without touching the probability distribution p(b)p(b).

Recall that we have a set of CP-maps sk:CC\mathcal{M}_{s_{k}}:C\to C for each outcome sks_{k}. These maps sk\mathcal{M}_{s_{k}}(or the coarse-grained maps t:=sk+ξsk\mathcal{M}^{\prime}_{t}:=\mathcal{M}_{s_{k+\xi}}\circ\ldots\circ\mathcal{M}_{s_{k}}) are all contractions in the Hilbert’s projective metric h(a,b)h(a,b). A self-contained introduction and supporting theorems are included in Section V.4.1. The contraction ratio associated to a CP-map \mathcal{M} is given by

η:=supa,bS+h((a),(b))h(a,b)1,\displaystyle\eta_{\mathcal{M}}:=\sup\limits_{a,b\in S_{+}}\frac{h(\mathcal{M}(a),\mathcal{M}(b))}{h(a,b)}\leq 1, (112)

where the supremum is taken over S+S_{+}, the set of unnormalized positive semi-definite operators. Condition 2 guarantees that {t}\{\mathcal{M}^{\prime}_{t}\} has a strict contraction ratio at Eq. (112). After repeatedly applying t\mathcal{M}^{\prime}_{t}, any state is mapped to a fixed state (up to rescaling) with exponentially small error. Accounting the normalization and the conversion to trace norm, we obtain the following lemma.

Lemma IV.5.

If all {1,2,,n}\{\mathcal{M}_{1},\mathcal{M}_{2},\ldots,\mathcal{M}_{n}\} are CP self-maps that map any state to a full rank state, then for arbitrary sequence of b=s,,s1{1,,n}b=s_{\ell},\ldots,s_{1}\in\{1,\ldots,n\}^{\ell} and all ρ1,ρ2\rho_{1},\rho_{2}

b[ρ1]Tr(b[ρ1])b[ρ2]Tr(b[ρ2])1=O(ec),\displaystyle\left\|\frac{\mathcal{M}_{b}[\rho_{1}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{1}])}-\frac{\mathcal{M}_{b}[\rho_{2}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{2}])}\right\|_{1}=O(e^{-c\ell}), (113)

where the exponent cc is independent of bb.

This statement implies the bipartite state ρAC,b\rho_{AC,b} being close to a product state in the trace norm.

Lemma IV.6.

Suppose the CP map b:CC\mathcal{M}_{b}:C^{\prime}\rightarrow C is contractive in the sense that for all ρ1,ρ2\rho_{1},\rho_{2},

b[ρ1]Tr(b[ρ1])b[ρ2]Tr(b[ρ2])1ϵ.\left\lVert\frac{\mathcal{M}_{b}[\rho_{1}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{1}])}-\frac{\mathcal{M}_{b}[\rho_{2}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{2}])}\right\rVert_{1}\leq\epsilon. (114)

Then the state ρAC:=b[σAC]/Tr(b[σA])\rho_{AC}:=\mathcal{M}_{b}[\sigma_{AC^{\prime}}]/\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{A}]) is close to the product state in the trace distance

2Tb:=ρAC,bρA,bρC,b14dCσC1ϵ.\displaystyle 2T_{b}:=\lVert\rho_{AC,b}-\rho_{A,b}\otimes\rho_{C,b}\rVert_{1}\leq 4d_{C^{\prime}}\lVert\sigma_{C^{\prime}}^{-1}\rVert\epsilon. (115)

Plugging σC=τC\sigma_{C^{\prime}}=\tau_{C^{\prime}}, and by the Alicki–Fannes–Winter inequality winter2015tight (see also (wilde2011classical, , Theorem 11.10.3)), we convert the O(ec)O(e^{-c\ell}) bound on the trace norm to the mutual information with some constant overhead. ∎

IV.3.2 121\implies 2: uniformly bounding coarse graining length by generalizing quantum Wielandt’s inequality to family of CP self-maps

Recall our goal is to show that there exists a coarse-graining length ξ\xi such that for any sequence t=(s,,s1)t=(s_{\ell},...,s_{1}), the coarse grained CP map t:=sls1\mathcal{M}^{\prime}_{t}:=\mathcal{M}_{s_{l}}\cdots\mathcal{M}_{s_{1}} is strictly positive for all lξl\geq\xi. First, we convert strict positivity to having full Kraus rank.

Lemma IV.7.

If span{K1,,Kp}=Mat(D,D){\rm span}\{K_{1},\cdots,K_{p}\}=Mat(D,D) then 𝒩[ρ]:=iKiρKi>0,ρ\mathcal{N}[\rho]:=\sum_{i}K_{i}\rho K_{i}^{\dagger}>0,\forall\rho.

Our strategy to a uniform bound on ξ\xi now relies on demanding for each sks_{k}, sk[]=iEiskEisk\mathcal{M}_{s_{k}}[\cdot]=\sum_{i}E_{i}^{s_{k}}\cdot E_{i}^{s_{k}\dagger} must increase the dimension of span of Kraus operators untill reaching full rank.

Lemma IV.8 (condition 1 implies the increment of span of Kraus operators).

Consider a CP self-map \mathcal{M} with p>2p>2 Kraus operators {E1,,Ep}\mathcal{M}\sim\{E_{1},\cdots,E_{p}\} such that EpE_{p} is invertible. Suppose that ((Ep)1E1,,(Ep)1Ep1)\left((E_{p})^{-1}E_{1},\cdots,(E_{p})^{-1}E_{p-1}\right) generate full matrix algebra Mat(D,D)Mat_{\mathbb{C}}(D,D) by addition and multiplication. Then for any set of Kraus operators {Tj}\{T_{j}\} containing an invertible element and whose Kraus rank is not full, the dimension of span must increase after applying {E1,,Ep}\mathcal{M}\sim\{E_{1},\cdots,E_{p}\}

dim(span{EiTj})>dim(span{Tj}).{\rm dim}({\rm span}\{E_{i}T_{j}\})>{\rm dim}({\rm span}\{T_{j}\}). (116)

We are now ready to prove 121\implies 2:

Proof.

For any sequence t=(s,,s1)t=(s_{\ell},...,s_{1}), consider the linear span of Kraus operators of the product sks2s1\mathcal{M}_{s_{k}}\cdots\mathcal{M}_{s_{2}}\mathcal{M}_{s_{1}} (1k1\leq k\leq\ell)

𝒮k:=span{EqkskEq2s2Eq1s1},\mathcal{S}_{k}:={\rm span}\{E^{s_{k}}_{q_{k}}\cdots E^{s_{2}}_{q_{2}}E^{s_{1}}_{q_{1}}\},

where {Eq1s1}q1\{E^{s_{1}}_{q_{1}}\}_{q_{1}} are the Kraus operators of the CP map s1\mathcal{M}_{s_{1}}. We have shorthanded the above as 𝒮k\mathcal{S}_{k}, while keeping in mind its dependence on the sequence sks1s_{k}\cdots s_{1}. Applying sk+1\mathcal{M}_{s_{k+1}} amounts to left-multiplying its Kraus operators, giving the new span of Kraus operators:

Sk+1={E1sk+1𝒮k,,Ep1sk+1𝒮k}.S_{k+1}=\{E^{s_{k+1}}_{1}\mathcal{S}_{k},\cdots,E^{s_{k+1}}_{p-1}\mathcal{S}_{k}\}.

By Lemma IV.8, multiplying sk+1\mathcal{M}_{s_{k+1}} must increase the dimension of span of Kraus operators. Therefore, after constant ξ:=D2p+1\ell\geq\xi:=D^{2}-p+1 steps, the product CP map ss2s1\mathcal{M}_{s_{\ell}}\cdots\mathcal{M}_{s_{2}}\mathcal{M}_{s_{1}} must reach full Kraus rank and hence becomes strictly positive (Proposition V.3). Importantly, the same coarse graining length ξ\xi works for all sequences (sξ,,s1)(s_{\xi},\cdots,s_{1}), which establishes Condition 2. Note that since ξ\xi is finite, the contraction ratios of t\mathcal{M}_{t} has a global bound being strictly less than one.

V Proofs for remaining propositions and lemmas

Here we compile the remaining proofs and lemmas, categorized by the theorem they are supporting: Theorem III.1 at Section V.1; Theorem III.2 at Section V.3; Theorem III.3 at Section V.4.

V.1 Remaining Proof for Theorem III.1

V.1.1 Proof of Lemma IV.1, DPI for 2-norm

Proof.

By definition, the contraction can be expressed by the completely bounded superoperator 2-2 norm of the channel.

Tr(dEC𝒩C[K]EC2)\displaystyle\mathrm{Tr}(d_{EC^{\prime}}\mathcal{N}_{C}[K]^{2}_{EC}) (𝒩K)222dECdECTr(dECKEC2)\displaystyle\leq\lVert\mathcal{I}\otimes(\mathcal{N}\circ\mathbb{P}_{K})\rVert_{2-2}^{2}\frac{d_{EC^{\prime}}}{d_{EC}}\mathrm{Tr}(d_{EC}K^{2}_{EC}) (117)
=(𝒩K)222dCdCTr(dECKEC2).\displaystyle=\lVert\mathcal{I}\otimes(\mathcal{N}\circ\mathbb{P}_{K})\rVert_{2-2}^{2}\frac{d_{C^{\prime}}}{d_{C}}\mathrm{Tr}(d_{EC}K^{2}_{EC}). (118)

The superoperator 222-2 norm is equal to the largest-singular-value of the map (with input restricted to be traceless on system CC) w.r.t. the 2-norm of operators. Tensoring with identity does not change the leading singular value, and hence the completely bounded 222-2 norm can be evaluated without auxiliary system as

(𝒩K)222dCdC\displaystyle\lVert\mathcal{I}\otimes(\mathcal{N}\circ\mathbb{P}_{K})\rVert_{2-2}^{2}\frac{d_{C^{\prime}}}{d_{C}} =𝒩K222dCdC\displaystyle=\lVert\mathcal{N}\circ\mathbb{P}_{K}\rVert_{2-2}^{2}\frac{d_{C^{\prime}}}{d_{C}} (119)
=supKC,Tr(KC)=0Tr(𝒩C[K]C2)Tr(KC2)dCdC\displaystyle=\sup_{K_{C},\mathrm{Tr}(K_{C})=0}\frac{\mathrm{Tr}(\mathcal{N}_{C}[K]^{2}_{C})}{\mathrm{Tr}(K^{2}_{C})}\cdot\frac{d_{C^{\prime}}}{d_{C}} (120)
=limρCτCsupD(𝒩[ρ]τC)D(ρCτC),\displaystyle=\lim_{\rho_{C}\rightarrow\tau_{C}}\sup\frac{D(\mathcal{N}[\rho]\|\tau_{C^{\prime}})}{D(\rho_{C}\|\tau_{C})}, (121)

In the last equality we convert the 2-2 norm to the perturbation of relative entropy

D(ρτ)=S(ρ)+log(d)=12Tr(dK2)+O(K3).\displaystyle D(\rho\|\tau)=-S(\rho)+\log(d)=\frac{1}{2}\mathrm{Tr}(dK^{2})+O(K^{3}). (122)

The O(K3)O(K^{3}) and higher order terms vanish in the limit because both spaces CC and CC^{\prime} have finite dimension, which guarantee the operator norm vanishes K0\lVert K\rVert\rightarrow 0 in the limit of ρCτC\rho_{C}\to\tau_{C}. ∎

V.1.2 Proof of Proposition IV.2, the continuity of CMI

The proof is based on the continuity of entropy w.r.t. the normalized H-S norm

Lemma V.1 (Continuity of entropy).
|S(ρ)S(σ)|\displaystyle|S(\rho)-S(\sigma)| ρσ1log(d)ρσ1log(ρσ1)\displaystyle\leq\lVert\rho-\sigma\rVert_{1}\log(d)-\lVert\rho-\sigma\rVert_{1}\log(\lVert\rho-\sigma\rVert_{1}) (123)
dρσ22(log(d)log(dρσ22)).\displaystyle\leq d\lVert\rho-\sigma\rVert_{2}^{2}\left(\log(d)-\log(d\lVert\rho-\sigma\rVert_{2}^{2})\right). (124)

Taking partial trace does not increase the normalized HS norm.

Proposition V.1.
dATr(GA2)dABTr(GAB2).\displaystyle d_{A}\mathrm{Tr}(G_{A}^{2})\leq d_{AB}\mathrm{Tr}(G_{AB}^{2}). (125)
Proof.

Decompose othogonally GAB=τAGB+{KAOB}G_{AB}=\tau_{A}\otimes G_{B}+\{K_{A}\otimes O_{B}\} so that

dABTr(GAB2)=dABTr(τA2GB2+(KAOB)2)dABTr(τA2GB2)=dBTr(GB2).\displaystyle d_{AB}\mathrm{Tr}(G_{AB}^{2})=d_{AB}\mathrm{Tr}(\tau_{A}^{2}\otimes G_{B}^{2}+(K_{A}\otimes O_{B})^{2})\geq d_{AB}\mathrm{Tr}(\tau_{A}^{2}\otimes G_{B}^{2})=d_{B}\mathrm{Tr}(G_{B}^{2}). (126)

We can now prove the continuity of CMI, Proposition IV.2.

Proof.

We can expand the CMI as

I(A:C|B)ρI(A:C|B)σ\displaystyle I(A:C|B)_{\rho}-I(A:C|B)_{\sigma} (127)
=(Sρ(ABC)Sσ(ABC))(Sρ(AB)Sσ(AB))(Sρ(BC)Sσ(BC))+(Sρ(B)Sσ(B)).\displaystyle=(S_{\rho}(ABC)-S_{\sigma}(ABC))-(S_{\rho}(AB)-S_{\sigma}(AB))-(S_{\rho}(BC)-S_{\sigma}(BC))+(S_{\rho}(B)-S_{\sigma}(B)). (128)

By the continuity of entropy (Lemma V.1), it holds that

|(Sρ(AB)Sσ(AB))|\displaystyle|(S_{\rho}(AB)-S_{\sigma}(AB))| dABGAB22(log(d)log(dABGAB22))\displaystyle\leq d_{AB}\lVert G_{AB}\rVert_{2}^{2}\left(\log(d)-\log(d_{AB}\lVert G_{AB}\rVert_{2}^{2})\right) (129)
dABCGABC22(log(d)log(dABCGABC22))\displaystyle\leq d_{ABC}\lVert G_{ABC}\rVert_{2}^{2}\left(\log(d)-\log(d_{ABC}\lVert G_{ABC}\rVert_{2}^{2})\right) (130)
log(dABC)ϵlog(ϵ)ϵ.\displaystyle\leq\log(d_{ABC})\epsilon-\log(\epsilon)\epsilon. (131)

In the second inequality we used Proposition V.1 and that function xlog(x)-x\log(x) is monotonically increasing at x1/ex\leq 1/e. The same argument holds for all four terms. ∎

V.1.3 Proof of Lemma IV.3, spectral characterization of the contraction ratio

Proof.

We expand the Petz-recovered map

𝒫τ,[]\displaystyle\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E}[\cdot] =τ1/2[[τ]1/2[][τ]1/2]τ1/2\displaystyle=\tau^{1/2}\mathcal{E}^{\dagger}[\mathcal{E}[\tau]^{-1/2}\mathcal{E}[\cdot]\mathcal{E}[\tau]^{-1/2}]\tau^{1/2} (132)
=dd[[]]\displaystyle=\frac{d^{\prime}}{d}\mathcal{E}^{\dagger}[\mathcal{E}[\cdot]] (133)

where 𝒩\mathcal{N} being bistochastic substantially simplifies the expression, making it self-adjoint and positive. Hence, the spectrum is positive and the second eigenvalue/singular value, expressed by removing the trace coincides with the relative entropy characterization Eq.(121)

λ2(𝒫τ,)=ddK222=lim supρCτCD([ρC]τC)D(ρCτC).\lambda_{2}(\mathcal{P}_{\tau,\mathcal{E}}\mathcal{E})=\frac{d^{\prime}}{d}\lVert\mathcal{E}\mathbb{P}_{K}\rVert^{2}_{2-2}=\limsup_{\rho_{C}\rightarrow\tau_{C}}\frac{D(\mathcal{E}[\rho_{C}]\|\tau_{C^{\prime}})}{D(\rho_{C}\|\tau_{C})}. (134)

V.2 Proofs for propositions around Conjecture III.1

V.2.1 Proof of Proposition III.3

We prove a slightly more general lemma that includes the case when the correctable algebra is not trivial, which immediately converts to Proposition III.3. Some necessary background are at Appendix A.

Lemma V.2.

For all tripartite state ρABC\rho_{ABC} and channel acting on system CC only :CD\mathcal{E}:C\rightarrow D

I(A:BC)ρ=I(A:BD)[ρ]I(A:Bγ)ρ=I(A:BC)ρ,I(A:BC)_{\rho}=I(A:BD)_{\mathcal{E}[\rho]}\iff I(A:B\gamma)_{\rho}=I(A:BC)_{\rho}, (135)

where γ:=𝒜()\gamma:=\mathcal{A}(\mathcal{E}) is the correctable algebra of \mathcal{E}, and I(A:Bγ)ρ:=I(A:BC)Eγ(ρ)I(A:B\gamma)_{\rho}:=I(A:BC)_{E_{\gamma}(\rho)} with Eγ:CγE_{\gamma}:C\to\gamma the conditional expectation (see e.g.,  TakesakiI ) onto the subalgebra γ\gamma 666Sometimes EγE_{\gamma} is called the restriction to subalgebra (see e.g. Chen_2020 ). If γ\gamma is trivial then this is taking partial trace over CC..

In words, if a channel does not decrease the mutual information of the state, then the correlation with system AA must be perfectly stored in the correctable algebra (B)γ=𝒜(BC)\mathcal{B}(\mathcal{H}_{B})\otimes\gamma=\mathcal{A}(\mathcal{I}_{B}\otimes\mathcal{E}_{C}). Operationally, the LHS implies that from reduced state Eγ(ρABC)E_{\gamma}(\rho_{ABC}) there is some recovery channel R:BγBCR:B\gamma\rightarrow BC that recovers the full state ρABC\rho_{ABC}.

Proof.

We start with the recoverability theorem(see, e.g.  (wilde2011classical, , Corollary 12.5.1))

I(A:BC)ρ=I(A:BD)[ρ]\displaystyle I(A:BC)_{\rho}=I(A:BD)_{\mathcal{E}[\rho]} D(ρABCρAρBC)=D(C[ρABC]ρAC[ρBC])\displaystyle\iff D(\rho_{ABC}\|\rho_{A}\otimes\rho_{BC})=D(\mathcal{E}_{C}[\rho_{ABC}]\|\rho_{A}\otimes\mathcal{E}_{C}[\rho_{BC}]) (136)
𝒫ρAρBC,[ρABC]=ρABC\displaystyle\implies\mathcal{P}_{\rho_{A}\otimes\rho_{BC},\mathcal{E}}\circ\mathcal{E}[\rho_{ABC}]=\rho_{ABC} (137)

where we are using the Petz map with reference state ρAρBC\rho_{A}\otimes\rho_{BC}, expanded explicitly as follows

𝒫ρAρBC,[ρ]\displaystyle\mathcal{P}_{\rho_{A}\otimes\rho_{BC},\mathcal{E}}[\rho] =ρAρBC[[ρAρBC]1ρ[ρAρBC]1]ρAρBC\displaystyle=\sqrt{\rho_{A}\otimes\rho_{BC}}\mathcal{E}^{\dagger}[\sqrt{\mathcal{E}[\rho_{A}\otimes\rho_{BC}]}^{-1}\rho\sqrt{\mathcal{E}[\rho_{A}\otimes\rho_{BC}]}^{-1}]\sqrt{\rho_{A}\otimes\rho_{BC}} (138)
=ρBC[[ρBC]1ρ[ρBC]1]ρBC=A𝒫ρBC,[ρ]\displaystyle=\sqrt{\rho_{BC}}\mathcal{E}^{\dagger}[\sqrt{\mathcal{E}[\rho_{BC}]}^{-1}\rho\sqrt{\mathcal{E}[\rho_{BC}]}^{-1}]\sqrt{\rho_{BC}}=\mathcal{I}_{A}\otimes\mathcal{P}_{\rho_{BC},\mathcal{E}}[\rho] (139)

The exact form of the Petz map does not mean so much for us here, and all we care is the recovery map 𝒫ρAρBC,\mathcal{P}_{\rho_{A}\otimes\rho_{BC},\mathcal{E}} only acts on subsystem BCBC due to factorization of the reference state ρAρBC\rho_{A}\otimes\rho_{BC}. Therefore we have the invariant subspace equation for 𝒵BC=𝒫ρBC,\mathcal{Z}_{BC}=\mathcal{P}_{\rho_{BC},\mathcal{E}}\circ\mathcal{E} and ρABC\rho_{ABC},

A𝒵BC[ρABC]=ρABC.\mathcal{I}_{A}\otimes\mathcal{Z}_{BC}[\rho_{ABC}]=\rho_{ABC}. (140)

Suppose the fixed point algebra has factors α\alpha,

S𝒵BC\displaystyle S_{\mathcal{Z}_{BC}^{\dagger}} =αB(α)Iα¯B(BC).\displaystyle=\bigoplus_{\alpha}B(\mathcal{H}_{\alpha})\otimes I_{\bar{\alpha}}\subset B(\mathcal{H}_{BC})\,. (141)

where the α\alphas are dependent on 𝒵BC\mathcal{Z}_{BC}, and α¯\bar{\alpha} labels the factors of the commutant of S𝒵S_{\mathcal{Z}^{\dagger}} and each α¯\bar{\alpha} has one-to-one correspondence with α\alpha. Then since SA𝒵BC=(A)S𝒵BCS_{\mathcal{I}_{A}\otimes\mathcal{Z}_{BC}^{\dagger}}=\mathcal{B}(\mathcal{H}_{A})\otimes S_{\mathcal{Z}_{BC}^{\dagger}}, by Theorem A.1, ρABC\rho_{ABC} must have the Markovian structure characterized by α\alphas as follows:

ρABC=αpαρAασα¯.\rho_{ABC}=\bigoplus_{\alpha}p_{\alpha}\rho_{A\alpha}\otimes\sigma_{\bar{\alpha}}\,. (142)

To know how α\alphas are embedding in system BCBC, certainly S𝒵BCS_{\mathcal{Z}_{BC}^{\dagger}} are by definition correctable, i.e. subalgebra of the correctable algebra of B\mathcal{I}_{B}\otimes\mathcal{E}. Hence by Proposition A.2, denoting the factors of the correctable algebra γ=𝒜()\gamma=\mathcal{A}(\mathcal{E}) by β\beta,

S𝒵BCβB(Bβ)Iβ¯S_{\mathcal{Z}_{BC}^{\dagger}}\subset\bigoplus_{\beta}B(\mathcal{H}_{B\beta})\otimes I_{\bar{\beta}} (143)

Taking the dual, this is saying restricting to subalgebra BγB\gamma keeps the components ρAα\rho_{A\alpha} intact, and thus does not change the mutual information between BCBC and AA,

I(A:Bγ)ρ=I(A:BC),I(A:B\gamma)_{\rho}=I(A:BC), (144)

completing the proof. ∎

We can now prove Proposition III.3, where \mathcal{E} has trivial correctable algebra. Rewriting Lemma V.2,

I(A:C|B)ρ=I(A:BC)ρI(A:B)ρ\displaystyle I(A:C|B)_{\rho}=I(A:BC)_{\rho}-I(A:B)_{\rho} >I(A:BD)[ρ]I(A:B)ρ=I(A:C|B)[ρ]\displaystyle>I(A:BD)_{\mathcal{E}[\rho]}-I(A:B)_{\rho}=I(A:C^{\prime}|B)_{\mathcal{E}[\rho]} (145)
I(A:B)ρ<I(A:BC)ρ\displaystyle\iff I(A:B)_{\rho}<I(A:BC)_{\rho} (146)
I(A:C|B)ρ>0\displaystyle\iff I(A:C|B)_{\rho}>0 (147)

,where the strict inequalities are from taking the negation of equality.

V.2.2 Proof of Proposition III.4

The proof uses standard manipulation of CMI:

I(A:C3|B1B2B3)\displaystyle I(A:C_{3}|B_{1}B_{2}B_{3}) =I(A:B1B2B3C3)I(A:B1B2B3)\displaystyle=I(A:B_{1}B_{2}B_{3}C_{3})-I(A:B_{1}B_{2}B_{3}) (148)
I(A:B1B2B3C3)I(A:B1B2)\displaystyle\leq I(A:B_{1}B_{2}B_{3}C_{3})-I(A:B_{1}B_{2}) (149)
η(I(A:B1B2C2)I(A:B1B2))\displaystyle\leq\eta(I(A:B_{1}B_{2}C_{2})-I(A:B_{1}B_{2})) (150)
=ηI(A:C2|B1B2).\displaystyle=\eta I(A:C_{2}|B_{1}B_{2}). (151)

where in the first inequality we used the monotonicity of the mutual information. The second inequality uses Conjecture III.1 for =𝒩:C2B3C3,B=B1B2,C=C2\mathcal{E}=\mathcal{N}:C_{2}\rightarrow B_{3}C_{3},B=B_{1}B_{2},C=C_{2}. For each tripartition separated by mm sites, we obtain the exponential decay of CMI by iterating this argument mm times. The proof is identical for the trace norm CMI.

V.2.3 Proof of Proposition III.2

Proof.

It suffices to show 𝒩\mathcal{N} satisfies the DPI conjecture I.1, using joint convexity of relative entropy

I(A:BC)𝒩C[ρ]\displaystyle I(A:BC^{\prime})_{\mathcal{N}_{C}[\rho]} =D(𝒩[ρABC]ρA𝒩[ρBC])\displaystyle=D\left(\mathcal{N}[\rho_{ABC}]\left\|\rho_{A}\otimes\mathcal{N}[\rho_{BC}]\right.\right)
=D((1η)F[ρABC]+η𝒩[ρABC](1η)ρAF[ρBC]+ηρA𝒩[ρBC])\displaystyle=D\left((1-\eta)F[\rho_{ABC}]+\eta\mathcal{N}^{\prime}[\rho_{ABC}]\left\|(1-\eta)\rho_{A}\otimes F[\rho_{BC}]+\eta\rho_{A}\mathcal{N}^{\prime}[\rho_{BC}]\right.\right)
(1η)D(F[ρABC]ρAF[ρBC])+ηD(𝒩[ρABC]ρA𝒩[ρBC])\displaystyle\leq(1-\eta)D(F[\rho_{ABC}]\|\rho_{A}\otimes F[\rho_{BC}])+\eta D(\mathcal{N}^{\prime}[\rho_{ABC}]\|\rho_{A}\otimes\mathcal{N}^{\prime}[\rho_{BC}])
=(1η)D(ρABσρAρBσ)+ηD(𝒩[ρABC]ρA𝒩[ρBC])\displaystyle=(1-\eta)D(\rho_{AB}\otimes\sigma\|\rho_{A}\otimes\rho_{B}\otimes\sigma)+\eta D(\mathcal{N}^{\prime}[\rho_{ABC}]\|\rho_{A}\otimes\mathcal{N}^{\prime}[\rho_{BC}])
(1η)D(ρABρAρB)+ηD(ρABCρAρBC)\displaystyle\leq(1-\eta)D(\rho_{AB}\|\rho_{A}\otimes\rho_{B})+\eta D(\rho_{ABC}\|\rho_{A}\otimes\rho_{BC}) (152)
=I(A:B)+η(I(A:BC)I(A:B)).\displaystyle=I(A:B)+\eta\left(I(A:BC)-I(A:B)\right)\,. (153)

In the first inequality we used the joint convexity of relative entropy, and in the second inequality we used the DPI under 𝒩\mathcal{N}^{\prime}. We conclude the proof by moving I(A:B)I(A:B) back to the LHS. The joint convexity holds for the trace-norm CMI as well and the lines are identical. Now we follow the lines in Sec. V.2.2 to get step-wise decay of CMI and then the decay of CMI for each tripartition. ∎

V.2.4 Proof of Proposition III.5

Proof.

We control the completely bounded superoperator 111-1 norm of B𝒩\mathcal{I}_{B}\otimes\mathcal{N} by using the following lemma:

Lemma V.3 ((paulsen_2003, , Section 3.11)).

For arbitrary map ϕ:dd\phi:\mathcal{M}_{d}\rightarrow\mathcal{M}_{d^{\prime}}, the completely bounded superoperator 111-1 norm is at most the dimension dd times the 111-1 norm.

ϕcb:=supk,X(ϕidk)[X]1X1dsupXϕ[X]1X1=dϕ11.\lVert\phi\rVert_{cb}:=\sup_{k,X}\frac{\lVert(\phi\otimes id_{k})[X]\rVert_{1}}{\lVert X\rVert_{1}}\leq d\sup_{X}\frac{\lVert\phi[X]\rVert_{1}}{\lVert X\rVert_{1}}=d\lVert\phi\rVert_{1-1}. (154)

Then the proof of Prop. III.5 simply follows as

𝒩[ρBCρBρC]1\displaystyle\lVert\mathcal{N}[\rho_{BC}-\rho_{B}\otimes\rho_{C}]\rVert_{1} =(𝒩𝒩(ρCTrC))[ρBCρBρC]1\displaystyle=\Big{\lVert}(\mathcal{N}-\mathcal{N}\circ(\rho_{C}Tr_{C}))[\rho_{BC}-\rho_{B}\otimes\rho_{C}]\Big{\rVert}_{1} (155)
4ηdρBCρBρC1,\displaystyle\leq 4\eta d\lVert\rho_{BC}-\rho_{B}\otimes\rho_{C}\rVert_{1}, (156)

where in the first line we insert a vanishing term ρCTrC[ρBC]ρBρC=0\rho_{C}Tr_{C}[\rho_{BC}]-\rho_{B}\otimes\rho_{C}=0, and in the second line we bounded the completely bounded trace norm using Lemma V.3. ∎

V.3 Remaining proof for Theorem III.2

V.3.1 Proof of Lemma IV.4

Proof.
(idν)𝒩(idν)cb\displaystyle\lVert(\rm id-\mathbb{P}_{\nu})\circ\mathcal{N}\circ(\rm id-\mathbb{P}_{\nu})\rVert_{cb} 2𝒩𝒩νcb\displaystyle\leq 2\lVert\mathcal{N}-\mathcal{N}\circ\mathbb{P}_{\nu}\rVert_{cb} (157)
2dC𝒩𝒩ν11\displaystyle\leq 2d_{C}\lVert\mathcal{N}-\mathcal{N}\circ\mathbb{P}_{\nu}\rVert_{1-1} (158)
8dCsupρS+(𝒩𝒩ν)[ρ]1\displaystyle\leq 8d_{C}\sup_{\rho\in S^{+}}\lVert(\mathcal{N}-\mathcal{N}\circ\mathbb{P}_{\nu})[\rho]\rVert_{1} (159)
8dCsupρS+𝒩[ρνρ]1\displaystyle\leq 8d_{C}\sup_{\rho\in S^{+}}\lVert\mathcal{N}[\rho-\mathbb{P}_{\nu}\rho]\rVert_{1} (160)
16dCsupρ,ρ𝒩[ρ]𝒩[ρ]1ρρ1\displaystyle\leq 16d_{C}\sup_{\rho,\rho^{\prime}}\frac{\lVert\mathcal{N}[\rho]-\mathcal{N}[\rho^{\prime}]\lVert_{1}}{\lVert\rho-\rho^{\prime}\lVert_{1}} (161)
16dCη1,C\displaystyle\leq 16d_{C}\eta_{1,C} (162)

We used idνcb2\lVert\rm id-\mathbb{P}_{\nu}\rVert_{cb}\leq 2 in the first inequality. The second inequality follows from Lemma V.3. In the third inequality, we convert the optimization over operator X1\lVert X\rVert_{1} into positive ρ\rho by a factor of 4. This chain of inequalities is largely along the lines of (James2015QuantumMC, , Theorem 45). ∎

V.3.2 Proof of Proposition III.1

We get the decay of trace norm CMI I1(A:C|B)I_{1}(A:C|B) from the continuity of the trace norm CMI.

Lemma V.4 (Continuity of trace norm CMI).

If ρABC=σABC+GABC\rho_{ABC}=\sigma_{ABC}+G_{ABC}, and the deviation is traceless on system CC, TrC(GABC)=0Tr_{C}(G_{ABC})=0, and bounded as GABC1ϵ\lVert G_{ABC}\rVert_{1}\leq\epsilon, then

|I1(A:C|B)ρI1(A:C|B)σ|2ϵ.|I_{1}(A:C|B)_{\rho}-I_{1}(A:C|B)_{\sigma}|\leq 2\epsilon. (163)
Proof.

All we need is the triangle inequality, which makes the analysis much simpler than Lemma IV.2.

|I1(A:C|B)ρI1(A:C|B)σ|\displaystyle|I_{1}(A:C|B)_{\rho}-I_{1}(A:C|B)_{\sigma}| (164)
=|σABC+GABCσAσBCσAGBC1σABσAσB1|\displaystyle=|\lVert\sigma_{ABC}+G_{ABC}-\sigma_{A}\otimes\sigma_{BC}-\sigma_{A}\otimes G_{BC}\rVert_{1}-\lVert\sigma_{AB}-\sigma_{A}\otimes\sigma_{B}\rVert_{1}| (165)
(σABCσAσBC1σABσAσB1)\displaystyle-(\lVert\sigma_{ABC}-\sigma_{A}\otimes\sigma_{BC}\rVert_{1}-\lVert\sigma_{AB}-\sigma_{A}\otimes\sigma_{B}\rVert_{1}) (166)
GABCσAGBC1\displaystyle\leq\lVert G_{ABC}-\sigma_{A}\otimes G_{BC}\rVert_{1} (167)
GABCσAGBC12GABC1.\displaystyle\leq\lVert G_{ABC}-\sigma_{A}\otimes G_{BC}\rVert_{1}\leq 2\lVert G_{ABC}\rVert_{1}. (168)

The C-traceless assumption reduced the expression that ρAB=σAB\rho_{AB}=\sigma_{AB}. ∎

To show Proposition III.1, choosing σABC=ρABτC\sigma_{ABC}=\rho_{AB}\otimes\tau_{C} for Theorem III.1, σABC=ρABνC\sigma_{ABC}=\rho_{AB}\otimes\nu_{C} for Theorem III.2 in the above we complete the proof.

V.4 Remaining proof of Theorem III.3: decay of BB-measured conditional mutual information in MPDO

In subsection V.4.1, we provide the sufficient background for applying Hilbert’s projective metric to show Condition 2 implies exponential decay of CMI; starting from subsection V.4.3 are the details for 121\implies 2.

V.4.1 The Hilbert’s projective metric and proof of Lemma IV.5

The CP-self maps sk\mathcal{M}_{s_{k}} arising from measurement are not trace-preserving, hindering it difficult to approach from typical quantum information tools. It turns out the Hilbert’s projective metric is suitable for this purpose, as it is designed to work for the set of all unnormalized states S+:={ρ()|ρ0,ρ0}S_{+}:=\{\rho\in\mathcal{B}(\mathcal{H})|\rho\geq 0,\rho\neq 0\}. While the general theory applies to convex cones, we will focus on the quantum case S+S_{+}, following partly the ideas in  doi:10.1063/1.3615729 .

Definition 3 (Hilbert’s projective metric).
a,bS,h(a,b):=ln(sup(a/b)sup(b/a))\displaystyle\forall a,b\in S,h(a,b):=\ln(\sup(a/b)\sup(b/a)) (169)
sup(a/b):=inf{λ|aλb}.\displaystyle\sup(a/b):=\inf\{\lambda\in\mathbb{R}|a\leq\lambda b\}. (170)

Note that sup(a/b)sup(b/a)\sup(a/b)\neq\sup(b/a) and the direction of inequality is important in (170). The metric is projective h(a,b)=h(αa,b)h(a,b)=h(\alpha a,b), and it become a true metric when restricted to set of density operators, i.e., quotient out scalar multiples. This definition works for any proper cone, and only implicitly depends on the actual geometry of the cone. In this metric, every positive maps :S+S+\mathcal{M}:S_{+}\rightarrow S_{+} is contracting 777In fact the unique metric for this to hold Kohlberg1982 .

Theorem V.1 (Birkhoff-Hopf contraction theorem (doi:10.1063/1.3615729, , Thoerem 4) ).

For all :S+S+\mathcal{M}:S_{+}\rightarrow S_{+}, the upper bound on contraction ratio is

η:=supa,bS+h((a),(b))h(a,b)=tanh(Δ()4),\displaystyle\eta_{\mathcal{M}}:=\sup\limits_{a,b\in S_{+}}\frac{h(\mathcal{M}(a),\mathcal{M}(b))}{h(a,b)}=\tanh(\frac{\Delta(\mathcal{M})}{4}), (171)

where Δ()\Delta(\mathcal{M}) is the projective diameter

Δ():=supa,bS+h((a),(b)).\displaystyle\Delta(\mathcal{M}):=\sup\limits_{a,b\in S_{+}}h(\mathcal{M}(a),\mathcal{M}(b)). (172)

Note that Δ()<\Delta(\mathcal{M})<\infty would imply strict contraction η<1\eta_{\mathcal{M}}<1.

We eventually convert back to the norm via the following bound.

Proposition V.2 ((doi:10.1063/1.3615729, , Eq. 38)).

For normalized density operators ρ1,ρ2\rho_{1},\rho_{2},

12ρ1ρ21tanh(h(ρ1,ρ2)4).\frac{1}{2}\lVert\rho_{1}-\rho_{2}\rVert_{1}\leq\tanh\left(\frac{h(\rho_{1},\rho_{2})}{4}\right). (173)

The above are the backgounds we need to prove the following lemma.

Lemma V.5.

If all {1,2,n}\{\mathcal{M}_{1},\mathcal{M}_{2},\cdots\mathcal{M}_{n}\} are CP maps that map any state to a full rank state, then for arbitrary sequence of b=s,,s1{1,,n}b=s_{\ell},\cdots,s_{1}\in\{1,\cdots,n\}^{\ell}, it holds that

b[ρ1]Tr(b[ρ1])b[ρ2]Tr(b[ρ2])1=O(ec),\displaystyle\left\lVert\frac{\mathcal{M}_{b}[\rho_{1}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{1}])}-\frac{\mathcal{M}_{b}[\rho_{2}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{2}])}\right\rVert_{1}=O(e^{-c\ell}), (174)

where the exponent c>0c>0 is independent of bb.

Proof.

For each s\mathcal{M}_{s}, the distance between any two input states is finite because the image is full rank h(s(a),s(b))<.h(\mathcal{M}_{s}(a),\mathcal{M}_{s}(b))<\infty. Hence the projective diameter as a supremum over compact set is also finite, and each s\mathcal{M}_{s} is strictly contracting. Maximizing over s=1,,ns=1,\cdots,n provides a global contraction ratio bound η<1\eta<1.

Δ(s)\displaystyle\Delta(\mathcal{M}_{s}) :=supa,bS+h(s(a),s(b))<\displaystyle:=\sup\limits_{a,b\in S_{+}}h(\mathcal{M}_{s}(a),\mathcal{M}_{s}(b))<\infty (175)
s,ηs\displaystyle\forall s,\ \eta_{\mathcal{M}_{s}} =tanh(Δ(s)4)η<1.\displaystyle=\tanh\left(\frac{\Delta(\mathcal{M}_{s})}{4}\right)\leq\eta<1. (176)

Then for all a,bS+a,b\in S_{+}

h(s,,s1(a),s,,s1(b))h(s1(a),s1(b))ec(1)sups(Δ(s))ec(1)=O(ec),\displaystyle h(\mathcal{M}_{s_{\ell},\cdots,s_{1}}(a),\mathcal{M}_{s_{\ell},\cdots,s_{1}}(b))\leq h(\mathcal{M}_{s_{1}}(a),\mathcal{M}_{s_{1}}(b))e^{-c(\ell-1)}\leq\sup\limits_{s}(\Delta(\mathcal{M}_{s}))\ e^{-c(\ell-1)}=O(e^{-c\ell}), (177)

where again we used the fact the projective diameter is finite. Finally we convert to the trace norm

b[ρ1]Tr(b[ρ1])b[ρ2]Tr(b[ρ2])1\displaystyle\left\lVert\frac{\mathcal{M}_{b}[\rho_{1}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{1}])}-\frac{\mathcal{M}_{b}[\rho_{2}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{2}])}\right\rVert_{1} 2tanh(h(b[ρ1],b[ρ2])4)\displaystyle\leq 2\tanh\left(\frac{h(\mathcal{M}_{b}[\rho_{1}],\mathcal{M}_{b}[\rho_{2}])}{4}\right) (178)
O(ec),\displaystyle\leq O(e^{-c\ell}), (179)

where tanh(x)x\tanh(x)\approx x for small xx. ∎

V.4.2 Proof of Lemma IV.6

Lemma V.6.

Suppose the CP map :CC\mathcal{M}:C^{\prime}\rightarrow C is contractive in the sense that

b[ρ1]Tr(b[ρ1])b[ρ2]Tr(b[ρ2])1ϵ.\lVert\frac{\mathcal{M}_{b}[\rho_{1}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{1}])}-\frac{\mathcal{M}_{b}[\rho_{2}]}{\mathrm{Tr}(\mathcal{M}_{b}[\rho_{2}])}\rVert_{1}\leq\epsilon. (180)

Then the state ρAC:=[σAC]\rho_{AC}:=\mathcal{M}[\sigma_{AC^{\prime}}] is close to the product state

2Tb:=ρAC,bρA,bρC,b14dCσC1ϵ.\displaystyle 2T_{b}:=\lVert\rho_{AC,b}-\rho_{A,b}\otimes\rho_{C,b}\rVert_{1}\leq 4d_{C^{\prime}}\lVert\sigma_{C^{\prime}}^{-1}\rVert_{\infty}\epsilon. (181)
Proof.

Bounded factors depending on the hidden system CC^{\prime} may show up here and there, but it poses no threat when the error ϵ\epsilon is exponentially small.

2Tb:\displaystyle 2T_{b}: =ρAC,bρA,bρC,b1\displaystyle=\lVert\rho_{AC,b}-\rho_{A,b}\otimes\rho_{C,b}\rVert_{1} (182)
=b[σAC]Tr(b[σC])TrC(b[σAC])Tr(b[σC])b[σC]Tr(b[σC])1\displaystyle=\left\lVert\frac{\mathcal{M}_{b}[\sigma_{AC^{\prime}}]}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}-\frac{Tr_{C}(\mathcal{M}_{b}[\sigma_{AC^{\prime}}])}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\otimes\frac{\mathcal{M}_{b}[\sigma_{C^{\prime}}]}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\right\rVert_{1} (183)
b[]Tr(b[σC])TrC(b[])Tr(b[σC])b[σC]Tr(b[σC])\displaystyle\leq\left\lVert\frac{\mathcal{M}_{b}[\cdot]}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}-\frac{Tr_{C}(\mathcal{M}_{b}[\cdot])}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\otimes\frac{\mathcal{M}_{b}[\sigma_{C^{\prime}}]}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\right\rVert_{\diamond} (184)
dATr(b[σC])supX11,Xsupp(σC)b[X]Tr(b[X])b[σC]Tr(b[σC])1\displaystyle\leq\frac{d_{A}}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\sup_{\lVert X\rVert_{1}\leq 1,X\subset{\rm supp}(\sigma_{C^{\prime}})}\left\lVert\mathcal{M}_{b}[X]-\mathrm{Tr}(\mathcal{M}_{b}[X])\frac{\mathcal{M}_{b}[\sigma_{C^{\prime}}]}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\right\rVert_{1} (185)
4dATr(b[σC])supρ1b[ρ1]Tr(b[ρ1])b[σC]Tr(b[σC])1\displaystyle\leq\frac{4d_{A}}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\sup_{\rho_{1}}\left\lVert\mathcal{M}_{b}[\rho_{1}]-\mathrm{Tr}(\mathcal{M}_{b}[\rho_{1}])\frac{\mathcal{M}_{b}[\sigma_{C^{\prime}}]}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\right\rVert_{1} (186)
4dCsupρ1supp(σC)(Tr(b[ρ1]))Tr(b[σC])ϵ\displaystyle\leq 4d_{C^{\prime}}\frac{\sup_{\rho_{1}\subset{\rm supp}(\sigma_{C^{\prime}})}(\mathrm{Tr}(\mathcal{M}_{b}[\rho_{1}]))}{\mathrm{Tr}(\mathcal{M}_{b}[\sigma_{C^{\prime}}])}\epsilon (187)
4dCσC1ϵ,\displaystyle\leq 4d_{C^{\prime}}\lVert\sigma_{C^{\prime}}^{-1}\rVert_{\infty}\epsilon, (188)

where in the second and third inequality we used that diamond norm is bounded by d times the superoperator-norm (Lemma V.3) and the extra factor of 4 comes from turning XX into density operator ρ1\rho_{1}; in the fourth inequality we used the assumption; in the last inequality we used that σC1σCIρ1\lVert\sigma_{C^{\prime}}^{-1}\rVert_{\infty}\sigma_{C^{\prime}}\geq I\geq\rho_{1}. Note that we are working on the support of σC\sigma_{C^{\prime}} so that the inverse is well-defined. ∎

232\implies 3 by setting σAC\sigma_{AC^{\prime}} to be the maximally entangled state on AA¯A\bar{A}, which yields ρAC,bρA,bρC,b14dA2𝒪(ec)\lVert\rho_{AC,b}-\rho_{A,b}\otimes\rho_{C,b}\rVert_{1}\leq 4d_{A}^{2}\mathcal{O}(e^{-c\ell}). The conversion to mutual information straightforwardly follows from the AFW inequality,

I(A:C)b2Tblog(min(dA,dC))+(1+Tb)log(1+Tb)Tblog(Tb)=O(ec1).\displaystyle I(A:C)_{b}\leq 2T_{b}\log(\min(d_{A},d_{C}))+(1+T_{b})\log(1+T_{b})-T_{b}\log(T_{b})=O(e^{-c_{1}\ell}). (189)

This immediately passes to CMI by taking expectation in Eq. (111) and thus completes the proof.

V.4.3 Proof of Lemma IV.7

Lemma V.7.

If span{K1,,Kp}=Mat(D,D){\rm span}\{K_{1},\cdots,K_{p}\}=Mat(D,D), then 𝒩[ρ]:=iKiρKi>0,ρ\mathcal{N}[\rho]:=\sum_{i}K_{i}\rho K_{i}^{\dagger}>0,\forall\rho.

Proof.

Suppose there exist |u,|v|u\rangle,|v\rangle, s.t. jv|Kj|uu|Kj|v=0.\sum_{j}\langle v|K_{j}|u\rangle\langle u|K^{\dagger}_{j}|v\rangle=0. Then we obtain

|v|Ki|u|2=0,i\displaystyle|\langle v|K_{i}|u\rangle|^{2}=0,\forall i (190)
Tr(Ki|uv|)=0,i.\displaystyle\implies\mathrm{Tr}(K_{i}|u\rangle\langle v|)=0,\forall i. (191)

This is a contradiction because {Ki}\{K_{i}\} are full dimensional. ∎

V.4.4 Proof of Lemma IV.8

The idea is rooted from a theorem of Burnside about algebra generated by matrices and the simultaneous invariant subspace.

Theorem V.2 (Burnside LOMONOSOV200445 ).

Consider m1,mp1Mat(D,D)m_{1},\cdots m_{p-1}\in Mat_{\mathbb{C}}(D,D), acting on vector space D\mathbb{C}^{D}. The following are equivalent.

  1. 1.

    The algebra generated by (m1,,mp1)(m_{1},\cdots,m_{p-1}) is the full matrix algebra Mat(D,D)Mat_{\mathbb{C}}(D,D)

  2. 2.

    For all non-trivial subspace VDV\subset\mathbb{C}^{D},

    mqVV,qV=D,m_{q}V\subset V,\ \forall q\implies V=\mathbb{C}^{D}, (192)

    i.e., the simultaneous invariant subspace of m1m_{1} and m2m_{2} is trivial or the whole space.

In our version, the vector space is Mat(D,D)Mat_{\mathbb{C}}(D,D), where Kraus operators live :

Proposition V.3.

Consider m1,,mp1Mat(D,D)m_{1},\cdots,m_{p-1}\in Mat_{\mathbb{C}}(D,D), acting on vector space Mat(D,D)Mat_{\mathbb{C}}(D,D) by left multiplication. The following are equivalent.

  1. 1.

    The algebra generated by (m1,,mp1)(m_{1},\cdots,m_{p-1}) is the full matrix algebra Mat(D,D)Mat_{\mathbb{C}}(D,D)

  2. 2.

    For all non-trivial subspace(as vector space) WMat(d,d)W\subset Mat_{\mathbb{C}}(d,d) containing an invertible element BB,

    mqWW,qW=Mat(D,D).m_{q}W\subset W,\forall q\implies W=Mat_{\mathbb{C}}(D,D).
Proof.

()(\implies) For all non-trivial subspace WMat(D,D)W\in Mat_{\mathbb{C}}(D,D), if mqWW,qm_{q}W\subset W,\forall q, then same is true under left multiplication

(m1,,mp1)WW.\displaystyle(m_{1},\cdots,m_{p-1})W\subset W. (193)

Then in particular WW contains (m1,,mp1)B(m_{1},\cdots,m_{p-1})B, thus must be the full matrix algebra W=Mat(D,D)W=Mat_{\mathbb{C}}(D,D).

()(\impliedby) For a contradiction, suppose (m1,,mp1)Mat(D,D)(m_{1},\cdots,m_{p-1})\neq Mat_{\mathbb{C}}(D,D). Then by theorem V.2, (m1,,mp1)(m_{1},\cdots,m_{p-1}) is reducible with an proper invariant subspace VDV\subsetneq\mathbb{C}^{D}. Consider the basis for which the first entries are basis vectors spanning VV, i.e. (m1,,mp1)(m_{1},\cdots,m_{p-1}) has some zeros at the left down corner:

(m1,,mp1)=[MVN0MVc].(m_{1},\cdots,m_{p-1})=\begin{bmatrix}M_{V}&N\\ 0&M_{V^{c}}\end{bmatrix}.

Consider the subspace W:=(m1,,mp1)+{λI}W^{\prime}:=(m_{1},\cdots,m_{p-1})+\{\lambda I\} by adding the identity, which is invertible.

Then the resulting subspace WW^{\prime} is an invariant subspace

mqW\displaystyle m_{q}W^{\prime} =mq(m1,,mp1)+mqW.\displaystyle=m_{q}(m_{1},\cdots,m_{p-1})+m_{q}\subset W^{\prime}. (194)

We arrive at contradiction with W=(m1,,mp1)+{λI}Mat(D,D)W^{\prime}=(m_{1},\cdots,m_{p-1})+\{\lambda I\}\neq Mat_{\mathbb{C}}(D,D). ∎

We can now prove Lemma IV.8:

Lemma V.8 (Condition 1 implies the increment of span of Kraus operators).

Consider a CP self-map \mathcal{M} with p>2p>2 Kraus operators {E1,,Ep}\{E_{1},\cdots,E_{p}\} such that EpE_{p} is invertible, and ((Ep)1E1,,(Ep)1Ep1)\left((E_{p})^{-1}E_{1},\cdots,(E_{p})^{-1}E_{p-1}\right) generate full matrix algebra Mat(D,D)Mat_{\mathbb{C}}(D,D) by addition and multiplication. Then for any set of Kraus operators {Tj}\{T_{j}\} not full rank and containing an invertible element, the dimension of span must increase after applying \mathcal{M}

dim(span{EiTj})>dim(span{Tj}).{\rm dim}({\rm span}\{E_{i}T_{j}\})>{\rm dim}({\rm span}\{T_{j}\}). (196)
Proof.

Left multiplying EiE_{i} yields the span

{E1Tj,,EpTj}.\{E_{1}T_{j},\cdots,E_{p}T_{j}\}.

Notice that dim({Tj})=dim(Ep{Tj}){\rm dim}(\{T_{j}\})={\rm dim}(E_{p}\{T_{j}\}) because EpE_{p} is invertible, so if any of E1{Tj},,Ep1{Tj}E_{1}\{T_{j}\},\cdots,E_{p-1}\{T_{j}\} is not a subspace of Ep{Tj}E_{p}\{T_{j}\}, then the dimension would increase dim(span{EiTj})>dim(span{Tj}){\rm dim}({\rm span}\{E_{i}T_{j}\})>{\rm dim}({\rm span}\{T_{j}\}); we only need to worry if

E1{Tj},Ep1{Tj}Ep{Tj}.E_{1}\{T_{j}\},\cdots E_{p-1}\{T_{j}\}\subset E_{p}\{T_{j}\}. (197)

Invert EpE_{p} and by Proposition V.3, {Tj}\{T_{j}\} must reach full rank already.

{Tj}=Mat(D,D).\{T_{j}\}=Mat_{\mathbb{C}}(D,D). (198)

V.4.5 Proof of Proposition III.6

It is simple to generate an algebra from set of matrices m1,,mp1m_{1},\ldots,m_{p-1}. Start with S=span{m1}S={\rm span}\{m_{1}\}, repeat the following steps:

  1. 1.

    Add left multiplied matrices to the set, resulting S={S,m1S,,mp1S}S^{\prime}=\{S,m_{1}S,\ldots,m_{p-1}S\}.

  2. 2.

    Find a linear basis for SS^{\prime}.888In the actual code, there always need to be a small error threshold to decide the linear independence. If dim(S)=dim(S){\rm dim}(S^{\prime})={\rm dim}(S), then terminate, and span{S}{\rm span}\{S^{\prime}\} is the algebra generated by m1,,mp1m_{1},\ldots,m_{p-1}. If dim(span{S})=D2{\rm dim}({\rm span}\{S^{\prime}\})=D^{2}, then it is the full matrix algebra.

Checking linear independence uses polynomial runtime.

VI Conclusion and discussions

Motivated by the problem of showing the existence of the local parent Hamiltonians of MPDO, we have studied the CMI of MPDO. We have shown that MPDO constructed by bistochastic Y-Shaped channels with trivial correctable algebra have exponentially decaying CMI and thus have approximately local parent Hamiltonians. We have shown a similar bound for a slightly more general class of channels under the restriction that certain decay constants are sufficiently small. We have also shown the exponential decay of CMI for Y-shaped channels with a forgetful component. We have introduced a trace norm variant of the CMI and have shown that for the above cases they obey no worse bounds than the CMI.

For more general Y-shaped channels, we have conjectured the completely contractive DPI (Conjecture III.1). We have shown that if this conjecture is true, every MPDO constructed by a Y-shaped channel with trivial correctable algebra has exponentially decaying CMI. For the measured MPDO, we have provided sufficient conditions implying the exponential decay of CMI. We have numerically confirmed (up to small bond dimension) that these conditions are generically true if the Y-shaped channel is generated by a Haar-random unitary.

Our results Theorem III.1 and Theorem III.2 only work for a restricted family of Y-shaped channels. The proof relies on the fact that the corresponding MPDO are approximately a product state. This is no longer true for general channels and the deviation from the product state do not obviously contract 999 Bistochastic channels are rather special that it contracts all pp-norm. See doi:10.1063/1.2218675 to see how this fails for non-bistochastic channels.. A possible solution to avoid the structure of many-body state is resorting to our Conjecture III.1. Note that having trivial correctable algebra is still only a sufficient condition, and a necessary and sufficient condition for exponentially decaying CMI is unclear yet.

Analysis of the CMI for the measured MPDO could be massively easier than the unmeasured case due to losing entanglement with BB. However, our results on measured MPDO are still limited and only provide sufficient conditions. In contrast to the single channel quantum Wielandt’s inequality 5550282 with sufficient and necessary conditions, Condition 1 in Theorem III.3 guarantees strict positivity for all sequences multiplicatively generated by a finite set of CP-maps {s}\{\mathcal{M}_{s}\}. Quantum Wielandt’s inequality has a classical analog in matrix theory, however this multiple-channel generalization has limited results even in the classical case (see e.g., PROTASOV2012749 for a related result).

VII Acknowledgement

We thank Jean-Francois Quint for comments on multiplicative ergodic theory. We thank Mario Berta, Marco Tomamichel, Hao-Chung Cheng for discussions about the DPI for CMI. CFC is thankful for Physics TA Relief Fellowship and the Physics TA Fellowship at Caltech. KK acknowledges funding provided by the Institute for Quantum Information and Matter, an NSF Physics Frontiers Center (NSF Grant PHY-1733907) and MEXT Quantum Leap Flagship Program (MEXT Q-LEAP) Grant Number JPMXS0120319794. FB acknowledges funding from NSF.

References

  • (1) I. Affleck, T. Kennedy, E. H. Lieb, and H. Tasaki. Rigorous results on valence-bond ground states in antiferromagnets. Phys. Rev. Lett., 59(7), 1987.
  • (2) D. Perez-Garcia, F. Verstraete, M. M. Wolf, and J. I. Cirac. Matrix product state representations. Quantum Inf. Comput., 7(5-6), 2007.
  • (3) M. B. Hastings. An area law for one-dimensional quantum systems. J. Stat. Mech.-Theory. E., 2007(08), 2007.
  • (4) M. Fannes, B. Nachtergaele, and R. F. Werner. Finitely correlated states on quantum spin chains. Commun. Math. Phys., 144(3), 1992.
  • (5) N. Schuch, D. Perez-Garcia, and J. I. Cirac. Classifying quantum phases using matrix product states and projected entangled pair states. Phys. Rev. B, 84(16), 2011.
  • (6) F. Verstraete and J. I. Cirac. Renormalization algorithms for quantum-many body systems in two and higher dimensions, arXiv:cond-mat/0407066, 2004.
  • (7) A. Molnar, Y. Ge, N. Schuch, and J. I. Cirac. A generalization of the injectivity condition for projected entangled pair states. J. Math. Phys., 59(2), 2018.
  • (8) M. B. Hastings. Solving gapped Hamiltonians locally. Phys. Rev. B, 73(8), 2006.
  • (9) F. Verstraete, J. J. García-Ripoll, and J. I. Cirac. Matrix product density operators: Simulation of finite-temperature and dissipative systems. Phys. Rev. Lett., 93(20), 2004.
  • (10) J. I. Cirac, D. Pérez-García, N. Schuch, and F. Verstraete. Matrix product density operators: Renormalization fixed points and boundary theories. Ann. Phys., 378, 2017.
  • (11) K. Kato and F. G. S. L. Brandão. Quantum approximate markov chains are thermal. Commun. Math. Phys., 370(1):117–149, 2019.
  • (12) M. Kliesch, D. Gross, and J. Eisert. Matrix-product operators and states: Np-hardness and undecidability. Phys. Rev. Lett., 113:160503, 2014.
  • (13) H. Araki. Gibbs states of a one dimensional quantum lattice. Commun. Math. Phys., 14(2), 1969.
  • (14) G. Lindblad. Completely positive maps and entropy inequalities. Commun. Math. Phys., 40(2):147–151, 1975.
  • (15) A. Uhlmann. Relative entropy and the wigner-yanase-dyson-lieb concavity in an interpolation theory. Commun. Math. Phys., 54(1):21–32, 1977.
  • (16) D. Petz. Sufficient subalgebras and the relative entropy of states of a von Neumann algebra. Commun. Math. Phys., 105(1):123–131, 1986.
  • (17) D. Petz. Sufficiency of channels over Von Neumann algebras. Q. J. Math., 39(1), 1988.
  • (18) V. Anantharam, A. Gohari, S. Kamath, and C. Nair. On hypercontractivity and a data processing inequality. 2014 IEEE International Symposium on Information Theory, 3022-3026, 2014.
  • (19) G. E. D.-Uchida. Carathéodory theorem. Springer US, Boston, MA, 2001.
  • (20) S. Beigi and A. Gohari. On dimension bounds for quantum systems. In 2013 Iran Workshop on Communication and Information Theory, 1–6, 2013.
  • (21) J.M. Hammersley and P.E. Clifford. Markov field on finite graphs and lattices. http://www.statslab.cam.ac.uk/ grg/books/hammfest/hamm-cliff.pdf, 1971.
  • (22) P. Hayden, R. Jozsa, D. Petz, and A. Winter. Structure of States Which Satisfy Strong Subadditivity of Quantum Entropy with Equality. Commun. Math. Phys., 246(2), 2004.
  • (23) W. Brown and D. Poulin. Quantum Markov Networks and Commuting Hamiltonians. arXiv:1206.0755 2012.
  • (24) C. Bény, A. Kempf, and D. W. Kribs. Generalization of quantum error correction via the Heisenberg picture. Phys. Rev. Lett., 98(10), 2007.
  • (25) C. Bény, A. Kempf, and D. W. Kribs. Quantum error correction of observables. Phys. Rev. A, 76(4), 2007.
  • (26) E. Knill, R. Laflamme, and L. Viola. Theory of quantum error correction for general noise. Phys. Rev. Lett., 84:2525–2528, 2000.
  • (27) M. Junge, R. Renner, D. Sutter, M. M. Wilde, and A. Winter. Universal recovery maps and approximate sufficiency of quantum relative entropy. Ann. Henri. Poincaré., 19(10):2955–2978, Aug 2018.
  • (28) D. Petz. sufficiency of channels over von neumann algebras. The Quarterly Journal of Mathematics, 39(1):97–108, 03 1988.
  • (29) M. M. Wilde. From classical to quantum shannon theory, 2011.
  • (30) K. Kato and F. G. S. L. Brandão. Toy model of boundary states with spurious topological entanglement entropy. Phys. Rev. Research, 2:032005, Jul 2020.
  • (31) M. Sanz, D. Pérez-García, M. M. Wolf, and J. I. Cirac. A quantum version of wielandt’s inequality. IEEE Transactions on Information Theory, 56(9):4668–4673, 2010.
  • (32) S. Gaubert and Z. Qu. Checking the strict positivity of Kraus maps is np-hard. arXiv:1402.1429, 2014.
  • (33) A. Winter. Tight uniform continuity bounds for quantum entropies: conditional entropy, relative entropy distance and energy constraints, Commun. Math. Phys. 347(1):291-313, 2016.
  • (34) M. Takesaki. Theory of operator algebras. I, volume 124 of Encyclopaedia of Mathematical Sciences. Springer-Verlag, Berlin, 2002.
  • (35) C.-F. Chen, G. Penington, and G. Salton. Entanglement wedge reconstruction using the petz map. JHEP, 2020(1), Jan 2020.
  • (36) V. Paulsen. Completely Bounded Maps and Operator Algebras. Cambridge Studies in Advanced Mathematics. Cambridge University Press, 2003.
  • (37) M. James. Quantum markov chain mixing and dissipative engineering. PhD thesis, 2015.
  • (38) D. Reeb, M. J. Kastoryano, and M. M. Wolf. Hilbert’s projective metric in quantum information theory. J. Math. Phys., 52(8):082201, 2011.
  • (39) E. Kohlberg and J. W. Pratt. The Contraction Mapping Approach to the Perron-Frobenius Theory: Why Hilbert’s Metric? Math. Oper. Res., 7(2):198–210, 1982.
  • (40) V. Lomonosov and P. Rosenthal. The simplest proof of Burnside’s theorem on matrix algebras. Linear Algebra Appl., 383:45 – 47, 2004. Special Issue in honor of Heydar Radjavi.
  • (41) D. Pérez-García, M. M. Wolf, D. Petz, and M. B. Ruskai. Contractivity of positive and trace-preserving maps under LPL_{P} norms. J. Math. Phys., 47(8):083506, 2006.
  • (42) V. Y. Protasov and A.S. Voynov. Sets of nonnegative matrices without positive products. Linear. Algebra. Appl., 437(3):749 – 765, 2012.
  • (43) M. Koashi and N. Imoto. Operations that do not disturb partially known quantum states. Phys. Rev. A, 66:022318, Aug 2002.
  • (44) G. Lindblad. A general no-cloning theorem. Lett. Math. Phys., 47(2):189–196, 1999.
  • (45) C. Hirche, C. Rouzé, and D. S. França On contraction coefficients, partial orders and approximation of capacities for quantum channels. arxiv:2011.05949, 2020
  • (46) https://github.com/Shawnger/supplementary_MPDO

Appendix A Equivalence between correctable algebra and the Petz recovered map for bistochastic channels

The structure of CPTP self-map, the invariant subspace, and the fixed-point algebra of the dual has been studied.

Theorem A.1 (Combination of Hayden2004 ; PhysRevA.66.022318 ; lindbald1999 ).

For every CPTP self-map 𝒵:S(d)S(d)\mathcal{Z}:S(\mathbb{C}^{d})\rightarrow S(\mathbb{C}^{d}), the following holds:

  1. 1.

    The invariant subspace S𝒵S_{\mathcal{Z}^{\dagger}} of 𝒵\mathcal{Z}^{\dagger} forms a subalgebra MB(d)M\subset B(\mathcal{H}_{d}), with factors α\alpha.

    S𝒵=αB(α)Iα¯S_{\mathcal{Z}^{\dagger}}=\bigoplus_{\alpha}B(\mathcal{H}_{\alpha})\otimes I_{\bar{\alpha}} (199)
  2. 2.

    The invariant subspace of S𝒵S_{\mathcal{Z}} of 𝒵\mathcal{Z} has form

    S𝒵=αpαρασα¯S_{\mathcal{Z}}=\bigoplus_{\alpha}p_{\alpha}\rho_{\alpha}\otimes\sigma_{\bar{\alpha}} (200)

    with σα¯\sigma_{\bar{\alpha}} determined by 𝒵\mathcal{Z}, and ρα,pα\rho_{\alpha},p_{\alpha} are free.

  3. 3.

    The channel 𝒵\mathcal{Z} restricted on the block diagonal entries has form (its acting on off-diagonal part is not as simple)

    𝒵=αα𝒵α¯\mathcal{Z}=\bigoplus_{\alpha}\mathcal{I}_{\alpha}\otimes\mathcal{Z}_{\bar{\alpha}} (201)

    and Zα¯Z_{\bar{\alpha}} has unique fixed point σα\sigma_{\alpha}

Though, when a channel has different input-output dimension, we alternatively consider the correctable algebra 𝒜()\mathcal{A}(\mathcal{E}). It turned out to correspond to the invariant subspace of Z=𝒫τ,Z=\mathcal{P}_{\tau,\mathcal{E}}\mathcal{E}, the original channel composed with the Petz recovery map with the maximally mixed state as the reference.

Proposition A.1 (Recap of Proposition II.1).

The correctable algebra of a channel \mathcal{E} equals to the fixed-point algebra of the Petz-recovery channel composed with channel 𝒫τ,\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E} (and the dual 𝒫τ,\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau,\mathcal{E}}^{\dagger}, due to being self-adjoint).

S𝒫τ,=S𝒫τ,=𝒜()S_{\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau,\mathcal{E}}^{\dagger}}=S_{\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E}}=\mathcal{A}(\mathcal{E}) (202)
Proof.

By (Chen_2020, , Theorem 1), the Petz map with the maximally mixed reference state is a universal subalgebra recovery map, i.e. it recovers any subalgebra that can be recovered from any channel DD.

D(ρ)|a=ρa,ρ𝒫τ,(ρ)|a=ρa,ρD\circ\mathcal{E}(\rho)|_{a}=\rho_{a},\forall\rho\implies\mathcal{P}_{\tau,\mathcal{E}}\circ\mathcal{E}(\rho)|_{a}=\rho_{a},\forall\rho (203)

The invariant subspace of 𝒫τ,\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau,\mathcal{E}}^{\dagger} contains the correctable algebra, and the converse is by definition true.

𝒜()S𝒫τ,𝒜()\mathcal{A}(\mathcal{E})\subset S_{\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau,\mathcal{E}}^{\dagger}}\subset\mathcal{A}(\mathcal{E}) (204)

We conclude the proof by that the Petz-recovered channel is self-adjoint w.r.t. to the H.S. norm

𝒫τ,\displaystyle\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau,\mathcal{E}}^{\dagger} =[[τ]1/2τ[][τ]1/2]=τ[[τ]1/2[][τ]1/2]\displaystyle=\mathcal{E}^{\dagger}\circ[\mathcal{E}[\tau]^{-1/2}\tau\mathcal{E}[\cdot]\mathcal{E}[\tau]^{-1/2}]=\tau\mathcal{E}^{\dagger}[\mathcal{E}[\tau]^{-1/2}\mathcal{E}[\cdot]\mathcal{E}[\tau]^{-1/2}] (205)
=𝒫τ,.\displaystyle=\mathcal{P}_{\tau,\mathcal{E}}\mathcal{E}. (206)

This provides an alternative proof that the correctable algebra behaves nicely when tensored with auxiliary system BB.

Proposition A.2.

Suppose the correctable algebra is

𝒜():=γ=βB(β)Iβ¯B(C)\mathcal{A}(\mathcal{E}):=\gamma=\bigoplus_{\beta}B(\mathcal{H}_{\beta})\otimes I_{\bar{\beta}}\subset B(\mathcal{H}_{C}) (207)

then

𝒜(B)=βB(Bβ)Iβ¯=B(B)γ\mathcal{A}(\mathcal{I}_{B}\otimes\mathcal{E})=\bigoplus_{\beta}B(\mathcal{H}_{B\beta})\otimes I_{\bar{\beta}}=B(\mathcal{H}_{B})\otimes\gamma (208)
Proof.

By Proposition II.1, we get

𝒜(B)=SB𝒫τBC,B=SB(𝒫τC,)\displaystyle\mathcal{A}(\mathcal{I}_{B}\otimes\mathcal{E})=S_{\mathcal{I}_{B}\otimes\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau_{BC},\mathcal{I}_{B}\otimes\mathcal{E}}^{\dagger}}=S_{\mathcal{I}_{B}\otimes(\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau_{C},\mathcal{E}}^{\dagger})} (209)

where we can use structure theorem A.1 for 𝒵:=𝒫τC,\mathcal{Z}:=\mathcal{P}_{\tau_{C},\mathcal{E}}\circ\mathcal{E}

B(𝒫τC,)\displaystyle\mathcal{I}_{B}\otimes(\mathcal{E}^{\dagger}\circ\mathcal{P}_{\tau_{C},\mathcal{E}}^{\dagger}) =Bββ𝒵,β¯\displaystyle=\mathcal{I}_{B}\otimes\bigoplus_{\beta}\mathcal{I}_{\beta}\otimes\mathcal{Z}_{,\bar{\beta}} (210)
=βBβ𝒵,β¯\displaystyle=\bigoplus_{\beta}\mathcal{I}_{B\beta}\otimes\mathcal{Z}_{,\bar{\beta}} (211)

The invariant subspace of B𝒵\mathcal{I}_{B}\otimes\mathcal{Z}^{\dagger} coincide with Eq.(207) ∎

Appendix B Structure of periodic MPDO generated by Y-shaped bistochastic channels.

Theorem B.1.

Consider the following MPDO constructed by closed loop of Y-shaped channels

ρB1B:=i,jTr(|ji|𝒩𝒩[|ij|]).\displaystyle\rho_{B_{1}\cdots B_{\ell}}:=\sum_{i,j}\mathrm{Tr}(|j\rangle\langle i|\mathcal{N}\circ\cdots\mathcal{N}[|i\rangle\langle j|]). (212)

Then the state is close to the maximally mixed state

ρB1B=τB1B+OB1B\displaystyle\rho_{B_{1}\cdots B_{\ell}}=\tau_{B_{1}\cdots B_{\ell}}+O_{B_{1}\cdots B_{\ell}} (213)

up to an exponentially small global operator

OB1B1=𝒪(η),\lVert O_{B_{1}\cdots B_{\ell}}\rVert_{1}=\mathcal{O}(\eta^{\ell}), (214)

which implies the decay of the CMI for any tripartition ABCA^{\prime}B^{\prime}C^{\prime}

I(A:C|B)=𝒪(η).I(A^{\prime}:C^{\prime}|B^{\prime})=\mathcal{O}(\ell\eta^{\ell}). (215)

Where the contraction ratio η\eta is given as

η:=lim supρCτCD(𝒩[ρC]τC)D(ρCτC).\eta:=\limsup_{\rho_{C}\rightarrow\tau_{C}}\frac{D(\mathcal{N}[\rho_{C}]\|\tau_{C^{\prime}})}{D(\rho_{C}\|\tau_{C})}\,. (216)
Proof.

We first break the loop by rewriting

ρB1B\displaystyle\rho_{B_{1}\cdots B_{\ell}} =i,jTr(|ji|𝒩𝒩[|ij|])\displaystyle=\sum_{i,j}\mathrm{Tr}(|j\rangle\langle i|\mathcal{N}\circ\cdots\mathcal{N}[|i\rangle\langle j|]) (217)
=OijTr(Oij𝒩𝒩[Oij]),\displaystyle=\sum_{O_{ij}}\mathrm{Tr}(O^{\dagger}_{ij}\mathcal{N}\circ\cdots\mathcal{N}[O_{ij}]), (218)

where OijO_{ij} are any complete basis of operators orthonormal w.r.t. the Hilbert Schimdt inner product. Then recall the decomposition of 𝒩𝒩\mathcal{N}\circ\cdots\mathcal{N} as in the open boundary case

𝒩𝒩[O]\displaystyle\mathcal{N}\circ\cdots\mathcal{N}[O] =(1K)𝒩[O]τ3τ\displaystyle=(1-\mathbb{P}_{K})\mathcal{N}[O]\otimes\tau_{3}\cdots\tau_{\ell} (219)
+(1K)𝒩K𝒩[O]τ4τ\displaystyle+(1-\mathbb{P}_{K})\mathcal{N}\mathbb{P}_{K}\mathcal{N}[O]\otimes\tau_{4}\cdots\tau_{\ell} (220)
+\displaystyle+\cdots (221)
+K𝒩K𝒩K𝒩[O].\displaystyle+\mathbb{P}_{K}\mathcal{N}\cdots\mathbb{P}_{K}\mathcal{N}\mathbb{P}_{K}\mathcal{N}[O]. (222)

Plugging into Eq. (218) and choosing OijO_{ij} to split into the maxmially mixed I/dI/\sqrt{d} and the basis for orthogonal complement {Kn}\{K_{n}\}.

OijTr(Oij𝒩𝒩[Oij])\displaystyle\sum_{O_{ij}}\mathrm{Tr}(O^{\dagger}_{ij}\mathcal{N}\circ\cdots\mathcal{N}[O_{ij}]) =nTr(Kn𝒩𝒩[Kn])\displaystyle=\sum_{n}\mathrm{Tr}(K^{\dagger}_{n}\mathcal{N}\circ\cdots\mathcal{N}[K_{n}]) (223)
+dTr(τ𝒩𝒩[τ])\displaystyle+d\mathrm{Tr}(\tau\mathcal{N}\circ\cdots\mathcal{N}[\tau]) (224)
=τB1B+nTr(Kn𝒩KK𝒩[Kn]),\displaystyle=\tau_{B_{1}\cdots B_{\ell}}+\sum_{n}\mathrm{Tr}(K^{\dagger}_{n}\mathcal{N}\mathbb{P}_{K}\circ\cdots\mathbb{P}_{K}\mathcal{N}[K_{n}]), (225)

where the sum over traceless KnK_{n} contains a single term since other terms have τ\tau_{\ell} which is orthogonal to KnK_{n}. When 𝒩\mathcal{N} has trivial correctable algebra, by Lemma IV.1 the global operator is exponentially decaying in the normalized H-S norm and hence the trace norm using Cauchy-Schwartz inequality. Finally by continutity of the CMI (Lemma  IV.2), each tripartition must have CMI exponentially small w.r.t to total length \ell. ∎