Smart Hybrid Beamforming and Pilot Assignment for 6G Cell-Free Massive MIMO

Carles Diaz-Vilor,¹ Alexei Ashikhmin,² and Hong Yang² ¹C. Diaz-Vilor is with the Univ. of California, Irvine. Email: {cdiazvil} at uci.edu ²A. Ashikhmin and H. Yang are with Nokia Bell Labs, Murray Hill. Emails: {alexei.ashikhmin, h.yang} at nokia-bell-labs.com

Abstract

We investigate Cell-Free massive MIMO networks, where each access point (AP) is equipped with a hybrid analog-digital transceiver, reducing the complexity and cost compared to a fully digital transceiver. Asymptotic approximations for the spectral efficiency are derived for uplink and downlink. Capitalizing on these expressions, a max-min problem is formulated enabling us to optimize the (i) analog beamformer at the APs and (ii) pilot assignment. Simulations show that the optimization of these variables substantially increases the minimum user throughput

Index Terms:

Cell-Free, MIMO, MMSE, RZF, hybrid beamforming, large-scale, optimization, SINR

I Introduction

A prospective candidate considered for beyond-5G wireless networks is the cell-free massive MIMO (CF-mMIMO) topology, where every user (UE) potentially connects to every access point (AP), and takes the principles of cell cooperation to the limit; see [1, 2, 3, 4, 5] and the references therein.

In parallel, forthcoming technologies will be operating at higher frequencies (i.e. mmWave or THz bands), and therefore the transceivers complexity experiences a key trade-off: data rate vs power consumption. Additionally, CF networks will cover larger areas compared to cellular systems, and therefore the severity of the path loss requires the APs to be equipped with large arrays to compensate the attenuation, demanding even more power if fully digital structures are used.

A possible solution that has attracted a lot of attention is a hybrid transceiver [6, 7], composed by two stages: (a) the analog part, in which the antennas are connected to a few RF chains by means of phase shifters, and (b) the digital part. While the former stage dramatically reduces the AP complexity and power consumption, the performance decreases as well. Consequently, properly designing the analog beamformer might be a mean to reduce the performance gap with respect to fully digital transceivers. To the best of our knowledge, there are two main works dealing with the construction of the analog beamformer as a function of slow fading channel parameters [8, 9], which is also investigated in this paper and shown to outperform the previous references.

Once the analog part is designed, we investigate the uplink and downlink of two digital benchmarks: (i) minimum mean squared error (MMSE) reception and (ii) regularized zero forcing (RZF) precoding. Asymptotic approximations on the signal-to-interference-and-noise-ratio (SINR) are derived based on [10], and shown to be tight for finite-dimension systems under the previous decoding/precoding. For a given hybrid structure, and capitalizing on the asymptotic approximations, another relevant problem is studied in this paper: pilot assignment, for which a greedy algorithm based on the asymptotic expressions is provided.

Finally, we derive two novel bounds on the gap between hybrid and fully digital structures. It is shown that such bounds only depend on the channel matrix eigenvalues.

II System Model

Consider a CF massive MIMO system composed by $M$ APs, each equipped with $N$ antennas and $L(\leq N)$ RF chains serving $K$ single antenna users (UEs). We assume each AP is connected to a central processing unit (CPU) through high capacity fronthaul links. Denote by $\boldsymbol{h}_{m,k}\in\mathbb{C}^{N\times 1}$ the channel between AP $m$ and UE $k$ . Then

\displaystyle\boldsymbol{h}_{m,k}\sim\mathcal{N}_{\mathbb{C}}(\boldsymbol{0},\boldsymbol{R}_{m,k}),

(1)

with $\boldsymbol{R}_{m,k}$ being the spatial correlation matrix. Each AP performs hybrid beamforming with the aim of reducing the number of RF chains at the transceivers, and therefore their cost and complexity. Particularly, each AP contains an analog matrix $\boldsymbol{W}_{m}\in\mathbb{C}^{N\times L}$ such that $\big{(}|\boldsymbol{W}_{m}|\big{)}_{n,l}=\frac{1}{\sqrt{N}}$ , emulating phase shifters and whose entries will be designed later. As a consequence, the effective channel between AP $m$ and UE $k$ is represented by $\boldsymbol{g}_{m,k}\in\mathbb{C}^{L\times 1}$

\displaystyle\boldsymbol{g}_{m,k}=\boldsymbol{W}_{m}^{*}\boldsymbol{h}_{m,k}.

(2)

Hence, $\boldsymbol{g}_{m,k}\sim\mathcal{N}_{\mathbb{C}}(\boldsymbol{0},\boldsymbol{R}_{m,k}^{(g)})$ with $\boldsymbol{R}_{m,k}^{(g)}=\boldsymbol{W}_{m}^{*}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m}$ .

II-A Channel Estimation Process

A portion of the total number of resource units, the latter denoted by $\tau_{c}$ , is used for channel estimation. During $\tau(\leq\tau_{c})$ channel uses, UE $k$ is assigned a pilot $\boldsymbol{\phi}_{k}\in\mathbb{C}^{\tau\times 1}$ with $||\boldsymbol{\phi}_{k}||^{2}=\tau$ and the pilot matrix is denoted by $\boldsymbol{\Phi}=(\boldsymbol{\phi}_{1},\dots,\boldsymbol{\phi}_{K})\in\mathbb{C}^{\tau\times K}$ . Upon pilot transmission at a certain power $p^{(t)}$ , the observations at the $m$ th AP are

\displaystyle\boldsymbol{Y}_{m}=\sqrt{p^{(t)}}(\boldsymbol{g}_{m,1},\dots,\boldsymbol{g}_{m,K})\boldsymbol{\Phi}^{\textrm{{T}}}+\boldsymbol{W}_{m}^{*}\boldsymbol{Z}_{m},

(3)

with $\boldsymbol{Z}_{m}\sim\mathcal{N}_{\mathbb{C}}(\boldsymbol{0},\sigma^{2}\boldsymbol{I}_{N})$ for $\sigma^{2}$ being the noise power. Standard MMSE estimation leads to the next estimates [11]

\displaystyle\boldsymbol{\hat{g}}_{m,k}=\sqrt{p^{(t)}}\boldsymbol{R}_{m,k}^{(g)}(\boldsymbol{\phi}_{k}\otimes\boldsymbol{I}_{L})^{*}\boldsymbol{\Psi}_{m}^{-1}\text{vec}(\boldsymbol{Y}_{m}),

(4)

with

\displaystyle\boldsymbol{\Psi}_{m}={p^{(t)}}(\boldsymbol{\Phi}\otimes\boldsymbol{I}_{L})\boldsymbol{R}_{m}^{(g)}(\boldsymbol{\Phi}\otimes\boldsymbol{I}_{L})^{*}+\sigma^{2}\boldsymbol{I}_{\tau}\otimes\boldsymbol{W}_{m}^{*}\boldsymbol{W}_{m},

(5)

for $\boldsymbol{R}_{m}^{(g)}=\mathrm{diag}\{\boldsymbol{R}_{m,k}^{(g)}\mspace{4.0mu}\mathrm{for}\mspace{4.0mu}k=1,\dots,K\}$ . It can be verified that $\boldsymbol{g}_{m,k}=\boldsymbol{\hat{g}}_{m,k}+\boldsymbol{\tilde{g}}_{m,k}$ with $\boldsymbol{\tilde{g}}_{m,k}$ denoting the error, uncorrelated with the estimate. More concretely, $\boldsymbol{\hat{g}}_{m,k}\sim\mathcal{N}_{\mathbb{C}}(\boldsymbol{0},\boldsymbol{\Gamma}_{m,k}^{(g)})$ with $\boldsymbol{\Gamma}_{m,k}^{(g)}$ defined by

	$\displaystyle\boldsymbol{\Gamma}_{m,k}^{(g)}$	$\displaystyle=\mathbb{E}\{\boldsymbol{\hat{g}}_{m,k}\boldsymbol{\hat{g}}_{m,k}^{*}\}$		(6)
		$\displaystyle=\boldsymbol{R}_{m,k}^{(g)}(\boldsymbol{\phi}_{k}\otimes\boldsymbol{I}_{L})^{*}\boldsymbol{\Psi}_{m}^{-1}(\boldsymbol{\phi}_{k}\otimes\boldsymbol{I}_{L})\boldsymbol{R}_{m,k}^{(g)},$		(7)

and the channel error following $\boldsymbol{\tilde{g}}_{m,k}\sim\mathcal{N}_{\mathbb{C}}(\boldsymbol{0},\boldsymbol{C}_{m,k}^{(g)})$ with $\boldsymbol{C}_{m,k}^{(g)}=\boldsymbol{R}_{m,k}^{(g)}-\boldsymbol{\Gamma}_{m,k}^{(g)}$ .

II-B Scalable Cell-Free

Although CF networks allow users to establish connectivity to multiple APs, scalability must be taken into account. Therefore only a subset of APs jointly serve a particular user. Hence, we define by $\mathcal{F}_{k}$ the subset of APs involved in the decoding of the $k$ th UE and by $\mathcal{U}_{m}$ the subset of UEs treated as signal by AP $m$ . Thus, the binary matrix $\boldsymbol{M}=(\boldsymbol{m}_{1},\dots,\boldsymbol{m}_{K})\in\mathbb{Z}_{2}^{M\times K}$ whose entries are

\left(\boldsymbol{M}\right)_{m,k}=\begin{cases}1&\text{if $k\mspace{4.0mu}\in\mspace{4.0mu}\mathcal{U}_{m}$}\\ 0&\text{otherwise}\end{cases},

(8)

accounts for scalability. Provided that each AP observes an $L$ -dimensional signal after the hybrid beamforming stage, the expanded version of $\boldsymbol{M}$ is $\boldsymbol{{M}}^{(s)}=\boldsymbol{M}\otimes\boldsymbol{1}_{L}$ with $\boldsymbol{1}_{L}$ an $L$ -dimensional vector of ones. The complementary matrix $\boldsymbol{{M}}^{(i)}=\boldsymbol{1}-\boldsymbol{{M}}^{(s)}$ accounts for the disregarded UEs per AP.

II-C Uplink & Downlink Data Transmission

After data transmission, the signal collected by the $M$ APs is $\boldsymbol{y}=(\boldsymbol{y}_{1},\dots,\boldsymbol{y}_{M})^{\text{T}}\in\mathbb{C}^{ML\times 1}$ with $\boldsymbol{y}_{m}\in\mathbb{C}^{L\times 1}$

\displaystyle\boldsymbol{y}

\displaystyle=(\boldsymbol{{M}}^{(s)}\circ\boldsymbol{G})\boldsymbol{x}+(\boldsymbol{{M}}^{(i)}\circ\boldsymbol{G})\boldsymbol{x}+\boldsymbol{W}^{*}\boldsymbol{n},

(9)

with $\circ$ denoting the Hadamard product, $\boldsymbol{G}\in\mathbb{C}^{ML\times K}$ being the effective channel matrix whose entries are $(\boldsymbol{G})_{m,k}=\boldsymbol{g}_{m,k}\in\mathbb{C}^{L\times 1}$ . Vector $\boldsymbol{x}=(\sqrt{p_{1}}s_{1},\dots,\sqrt{p_{K}}s_{K})^{\rm T}$ for given UE transmit powers and symbols, denoted by $p_{k}$ and $s_{k}$ , respectively. Finally, $\boldsymbol{W}=\textrm{diag}\{\boldsymbol{W}_{m}\mspace{4.0mu}\textrm{for}\mspace{4.0mu}m=1,\dots,M\}$ and $\boldsymbol{n}=(\boldsymbol{n}_{1},\dots,\boldsymbol{n}_{M})^{\mathrm{T}}$ where $\boldsymbol{n}_{m}\sim\mathcal{N}_{\mathbb{C}}(\boldsymbol{0},\sigma^{2}\boldsymbol{I}_{N})$ .

In the downlink, the APs jointly precode the users data. More particularly, the precoder intended for UE $k$ is denoted by $\boldsymbol{v}_{k}\in\mathbb{C}^{ML\times 1}$ and after data transmission, the signal collected at UE $k$ is

\displaystyle y_{k}=\sum\limits_{i=1}^{K}\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{i}\sqrt{p_{i}}s_{i}+n_{k},

(10)

where $n_{k}\sim\mathcal{N}_{\mathbb{C}}({0},\sigma^{2})$ .

III Spectral Efficiency Analysis

III-A Uplink MMSE Reception

Provided that for UE $k$ only $|\mathcal{F}_{k}|$ APs are relevant, taking the rows of $\boldsymbol{y}$ associated to $\mathcal{F}_{k}$ produces the following reduced signal model

\displaystyle\boldsymbol{y}_{k}

\displaystyle=\underbrace{\boldsymbol{{M}}^{(s)}_{k}\circ\boldsymbol{{\hat{G}}}_{k}\boldsymbol{x}}_{\text{signal}}+\underbrace{\big{(}\boldsymbol{{M}}^{(s)}_{k}\circ\boldsymbol{{\tilde{G}}}_{k}+\boldsymbol{{M}}^{(i)}_{k}\circ\boldsymbol{{G}}_{k}\big{)}\,\boldsymbol{x}+\boldsymbol{W}^{*}_{k}\boldsymbol{n}}_{\text{ effective noise: $\boldsymbol{z}_{k}$ }},

(11)

where matrices in (11) are the reduced version of the original matrices which contain the rows related to $\mathcal{F}_{k}$ and all columns. Moreover, $\boldsymbol{z}_{k}\sim\mathcal{N}_{\mathbb{C}}(\boldsymbol{0},\boldsymbol{\Sigma}_{k})$ with $\boldsymbol{\Sigma}_{k}$ being a block diagonal matrix $\boldsymbol{\Sigma}_{k}=\mathrm{diag}\{\boldsymbol{\Sigma}_{k,m}\in\mathbb{C}^{L\times L}\mspace{4.0mu}\mathrm{for}\mspace{4.0mu}m\in\mathcal{F}_{k}\}$ where the diagonal terms are

\displaystyle\boldsymbol{\Sigma}_{k,m}=\sum_{i\in\mathcal{U}_{m}}\boldsymbol{C}_{m,i}^{(g)}p_{i}+\sum_{i\notin\mathcal{U}_{m}}\boldsymbol{R}_{m,i}^{(g)}p_{i}+\sigma^{2}\boldsymbol{W}_{m}^{*}{}\boldsymbol{W}_{m}.

(12)

In the uplink, the combiner maximizing the SINR is the MMSE, achieving a maximum value of

\displaystyle\mathrm{SINR}_{k}=\boldsymbol{\hat{g}}_{k}^{*}\bigg{(}\sum\limits_{i\neq k}^{K}(\boldsymbol{m}_{k,i}^{(s)}\circ\boldsymbol{\hat{g}}_{i})(\boldsymbol{m}_{k,i}^{(s)}\circ\boldsymbol{\hat{g}}_{i})^{*}p_{i}+\boldsymbol{\Sigma}_{k}\bigg{)}^{-1}\boldsymbol{\hat{g}}_{k}.

(13)

where $\boldsymbol{\hat{g}}_{k}$ and $\boldsymbol{\hat{g}}_{i}$ are the $k$ th and $i$ th columns of $\boldsymbol{\hat{G}}_{k}$ , respectively, and a similar definition applies to $\boldsymbol{m}_{k,i}^{(s)}$ . As a consequence, after accounting for the pilot overhead $\frac{\tau}{\tau_{c}}$ , the ergodic spectral efficiency that the $k$ th UE can achieve is

\mathrm{SE}_{k}=\left(1-\frac{\tau}{\tau_{c}}\right)\mathbb{E}\{\log_{2}(1+\mathrm{SINR}_{k})\}.

(14)

III-B Downlink RZF Precoding

Various precoding strategies can be used to encode the users data. However, RZF provides an outstanding performance as studied in the literature. More particularly, the subset RZF precoding, denoted by $\boldsymbol{V}=(\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{K})$ , follows

	$\displaystyle\boldsymbol{V}$	$\displaystyle=(\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{K})$		(15)
		$\displaystyle=\big{[}(\boldsymbol{{M}}^{(s)}\circ\boldsymbol{\hat{G}})(\boldsymbol{{M}}^{(s)}\circ\boldsymbol{\hat{G}})^{*}+\rho\boldsymbol{I}_{ML}\big{]}^{-1}(\boldsymbol{{M}}^{(s)}\circ\boldsymbol{\hat{G}})\boldsymbol{\Lambda}.$		(16)

with $\rho$ being the regularitzation parameter and $\boldsymbol{\Lambda}=\mathrm{diag}(\lambda_{1},\dots,\lambda_{K})$ . Different formulations can be used for $\lambda_{k}$ , such as to ensure (i) $\mathbb{E}\{||\boldsymbol{W}\boldsymbol{v}_{k}||^{2}\}\leq 1$ or (ii) $||\boldsymbol{W}\boldsymbol{v}_{k}||^{2}\leq 1$ . In our case, since perfect CSI is not available, we use the former formulation. Once User $k$ receives $y_{k}$ , as defined in Eq. (10), the following spectral efficiency can be achieved:

\displaystyle\mathrm{SE}_{k}=\left(1-\frac{\tau}{\tau_{c}}\right)\log_{2}(1+\mathrm{SINR}_{k}),

(17)

with

\displaystyle\mathrm{SINR}_{k}=\frac{|\mathbb{E}\{\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k}\}|^{2}p_{k}}{\sum\limits_{i\neq 1}^{K}\mathbb{E}\{|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{i}|^{2}\}p_{i}+\mathrm{var}(\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k})p_{k}+\sigma^{2}}

(18)

IV Asymptotic Analysis

To evaluate the previous SINR expressions, we consider the asymptotic regime, $MN,K\to\infty$ with finite $MN/K$ and investigate the convergence of the spectral efficiency expressions to deterministic limits. Provided that the subsets account for the non-zero entries in the random matrices, it is required that they grow with the network as well, i.e., $|\mathcal{F}_{k}|N,|\mathcal{U}_{m}|\to\infty$ $\forall\mspace{4.0mu}k,m$ . The premises for this convergence need the involved matrices to satisfy two technical conditions: (a) the inverse of the resolvent matrix in (13) and (15) to exist, ensured by $\boldsymbol{\Sigma}_{k}$ and $\rho\boldsymbol{I}_{ML}$ , respectively, and that (b) $\boldsymbol{\Gamma}_{k}^{(g)}=\text{diag}\{{m}_{m,k}\cdot\boldsymbol{\Gamma}_{m,k}^{(g)}\mspace{4.0mu}m=1,\dots,M\}$ has uniformly bounded spectral norm, for ${m}_{m,k}$ being the $(m,k)$ element of (8). Under these conditions, the following approximations can be made.

Theorem 1.

For $|\mathcal{F}_{k}|N,|\mathcal{U}_{m}|\to\infty$ $\forall\mspace{4.0mu}k,m$ and UL MMSE combining, $\mathrm{SINR}_{k}\approx\overline{\mathrm{SINR}}_{k}$ with $\overline{\mathrm{SINR}}_{k}$ given in (19).

\displaystyle\overline{\mathrm{SINR}}_{k}=\frac{p_{k}}{|\mathcal{F}_{k}|N}\sum\limits_{m\in\mathcal{F}_{k}}\mathrm{tr}\Big{[}\boldsymbol{\Gamma}_{m,k}^{(g)}\boldsymbol{T}_{m,k}\Big{]},

(19)

where

\displaystyle\boldsymbol{T}_{m,k}=\bigg{(}\frac{1}{|\mathcal{F}_{k}|N}\sum\limits_{i=1}^{K}\frac{{m}_{m,k}\cdot\boldsymbol{\Gamma}_{m,i}^{(g)}}{1+e_{i}}p_{i}+\frac{1}{|\mathcal{F}_{k}|N}\boldsymbol{\Sigma}_{m,k}\bigg{)}^{-1}.

(20)

The coefficients $e_{i}$ are obtained iteratively, $e_{i}=\lim_{n\to\infty}e_{i}^{(n)}$ , given $e_{i}^{(0)}=|\mathcal{F}_{i}|N$ and the recursion in (21).

\displaystyle e_{i}^{(n)}

\displaystyle=p_{i}\mathrm{tr}\Bigg{[}\boldsymbol{\Gamma}_{i}^{(g)}\bigg{(}\sum\limits_{j=1}^{K}\frac{\boldsymbol{\Gamma}_{j}^{(g)}p_{j}}{1+e_{j}^{(n-1)}}+\boldsymbol{\Sigma}_{i}\bigg{)}^{-1}\Bigg{]}

(21)

Proof.

The proof can be found in App. C. ∎

Theorem 2.

For $|\mathcal{F}_{k}|N,|\mathcal{U}_{m}|\to\infty$ $\forall\mspace{4.0mu}k,m$ and DL RZF precoding, $\mathrm{SINR}_{k}\approx\overline{\mathrm{SINR}}_{k}$ with $\overline{\mathrm{SINR}}_{k}$ given in (22)

\displaystyle\overline{\mathrm{SINR}}_{k}=\frac{\frac{\mu_{k}^{2}}{\delta_{k}}p_{k}}{\sum\limits_{i\neq 1}^{K}\frac{\theta_{k,i}}{\delta_{i}}p_{i}+\sigma^{2}},

(22)

where

\displaystyle\mu_{k}=\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}\big{]},

(23)

\displaystyle\delta_{k}=\frac{1}{(MN)^{2}}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{W}^{*}\boldsymbol{W})\big{]},

(24)

$\displaystyle\theta_{k,i}$	$\displaystyle=\frac{1}{(MN)^{2}}\mathrm{tr}\big{[}\boldsymbol{R}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{\Gamma}_{i}^{(g)})\big{]}+$
	$\displaystyle\mspace{22.0mu}\frac{1}{MN}\frac{\mu_{k}^{2}\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{\Gamma}_{i}^{(g)})\big{]}}{(1+\mu_{k})^{2}}-$
	$\displaystyle\mspace{22.0mu}\frac{2}{MN}\mathbb{R}\bigg{\{}\frac{\mu_{k}\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{\Gamma}_{i}^{(g)})}{1+\mu_{k}}\bigg{\}},$	(25)

\displaystyle\boldsymbol{T}=\bigg{(}\frac{1}{MN}\sum\limits_{i=1}^{K}\frac{\boldsymbol{\Gamma}_{i}^{(g)}}{1+e_{i}}+\frac{\rho}{MN}\boldsymbol{I}_{ML}\bigg{)}^{-1}.

(26)

The coefficients $e_{i}$ are obtained iteratively with $e_{i}=\lim_{n\to\infty}e_{i}^{(n)}$ , given $e_{i}^{(0)}=MN$ and the recursion in (27)

\displaystyle e_{k}^{(n)}=\mathrm{tr}\Bigg{[}\boldsymbol{\Gamma}_{k}^{(g)}\bigg{(}\sum\limits_{i=1}^{K}\frac{\boldsymbol{\Gamma}_{i}^{(g)}}{1+e_{i}^{(n-1)}}+\rho\boldsymbol{I}_{ML}\bigg{)}^{-1}\Bigg{]}.

(27)

Moreover, matrix

\displaystyle\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{\Gamma}_{i}^{(g)})=\boldsymbol{{T}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{{T}}+\boldsymbol{{T}}\frac{1}{M}\sum\limits_{k=1}^{K}\frac{\boldsymbol{\Gamma}_{k}^{(g)}e_{k}^{{}^{\prime}}}{(1+e_{k})^{2}}\boldsymbol{{T}},

(28)

and coefficients $\boldsymbol{e}^{{}^{\prime}}(\frac{\rho}{MN})=({e}_{1}^{{}^{\prime}},\dots,{e}_{K}^{{}^{\prime}})$ are calculated as

\displaystyle\boldsymbol{e}^{{}^{\prime}}(\frac{\rho}{MN})=\big{(}\boldsymbol{I}_{K}-\boldsymbol{J}\big{)}^{-1}\boldsymbol{v}(\frac{\rho}{MN}),

(29)

with $\boldsymbol{J}\in\mathbb{C}^{K\times K}$ and $\boldsymbol{v}(\frac{\rho}{MN})\in\mathbb{C}^{K\times 1}$ defined as

\displaystyle\big{(}\boldsymbol{J}\big{)}_{k,l}=\frac{\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{{T}}\boldsymbol{\Gamma}_{l}^{(g)}\boldsymbol{{T}}\big{]}}{MN(1+e_{l})^{2}},

(30)

and

\displaystyle\big{(}\boldsymbol{v}(\frac{\rho}{MN})\big{)}_{k}=\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{{T}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{{T}}\big{]}.

(31)

Proof.

The proof can be found in App. D. ∎

From the continuous mapping theorem [12], the following holds: $\mathrm{SE}_{k}\approx\left(1-\frac{\tau}{\tau_{c}}\right)\log_{2}(1+\overline{\mathrm{SINR}}_{k})$ with the corresponding $\overline{\mathrm{SINR}}_{k}$ provided above for $MN$ and $K$ $\to\infty$ .

V Spectral Efficiency Optimization

Note that the asymptotic SE approximations derived in the previous section only depend on large scale parameters. Therefore, we can formulate different asymptotic optimization problems. However, with the aim of increasing fairness in the network we focus on the following max-min problem:

	$\displaystyle\max_{\boldsymbol{W},\mspace{4.0mu}\boldsymbol{\Phi}}$	$\displaystyle\min_{k}\overline{\mathrm{SINR}}_{k}.$		(32)
	s.t.	$\displaystyle\big{(}\|\boldsymbol{W}_{m}\|\big{)}_{n,l}=\frac{1}{\sqrt{N}}$		(32)

where the optimization variables are two: (i) analog beamforing matrix $\boldsymbol{W}$ and (ii) pilot matrix, studied separately.

V-A Analog Beamformer Design

The design of $\boldsymbol{W}=\mathrm{diag}\{\boldsymbol{W}_{m}\mspace{4.0mu}\mathrm{for}\mspace{4.0mu}m=1,\dots,M\}$ is challenging given the complexity of the SINR. Therefore, directly solving (32) poses a major challenge. However, under perfect CSI, some algebraic properties on $\boldsymbol{W}_{m}$ can be extracted and therefore used for its design. Concretely, we first disregard the unit-modulus constraint and after SVD decomposition $\boldsymbol{W}$ factorizes as $\boldsymbol{W}=\boldsymbol{U}\boldsymbol{Q}$ with semi-unitary $\boldsymbol{U}$ , i.e. $\boldsymbol{U}^{*}\boldsymbol{U}=\boldsymbol{I}_{ML}$ .

Proposition 1.

Under perfect CSI UL-MMSE reception, any nonsingular $\boldsymbol{Q}$ provides maximum SINR.

Proof.

The proof can be found in App. E. ∎

According to [13], $\sum\limits_{i\neq k}^{K}|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{i}|^{2}p_{i}+\sigma^{2}\approx\sum\limits_{i\neq k}^{K}|\boldsymbol{g}_{i}^{*}\boldsymbol{v}_{k}|^{2}p_{i}+\sigma^{2}$ . Under the condition that the previous approximation is tight, the following proposition, which is similar to the result obtained in [9] for another metric, can be obtained.

Proposition 2.

Under perfect CSI DL-RZF precoding, the SINR is maximum when $\boldsymbol{Q}$ is semi-unitary: $\boldsymbol{Q}\boldsymbol{Q}^{*}=\boldsymbol{I}_{ML}$ .

Proof.

The proof can be found in App. F. ∎

In order to full-fill both propositions, for UL and DL, $\boldsymbol{Q}$ can be set to $\boldsymbol{Q}=\boldsymbol{I}_{ML}$ and therefore $\boldsymbol{W}=\boldsymbol{U}$ meaning that the analog matrix should have orthogonal columns. The idea behind having orthogonal columns is that interference is reduced. To the best of our knowledge, there are two ways of smartly creating $\boldsymbol{W}$ explained in [8] and [9], respectively. While the latter is based on perfectly known channels, the former fails to capture the complete spectrum of the channel covariance matrices. In this work, we propose a method that takes into account all possible eigenvectors/eigenvalues of all $\boldsymbol{R}_{m,k}$ with the aim of maximizing the minimum average UE power signal, which is shown to maximize the minimum SINR in our simulations. More particularly, $\boldsymbol{R}_{m,k}=\boldsymbol{V}_{m,k}\boldsymbol{\Lambda}_{m,k}\boldsymbol{V}_{m,k}^{*}$ with $\boldsymbol{V}_{m,k}$ having orthonormal column vectors and $\boldsymbol{\Lambda}_{m,k}=\mathrm{diag}(\lambda_{m,k}^{(1)},\dots,\lambda_{m,k}^{(N)})$ containing the $N$ eigenvalues of $\boldsymbol{R}_{m,k}$ . Note that the average signal power for UE $k$ is given by $\sum\limits_{m\in\mathcal{F}_{k}}\mathrm{tr}(\boldsymbol{W}_{m}^{*}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m})$ . Therefore, such a expression is maximized whenever the columns of $\boldsymbol{W}_{m}$ match the eigenvectors of $\boldsymbol{R}_{m,k}$ . However, not all UEs and their respective eigenmodes can be captured by $\boldsymbol{W}_{m}$ . A selection of $L$ out of $NK$ should be made. As a consequence, we define the UE average signal power as

\displaystyle S_{k}=\sum\limits_{m\in\mathcal{F}_{k}}\sum\limits_{n=1}^{N}\alpha_{m,k}^{(n)}\lambda_{m,k}^{(n)}.

(33)

where $\alpha_{m,k}^{(n)}$ is a binary optimization variable scheduling the eigenvectors to the columns of $\boldsymbol{W}_{m}$ . Therefore, the following optimization problem with respect to $\alpha_{m,k}^{(n)}$ can be formulated:

$\displaystyle\max_{\alpha_{m,k}^{(n)}}$	$\displaystyle\min_{k}S_{k}$	(34)
s.t.	$\displaystyle\alpha_{m,k}^{(n)}\in\{0,1\}$
	$\displaystyle\sum\limits_{k=1}^{K}\sum\limits_{n=1}^{N}\alpha_{m,k}^{(n)}\leq L$

The reverse-delete algorithm is capable of efficiently solving (34) without the need of an exhaustive search. The surviving $\alpha_{m,k}^{(n)}$ determine which eigenvectors of which users will compose the columns of $\boldsymbol{W}_{m}$ . However, note that $\boldsymbol{{W}}_{m}$ for $m=1,\dots,M$ does not necessarily have orthogonal columns given that, most likely, eigenvectors from multiple users will be used to construct the analog matrices. As a consequence, neither Prop. 1 nor Prop. 2 are satisfied. Thus, the final unconstrained analog beamformers are obtained by $\boldsymbol{W}_{m}^{(p)}=\mathcal{P}(\boldsymbol{{W}}_{m})$ where $\mathcal{P}(\boldsymbol{A}_{m})$ is the projection of matrix $\boldsymbol{A}_{m}$ into an orthonormal basis.

Still, $\boldsymbol{W}_{m}^{(p)}$ is not only composed by phase shifters, i.e. the entries are not roots of unity. Therefore, for given $\boldsymbol{W}_{m}^{(p)}$ , we aim at solving the following optimization problem:

\begin{aligned} &\underset{\boldsymbol{{\hat{W}}}_{m}}{\text{min}}&&||\boldsymbol{{W}}_{m}^{(p)}-\boldsymbol{\hat{W}}_{m}||_{\text{F}}^{2}\\ &\text{s.t.}&&|[\boldsymbol{\hat{W}}_{m}]_{n,l}|=\frac{1}{\sqrt{N}}\end{aligned}.

(35)

Although the optimal solution is obtained by taking the phase of the eigenvectors in $\boldsymbol{W}_{m}^{(p)}$ , the orthogonality between columns achieved by $\mathcal{P}(\cdot)$ would be broken. Therefore, we modify our receiver. We add an orthogonality compensation matrix into our digital processing [9]. More concretely, for a constrained analog beamformer $\boldsymbol{\hat{W}}_{m}$ , its SVD results in $\boldsymbol{\hat{W}}_{m}=\boldsymbol{\hat{U}}_{m}\boldsymbol{\hat{D}}_{m}\boldsymbol{\hat{V}}_{m}^{*}$ . The orthogonality compensation matrix, denoted by $\boldsymbol{F}_{m}$ , is defined as

\displaystyle\boldsymbol{F}_{m}=\boldsymbol{\hat{V}}_{m}\boldsymbol{\hat{D}}_{m}^{-1}\boldsymbol{\hat{V}}_{m}^{*}.

(36)

Therefore, adding such a compensation matrix allows us to improve the design of the analog matrix exploiting the following proposition.

Proposition 3.

Assume that instead of using $\boldsymbol{W}_{m}^{(p)}$ as the analog matrix, $\boldsymbol{W}_{m}^{(p)}\boldsymbol{A}_{m}$ is the new analog beamformer with $\boldsymbol{A}_{m}\in\mathbb{C}^{L\times L}$ nonsingular. The product between $\boldsymbol{W}_{m}^{(p)}\boldsymbol{A}_{m}\boldsymbol{F}_{m}$ provides the same optimality as $\boldsymbol{W}_{m}^{(p)}$ and therefore $\boldsymbol{W}_{m}^{(p)}\boldsymbol{A}_{m}$ is an optimal unconstrained analog matrix.

Proof.

The proof can be found in App. G ∎

Using the previous proposition, the initial unconstrained beamformer $\boldsymbol{W}_{m}^{(p)}$ can be replaced by $\boldsymbol{W}_{m}^{(p)}\boldsymbol{A}_{m}$ without a performance degradation as long as $\boldsymbol{A}_{m}$ is nonsingular. As a consequence, we can formulate the following optimization problem:

		$\displaystyle\underset{\boldsymbol{\hat{W}}_{m},\boldsymbol{A}_{m}}{\text{min}}$		$\displaystyle\|\|\boldsymbol{\hat{W}}_{m}-\boldsymbol{{W}}_{m}^{(p)}\boldsymbol{A}_{m}\|\|_{\text{F}}^{2}$		(37)
		s.t.		$\displaystyle\|[\boldsymbol{\hat{W}}_{m}]_{n,l}\|=\frac{1}{\sqrt{N}}$		(37)

Thanks to the degrees of freedom added by $\boldsymbol{A}_{m}$ , the constrained analog beamformer $\boldsymbol{\hat{W}}_{m}$ , can be made closer to the unconstrained one $\boldsymbol{W}_{m}^{(p)}$ . By alternating minimization, we split the previous problem into two sub-problems: (i) find the optimal $\boldsymbol{A}_{m}$ for fixed $\boldsymbol{\hat{W}}_{m}$ and (ii) find the optimal $\boldsymbol{\hat{W}}_{m}$ for fixed $\boldsymbol{A}_{m}$ . The solution to the previous subproblems is

\boldsymbol{A}_{m}=\boldsymbol{{W}}_{m}^{(p)\mspace{4.0mu}*}\boldsymbol{\hat{W}}_{m},

(38)

\displaystyle\boldsymbol{\hat{W}}_{m}=\frac{1}{\sqrt{N}}\text{exp}\angle(\boldsymbol{{W}}_{m}^{(p)}\boldsymbol{A}_{m}).

(39)

An iterative process based on the block coordinate descend method follows until convergence is reached [14]. Therefore, a constrained analog matrix will be obtained and thus from Eq. (36) we can create $\boldsymbol{F}_{m}$ that goes into the baseband (or digital) part. As a consequence, the equivalent channel between AP $m$ and UE $k$ has an extra component:

\displaystyle\boldsymbol{g}_{m,k}=\boldsymbol{F}_{m}^{*}\boldsymbol{\hat{W}}_{m}^{*}\boldsymbol{h}_{m,k}.

(40)

V-B Pilot Assignment Optimization

The optimal solution to (32) with respect to $\boldsymbol{\Phi}$ requires an exhaustive search over the set of possible pilot sequences. However, based on the correlation between effective channels: $\Delta_{k,i}=\text{tr}(\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{\Gamma}_{i}^{(g)})$ for $k\neq i$ , an initial pilot assignment can be made, denoted by $\boldsymbol{\Phi}^{(0)}$ . Particularly, a set of users is assigned the same pilot if their normalized cross-correlation, i.e. $\frac{\Delta_{k,i}}{\text{tr}(\boldsymbol{\Gamma}_{k}^{(g)})\text{tr}(\boldsymbol{\Gamma}_{i}^{(g)})}$ , is minimized. Afterwards, the greedy algorithm proposed in Alg. 1 combined with the asymptotic approximations can be used to iteratively update the UE pilot assignment in a max-min SINR sense. Additionally, by construction, Alg. 1 converges provided that the cost function (i) is non-decreasing and (ii) is upper bounded.

Algorithm 1 Greedy pilot assignment

Set of available pilots,

\mathcal{S}=\{s_{1},\dots,s_{|\mathcal{S}|}\}

and initial pilot assignment

\boldsymbol{\Phi}^{(0)}

at iteration

j=0

Define the cost function

\mu^{(0)}=\min_{k}\overline{\mathrm{SINR}}_{k}(\boldsymbol{\Phi}^{(0)})

while

{\mu^{(j+1)}-\mu^{(j)}\over\mu^{(j)}}>\epsilon

For each UE

u=1,\dots,K

solve

	$\displaystyle\phi_{u}^{(j+1)}=\arg\max_{s\in\mathcal{S}}\mspace{4.0mu}$
	$\displaystyle\min_{k}\overline{\mathrm{SINR}}_{k}(\phi_{1}^{(j+1)},\dots,\phi_{u-1}^{(j+1)},s,\phi_{u+1}^{(j)},\dots,\phi_{K}^{(j)})$		(41)

Update cost function

\mu^{(j+1)}=\min_{k}\overline{\mathrm{SINR}}_{k}(\boldsymbol{\Phi}^{(j+1)})

end while

VI $M\to\infty$ Regime

Finally, we focus on the case where $M\to\infty$ . For simplicity, assume $\boldsymbol{{M}}^{(s)}=\boldsymbol{1}$ and recall that a full digital structure is the one providing the best performance in terms of SE, attained when $L=N$ and $\boldsymbol{W}_{m}=\boldsymbol{I}_{N}$ . Then, the following can be derived.

Proposition 4.

Define the gap as the difference in SINR between full digital and hybrid. Then, there exist lower and upper bounds for the gap, denoted by $\delta_{\mathrm{LB}}$ and $\delta_{\mathrm{UB}}$ , given by

\displaystyle\delta_{\mathrm{LB}}=\frac{p_{k}}{\sigma^{2}}\sum\limits_{m=1}^{M}\sum\limits_{n=L+1}^{N}\lambda_{m,k}^{(n)}.

(42)

\displaystyle\delta_{\mathrm{UB}}=\frac{p_{k}}{\sigma^{2}}\sum\limits_{m=1}^{M}\bigg{(}\sum\limits_{n=1}^{N}(\lambda_{m,k}^{(n)}-\lambda_{m,k}^{(N-L+n)})+\sum\limits_{n=L+1}^{M}\lambda_{m,k}^{(n)}\bigg{)}

(43)

Proof.

The proof can be found in App. H ∎

Note that if the channel matrices are rank-deficient, i.e. $\mathrm{rank}(\boldsymbol{R}_{m,k})\leq L$ , the gap can be as small as zero and therefore a hybrid structure would achieve the same performance as digital.

VII Simulation Results

For the purpose of performance evaluation, we consider a $200\times 200$ $m^{2}$ wrapped around universe. To generate the channel model, we assume that the APs are deployed in urban environments at around 10 m, matching with the 3GPP Urban Microcell model in [15, Table B.1.2.1-1] at an operating frequency of 2 GHz. The shadowing terms given an AP to different UEs present a certain correlation, given by the model in [15, Table B.1.2.2.1-4]. The number of total channel uses is $\tau_{c}=200$ . Unless otherwise specified, in order to take into account the effects of pilot contamination $\tau=8$ orthogonal pilots and $K=16$ UEs (i.e. reuse factor of two). Additionally, each AP has $N=32$ antennas. The UE transmit power is set to $200$ mW, $\sigma^{2}=-96$ dBm and $\rho=10^{-4}$ . Moreover, to account for scalability, the $[m,k]$ entry of $\boldsymbol{M}_{[m,k]}$ is 1 if $d_{m,k}\leq R_{\text{max}}$ for $R_{\text{max}}=90$ m, which ensures connectivity to multiple FBSs per GU for $d_{m,k}$ the Euclidean distance between AP $m$ and UE $k$ . Finally, $\epsilon=0.001$ to ensure enough iterations until convergence is reached.

The applicability of Theorems 1 and 2 to finite-dimensional systems is first verified in Figs. 1 and 2, where the approximations are denoted by RMT in the legend. For different network setups, corresponding to $M=4$ , $N=32$ , $L=16$ and $M=12$ , $N=32$ , $L=8$ , the approximations obtained in Th. 1 and 2 respectively are indeed accurate for $K=16$ and $\tau=8$ orthogonal pilots.

Refer to caption — Figure 1: Exact SE vs $\left(1-{\tau}/{\tau_{c}}\right)\log_{2}(1+\overline{\mathrm{SINR}}_{k})$ with $\overline{\mathrm{SINR}}_{k}$ given in Th. 1.

In Fig. 3, we compare the UL pilot assignment obtained by Alg. 1 (Greedy) and a random assignment (RA) for different values of $N$ and $L$ . For $N=L=16$ we assume a digital structure while for $N=32$ and $L=8$ the analog matrices $\boldsymbol{\hat{W}}_{m}$ are obtained as described in Section V-A. There is a visible improvement after running the greedy algorithm when the set of available pilots $\mathcal{S}$ is composed by orthogonal pilots. Additionally, the improvement in terms of minimum SE is measured and is of about 60% and 90% for $N=L=16$ and $N=32$ , $L=8$ , respectively. Similar results are obtained in the DL.

Next, we analyze the performance of our hybrid beamforming method compared to the two existing techniques, called SVD [8] and SLNR [9]. We measure the 95% outage SE which is a key metric in wireless systems for both the UL and DL in Figs. 4 and 5. Clearly, our method outperforms both works in the two links, i.e. UL and DL, with gains in the range of 1-8% and 10-35% in the UL and DL, respectively.

VIII Conclusions

This paper has investigated the use of hybrid transceivers in CF MIMO setups. After deriving asymptotic approximations for both UL and DL, we focused on solving two problems: (i) analog beamformer and (ii) pilot assignment. The solution to the first one is shown to outperform state-of-the-art techniques while the greedy pilot assignment highly outperforms a RA. Finally, theoretical bounds for the gap between full digital and hybrid structures are presented, showing that such a gap is highly dependant on the eigenvalues of the channel correlation matrices.

References

[1] H. Q. Ngo, A. Ashikhmin, H. Yang, E. G. Larsson, and T. L. Marzetta, “Cell-Free Massive MIMO Versus Small Cells,” IEEE Trans. Wireless Commun., vol. 16, pp. 1834–1850, Mar. 2017.
[2] E. Björnson and L. Sanguinetti, “Making Cell-Free Massive MIMO Competitive With MMSE Processing and Centralized Implementation,” IEEE Trans. Wireless Commun., vol. 19, pp. 77–90, Jan. 2020.
[3] E. Nayebi, A. Ashikhmin, T. L. Marzetta, H. Yang, and B. D. Rao, “Precoding and Power Optimization in Cell-Free Massive MIMO Systems,” IEEE Trans. on Wireless Commun., vol. 16, pp. 4445–4459, May 2017.
[4] M. Bashar, K. Cumanan, A. G. Burr, M. Debbah, and H. Q. Ngo, “On the Uplink Max–Min SINR of Cell-Free Massive MIMO Systems,” IEEE Trans. on Wireless Commun., vol. 18, pp. 2021–2036, Jan. 2019.
[5] M. Attarifar, A. Abbasfar, and A. Lozano, “Subset MMSE Receivers for Cell-Free Networks,” IEEE Trans. Wireless Commun., vol. 19, pp. 4183–4194, Jun. 2020.
[6] X. Gao, L. Dai, S. Han, C.-L. I, and R. W. Heath, “Energy-Efficient Hybrid Analog and Digital Precoding for MmWave MIMO Systems With Large Antenna Arrays,” IEEE Journal on Sel. Areas in Commun., vol. 34, pp. 998–1009, Mar. 2016.
[7] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave mimo systems,” IEEE Trans. on Wireless Commun., vol. 13, pp. 1499–1513, Jan. 2014.
[8] G. Femenias and F. Riera-Palou, “Cell-Free Millimeter-Wave Massive MIMO Systems With Limited Fronthaul Capacity,” IEEE Access, vol. 7, pp. 44596–44612, Mar. 2019.
[9] S. Park, J. Park, A. Yazdan, and R. W. Heath, “Exploiting Spatial Channel Covariance for Hybrid Precoding in Massive MIMO Systems,” IEEE Trans. on Sig. Proc., vol. 65, pp. 3818–3832, May 2017.
[10] S. Wagner, R. Couillet, M. Debbah, and D. T. M. Slock, “Large System Analysis of Linear Precoding in Correlated MISO Broadcast Channels Under Limited Feedback,” IEEE Trans. on Inf. Th., vol. 58, pp. 4509–4537, Mar. 2012.
[11] S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice-Hall PTR, 1st ed., 1993.
[12] H. B. Mann and A. Wald, “On Stochastic Limit and Order Relationships,” Annals of Mathematical Statistics, vol. 14, pp. 217–226, 1943.
[13] P. Patcharamaneepakorn, S. Armour, and A. Doufexi, “On the Equivalence Between SLNR and MMSE Precoding Schemes with Single-Antenna Receivers,” IEEE Commun. Lett., vol. 16, no. 7, pp. 1034–1037, 2012.
[14] Z. Luo and P. Tseng, “On the convergence of the coordinate descent method for convex differentiable minimization,” J. of Optimization Theory and Applications, vol. 72, pp. 7–35, Jan. 1992.
[15] “Rel. 9: Evolved Universal Terrestrial Radio Access (E-UTRA); Further advancements for E-UTRA physical layer aspects,” Tech. Rep. 36.814, 3GPP, Dec. 2017.

Appendix A

Theorem 3.

([10, Theorem 1]) Let $\boldsymbol{{D}}\in\mathbb{C}^{M\times M}$ and $\boldsymbol{{S}}\in\mathbb{C}^{M\times M}$ be Hermitian nonnegative-definite while $\boldsymbol{{H}}\in\mathbb{C}^{M\times K}$ is a random matrix with zero-mean independent column vectors, $\boldsymbol{h}_{k}$ , each with covariance matrix $\frac{1}{M}\boldsymbol{\mathrm{R}}_{k}$ . Finally, $\boldsymbol{{D}}$ and $\boldsymbol{{R}}_{k}$ have uniformly bounded spectral norm w.r.t. $M$ . For $z>0$ and $M,K\to\infty$ ,

\frac{1}{M}\,\mathrm{tr}\!\left[\boldsymbol{{D}}\big{(}\boldsymbol{{H}}\boldsymbol{{H}}^{*}+\boldsymbol{{S}}+z\boldsymbol{{I}}_{M})^{-1}\right]-\frac{1}{M}\,\mathrm{tr}[\boldsymbol{{D}}\boldsymbol{{T}}]\stackrel{{\scriptstyle\text{a.s.}}}{{\to}}0,

where

\boldsymbol{{T}}=\bigg{(}\frac{1}{M}\sum\limits_{j=1}^{K}\frac{\boldsymbol{{R}}_{j}}{1+e_{j}}+\boldsymbol{{S}}+z\boldsymbol{{I}}_{M}\bigg{)}^{\!-1}

(44)

with coefficients $e_{k}=\text{lim}_{n\xrightarrow{}\infty}e_{k}^{(n)}$ for

e_{k}^{(n)}=\frac{1}{M}\,\mathrm{tr}\!\left[\boldsymbol{{R}}_{k}\bigg{(}\frac{1}{M}\sum\limits_{j=1}^{K}\frac{\boldsymbol{{R}}_{j}}{1+e_{j}^{(n-1)}}+\boldsymbol{{S}}+z\boldsymbol{{I}}_{M}\bigg{)}^{\!-1}\right]

(45)

with initial values $e_{k}^{(0)}=M$ .

Appendix B

Theorem 4.

([10, Theorem 2]) Let $\boldsymbol{\Phi}\in\mathbb{C}^{M\times M}$ be Hermitian nonnegative-definite. Under the same conditions as Th. 3, for $M,K\to\infty$ ,

	$\displaystyle\frac{1}{M}\,\mathrm{tr}\!\left[\boldsymbol{{D}}\big{(}\boldsymbol{{H}}\boldsymbol{{H}}^{}+\boldsymbol{{S}}+z\boldsymbol{{I}}_{M})^{-1}\boldsymbol{\Phi}\big{(}\boldsymbol{{H}}\boldsymbol{{H}}^{}+\boldsymbol{{S}}+z\boldsymbol{{I}}_{M})^{-1}\right]-$
	$\displaystyle\frac{1}{M}\,\mathrm{tr}[\boldsymbol{{D}}\boldsymbol{T}^{{}^{\prime}}(z,\boldsymbol{\Phi})]\stackrel{{\scriptstyle\text{a.s.}}}{{\to}}0,$

where $\boldsymbol{T}^{{}^{\prime}}(z,\boldsymbol{\Phi})$ is defined as

\boldsymbol{T}^{{}^{\prime}}(z,\boldsymbol{\Phi})=\boldsymbol{{T}}\boldsymbol{\Phi}\boldsymbol{{T}}+\boldsymbol{{T}}\frac{1}{M}\sum\limits_{k=1}^{K}\frac{\boldsymbol{R}_{k}e_{k}^{{}^{\prime}}(z,\boldsymbol{\Phi})}{(1+e_{k})^{2}}\boldsymbol{{T}}

(46)

with $\boldsymbol{{T}}$ and $e_{k}$ given in Th. 3 for particular $z$ and $\boldsymbol{e}^{{}^{\prime}}(z,\boldsymbol{\Phi})=\big{(}e_{1}^{{}^{\prime}}(z),\dots,e_{K}^{{}^{\prime}}(z)\big{)}$ calculated as

\displaystyle\boldsymbol{e}^{{}^{\prime}}(z,\boldsymbol{\Phi})=\big{(}\boldsymbol{I}-\boldsymbol{J}(z)\big{)}^{-1}\boldsymbol{v}(z,\boldsymbol{\Phi})

(47)

with $\boldsymbol{J}(z)\in\mathbb{C}^{K\times K}$ and $\boldsymbol{v}(z)\in\mathbb{C}^{K\times 1}$ defined as

\displaystyle\big{(}\boldsymbol{J}(z)\big{)}_{k,l}=\frac{\frac{1}{M}\mathrm{tr}\big{[}\boldsymbol{R}_{k}\boldsymbol{{T}}\boldsymbol{R}_{l}\boldsymbol{{T}}\big{]}}{M(1+e_{l})^{2}}

(48)

and

\displaystyle\big{(}\boldsymbol{v}(z,\boldsymbol{\Phi})\big{)}_{k}=\frac{1}{M}\mathrm{tr}\big{[}\boldsymbol{R}_{k}\boldsymbol{{T}}\boldsymbol{\Phi}\boldsymbol{{T}}\big{]}

(49)

Appendix C Proof of Th. 1

Let us define matrices $\boldsymbol{P}=\text{diag}\{p_{1},\dots,p_{K}\}$ ,

\displaystyle\boldsymbol{\Omega}=|\mathcal{F}_{k}|N\left(\big{(}\boldsymbol{{M}}^{(s)}\circ\boldsymbol{{\hat{G}}}_{k}\big{)}\boldsymbol{{P}}{}\big{(}\boldsymbol{{M}}^{(s)}{}\circ\boldsymbol{{\hat{G}}}_{k}\big{)}^{*}+\boldsymbol{\Sigma}_{k}\right)^{\!-1},

(50)

	$\displaystyle\boldsymbol{\Omega}_{k}$	$\displaystyle=\bigg{(}\big{(}\boldsymbol{{M}}^{(s)}\circ\boldsymbol{{\hat{G}}}_{k}\big{)}\boldsymbol{{P}}{}\big{(}\boldsymbol{{M}}^{(s)}{}\circ\boldsymbol{{\hat{G}}}_{k}\big{)}^{*}-$
		$\displaystyle\mspace{22.0mu}\big{(}\boldsymbol{m}_{k}^{(s)}\circ\boldsymbol{\hat{g}}_{k}\big{)}\big{(}\boldsymbol{m}_{k}^{(s)}\circ\boldsymbol{\hat{g}}_{k}\big{)}^{*}p_{k}+\boldsymbol{\Sigma}_{k}\bigg{)}^{\!-1},$		(51)

and $\boldsymbol{\Omega}_{k}^{{}^{\prime}}=|\mathcal{F}_{k}|N\boldsymbol{\Omega}_{k}$ . Then, (13) can be written as

	$\displaystyle\mathrm{SINR}_{k}$	$\displaystyle=\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{k}\boldsymbol{\hat{g}}_{k}\,p_{k}$		(52)
		$\displaystyle=\frac{p_{k}}{\|\mathcal{F}_{k}\|N}\,\mathrm{tr}\!\left[\boldsymbol{\hat{g}}_{k}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{k}^{{}^{\prime}}\right].$		(53)

For $|\mathcal{F}_{k}|N$ , $|\mathcal{U}_{m}|$ $\xrightarrow{}\infty$ $\forall\mspace{4.0mu}k,m$ , we have

	$\displaystyle\frac{p_{k}}{\|\mathcal{F}_{k}\|N}\mathrm{tr}\bigg{[}\boldsymbol{\hat{g}}_{k}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{k}^{{}^{\prime}}\bigg{]}$	$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{\approx}}\frac{p_{k}}{\|\mathcal{F}_{k}\|N}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{\Omega}\big{]}$		(54)
		$\displaystyle\stackrel{{\scriptstyle\text{(b)}}}{{\approx}}\frac{p_{k}}{\|\mathcal{F}_{k}\|N}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{{T}}_{k}\big{]}.$		(55)

where (a) follows from [10, Lemmas 4 and 6] and (b) is obtained after applying Th. 3 by substituting $\boldsymbol{{D}}=\boldsymbol{\Gamma}_{k}^{(g)}\,p_{k}$ , (ii) $\boldsymbol{{R}}_{j}=\boldsymbol{\Gamma}_{j}^{(g)}\,p_{k}$ , and (iii) $\boldsymbol{{S}}+z\boldsymbol{{I}}_{M}=\frac{1}{|\mathcal{F}_{k}|N}\boldsymbol{\Sigma}_{k}$ while $\boldsymbol{{T}}_{k}$ is defined next

\boldsymbol{{T}}_{k}=\bigg{(}\frac{1}{|\mathcal{F}_{k}|N}\sum\limits_{i=1}^{K}\frac{\boldsymbol{\Gamma}_{i}^{(g)}}{1+e_{i}}\,p_{i}+\frac{1}{|\mathcal{F}_{k}|N}\boldsymbol{\Sigma}_{k}\bigg{)}^{\!-1}.

(56)

The necessary coefficients can be calculated as $e_{j}=\lim_{n\to\infty}e_{j}^{(n)}$ with

\displaystyle e_{j}^{(n)}

\displaystyle=\frac{p_{j}}{|\mathcal{F}_{j}|N}\,\mathrm{tr}\Bigg{[}\boldsymbol{\Gamma}_{j}^{(g)}\bigg{(}\frac{1}{|\mathcal{F}_{j}|N}\sum\limits_{i=1}^{K}\frac{\boldsymbol{\Gamma}_{i}^{(g)}}{1+e_{i}}\,p_{i}+\frac{1}{|\mathcal{F}_{j}|N}\boldsymbol{\Sigma}_{k}\bigg{)}^{\!-1}\Bigg{]}.

(57)

The fixed-point algorithm can be used to compute $e_{j}^{(n)}$ and has been proved to converge [10]. Finally, given that all the involved matrices in $\overline{\mathrm{SINR}}_{k}$ are block-diagonal, i.e. $\boldsymbol{{T}}_{k}=\mathrm{diag}\{\boldsymbol{T}_{m,k}\mspace{4.0mu}\mathrm{for}\mspace{4.0mu}m\in\mathcal{F}_{k}\}$ the expression in (19) is obtained where $\boldsymbol{T}_{m,k}$ is defined in (20).

Appendix D Proof of Th. 2

From Eq. (18), we can derive an approximation for each of the terms in the numerator and denominator, respectively. In order not to overload the formulation, we will denote by $\boldsymbol{\bf\hat{g}}_{k}=\boldsymbol{m}_{k}^{(s)}\circ\boldsymbol{\hat{g}}_{k}$ the sparse version of the channel. We also define $\boldsymbol{\Omega}=\big{[}\boldsymbol{\bf\hat{G}}\boldsymbol{\bf\hat{G}}^{*}+\rho\boldsymbol{I}\big{]}^{-1}=\frac{1}{MN}\boldsymbol{\Omega}^{{}^{\prime}}$ with $\boldsymbol{\Omega}^{{}^{\prime}}=\big{[}\frac{1}{MN}\boldsymbol{\bf\hat{G}}\boldsymbol{\bf\hat{G}}^{*}+\frac{\rho}{MN}\boldsymbol{I}\big{]}^{-1}$ . Denote by $\boldsymbol{\Omega}_{k}$ and $\boldsymbol{\Omega}_{k}^{{}^{\prime}}$ the same as $\boldsymbol{\Omega}$ and $\boldsymbol{\Omega}^{{}^{\prime}}$ after removing the contribution of UE $k$ (the same applies to $\boldsymbol{\Omega}_{k,i}$ where the contributions of UEs $k$ and $i$ are removed). We first calculate the value of $\lambda_{k}$ , ensuring that $\mathbb{E}\{||\boldsymbol{W}\boldsymbol{v}_{k}||^{2}\}=1$ .

\displaystyle\lambda_{k}=\frac{1}{\sqrt{\mathbb{E}\{\boldsymbol{\bf\hat{g}}_{k}^{*}\boldsymbol{\Omega}\boldsymbol{W}^{*}\boldsymbol{W}\boldsymbol{\Omega}\boldsymbol{\bf\hat{g}}_{k}\}}}

(58)

The term inside the squared root can be asymptotically approximated for large $MN$ , $K$ as follows:

$\displaystyle\boldsymbol{\bf\hat{g}}_{k}^{}\boldsymbol{\Omega}\boldsymbol{W}^{}\boldsymbol{W}\boldsymbol{\Omega}\boldsymbol{\bf\hat{g}}_{k}$	$\displaystyle=\frac{\boldsymbol{\bf\hat{g}}_{k}^{}\boldsymbol{\Omega}_{k}\boldsymbol{W}^{}\boldsymbol{W}\boldsymbol{\Omega}_{k}\boldsymbol{\bf\hat{g}}_{k}}{(1+\boldsymbol{\bf\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{k}\boldsymbol{\bf\hat{g}}_{k})^{2}}$	(59)
	$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{\approx}}\frac{\frac{1}{(MN)^{2}}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{\Omega}_{k}^{{}^{\prime}}\boldsymbol{W}^{*}\boldsymbol{W}\boldsymbol{\Omega}_{k}^{{}^{\prime}}\big{]}}{(1+\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{\Omega}_{k}^{{}^{\prime}}\big{]})^{2}}$	(60)
	$\displaystyle\stackrel{{\scriptstyle\text{(b)}}}{{\approx}}\frac{\frac{1}{(MN)^{2}}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{W}^{*}\boldsymbol{W})\big{]}}{(1+\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}\big{]})^{2}}$	(61)
	$\displaystyle\stackrel{{\scriptstyle\text{(c)}}}{{=}}\frac{\delta_{k}}{(1+\mu_{k})^{2}}$	(62)

where (a) is obtained using [10, Lemma 4] and that $\boldsymbol{\Omega}_{k}=\frac{1}{MN}\boldsymbol{\Omega}_{k}^{{}^{\prime}}$ , (b) results from [10, Lemma 6] and applying Th. 2 and Th. 1 in the numerator and denominator, respectively, with $\boldsymbol{D}=\boldsymbol{\Gamma}_{k}^{(g)}$ , $\boldsymbol{\Phi}=\boldsymbol{W}^{*}\boldsymbol{W}$ , $\boldsymbol{S}=\boldsymbol{0}$ , $z=\frac{\rho}{MN}$ . Finally, (c) defines the values of $\delta_{k}=\frac{1}{(MN)^{2}}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{W}^{*}\boldsymbol{W})\big{]}$ and $\mu_{k}=\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}\big{]}$ as they will be repeatedly used later. As a consequence, from the continous mapping theorem:

\displaystyle\lambda_{k}\approx\frac{1}{\sqrt{\frac{\delta_{k}}{(1+\mu_{k})^{2}}}}

(63)

For the numerator of (18), given by $|\mathbb{E}\{\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k}\}|^{2}$ , we can compute an approximated deterministic equivalent for the term inside the expectation in a similar manner as for $\lambda_{k}$ :

$\displaystyle\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k}$	$\displaystyle=\lambda_{k}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}\boldsymbol{\bf\hat{g}}_{k}$	(64)
	$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{=}}\lambda_{k}\frac{\boldsymbol{g}_{k}^{}\boldsymbol{\Omega}_{k}\boldsymbol{\bf\hat{g}}_{k}}{1+\boldsymbol{g}_{k}^{}\boldsymbol{\Omega}_{k}\boldsymbol{\bf\hat{g}}_{k}}$	(65)
	$\displaystyle\stackrel{{\scriptstyle\text{(b)}}}{{\approx}}\lambda_{k}\frac{\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{\Omega}_{k}^{{}^{\prime}}\big{]}}{1+\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{\Omega}_{k}^{{}^{\prime}}\big{]}}$	(66)
	$\displaystyle\stackrel{{\scriptstyle\text{(c)}}}{{\approx}}\lambda_{k}\frac{\mu_{k}}{1+\mu_{k}}$	(67)

where (a) follows from [10, Lemma 1] (b) is derived applying [10, Lemma 4] and the fact that $\boldsymbol{\Omega}_{k}=\frac{1}{MN}\boldsymbol{\Omega}_{k}^{{}^{\prime}}$ . Finally, (c) is obtained by applying the definition of $\mu_{k}$ previously derived. From the continuous mapping theorem and substituting the value of $\lambda_{k}$ provided in (63), the numerator therefore has an approximated value of

	$\displaystyle\|\mathbb{E}\{\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k}\}\|^{2}$	$\displaystyle\approx\lambda_{k}^{2}\frac{\mu_{k}^{2}}{(1+\mu_{k})^{2}}$		(68)
		$\displaystyle=\frac{\mu_{k}^{2}}{\delta_{k}}$		(69)

For the interfering terms $\mathbb{E}\{|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{i}|^{2}\}$ we can proceed similarly and obtain a deterministic approximation by considering the term inside the expectation as follows:

$\displaystyle\|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{i}\|^{2}$	$\displaystyle=\lambda_{i}^{2}\|\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}\boldsymbol{\bf\hat{g}}_{i}\|^{2}$	(70)
	$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{=}}\lambda_{i}^{2}\frac{\|\boldsymbol{g}_{k}^{}\boldsymbol{\Omega}_{i}\boldsymbol{\bf\hat{g}}_{i}\|^{2}}{(1+\boldsymbol{\bf\hat{g}}_{i}^{}\boldsymbol{\Omega}_{i}\boldsymbol{\bf\hat{g}}_{i})^{2}}$	(71)
	$\displaystyle\stackrel{{\scriptstyle\text{(b)}}}{{=}}\lambda_{i}^{2}\frac{\|\frac{1}{MN}\boldsymbol{g}_{k}^{}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i}\|^{2}}{(1+\frac{1}{MN}\boldsymbol{\bf\hat{g}}_{i}^{}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i})^{2}}$	(72)
	$\displaystyle\stackrel{{\scriptstyle\text{(c)}}}{{\approx}}\lambda_{i}^{2}\frac{\|\frac{1}{MN}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i}\|^{2}}{(1+\mu_{i})^{2}}$	(73)
	$\displaystyle\stackrel{{\scriptstyle\text{(d)}}}{{=}}\frac{1}{\delta_{i}}\|\frac{1}{MN}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i}\|^{2}$	(74)

where (a) follows from [10, Lemma 1], (b) substitutes $\boldsymbol{\Omega}_{i}=\frac{1}{MN}\boldsymbol{\Omega}_{i}^{{}^{\prime}}$ , (c) applies the definition of $\mu_{k}$ in the denominator and (d) substitutes the value of $\lambda_{i}$ previously derived.

To get a deterministic equivalent for the previous equation, we first know that:

\displaystyle|\frac{1}{MN}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i}|^{2}\approx\frac{1}{(MN)^{2}}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{g}_{k}

(75)

being a direct consequence of [10, Lemma 4]. After applying the matrix inversion lemma to $\boldsymbol{\Omega}_{i}^{{}^{\prime}}$ to remove the dependency with respect to UE $k$ , we obtain that

\displaystyle\boldsymbol{\Omega}_{i}^{{}^{\prime}}=\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}-\frac{\frac{1}{MN}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}}{1+\frac{1}{MN}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k}}

(76)

Substituting (76) in (75) yields the following:

\displaystyle\frac{1}{(MN)^{2}}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{g}_{k}=\mathrm{T}_{1}+\mathrm{T}_{2}+\mathrm{T}_{3}

(77)

where each of the terms is provided below:

	$\displaystyle\mathrm{T}_{1}$	$\displaystyle=\frac{1}{(MN)^{2}}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{g}_{k}$		(78)
		$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{\approx}}\frac{1}{(MN)^{2}}\mathrm{tr}\big{[}\boldsymbol{R}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{\Gamma}_{i}^{(g)})\big{]}$		(79)

where (a) combines both [10, Lemma 4] and Th. 2 with the following substitutions $\boldsymbol{D}=\boldsymbol{\Gamma}_{k}^{(g)}$ , $\boldsymbol{\Phi}=\boldsymbol{\Gamma}_{i}^{(g)}$ , $\boldsymbol{S}=\boldsymbol{0}$ , $z=\frac{\rho}{MN}$ . In addition,

$\displaystyle\mathrm{T}_{2}$	$\displaystyle=\frac{1}{(MN)^{2}}\frac{\frac{1}{(MN)^{2}}\|\boldsymbol{\hat{g}}_{k}^{}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{g}_{k}\|^{2}\boldsymbol{\hat{g}}_{k}^{}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k}}{(1+\frac{1}{MN}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k})^{2}}$	(80)
	$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{\approx}}\frac{1}{MN}\frac{\mu_{k}^{2}\frac{1}{MN}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k}}{(1+\mu_{k})^{2}}$	(81)
	$\displaystyle\stackrel{{\scriptstyle\text{(b)}}}{{\approx}}\frac{1}{MN}\frac{\mu_{k}^{2}\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{\Gamma}_{i}^{(g)})\big{]}}{(1+\mu_{k})^{2}}$	(82)

where (a) comes from the definition of $\mu_{i}$ , and (b) arises from applying [10, Lemma 6] and Th. 2 to the term $\frac{1}{MN}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k}$ with the same substitutions as for $\mathrm{T}_{1}$ . Finally, the last term can be computed as

$\displaystyle\mathrm{T}_{3}$	$\displaystyle=-\frac{2}{(MN)^{2}}\mathbb{R}\bigg{\{}\frac{\frac{1}{MN}\boldsymbol{\hat{g}}_{k}^{}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{{g}}_{k}\boldsymbol{{g}}_{k}^{}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k}}{1+\frac{1}{MN}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k}}\bigg{\}}$	(83)
	$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{\approx}}\frac{2}{MN}\mathbb{R}\bigg{\{}\frac{\mu_{k}\frac{1}{MN}\boldsymbol{{g}}_{k}^{*}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\Gamma}_{i}^{(g)}\boldsymbol{\Omega}_{i,k}^{{}^{\prime}}\boldsymbol{\hat{g}}_{k}}{1+\mu_{k}}\bigg{\}}$	(84)
	$\displaystyle\stackrel{{\scriptstyle\text{(b)}}}{{\approx}}\frac{2}{MN}\mathbb{R}\bigg{\{}\frac{\mu_{k}\frac{1}{MN}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{T}^{{}^{\prime}}(\frac{\rho}{MN},\boldsymbol{\Gamma}_{i}^{(g)})}{1+\mu_{k}}\bigg{\}}$	(85)

where (a) is obtained from the definition of $\mu_{k}$ and (b) follows the same step as to calculate $\mathrm{T}_{2}$ (b). Consequently, the interfering terms accept an assymptotic approximation as follows:

\displaystyle|\mathbb{E}\{\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k}\}|^{2}

\displaystyle\approx\frac{\theta_{k,i}}{\delta_{i}}

(86)

with $\theta_{k,i}=\mathrm{T}_{1}+\mathrm{T}_{2}+\mathrm{T}_{3}$ .

Finally, the term $\mathrm{var}(\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k})$ can be shown to approximately converge to zero in the asymptotic regime as follows:

	$\displaystyle\mathrm{var}(\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k})$	$\displaystyle=\mathbb{E}\{\|\boldsymbol{g}_{k}^{}\boldsymbol{v}_{k}\|^{2}\}-\mathbb{E}\{\boldsymbol{g}_{k}^{}\boldsymbol{v}_{k}\}^{2}$		(87)
		$\displaystyle\approx\bigg{(}\lambda_{k}\frac{\mu_{k}}{1+\mu_{k}}\bigg{)}^{2}-\bigg{(}\lambda_{k}\frac{\mu_{k}}{1+\mu_{k}}\bigg{)}^{2}$		(88)

As a consequence, the result in Th. 2 is obtained.

Appendix E Proof of Prop. 1

From Eq. (13), under perfect CSI it can be shown that the SINR achieved by UE $k$ is:

\displaystyle\mathrm{SINR}_{k}=\boldsymbol{{g}}_{k}^{*}\bigg{(}\sum\limits_{i\neq k}^{K}\boldsymbol{{g}}_{i}\boldsymbol{{g}}_{i}^{*}p_{i}+\boldsymbol{\Sigma}_{k}\bigg{)}^{-1}\boldsymbol{{g}}_{k},

(89)

where, for simplicity we assume that $\boldsymbol{{M}}^{(s)}=\boldsymbol{1}$ though the same analysis and conclusion is valid for subsets of APs and UEs. Therefore, $\boldsymbol{\Sigma}_{k}$ is a block diagonal matrix $\boldsymbol{\Sigma}_{k}=\mathrm{diag}\{\boldsymbol{\Sigma}_{k,m}\in\mathbb{C}^{L\times L}\mspace{4.0mu}\mathrm{for}\mspace{4.0mu}m\in\mathcal{F}_{k}\}$ where $\boldsymbol{\Sigma}_{k,m}=\sigma^{2}\boldsymbol{W}_{m}^{*}{}\boldsymbol{W}_{m}$ . Note that $\boldsymbol{{g}}_{k}=\boldsymbol{W}\boldsymbol{{h}}_{k}$ . As a consequence:

	$\displaystyle\mathrm{SINR}_{k}$	$\displaystyle=\boldsymbol{{h}}_{k}^{}\boldsymbol{W}\bigg{(}\sum\limits_{i\neq k}^{K}\boldsymbol{W}^{}\boldsymbol{{h}}_{i}\boldsymbol{{h}}_{i}^{}\boldsymbol{W}p_{i}+\sigma^{2}\boldsymbol{W}^{}\boldsymbol{W}\bigg{)}^{-1}\boldsymbol{W}\boldsymbol{{h}}_{k}$		(90)
		$\displaystyle=\boldsymbol{{h}}_{k}^{}\boldsymbol{W}\bigg{(}\boldsymbol{W}^{}\Big{(}\sum\limits_{i\neq k}^{K}\boldsymbol{{h}}_{i}\boldsymbol{{h}}_{i}^{*}p_{i}+\sigma^{2}\boldsymbol{I}\Big{)}\boldsymbol{W}\bigg{)}^{-1}\boldsymbol{W}\boldsymbol{{h}}_{k}.$		(91)

Consider the generic case of $\mathrm{rank}(\boldsymbol{W}_{m})=r_{m}\leq L$ . It can be easily shown that if $\exists\mspace{4.0mu}r_{m}<L$ $\bigg{(}\boldsymbol{W}^{*}\Big{(}\sum\limits_{i\neq k}^{K}\boldsymbol{{h}}_{i}\boldsymbol{{h}}_{i}^{*}p_{i}+\sigma^{2}\boldsymbol{I}\Big{)}\boldsymbol{W}\bigg{)}^{-1}$ does not exist. As a consequence, each $\boldsymbol{W}_{m}$ must be full rank. After doing the compact SVD on $\boldsymbol{W}=\boldsymbol{U}\boldsymbol{Q}\in\mathbb{C}^{NM\times r}$ where $r=\sum_{m}r_{m}$ and both $\boldsymbol{U}$ and $\boldsymbol{Q}$ are block diagonal. More particularly, $\boldsymbol{U}=\mathrm{diag}\{\boldsymbol{U}_{m}\mspace{4.0mu}\mathrm{for}\mspace{4.0mu}m=1,\dots,M\}$ with each $\boldsymbol{U}_{m}\in\mathbb{C}^{N\times r_{m}}$ and $\boldsymbol{U}_{m}^{*}\boldsymbol{U}_{m}=\boldsymbol{I}$ . Similarly, $\boldsymbol{Q}=\mathrm{diag}\{\boldsymbol{Q}_{m}\mspace{4.0mu}\mathrm{for}\mspace{4.0mu}m=1,\dots,M\}$ with each $\boldsymbol{Q}_{m}\in\mathbb{C}^{r_{m}\times r_{m}}$ . Then it follows that

\displaystyle\begin{gathered}\mathrm{SINR}_{k}=\\ \boldsymbol{{h}}_{k}^{*}\boldsymbol{U}\boldsymbol{Q}\bigg{(}\boldsymbol{Q}^{*}\boldsymbol{U}^{*}\Big{(}\sum\limits_{i\neq k}^{K}\boldsymbol{{h}}_{i}\boldsymbol{{h}}_{i}^{*}p_{i}+\sigma^{2}\boldsymbol{I}\Big{)}\boldsymbol{U}\boldsymbol{Q}\bigg{)}^{-1}\boldsymbol{Q}^{*}\boldsymbol{U}^{*}\boldsymbol{{h}}_{k}\\ =\boldsymbol{{h}}_{k}^{*}\boldsymbol{U}\bigg{(}\boldsymbol{U}^{*}\Big{(}\sum\limits_{i\neq k}^{K}\boldsymbol{{h}}_{i}\boldsymbol{{h}}_{i}^{*}p_{i}+\sigma^{2}\boldsymbol{I}\Big{)}\boldsymbol{U}\bigg{)}^{-1}\boldsymbol{U}^{*}\boldsymbol{{h}}_{k}\end{gathered}

(95)

As a consequence, the UL SINR after MMSE reception under perfect CSI does not depend on $\boldsymbol{Q}$ . Therefore, any non-singular $\boldsymbol{Q}$ maximizes $\mathrm{SINR}_{k}$ .

Appendix F Proof of Prop. 2

Under perfect CSI, the DL-SINR under RZF precoding is

\displaystyle\mathrm{SINR}_{k}

\displaystyle=\frac{|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k}^{*}|^{2}p_{k}}{\sum\limits_{i\neq k}^{K}|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{i}|^{2}p_{i}+\sigma^{2}}

(96)

According to [13], the term $\sum\limits_{i\neq 1}^{K}|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{i}|^{2}p_{i}+\sigma^{2}\approx\sum\limits_{i\neq k}^{K}|\boldsymbol{g}_{i}^{*}\boldsymbol{v}_{k}|^{2}p_{i}+\sigma^{2}$ . As a consequence, $\mathrm{SINR}_{k}$ can be approximately rewritten as

\displaystyle\mathrm{SINR}_{k}

\displaystyle\approx\frac{|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{k}^{*}|^{2}p_{k}}{\sum\limits_{i\neq k}^{K}|\boldsymbol{g}_{i}^{*}\boldsymbol{v}_{k}|^{2}p_{i}+\sigma^{2}}.

(97)

Again, and for simplicity, we assume $\boldsymbol{{M}}^{(s)}=\boldsymbol{1}$ . Using a RZF precoding

\displaystyle\boldsymbol{V}

\displaystyle=\big{[}\boldsymbol{{G}}\boldsymbol{{G}}^{*}+\rho\boldsymbol{I}\big{]}^{-1}\boldsymbol{{G}}\boldsymbol{\Lambda}.

(98)

where $\boldsymbol{\Lambda}=\mathrm{diag}(\lambda_{1},\dots,\lambda_{K})$ such that $||\boldsymbol{v}_{k}||^{2}=1$ . Therefore

\displaystyle\lambda_{k}=\frac{1}{\sqrt{||\boldsymbol{W}\big{[}\boldsymbol{{G}}\boldsymbol{{G}}^{*}+\rho\boldsymbol{I}\big{]}^{-1}\boldsymbol{g}_{k}||^{2}}}.

(99)

Substituting the previous expression in Eq. (97) we obtain

$\displaystyle\mathrm{SINR}_{k}$	$\displaystyle\approx\frac{\|\boldsymbol{g}_{k}^{}\big{[}\boldsymbol{{G}}\boldsymbol{{G}}^{}+\rho\boldsymbol{I}\big{]}^{-1}\boldsymbol{g}_{k}\|^{2}\lambda_{k}^{2}p_{k}}{\sum\limits_{i\neq k}^{K}\|\boldsymbol{g}_{i}^{}\big{[}\boldsymbol{{G}}\boldsymbol{{G}}^{}+\rho\boldsymbol{I}\big{]}^{-1}\boldsymbol{g}_{k}\|^{2}p_{i}+\sigma^{2}}$	(100)
	$\displaystyle=\frac{\|\boldsymbol{g}_{k}^{}\big{[}\boldsymbol{{G}}\boldsymbol{{G}}^{}+\rho\boldsymbol{I}\big{]}^{-1}\boldsymbol{g}_{k}\|^{2}p_{k}}{\sum\limits_{i\neq k}^{K}\|\boldsymbol{g}_{i}^{}\big{[}\boldsymbol{{G}}\boldsymbol{{G}}^{}+\rho\boldsymbol{I}\big{]}^{-1}\boldsymbol{g}_{k}\|^{2}\lambda_{k}^{2}p_{i}+\frac{\sigma^{2}}{\lambda_{k}^{2}}}$	(101)
	$\displaystyle=\frac{\boldsymbol{h}_{k}^{}\boldsymbol{\Omega}\boldsymbol{h}_{k}\boldsymbol{h}_{k}^{}\boldsymbol{\Omega}\boldsymbol{h}_{k}p_{k}}{\boldsymbol{h}_{k}^{}\boldsymbol{\Omega}\big{(}\boldsymbol{H}\boldsymbol{P}\boldsymbol{H}^{}-\boldsymbol{h}_{k}\boldsymbol{h}_{k}^{*}p_{k}+\sigma^{2}\boldsymbol{I}\big{)}\boldsymbol{\Omega}\boldsymbol{h}_{k}}$	(102)

where, in the last step we define by $\boldsymbol{\Omega}=\boldsymbol{W}\big{[}\boldsymbol{{G}}\boldsymbol{{G}}^{*}+\rho\boldsymbol{I}\big{]}^{-1}\boldsymbol{W}^{*}$ . Now, by compact SVD $\boldsymbol{W}=\boldsymbol{U}\boldsymbol{Q}\in\mathbb{C}^{NM\times r}$ where $r=\sum_{m}r_{m}$ and both $\boldsymbol{U}$ and $\boldsymbol{Q}$ are block diagonal. Let $\boldsymbol{\bf H}=\boldsymbol{U}^{*}\boldsymbol{H}$ and $\boldsymbol{\bf h}_{k}=\boldsymbol{U}^{*}\boldsymbol{h}_{k}$ . Then $\boldsymbol{h}_{k}^{*}\boldsymbol{\Omega}$ can be written as

	$\displaystyle\boldsymbol{h}_{k}^{*}\boldsymbol{\Omega}$	$\displaystyle=\boldsymbol{h}_{k}^{}\boldsymbol{W}\big{[}\boldsymbol{W}^{}\boldsymbol{{H}}\boldsymbol{{H}}^{}\boldsymbol{W}+\rho\boldsymbol{I}\big{]}^{-1}\boldsymbol{W}^{}$		(103)
		$\displaystyle=\boldsymbol{\bf h}_{k}^{}\big{[}\boldsymbol{\bf H}\boldsymbol{\bf H}^{}+\rho(\boldsymbol{Q}\boldsymbol{Q}^{})^{-1}\big{]}^{-1}\boldsymbol{U}^{}.$		(104)

We define by $\boldsymbol{B}=\big{[}\boldsymbol{\bf H}\boldsymbol{\bf H}^{*}+\rho(\boldsymbol{Q}\boldsymbol{Q}^{*})^{-1}\big{]}^{-1}$ . Operating on (19), we obtain that

	$\displaystyle\mathrm{SINR}_{k}$	$\displaystyle\approx\frac{\boldsymbol{\bf h}_{k}^{}\boldsymbol{B}\boldsymbol{\bf h}_{k}\boldsymbol{\bf h}_{k}^{}\boldsymbol{B}\boldsymbol{\bf h}_{k}p_{k}}{\boldsymbol{\bf h}_{k}^{}\boldsymbol{B}\big{(}\boldsymbol{\bf H}\boldsymbol{P}\boldsymbol{\bf H}^{}+\sigma^{2}\boldsymbol{I}\big{)}\boldsymbol{B}\boldsymbol{\bf h}_{k}-\boldsymbol{\bf h}_{k}^{}\boldsymbol{B}\boldsymbol{\bf h}_{k}\boldsymbol{\bf h}_{k}^{}\boldsymbol{B}\boldsymbol{\bf h}_{k}p_{k}}$		(105)
		$\displaystyle=\frac{\mathrm{K}_{k}}{1-\mathrm{K}_{k}}$		(106)

where $0\leq\mathrm{K}_{k}\leq 1$ with $\mathrm{K}_{k}$ defined as

	$\displaystyle\mathrm{K}_{k}$	$\displaystyle=\frac{\boldsymbol{\bf h}_{k}^{}\boldsymbol{B}\boldsymbol{\bf h}_{k}\boldsymbol{\bf h}_{k}^{}\boldsymbol{B}\boldsymbol{\bf h}_{k}p_{k}}{\boldsymbol{\bf h}_{k}^{}\boldsymbol{B}\big{(}\boldsymbol{\bf H}\boldsymbol{P}\boldsymbol{\bf H}^{}+\sigma^{2}\boldsymbol{I}\big{)}\boldsymbol{B}\boldsymbol{\bf h}_{k}}$		(107)
		$\displaystyle=\frac{\boldsymbol{b}_{k}^{}\boldsymbol{\bf h}_{k}\boldsymbol{\bf h}_{k}^{}\boldsymbol{b}_{k}p_{k}}{\boldsymbol{b}_{k}^{}\big{(}\boldsymbol{\bf H}\boldsymbol{P}\boldsymbol{\bf H}^{}+\sigma^{2}\boldsymbol{I}\big{)}\boldsymbol{b}_{k}}.$		(108)

with $\boldsymbol{b}_{k}=\boldsymbol{B}\boldsymbol{\bf h}_{k}$ . Note that (105) is an increasing function with respect to $\mathrm{K}_{k}$ . Thus, maximizing $\mathrm{K}_{k}$ is equivalent to maximizing the SINR. Since $\mathrm{K}_{k}$ follows a Rayleigh quotient, the optimal $\boldsymbol{b}_{k}$ maximizing $\mathrm{K}_{k}$ is the eigenvector associated to the maximum eigenvalue of $\big{(}\boldsymbol{\bf H}\boldsymbol{P}\boldsymbol{\bf H}^{*}+\sigma^{2}\boldsymbol{I}\big{)}^{-1}\boldsymbol{\bf h}_{k}\boldsymbol{\bf h}_{k}^{*}$ . Given that the previous matrix is rank-ones, there is only one eigenvector. As a consequence:

\displaystyle\boldsymbol{b}_{k}^{(\mathrm{max})}=\big{(}\boldsymbol{\bf H}\boldsymbol{P}\boldsymbol{\bf H}^{*}+\sigma^{2}\boldsymbol{I}\big{)}^{-1}\boldsymbol{\bf h}_{k}.

(109)

By definition, $\boldsymbol{b}_{k}=\boldsymbol{B}\boldsymbol{\bf h}_{k}=\big{[}\boldsymbol{\bf H}\boldsymbol{\bf H}^{*}+\rho(\boldsymbol{Q}\boldsymbol{Q}^{*})^{-1}\big{]}^{-1}\boldsymbol{\bf h}_{k}$ . As a consequence, to obtain that $\boldsymbol{b}_{k}=\boldsymbol{b}_{k}^{(\mathrm{max})}$ , matrix $\boldsymbol{Q}$ has to satisfy $\boldsymbol{Q}\boldsymbol{Q}^{*}=\boldsymbol{I}$ , i.e. being semi-unitary.

Appendix G Proof of Prop. 3

Let us assume a generic and nonsingular $\boldsymbol{A}_{m}$ . Then, $\boldsymbol{A}_{m}=\boldsymbol{U}_{1}\boldsymbol{D}_{1}\boldsymbol{V}_{1}^{*}$ with $\boldsymbol{U}_{1}$ and $\boldsymbol{V}_{1}$ being unitary. After adding $\boldsymbol{A}_{m}$ , the output of the analog beamformer is

\displaystyle\boldsymbol{\hat{W}}_{m}\boldsymbol{A}_{m}=\boldsymbol{\hat{W}}_{m}\boldsymbol{U}_{1}\boldsymbol{D}_{1}\boldsymbol{V}_{1}^{*}

(110)

Now, let us add the compensation matrix $\boldsymbol{F}_{m}$ . Recall that the compensation matrix tries to somehow compensate the matrix that is in front of it, as shown in Eq. (36). In this case, for a generic $\boldsymbol{A}_{m}$ , the compensation matrix of $\boldsymbol{\hat{W}}_{m}^{*}\boldsymbol{A}_{m}$ is $\boldsymbol{F}_{m}=\boldsymbol{{V}}_{1}\boldsymbol{{D}}_{1}^{-1}\boldsymbol{{V}}_{1}^{*}$ following Eq. (36). Then, the product of the three matrices is:

\displaystyle\boldsymbol{\hat{W}}_{m}\boldsymbol{A}_{m}\boldsymbol{F}_{m}=\boldsymbol{\hat{W}}_{m}\boldsymbol{U}_{1}\boldsymbol{D}_{1}\boldsymbol{V}_{1}^{*}\boldsymbol{{V}}_{1}\boldsymbol{{D}}_{1}^{-1}\boldsymbol{{V}}_{1}^{*}=\boldsymbol{\hat{W}}_{m}\boldsymbol{U}_{1}\boldsymbol{{V}}_{1}^{*}

(111)

Note that since both $\boldsymbol{{U}}_{1}$ and $\boldsymbol{{V}}_{1}$ are unitary, we are not modifying the optimality of the solution. As a consequence, $\boldsymbol{\hat{W}}_{m}\boldsymbol{A}_{m}\boldsymbol{F}_{m}$ is also an unconstrained combiner, as the initial one $\boldsymbol{\hat{W}}_{m}$ , that does not change the output power.

Appendix H Proof of Prop. 4

For simplicity, let us assume $\boldsymbol{{M}}^{(s)}=\boldsymbol{1}$ . Under perfect CSI and maximum ratio combining (MRC), i.e. $\boldsymbol{v}_{k}=\boldsymbol{g}_{k}$ for the unconstrained solution of $\boldsymbol{W}$ , the SINR in (112) becomes

\displaystyle\mathrm{SINR}_{k}=\frac{|\boldsymbol{g}_{k}^{*}\boldsymbol{{g}}_{k}|^{2}p_{k}}{\sum\limits_{i\neq k}^{K}|\boldsymbol{g}_{k}^{*}\boldsymbol{{g}}_{i}|^{2}p_{i}+\sigma^{2}\boldsymbol{g}_{k}^{*}\boldsymbol{g}_{k}}.

(112)

For $M\to\infty$ at a faster peace than $K$ , $\boldsymbol{g}_{k}^{*}\boldsymbol{{g}}_{i}\to 0$ almost surely. As a consequence, the asymptotic SINR achieved by UE $k$ is

\displaystyle\overline{\mathrm{SINR}}_{k}=\frac{p_{k}}{\sigma^{2}}\sum\limits_{m=1}^{M}\mathrm{tr}\Big{[}\boldsymbol{W}_{m}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m}\Big{]}.

(113)

Let $\{\lambda_{m,k}^{(1)},\dots,\lambda_{m,k}^{(N)}\}$ be the eigenvalues of $\boldsymbol{R}_{m,k}$ sorted in descending order. Recall that $\boldsymbol{W}_{m}$ is semi-unitary. Therefore we can construct a unitary $\boldsymbol{W}_{m}^{(u)}=[\boldsymbol{W}_{m}\mspace{4.0mu}\boldsymbol{W}_{m,0}]$ such that $\boldsymbol{W}_{m,0}^{*}\boldsymbol{W}_{m,0}=\boldsymbol{I}$ and $\boldsymbol{W}_{m}^{*}\boldsymbol{W}_{m,0}=\boldsymbol{0}$ . Provided that $\boldsymbol{W}_{m}^{(u)}$ is unitary, $\boldsymbol{W}_{m}^{(u)\mspace{4.0mu}*}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m}^{(u)}$ has the same eigenvalues as $\boldsymbol{R}_{m,k}$ and can be written as

	$\displaystyle\boldsymbol{W}_{m}^{(u)\mspace{4.0mu}*}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m}^{(u)}$	$\displaystyle=$		(114)
		$\displaystyle\left[{\begin{array}[]{cc}\boldsymbol{W}_{m}^{}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m}&\boldsymbol{W}_{m}^{}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m,0}\\ \boldsymbol{W}_{m,0}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m}^{}&\boldsymbol{W}_{m,0}^{}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m,0}\\ \end{array}}\right]$		(117)

Denote the eigenvalues of $\boldsymbol{W}_{m}^{*}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m}$ by $\mu_{m,k}^{(1)}\geq\mu_{m,k}^{(2)}\geq\dots\geq\mu_{m,k}^{(L)}$ . For a fully digital receiver, the asymptotic SINR is

\displaystyle\overline{\mathrm{SINR}}_{k}^{\mathrm{FD}}=\frac{p_{k}}{\sigma^{2}}\sum\limits_{m=1}^{M}\mathrm{tr}\Big{[}\boldsymbol{R}_{m,k}\Big{]}.

(118)

By the Cauchy’s interlacing theorem, the eigenvalues of the leading principal submatrix $\boldsymbol{W}_{m}^{*}\boldsymbol{R}_{m,k}\boldsymbol{W}_{m}$ satisfy

\displaystyle\lambda_{m,k}^{(i)}\geq\mu_{m,k}^{(i)}\geq\lambda_{m,k}^{(N-L+i)}\mspace{10.0mu}\mathrm{for}\mspace{4.0mu}i=1,\dots,L.

(119)

As a consequence, two bounds can be derived. A lower bound for the gap between hybrid and full digital occurs when $\mu_{m,k}^{(i)}=\lambda_{m,k}^{(i)}$ . As a consequence, such a gap, denoted by $\delta_{\mathrm{LB}}$ is:

$\displaystyle\delta$	$\displaystyle=\overline{\mathrm{SINR}}_{k}^{\mathrm{FD}}-\overline{\mathrm{SINR}}_{k}$	(120)
	$\displaystyle=\frac{p_{k}}{\sigma^{2}}\sum\limits_{m=1}^{M}\bigg{(}\sum\limits_{n=1}^{N}\lambda_{m,k}^{(n)}-\sum\limits_{n=1}^{L}\mu_{m,k}^{(n)}\bigg{)}$	(121)
	$\displaystyle\geq\frac{p_{k}}{\sigma^{2}}\sum\limits_{m=1}^{M}\sum\limits_{n=L+1}^{N}\lambda_{m,k}^{(n)}$	(122)
	$\displaystyle=\delta_{\mathrm{LB}}.$	(123)

To the contrary, the gap is maximum when $\mu_{m,k}^{(i)}=\lambda_{m,k}^{(N-M+i)}$ . As a consequence, an upper bound on the gap between hybrid and full digital can be derived

$\displaystyle\delta$	$\displaystyle=\overline{\mathrm{SINR}}_{k}^{\mathrm{FD}}-\overline{\mathrm{SINR}}_{k}$	(124)
	$\displaystyle=\frac{p_{k}}{\sigma^{2}}\sum\limits_{m=1}^{M}\bigg{(}\sum\limits_{n=1}^{N}\lambda_{m,k}^{(n)}-\sum\limits_{n=1}^{L}\mu_{m,k}^{(n)}\bigg{)}$	(125)
	$\displaystyle\leq\frac{p_{k}}{\sigma^{2}}\sum\limits_{m=1}^{M}\bigg{(}\sum\limits_{n=1}^{L}(\lambda_{m,k}^{(n)}-\lambda_{m,k}^{(N-L+n)})+\sum\limits_{n=L+1}^{N}\lambda_{m,k}^{(n)}\bigg{)}$	(126)
	$\displaystyle=\delta_{\mathrm{UB}}$	(127)

	$\displaystyle\frac{p_{k}}{\|\mathcal{F}_{k}\|N}\mathrm{tr}\bigg{[}\boldsymbol{\hat{g}}_{k}\boldsymbol{\hat{g}}_{k}^{*}\boldsymbol{\Omega}_{k}^{{}^{\prime}}\bigg{]}$	$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{\approx}}\frac{p_{k}}{\|\mathcal{F}_{k}\|N}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{\Omega}\big{]}$		(54)
		$\displaystyle\stackrel{{\scriptstyle\text{(b)}}}{{\approx}}\frac{p_{k}}{\|\mathcal{F}_{k}\|N}\mathrm{tr}\big{[}\boldsymbol{\Gamma}_{k}^{(g)}\boldsymbol{{T}}_{k}\big{]}.$		(55)

$\displaystyle\|\boldsymbol{g}_{k}^{*}\boldsymbol{v}_{i}\|^{2}$	$\displaystyle=\lambda_{i}^{2}\|\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}\boldsymbol{\bf\hat{g}}_{i}\|^{2}$	(70)
	$\displaystyle\stackrel{{\scriptstyle\text{(a)}}}{{=}}\lambda_{i}^{2}\frac{\|\boldsymbol{g}_{k}^{}\boldsymbol{\Omega}_{i}\boldsymbol{\bf\hat{g}}_{i}\|^{2}}{(1+\boldsymbol{\bf\hat{g}}_{i}^{}\boldsymbol{\Omega}_{i}\boldsymbol{\bf\hat{g}}_{i})^{2}}$	(71)
	$\displaystyle\stackrel{{\scriptstyle\text{(b)}}}{{=}}\lambda_{i}^{2}\frac{\|\frac{1}{MN}\boldsymbol{g}_{k}^{}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i}\|^{2}}{(1+\frac{1}{MN}\boldsymbol{\bf\hat{g}}_{i}^{}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i})^{2}}$	(72)
	$\displaystyle\stackrel{{\scriptstyle\text{(c)}}}{{\approx}}\lambda_{i}^{2}\frac{\|\frac{1}{MN}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i}\|^{2}}{(1+\mu_{i})^{2}}$	(73)
	$\displaystyle\stackrel{{\scriptstyle\text{(d)}}}{{=}}\frac{1}{\delta_{i}}\|\frac{1}{MN}\boldsymbol{g}_{k}^{*}\boldsymbol{\Omega}_{i}^{{}^{\prime}}\boldsymbol{\bf\hat{g}}_{i}\|^{2}$	(74)

Smart Hybrid Beamforming and Pilot Assignment for 6G Cell-Free Massive MIMO

Abstract

Index Terms:

I Introduction

II System Model

II-A Channel Estimation Process

II-B Scalable Cell-Free

II-C Uplink & Downlink Data Transmission

III Spectral Efficiency Analysis

III-A Uplink MMSE Reception

III-B Downlink RZF Precoding

IV Asymptotic Analysis

Theorem 1.

Proof.

Theorem 2.

Proof.

V Spectral Efficiency Optimization

V-A Analog Beamformer Design

Proposition 1.

Proof.

Proposition 2.

Proof.

Proposition 3.

Proof.

V-B Pilot Assignment Optimization

VI M→∞M\to\infty Regime

Proposition 4.

Proof.

VII Simulation Results

VIII Conclusions

References

Appendix A

Theorem 3.

Appendix B

Theorem 4.

Appendix C Proof of Th. 1

Appendix D Proof of Th. 2

Appendix E Proof of Prop. 1

Appendix F Proof of Prop. 2

Appendix G Proof of Prop. 3

Appendix H Proof of Prop. 4

VI $M\to\infty$ Regime