Smart Hybrid Beamforming and Pilot Assignment for 6G Cell-Free Massive MIMO
Abstract
We investigate Cell-Free massive MIMO networks, where each access point (AP) is equipped with a hybrid analog-digital transceiver, reducing the complexity and cost compared to a fully digital transceiver. Asymptotic approximations for the spectral efficiency are derived for uplink and downlink. Capitalizing on these expressions, a max-min problem is formulated enabling us to optimize the (i) analog beamformer at the APs and (ii) pilot assignment. Simulations show that the optimization of these variables substantially increases the minimum user throughput
Index Terms:
Cell-Free, MIMO, MMSE, RZF, hybrid beamforming, large-scale, optimization, SINRI Introduction
A prospective candidate considered for beyond-5G wireless networks is the cell-free massive MIMO (CF-mMIMO) topology, where every user (UE) potentially connects to every access point (AP), and takes the principles of cell cooperation to the limit; see [1, 2, 3, 4, 5] and the references therein.
In parallel, forthcoming technologies will be operating at higher frequencies (i.e. mmWave or THz bands), and therefore the transceivers complexity experiences a key trade-off: data rate vs power consumption. Additionally, CF networks will cover larger areas compared to cellular systems, and therefore the severity of the path loss requires the APs to be equipped with large arrays to compensate the attenuation, demanding even more power if fully digital structures are used.
A possible solution that has attracted a lot of attention is a hybrid transceiver [6, 7], composed by two stages: (a) the analog part, in which the antennas are connected to a few RF chains by means of phase shifters, and (b) the digital part. While the former stage dramatically reduces the AP complexity and power consumption, the performance decreases as well. Consequently, properly designing the analog beamformer might be a mean to reduce the performance gap with respect to fully digital transceivers. To the best of our knowledge, there are two main works dealing with the construction of the analog beamformer as a function of slow fading channel parameters [8, 9], which is also investigated in this paper and shown to outperform the previous references.
Once the analog part is designed, we investigate the uplink and downlink of two digital benchmarks: (i) minimum mean squared error (MMSE) reception and (ii) regularized zero forcing (RZF) precoding. Asymptotic approximations on the signal-to-interference-and-noise-ratio (SINR) are derived based on [10], and shown to be tight for finite-dimension systems under the previous decoding/precoding. For a given hybrid structure, and capitalizing on the asymptotic approximations, another relevant problem is studied in this paper: pilot assignment, for which a greedy algorithm based on the asymptotic expressions is provided.
Finally, we derive two novel bounds on the gap between hybrid and fully digital structures. It is shown that such bounds only depend on the channel matrix eigenvalues.
II System Model
Consider a CF massive MIMO system composed by APs, each equipped with antennas and RF chains serving single antenna users (UEs). We assume each AP is connected to a central processing unit (CPU) through high capacity fronthaul links. Denote by the channel between AP and UE . Then
(1) |
with being the spatial correlation matrix. Each AP performs hybrid beamforming with the aim of reducing the number of RF chains at the transceivers, and therefore their cost and complexity. Particularly, each AP contains an analog matrix such that , emulating phase shifters and whose entries will be designed later. As a consequence, the effective channel between AP and UE is represented by
(2) |
Hence, with .
II-A Channel Estimation Process
A portion of the total number of resource units, the latter denoted by , is used for channel estimation. During channel uses, UE is assigned a pilot with and the pilot matrix is denoted by . Upon pilot transmission at a certain power , the observations at the th AP are
(3) |
with for being the noise power. Standard MMSE estimation leads to the next estimates [11]
(4) |
with
(5) |
for . It can be verified that with denoting the error, uncorrelated with the estimate. More concretely, with defined by
(6) | ||||
(7) |
and the channel error following with .
II-B Scalable Cell-Free
Although CF networks allow users to establish connectivity to multiple APs, scalability must be taken into account. Therefore only a subset of APs jointly serve a particular user. Hence, we define by the subset of APs involved in the decoding of the th UE and by the subset of UEs treated as signal by AP . Thus, the binary matrix whose entries are
(8) |
accounts for scalability. Provided that each AP observes an -dimensional signal after the hybrid beamforming stage, the expanded version of is with an -dimensional vector of ones. The complementary matrix accounts for the disregarded UEs per AP.
II-C Uplink & Downlink Data Transmission
After data transmission, the signal collected by the APs is with
(9) |
with denoting the Hadamard product, being the effective channel matrix whose entries are . Vector for given UE transmit powers and symbols, denoted by and , respectively. Finally, and where .
In the downlink, the APs jointly precode the users data. More particularly, the precoder intended for UE is denoted by and after data transmission, the signal collected at UE is
(10) |
where .
III Spectral Efficiency Analysis
III-A Uplink MMSE Reception
Provided that for UE only APs are relevant, taking the rows of associated to produces the following reduced signal model
(11) |
where matrices in (11) are the reduced version of the original matrices which contain the rows related to and all columns. Moreover, with being a block diagonal matrix where the diagonal terms are
(12) |
In the uplink, the combiner maximizing the SINR is the MMSE, achieving a maximum value of
(13) |
where and are the th and th columns of , respectively, and a similar definition applies to . As a consequence, after accounting for the pilot overhead , the ergodic spectral efficiency that the th UE can achieve is
(14) |
III-B Downlink RZF Precoding
Various precoding strategies can be used to encode the users data. However, RZF provides an outstanding performance as studied in the literature. More particularly, the subset RZF precoding, denoted by , follows
(15) | ||||
(16) |
with being the regularitzation parameter and . Different formulations can be used for , such as to ensure (i) or (ii) . In our case, since perfect CSI is not available, we use the former formulation. Once User receives , as defined in Eq. (10), the following spectral efficiency can be achieved:
(17) |
with
(18) |
IV Asymptotic Analysis
To evaluate the previous SINR expressions, we consider the asymptotic regime, with finite and investigate the convergence of the spectral efficiency expressions to deterministic limits. Provided that the subsets account for the non-zero entries in the random matrices, it is required that they grow with the network as well, i.e., . The premises for this convergence need the involved matrices to satisfy two technical conditions: (a) the inverse of the resolvent matrix in (13) and (15) to exist, ensured by and , respectively, and that (b) has uniformly bounded spectral norm, for being the element of (8). Under these conditions, the following approximations can be made.
Theorem 1.
For and UL MMSE combining, with given in (19).
(19) |
where
(20) |
The coefficients are obtained iteratively, , given and the recursion in (21).
(21) |
Proof.
The proof can be found in App. C. ∎
Theorem 2.
(26) |
The coefficients are obtained iteratively with , given and the recursion in (27)
(27) |
Moreover, matrix
(28) |
and coefficients are calculated as
(29) |
with and defined as
(30) |
and
(31) |
Proof.
The proof can be found in App. D. ∎
From the continuous mapping theorem [12], the following holds: with the corresponding provided above for and .
V Spectral Efficiency Optimization
Note that the asymptotic SE approximations derived in the previous section only depend on large scale parameters. Therefore, we can formulate different asymptotic optimization problems. However, with the aim of increasing fairness in the network we focus on the following max-min problem:
(32) | ||||
s.t. |
where the optimization variables are two: (i) analog beamforing matrix and (ii) pilot matrix, studied separately.
V-A Analog Beamformer Design
The design of is challenging given the complexity of the SINR. Therefore, directly solving (32) poses a major challenge. However, under perfect CSI, some algebraic properties on can be extracted and therefore used for its design. Concretely, we first disregard the unit-modulus constraint and after SVD decomposition factorizes as with semi-unitary , i.e. .
Proposition 1.
Under perfect CSI UL-MMSE reception, any nonsingular provides maximum SINR.
Proof.
The proof can be found in App. E. ∎
According to [13], . Under the condition that the previous approximation is tight, the following proposition, which is similar to the result obtained in [9] for another metric, can be obtained.
Proposition 2.
Under perfect CSI DL-RZF precoding, the SINR is maximum when is semi-unitary: .
Proof.
The proof can be found in App. F. ∎
In order to full-fill both propositions, for UL and DL, can be set to and therefore meaning that the analog matrix should have orthogonal columns. The idea behind having orthogonal columns is that interference is reduced. To the best of our knowledge, there are two ways of smartly creating explained in [8] and [9], respectively. While the latter is based on perfectly known channels, the former fails to capture the complete spectrum of the channel covariance matrices. In this work, we propose a method that takes into account all possible eigenvectors/eigenvalues of all with the aim of maximizing the minimum average UE power signal, which is shown to maximize the minimum SINR in our simulations. More particularly, with having orthonormal column vectors and containing the eigenvalues of . Note that the average signal power for UE is given by . Therefore, such a expression is maximized whenever the columns of match the eigenvectors of . However, not all UEs and their respective eigenmodes can be captured by . A selection of out of should be made. As a consequence, we define the UE average signal power as
(33) |
where is a binary optimization variable scheduling the eigenvectors to the columns of . Therefore, the following optimization problem with respect to can be formulated:
(34) | ||||
s.t. | ||||
The reverse-delete algorithm is capable of efficiently solving (34) without the need of an exhaustive search. The surviving determine which eigenvectors of which users will compose the columns of . However, note that for does not necessarily have orthogonal columns given that, most likely, eigenvectors from multiple users will be used to construct the analog matrices. As a consequence, neither Prop. 1 nor Prop. 2 are satisfied. Thus, the final unconstrained analog beamformers are obtained by where is the projection of matrix into an orthonormal basis.
Still, is not only composed by phase shifters, i.e. the entries are not roots of unity. Therefore, for given , we aim at solving the following optimization problem:
(35) |
Although the optimal solution is obtained by taking the phase of the eigenvectors in , the orthogonality between columns achieved by would be broken. Therefore, we modify our receiver. We add an orthogonality compensation matrix into our digital processing [9]. More concretely, for a constrained analog beamformer , its SVD results in . The orthogonality compensation matrix, denoted by , is defined as
(36) |
Therefore, adding such a compensation matrix allows us to improve the design of the analog matrix exploiting the following proposition.
Proposition 3.
Assume that instead of using as the analog matrix, is the new analog beamformer with nonsingular. The product between provides the same optimality as and therefore is an optimal unconstrained analog matrix.
Proof.
The proof can be found in App. G ∎
Using the previous proposition, the initial unconstrained beamformer can be replaced by without a performance degradation as long as is nonsingular. As a consequence, we can formulate the following optimization problem:
(37) | ||||||
s.t. |
Thanks to the degrees of freedom added by , the constrained analog beamformer , can be made closer to the unconstrained one . By alternating minimization, we split the previous problem into two sub-problems: (i) find the optimal for fixed and (ii) find the optimal for fixed . The solution to the previous subproblems is
(38) |
(39) |
An iterative process based on the block coordinate descend method follows until convergence is reached [14]. Therefore, a constrained analog matrix will be obtained and thus from Eq. (36) we can create that goes into the baseband (or digital) part. As a consequence, the equivalent channel between AP and UE has an extra component:
(40) |
V-B Pilot Assignment Optimization
The optimal solution to (32) with respect to requires an exhaustive search over the set of possible pilot sequences. However, based on the correlation between effective channels: for , an initial pilot assignment can be made, denoted by . Particularly, a set of users is assigned the same pilot if their normalized cross-correlation, i.e. , is minimized. Afterwards, the greedy algorithm proposed in Alg. 1 combined with the asymptotic approximations can be used to iteratively update the UE pilot assignment in a max-min SINR sense. Additionally, by construction, Alg. 1 converges provided that the cost function (i) is non-decreasing and (ii) is upper bounded.
(41) |
VI Regime
Finally, we focus on the case where . For simplicity, assume and recall that a full digital structure is the one providing the best performance in terms of SE, attained when and . Then, the following can be derived.
Proposition 4.
Define the gap as the difference in SINR between full digital and hybrid. Then, there exist lower and upper bounds for the gap, denoted by and , given by
(42) |
(43) |
Proof.
The proof can be found in App. H ∎
Note that if the channel matrices are rank-deficient, i.e. , the gap can be as small as zero and therefore a hybrid structure would achieve the same performance as digital.
VII Simulation Results
For the purpose of performance evaluation, we consider a wrapped around universe. To generate the channel model, we assume that the APs are deployed in urban environments at around 10 m, matching with the 3GPP Urban Microcell model in [15, Table B.1.2.1-1] at an operating frequency of 2 GHz. The shadowing terms given an AP to different UEs present a certain correlation, given by the model in [15, Table B.1.2.2.1-4]. The number of total channel uses is . Unless otherwise specified, in order to take into account the effects of pilot contamination orthogonal pilots and UEs (i.e. reuse factor of two). Additionally, each AP has antennas. The UE transmit power is set to mW, dBm and . Moreover, to account for scalability, the entry of is 1 if for m, which ensures connectivity to multiple FBSs per GU for the Euclidean distance between AP and UE . Finally, to ensure enough iterations until convergence is reached.
The applicability of Theorems 1 and 2 to finite-dimensional systems is first verified in Figs. 1 and 2, where the approximations are denoted by RMT in the legend. For different network setups, corresponding to , , and , , , the approximations obtained in Th. 1 and 2 respectively are indeed accurate for and orthogonal pilots.


In Fig. 3, we compare the UL pilot assignment obtained by Alg. 1 (Greedy) and a random assignment (RA) for different values of and . For we assume a digital structure while for and the analog matrices are obtained as described in Section V-A. There is a visible improvement after running the greedy algorithm when the set of available pilots is composed by orthogonal pilots. Additionally, the improvement in terms of minimum SE is measured and is of about 60% and 90% for and , , respectively. Similar results are obtained in the DL.

Next, we analyze the performance of our hybrid beamforming method compared to the two existing techniques, called SVD [8] and SLNR [9]. We measure the 95% outage SE which is a key metric in wireless systems for both the UL and DL in Figs. 4 and 5. Clearly, our method outperforms both works in the two links, i.e. UL and DL, with gains in the range of 1-8% and 10-35% in the UL and DL, respectively.


VIII Conclusions
This paper has investigated the use of hybrid transceivers in CF MIMO setups. After deriving asymptotic approximations for both UL and DL, we focused on solving two problems: (i) analog beamformer and (ii) pilot assignment. The solution to the first one is shown to outperform state-of-the-art techniques while the greedy pilot assignment highly outperforms a RA. Finally, theoretical bounds for the gap between full digital and hybrid structures are presented, showing that such a gap is highly dependant on the eigenvalues of the channel correlation matrices.
References
- [1] H. Q. Ngo, A. Ashikhmin, H. Yang, E. G. Larsson, and T. L. Marzetta, “Cell-Free Massive MIMO Versus Small Cells,” IEEE Trans. Wireless Commun., vol. 16, pp. 1834–1850, Mar. 2017.
- [2] E. Björnson and L. Sanguinetti, “Making Cell-Free Massive MIMO Competitive With MMSE Processing and Centralized Implementation,” IEEE Trans. Wireless Commun., vol. 19, pp. 77–90, Jan. 2020.
- [3] E. Nayebi, A. Ashikhmin, T. L. Marzetta, H. Yang, and B. D. Rao, “Precoding and Power Optimization in Cell-Free Massive MIMO Systems,” IEEE Trans. on Wireless Commun., vol. 16, pp. 4445–4459, May 2017.
- [4] M. Bashar, K. Cumanan, A. G. Burr, M. Debbah, and H. Q. Ngo, “On the Uplink Max–Min SINR of Cell-Free Massive MIMO Systems,” IEEE Trans. on Wireless Commun., vol. 18, pp. 2021–2036, Jan. 2019.
- [5] M. Attarifar, A. Abbasfar, and A. Lozano, “Subset MMSE Receivers for Cell-Free Networks,” IEEE Trans. Wireless Commun., vol. 19, pp. 4183–4194, Jun. 2020.
- [6] X. Gao, L. Dai, S. Han, C.-L. I, and R. W. Heath, “Energy-Efficient Hybrid Analog and Digital Precoding for MmWave MIMO Systems With Large Antenna Arrays,” IEEE Journal on Sel. Areas in Commun., vol. 34, pp. 998–1009, Mar. 2016.
- [7] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave mimo systems,” IEEE Trans. on Wireless Commun., vol. 13, pp. 1499–1513, Jan. 2014.
- [8] G. Femenias and F. Riera-Palou, “Cell-Free Millimeter-Wave Massive MIMO Systems With Limited Fronthaul Capacity,” IEEE Access, vol. 7, pp. 44596–44612, Mar. 2019.
- [9] S. Park, J. Park, A. Yazdan, and R. W. Heath, “Exploiting Spatial Channel Covariance for Hybrid Precoding in Massive MIMO Systems,” IEEE Trans. on Sig. Proc., vol. 65, pp. 3818–3832, May 2017.
- [10] S. Wagner, R. Couillet, M. Debbah, and D. T. M. Slock, “Large System Analysis of Linear Precoding in Correlated MISO Broadcast Channels Under Limited Feedback,” IEEE Trans. on Inf. Th., vol. 58, pp. 4509–4537, Mar. 2012.
- [11] S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice-Hall PTR, 1st ed., 1993.
- [12] H. B. Mann and A. Wald, “On Stochastic Limit and Order Relationships,” Annals of Mathematical Statistics, vol. 14, pp. 217–226, 1943.
- [13] P. Patcharamaneepakorn, S. Armour, and A. Doufexi, “On the Equivalence Between SLNR and MMSE Precoding Schemes with Single-Antenna Receivers,” IEEE Commun. Lett., vol. 16, no. 7, pp. 1034–1037, 2012.
- [14] Z. Luo and P. Tseng, “On the convergence of the coordinate descent method for convex differentiable minimization,” J. of Optimization Theory and Applications, vol. 72, pp. 7–35, Jan. 1992.
- [15] “Rel. 9: Evolved Universal Terrestrial Radio Access (E-UTRA); Further advancements for E-UTRA physical layer aspects,” Tech. Rep. 36.814, 3GPP, Dec. 2017.
Appendix A
Theorem 3.
([10, Theorem 1]) Let and be Hermitian nonnegative-definite while is a random matrix with zero-mean independent column vectors, , each with covariance matrix . Finally, and have uniformly bounded spectral norm w.r.t. . For and ,
where
(44) |
with coefficients for
(45) |
with initial values .
Appendix B
Appendix C Proof of Th. 1
Let us define matrices ,
(50) |
(51) |
and . Then, (13) can be written as
(52) | ||||
(53) |
For , , we have
(54) | ||||
(55) |
where (a) follows from [10, Lemmas 4 and 6] and (b) is obtained after applying Th. 3 by substituting , (ii) , and (iii) while is defined next
(56) |
The necessary coefficients can be calculated as with
(57) |
The fixed-point algorithm can be used to compute and has been proved to converge [10]. Finally, given that all the involved matrices in are block-diagonal, i.e. the expression in (19) is obtained where is defined in (20).
Appendix D Proof of Th. 2
From Eq. (18), we can derive an approximation for each of the terms in the numerator and denominator, respectively. In order not to overload the formulation, we will denote by the sparse version of the channel. We also define with . Denote by and the same as and after removing the contribution of UE (the same applies to where the contributions of UEs and are removed). We first calculate the value of , ensuring that .
(58) |
The term inside the squared root can be asymptotically approximated for large , as follows:
(59) | ||||
(60) | ||||
(61) | ||||
(62) |
where (a) is obtained using [10, Lemma 4] and that , (b) results from [10, Lemma 6] and applying Th. 2 and Th. 1 in the numerator and denominator, respectively, with , , , . Finally, (c) defines the values of and as they will be repeatedly used later. As a consequence, from the continous mapping theorem:
(63) |
For the numerator of (18), given by , we can compute an approximated deterministic equivalent for the term inside the expectation in a similar manner as for :
(64) | ||||
(65) | ||||
(66) | ||||
(67) |
where (a) follows from [10, Lemma 1] (b) is derived applying [10, Lemma 4] and the fact that . Finally, (c) is obtained by applying the definition of previously derived. From the continuous mapping theorem and substituting the value of provided in (63), the numerator therefore has an approximated value of
(68) | ||||
(69) |
For the interfering terms we can proceed similarly and obtain a deterministic approximation by considering the term inside the expectation as follows:
(70) | ||||
(71) | ||||
(72) | ||||
(73) | ||||
(74) |
where (a) follows from [10, Lemma 1], (b) substitutes , (c) applies the definition of in the denominator and (d) substitutes the value of previously derived.
To get a deterministic equivalent for the previous equation, we first know that:
(75) |
being a direct consequence of [10, Lemma 4]. After applying the matrix inversion lemma to to remove the dependency with respect to UE , we obtain that
(76) |
Substituting (76) in (75) yields the following:
(77) |
where each of the terms is provided below:
(78) | ||||
(79) |
where (a) combines both [10, Lemma 4] and Th. 2 with the following substitutions , , , . In addition,
(80) | ||||
(81) | ||||
(82) |
where (a) comes from the definition of , and (b) arises from applying [10, Lemma 6] and Th. 2 to the term with the same substitutions as for . Finally, the last term can be computed as
(83) | ||||
(84) | ||||
(85) |
where (a) is obtained from the definition of and (b) follows the same step as to calculate (b). Consequently, the interfering terms accept an assymptotic approximation as follows:
(86) |
with .
Finally, the term can be shown to approximately converge to zero in the asymptotic regime as follows:
(87) | ||||
(88) |
As a consequence, the result in Th. 2 is obtained.
Appendix E Proof of Prop. 1
From Eq. (13), under perfect CSI it can be shown that the SINR achieved by UE is:
(89) |
where, for simplicity we assume that though the same analysis and conclusion is valid for subsets of APs and UEs. Therefore, is a block diagonal matrix where . Note that . As a consequence:
(90) | ||||
(91) |
Consider the generic case of . It can be easily shown that if does not exist. As a consequence, each must be full rank. After doing the compact SVD on where and both and are block diagonal. More particularly, with each and . Similarly, with each . Then it follows that
(95) |
As a consequence, the UL SINR after MMSE reception under perfect CSI does not depend on . Therefore, any non-singular maximizes .
Appendix F Proof of Prop. 2
Under perfect CSI, the DL-SINR under RZF precoding is
(96) |
According to [13], the term . As a consequence, can be approximately rewritten as
(97) |
Again, and for simplicity, we assume . Using a RZF precoding
(98) |
where such that . Therefore
(99) |
Substituting the previous expression in Eq. (97) we obtain
(100) | ||||
(101) | ||||
(102) |
where, in the last step we define by . Now, by compact SVD where and both and are block diagonal. Let and . Then can be written as
(103) | ||||
(104) |
We define by . Operating on (19), we obtain that
(105) | ||||
(106) |
where with defined as
(107) | ||||
(108) |
with . Note that (105) is an increasing function with respect to . Thus, maximizing is equivalent to maximizing the SINR. Since follows a Rayleigh quotient, the optimal maximizing is the eigenvector associated to the maximum eigenvalue of . Given that the previous matrix is rank-ones, there is only one eigenvector. As a consequence:
(109) |
By definition, . As a consequence, to obtain that , matrix has to satisfy , i.e. being semi-unitary.
Appendix G Proof of Prop. 3
Let us assume a generic and nonsingular . Then, with and being unitary. After adding , the output of the analog beamformer is
(110) |
Now, let us add the compensation matrix . Recall that the compensation matrix tries to somehow compensate the matrix that is in front of it, as shown in Eq. (36). In this case, for a generic , the compensation matrix of is following Eq. (36). Then, the product of the three matrices is:
(111) |
Note that since both and are unitary, we are not modifying the optimality of the solution. As a consequence, is also an unconstrained combiner, as the initial one , that does not change the output power.
Appendix H Proof of Prop. 4
For simplicity, let us assume . Under perfect CSI and maximum ratio combining (MRC), i.e. for the unconstrained solution of , the SINR in (112) becomes
(112) |
For at a faster peace than , almost surely. As a consequence, the asymptotic SINR achieved by UE is
(113) |
Let be the eigenvalues of sorted in descending order. Recall that is semi-unitary. Therefore we can construct a unitary such that and . Provided that is unitary, has the same eigenvalues as and can be written as
(114) | ||||
(117) |
Denote the eigenvalues of by . For a fully digital receiver, the asymptotic SINR is
(118) |
By the Cauchy’s interlacing theorem, the eigenvalues of the leading principal submatrix satisfy
(119) |
As a consequence, two bounds can be derived. A lower bound for the gap between hybrid and full digital occurs when . As a consequence, such a gap, denoted by is:
(120) | ||||
(121) | ||||
(122) | ||||
(123) |
To the contrary, the gap is maximum when . As a consequence, an upper bound on the gap between hybrid and full digital can be derived
(124) | ||||
(125) | ||||
(126) | ||||
(127) |