Covariance-Based Cooperative Activity Detection for Massive Grant-Free Random Access
Abstract
This paper designs a cooperative activity detection framework for massive grant-free random access in the sixth-generation (6G) cell-free wireless networks based on the covariance of the received signals at the access points (APs). In particular, multiple APs cooperatively detect the device activity by only exchanging the low-dimensional intermediate local information with their neighbors. The cooperative activity detection problem is non-smooth and the unknown variables are coupled with each other for which conventional approaches are inapplicable. Therefore, this paper proposes a covariance-based algorithm by exploiting the sparsity-promoting and similarity-promoting terms of the device state vectors among neighboring APs. An approximate splitting approach is proposed based on the proximal gradient method for solving the formulated problem. Simulation results show that the proposed algorithm is efficient for large-scale activity detection problems while requires shorter pilot sequences compared with the state-of-art algorithms in achieving the same system performance.
Index Terms:
Cooperative activity detection, massive access, 6G cell-free wireless networks, covariance-based detection.I Introduction
The massive machine-type communications (mMTC), which is a typical application scenario for 6G wireless networks, aims to meet the demand for massive connectivity for hundreds of billions of Internet-of-Things (IoT) devices. For the massive access, conventional grant-based random access schemes lead to an exceedingly long access latency and a prohibitive signaling overhead. To this end, grant-free random access schemes have been considered as a promising candidate technique for realizing 6G cellular IoT [1], where active devices transmit their data signals without obtaining a grant from the base station (BS) after sending pre-assigned pilot sequences. Hence, the key to grant-free random access is active device detection at the BS based on the received pilot sequences [2].
Inspired by the sporadic characteristics of IoT applications, several compressed sensing-based approaches have been proposed to detect active devices for grant-free random access systems. For instance, in [3] and [4], the approximate message propagation (AMP) algorithms were designed for activity detection in different scenarios by exploiting the statistics of wireless channels. However, the AMP algorithms require high computational complexity. As a result, the authors in [5] proposed a low-complexity dimension reduction-based algorithm, which projects the original device state matrix to a low-dimensional space by exploiting its sparse and low-rank structure. Note that the above approaches in [3]-[5] perform activity detection based on the instantaneous received signals. Recently, a covariance-based algorithm has been proposed to improve the performance of device activity detection in [6], where the detection problem was solved by a coordinate descent algorithm with random sampling. In general, these algorithms, e.g. [3]-[6], exploiting the sparsity structure of the device state matrix, which enjoy reasonable detection performance. However, due to a large number of devices and the limited radio resources in G networks for massive access, the active device detection has been emerging as a challenging problem.
To overcome this challenge, multi-cell massive access with multiple APs were applied to the problem of active device detection. For example, the multi-cell sparse activity detection was proposed in [7], where each AP operates independently to perform activity detection and channel estimation for the devices distributed in its own cell by treating the inter-cell interference as noise. In fact, if the APs can jointly process the pilot sequences received from the devices in neighboring APs, the detection performance can be further improved even with only short pilot sequences. Motivated by this fact, this paper considers a 6G cell-free wireless network, where multiple APs deployed in a vast area to serve all devices located in this area [8, 9]. In particular, cooperative activity detection among the APs requires extra information exchanges in the system. To reduce the amount of associated signaling overhead, this paper designs a scalable computationally efficient algorithm to detect the active devices, which is reliable and robust to AP and/or backhaul link failure and the variation in channel statistics. The main contributions of this paper are as follows:
-
1.
The paper proposes a novel cooperative activity detection framework for grant-free random access in 6G cell-free wireless networks based on the covariance of the received signals.
-
2.
This paper proposes a cooperative massive detection (CMD) algorithm by exploiting the special characteristic of the device state vectors of interest among the neighboring APs, namely joint similarity and sparsity.
-
3.
This paper analyzes the computational complexity and the communication cost of the proposed CMD algorithm which shows its effectiveness in 6G cell-free wireless networks.
II System Model
Consider a 6G cell-free wireless network comprising APs. The APs are equipped with antennas each, serving uniformly distributed single-antenna IoT devices in a vast area. Each AP is connected to several adjacent APs via backhaul links and can only communicate with its one-hop neighbors for reducing the communication load, as shown in Fig. 1. Due to the burst characteristic of IoT applications, only a fraction of IoT devices are active at any given time slot. Let denote the cardinality of a set. We use to denote the set of active devices with being the number of active devices. For convenience, we define as the binary activity indicator with if the th device is active, and otherwise. Moreover, we represent the -dimensional channel vector from the th device to the th AP as , where is the large-scale fading component depending on the devices location, and is the small-scale fading following independent and identically distributed (i.i.d.) complex Gaussian distribution with zero mean and unit variance.

The grant-free random access protocol is adopted in this paper [10]. Specifically, at the beginning of each time slot, the active devices transmit their pilot sequences over the uplink channels simultaneously and then the APs perform the activity detection based on the received signals in a cooperative manner. All pilot sequences, , are generated from i.i.d. complex Gaussian distribution with zero mean and unit variance which are known at the APs in advance. Thus, the received signal at the th AP can be expressed as
(1) |
where denotes the small-scale fading channel matrix, denotes the horizontal stack of all pilot sequences, and is the additive white Gaussian noise (AWGN) marix with i.i.d. entries , where denotes the noise power at each antenna. In this paper, we adopt and to denote conjugate transpose and transpose, respectively. Define as the diagonal entries of , representing the device state vector of the th AP with . The APs detect the active devices by estimating the term . Especially, since the term can be determined by the covariance of the received signal, we aim to design a covariance-based cooperative activity detection algorithm via a limited cooperation among multiple APs.
III Cooperative Massive Detection Algorithm
In this section, we first propose a cooperative detection framework for 6G cell-free wireless networks with a massive number of IoT devices. Then, we design a corresponding cooperative detection algorithm.
III-A Cooperative Massive Detection Framework
For the problem in model (1), the unknown device state vectors for different APs are different. Moreover, there are some common characteristics among the neighboring APs. To enhance the detection performance, we first associate a local estimator with each AP. Incorporating the estimates of the neighboring APs, i.e., sparsity-promoting and the similarity-promoting terms [5, 11], we can modify the local estimator to associate a regularized local cost function with each AP.
Firstly, we design the local estimator. It is well known that the covariance-based massive activity detection is equivalent to recovering the device state vector from the noisy measures with the knowledge of the pre-defined pilot sequence matrix . In general, the estimation of the device state vector can be formulated as a maximum likelihood estimation problem [6]. In particular, for a given , each column of , denoted as , , can be termed as an independent sample having the following multivariate complex Gaussian distribution:
(2) |
where the covariance matrix is calculated by and denotes the identity matrix. For convenience, we define . Then, the likelihood of given can be represented as
(3) |
where and are operators that return the determinant and the trace of a matrix, respectively. By exploiting the Gaussianity, we can obtain the Maximum Likelihood (ML) estimator of at the th AP as follows:
(4) |
where denotes the sample covariance matrix of the received signal of the th AP averaged over different antennas. Based on (4), the maximum likelihood estimation problem can be formulated as .
Secondly, since the activity detection is a typical sparse signal processing problem, we propose a sparsity-promoting term to facilitate cooperative detection. The specific sparsity pattern can be simultaneously observed at different APs, namely the indices of nonzero entries of are consistent for . Because each AP only communicates with its neighbor APs, it cannot obtain the global information about the sparsity pattern. Moreover, it is quite challenging to split this global quantity into several local quantities consisting of components only from the neighboring nodes. In this case, for the th AP, we define a local parameter matrix consisting of the parameter vectors of all its neighbors, which can be directly obtained as follows:
(5) |
where is the index set of neighbors of the th AP except itself, denotes the index set of the neighbors of the AP including itself, and and denote the cardinality of the set and , respectively. Consequently, we aim to impose sparsity constraints on the row vectors of matrix to exploit the joint sparsity. To this end, this paper designs a novel sparsity-promoting term, which is given by
(6) |
where is the penalty parameter, is the th row of , and denotes the norm of a matrix. Herein, is the logarithmic smooth function which can promote row sparsity [5], where the nonzero rows are penalized by minimizing . In this way, a common sparsity profile across the columns of the local parameter matrix is promoted. Although the sparsity-promoting term is imposed on the local parameter matrix , the cooperative nature promotes a common sparsity profile across all columns of the global device state vectors .
Thirdly, we design a similarity-promoting term to improve the detection performance. The supports of the global device state vector for all APs are the same, but the amplitudes of the nonzero entries at the APs are different from each other due to the effects of different path loss. In particular, the device state vectors of neighboring APs have a large number of similar entries and only a relatively small number of distinct entries. Motivated by these observations, we design a similarity-promoting function as follows
(7) |
where are linear weights satisfying the conditions: . is a convex penalty function, minimized at , which encourages similarity between and . Note that the log-likelihood depends on the empirical covariance . In high-dimensional settings, where the length of pilot sequences is larger than the number of AP antennas , will be relatively different from the covariance matrix . By enforcing structural similarity, each can exploit from the fact that neighboring AP estimates should be similar to each other.
The specific expressions in the penalty function form can be set different. In general, it dependents on the assumptions imposed on the problem, one may choose the most appropriate penalty for the data at hand. For example, -norm penalty , where and denote the th element of the vector and the absolute value, respectively. This penalty function encourages the changes of limited number of values between neighbor APs, while the rest of the structure remains the same. In other words, it borrows information aggressively across neighbors, encouraging not only similar structure but also similar values. As a result, this penalty is suitable for massive access where only a small fraction of potential devices to change their states at a time slot and is adopted in this paper.
After defining the similarity-promoting term and sparsity-promoting term, accumulating them into (4) leads to the following novel regularized local cost function at the th AP:
(8) |
where and are the penalty parameters used to enforce sparsity and similarity, respectively. In the following, we design a massive activity detection algorithm to minimize the local cost function at each AP.
III-B A Decentralized Approximate Separating Strategy
Note that the first term of (8), i.e., is differentiable and geodesically convex [12]. However, as stated in the above subsection, is discontinuous, i.e., the third term of the local cost function could be a sum of non-smooth functions, and the second term is also potentially non-differentiable. In addition, the unknown variables for neighboring APs are coupled with each other. These obstacles make the problem intractable to solve and existing algorithms are not applicable to such a problem. In the following, we design a decentralized approximate separating strategy for minimizing the cost function in (8) based on the forward-backward splitting strategy [13], which can handle the non-smooth problem and is especially amenable to solve the high-dimensional activity detection problem due to its fast convergence rate and its conceptual and mathematical simplicity.
Before proceeding, we recall the forward-backward splitting approach for minimizing (8), which is given by the iteration
(9) |
where is the gradient of function , is the step size for the th AP, and denotes the value of in the th iteration. The gradient descent step is the forward step and the proximal step is the backward step [13]. Note that the proximal operator of a function is a mapping function given by: with variables and , and a step-size [14].
Unfortunately, it is prohibitively challenging to directly evaluate the proximal operators with respect to similarity-promoting function and the sum of . Moreover, the calculation of over all the number of neighborhood, , in each iteration is expensive. Motivated by Douglas Rachford splitting in [13], where two of the proximal operators can be updated alternately, this paper aims to handle the proximal operator of function and separately. Specifically, we first calculate an estimator of the subgradient and then incorporate the gradient descent step into the proximal step with respect to sparsity-promoting term for the th AP, which is given by
(10) |
where is a intermediate variable. Then, according to the update rule of Douglas Rachford splitting, we incorporate into proximal operator with respect to similarity-promoting function :
(11) |
The intermediate variable and device state vector iterate alternately and their values approach to each other. When converging to optimality, their values are identical. Afterwards, in order to overcome the difficulty in processing the non-smooth finite sum term and to reduce the computational overhead, this paper proposes a splitting strategy that uses the proximal operator of a single function in each iteration to approximate the proximal operator of the average of non-smooth functions . In mathematical terms, we first choose randomly from the set with probabilities . Then, utilizing the proximal operator, a specific step is introduced as follows
(12) |
where is the estimator of subgradient for the randomly selected th neighbor of the th AP in the th iteration. Let denotes the combiner at th iteration. In the sequel, can be set to , which is a stochastic approximation of controlled by the combiner and the probability of being selected. In this way, we are able to treat the difficult term in (8) with non-smooth finite sum term for any size of cardinality .
Since and converge to the same value, (12) is an accurate approximation of (9) if and hold. Thus, we must ensure that is close to to obtain an accurate estimator. According to the definition of proximal operator, equation (12) satisfies
(13) |
Hence, we can arrive at the following subgradient estimator such that (12) holds:
(14) |
where the right hand side (RHS) of (14) is obtained by replacing the proximal step in (13) by and further reorganizing the left hand side (LHS) of formula (13). Consequently, the subgradient estimator in (10) can be updated as
(15) |
which exploits the fact that and as stated in (12), only a single selected is updated in each iteration.
III-C Derivation of Proximal Operators and Combiners
Since the proximal operator needs to be calculated at each iteration in (10) and (12), it is important to derive closed form expressions for evaluating them exactly. According to the well-known Sherman-Morrison rank-1 update identity [15], we obtain
with , where is the th element of . Applying the well-known determinant identity yields
(17) |
Then, substituting (III-C) and (17) into (4) and taking the derivative of with respect to , we have
(18) |
Correspondingly, the gradient can be derived by computing the following derivative: , where denotes a column vector. By substituting it into (10) and calculating the closed form expression of the proximal operator of , we can obtain the following intermediate recursion
(19) |
with , where is the th element of .
Now, we turn to derive the recursion of in (12). Since in (12) is fully separable, its proximal operator can be evaluated component-wise:
(23) |
where the minimization operator is to preserve the positivity of . Here, and are the th entry of vectors and , respectively. For (15) and (23), the estimation performance depends, to a great extent, on the cooperation strategy specified by the combiner . This paper adopts the following adaptive combiner
(27) |
where is a large constant set beforehand. Note that the term in (27) accounts for the distance between the local estimates of the th AP and its th neighbor. The combiner is inversely proportional to such a distance. When the distance defined above between two APs is large, the th AP tends to decrease the combination weight, or even discard the information from this neighbor. Conversely, the th AP will increase the combination weight when the distance of estimation between two APs is small. For clarity, the pseudo-code of the CMD algorithm is summarized in Algorithm 1.
Once an estimate is obtained, we employ the element-wise thresholding at each AP to determine from which is the -th entry of . In specific, if for a pre-specified threshold , and otherwise [16]. Herein, is the AP closest to device . Note that there exists a particular fixed point of the problem for (10) and (12). Based on the information across the 6G wireless network, it can be shown that the iterates converge to this particular fixed point under certain step-size conditions.
III-D Computational Complexity and Communication Cost
In what follows, the computational complexity and communication cost of the proposed CMD algorithm is analyzed. In each iteration, for an arbitrary AP, the computational complexity mainly arises from the matrix multiplication, and the overall computational complexity of the CMD algorithm is . Although the computational complexity of sample covariance is , it only needs to be calculated once at each time slot before the iteration. For the communication cost, in each iteration, each AP needs to transmit -dimensional intermediate to its neighboring APs. Thus, for all APs, the CMD algorithm needs to exchange parameters. Since the APs exchange intermediate variables instead of the received signal matrix , the communication cost does not grow as the number of AP antennas increases.
Remark 1: It is interesting to emphasize that the computational complexity and the communication cost at each iteration of the CMD algorithm do not grow as the number of each AP antennas, , increases. Note that the CMD algorithm can also be adopted for data detection in unsourced random access [17], where an arbitrary AP only needs to know which messages are sent without identifying which message belongs to which device. Suppose that each active device has bits to send, then the total number of potential devices in the device activity detection problem is replaced by , where the small size is obtained by divided -bit message into blocks. The details please refer to our journal paper. Thus, the computational cost of CMD algorithm reduces to . This implies that the communication cost of the proposed CMD algorithm does not grow by increasing the total number of potential devices, which is an appealing feature for reality IoT networks.
IV Numerical Results
In this section, we present numerical simulations to validate the effectiveness of the proposed CMD algorithm. We simulate the underdetermined 6G cell-free wireless network comprising APs geographically distributed in a vast area to serve potential devices. The AP-to-AP distance is m and the number of connections between APs can be set differently. The positive constants is selected to be . The penalty parameters and are set as and , respectively. The step size and is the same for all APs, , and .
As a performance measure, we adopt the activity error rate (AER). The AER is a sum of the missed detection probability, defined as the probability that a device is active but is declared to be inactive, and the false-alarm probability, defined as the probability that a device is inactive but the detector declares it to be active. As a reference, we compare the proposed CMD algorithm with two baseline schemes: the conventional ML-based multi-cell algorithm [6] and the AMP-based multi-cell algorithm, where each AP only serves its cell’s devices without multi-cell cooperation and treats the inter-cell interference as noise [7].
Fig. 2 depicts the detection performance versus different choices of the number of APs for cooperation. Initially, in the area with a few numbers of cooperation APs, the AER decreases sharply as the number of cooperation APs increases. When the number of cooperation APs continues to increase, the performance improvement diminishes. In addition, it is observed that such a performance saturation point value depends on AP antennas and pilot length , i.e., increasing or helps decrease the saturation point value, which indicates that the AP becomes more capable of detecting the activity with a low communication cost. The reason for this phenomenon is that for an arbitrary AP, more connections result in more intermediate estimates exchange in the proposed CMD algorithm, leading to good detection performance. However, the channel strengths from a specific active device to the far away APs are approximate zero, and the intermediate estimates exchange with the remote AP can not further improve the massive detection performance significantly. The results also indicate that only a small number of APs are required for cooperation which strikes a tradeoff between the detection performance and the communication cost.

In the rest of the simulations, the number of APs for cooperation is set to . Fig. 4 demonstrates the activity detection performance versus different numbers of AP antennas with potential devices , active devices , pilot length , and SNR is set as 10 dB. It is seen that the proposed CMD algorithm provides much lower AER than that of the baseline ones and the performance gap is enlarged as the number of AP antennas increases. In other words, the superiority of the proposed CMD algorithm is evident in massive MIMO systems, which is a key technique for 6G wireless networks. Such an advantage of cooperative strategies mainly comes from that the proposed algorithm exploits the joint sparsity and similarities of the multiple APs, and the closed-form expressions for the proximal operators are derived to achieve higher efficiency. In contrast, ML-based and AMP-based multi-cell approaches ignore such prior information and only perform activity detection for the devices distributed in its own cell, where the inter-cell interference is also a severely limiting factor for reliable activity detection.


Fig. 4 shows the detection performance versus the length of the pilot sequence with potential devices , active devices , antennas at the AP, and SNR is set as 10 dB. From this figure, we observe that the activity detection performance of all the considered algorithms increases as the pilot length increases and the CMD algorithm achieves a substantial performance gain over the ML-based multi-cell algorithm and the AMP-based multi-cell algorithm. Note that the CMD algorithm does not require the knowledge of channel strengths which only needs to estimate a smaller number of unknown parameters, thus, it is more efficient for activity detection than that of the AMP-based multi-cell detection approach. We can also see that the performance gap between the proposed algorithm and the baseline ones is large, especially when the pilot sequence is short.
V Conclusion
This paper designed a grant-free cooperative random access framework for mMTC in 6G cell-free wireless networks based on the covariance of the received signals. By exploiting the special characteristic of the device state vectors of interest, we developed a covariance-based high-accuracy and low-complexity algorithm. Simulation results showed that the proposed algorithm can almost achieve near-optimal activity detection performance.
References
- [1] X. Chen, D. W. K. Ng, W. Yu, E. G. Larsson, N. Al-Dhahir, and R. Schober, “Massive access for 5G and beyond,” IEEE J. Sel. Areas Commun., vol. PP, no. 99, pp. 1-1, Aug. 2020.
- [2] V. W. S. Wong, R. Schober, D. W. K. Ng, and L.-C. Wang, Key Technologies for 5G Wireless Systems. Cambridge, U.K.: Cambridge Univ. Press, 2017.
- [3] X. Shao, X. Chen, C. Zhong, J. Zhao, and Z. Zhang, “A unified design of massive access for cellular Internet of Things,” IEEE Internet of Things J., vol. 6. no. 2, pp. 3934-3947, Apr. 2019.
- [4] L. Liu and W. Yu, “Massive connectivity with massive MIMO-Part I: Device activity detection and channel estimation,” IEEE Trans. Signal Process., vol. 66, no. 11, pp. 2933-2946, Jun. 2018.
- [5] X. Shao, X. Chen, and R. Jia, “A dimension reduction-based joint activity detection and channel estimation algorithm for massive access,” IEEE Trans. Signal Process., vol. 68, pp. 420-435, 2020.
- [6] S. Haghighatshoar, P. Jung, and G. Caire, “A new scaling law for activity detection in massive MIMO systems,” [Online]: arXiv:1803.02288, Mar. 2018.
- [7] Z. Chen, F. Sohrabi, and W. Yu, “Multi-cell sparse activity detection for massive random access: Massive MIMO versus cooperative MIMO,” IEEE Trans. Wireless Commun., vol. 18, no. 8, pp. 1558-2248, Aug. 2019.
- [8] M. Ke, Z. Gao, Y. Wu, X. Gao and K. Wong, “Massive access in cell-free massive MIMO-based Internet of Things: cloud computing and edge computing paradigms,” in IEEE J. Sel. Areas Commun., vol. PP, no. 99, pp. 1-1, Aug. 2020.
- [9] J. Zhang, E. Björnson, M. Matthaiou, D. W. K. Ng, H. Yang and D. J. Love, “Prospective multiple antenna technologies for beyond 5G,” IEEE J. Sel. Areas Commun., vol. 38, no. 8, pp. 1637-1660, Aug. 2020.
- [10] Y. Qiang, X. Shao, and X. Chen, “A model-driven deep learning algorithm for joint activity detection and channel estimation,” IEEE Commun. Lett., vol. PP, no. 99, pp. 1-1, 2020.
- [11] R. Nassif, C. Richard, A. Ferrari, and A. H. Sayed, “Multitask diffusion LMS with sparsity-based regularization,” IEEE Intern. Conf. on Acous. Speech and Signal Process. (ICASSP), Brisbane, QLD, pp. 3516-3520, 2015.
- [12] A. Wiesel, “Geodesic convexity and covariance estimation,” IEEE Trans. Signal Process., vol. 60, no. 12, pp. 6182, 2012.
- [13] N. Parikh and S. Boyd, “Proximal algorithms,” Found. Trends Optim., vol. 1, no. 3, pp. 123-231, 2013.
- [14] A. Defazio, “A simple practical accelerated method for finite sums,” Advances in Neural Information Processing Systems (NIPS), 2016.
- [15] J. Sherman and W. J. Morrison, “Adjustment of an inverse matrix corresponding to a change in one element of a given matrix,” The Annals of Mathematical Statistics, vol. 21, no. 1, pp. 124-127, 1950.
- [16] X. Shao, X. Chen, C. Zhong and Z. Zhang, “Joint activity detection and channel estimation for mmW/THz wideband massive access,” IEEE Intern. Conf. Commun. (ICC), Dublin, Ireland, Jun. 2020, pp. 1-6.
- [17] A. Fengler, G. Caire, P. Jung, and S. Haghighatshoar, “Massive MIMO unsourced random access,” [Online]: http://arxiv.org/abs/1901.00828, Jan. 2019.