A Physics-based and Data-driven Approach for Localized Statistical Channel Modeling
Abstract
Localized channel modeling is crucial for offline performance optimization of 5G cellular networks, but the existing channel models are for general scenarios and do not capture local geographical structures. In this paper, we propose a novel physics-based and data-driven localized statistical channel modeling (LSCM), which is capable of sensing the physical geographical structures of the targeted cellular environment. The proposed channel modeling solely relies on the reference signal receiving power (RSRP) of the user equipment, unlike the traditional methods which use full channel impulse response matrices. The key is to build the relationship between the RSRP and the channel’s angular power spectrum. Based on it, we formulate the task of channel modeling as a sparse recovery problem where the non-zero entries of the sparse vector indicate the channel paths’ powers and angles of departure. A computationally efficient weighted non-negative orthogonal matching pursuit (WNOMP) algorithm is devised for solving the formulated problem. Finally, experiments based on synthetic and real RSRP measurements are presented to examine the performance of the proposed method.
I Introduction
With the rapid development of the fifth generation (5G) mobile communications, cellular network optimization becomes the fundamental means to improve the quality of experience (QoE) of the users in wireless communication [1, 2, 3]. Since network optimization must be performed in the offline fashion, one of the key challenges is how to characterize the large-scale behaviors of the wireless propagation channels between the base station and the user equipment under the network coverage. Thus, a channel model that can capture the localized physical geographical structures of the targeted environment is paramount for the network optimization of the 5G systems.
Unfortunately, the existing channel models commonly used in the 5G systems are not applicable to cellular network optimization. For example, the geometry-based stochastic models (GBSM) have been intensively studied [4, 5, 6]. They are employed for some typical scenarios, such as indoor and outdoor, rural and urban scenarios, but lack the ability to describe a specific and localized environment. As for deterministic channel modeling methods like ray tracing [7], they can describe the localized environment, but the complexity of using Maxwell’s equations is unaffordable for cellular network optimization. Additionally, ray tracing is based on map information, which may not always be available [8].
In this paper, we propose a physics-based and data-driven localized statistical channel modeling (LSCM) method, which is capable of sensing the physical propagation environment for the target region. The proposed LSCM can efficiently provide the statistical information of channel angular power spectrum (APS), thus revealing the localized multi-path structure and topography. In contrast to the existing GBSM which relies on channel impulse response (CIR) measurements, our proposed LSCM considers the measurements of reference signal receiving power (RSRP) [9] from multiple transmit beams only. Compared with the high-dimensional CIR, the benefit of using RSRP is that it can save a lot of storage space during data collection, and enable computationally efficient approaches. However, how to extract the channel statistics from the RSRP data has never been studied before and is a challenging task.
To this end, we first build the relationship between the RSRP and the APS. We show later that extracting the APS can be achieved through solving a sparse recovery problem, where the non-zero entries of the sparse vector indicate the channel paths’ powers and angles of departure. While the classical compressive sensing (CS) theory suggests that orthogonal matching pursuit (OMP) is a good solver [10], its convergence is guaranteed for some well-conditioned matrices, which in general does not hold. On the contrary, the column magnitudes of the coefficient matrix in the formulated problem can vary dramatically, which will significantly deteriorate the channel modeling performance. To solve this intractable sparse recovery problem, we design the weighted non-negative OMP (WNOMP) algorithm. Simulation results based on synthetic data and real-world RSRP measurements demonstrate that the proposed channel modeling method is effective and the WNOMP outperforms the classical methods.

II MIMO Downlink System and RSRP
In this section, we consider the channel modeling of the massive multiple input multiple output (MIMO) downlink communication system in the target grid, followed by the RSRP measured from multiple beams.
II-A Angular discretization channel modeling
The technique of beamforming is exploited in the massive MIMO of 5G systems, yielding high-resolution multi-path channels in the angular domain. Thus, we consider the channel modeling of the massive MIMO downlink communication system with beamforming, where the signal is transmitted from a base station to the user equipment through several propagation paths, as shown in Fig. 1.
Similar to the 3GPP’s technical report [11], we focus on the tilt angle of departure (AoD), azimuth AoD and channel gain from the base station to the user equipment with one receive antenna, ignoring the arrival angle and delay. Suppose the uniform rectangular array of the base station contains antennas. The CIR of antenna from the base station to the user equipment is given by
(1) |
![[Uncaptioned image]](https://cdn.awesomepapers.org/papers/c6456654-8a05-4da5-bd95-8eb983a14dda/table11.png)
The definition of notation in (II-A) is summarized in Table I. The tilt AoD and azimuth AoD over the free space are discretized into angles and angles, respectively. If there does not exist a path in the angles , the corresponding channel gain is zero, otherwise for the path in the channel. It is readily known that the channel gain is sparse, and contains multiple propagation paths in the angular domain.
The channel gain consists of path loss and the shadowing effect. Path loss is determined by the physical environment (distance, carrier frequency, buildings) which is assumed to be relatively static, while shadowing is caused by the obstacles and is usually modeled as the log-normal distribution. Thus, we assume follows the log-normal distribution, with its mean representing path loss and its covariance representing the shadowing effect. is the random phase error between different angles caused by the reflection, diffraction and scattering effect of electromagnetic waves, while is the random phase error of different antennas caused by the imperfect hardware of the antenna array. Inspired by [11], we assume follows the uniform distribution between and , and follows the Gaussian distribution with zero mean and covariance .





II-B RSRP measurements
The channel measurement of our proposed LSCM is the RSRP measured from multiple beams. In the downlink of 5G cellular systems, the RSRP can be measured from synchronization signals block (SSB) beams or channel state information-reference signal (CSI-RS) beams. Denote the precoding matrix of the beam as , whose size is the same as that of the antenna array. The entry of is . Denote the CIR matrix from the base station to the user equipment as , where the entry of is in (II-A). The RSRP of the beam at time is defined as
(2) |
where denotes the transmit power. The quality of the channel can be represented through , and a larger value of reflects better channel quality.
Figure 2 shows an example of channel measurement of RSRP measured from five beams. These five beam patterns are generated according to different beamforming weights, and the directions of main lobe of these five beams are different. The receiver at the target grid is highlighted as a blue point. The target grid can receive five RSRP measurements, corresponding to the received power of five beams. To be specific, the strongest value of RSRP measurements is -59.61 dB in Fig. 2(d), because its main lobe’s direction is , which is close to the angle of departure of the propagation path .
III Physics-based and Data-driven LSCM
In this section, the relation between the first-order statistics of the RSRP and channel gain is established, which is the key of the proposed LSCM. Then, the WNOMP is designed for extracting the channel statistics of the APS from the RSRP.
III-A The first-order statistics of the RSRP and channel gain
As for , it is a random variable inferred from (2), since contains three independent random variables, i.e., , and . Theorem 1 shows the relationship between the first-order statistics of and , i.e., their expectation and , implying the statistical relationship of the RSRP and angular information.
Theorem 1.
(The relationship between the first-order statistics of RSRP and channel gain) Suppose the expectation of is , and the expectation of channel gain is , then we have
(3) |
where
(4) | ||||
(5) |
Proof.
Please refer to Appendix A. ∎
The statistical relationship between the expectation of RSRP measurements and the first-order statistics of channel gain is connected by the coefficient , as revealed by Theorem 1. According to this finding, it is possible to extract the angular information from RSRP measurements if we can calculate from . The APS can be expressed by , where the subscripts and indicate the tilt angle and azimuth angle , respectively.
Based on (3), we reshape with the subscripts as a vector, i.e.,
(6) |
where the length of is . If the path does not exist, then . As there are only several propagation paths from the base station to the user equipment, the channel vector is sparse.
Suppose there are measured beams in total, and define as the vector of the expectation of RSRP measurements from different beams, i.e.,
(7) |
Define the vector as
(8) |
and the coefficient matrix is
(9) |
Then we can recast (3) as
(10) |
III-B The designed WNOMP algorithm
To construct the localized statistical channel model, we want to recover the sparse channel vector from the expectation of RSRP measurements and the coefficient matrix . According to (10), as the expectation of channel gain in terms of different angles , is non-negative, we can formulate the optimization problem as
(11) | ||||
s.t. | ||||
where is the maximum number of non-zero entries, representing the maximum number of propagation paths.
The OMP is an iterative algorithm, which selects one column into the set of indexes of non-zero entries at each iteration. The selection criteria is to select the column that is best correlated with the residual of current iteration. Problem (11) can be solved by the non-negative OMP (NNOMP), as shown in Algorithm 1. The biggest difference between NNOMP and OMP lies in Step 5 of Algorithm 1, where the subproblem of non-negative least squares (NNLS) is solved instead of the unconstrained least squares (ULS). After solving the NNLS subproblem, the entries of in the complement set are forced zero to ensure the sparsity. A fast implementation of NNOMP is proposed in [12] based on the active-set algorithm. The role of is also explained in [12] by using the Karush-Kuhn-Tucker (KKT) condition. The condition means that the column is the descending column, which can be added into the set and reduce the residual.
However, in our case, because our coefficient matrix is not column-normalized, this sparse recovery problem is intractable. The magnitude of each column varies significantly, which influences the accuracy of finding the correct column that is best correlated with the residual. The NNOMP algorithm prefers the large-magnitude columns of coefficient matrix , resulting in a low probability to select the correct index of non-zero entries. For example, although there exists one column that is parallel to the residual, the NNOMP algorithm will still select the wrong column with large magnitude.
To deal with this problem, we design the WNOMP algorithm, which is shown in Algorithm 2. The WNOMP algorithm can strike a good balance between the effect of column magnitude and correlation with the residual by setting the dynamic weight for each selection. The weight of magnitude should be smaller than that of the correlation. If we define the column-wise normalized coefficient matrix as
(12) |
where is the column vector of coefficient matrix . The dynamic weight can be calculated as
(13) |
The dynamic weight plays an important role in the selection of Step 4 of Algorithm 2. An intuitive idea to reduce the influence of column is to use the column-normalized matrix , i.e., the selection in Step 4 of Algorithm 2 is modified as . However, this modification is not good, because the propagation paths are more likely to exist in the angles with large antenna gain of the transmit antenna array in the base station, which means the large-magnitude column corresponds to the non-zero entries with a large probability. As for the selection , it can be viewed as the multiplication of the modification case and , where the correlation with residual contributes little to the selection because the value of is much larger than . As a result, the WNOMP considers these two terms separately, and use to balance them. The Step 7 of Algorithm 2 compresses the set by updating it with the support of , because some components of may vanish due to the activation of the non-negative constraints.




IV Simulation Results and Discussions
In the following, we show numerical results to analyze the performance of the algorithms proposed in this paper. To generate , we set the interval of the tilt angle to be two degrees, and the interval of the azimuth angle to be five degrees, as shown in Fig. 3. Due to the antenna gain , the magnitude (i.e., norm) of each column in varies significantly, making problem (11) intractable. Both synthetic and real-world measurement data are considered.
For synthetic data, the location of non-zero entries of synthetic data are generated following the uniform distribution in the interval . The accuracy is defined as the ratio of the number of correctly recovered non-zero entries to the total number of non-zero entries, i.e., the success of support recovery. Besides the NNOMP and the proposed WNOMP for the norm constraint, we also consider the least absolute shrinkage and selection operator (LASSO) for sparse recovery, which uses regularization and can be solved by proximal gradient descent. The performance of the proposed algorithm is related to the problem size, i.e., , and . As shown in Fig. 4(a), we fix = 32 and = 5, the accuracy decreases when increases because the algorithms may select the wrong column with more choices. Here, we change the value of by picking the columns with the top highest magnitude. The NNOMP performs worst, while WNOMP performs better than LASSO. As shown in Fig. 4(b), the accuracy increases when increases for = 400 and = 5, which indicates that the channel model is more accurate with more beam measurements. As shown in Fig. 4(c), the accuracy decreases when increases for and .

As for the real-world measurement data, they are collected by using the drive-test in street of the Chengdu city, China, as shown in Fig. 5. The carrier frequency is GHz with MHz bandwidth. There are transmit antennas at the base station. Depending on the velocity of the driving car, dozens of samples are collected when passing through a meters grid in the interval of several seconds. For each grid, the user equipment can measure the RSRP of the CSI-RS beams (32 beams in total) and SSB beams (8 beams in total) transmitted from its serving cell and other neighborhood cells. In general, the number of the measured beams in the serving cell is more than the number of the measured beams in the neighborhood cells. Based on , we obtain the RSRP by averaging the samples, then use them and the coefficient matrix to formulate and solve the problem (11).
(LASSO NNOMP WNOMP)
# of grids | MAE (dB) | |
Serving cell | 718 | 6.14 4.45 4.93 |
Neighborhood | 1566 | 7.32 5.85 5.58 |
All grids | 2284 | 6.95 5.41 5.38 |
(LASSO NNOMP WNOMP)
# of grids | MAE (dB) | |
Serving cell | 631 | 6.07 5.37 4.89 |
Neighborhood | 1055 | 11.5 10.6 8.96 |
All grids | 1686 | 9.44 8.68 7.44 |
In particular, to evaluate the proposed channel modeling method in practical systems, we collect the RSRP measurements both before and after rotating the antenna array for certain degrees. The former RSRP data is used for channel modeling by solving (11) and the solution denoted by . Then we construct the coefficient matrix for the array after rotation, denoted by . The estimated RSRP after array rotation is thus given by . Suppose the RSRP after rotating as , we then use the following mean absolute error (MAE)
(14) |
as the performance measure.
As shown in Table II of predicting SSB beams, we consider 2284 grids in total and calculate the average MAE. The MAE of serving cell is better than that of neighborhood cells due to more beam measurements in serving cell. The MAE from 1686 grids of predicting CSI-RS beam is shown in Table III, which is worse than that of the prediction of SSB beam, because the beam pattern of CSI-RS is more complex. Although LASSO performs better than the NNOMP in the synthetic data, its performance deteriorates in the real-world measurement data, where the paths does not follow the uniform distribution. The MAE performance of the proposed WNOMP is better than those of LASSO and NNOMP algorithms, which means that the channel model established by WNOMP is more accurate than the channel models established by LASSO and NNOMP.
V Conclusions
In this paper, we propose the LSCM to sense the physical environment that is localized for the specific geographical location. The proposed LSCM can generate a high-resolution multi-path channel in the angular domain by exploiting the first-order statistics of RSRP measured from multiple beams. The statistical distribution of the APS in the LSCM is consistent with that of true wireless propagation environment. To construct the LSCM, we design the WNOMP algorithm to deal with the coefficient matrix without column-wise normalization at a low computational complexity. Simulation results verify the WNOMP algorithm outperforms the classic LASSO and NNOMP algorithms in terms of accuracy and MAE. Our future work will focus on the joint estimation of the APS by taking advantage of the spatial consistency in multiple grids [13].
Appendix A Proof of Theorem 1
We want to establish the relationship between and . We have
(15) |
where is the expectation of w.r.t. both and . Then, we need to show how is expressed by . According to the definition in (2),
(16) |
where .
The expectation of first term and second term in (16) w.r.t. and are equal. Then, we have
(17) | ||||
(18) |
It is inferred from the distribution of that
(19) |
when . As a result,
(20) | ||||
(21) |
At last, we can obtain
(22) | |||
(23) |
where
(24) |
References
- [1] Y. Liu, W. Huangfu, H. Zhang, and K. Long, “An efficient stochastic gradient descent algorithm to maximize the coverage of cellular networks,” IEEE Trans. Wireless Commun., vol. 18, no. 7, pp. 3424–3436, 2019.
- [2] W. Li, D. López-Pérez, X. Geng, H. Bao, Q. Song, and X. Chen, “A zeroth-order continuation method for antenna tuning in wireless networks,” in Proc. IEEE ICC, 2021, pp. 1–6.
- [3] R. M. Dreifuerst, S. Daulton, Y. Qian, P. Varkey, M. Balandat, S. Kasturia, A. Tomar, A. Yazdan, V. Ponnampalam, and R. W. Heath, “Optimizing coverage and capacity in cellular networks using machine learning,” in Proc. IEEE ICASSP, 2021, pp. 8138–8142.
- [4] S. Wu, C.-X. Wang, M. M. Alwakeel, X. You et al., “A general 3-D non-stationary 5G wireless channel model,” IEEE Trans. Commun., vol. 66, no. 7, pp. 3065–3078, 2017.
- [5] C.-X. Wang, J. Bian, J. Sun, W. Zhang, and M. Zhang, “A survey of 5G channel measurements and models,” IEEE Commun. Surv. Tutor., vol. 20, no. 4, pp. 3142–3168, 2018.
- [6] J. Bian, C.-X. Wang, X. Gao, X. You, and M. Zhang, “A general 3D non-stationary wireless channel model for 5G and beyond,” IEEE Trans. Wireless Commun., vol. 20, no. 5, pp. 3211–3224, 2021.
- [7] S. Hussain and C. Brennan, “Efficient preprocessed ray tracing for 5G mobile transmitter scenarios in urban microcellular environments,” IEEE Trans. Antennas Propagat., vol. 67, no. 5, pp. 3323–3333, 2019.
- [8] A. C. S. Rodriguez, N. Haider, Y. He, and E. Dutkiewicz, “Network optimisation in 5G networks: A radio environment map approach,” IEEE Trans. Veh. Technol., vol. 69, no. 10, pp. 12 043–12 057, 2020.
- [9] C. S. Park and S. Park, “Analysis of RSRP measurement accuracy,” IEEE Commun. Lett., vol. 20, no. 3, pp. 430–433, 2016.
- [10] J. A. Tropp and A. C. Gilbert, “Signal recovery from random measurements via orthogonal matching pursuit,” IEEE Trans. Inf. Theory, vol. 53, no. 12, pp. 4655–4666, 2007.
- [11] 3GPP, “Study on channel model for frequencies from 0.5 to 100 GHz (Release 16),” Technical Report (TR) 38.901, Jan. 2020, version 16.1.0.
- [12] T. T. Nguyen, J. Idier, C. Soussen, and E.-H. Djermoune, “Non-negative orthogonal greedy algorithms,” IEEE Trans. Signal Process., vol. 67, no. 21, pp. 5643–5658, 2019.
- [13] X. Ning, S. Zhang, X. Zheng, and T.-H. Chang, “Multi-grid-based localized statistical channel modeling: A radio map approach,” in Proc. IEEE ISWCS, Oct. 2022.