This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Integrated Location Sensing and Communication for Ultra-Massive MIMO With Hybrid-Field Beam-Squint Effect

Zhen Gao, , Xingyu Zhou, Boyu Ning, , Yu Su, , Tong Qin,
Dusit Niyato
Abstract

The advent of ultra-massive multiple-input-multiple-output systems holds great promise for next-generation communications, yet their channels exhibit hybrid far- and near- field beam-squint (HFBS) effect. In this paper, we not only overcome but also harness the HFBS effect to propose an integrated location sensing and communication (ILSC) framework. During the uplink training stage, user terminals (UTs) transmit reference signals for simultaneous channel estimation and location sensing. This stage leverages an elaborately designed hybrid-field projection matrix to overcome the HFBS effect and estimate the channel in compressive manner. Subsequently, the scatterers’ locations can be sensed from the spherical wavefront based on the channel estimation results. By treating the sensed scatterers as virtual anchors, we employ a weighted least-squares approach to derive UT’s location. Moreover, we propose an iterative refinement mechanism, which utilizes the accurately estimated time difference of arrival of multipath components to enhance location sensing precision. In the following downlink data transmission stage, we leverage the acquired location information to further optimize the hybrid beamformer, which combines the beam broadening and focusing to mitigate the spectral efficiency degradation resulted from the HFBS effect. Extensive simulation experiments demonstrate that the proposed ILSC scheme has superior location sensing and communication performance than conventional methods.

Index Terms:
Ultra-massive multiple-input-multiple-output (UM-MIMO), hybrid far- and near- field, beam squint, millimeter-wave (mmWave), integrated sensing and communication (ISAC).

I Introduction

The next-generation wireless communication is poised to enhance the traditional mobile-broadband and real-time experiences to the next level and unlock novel use cases such as spatial perception and immersive platform [1]. To support these services, it is expected that the new sixth-generation (6G) targets will push existing key performance indicators further in areas such as throughput, latency, and coverage. In this context, revolutionary physical layer (PHY) technologies serve as fundamental building blocks for the performance leap, among which the advanced multiple-input multiple-output (MIMO) technology [2, 3, 4] and spectrum expansion [2, 5] hold great promise.

On the one hand, the evolution toward ultra-massive MIMO (UM-MIMO) emerges as a viable solution for further enhancing spatial degrees of freedom, thereby providing higher spectral efficiency. On the other hand, underutilized spectrum bands, such as millimeter-wave (mmWave) and terahertz (THz), harbor a substantial reservoir of available bandwidth, which enables the establishment of extremely wideband channels with unprecedented transmission rates. Moreover, the shorter wavelengths at these higher frequency bands permit the dense packing of more antennas within the same aperture size, effectively compensating for most of the path loss attenuation by resorting to highly directional beams [5]. Consequently, UM-MIMO and mmWave (THz) technologies mutually complement each other, constituting a foundation of the 6G PHY.

Despite the appealing advantages of mmWave UM-MIMO systems, their channels exhibit the Hybrid Far- and Near- Field Beam-Squint (HFBS) effect due to the substantial increase in the number of antennas and the corresponding bandwidth, urgently necessitating a significant shift in signal processing paradigms. Specifically, electromagnetic wave radiation can be categorized into two regions: Fraunhofer (far-field) and Fresnel (near-field), with their boundary quantified by the Rayleigh distance [6]. The limited antenna aperture of current cellular networks makes the Rayleigh distance negligible for typical cellular networks. Consequently, existing communication technologies predominantly rely on the far-field radiation theories [7]. However, as the antenna aperture sizes dramatically increase in the UM-MIMO systems, the near-field region expands rapidly, and some of the served user terminals (UTs) would fall into the near-field region and some otherwise, leading to the coexistence of the near-field and the far-field propagation [8, 9]. Moreover, the combination of large bandwidth and UM-MIMO introduces a non-negligible propagation delay gap at different antennas for the same received signal filling this array aperture, which could be as large as multiple symbol periods. This phenomenon is termed as the delay-squint effect [9, 11, 10]. From the frequency domain perspective, the delay-squint effect could further introduce the beam-squint effect. To be more specific, the beam direction is a function of the operating frequency, and the undesired beam misalignment for the signals at marginal carrier frequencies occurs that reduces the spatial-frequency consistency [11].

Furthermore, with their broader bandwidth and larger antenna aperture, mmWave UM-MIMO systems inherently yield improved delay and angle resolution. Meanwhile, the hybrid-field propagation effect offers the potential of simultaneous angular and range resolutions by measuring the different antenna elements’ phase difference generated by the spherical electromagnetic wave. As a result, the HFBS effect of mmWave UM-MIMO systems also can be promising solutions for applications such as target localization [9], radar sensing [12], simultaneous localization and mapping (SLAM) [13], and ultimately, integrated sensing and communication (ISAC) [14].

I-A Prior Work

Reference Channel Beam Squint Configuration Functionality
Localization Communication
[12] Near field Single base station \checkmark
[13] Near field Single base station \checkmark
[26] Near field \checkmark Single base station \checkmark
[27] Near field \checkmark Single base station+RIS \checkmark
[36] Near field Single base station \checkmark
[37] Near field Single base station \checkmark
[38] Near field Single base station \checkmark
[39] Near field Single base station+RIS \checkmark
[9] Hybrid far and near field \checkmark Single base station+RIS \checkmark \checkmark
Our work Hybrid far and near field \checkmark Single base station \checkmark \checkmark
TABLE I: Summary of existing works on ILSC

Hybrid beamforming architecture has been widely considered for massive MIMO and UM-MIMO systems due to their low hardware cost achieved by using a much smaller number of radio frequency (RF) chains than antenna elements. [18]. Nevertheless, this architecture significantly decreases the effective baseband measurement dimension of received signals, bringing great challenges to full-dimensional channel state information (CSI) acquisition. To this end, compressive sensing (CS) technique, as an effective tool to recover the sparse signals via underdetermined measurements, has attracted extensive attention for channel estimation in mmWave and THz bands [18, 19, 20]. Under the assumption of the far-field planar wavefront, the mmWave massive MIMO channel exhibits an obvious sparsity pattern in the angular domain. Therefore, the channel estimation can be formulated as a larger number of antennasal sparse signal recovery problem, and solved resorting to the off-the-shelf CS-based algorithms [18, 21, 22]. However, when extended to UM-MIMO systems, the far-field approximation no longer holds, leading to a non-negligible energy dispersion and deteriorating its inherent sparsity. For this purpose, the authors of [23] introduced a subarray piecewise approximate far-field-based sparse channel estimation scheme, which handled the channels of different subarrays independently with the far-field approximation. However, it overlooked the geometrical correlations of different subarrays, thus increasing the parameters to be estimated, and only performing well at a moderate medium antenna array size. Furthermore, a more complex Fresnel approximation was developed based on second-order Taylor expansion, and [25, 24] proposed higher-dimensional projection matrices to enable sparse channel estimation at the cost of increased computational complexity. This is due to the reason that candidate columns steering vectors of projection matrices were determined by both the angular and the range parameters.

Most of the aforementioned schemes have focused on leveraging the structured common sparsity of the spatial-frequency channel to enhance estimation performance, yet neglecting the beam-squint effect. To address this issue, [26] revealed the bilinear pattern of HFBS, which implies that the sparse support set of channels in both the angle and the range domains can be regarded as a linear function against frequency, and developed a bilinear pattern based structured CS algorithm for channel estimation. Additionally, [9, 27, 28] developed a frequency-dependent projection matrix to correct the beams-squint effect and maintained the spatial-frequency consistency in the corresponding transform domain. Besides, for the problem of dispersed beams caused by the beam-squint effect, a common solution is to employ true-time-delay (TTD) to replace phase shifters to generate frequency-dependent phase shift but inducing higher hardware cost [10]. To retain the original cost-effective hybrid beamforming architecture, [29] proposed analog beam broadening strategies to divide the whole array into several virtual subarrays to generate a wider beam and provide constant beamforming gain as much as possible over the whole operating frequency band, thus mitigating the impact of far-field beam-squint to some extent. Furthermore, [30] studied a spatial frequency modulated continuous wave (FMCW)-based phase shift design for massive phased arrays to overcome the near-field beam misfocus problem, and achieve a uniform gain over a wide bandwidth and a higher rate than the standard design in the near-field propagation region.

In 6G mobile communication systems, the use of higher frequency bands, wider bandwidth, and larger number of antennas will pave the way for advanced sensing capabilities with high accuracy and resolution. As a result, future 6G ISAC systems are likely to support a variety of sensing applications, including ultra-high accuracy localization and tracking, simultaneous imaging, mapping, and localization, as well as enhanced human sensing, gesture, and activity recognition[17]. Localization is a typical application of ISAC. Therefore, the concept of integrated localization and sensing communication (ILSC) is proposed to further specify and refine the broader concept of ISAC. In the context of sensing and localization, research efforts towards SLAM and ISAC techniques are well underway. Standardized cellular network localization methods allocate dedicated reference signals, namely the downlink positioning reference signal (PRS) and uplink sounding reference signal (SRS), to facilitate positioning [31]. These signals are instrumental in deriving location information by measuring parameters such as reference signal received power (RSRP), time difference of arrival (TDoA), angle of departure (AoD), angle of arrival (AoA), and multi-cell round trip time (Multi-RTT). Moreover, these measurements further enable precise UT localization, achieving accuracy ranging from a few meters to a few decimeters depending on the specific deployment scenarios [32]. In [33, 34], the authors proposed a joint localization and location-aware beamforming scheme for a semi-passive RIS-assisted system for the single-user and multiple-user scenarios, where the angular parameters were utilized to position the UTs. However, only the LoS path was considered and the delay information was also dismissed. In [35], the author design a joint users’ activity and channel estimation scheme under near-field scenario and sense the users’ location with the TDoA and AoA of the LoS links between multiple subarrays and users. For the near-field propagation region of UM-MIMO systems, the accurate spherical wavefront presents a new way for sensing and localization. Particularly, [36, 37] pioneered to develope a modified two-dimensional version of the multiple signal classification (MUSIC) algorithm for localizing the signal sources with the spherical wavefront sampled by a passive sensor array. Then, [38] derived the theoretical error bound for target tracking by only harnessing the curvature of arrival in phase-difference measurements in the near-field region, and revealed that the range information tends to decrease with increasing target distance. The authors in [39] further proposed a CS-based reconfigurable intelligent surfaces (RIS) aided localization scheme for near-field region depending on the double anchors. Moreover, [12] presented a near-field wideband sensing scheme, which unveiled that coherent processing of spatial domain and frequency measurements contributes to better sensing accuracy compared with [36, 37, 38, 39].

However, the aforementioned schemes necessitated the allocation of dedicated radio resources for positioning signals. To minimize resource overhead and achieve highly integrated functionality, [13] proposed to utilize the beam sweeping procedure in the broadcast synchronization signal block (SSB), where the angular parameters can be extracted for reconstructing images of the propagation environment and providing reliable UT localization service simultaneously. Yet, this method only peroformed well for indoor scenario with smooth concrete wall and required the accurate prior information of the initial positions of UTs and base station (BS), significantly limiting its application scope. Additionally, [9] introduced an integrated uplink channel estimation and localization scheme for the RIS-assisted UM-MIMO system, which synthetically exploited the TDoA and AoA information of hybrid-field spherical wave for accurate UT localization based on the estimated channels, and the localization results were further used to improve the channel estimation performance in turn. However, this method relied on double anchors including the BS and RIS, and also overlooked the design of data transmission for achieving ISAC architecture. Meanwhile, the authors in [14] optimized the ISAC signal to maximize the near-field sensing performance subject to a minimum communication rate while requires the priori target location information and only considered the ideal line-of-sight (LoS) propagation condition. In [15], the UAV-enabled ISAC systems are shown to effectively address the requirements of periodic sensing scenarios. In addition, the authors in [15, 16] have investigated the problem of resource allocation between communication and sensing in ISAC. Additionally, it may be beneficial to explore network-level cooperative ISAC, which leverages multi-cell cooperation to enhance both sensing and communication coverage. As discussed in [40], network-level ISAC can provide additional degrees of freedom for achieving better integration between sensing and communication, thereby improving overall system performance. By coordinating across multiple ISAC base stations, such a network can optimize resource allocation and interference management, offering promising opportunities for overcoming limitations in traditional ISAC setups.

Based on the review above and Table I, there still exists a gap among previous works in investigating ILSC performance with only a single UM-MIMO anchor and considering both HF and beam squint effects simultaneously.

I-B Our Contributions

In this work, we focus on achieving ILSC function for time-division duplexing (TDD) mmWave UM-MIMO systems. To be specific, the main contributions of this paper can be summarized as follows:

  • We conceive a novel frame structure and signal processing flow designed for ILSC. The proposed architecture consists of a training stage and a data transmission stage. During the training stage, UTs transmit the uplink reference signals to acquire CSI. Moreover, without the need for additional radio resources, a BS can simultaneously sense the positions of scatterers based on the estimated channels and further provide accurate UT localization service by exploiting the hybrid-field propagation characteristic. In the following downlink data transmission stage, the BS further employs location information to design beamforming schemes for more efficient payload data transmission.

  • We utilize a projection-based Bayesian CS channel estimation scheme to combat the HFBS effect. By utilizing an elaborate frequency-dependent hybrid-field projection matrix, we ensure that the channel vectors (from one UT’s antenna to all BS’s antennas) represented in the projected domain have the identical sparsity patterns across different sub-carriers and different antennas on the UT side. On this basis, we further consider the generalized multiple measurement vector approximate message passing (GMMV-AMP) algorithm to solve this CS problem, where the expectation-maximization (EM) technique is combined to adaptively learn the unknown hyper-parameters across all subcarriers.

  • We propose to simultaneously locate the scatterers and UT based on the acquired CSI, by leveraging the hybrid-field propagation characteristic of UM-MIMO systems. Exploiting acquired CSI, we simultaneously determine locations of scatterers and a UT without requiring additional radio resources. Specifically, we extract the targets’ angle and distance information from the spherical electromagnetic wavefront based on the estimated channels to sense the scatterers’ locations. Then, by treating these sensed scatterers as virtual anchors (VAs), we can coarsely estimate the UT’s location according to the propagation geometry. To further enhance the sensing performance, we utilize super-resolution relative delay estimation so that the UT’s and scatterers’ location can be refined through an iterative procedure.

  • We introduce a hybrid beamforming design for downlink data transmission that leverages the location sensing results and overcomes the HFBS effect. Specifically, we first develop an analog beamforming codebook. Within this codebook, each beamforming vector is associated with a pair of estimated distance and angle parameters, and thus can focus the signal energy on the sensed position under the hybrid-field effect. To further mitigate the beam-squint effect, each beam is designed to be broadened along both the distance and angle domains simultaneously to cover the beam-squint region, thus providing constant beamforming gain as much as possible over the whole bandwidth and enhance spectral efficiency (SE). Furthermore, we employ the simultaneous orthogonal matching pursuit (SOMP) algorithm to select optimal analog beamforming vectors and design the corresponding digital precoder to enable multi-stream transmission.

I-C Notations

Throughout this paper, scalar variables are represented by normal-face letters, while column vectors and matrices are indicated by boldface lower and upper-case letters, respectively. The transpose, Hermitian transpose, and inversion of matrices are denoted by ()T(\cdot)^{\rm T}, ()H(\cdot)^{\rm H}, and ()1(\cdot)^{-1}, respectively. In addition, |||\cdot| represents the modulus, p\|\cdot\|_{p} denotes the p\ell_{p}-norm, and F\|\cdot\|_{\rm F} signifies the Frobenius-norm. []i[\cdot]_{i} denotes the ii-th element of a vector, and []i,j[\cdot]_{i,j} represents the ii-th row and jj-th column element of a matrix. Furthermore, ||c|\cdot|_{c} indicates the cardinality of a set. 𝒞𝒩(,μ,Σ)\mathcal{CN}(\cdot,\mu,\Sigma) represents the Gaussian distribution with mean μ\mu and variance Σ\Sigma, while 𝒰[a,b]\mathcal{U}[a,b] stands for a uniform distribution between aa and bb. 𝔼()\mathbb{E}(\cdot) and 𝕍()\mathbb{V}(\cdot) calculate the mean and variance of a variable, respectively. ()\partial(\cdot) denotes the first-order partial derivative operation, and diag(𝐚){\rm diag}(\mathbf{a}) constructs a diagonal matrix with elements from vector 𝐚\mathbf{a} placed along its diagonal. Finally, 𝐈n\mathbf{I}_{n} denotes the identity matrix of size n×nn\times n, and 𝟎n×m\mathbf{0}_{n\times m} represents the n×mn\times m zero matrix.

II System Model

Consider an ILSC scenario in mmWave UM-MIMO systems as shown in Fig. 1, where a BS serves UU UTs in TDD mode. The BS and UTs are all equipped with a uniform linear array (ULA)111If uniform planar array is considered, the dimensionality of the parameters becomes too large to run the simulation. Therefore, without changing the core of the problem, we have made a relatively idealized treatment of the practical problem in order to gain an understanding of the essential problem of HFBS effect in channel estimation and localization, and to gain corresponding insights., and comprise NBS(NUT)N_{\rm BS}\ (N_{\rm UT}) antennas and NBSRF(NUTRF)N_{\rm BS}^{\rm RF}\ (N_{\rm UT}^{\rm RF}) RF chains. To reduce the hardware costs and power consumption of UM-MIMO, the BS and UTs all employ hybrid beamforming architecture with a much smaller number of RF chains than that of antennas, i.e., NBSNBSRF(NUTNUTRF)N_{\rm BS}\gg N_{\rm BS}^{\rm RF}\ (N_{\rm UT}\gg N_{\rm UT}^{\rm RF}). Moreover, the cyclic prefix orthogonal frequency division multiplexing (CP-OFDM) with MM subcarriers is adopted to combat the frequency selective fading of the broadband mmWave channels.

Refer to caption
Figure 1: The ILSC scenario for mmWave UM-MIMO systems.

II-A Uplink Reference Signal Transmission

At this stage, multiple UTs transmit mutually orthogonal reference signals in the QQ consecutive time slots in the uplink to acquire CSI. For a specific UT, in order to estimate the mm-th subcarrier’s CSI, the received baseband signal 𝐲~UL[m,q]Nsp\tilde{\mathbf{y}}_{\rm UL}[m,q]\in\mathbb{C}^{N_{s}^{p}} at the BS in the qq-th time slot can be formulated as

𝐲~UL[m,q]=\displaystyle\tilde{\mathbf{y}}_{\rm UL}[m,q]= PtULM𝐖ULH[m,q]𝐇UL[m]𝐟ULRF[q]s[m,q]\displaystyle\frac{P_{t}^{\rm UL}}{M}\mathbf{W}^{\rm H}_{\rm UL}[m,q]\mathbf{H}_{\rm UL}[m]\mathbf{f}_{\rm UL}^{\rm RF}[q]s[m,q]
+𝐖ULH[m,q]𝐧~[m,q],\displaystyle+\mathbf{W}^{\rm H}_{\rm UL}[m,q]\tilde{\mathbf{n}}[m,q], (1)

where 1qQ1\leq q\leq Q and 1mM1\leq m\leq M. In (II-A), 𝐖UL[m,q]\mathbf{W}_{\rm UL}[m,q] represents the BS’s receive combining matrix 𝐖UL[m,q]=𝐖ULRF[q]𝐖ULBB[m,q]NBS×Nsp\mathbf{W}_{\rm UL}[m,q]=\mathbf{W}^{\rm RF}_{\rm UL}[q]\mathbf{W}^{\rm BB}_{\rm UL}[m,q]\in\mathbb{C}^{N_{\rm BS}\times N_{s}^{p}}, in which 𝐖ULRF[m,q]NBS×NBSRF\mathbf{W}^{\rm RF}_{\rm UL}[m,q]\in\mathbb{C}^{N_{\rm BS}\times N_{\rm BS}^{\rm RF}} and 𝐖ULBB[m,q]NBSRF×Nsp\mathbf{W}^{\rm BB}_{\rm UL}[m,q]\in\mathbb{C}^{N_{\rm BS}^{\rm RF}\times N_{s}^{p}} are the analog and digital receive combining matrices respectively, NspN_{s}^{p} is the number of independent received signal streams on each subcarrier. By activating only one RF chain in one time slot for UT, the RF reference signal vector transmitted by the UT can be denoted as 𝐟ULRF[q]NUT×1\mathbf{f}_{\rm UL}^{\rm RF}[q]\in\mathbb{C}^{N_{\rm UT}\times 1}. PtULP_{t}^{\rm UL} is the total uplink transmit power, while s[m,q]s[m,q] is the baseband reference signal with s[m,q]s[m,q]=1s[m,q]s^{*}[m,q]=1. In addition, 𝐇UL[m]NBS×NUT\mathbf{H}_{\rm UL}[m]\in\mathbb{C}^{N_{\rm BS}\times N_{\rm UT}} stands for the corresponding uplink frequency-domain channel matrix, whose modeling approach will be detailed in Sec. II-B, and 𝐧~[m,q]NBS×1\tilde{\mathbf{n}}[m,q]\in\mathbb{C}^{N_{\rm BS}\times 1} is the complex additive white Gaussian noise (AWGN) with the covariance matrix σ2𝐈NBS\sigma^{2}\mathbf{I}_{N_{\rm BS}}, i.e., 𝐧[m,q]𝒞𝒩(𝟎NBS,σ2𝐈NBS){\mathbf{n}}[m,q]\sim\mathcal{CN}(\mathbf{0}_{N_{\rm BS}},\sigma^{2}\mathbf{I}_{N_{\rm BS}}). Due to the constant modulus constraint of the adopted fully-connected RF phase shift network at the BS and UT, the uplink transmit and combining matrices can be expressed as [𝐟ULRF[q]]i1=1NUTejϑi11\left[\mathbf{f}_{\rm UL}^{\rm RF}[q]\right]_{i_{1}}=\frac{1}{\sqrt{N_{\rm UT}}}e^{j\vartheta^{1}_{i_{1}}} and [𝐖ULRF[q]]i1,i2=1NBSejϑi1,i22\left[\mathbf{W}_{\rm UL}^{\rm RF}[q]\right]_{i_{1},i_{2}}=\frac{1}{\sqrt{N_{\rm BS}}}e^{j\vartheta^{2}_{i_{1},i_{2}}} respectively, where ϑi11\vartheta^{1}_{i_{1}} and ϑi1,i22\vartheta^{2}_{i_{1},i_{2}} denote the phase value connecting the i1i_{1}-th antenna and the i2i_{2}-th RF chain, respectively.

𝐇UL[m]=𝐇ULLoS[m]+l=1LβNLoSlNBSNUTLαlej2πλm(rlUT+rlBS)𝐚BS(θlBS,rlBS,m)𝐚UTH(θlUT,rlUT,m),\displaystyle\mathbf{H}_{\rm UL}[m]=\mathbf{H}_{\rm UL}^{\rm LoS}[m]+\sum_{l=1}^{L}\sqrt{\frac{\beta_{\rm NLoS}^{l}N_{\rm BS}N_{\rm UT}}{L}}\alpha_{l}e^{-j\frac{2\pi}{\lambda_{m}}(r_{l}^{\rm UT}+r_{l}^{\rm BS})}\mathbf{a}_{\rm BS}(\theta_{l}^{\rm BS},r_{l}^{\rm BS},m)\mathbf{a}_{\rm UT}^{\rm H}(\theta_{l}^{\rm UT},r_{l}^{\rm UT},m), (2)

II-B Channel Model with HFBS Effect

Considering the channel reciprocity in TDD systems, we focus on the formulation of the uplink channel matrix next. In an mmWave UM-MIMO system with a hyper-scale antenna aperture and ultra-wide bandwidth, the channel exhibits an intrinsic HFBS effect. Our study specifically targets scenarios that are representative of indoor and short-range localization and communication environments, rather than urban macro-cell (UMa) environments. In these indoor settings, both near-field and far-field effects are present, along with significant beam squint effects due to the large antenna aperture and wide bandwidth, which are distinct from the purely far-field propagation considered in UMa scenarios. Under the spherical wavefront assumption, the spatial-frequency domain hybrid near- and far-field channel matrix 𝐇UL[m]\mathbf{H}_{\rm UL}[m] can be accurately modeled as (2) shown at the top of next page, which is a sum of the contributions of (L+1)(L+1) multipath components (MPCs) including one LoS MPC and several non-LoS (NLoS) MPCs reflected by LL scattering clusters [42]. Particularly, for the LoS component 𝐇ULLoS[m]\mathbf{H}_{\rm UL}^{\rm LoS}[m], its (n1,n2)(n_{1},n_{2})-th element can be further written as

[𝐇ULLoS[m]]n1,n2=βLoSej2πλmrn1,n2,\displaystyle\left[\mathbf{H}_{\rm UL}^{\rm LoS}[m]\right]_{n_{1},n_{2}}=\sqrt{\beta_{\rm LoS}}e^{-j\frac{2\pi}{\lambda_{m}}r_{n_{1},n_{2}}}, (3)

where βLoS\beta_{\rm LoS} is the large-scale fading gain of the LoS link, λm\lambda_{m} represents the wavelength of the mm-th subcarrier, and rn1,n2r_{n_{1},n_{2}} denotes the physical distance between the n1n_{1}-th antenna element of the BS and the n2n_{2}-th antenna element of the UT.

Refer to caption
Figure 2: The proposed frame structure for ILSC.

For the ll-th NLoS component, it can be decomposed into the product of frequency-dependent array manifold of BS and UT sides under the spherical wave assumption, which can be further concretely presented as [25]

𝐚BS(θlBS,rlBS,m)=\displaystyle\mathbf{a}_{\rm BS}(\theta_{l}^{\rm BS},r_{l}^{\rm BS},m)=
1NBS[ej2πλm(rl,1BSrlBS),,ej2πλm(rl,NBSBSrlBS)]T.\displaystyle\quad\frac{1}{\sqrt{N_{\rm BS}}}\left[e^{-j\frac{2\pi}{\lambda_{m}}(r_{l,1}^{\rm BS}-r_{l}^{\rm BS})},\dots,e^{-j\frac{2\pi}{\lambda_{m}}(r_{l,N_{\rm BS}}^{\rm BS}-r_{l}^{\rm BS})}\right]^{\rm T}. (4)

Herein, rlBSr_{l}^{\rm BS} and θlBS\theta_{l}^{\rm BS} denote the distance and AoA between the ll-th scatterer and the reference point of BS’s antenna array, respectively. Without loss of generality, the reference point could be set to the center of the antenna array. More generally, rl,n1BSr_{l,n_{1}}^{\rm BS} represents the distance between the ll-th scatterer and the n1n_{1}-th element of BS’s antenna array. Under the spherical wave assumption, rl,n1BSr_{l,n_{1}}^{\rm BS} is a function of both rlBSr_{l}^{\rm BS} and θlBS\theta_{l}^{\rm BS}, i.e., rl,n1BS=(rlBS)2+δn12d22rlBSδn1dsin(θlBS)r_{l,n_{1}}^{\rm BS}=\sqrt{\left({r_{l}^{\rm BS}}\right)^{2}+\delta^{2}_{n_{1}}d^{2}-2r_{l}^{\rm BS}\delta_{n_{1}}d\sin(\theta_{l}^{\rm BS})}, in which δn1=(2n1NBS1)/2\delta_{n_{1}}=(2n_{1}-N_{\rm BS}-1)/2 with n1=1,2,,NBSn_{1}=1,2,\dots,N_{\rm BS}, and dd denotes adjacent antenna spacing commonly with half-wavelength. Similarly, 𝐚UT(θlUT,rlUT,m)\mathbf{a}_{\rm UT}(\theta_{l}^{\rm UT},r_{l}^{\rm UT},m) can be expressed as

𝐚UT(θlUT,rlUT,m)=\displaystyle\mathbf{a}_{\rm UT}(\theta_{l}^{\rm UT},r_{l}^{\rm UT},m)=
1NUT[ej2πλm(rl,1UTrlUT),,ej2πλm(rl,NUTUTrlUT)]T,\displaystyle\;\frac{1}{\sqrt{N_{\rm UT}}}\left[e^{-j\frac{2\pi}{\lambda_{m}}(r_{l,1}^{\rm UT}-r_{l}^{\rm UT})},\dots,e^{-j\frac{2\pi}{\lambda_{m}}(r_{l,N_{\rm UT}}^{\rm UT}-r_{l}^{\rm UT})}\right]^{\rm T}, (5)

where rlUTr_{l}^{\rm UT} and θlUT\theta_{l}^{\rm UT} denote the distance and AoD between the reference point of UT’s antenna array and the ll-th scatterer, respectively. Additionally, in (2), βNLoSl\beta_{\rm NLoS}^{l} and αl𝒞𝒩(0,σα2)\alpha_{l}\sim\mathcal{CN}(0,\sigma_{\alpha}^{2}) are the large-scale fading gain of the NLoS link and small-scale channel gain corresponding to the ll-th NLoS component, respectively.

Note that in practical environments, multi-hop paths experience significant path loss, as shown in [43], where each hop results in approximately 6 dB reflection loss and 12 dB transmission loss in a building scatterer scenario. Consequently, these multi-hop signals are treated as noise, having a minimal effect on both communication and sensing capabilities. Therefore, in the modeling presented in this paper, only single-hop NLoS components are considered.

III Uplink Channel State Information Acquisition

This section introduces the uplink CSI acquisition, i.e., the BS estimates the uplink channels based on the pilot signals transmitted by the UTs. Specifically, we first conceive a dedicated frame structure for ILSC and formulate the channel estimation problem as a structured CS problem. Moreover, we conceive a frequency-dependent hybrid-field projection matrix to unveil the inherent sparse pattern of the channels with HFBS effect. Finally, we develop an algorithm that combines AMP with the EM technique to address this CS problem.

III-A Problem Formulation

The designed frame structure for ILSC is depicted in Fig. 2. Its time-frequency radio resources can be divided into multiple resource elements to convey both reference signals and payload data. Specifically, the first Q=NUTQBSQ=N_{\rm UT}Q_{\rm BS} symbols are allocated for channel training, while the remaining resources are reserved solely for payload data transmission. During the full frame, the discrete Fourier transform (DFT) length of an OFDM symbol is set to Mp=MCPM_{p}=M_{\text{CP}}. Consequently, the symbol duration of the reference signal becomes (Mp+MCP)/Bw(M_{p}+M_{\text{CP}})/B_{w}, where MCPM_{\text{CP}} represents the CP length, and BwB_{w} is the system bandwidth. Furthermore, the reference signals in the training stage consists of QBSQ_{\text{BS}} subframes. Within each subframe, the BS analog receive combiner 𝐖ULRF[q]\mathbf{W}_{\text{UL}}^{\text{RF}}[q] remains constant, and its phase shift of (i1,i2)(i_{1},i_{2})-th elements ϑi1,i22\vartheta^{2}_{i_{1},i_{2}} for 1i1NBS1\leq i_{1}\leq N_{\rm BS} and 1i2NBSRF1\leq i_{2}\leq N_{\rm BS}^{\rm RF} are uniformly distributed in 𝒰[0,2π)\mathcal{U}[0,2\pi). Meanwhile, the digital receive combiner at the BS adopts an identity matrix 𝐖ULBB[m,q]=𝐈NBSRF\mathbf{W}^{\text{BB}}_{\text{UL}}[m,q]=\mathbf{I}_{N_{\text{BS}}^{\text{RF}}}, where Nsp=NBSRFN_{s}^{p}=N_{\text{BS}}^{\text{RF}}. In this setting, UTs transmit NUTN_{\text{UT}} orthogonal reference signals in each subframe, which could be designed as a DFT matrix as 𝐅UL=[𝐟ULRF[1],𝐟ULRF[2],,𝐟ULRF[NUT]]NUT×NUT\mathbf{F}_{\text{UL}}=\left[\mathbf{f}_{\text{UL}}^{\text{RF}}[1],\mathbf{f}_{\text{UL}}^{\text{RF}}[2],\dots,\mathbf{f}_{\text{UL}}^{\text{RF}}[N_{\text{UT}]}\right]\in\mathbb{C}^{N_{\text{UT}}\times N_{\text{UT}}} to meet the prerequisites of constant modulus and orthogonality.

Based on the above assumptions, by collecting 𝐲~UL[m,(q11)NUT+q2]\tilde{\mathbf{y}}_{\rm UL}[m,(q_{1}-1)N_{\rm UT}+q_{2}] for all QBSQ_{\rm BS} subframes together (where q1q_{1} is the subframe index and q2q_{2} is the symbol index in one subframe), the aggregate received signals 𝐘~UL[m]P×NUT\tilde{\mathbf{Y}}_{\rm UL}[m]\in\mathbb{C}^{P\times N_{\rm UT}} can be written as

𝐘~UL[m]=𝐖ULH𝐇UL[m]𝐅UL𝐒[m]+𝐖ULH𝐍~[m],\displaystyle\tilde{\mathbf{Y}}_{\rm UL}[m]=\mathbf{W}^{\rm H}_{\rm UL}\mathbf{H}_{\rm UL}[m]\mathbf{F}_{\rm UL}\mathbf{S}[m]+\mathbf{W}^{\rm H}_{\rm UL}\tilde{\mathbf{N}}[m], (6)

where 𝐘~UL[m]\tilde{\mathbf{Y}}_{\rm UL}[m] successively selects every symbol of the same subframes 1q2NUT1\leq q_{2}\leq N_{\rm UT} as its columns, and stacks different subframes 1q1QBS1\leq q_{1}\leq Q_{\rm BS} on top of each another, and P=QBSNBSRFP=Q_{\rm BS}N_{\rm BS}^{\rm RF} denotes the effective measurement length. Besides, 𝐖UL=[𝐖ULRF[1],𝐖ULRF[2],,𝐖ULRF[QBS]]NBS×P\mathbf{W}_{\rm UL}=\left[\mathbf{W}_{\rm UL}^{\rm RF}[1],\mathbf{W}_{\rm UL}^{\rm RF}[2],\dots,\mathbf{W}_{\rm UL}^{\rm RF}[Q_{\rm BS}]\right]\in\mathbb{C}^{N_{\rm BS}\times P}, 𝐒[m]=PtUL/Mdiag{s[m,1],s[m,2],,s[m,NUT]}\mathbf{S}[m]=\sqrt{P_{t}^{\rm UL}/M}{\rm diag}\left\{s[m,1],s[m,2],\dots,s[m,N_{\rm UT}]\right\} is the stacked baseband reference signal, and 𝐍~[m]NBS×NUT\tilde{\mathbf{N}}[m]\in\mathbb{C}^{N_{\rm BS}\times N_{\rm UT}} stands for the corresponding AWGN for all these QBSQ_{\rm BS} subframes. Finally, the received baseband signal in (6) is post-processed by multiplying 𝐒H[m]𝐅ULH\mathbf{S}^{\rm H}[m]\mathbf{F}_{\rm UL}^{\rm H} right-hand, i.e.,

𝐘UL[m]=MPtUL𝐘~UL[m]𝐒H[m]𝐅ULH=𝐖ULH𝐇UL[m]+𝐍[m],\displaystyle{\mathbf{Y}}_{\rm UL}[m]=\sqrt{\frac{M}{P_{t}^{\rm UL}}}\tilde{\mathbf{Y}}_{\rm UL}[m]\mathbf{S}^{\rm H}[m]\mathbf{F}_{\rm UL}^{\rm H}=\mathbf{W}^{\rm H}_{\rm UL}\mathbf{H}_{\rm UL}[m]+{\mathbf{N}}[m], (7)

in which 𝐍[m]=𝐖ULH𝐍~[m]𝐒H[m]𝐅ULH{\mathbf{N}}[m]=\mathbf{W}^{\rm H}_{\rm UL}\tilde{\mathbf{N}}[m]\mathbf{S}^{\rm H}[m]\mathbf{F}_{\rm UL}^{\rm H} is the effective noise term.

III-B Projection Matrix Design

Refer to caption
(a) The representation of NLoS components with frequency-flat DFT projection matrix.
Refer to caption
(b) The representation of NLoS components with projection matrix defined in (10).
Refer to caption
(c) The piece-wise planar wave approximation of LoS component.
Figure 3: The sparse representation of HFBS channel, where the system parameters are the carrier frequency fc=47GHzf_{c}=47\rm GHz, system bandwidth BW=5GHz\text{BW}=5\text{GHz}, NBS=512N_{\rm BS}=512, NUT=32N_{\rm UT}=32, and M=64M=64 for ease of illustration.

In the case of hybrid far- and near-field propagation effect, the planar wave approximation based projection matrix fails to exactly capture the inherent characteristics of the channel, leading to a degradation in the sparsity of the channels represented in such a projection matrix, as illustrated in Fig. 3(a). To address this issue, a more accurate Fresnel approximation [6] is developed for the distance rl,n1BSr_{l,n_{1}}^{\rm BS} based on a second-order Taylor expansion as

rl,n1BS=(rlBS)2+δn12d22rlBSδn1dsin(θlBS)\displaystyle r_{l,n_{1}}^{\rm BS}=\sqrt{\left({r_{l}^{\rm BS}}\right)^{2}+\delta^{2}_{n_{1}}d^{2}-2r_{l}^{\rm BS}\delta_{n_{1}}d\sin(\theta_{l}^{\rm BS})}
rlBSδn1dsin(θlBS)+δn12d2cos2θlBS2rlBS.\displaystyle\approx r_{l}^{\rm BS}-\delta_{n_{1}}d\sin(\theta_{l}^{\rm BS})+\frac{\delta_{n_{1}}^{2}d^{2}\cos^{2}\theta_{l}^{\rm BS}}{2{r_{l}^{\rm BS}}}. (8)

Based on this approximation, the BS array manifold of each MPC depends on not only its AoD/AoA but also on the distance between the BS and scatterers. To this end, [25, 9] proposed a non-uniform two-dimensional sampling lattice that considers both angles and distances as

Θ={sinθn=2nρNBS+1ρNBS|n=0,1,,ρNBS1,s},\displaystyle\varTheta=\left\{\sin\theta_{n}=\frac{2n-\rho N_{\rm BS}+1}{\rho N_{\rm BS}}\Big{|}n=0,1,\dots,\rho N_{\rm BS}-1,\forall s\right\}, (9a)
={1rn,s=sη4Rrcos2θn|s=0,1,,S1,n},\displaystyle\mathcal{R}=\left\{\frac{1}{r_{n,s}}=\frac{s\eta}{4R_{r}\cos^{2}\theta_{n}}\Big{|}s=0,1,\dots,S-1,\forall n\right\}, (9b)

where ρNBS\rho N_{\rm BS} and SS are the number of grids along the angles and distances, respectively, and Rr=2DBS2/λcR_{r}=2D_{\rm BS}^{2}/\lambda_{c} is the Rayleigh distance corresponding to the antenna array aperture DBSD_{\rm BS}. Furthermore, η\eta is a scaling factor utilized to control the sampling density.

In this way, the projection matrix 𝚽[m]NBS×ρNBSS\boldsymbol{\Phi}[m]\in\mathbb{C}^{N_{\rm BS}\times\rho N_{\rm BS}S} can be designed as

𝚽[m]={𝐚BS(θ,r,m)|θΘ,r,θ,r},\displaystyle\boldsymbol{\Phi}[m]=\left\{\mathbf{a}_{\rm BS}(\theta,r,m)|\theta\in\varTheta,r\in\mathcal{R},\forall\theta,r\right\}, (10)

in which the frequency-selective array manifold 𝐚BS(θ,r,m)\mathbf{a}_{\rm BS}(\theta,r,m) sampled from the defined 2D lattice forms the f(θ,r)f(\theta,r)-th candidate columns of the projection matrix 𝚽[m]\boldsymbol{\Phi}[m]. It is essential to note that when s=0s=0, rn,sr_{n,s}\rightarrow\infty, the defined sampling grids match the traditional virtual-angular lattices in the far-field. This alignment indicates that the projection matrix 𝚽[m]\boldsymbol{\Phi}[m] serves as a generalized representation of channels in both far-field and near-field. Furthermore, in contrast to the frequency-flat projection matrix leading to the offset of sparse support sets across different subcarriers [25], the proposed matrix considers the frequency-dependent behavior of the array manifold to ensure an identical support set across the entire frequency spectrum, as illustrated in Fig. 3(b).

As a result, the NLoS components of the HFBS channel can be readily represented in a sparse manner using the defined projection matrix. Furthermore, while the LoS component in (3) cannot be straightforwardly decomposed into the product of two array manifold vectors, it can be approximated as follows using the piece-wise subarray planar wave for UT’s array as depicted in Fig. 3(c)

𝐇ULLoS[m]βLoS[𝐚BS(1)(m)𝐚UT(1)H(m),,𝐚BS(G)(m)𝐚UT(G)H(m)],\displaystyle\mathbf{H}_{\rm UL}^{\rm LoS}[m]\approx\sqrt{\beta_{\rm LoS}}\left[{\mathbf{a}_{\rm BS}^{(1)}}(m){\mathbf{a}_{\rm UT}^{(1)}}^{\rm H}(m),\dots,{\mathbf{a}_{\rm BS}^{(G)}}(m){\mathbf{a}_{\rm UT}^{(G)}}^{\rm H}(m)\right], (11)

where GG indicates the number of subarrays divided for UT’s array, 𝐚UT(g)(m)=𝐚UT(θgUT,,m){\mathbf{a}_{\rm UT}^{(g)}}(m)=\mathbf{a}_{\rm UT}(\theta_{g}^{\rm UT},\infty,m) and 𝐚BS(g)(m)=(θgBS,rgBS,m){\mathbf{a}_{\rm BS}^{(g)}}(m)=(\theta_{g}^{\rm BS},r_{g}^{\rm BS},m) denote the UT’s array steering vectors of the gg-th subarray and the BS’s array steering vectors, respecitvely. Under this approximation, the LoS component can be represented sparsely with GG components as

𝐇ULLoS[m]βLoS\displaystyle\mathbf{H}_{\rm UL}^{\rm LoS}[m]\approx\sqrt{\beta_{\rm LoS}}
×g=1G𝐚BS(g)(m)[𝟎(g1)NUT/G,𝐚UT(g)(m), 0NUTgNUT/G]H.\displaystyle\times\sum_{g=1}^{G}{\mathbf{a}_{\rm BS}^{(g)}}(m)\left[{\mathbf{0}_{(g-1)N_{\rm UT}/G},\,\mathbf{a}_{\rm UT}^{(g)}}(m),\,\mathbf{0}_{N_{\rm UT}-gN_{\rm UT}/G}\right]^{\rm H}. (12)

Therefore, the channel estimation in (7) is equivalent to a structured sparse signal recovery problem

𝐘UL[m]=𝐀[m]𝐇ULP[m]+𝐍[m],m,\displaystyle{\mathbf{Y}}_{\rm UL}[m]=\mathbf{A}[m]\mathbf{H}_{\rm UL}^{P}[m]+{\mathbf{N}}[m],\,\forall m, (13)

where 𝐀[m]=𝐖ULH𝚽[m]P×ρNBSS\mathbf{A}[m]=\mathbf{W}^{\rm H}_{\rm UL}\boldsymbol{\Phi}[m]\in\mathbb{C}^{P\times\rho N_{\rm BS}S} is the effective sensing matrix and 𝐇ULP[m]ρNBSS×NUT\mathbf{H}_{\rm UL}^{P}[m]\in\mathbb{C}^{\rho N_{\rm BS}S\times N_{\rm UT}} denotes the high-dimensional projected sparse channel matrix.

III-C AMP-EM-Based Channel Estimation

It is well-known that AMP is an efficient iterative Bayesian algorithm for CS problems, which employs low-complexity message passing for approximating the traditional minimum mean square error (MMSE) estimation [44]. For notation brevity, the superscript, subscript, and index mm have been omitted for 𝐘UL[m]{\mathbf{Y}}_{\rm UL}[m] and 𝐇ULP[m]\mathbf{H}_{\rm UL}^{P}[m] hereinafter, and the different entries of matrix is distinguished by its subscript, e.g., hk,n=[𝐇]k,nh_{k,n}=\left[\mathbf{H}\right]_{k,n}. Starting with the MMSE estimate of 𝐇\mathbf{H}, we can obtain its posterior mean as

h^k,n=𝔼{hk,n|𝐘}=hk,np(hk,n|𝐘)dhk,n,k,n,\displaystyle\hat{h}_{k,n}=\mathbb{E}\{{h}_{k,n}|\mathbf{Y}\}=\int{h}_{k,n}p({h}_{k,n}|\mathbf{Y})\,{\rm d}{h}_{k,n},\forall k,n, (14)

where p(hk,n|𝐘)p({h}_{k,n}|\mathbf{Y}) denotes the marginal posterior distribution expressed as follows:

p(hk,n|𝐘)=p(𝐇|𝐘)d𝐇{k,n}.\displaystyle p({h}_{k,n}|\mathbf{Y})=\int p(\mathbf{H}|\mathbf{Y})\,{\rm d}\mathbf{H}_{\sim\{k,n\}}. (15)

(15) requires the high-dimensional integral of joint posterior distribution p(𝐇|𝐘)p(\mathbf{H}|\mathbf{Y}) w.r.t. each variable entry of 𝐇\mathbf{H} except hk,n{h}_{k,n}. Based on the Bayesian theorem, p(𝐇|𝐘)p(\mathbf{H}|\mathbf{Y}) can be represented as

p(𝐇|𝐘)\displaystyle p(\mathbf{H}|\mathbf{Y}) =p(𝐘|𝐇)p0(𝐇)p(𝐘)\displaystyle=\frac{p(\mathbf{Y}|\mathbf{H})\,p_{0}(\mathbf{H})}{p(\mathbf{Y})}
=1p(𝐘)n=1NUT[p=1Pp(yp,n|𝐇)k=1ρNBSSp0(hk,n)].\displaystyle=\frac{1}{p(\mathbf{Y})}\prod_{n=1}^{N_{\rm UT}}\left[\prod_{p=1}^{P}p(y_{p,n}|\mathbf{H})\prod_{k=1}^{\rho N_{\rm BS}S}p_{0}(h_{k,n})\right]. (16)

Under the assumption of AWGN, the above likelihood function p(𝐘|𝐇)p(\mathbf{Y}|\mathbf{H}) obeys complex Gaussian distribution

p(yp,n|𝐇)=1πσ2exp(1σ2|yp,nkap,khk,n|2).\displaystyle p(y_{p,n}|\mathbf{H})=-\frac{1}{\pi\sigma^{2}}{\rm exp}\left(-\frac{1}{\sigma^{2}}\left|y_{p,n}-\sum\nolimits_{k}a_{p,k}h_{k,n}\right|^{2}\right). (17)

By following the classic AMP literature [45], we consider the prior distribution of 𝐇\mathbf{H} to follow an i.i.d. Bernoulli-Gaussian distribution for effectively capturing its sparsity, i.e.,

p0(hk,n)=(1λk,n)δ(hk,n)+λk,n𝒞𝒩(hk,n;μk,n,γk,n),\displaystyle p_{0}(h_{k,n})=(1-\lambda_{k,n})\delta(h_{k,n})+\lambda_{k,n}\mathcal{CN}(h_{k,n};\mu_{k,n},\gamma_{k,n}), (18)

where 0λk,n10\leq\lambda_{k,n}\leq 1 denotes the non-zero probability of hk,nh_{k,n}, and μk,n\mu_{k,n}, γk,n\gamma_{k,n} are the prior mean and variance of a Gaussian distribution, respectively.

The joint distribution p(𝐇,𝐘)p(\mathbf{H},\mathbf{Y}) can be represented by a bipartite graph 𝒢=(𝒱,,)\mathcal{G}=(\mathcal{V},\mathcal{F},\mathcal{E}), which consists of variable nodes 𝒱\mathcal{V}, factor nodes \mathcal{F}, and the corresponding edges \mathcal{E} [46]. This suggests the use of message passing algorithms contributes to rigorous solution of the marginal posterior distribution in (15), and the messages approximation can further reduce the complexity of high-dimensional integral. Therefore, it yields [19]

p(hk,n|𝐘)p0(hk,n)𝒞𝒩(hk,n;Rk,nt,Σk,nt),\displaystyle p({h}_{k,n}|\mathbf{Y})\propto p_{0}({h}_{k,n})\,\mathcal{CN}(h_{k,n};R_{k,n}^{t},\Sigma_{k,n}^{t}), (19)

where tt indicates the iteration index, Σk,nt\Sigma_{k,n}^{t} and Rk,ntR_{k,n}^{t} are the message parameters updated at variable nodes 𝒱\mathcal{V} as

Σk,nt\displaystyle\Sigma_{k,n}^{t} =[p|ap,k|2σ2+Vp,nt]1,\displaystyle=\left[\sum\nolimits_{p}\frac{|a_{p,k}|^{2}}{\sigma^{2}+V_{p,n}^{t}}\right]^{-1}, (20a)
Rk,nt\displaystyle R_{k,n}^{t} =h^k,nt+Σk,ntpap,k(yp,nZp,nt)σ2+Vp,nt,\displaystyle=\hat{h}_{k,n}^{t}+\Sigma_{k,n}^{t}\sum\nolimits_{p}\frac{a_{p,k}^{*}(y_{p,n}-Z_{p,n}^{t})}{\sigma^{2}+V_{p,n}^{t}}, (20b)

while Vp,ntV_{p,n}^{t} and Zp,ntZ_{p,n}^{t} are the message parameters updated at factor nodes \mathcal{F} as

Vp,nt=k|ap,k|2v^k,nt,\displaystyle V_{p,n}^{t}=\sum\nolimits_{k}|a_{p,k}|^{2}\hat{v}_{k,n}^{t}, (21a)
Zp,nt=kap,kh^k,ntVp,ntσ2+Vp,nt1(yp,nZp,nt1).\displaystyle Z_{p,n}^{t}=\sum\nolimits_{k}a_{p,k}\hat{h}_{k,n}^{t}-\frac{V_{p,n}^{t}}{\sigma^{2}+V_{p,n}^{t-1}}(y_{p,n}-Z_{p,n}^{t-1}). (21b)

where h^k,nt\hat{h}_{k,n}^{t} and v^k,nt\hat{v}_{k,n}^{t} are defined in (24). By combining (18) and (19), the marginal posterior distribution can be merged as

p(hk,n|𝐘)=(1πk,nt)δ(hk,n)+πk,nt𝒞𝒩(hk,n;ξk,nt,ζk,nt),\displaystyle p({h}_{k,n}|\mathbf{Y})=(1-\pi_{k,n}^{t})\,\delta({h}_{k,n})+\pi_{k,n}^{t}\,\mathcal{CN}(h_{k,n};\xi_{k,n}^{t},\zeta_{k,n}^{t}), (22)

where

ξk,nt=Σk,ntμk,n+Rk,ntγk,nΣk,nt+γk,n,ζk,nt=Σk,ntγk,nΣk,nt+γk,n,\displaystyle\xi_{k,n}^{t}=\frac{\Sigma_{k,n}^{t}\mu_{k,n}+R_{k,n}^{t}\gamma_{k,n}}{\Sigma_{k,n}^{t}+\gamma_{k,n}},\quad\zeta_{k,n}^{t}=\frac{\Sigma_{k,n}^{t}\gamma_{k,n}}{\Sigma_{k,n}^{t}+\gamma_{k,n}}, (23a)
πk,nt=λk,nλk,n+(1λk,n)exp(k,nt),\displaystyle\pi_{k,n}^{t}=\frac{\lambda_{k,n}}{\lambda_{k,n}+(1-\lambda_{k,n})\,{\rm exp}(-\mathcal{L}^{t}_{k,n})}, (23b)
k,nt=12lnΣk,ntΣk,nt+γk,n+|Rk,nt|22Σk,nt|Rk,ntμk,n|22(Σk,nt+γk,n).\displaystyle\mathcal{L}^{t}_{k,n}=\frac{1}{2}\ln\frac{\Sigma_{k,n}^{t}}{\Sigma_{k,n}^{t}+\gamma_{k,n}}+\frac{\left|R_{k,n}^{t}\right|^{2}}{2\Sigma_{k,n}^{t}}-\frac{\left|R_{k,n}^{t}-\mu_{k,n}\right|^{2}}{2(\Sigma_{k,n}^{t}+\gamma_{k,n})}. (23c)

Then, the posterior mean and variance can be explicitly calculated as

h^k,nt+1=𝔼{hk,n|Rk,nt,Σk,nt}=πk,ntξk,nt,\displaystyle\hat{h}_{k,n}^{t+1}=\mathbb{E}\{{h}_{k,n}|R_{k,n}^{t},\Sigma_{k,n}^{t}\}=\pi_{k,n}^{t}\xi_{k,n}^{t}, (24a)
v^k,nt+1=𝕍{hk,n|Rk,nt,Σk,nt}=πk,nt(|ξk,nt|2+ζk,nt)|h^k,nt+1|2.\displaystyle\hat{v}_{k,n}^{t+1}=\mathbb{V}\{{h}_{k,n}|R_{k,n}^{t},\Sigma_{k,n}^{t}\}=\pi_{k,n}^{t}\left(|\xi_{k,n}^{t}|^{2}+\zeta_{k,n}^{t}\right)-|\hat{h}_{k,n}^{t+1}|^{2}. (24b)

The message switching procedures mentioned in (20)-(24) constitute the fundamental steps of the AMP algorithm. However, several concerns arise when applying the AMP algorithm to practical systems: (a) The hyper-parameters in the prior assumption and the noise variance are often unknown in practice. (b) The column-wise independent solution to the channel estimation problem neglects the common support structure that arises from spatial and frequency correlations. (c) The AMP algorithm is primarily designed for large system limits and typically assumes the use of i.i.d. Gaussian sensing matrices. This assumption is unrealistic for the problem described in (13).

Input: Received reference signal 𝐘UL[m]{\mathbf{Y}}_{\rm UL}[m], sensing matrix 𝐀[m]\mathbf{A}[m], maximum iteration number TiterT_{\rm iter}, damping parameter ϵ\epsilon.
Output: Estimated channel matrix 𝐇^UL[m]\widehat{\mathbf{H}}_{\rm UL}[m], and the average noise variance σ^2¯[m]\overline{\hat{\sigma}^{2}}[m].
1Initialize hyper-parameters 𝜽1\boldsymbol{\theta}^{1} as [21] recommended in case of trapping in local extremum, and initialize other parameters Vp,n0[m]=1V_{p,n}^{0}[m]=1, Zp,n0[m]=yp,nZ_{p,n}^{0}[m]=y_{p,n}, h^k,n1[m]=μk,n1[m]\hat{h}_{k,n}^{1}[m]=\mu_{k,n}^{1}[m], v^k,n1[m]=γk,n1[m]\hat{v}_{k,n}^{1}[m]=\gamma_{k,n}^{1}[m]. Set iteration index t=1t=1.
2for 1tTiter1\leq t\leq T_{\rm iter} do
3       for 1mM1\leq m\leq M do
4             Update factor nodes Vp,nt[m]V_{p,n}^{t}[m], Zp,nt[m]Z_{p,n}^{t}[m] in line with (21) and implement the damping process: Vp,nt[m]ϵVp,nt1[m]+(1ϵ)Vp,nt[m]V_{p,n}^{t}[m]\leftarrow\epsilon V_{p,n}^{t-1}[m]+(1-\epsilon)V_{p,n}^{t}[m], Zp,nt[m]ϵZp,nt1[m]+(1ϵ)Zp,nt[m]Z_{p,n}^{t}[m]\leftarrow\epsilon Z_{p,n}^{t-1}[m]+(1-\epsilon)Z_{p,n}^{t}[m].
5            Update variable nodes Σk,nt[m]\Sigma_{k,n}^{t}[m],Rk,nt[m]R_{k,n}^{t}[m] according to (20), and the posterior mean and variance h^k,nt+1[m]\hat{h}_{k,n}^{t+1}[m], v^k,nt+1[m]\hat{v}_{k,n}^{t+1}[m] according to (24).
6       end for
7      Learn hyper parameters λk,nt+1[m]\lambda_{k,n}^{t+1}[m], μk,nt+1[m]\mu_{k,n}^{t+1}[m], γk,nt+1[m]\gamma_{k,n}^{t+1}[m], and (σp,n2)t+1[m](\sigma^{2}_{p,n})^{t+1}[m] in line with rules in (27), (26).
8      tt+1t\leftarrow t+1.
9 end for
10
Result: Return the estimated channel 𝐇^UL[m]=𝚽[m]𝐇^ULP[m]\widehat{\mathbf{H}}_{\rm UL}[m]=\boldsymbol{\Phi}[m]\widehat{\mathbf{H}}_{\rm UL}^{P}[m], and the average noise variance σ^2¯[m]=1PNUTp,n(σp,n2)Titer+1[m]\overline{\hat{\sigma}^{2}}[m]=\frac{1}{PN_{\rm UT}}\sum_{p,n}(\sigma^{2}_{p,n})^{T_{\rm iter}+1}[m].
Algorithm 1 AMP-EM-Based HFBS Channel Estimation

To overcome the first challenge, EM can be exploited to learn the unknown hyper-parameters 𝜽={λk,n,μk,n,γk,n,σ2,k,n}\boldsymbol{\theta}=\{\lambda_{k,n},\mu_{k,n},\gamma_{k,n},\sigma^{2},\forall k,n\} as [47]

Q(𝜽,𝜽t)\displaystyle Q(\boldsymbol{\theta},\boldsymbol{\theta}^{t}) =𝔼{lnp(𝐇,𝐘)|𝐘;𝜽t},\displaystyle=\mathbb{E}\left\{\ln p(\mathbf{H},\mathbf{Y})|\mathbf{Y};\boldsymbol{\theta}^{t}\right\}, (25a)
𝜽t+1\displaystyle\quad\quad\boldsymbol{\theta}^{t+1} =argmax𝜽Q(𝜽,𝜽t),\displaystyle=\mathop{\arg\max}\limits_{\boldsymbol{\theta}}\,Q(\boldsymbol{\theta},\boldsymbol{\theta}^{t}), (25b)

where 𝔼{|𝐘;𝜽t}\mathbb{E}\left\{\cdot|\mathbf{Y};\boldsymbol{\theta}^{t}\right\} denotes the expectation with respect to the posterior distribution p(𝐇|𝐘;𝜽t)p(\mathbf{H}|\mathbf{Y};\boldsymbol{\theta}^{t}), and its approximation is concluded in (22). By updating one of the elements in (25b) at a time and holding others constant, the learning rules of the hyper-parameters can be summarized as [19]

λk,nt+1=πk,nt,μk,nt+1=kπk,ntξk,ntkπk,nt,\displaystyle\lambda_{k,n}^{t+1}=\pi_{k,n}^{t},\quad\mu_{k,n}^{t+1}=\frac{\sum_{k}\pi_{k,n}^{t}\xi_{k,n}^{t}}{\sum_{k}\pi_{k,n}^{t}}, (26a)
γk,nt+1=kπk,nt(|μk,ntξk,nt|2+ζk,nt)kπk,nt,\displaystyle\gamma_{k,n}^{t+1}=\frac{\sum_{k}\pi_{k,n}^{t}(|\mu_{k,n}^{t}-\xi_{k,n}^{t}|^{2}+\zeta_{k,n}^{t})}{\sum_{k}\pi_{k,n}^{t}}, (26b)
(σp,n2)t+1=1Pp(|yp,nZp,nt|2|1+Vp,nt/(σk,n2)t|2+(σk,n2)tVp,nt(σk,n2)t+Vp,nt).\displaystyle(\sigma^{2}_{p,n})^{t+1}=\frac{1}{P}\sum\nolimits_{p}\left(\frac{|y_{p,n}-Z_{p,n}^{t}|^{2}}{\left|1+V_{p,n}^{t}/(\sigma^{2}_{k,n})^{t}\right|^{2}}+\frac{(\sigma^{2}_{k,n})^{t}V_{p,n}^{t}}{(\sigma^{2}_{k,n})^{t}+V_{p,n}^{t}}\right). (26c)

To further address the second concern aforementioned, it is reasonable to assign a common non-zero probability λk,n\lambda_{k,n} for each different columns and subcarriers of 𝐇\mathbf{H}, which indicates that the learning rule in (26a) and can be refined as

λk,1t+1[1]==λk,NUTt+1[M]=1MNUTn,mλk,nt+1[m].\displaystyle\lambda_{k,1}^{t+1}[1]=\cdots=\lambda_{k,N_{\rm UT}}^{t+1}[M]=\frac{1}{MN_{\rm UT}}\sum_{n,m}\lambda_{k,n}^{t+1}[m]. (27)

Moreover, the damping operation [48] could be introduced to prevent AMP from divergence due to the nonideal sensing matrix 𝐀[m]\mathbf{A}[m] and the medium size problem mentioned in the concern (c). To sum up, the overall procedure of the AMP-EM-based HFBS channel estimation algorithm is presented in Alg. 1.

IV Location Sensing

This section leverages the acquired CSI to achieve simultaneous location sensing of the scatterers and UTs without requiring additional radio resources. Specifically, we first extract the targets’ angle and distance information from the spherical electromagnetic wavefront based on the channel estimation results to sense the scatterers’ locations. Then, treating these sensed scatterers as VAs, the UT’s location can be coarsely estimated according to the propagation geometry. To further enhance the sensing performance, we utilize the estimated relative delay of MPCs to refine the UT’s and scatterers’ location through an iterative procedure.

IV-A VAs Location Mapping

Under the spherical wave assumption, the HFBS channel depends not only on the angle of the target, but also on its distance. Therefore, it is feasible to simultaneously derive the angle and distance features of the target from the channel state information. According to the analysis in Section III-B, under ideal conditions, the non-zero support set of sparse channel 𝐇ULP[m]\mathbf{H}_{\rm UL}^{P}[m] exactly reflects the approximate LoS components and the NLoS MPCs of the HFBS channel, and their angle and distance parameters are associated with the spatial spatial sampling grid defined in (9). In this way, we can map the location of the center of UT’s subarrays and the scatterers base on the channel estimation results.

One of the premise to achieve location mapping lies on accurate knowleadge about MPCs’ number. To this end, we estimate the effective MPCs’ number L=G+LL^{\prime}=G+L based on the minimum description length (MDL) principle, which selects the parametric probability model that fits the dataset best based on the maximum likelihood criterion, so as to estimate the number of signal sources without the need of empirical threshold [50].

Firstly, in order to overcome the rank-deficient problem of covariance matrix caused by coherent multipath signals, the covariance matrix of received pilot signals is calculated with the spatial smoothing technique [49] as

𝐑yyK[m]\displaystyle\mathbf{R}_{\rm yy}^{K}[m] =1K+1i=0K𝐈NUTK,i𝐑yy[m]𝐈NUTK,iT,\displaystyle=\frac{1}{K+1}\sum_{i=0}^{K}\mathbf{I}_{N_{\rm UT}-K,i}\mathbf{R}_{\rm yy}[m]\mathbf{I}_{N_{\rm UT}-K,i}^{\rm T}, (28)

where 𝐑yy[m]𝐘ULH[m]𝐘UL[m]\mathbf{R}_{\rm yy}[m]\approx{\mathbf{Y}}_{\rm UL}^{\rm H}[m]{\mathbf{Y}}_{\rm UL}[m] is the estimated covariance matrix of the original signal, while 𝐈NUTK,i=[𝟎(NUTK)×i𝐈NUTK 0(NUTK)×(Ki)](NUTK)×NUT\mathbf{I}_{N_{\rm UT}-K,i}=\left[\mathbf{0}_{(N_{\rm UT}-K)\times i}\;\mathbf{I}_{N_{\rm UT}-K}\;\mathbf{0}_{(N_{\rm UT}-K)\times(K-i)}\right]\in\mathbb{C}^{(N_{\rm UT}-K)\times N_{\rm UT}}. By the spectrum decomposition theorem, if there are LL^{\star} of the multipath, 𝐑yyK\mathbf{R}_{\rm yy}^{K} can be decomposed into

𝐑yyK=i=1L(λiσn2)𝐮i𝐮iH+σn2𝐈NUTK,\displaystyle\mathbf{R}_{\rm yy}^{K}=\sum_{i=1}^{L^{\star}}(\lambda_{i}-\sigma^{2}_{n})\mathbf{u}_{i}\mathbf{u}_{i}^{\rm H}+\sigma^{2}_{n}\mathbf{I}_{N_{\rm UT}-K}, (29)

where λi\lambda_{i} and 𝐮i\mathbf{u}_{i} are the eigenvalues and eigenvectors of 𝐑yyK\mathbf{R}_{\rm yy}^{K}, respectively. Defines a set of parameters for the received signal model𝜼={λ1,,λL,𝐮1,,𝐮L,σn2}\boldsymbol{\eta}=\left\{\lambda_{1},\dots,\lambda_{L^{\star}},\mathbf{u}_{1},\dots,\mathbf{u}_{L^{\star}},\sigma^{2}_{n}\right\}, the estimate of LL^{\star} can be obtained based on the MDL rule as

L^=argminL[logP(𝐘UL|𝜼)+γ(L)],\displaystyle\hat{L}^{\prime}=-\mathop{\arg\min}\limits_{L^{\star}}\ \left[\log P({\mathbf{Y}}_{\rm UL}|{\boldsymbol{\eta}})+\gamma(L^{\star})\right], (30)

where P(𝐘UL|𝜼)P({\mathbf{Y}}_{\rm UL}|{\boldsymbol{\eta}}) is the joint probability density distribution of the received pilot signals, while γ(L)=12L(2NUTLK)logNBS\gamma(L^{\star})=\frac{1}{2}L^{\star}(2N_{\rm UT}-L^{\star}-K)\log N_{\rm BS}. Under the assumption of complex Gaussian distribution, it can be further expressed as (31), which is shown at the top of the next page, where λ^i\hat{\lambda}_{i} is the estimate of eigenvalue by decomposing the estimated covariance matrix 𝐑yyK\mathbf{R}_{\rm yy}^{K}. By calculating the covariance matrix of the received pilot signal on different subcarriers, MM independent estimation results can be obtained in parallel. Finally, the fusion of different estimation results can be completed by the mode operation.

L^=argminL{NBS(NUTKL)log[1NUTKLi=L+1NUTλ^i(i=L+1NUTλ^i)1NUTKL]+γ(L)},\displaystyle\hat{L}^{\prime}=\mathop{\arg\min}\limits_{L^{\star}}\ \left\{N_{\rm BS}(N_{\rm UT}-K-L^{\star})\log\left[\frac{\frac{1}{N_{\rm UT}-K-L^{\star}}\sum_{i=L^{\star}+1}^{N_{\rm UT}}\hat{\lambda}_{i}}{\left(\prod_{i=L^{\star}+1}^{N_{\rm UT}}\hat{\lambda}_{i}\right)^{\frac{1}{N_{\rm UT}-K-L^{\star}}}}\right]+\gamma(L^{\star})\right\}, (31)

Refer to caption
Figure 4: Propagation geometry of the location sensing problem in the UM-MIMO system.

When the corresponding row energy of the estimated sparse channel is greater than the noise threshold, the MPCs can be considered to exist. Extract the angle and distance parameters of them

(Θ~,~)=argθΘ,r{h^ULP(θ,r)>NUTmσ^2¯[m]},\displaystyle(\tilde{\varTheta},\tilde{\mathcal{R}})=\mathop{\arg}\limits_{\theta\in{\varTheta},r\in{\mathcal{R}}}\left\{\widehat{h}_{\rm UL}^{P}(\theta,r)>N_{\rm UT}\sum_{m}\overline{\hat{\sigma}^{2}}[m]\right\}, (32)

where h^ULP(θ,r)=n=1NUTm=1M|{𝐇^ULP[m]}f(θ,r),n|2\hat{h}_{\rm UL}^{P}(\theta,r)=\sum_{n=1}^{N_{\rm UT}}\sum_{m=1}^{M}\left|\left\{\widehat{\mathbf{H}}_{\rm UL}^{P}[m]\right\}_{f(\theta,r),n}\right|^{2}. Unfortunately, due to the finite-resolution sampling lattice, the energy leakage is an inevitable negative effect. As a result, the grids set (Θ~,~)(\tilde{\varTheta},\tilde{\mathcal{R}}) exhibits a cluster distribution, where L^\hat{L}^{\prime} truthful peaks are surrouded by a few ghost points. In order to divide clusters before peak identification, K-Means clustering algorithm [52] can be exploited to select the spatial grids subset (Θ~l,~l)(\tilde{\varTheta}_{l},\tilde{\mathcal{R}}_{l}) for the l(1lL^)l\,(1\leq l\leq\hat{L}^{\prime})-th effective MPC. In this case, the angle and distance of ll-th MPC corresponding to the summit of the ll-th cluster grids can be represented as

(θ^lBS,r^lBS)=argmaxθΘ~l,r~lh^ULP(θ,r).\displaystyle(\hat{\theta}_{l}^{\rm BS},\hat{r}_{l}^{\rm BS})=\mathop{\arg\max}\limits_{\theta\in\tilde{\varTheta}_{l},r\in\tilde{\mathcal{R}}_{l}}\,\widehat{h}_{\rm UL}^{P}(\theta,r). (33)

Considering the location sensing geometry illustrated in Fig. 4, where the BS is located at the origin of coordinates, we can coarsely map the position of VAs with the orientation and distances measured above as (x^l,y^l)=(r^lBScosθ^lBS,r^lBSsinθ^lBS)(\hat{x}_{l},\hat{y}_{l})=(\hat{r}_{l}^{\rm BS}\cos\hat{\theta}_{l}^{\rm BS},\hat{r}_{l}^{\rm BS}\sin\hat{\theta}_{l}^{\rm BS}), where we treat the sensed UT subarray centers and scatterers equally as VAs, and they can contribute to UT location sensing hereinafter. In order to distinguish the sensed UT subarray centers from the scatterers, we adopt the following method: VAs satisfying the array aperture constraint are inferred as UT subarray centers, and their coordinate set 𝒢={l|1lL^,(x^lx^)2+(y^ly^)2DUT}\mathcal{G}=\{l|1\leq l\leq\hat{L^{\prime}},(\hat{x}_{l}-\hat{x}^{\star})^{2}+(\hat{y}_{l}-\hat{y}^{\star})^{2}\leq D_{\rm UT}\}, where (x^,y^)(\hat{x}^{\star},\hat{y}^{\star}) is the VAs with the maximum energy and thus can be truthfully inferred as LoS components, while DUTD_{\rm UT} is the antenna array aperture of UT. The remains are classified as scatterers and their coordinate set 𝒮={l|1lL^,l𝒢}\mathcal{S}=\{l|1\leq l\leq\hat{L}^{\prime},l\notin\mathcal{G}\}.

IV-B Coarse UT Location Sensing

In order to acquire the UT’s position with the aid of sensed VAs, one of the prerequisites lies in acquiring the orientation between the UT and VAs. For this purpose, motivated by the channel eigenvector decomposition in (2) and LoS approximation in (III-B), and the BS steering vector can be estimated as 𝐚BS(θlBS,rlBS,m)𝐚BS(θ^lBS,r^lBS,m)\mathbf{a}_{\rm BS}(\theta_{l}^{\rm BS},r_{l}^{\rm BS},m)\approx\mathbf{a}_{\rm BS}(\hat{\theta}_{l}^{\rm BS},\hat{r}_{l}^{\rm BS},m), thus the angle and distance can be estimated via matching pursuit as

(θ^lUT,r^lUT)=\displaystyle(\hat{\theta}_{l}^{\rm UT},\hat{r}_{l}^{\rm UT})=
argmaxθ,rmaxm{𝐇^ULP[m]}f(θ^lBS,r^lBS),:𝐚UT(θ,r,m)2.\displaystyle\mathop{\arg\max}\limits_{\theta,r}\mathop{\max}\limits_{m}{\left\|\left\{\widehat{\mathbf{H}}_{\rm UL}^{P}[m]\right\}_{\tiny f(\hat{\theta}_{l}^{\rm BS},\hat{r}_{l}^{\rm BS}),:}\,\mathbf{a}_{\rm UT}^{*}(\theta,r,m)\right\|^{2}}. (34)

It is noteworthy that since the array aperture of UT is much smaller than that of BS, the Rayleigh distance observed from UT could be relatively small so that the uncertainty of r^lUT\hat{r}_{l}^{\rm UT} aggravates. In this context, the geometric propagation model used for UT’s location sensing in Fig. 4 can be represented irrespective of rlUT{r}_{l}^{\rm UT} as

{φ=θlBS+θlUT,l𝒢,(yly)/(xlx)=tan(φθlUT),l𝒮,(yly)/(xlx)=tan(φπ/2),l𝒢,\displaystyle\begin{cases}\varphi=\theta_{l}^{\rm BS}+\theta_{l}^{\rm UT},l\in\mathcal{G},\\ (y_{l}-y)/(x_{l}-x)=\tan\,(\varphi-\theta_{l}^{\rm UT}),l\in\mathcal{S},\\ (y_{l}-y)/(x_{l}-x)=\tan\,(\varphi-\pi/2),l\in\mathcal{G},\end{cases} (35)

where (x,y)(x,y) is the location of UT to be determined, and φ\varphi is the inclination angle of UT’s array. Furthermore, it can be rewritten with a matrix form as

[yl𝒮tan(φθl𝒮UT)xl𝒮yl𝒢tan(φπ/2)xl𝒢]𝒴L×1=[tan(φθl𝒮UT)1tan(φπ/2)1]L×2[xy].\displaystyle\underbrace{\begin{bmatrix}{y}_{l\in\mathcal{S}}-\tan({\varphi}-\theta_{l\in\mathcal{S}}^{\rm UT}){x}_{l\in\mathcal{S}}\\ \vdots\\ {y}_{l\in\mathcal{G}}-\tan({\varphi}-\pi/2){x}_{l\in\mathcal{G}}\end{bmatrix}}_{\mathcal{Y}\in\mathbb{R}^{{L^{\prime}}\times 1}}=\underbrace{\begin{bmatrix}-\tan({\varphi}-{\theta}_{l\in\mathcal{S}}^{\rm UT})&1\\ \vdots&\vdots\\ -\tan({\varphi}-\pi/2)&1\end{bmatrix}}_{\mathcal{H}\in\mathbb{R}^{{L^{\prime}}\times 2}}\begin{bmatrix}{x}\\ {y}\end{bmatrix}. (36)

In view of the non-uniform sampling pattern defined in (9), the further the VAs locate, the sparser the corresponding sampling grid is. If the scatterer falls into a region with sparser spatial sampling grids, the location sensing error for those VAs would aggravate, and their reliability for UT localization would decline. Therefore, one viable solution to the above problem is the weighted least squares (w-LS) method [51], which introduces a weighted matrix assigning higher weights to VAs with smaller sampling interval and lower weights to VAs with larger sampling interval, thereby enhancing the accuracy of the estimation, i.e.,

[x^y^]T=(^T𝐖^)1^T𝐖𝒴^,\displaystyle\left[\hat{x}\;\hat{y}\right]^{\rm T}=\left({\hat{\mathcal{H}}^{\rm T}}\mathbf{W}\hat{\mathcal{H}}\right)^{\rm-1}\hat{\mathcal{H}}^{\rm T}\mathbf{W}\hat{\mathcal{Y}}, (37)

where 𝒴^\hat{\mathcal{Y}} and ^\hat{\mathcal{H}} are estimated by replacing the corresponding element with the sensing results in (33) respectively, while φ^\hat{\varphi} can be estimated as φ^=1|𝒢|cl𝒢(θlBS+θlUT)\hat{\varphi}=\frac{1}{|\mathcal{G}|_{c}}\sum_{l\in\mathcal{G}}\left(\theta_{l}^{\rm BS}+\theta_{l}^{\rm UT}\right). Moreover, the weighted matrix can be expressed as 𝐖=diag{1|r^1BS𝒩(θ^1BS,r^1BS)|2,,1|r^L^BS𝒩(θ^L^BS,r^L^BS)|2}\mathbf{W}={\rm diag}\left\{\frac{1}{|\hat{r}_{1}^{\rm BS}-\mathcal{N}(\hat{\theta}_{1}^{\rm BS},\hat{r}_{1}^{\rm BS})|^{2}},\dots,\frac{1}{|\hat{r}_{\hat{L^{\prime}}}^{\rm BS}-\mathcal{N}(\hat{\theta}_{\hat{L}^{\prime}}^{\rm BS},\hat{r}_{\hat{L}^{\prime}}^{\rm BS})|^{2}}\right\}, and 𝒩(θ,r)\mathcal{N}(\theta,r) denotes the distance parameter of nearest adjacent grid set in (9) corresponding to (θ,r)(\theta,r).

Remark 1

In fact, the above mentioned UT localization method can also be extended to blocked LoS case. In the absence of LoS component, compared with (35), the only difference for location sensing problem lies in the absence of information about LoS orientation (θ1BS(\theta_{1}^{\rm BS}, θ1UT)\theta_{1}^{\rm UT}), which does not yet affect its rigorous LS solution when the number of sensed VAs meets L^3\hat{L}^{\prime}\geq 3.

IV-C Sensing Result Refinement

The aforementioned coarse UT’s location sensing result only depends on the angle and distance information extracted from the spherical wavefront, which applies to both wideband and narrowband sensing tasks in sharp contrast to the traditional TDoA-based methods [51]. On that account, benefiting from the available bandwidth in the broadband mmWave systems, the accurately estimated MPCs’ TDoA information can be further utilized to refine the above location sensing result, whose implementation will be detailed next.

Similar to the angle and distance estimation in (IV-B), the relative delay of the ll-th multipath is firstly obtained by frequency-domain matching filter as

τ^l=1NUTn=1NUTargmaxτ[n]{𝐇^ULP[m]}f(θ^lBS,r^lBS),n,1:m𝐚f(τ[n])2,\displaystyle\hat{\tau}_{l}=\frac{1}{N_{\rm UT}}\sum_{n=1}^{N_{\rm UT}}\mathop{\arg\max}\limits_{\tau[n]}{\left\|\left\{\widehat{\mathbf{H}}_{\rm UL}^{P}[m]\right\}_{f(\hat{\theta}_{l}^{\rm BS},\hat{r}_{l}^{\rm BS}),n,1:m}\,\mathbf{a}_{f}^{*}(\tau[n])\right\|^{2}}, (38)

where 𝐚f(τ)=ej2πτ[f1,f2,,fM]TM×1\mathbf{a}_{f}(\tau)=e^{-j2\pi\tau[f_{1},f_{2},\dots,f_{M}]^{\rm T}}\in\mathbb{C}^{M\times 1} represents the frequency-domain channel response parameterized by relative channel delay τ\tau, while f1,f2,,fMf_{1},f_{2},\dots,f_{M} are the frequencies of the first, second and MM subcarriers, respectively. Considering the different geographical locations of transceivers and the limited clock synchronization accuracy, it is difficult to measure the absolute delay of MPCs, and thus it is more effective to exploit the TDoA of MPCs

τ^lTDoA=τ^ll𝒢τ^l|𝒢|c,l𝒮.\displaystyle\hat{\tau}_{l}^{\rm TDoA}=\hat{\tau}_{l}-\frac{\sum_{l^{\prime}\in\mathcal{G}}\hat{\tau}_{l^{\prime}}}{|\mathcal{G}|_{c}},l\in\mathcal{S}. (39)

In addition, the TDoA of MPCs can also be calculated according to the location sensing results

τ¯lTDoA=x^l2+y^l2+(x^x^l)2+(y^y^l)2x^2+y^2c,\displaystyle\bar{\tau}_{l}^{\rm TDoA}=\frac{\sqrt{\hat{x}_{l}^{2}+\hat{y}_{l}^{2}}+\sqrt{(\hat{x}-\hat{x}_{l})^{2}+(\hat{y}-\hat{y}_{l})^{2}}-\sqrt{\hat{x}^{2}+\hat{y}^{2}}}{c}, (40)

which is the function of coordinates (x^,y^),(x^l,y^l)(\hat{x},\hat{y}),(\hat{x}_{l},\hat{y}_{l}). Owing to the location sensing errors, there exists difference between the calculated TDoA of MPCs and the real TDoA of MPCs, and thus define the loss function as

(x^,y^)=l𝒮|τ^lTDoAτ¯lTDoA(x^,y^)|2,\displaystyle\mathcal{L}(\hat{x},\hat{y})=\sum_{l\in\mathcal{S}}\left|\hat{\tau}_{l}^{\rm TDoA}-\bar{\tau}_{l}^{\rm TDoA}(\hat{x},\hat{y})\right|^{2}, (41)

Following the maximum likelihood principle, the optimal UT and scatterers coordinates tend to minimize the above loss function, but it is intractable to directly derive its optimal closed-form solution. To this end, a feasible solution is to use gradient descent algorithm iteratively update UT and scatterers coordinates (x^,y^)(\hat{x},\hat{y}) and (x^l,y^l)l𝒮(\hat{x}_{l},\hat{y}_{l})_{l\in\mathcal{S}}[9]. Therefore, in the tt-th iteration, one of the parameters in the coordinates is updated each time, while the other parameters are hold unchanged. For instance, the update of the UT’s coordinate can be represented as

x^t=x^t1Δx(x^,y^t1)x^|x^=x^t1,\displaystyle\hat{x}^{t}=\hat{x}^{t-1}-\Delta_{x}\cdot\left.\frac{\partial\,\mathcal{L}(\hat{x},\hat{y}^{t-1})}{\partial\,\hat{x}}\right|_{\hat{x}=\hat{x}^{t-1}}, (42a)
y^t=y^t1Δy(x^t1,y^)y^|y^=y^t1.\displaystyle\hat{y}^{t}=\hat{y}^{t-1}-\Delta_{y}\cdot\left.\frac{\partial\,\mathcal{L}(\hat{x}^{t-1},\hat{y})}{\partial\,\hat{y}}\right|_{\hat{y}=\hat{y}^{t-1}}. (42b)

To guarantee the convergence of the loss function, an effective approach is Armijo-Goldstein backtracking line search [55] to determine the update step size Δx\Delta_{x} and Δy\Delta_{y}. Besides, the location of scatterers can be updated similar to (42). In this case, with a small number of iterations, the refined coordinates are inclined to converge to the appoximately optimal solution. So far, we have finished the discussion about the proposed location sensing scheme for UM-MIMO systems, and its complete procedures are summarized in Alg. 2.

In summary, our approach enhances localization accuracy by leveraging the TDOA technique with multiple scatterers as virtual anchors. By calculating the time differences of arrival from these scatterers to the user, and incorporating the scatterers’ known positions obtained through Alg. 2, our method refines the localization results. This supplementary use of TDOA, despite the presence of only a single BS, effectively enhances the robustness of the localization in environments with significant multipath effects.

Input: Received reference signal 𝐘UL[m]{\mathbf{Y}}_{\rm UL}[m], estimated sparse channel matrix 𝐇^ULP[m]\widehat{\mathbf{H}}_{\rm UL}^{P}[m], the average noise variance σ^2¯[m]\overline{\hat{\sigma}^{2}}[m], and the maximum refinement iteration number TgrdT_{\rm grd}.
1
Output: The estimated UT and scatterers’ position (x¯,y¯)(\bar{x},\bar{y}) and (x¯l,y¯l),l𝒮(\bar{x}_{l},\bar{y}_{l}),l\in\mathcal{S}.
2
3Initialize to estimate the number of MPCs L^\hat{L}^{\prime} in accordance with (28)-(30).
4VAs Location Mapping:
5Identify effective scatterers grids set (Θ~,~)(\tilde{\varTheta},\tilde{\mathcal{R}}) following (32).
6Utilize K-Means algorithm to cluster scatteres grids subset (Θ~l,~l)(\tilde{\varTheta}_{l},\tilde{\mathcal{R}}_{l}) for the l(1lL^)l\,(1\leq l\leq\hat{L}^{\prime})-th MPC.
7Identify the truthful VAs coordinates (θ^lBS,r^lBS)(\hat{\theta}_{l}^{\rm BS},\hat{r}_{l}^{\rm BS}) in (33), and distinguish scatterers and subarray center set 𝒮\mathcal{S} and 𝒢\mathcal{G}.
8Coarse UT Location Sensing:
9Estimate AoDs θ^lUT\hat{\theta}_{l}^{\rm UT} for identified VAs following (IV-B).
10Sense UT coarse location (x^,y^)(\hat{x},\hat{y}) exploiting VAs in (37).
11Sensing Results Refinement:
12Measure TDoA for identified MPCs through (38)-(39).
13for 1tTgrd1\leq t\leq T_{\rm grd} do
14      
15      Calculate the gradient and Armijo-Goldstein backtracking line search step size Δx\Delta_{x} and Δy\Delta_{y}.
16      Update the coordinates of UT and scatterers by (42).
17 end for
18
Result: The refined UT and scatterers’ positions (x¯,y¯)=(x¯Tgrd,y¯Tgrd)({\bar{x}},{\bar{y}})=(\bar{x}^{T_{\rm grd}},\bar{y}^{T_{\rm grd}}) and (x¯l,y¯l)=(x¯lTgrd,y^lTgrd),l𝒮({\bar{x}}_{l},{\bar{y}}_{l})=(\bar{x}^{T_{\rm grd}}_{l},\hat{y}^{T_{\rm grd}}_{l}),l\in\mathcal{S}.
Algorithm 2 Proposed Location Sensing Scheme

V Hybrid Beamforming Design for Downlink Data Transmission

To fully unleash the potential of UM-MIMO for more efficient data transmission, this section designs a beamforming scheme attempting to mitigate the spectral efficiency degradation caused by HFBS issue.

V-A Analog Codebook Design

For the downlink data transmission stage, the received baseband data 𝐲DL[m]Nsd{\mathbf{y}}_{\rm DL}[m]\in\mathbb{C}^{N_{s}^{d}} at the UT on the mm-th subcarrier can be formulated as the following problem similar to (II-A) as222It is assumed that the BS serves the different UTs using orthogonal time-frequency resources, and thus the beamforming design is optimized for a single specific UT to disclose the impact of HFBS issue.

𝐲DL[m]=𝐖DLH[m]𝐇DL[m]𝐅DL[m]𝐬[m]+𝐖DLH[m]𝐧[m],\displaystyle\mathbf{y}_{\rm DL}[m]=\mathbf{W}^{\rm H}_{\rm DL}[m]\mathbf{H}_{\rm DL}[m]\mathbf{F}_{\rm DL}[m]\mathbf{s}[m]+\mathbf{W}^{\rm H}_{\rm DL}[m]\mathbf{n}[m], (43)

Different from the parameters defined in (II-A), 𝐅DL[m]\mathbf{F}_{\rm DL}[m] represents the BS’s transmit beamforming matrix 𝐅DL[m]=𝐅DLRF𝐅DLBB[m]NBS×Nsd\mathbf{F}_{\rm DL}[m]=\mathbf{F}^{\rm RF}_{\rm DL}\mathbf{F}^{\rm BB}_{\rm DL}[m]\in\mathbb{C}^{N_{\rm BS}\times N_{s}^{d}} with the power constraint 𝐅DL[m]F2PtDL/M\|\mathbf{F}_{\rm DL}[m]\|_{F}^{2}\leq P_{t}^{\rm DL}/M, in which 𝐅DLRFNBS×NBSRF\mathbf{F}^{\rm RF}_{\rm DL}\in\mathbb{C}^{N_{\rm BS}\times N_{\rm BS}^{\rm RF}} and 𝐅DLBB[m]NBSRF×Nsd\mathbf{F}^{\rm BB}_{\rm DL}[m]\in\mathbb{C}^{N_{\rm BS}^{\rm RF}\times N_{s}^{d}} are the analog and digital beamforming matrices, respectively, and NsdN_{s}^{d} is the number of simultaneous transmitted data streams. Similarly, 𝐖DL[m]\mathbf{W}_{\rm DL}[m] denotes the UT’s receive combining matrix 𝐖DL[m]=𝐖DLRF𝐖DLBB[m]NUT×Nsd\mathbf{W}_{\rm DL}[m]=\mathbf{W}^{\rm RF}_{\rm DL}\mathbf{W}^{\rm BB}_{\rm DL}[m]\in\mathbb{C}^{N_{\rm UT}\times N_{s}^{d}}, where 𝐖DLRFNUT×NUTRF\mathbf{W}^{\rm RF}_{\rm DL}\in\mathbb{C}^{N_{\rm UT}\times N_{\rm UT}^{\rm RF}} and 𝐖DLBB[m]NUTRF×Nsd\mathbf{W}^{\rm BB}_{\rm DL}[m]\in\mathbb{C}^{N_{\rm UT}^{\rm RF}\times N_{s}^{d}} are the analog and digital combining matrices, while 𝐬[m]\mathbf{s}[m] is the payload data stream. In the data transmission stage, we seek to optimize the precoder and combiner that maximizes the beamforming gain over the whole bandwidth and enhance the spectral efficiency (SE) expressed in (44) shown at the top of the next page, where 𝐑n[m]=σ2𝐖DLH[m]𝐖DL[m]\mathbf{R}_{n}[m]=\sigma^{2}\mathbf{W}^{\rm H}_{\rm DL}[m]\mathbf{W}_{\rm DL}[m] is the noise covariance matrix after combining.

R=1Mm=1Mlog2(|𝐈Nsd+𝐑n1[m]𝐖DLH[m]𝐇DL[m]𝐅DL[m]𝐅DLH[m]𝐇DLH[m]𝐖DL[m]|)\displaystyle R=\frac{1}{M}\sum_{m=1}^{M}\log_{2}\left(\left|\mathbf{I}_{N_{s}^{d}}+\mathbf{R}_{n}^{-1}[m]\mathbf{W}^{\rm H}_{\rm DL}[m]\mathbf{H}_{\rm DL}[m]\mathbf{F}_{\rm DL}[m]\mathbf{F}_{\rm DL}^{\rm H}[m]\mathbf{H}_{\rm DL}^{\rm H}[m]\mathbf{W}_{\rm DL}[m]\right|\right) (44)
g0(r,θ,m)=1NBS|n=1NBSej(2πλmδn2d2cos2θ2r2πλcδn2d2cos2θ02r0)j(2πλmδndsinθ2πλcδndsinθ0)|\displaystyle g_{0}(r,\theta,m)=\frac{1}{N_{\rm BS}}\left|\sum_{n=1}^{N_{\rm BS}}e^{j\left(\frac{2\pi}{\lambda_{m}}\frac{\delta_{n}^{2}d^{2}\cos^{2}\theta}{2r}-\frac{2\pi}{\lambda_{c}}\frac{\delta_{n}^{2}d^{2}\cos^{2}\theta_{0}}{2r_{0}}\right)-j\left(\frac{2\pi}{\lambda_{m}}\delta_{n}d\sin\theta-\frac{2\pi}{\lambda_{c}}\delta_{n}d\sin\theta_{0}\right)}\right| (47)

Overcoming beam squint requires that each RF phase shifter generate different phase shift across different subcarriers to form frequency-dispered beamforming vectors. However, the RF phase shifter can only generate the frequency-independent phase shift, greatly limiting its beamforming ability especially in the presence of beam-squint effect. In this case, traditional analog narrow beams design approach that focus on optimizing for center-frequency suffers from serious performance degradation. Therefore, elaborate design of the analog beams codebook is pivotal to an efficient beamforming scheme for hybrid UM-MIMO systems in the presence of HFBS effect.

Without loss of generality, we assume that the reference frequency for designing the analog phase shifts equals the center frequency fcf_{c}. Particularly, in the case of NUT=NBSRF=1N_{\rm UT}=N_{\rm BS}^{\rm RF}=1, when the beamforming 𝐟DLRF(r0,θ0)\mathbf{f}_{\rm DL}^{\rm RF}(r_{0},\theta_{0}) is designed to focus on the postion (r0,θ0)(r_{0},\theta_{0}) for the reference frequency, then 𝐟DLRF(r0,θ0)=1NBSej2πλc[r0,1,r0,2,,r0,NBS]T\mathbf{f}_{\rm DL}^{\rm RF}(r_{0},\theta_{0})=\frac{1}{\sqrt{N_{\rm BS}}}e^{-j\frac{2\pi}{\lambda_{c}}[r_{0,1},r_{0,2},\dots,r_{0,N_{\rm BS}}]^{\rm T}}, where r0,n=r02+δn2d22r0δndsinθ0r_{0,n}=\sqrt{r_{0}^{2}+\delta^{2}_{n}d^{2}-2r_{0}\delta_{n}d\sin\theta_{0}}. In this case, the beamforming gain at the given position (r,θ)(r,\theta) on the mm-th subcarrier can be computed as

g0(r,θ,m)\displaystyle g_{0}(r,\theta,m) =|𝐚BSH(r,θ,m)𝐟DLRF(r0,θ0)|,\displaystyle=\left|\mathbf{a}_{\rm BS}^{\rm H}(r,\theta,m)\mathbf{f}_{\rm DL}^{\rm RF}(r_{0},\theta_{0})\right|,
=1NBS|n=1NBSej2πλmrnj2πλcr0,n|.\displaystyle=\frac{1}{N_{\rm BS}}\left|\sum_{n=1}^{N_{\rm BS}}e^{j\frac{2\pi}{\lambda_{m}}r_{n}-j\frac{2\pi}{\lambda_{c}}r_{0,n}}\right|. (46)

By means of Fresnel approximation mentioned in (III-B), (V-A) can be further derived as (47), which is shown at the top of the next page.

Refer to caption
(a) Trajectory of beam-squint positions across different subcarriers.
Refer to caption
(b) Beamforming gain variation versus angles at the optimal distance g0(r~m,θ,m)g_{0}(\tilde{r}_{m},\theta,m).
Refer to caption
(c) Beamforming gain variation versus distances at the optimal angle g0(r,θ~m,m)g_{0}(r,\tilde{\theta}_{m},m).
Figure 5: Illustration of hybrid-field beam-squint effect, where the system parameters are fc=47GHzf_{c}=47\rm GHz, BW=5GHz\text{BW}=5\text{GHz}, NBS=512N_{\rm BS}=512, NUT=NBSRF=1N_{\rm UT}=N_{\rm BS}^{\rm RF}=1, and M=8M=8 without regard to large-scale path loss for easy demonstration. The transmit beamforming is set to focus on the location (r,θ)=(7.01m,46.6)(r,\theta)=(7.01m,46.6^{\circ}) at the center frequency.

Denote the beam-squint position that maximize the beamforming gain on the mm-th subcarrier as (r~m,θ~m)(\tilde{r}_{m},\tilde{\theta}_{m}), which can be expressed as [11]

sinθ~m\displaystyle\sin\tilde{\theta}_{m} =sinθ0λmλc,\displaystyle=\sin\theta_{0}\cdot\frac{\lambda_{m}}{\lambda_{c}}, (48a)
r~m\displaystyle\tilde{r}_{m} =r0λcλm1sin2θ~m1sin2θ0.\displaystyle=r_{0}\cdot\frac{\lambda_{c}}{\lambda_{m}}\cdot\frac{1-\sin^{2}\tilde{\theta}_{m}}{1-\sin^{2}\theta_{0}}. (48b)

(48) indicates that distinct from the far-field propagation, HFBS effct includes angle squint effect and distance squint effect, respectively. Fig. 5(a) illustrates the beamforming gain distribution g0(r,θ,m)g_{0}(r,\theta,m) with different positions and subcarriers (r,θ,m)(r,\theta,m) via a heatmap, and the beam-squint trajectory of (r~m,θ~m)(\tilde{r}_{m},\tilde{\theta}_{m}) plotted by a red-star line. To further provide insights into analog beamforming design, we investigate to decouple the variation of beamforming gain along distances or angles, i.e., g0(r~m,θ,m)g_{0}(\tilde{r}_{m},\theta,m) and g0(r,θ~m,m)g_{0}(r,\tilde{\theta}_{m},m), respectively, and disclose the sepreate effects of angle squint and distance squint. Substitute the beam-squint positions derived in (48), g0(r~m,θ,m)g_{0}(\tilde{r}_{m},\theta,m) and g0(r,θ~m,m)g_{0}(r,\tilde{\theta}_{m},m) can be expressed as [25]

g0(r~m,θ,m)\displaystyle g_{0}(\tilde{r}_{m},\theta,m) =|1NBSn=1NBSej(2πλmδndsinθ2πλcδndsinθ0)|\displaystyle=\left|\frac{1}{N_{\rm BS}}\sum_{n=1}^{N_{\rm BS}}e^{-j\left(\frac{2\pi}{\lambda_{m}}\delta_{n}d\sin\theta-\frac{2\pi}{\lambda_{c}}\delta_{n}d\sin\theta_{0}\right)}\right|
=|sin(πdNBSλmsinθπdNBSλcsinθ0)NBSsin(πdλmsinθπdλcsinθ0)|,\displaystyle=\left|\frac{\sin\left(\frac{\pi dN_{\rm BS}}{\lambda_{m}}\sin\theta-\frac{\pi dN_{\rm BS}}{\lambda_{c}}\sin\theta_{0}\right)}{N_{\rm BS}\sin\left(\frac{\pi d}{\lambda_{m}}\sin\theta-\frac{\pi d}{\lambda_{c}}\sin\theta_{0}\right)}\right|, (49a)
g0(r,θ~m,m)\displaystyle g_{0}(r,\tilde{\theta}_{m},m) =|1NBSn=1NBSej(2πλmδn2d2cos2θ~m2r2πλcδn2d2cos2θ02r0)|\displaystyle=\left|\frac{1}{N_{\rm BS}}\sum_{n=1}^{N_{\rm BS}}e^{j\left(\frac{2\pi}{\lambda_{m}}\frac{\delta_{n}^{2}d^{2}\cos^{2}\tilde{\theta}_{m}}{2r}-\frac{2\pi}{\lambda_{c}}\frac{\delta_{n}^{2}d^{2}\cos^{2}\theta_{0}}{2r_{0}}\right)}\right|
|1NBSNBS/2NBS/2ejπn2(d2cos2θ~mλmrd2cos2θ0λcr0)dn|,\displaystyle\approx\left|\frac{1}{N_{\rm BS}}\int_{-N_{\rm BS}/2}^{N_{\rm BS}/2}e^{j\pi n^{2}\left(\frac{d^{2}\cos^{2}\tilde{\theta}_{m}}{\lambda_{m}r}-\frac{d^{2}\cos^{2}\theta_{0}}{\lambda_{c}r_{0}}\,\right)}{\rm d}n\right|, (49b)

which are displayed in Fig. 5(b) and Fig. 5(c), respectively. On the one hand, it can be observed that the angle squint effect causes the beamforming gain of non-central subcarriers w.r.t. position (r0,θ0)(r_{0},\theta_{0}) to fall out of the main lobe and attenuate substantially. On the other hand, the distance squint effect also leads the received energy associated with non-central subcarriers on position (r0,θ0)(r_{0},\theta_{0}) to decline by half.

Besides, according to the conclusion derived from (48), to enable the generated beam to focus on the expected location (r0,θ0)(r_{0},\theta_{0}) for the mm-th subcarrier, 𝐟DLRF(rm,θm)\mathbf{f}_{\rm DL}^{\rm RF}(r_{m}^{\star},\theta_{m}^{\star}) should meet

sinθm\displaystyle\sin\theta_{m}^{\star} =sinθ0λcλm,\displaystyle=\sin\theta_{0}\cdot\frac{\lambda_{c}}{\lambda_{m}}, (50a)
1rm\displaystyle\frac{1}{r_{m}^{\star}} =1r0λcλm1sin2θm1sin2θ0.\displaystyle=\frac{1}{r_{0}}\cdot\frac{\lambda_{c}}{\lambda_{m}}\cdot\frac{1-\sin^{2}\theta_{m}^{\star}}{1-\sin^{2}\theta_{0}}. (50b)

Therefore, to provide constant beamforming gain as much as possible over the whole bandwidth and maximize SE, one intuitive compromise is to generate beams 𝐛(r0,θ0)\mathbf{b}(r_{0},\theta_{0}) covering all (rm,θm),1mM(r_{m}^{\star},\theta_{m}^{\star}),1\leq m\leq M so that the traditional narrow beams are required to be broadened along both the distance and angle at the reference frequency, with the maximum angle and distance intervals Δsinθ0=|sinθMsinθ1|\Delta\sin\theta_{0}=|\sin\theta_{M}^{\star}-\sin\theta_{1}^{\star}| and Δ1r0=|1rM1r1|\Delta\frac{1}{r_{0}}=|\frac{1}{r_{M}^{\star}}-\frac{1}{r_{1}^{\star}}|, respectively.

The generation of beam 𝐛(r0,θ0)\mathbf{b}(r_{0},\theta_{0}) can employ a subarray based beam aggregation method [29], which divides the antenna array into G=NBSNsubG=\lceil\frac{N_{\rm BS}}{N_{\rm sub}}\rceil subarrays and each with NsubN_{\rm sub} elements, and synthesizes the sub-beams generated by different subarrays. Specifically, according to the array signal processing theory, the 3dB beam width of an NsubN_{\rm sub}-elements subarray is Δsubsinθ=1Nsub\Delta_{\rm sub}\sin\theta=\frac{1}{N_{\rm sub}} and Δsub1r=1.556Nsub2λccos2θ0\Delta_{\rm sub}\frac{1}{r}=\frac{1.556}{N_{\rm sub}^{2}\lambda_{c}\cos^{2}\theta_{0}} along the normalized angle and distance domains, respectively [56]. In line with the requirements for beam broadening region derived in (50), the number of subarrays GG should satisfy

1NsubGΔsinθ0,\displaystyle\frac{1}{N_{\rm sub}}\cdot G\geq\Delta\sin\theta_{0}, (51a)
1.556Nsub2λccos2θ0GΔ1r0.\displaystyle\frac{1.556}{N_{\rm sub}^{2}\lambda_{c}\cos^{2}\theta_{0}}\cdot G\geq\Delta\frac{1}{r_{0}}. (51b)

And the beamforming vector 𝐛(r0,θ0)\mathbf{b}(r_{0},\theta_{0}) can be generated subarray-wisely for the gg-th virtual subarray as

[𝐛(r0,θ0)](g1)Nsub+n=\displaystyle{\left[\mathbf{b}(r_{0},\theta_{0})\right]_{(g-1)N_{\rm sub}+n}=}
1NBSej2πλc[δ[(g1)Nsub+n]sinθg+δ[(g1)Nsub+n]2d2cos2θv2rg],\displaystyle{\frac{1}{\sqrt{N_{\rm BS}}}e^{-j\frac{2\pi}{\lambda_{c}}\left[-\delta_{[(g-1)N_{\rm sub}+n]}\sin\theta_{g}+\frac{\delta_{[(g-1)N_{\rm sub}+n]}^{2}d^{2}\cos^{2}\theta_{v}}{2r_{g}}\right]},} (52)

where sinθg=sinθ0+2g12Nsub\sin\theta_{g}=\sin\theta_{0}+\frac{2g-1}{2N_{\rm sub}} and 1rg=1r0+0.778(2g1)Nsub2λccos2θ0\frac{1}{r_{g}}=\frac{1}{r_{0}}+\frac{0.778(2g-1)}{N_{\rm sub}^{2}\lambda_{c}\cos^{2}\theta_{0}}. In this problem, in order to maximize the beam gain and improve SE, the beam energy should be focused on UT or the scatterers as much as possible. Therefore, the analog beamforming codebook used to optimize RF phase shifts can be constructed as 𝐁={𝐛(r¯BS,θ¯BS)}{𝐛(r¯lBS,θ¯lBS)|l𝒮}{𝐛(r^lBS,θ^lBS)|l𝒢}NBS×(L^+1)\mathbf{B}=\left\{\mathbf{b}(\bar{r}^{\rm BS},\bar{\theta}^{\rm BS})\right\}\cup\left\{\mathbf{b}(\bar{r}_{l}^{\rm BS},\bar{\theta}_{l}^{\rm BS})|\,l\in\mathcal{S}\right\}\cup\left\{\mathbf{b}(\hat{r}_{l}^{\rm BS},\hat{\theta}_{l}^{\rm BS})|\,l\in\mathcal{G}\right\}\in\mathbb{C}^{N_{\rm BS}\times(\hat{L}^{\prime}+1)}, which collects all UT’s estimated location, scatterers’ location, and the estimated UT’s subarray center, while r¯BS=x¯2+y¯2\bar{r}^{\rm BS}=\sqrt{\bar{x}^{2}+\bar{y}^{2}}, θ¯BS=arctan(y¯/x¯)\bar{\theta}^{\rm BS}=\text{arctan}(\bar{y}/\bar{x}) and r¯lBS=x¯l2+y¯l2\bar{r}_{l}^{\rm BS}=\sqrt{\bar{x}_{l}^{2}+\bar{y}_{l}^{2}}, θ¯lBS=arctan(y¯l/x¯l)\bar{\theta}_{l}^{\rm BS}=\text{arctan}(\bar{y}_{l}/\bar{x}_{l}), respectively.

V-B Hybrid Beamforming Design with Designed Codebook

Thanks to the mmWave UM-MIMO channel sparsity, the designed analog beamforming codebook is capable of approximately spaning the column space of the unconstrained full-digital beamforming matrix 𝐅DLopt[m]\mathbf{F}_{\rm DL}^{\rm opt}[m] on the m-th subcarrier for 1mM1\leq m\leq M, where 𝐅DLopt[m]\mathbf{F}_{\rm DL}^{\rm opt}[m] composes of the first NsdN_{s}^{d} columns of the right singular matrix of 𝐇DL[m]\mathbf{H}_{\rm DL}[m] [41]. Therefore, the hybrid beamforming design problem under the HFBS effect is equivalent to selecting the NBSRFN_{\rm BS}^{\rm RF} best-matched RF beamforming vectors and finding their optimal baseband combination, which can be concretely expressed as

(𝚲~,𝐅~DLBB[m])\displaystyle\left(\tilde{\boldsymbol{\Lambda}},\tilde{\mathbf{F}}_{\rm DL}^{\rm BB}[m]\right) =argmin𝚲,𝐅DLBB[m]m=1M𝐅DLopt[m]𝐁𝚲𝐅DLBB[m]F,\displaystyle=\mathop{\arg\min}_{\boldsymbol{\Lambda},\mathbf{F}_{\rm DL}^{\rm BB}[m]}\,\sum_{m=1}^{M}\|\mathbf{F}_{\rm DL}^{\rm opt}[m]-\mathbf{B}\boldsymbol{\Lambda}\mathbf{F}_{\rm DL}^{\rm BB}[m]\|_{F},
s.t.diag(𝚲𝚲H)0=NBSRF,\displaystyle{\rm s.t.}\quad\|{\rm diag}\left(\boldsymbol{\Lambda}\boldsymbol{\Lambda}^{\rm H}\right)\|_{0}=N_{\rm BS}^{\rm RF},
𝐁𝚲𝐅DLBB[m]F2PtDL/M,\displaystyle\quad\quad\;\|\mathbf{B}\boldsymbol{\Lambda}\mathbf{F}_{\rm DL}^{\rm BB}[m]\|_{F}^{2}\leq P_{t}^{\rm DL}/M, (53)

where 𝚲\boldsymbol{\Lambda} is a selection matrix used to select beamforming vectors from the codebook 𝐁\mathbf{B} for constructing the analog beamformer as 𝐅DLRF=𝐁𝚲\mathbf{F}_{\rm DL}^{\rm RF}=\mathbf{B}\boldsymbol{\Lambda}. Considering the sparsity of selection matrix 𝚲\boldsymbol{\Lambda}, the resulting problem formulation is identical to the optimization problem of sparse signal recovery. Therefore, we straightforwardly exploit the method based on the classic concept of SOMP [41] to obatain the solution of (𝐅DLRF~,𝐅~DLBB[m]),1mM\left(\tilde{\mathbf{F}_{\rm DL}^{\rm RF}},\tilde{\mathbf{F}}_{\rm DL}^{\rm BB}[m]\right),1\leq m\leq M. For the receiver at the UT, the optimal full-digital combiner at each subcarrier is the MMSE combiner 𝐖DLMMSE[m]\mathbf{W}_{\rm DL}^{\rm MMSE}[m] in the absence of any hardware restrictions [41]. And the hybrid combiner design (𝐖DLRF~,𝐖~DLBB[m]),1mM\left(\tilde{\mathbf{W}_{\rm DL}^{\rm RF}},\tilde{\mathbf{W}}_{\rm DL}^{\rm BB}[m]\right),1\leq m\leq M at the receiver can be acquired in a similar way to approximate 𝐖DLMMSE[m]\mathbf{W}_{\rm DL}^{\rm MMSE}[m].

VI Simulation Experiments

This section conducts extensive simulation experiments to validate the superiority of our proposed ILSC schemes and compares theri performance with the counterparts in previous literature.

VI-A Simulation Setup

First of all, we present the main system parameters used in simulation experiments in TABLE II 333 Tens up to a hundred GHz will be obtained in THz band for 6G and beyond era[60]. Although the current 5G NR does not have continuous 5 GHz bandwidth, this parameter setting is used to verify the effectiveness of our proposed ILSC scheme, which can be applied to address the HFBS problem in future networks. unless stated otherwise. In addition, the noise power spectrum density at the receivers is N0=174dBm/HzN_{0}=-174\,\rm{dBm/Hz}, and the power of AWGN is accordingly calculated as σ2=N0×BW=77dBm\sigma^{2}=N_{0}\times\rm{BW}=-77\,\rm{dBm}. The UT’s and scatterers’ position can be simulated in line with the angle and distance distribution given in TABLE II. For the MPCs resulted from scatterers, we employ the hybrid-field channel model in (2) to generate them. Here, we only count single-hop paths and assume that the small-scale channel gain σα2=1\sigma_{\alpha}^{2}=1. The large-scale path loss coefficients of MPCs are determined according to the empirical model provided in Table 7.4.1-1 of [57]. In the uplink, the transmit power of UT PtULP_{t}^{\rm UL} ensures the received signal-to-noise-ratio (SNR) to meet 30dB30\,\rm dB. It is important to note that our study focuses on indoor and short-range localization and communication scenarios, rather than UMa environments. Thus, the simulation parameters are chosen to reflect these indoor settings.

TABLE II:
Parameter Value
Number of BS antenna NBSN_{\rm BS} 512
Number of UT antenna NUTN_{\rm UT} 32
Number of BS RF chains NBSRFN_{\rm BS}^{\rm RF} 4
Number of UT RF chains NUTRFN_{\rm UT}^{\rm RF} 4
Carrier frequency fcf_{c} 47GHz47\,\rm GHz
System bandwidth BW{\rm BW} 5GHz5\,\rm GHz
Number of subcarriers MM 64
Maximum mumber of MPCs LL 6
Scatterer angle θlBS\theta_{l}^{\rm BS} distribution 𝒰[π3,π3]\mathcal{U}[-\frac{\pi}{3},\frac{\pi}{3}]
Scatterer distance rlBSr_{l}^{\rm BS} distribution 𝒰[5,20](m)\mathcal{U}[5,20](m)
UT angle θ\theta distribution 𝒰[π3,π3]\mathcal{U}[-\frac{\pi}{3},\frac{\pi}{3}]
UT distance rr distribution 𝒰[5,50](m)\mathcal{U}[5,50](m)
Sampling lattice parameter (ρ,S,η)(\rho,S,\eta) (2,20,1.5)(2,20,1.5)
AMP damping factor ϵ\epsilon 0.8
AMP maximum iteration number TiterT_{\rm iter} 100
Sensing refinement iteration number TiterT_{\rm iter}^{\prime} 10

VI-B Channel Estimation Results

Next, we investigate the performance of the proposed uplink CSI acquistion scheme by evaluating the normalized mean square error (NMSE) between the real channel matrix 𝐇UL[m]\mathbf{H}_{\rm UL}[m] and its estimation 𝐇^UL[m]\widehat{\mathbf{H}}_{\rm UL}[m], i.e.,

NMSE=𝔼{1Mm=1M𝐇UL[m]𝐇^UL[m]F2𝐇UL[m]F2}.\displaystyle\text{NMSE}=\mathbb{E}\left\{\frac{1}{M}\sum_{m=1}^{M}\frac{\|\mathbf{H}_{\rm UL}[m]-\widehat{\mathbf{H}}_{\rm UL}[m]\|_{F}^{2}}{\|\mathbf{H}_{\rm UL}[m]\|_{F}^{2}}\right\}. (54)

For comparison, we consider the baseline schemes including

  1. (i)

    traditional far-field DFT projection matrix based schemes using OMP algorithm [18];

  2. (ii)

    far-field frequency-selective projection matrix based schemes with beam-squint effect into consideration [27];

  3. (iii)

    subarray piecewise far-field approximate projection matrix based schemes [23];

  4. (iv)

    near-field polar-domain projection matrix based schemes using OMP algorithm with the oracle information of MPCs number (L+1)(L+1) [25] or AMP algorithm.

Fig. 6 presents channel estimation performance comparison results versus measurement length PP and scatterers distance respectively, while in Fig. 6(b), the measurement length is P=72P=72. On the one hand, the performance degradation of traditional far-field sparse channel estimation scheme proves that the channel characteristics in hybrid-field region is different from those in conventional far-field region, and the adopted hybrid-field projection matrix derived based on the Fresnel approximation can better capture its sparsity especially when the scatterers’ or UT’s distance is small. Although dividing the UM-MIMO array into multiple small subarrays and approximate their channel with far-field planar wave is a feasible solution, we still observe that its performance enhancement is limited as it increases the number of angle parameters that need to be estimated to some extent. On the other hand, the significant performance loss caused by the traditional frequency-flat projection matrices illustrates that the beam-squint effect destroys the common support set property of the channels, while utilizing the proposed frequency-dependent projection can address this issue. Moreover, the adopted AMP-EM algorithm shows evident superiority over OMP algorithm since it better exploits the statistical prior information of HFBS channel and AWGN, and can also learn the information of MPCs’ number adaptively.

Refer to caption
(a) The performance comparison of different channel estimation schemes against different pilot length
Refer to caption
(b) The performance comparison of different channel estimation schemes against different scatterers distance
Figure 6: Channel estimation performance comparison.
Method Metrics Figure 7(a) Figure 7(b)
RMSEθ\textrm{RMSE}_{\theta}[dB] RMSEr\textrm{RMSE}_{r}[dB] RMSEθ\textrm{RMSE}_{\theta}[dB] RMSEr\textrm{RMSE}_{r}[dB]
Scatterers localization -11.2081 -5.6610 -10.4287 2.3688
LoS-only localization -9.6738 4.1412 -5.4379 6.5823
VAs-aided localization -9.3817 -1.8237 -16.5758 4.8653
Refined Localization -9.6900 -16.9680 -17.6700 -22.4413
TABLE III: Comparison of RMSE values for different scatterers’ localization errors.
Refer to caption
(a) Case that UT and scatterers are closer to the BS
Refer to caption
(b) Case that UT and scatterers are far away from the BS, where one scatterer has been wrongly clustered
Figure 7: A visualization of UT and scatterers location sensing performance.
Refer to caption
(a) RMSE performance of UT’s angle
Refer to caption
(b) RMSE performance of UT’s distance
Figure 8: Location sensing performance against different pilot length.

VI-C Location Sensing Results

Refer to caption
(a) CDF of RMSEθ{\rm RMSE}_{\theta}
Refer to caption
(b) CDF of RMSEr{\rm RMSE}_{r}
Figure 9: Empirical CDF w.r.t. the RMSE of angle and distance estimation.
Refer to caption
(a) RMSE performance of UT’s angle
Refer to caption
(b) RMSE performance of UT’s distance
Figure 10: Localization performance comparison against different uplink SNRs.
Refer to caption
(a) RMSE performance of UT’s angle
Refer to caption
(b) RMSE performance of UT’s distance
Figure 11: Localization performance comparison against different numbers of BS antennas.

To evaluate the performance of location sensing, we adopt the root mean square error (RMSE) of UT’s angle and distance estimation as the criteria, which can be defined as

RMSEθ=𝔼{(θ~BSθBS)2},\displaystyle\text{RMSE}_{\theta}=\sqrt{\mathbb{E}\left\{\left(\tilde{\theta}^{\rm BS}-\theta^{\rm BS}\right)^{2}\right\}}, (55a)
RMSEr=𝔼{(r~BSrBS)2},\displaystyle\text{RMSE}_{r}=\sqrt{\mathbb{E}\left\{\left(\tilde{r}^{\rm BS}-r^{\rm BS}\right)^{2}\right\}}, (55b)

In addition, we consider the following comparison schemes:

  1. (i)

    location sensing with only LoS information [39];

  2. (ii)

    location sensing with the aid of VAs according to (37);

  3. (iii)

    Refined location sensing with TDoA of MPCs according to (42).

In order to intuitively show the performance of location sensing, we compare the estimated locations of scatterers and UT with their true positions for a single realization with measurement length P=72P=72 in Fig. 7. Besides, Table III shows the RMSE performance of the localization for scatterers and UT in Fig. 7, and Table III could also demonstrate the impact of scatterers localization errors on various localization methods. In Fig. 7(a) and Table III, we observe that when UT and scatterers are closer to the BS, all UT and scatterers location can be accurately estimated. By contrast, when UT and scatterers are relatively far away from the BS as shown in Fig. 7(b), the accuracy of scatterers sensing is impaired and missing detection arises due to the non-uniform sampling grid pattern. From Table II, we can conclude that the scatterers localization errors have a significant impact on localization algorithms that rely solely on geometric information. Algorithms utilizing TDoA information, which is our proposed refined localization method, show more resilience to scatterers localization errors. Besides, it can be seen that in Fig. 7(b), there exist 6 MPCs but the estimated number of MPCs is 5. However, its impact on UT location sensing accuracy is insignificant for the VAs-aided localization and refined localization schemes, benefiting from the adopted weighting strategy and high-precision TDoA measurement.

Furthermore, Fig. 8 numerically exhibits the UT sensing performance with the criteria RMSEθ\text{RMSE}_{\theta} and RMSEr\text{RMSE}_{r}. It can be observed that with the increase of pilot overhead, the UT location sensing accuracy witnesses sustained improvement until convergence, which indicates that superior channel estimation results contributes to better location sensing performance with a certain range. Owing to ultra-high angular resolution of UM-MIMO array, utilizing LoS information allows high-precision angle estimation, while its distance estimation performance is limited. This is due to the non-uniform spatial sampling lattice, where the grid interval is inversely proportional to the distance. Thus, the straightforward grid mapping with LoS information will inevitably result in worse distance resolution when the target distance increases. To remedy this, joint utilization of VAs can lead to distance estimation enhancement but at a slight cost of angle estimation performance degradation. Particularly worth mentioning is that the aforementioned proposed methods do not depend on the delay information and are thus also suitable for narrowband systems, which offers a new location sensing method for band-limited systems.

For the broadband systems, the available TDoA information of MPCs can further promote the localization accuracy to the sub-centimeter level. Meanwhile, we investigate the location sensing performance in the absence of LoS link, where the number of NLoS MPCs is set to L=6L=6. We can observe a slight loss in angle estimation accuracy and similar distance estimation performance, which validates the effectiveness of the proposed location sensing method under different channel conditions. On this basis, we further plot the empirical cumulative distribution function (CDF) of RMSEθ\text{RMSE}_{\theta} and RMSEr\text{RMSE}_{r} in Fig. 9 with measurement length P=72P=72. Both of them have relatively large variances owing to the non-uniform spatial sampling grid. Take the proposed TDoA-assisted localization method for example, for the best case, RMSEθ\text{RMSE}_{\theta} and RMSEr\text{RMSE}_{r} can reach 50dB-50\,\rm dB and 30dB-30\,\rm dB, respectively. While for the worst case, RMSEθ\text{RMSE}_{\theta} and RMSEr\text{RMSE}_{r} degrade to 23dB-23\,\rm dB and 0dB0\,\rm dB. Anyway, from the statistical average view, the satisfactory performance can still be achieved.

Fig. 10 shows the localization performance comparison against different uplink SNR levels with the number of BS antennas and the number of measurements are set as NBS=512N_{\textrm{BS}}=512 and P=80P=80, respectively. It can be seen from the figure that the performance of the localization get better as the uplink SNR increases. Besides, the results demonstrate that our proposed refined TDoA-assisted localization method consistently outperforms other approaches across different SNR levels, both in terms of angle and distance estimation, which demonstrates the superiority and the robustness of our methods under various noise conditions.

Fig.  11 shows the localization performance comparison against different numbers of BS antennas with the measurement length set as P=80P=80. As evident from both subplots, all localization algorithms demonstrate gradually improving performance as the number of BS antennas increases. The improvement in localization accuracy with more antennas can be attributed to the enhanced spatial resolution and diversity provided by larger antenna arrays, which allow for more precise angle and distance estimation. Besides, methods utilizing TDoA information consistently outperform the method only utilizing the geometric information. The substantial performance improvement demonstrates the importance of incorporating TDoA information in localization problems

VI-D Beamforming Design Results

Refer to caption
Figure 12: SE comparison across the whole frequency band.
Refer to caption
(a) Case with LoS component
Refer to caption
(b) Case without LoS component
Figure 13: SE performance comparison with different transmit power PtDLP_{t}^{\rm DL}.
Refer to caption
(a) Case with LoS component
Refer to caption
(b) Case without LoS component
Figure 14: SE performance comparison with different bandwidth BW\rm BW.

Finally, we assess the effectiveness of beamforming design for data transmission by the SE criterion defined in (44). The baseline schemes for comparison include

  1. (i)

    DFT-codebook based spatially sparse precoding scheme [41];

  2. (ii)

    near-field codebook based beam focusing scheme [58];

  3. (iii)

    principal component analysis (PCA)-based beamforming scheme [59];

  4. (iv)

    far-field beam boradening scheme [29];

  5. (v)

    ideal steering vector (for the center frequency) scheme.

All of the schemes are based on the estimated channel and location sensing results, while the measurement length is set as P=80P=80.

Fig. 12 presents SE results calculated against different subcarriers, where the downlink transmit power is set to PtDL=30dBmP_{t}^{\rm DL}=30\,\rm dBm. As we can see, the ideal steering vector at the center frequency utilized for analog phase shifts design can ensure approximately optimal beamforming design at the center frequency, but with severe loss at the marginal subcarriers. The traditional DFT codebook-based spatially sparse precoding scheme undergoes the worst performance across the entire frequency band due to its mismatch with the channel’s HFBS feature. The near-field codebook beamforming scheme can remedy the aforementioned drawback but suffers from the problems of the beam-squint effect. Similarly, the far-field beam-broadening scheme handles the probelms caused by the beam-squint effect partially but overlooks the coexisting hybrid-field effect. The proposed scheme jointly optimizes beamforming design by fully considering the HFBS effect and thus achieves the approximately consistent SE performance across the entire bandwidth.

Fig. 13 further compares SE performance with different transmit power in the statistical average view. In the presence of LoS component, the proposed method and the PCA scheme similarly achieve the best performance, while the proposed one maintains the superiority of lower computational complexity without matrix inversion for analog codebook design. The proposed method further expands its advantages in the blocked LoS case and outperforms all other baseline schemes. In Fig. 14, we investigate the SE performance comparison with different bandwidth. When employing a relatively narrow band, all schemes except the far-field assumption based ones harvest satisfactory performance. With the increase of available bandwidth, the performance of all beamforming schemes deteriorates rapidly due to the practical hardware constraint of frequency-independent analog phase shifters. Nevertheless, among them, the proposed one degrades most sluggishly and exhibits best robustness to the HFBS effect.

VII Conclusion

In this paper, we not only overcomed but also exploited the HFBS nature of UM-MIMO systems and investigated a novel ILSC scheme. We commenced with proposing an ILSC signal frame structure consisting of uplink training and data transmission stage. In the uplink training stage, we utilized the designed projection matrix to achieve reliable sparse channel estimation under the HFBS effect, and further sensed the scatterers’ location by extracting the angular and range parameters simultaneously from the spherical electromagnetic wave. Taking the sensed scatterers as VAs, we conceived a w-LS method to coarsely estimate the the UT’s location in line with its propagation geometry. Leveraging the accurately estimated TDoA of MPCs, we further refined the location sensing results to a sub-decimeter level. For the downlink data transmission beamforming design, we utilized the acquired channel and location information, and combined beam broadening and focusing strategy to mitigate the spectral efficiency degradation resulted from HFBS effect, and achieve constant beamforming gain as much as possible over the whole bandwidth and enhance SE. Finally, we carried out simulation experiments to validate the superiority of the proposed ILSC scheme, and the results showed that the proposed schemes outperformed the baseline algorithms in terms of the channel estimation accuracy, location sensing performance, and communication SE.

References

  • [1] Qualcomm, Qualcomm Whitepaper Vision, Market Drivers, and Research Directions on the Path to 6G. [Online]. Available: https://www.qualcomm.com/content/dam/qcomm-martech/dm-assets/documents/Qualcomm-Whitepaper-Vision-market-drivers-and-research-directions-on-the-path-to-6G.pdf.
  • [2] S. A. Busari et al., “Millimeter-wave massive MIMO communication for future wireless systems: A Survey,” IEEE Commun. Surv. Tutor., vol. 20, no. 2, pp. 836-869, Q2nd. 2018.
  • [3] Y. Wang et al., ”Transformer-empowered 6G intelligent networks: From massive MIMO processing to semantic communication,” IEEE Wireless Commun., vol. 30, no. 6, pp. 127-135, Dec. 2023.
  • [4] Z. Wang et al., “Extremely large-scale MIMO: Fundamentals, challenges, solutions, and future directions,” IEEE Wireless Commun., early access, Apr. 10, 2023, doi: 10.1109/MWC.132.2200443.
  • [5] Z. Wan et al., “Terahertz massive MIMO with holographic reconfigurable intelligent surfaces,” IEEE Trans. Commun., vol. 69, no. 7, pp. 4732-4750, Jul. 2021.
  • [6] K. T. Selvan and R. Janaswamy, “Fraunhofer and Fresnel distances: Unified derivation for aperture antennas,” IEEE Antennas Propag. Mag., vol. 59, no. 4, pp. 12–15, Aug. 2017.
  • [7] Z. Gao et al., “Integrated sensing and communication with mmWave massive MIMO: A compressed sampling perspective,” IEEE Trans. Wireless Commun., vol. 22, no. 3, pp. 1745-1762, Mar. 2023.
  • [8] Y. Chen, R. Li, C. Han, S. Sun, and M. Tao, “Hybrid spherical- and planar-wave channel modeling and estimation for terahertz integrated UM-MIMO and IRS systems,” IEEE Trans. Wireless Commun., vol. 22, no. 12, pp. 9746-9761, Dec. 2023.
  • [9] Z. Li, Z. Gao, and T. Li, “Sensing user’s channel and location with terahertz extra-large reconfigurable intelligent surface under hybrid-field beam squint effect,” IEEE J. Sel. Topics Signal Process., vol. 17, no. 4, pp. 893-911, Jul. 2023.
  • [10] A. Liao et al., “Terahertz ultra-massive MIMO-based aeronautical communications in space-air-ground integrated networks,” IEEE J. Sel. Areas Commun., vol. 39, no. 6, pp. 1741-1767, Jun. 2021.
  • [11] F. Gao, L. Xu, and S. Ma, “Integrated sensing and communications with joint beam-squint and beam-split for mmWave/THz massive MIMO,” IEEE Trans. Commun., vol. 71, no. 5, pp. 2963-2976, May 2023.
  • [12] A. Sakhnini, S. De Bast, M. Guenach, A. Bourdoux, H. Sahli, and S. Pollin, “Near-field coherent radar sensing using a massive MIMO communication testbed,” IEEE Trans. Wireless Commun., vol. 21, no. 8, pp. 6256-6270, Aug. 2022.
  • [13] J. Yang, C. Wen, J. Xu, H. Que, H. Wei and S. Jin, “Angle-based SLAM on 5G mmWave systems: Design, implementation, and measurement,” IEEE Internet Things J., vol. 10, no. 20, pp. 17755-17771, Oct.15, 2023.
  • [14] Z. Wang, X. Mu, and Y. Liu, “Near-field integrated sensing and communications,” IEEE Commun. Lett., vol. 27, no. 8, pp. 2048-2052, Aug. 2023.
  • [15] K. Meng, Q. Wu, S. Ma, W. Chen, K. Wang and J. Li, “Throughput maximization for UAV-enabled integrated periodic sensing and communication,” IEEE Trans. Wireless Commun., vol. 22, no. 1, pp. 671-687, Jan. 2023.
  • [16] K. Meng, Q. Wu, W. Chen, and D. Li, “Sensing-assisted communication in vehicular networks with intelligent surface,” IEEE Trans. Veh. Technol., vol. 73, no. 1, pp. 876-893, Jan. 2024.
  • [17] A. Bayesteh, J. He, Y. Chen, P. Zhu, J. Ma, A. W. Shaban, Z. Yu, Y. Zhang, Z. Zhou, and G. Wang, “Integrated sensing and communication (ISAC)—From concept to practice,” Communications of Huawei Research, pp. 4–25, 2022.
  • [18] J. Lee, G. Gil, and Y. Lee, “Channel estimation via orthogonal matching pursuit for hybrid MIMO systems in millimeter wave communications,” IEEE Trans. Commun., vol. 64, no. 6, pp. 2370-2386, Jun. 2016.
  • [19] M. Ke, Z. Gao, Y. Wu, X. Gao, and R. Schober, “Compressive sensing based adaptive active user detection and channel estimation: Massive access meets massive MIMO,” IEEE Trans. Signal Process., vol. 68, pp. 764–779, Jan. 2020.
  • [20] Z. Gao, S. Liu, Y. Su, Z. Li, and D. Zheng, ”Hybrid knowledge-data driven channel semantic acquisition and beamforming for cell-free massive MIMO,” IEEE J. Sel. Topics Signal Process., vol. 17, no. 5, pp. 964-979, Sept. 2023.
  • [21] S. Wu, Z. Ni, X. Meng, and L. Kuang, “Block expectation propagation for downlink channel estimation in massive MIMO systems,” IEEE Commun. Lett., vol. 20, no. 11, pp. 2225–2228, Nov. 2016.
  • [22] Z. Gao et al., ”Compressive sensing-based grant-free massive access for 6G massive communication,” IEEE Internet Things J., vol. 11, no. 5, pp. 7411-7435, Mar. 2024.
  • [23] Y. Mei et al., “Massive access in extra large-scale MIMO with mixed-ADC over near-field channels,” IEEE Trans Veh. Technol., vol. 72, no. 9, pp. 12373-12378, Sept. 2023.
  • [24] Y. Han, S. Jin, C. Wen, and X. Ma, “Channel estimation for extremely large-scale massive MIMO systems,” IEEE Wireless Commun. Lett., vol. 9, no. 5, pp. 633-637, May 2020.
  • [25] M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: Far-field or near-field?”, IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, Apr. 2022.
  • [26] M. Cui and L. Dai, “Near-field wideband channel estimation for extremely large-scale MIMO,” arXiv preprint, arXiv:2212.08401, 2022.
  • [27] S. Ma, W. Shen, J. An, and L. Hanzo, “Wideband channel estimation for IRS-aided systems in the face of beam squint,” IEEE Trans Wireless Commun., vol. 20, no. 10, pp. 6240-6253, Oct. 2021.
  • [28] K. Wang et al., ”Knowledge and data dual-driven channel estimation and feedback for ultra-massive MIMO systems under hybrid field beam squint effect,” in IEEE Trans. Wireless Commun., early access, 2024.
  • [29] F. Gao, B. Wang, C. Xing, J. An, and G. Y. Li, “Wideband beamforming for hybrid massive MIMO terahertz communications,” IEEE J. Sel. Areas Commun., vol. 39, no. 6, pp. 1725-1740, Jun. 2021.
  • [30] N. J. Myers and R. W. Heath, “Infocus: A spatial coding technique to mitigate misfocus in near-field LoS beamforming,” IEEE Trans. Wireless Commun., vol. 21, no. 4, pp. 2193-2209, Apr. 2022.
  • [31] S. Dwivedi et al., “Positioning in 5G networks,” IEEE Commun. Mag., vol. 59, no. 11, pp. 38-44, Nov. 2021.
  • [32] A. Shahmansoori, G. Garcia, G. Destino, G. Seco-Granados, and H. Wymeersch, “Position and orientation estimation through millimeter-wave MIMO in 5G systems,” IEEE Trans. Wireless Commun., vol. 17, no. 3, pp. 1822-1835, Mar. 2018.
  • [33] X. Hu, C. Liu, M. Peng and C. Zhong, “IRS-based integrated location sensing and communication for mmWave SIMO systems,” IEEE Trans. Wireless Commun., vol. 22, no. 6, pp. 4132-4145, Jun. 2023.
  • [34] Z. Yu, X. Hu, C. Liu, M. Peng and C. Zhong, “Location sensing and beamforming design for IRS-enabled multi-user ISAC systems,” IEEE Trans. Signal Process., vol. 70, pp. 5178-5193, 2022.
  • [35] L. Qiao et al., “Sensing user’s activity, channel, and location with near-field extra-large-scale MIMO,” IEEE Trans. Commun., vol. 72, no. 2, pp. 890-906, Feb. 2024.
  • [36] Y. Huang and M. Barkat, “Near-field multiple source localization by passive sensor array,” IEEE Trans. Antennas and Propag., vol. 39, no. 7, pp. 968-975, Jul. 1991.
  • [37] J. Liang and D. Liu, “Passive localization of mixed near-field and far-field sources using two-stage MUSIC algorithm,” IEEE Trans. Signal Process., vol. 58, no. 1, pp. 108-120, Jan. 2010.
  • [38] A. Guerra, F. Guidi, D. Dardari, and P. M. Djurić, “Near-field tracking with large antenna arrays: Fundamental limits and practical algorithms,” IEEE Trans. Signal Process., vol. 69, pp. 5723-5738, Aug. 2021.
  • [39] O. Rinchi, A. Elzanaty, and M. Alouini, “Compressive near-field localization for multipath RIS-aided environments,” IEEE Commun. Lett., vol. 26, no. 6, pp. 1268-1272, Jun. 2022.
  • [40] K. Meng, et al., “Cooperative ISAC Networks: Opportunities and Challenges,” arXiv preprint arXiv:2405.06305, May 2024.
  • [41] O. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave MIMO systems,” IEEE Trans Wireless Commun., vol. 13, no. 3, pp. 1499-1513, Mar. 2014.
  • [42] Y. Liu, Z. Wang, J. Xu, C. Ouyang, X. Mu, and R. Schober, “Near-field communications: A tutorial review,” IEEE Open J. Commun. Soc., vol. 4, pp. 1999-2049, Aug. 2023.
  • [43] K. R. Schaubach, N. J. Davis, and T. S. Rappaport, ”A ray tracing method for predicting path loss and delay spread in microcellular environments,” in Proc. Veh. Technol. Soc. 42nd VTS Conf. Front. Technol., Denver, CO, USA, 1992, vol. 2, pp. 932-935.
  • [44] D. Donoho, A. Maleki, and A.Montanari, “Message passing algorithms for compressed sensing: I. motivation and construction,” in Proc. Inf. Theory Workshop, Jan. 2010, pp. 1–5.
  • [45] X. Meng et al., “Efficient recovery of structured sparse signals via approximate message passing with structured spike and slab prior,” IEEE China Commun., vol. 15, no. 6, pp. 1–17, Jun. 2018.
  • [46] F. R. Kschischang, B. J. Frey, and H. A. Loeliger, “Factor graph and the sum-product algorithm,” IEEE Trans. Inf. Theory, vol. 47, no. 2, pp. 498–519, Feb. 2001.
  • [47] J. Vila and P. Schniter, “Expectation-maximization Gaussian-mixture approximate message passing,” IEEE Trans. Signal Process., vol. 61, no. 19, pp. 4658–4672, Oct. 2013.
  • [48] S. Rangan, P. Schniter, A. K. Fletcher and S. Sarkar, ”On the convergence of approximate message passing with arbitrary matrices,” IEEE Trans. on Inf. Theory, vol. 65, no. 9, pp. 5339-5351, Sept. 2019.
  • [49] T. Shan, M. Wax and T. Kailath, “On spatial smoothing for direction-of-arrival estimation of coherent signals,” IEEE Trans. Acoust., Speech, Signal Process., vol. 33, no. 4, pp. 806-811, Aug. 1985.
  • [50] M. Wax and T. Kailath, “Detection of signals by information theoretic criteria,” IEEE Trans. Acoust., Speech, Signal Process., vol. 33, no. 2, pp. 387-392, Apr. 1985.
  • [51] K. Cheung, H. So, W. Ma, and Y. Chan, “Least squares algorithms for time-of-arrival-based mobile location,” IEEE Trans. Signal Process., vol. 52, no. 4, pp. 1121-1130, Apr. 2004.
  • [52] T. Kanungo et al., “An efficient k-means clustering algorithm: Analysis and implementation,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 7, pp. 881-892, Jul. 2002.
  • [53] X. Shao, C. You, W. Ma, X. Chen, and R. Zhang, “Target sensing with intelligent reflecting surface: Architecture and performance,” IEEE J. Sel. Areas Commun., vol. 40, no. 7, pp. 2070-2084, Jul. 2022.
  • [54] T. Cai and L. Wang, “Orthogonal matching pursuit for sparse signal recovery with noise,” IEEE Trans. Inf. Theory, vol. 57, no. 7, pp. 4680-4688, Jul. 2011.
  • [55] J. Nocedal and S. J. Wright, Numerical Optimization, New York, NY, USA: Springer, 2006, pp. 448–492.
  • [56] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potentials, and future directions,” IEEE Commun. Mag., vol. 61, no. 1, pp. 40-46, Jan. 2023.
  • [57] 3GPP, “Technical specification group radio access network; Study on channel model for frequencies from 0.5 to 100 GHz,” 3rd Generation Partnership Project (3GPP), Technical Report (TR) 38.901, Mar. 2022, version 17.0.0.
  • [58] Y. Zhang, X. Wu and C. You, “Fast near-field beam training for extremely large-scale array,” IEEE Wireless Commun. Lett., vol. 11, no. 12, pp. 2625-2629, Dec. 2022.
  • [59] Y. Sun et al., “Principal component analysis-based broadband hybrid precoding for millimeter-wave massive MIMO systems,”IEEE Trans. Wireless Commun., vol. 19, no. 10, pp. 6331-6346, Oct. 2020.
  • [60] A. Shafie et al., “Terahertz communications for 6G and beyond wireless networks: Challenges, key advancements, and opportunities,” IEEE Netw., pp. 1–8, May 2022.