This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\UseRawInputEncoding

Low Time Complexity Near-Field Channel and Position Estimations

Xiyuan Liu,  Qingqing Wu,  Rui Wang,  Jun Wu X. Liu is with the College of Electronics and Information Engineering, Tongji University, Shanghai 201804, China (e-mail: [email protected]). Q. Wu is with the Department of Electronic Engineering, Shanghai Jiao Tong University, 200240, China (e-mail: [email protected]). R. Wang is with the College of Electronics and Information Engineering, Tongji University, Shanghai 201804, China, and also with the Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai 201804, China (e-mail: [email protected]). J. Wu is with the School of Computer Science, Fudan University, Shanghai 200433, China, and also with the Shanghai Qi Zhi Institute, Shanghai 200030, China (e-mail: [email protected]).
Abstract

With the application of high-frequency communication and extremely large MIMO (XL-MIMO), the near-field effect has become increasingly apparent. The near-field channel estimation and position estimation problems both rely on the Angle of Arrival (AoA) and the Curvature of Arrival (CoA) estimation. However, in the near-field channel model, the coupling of AoA and CoA information poses a challenge to the estimation of the near-field channel. This paper proposes a Joint Autocorrelation and Cross-correlation (JAC) scheme to decouple AoA and CoA estimation. Based on the JAC scheme, we propose two specific near-field estimation algorithms, namely Inverse Sinc Function (JAC-ISF) and Gradient Descent (JAC-GD) algorithms. Finally, we analyzed the time complexity of the JAC scheme and the cramer-rao lower bound (CRLB) for near-field position estimation. The simulation experiment results show that the algorithm designed based on JAC scheme can solve the problem of coupled CoA and AoA information in near-field estimation, thereby improving the algorithm performance. The JAC-GD algorithm shows significant performance in channel estimation and position estimation at different SNRs, snapshot points, and communication distances compared to other algorithms. This indicates that the JAC-GD algorithm can achieve more accurate channel and position estimation results while saving time overhead.

Index Terms:
Near-field, position estimation, channel estimation, AoA, CoA, XL-MIMO.

I Introduction

Since the concept of MIMO was introduced, the scale of MIMO systems has been expanding, evolving from massive MIMO [1] to extremely large MIMO (XL-MIMO) [2, 3, 4, 5, 6, 7]. This progression is primarily driven by two key factors. First, the frequency band of communication has expanded from the original sub-6 GHz to high-frequency bands like millimeter wave and terahertz [8, 9, 10, 11]. Due to the limited scattering and diffraction capabilities of high-frequency signals [2, 12], XL-MIMO becomes crucial as it provides beamforming gain to compensate for path loss in high-frequency signal propagation. Simultaneously, the shorter wavelength of high-frequency signals allows the integration of larger MIMOs within a limited physical size. Second, the adoption of hybrid beamforming and intelligent reflecting surface technologies has significantly reduced the average cost of MIMO [13, 14]. Consequently, XL-MIMO is deemed necessary, feasible, and practical in the 6G communication scenario.

However, XL-MIMO can introduce significant near-field effects [15, 16, 17, 18]. It is widely acknowledged that the boundary between the far-field and near-field is defined by the Rayleigh distance, expressed as 2D2λ\frac{2D^{2}}{\lambda} [19], where DD represents the array size, and λ\lambda is the wavelength of the signal. In XL-MIMO and high-frequency communication scenarios, the larger DD and smaller λ\lambda result in a substantially increased Rayleigh distance compared to traditional communication scenarios [20, 21]. This leads to more users being in the near-field range. The near-field effect introduces significant challenges to beamforming design, codebook design [7, 22], and the beam training process [23], emerging as the primary bottleneck in current high-frequency communication and XL-MIMO scenarios.

The main distinction between the near-field and far-field lies in the fact that electromagnetic waves exhibit spherical wave characteristics in the near-field, as opposed to plane waves in the far-field [6]. Consequently, in the far-field, electromagnetic wave characteristics can be solely described through the angles of arrival (AoA) or departure (AoD). In the near-field region, owing to the characteristics of spherical waves, the AoAs observed by antennas at different positions of the array vary [19] [24]. To elucidate the AoA variations, the introduction of the concept of curvature of arrival (CoA) becomes necessary to assist in delineating the spatial characteristics of electromagnetic waves in near-field channels [25]. Generally, the CoA of spherical waves in the near-field region can be considered constant [2]. Therefore, combining CoA information with AoA information at the reference antenna enables the description of the characteristics of the near-field channel.

However, the introduction of CoA significantly complicates the near-field problem compared to the far-field problem. There are two primary challenges in beam training within near-field scenarios. First, the joint estimation of CoA and AoA introduces a multiplicative time complexity, resulting in a quadratic form of the number of antennas. This complexity arises for the reason that the array’s resolution for both AoA and CoA is proportional to the array’s aperture, denoted as Dλ\frac{D}{\lambda}. In the context of XL-MIMO, the time complexity of separately estimating AoA or CoA is 𝒪(N)\mathcal{O}(N), where NN represents the number of antennas in the array. When estimating AoA and CoA simultaneously, the time complexity becomes 𝒪(N2)\mathcal{O}(N^{2}) due to their combination in forming electromagnetic waves of different shapes. Given that XL-MIMO involves a larger value of NN compared to Massive MIMO, the near-field beam training problem becomes exceptionally time-consuming, exacerbated by the quadratic time complexity and larger NN. Consequently, designing an efficient beam training scheme in the near-field becomes a considerable challenge. The second primary challenge faced by near-field beam training is the coupling of CoA and AoA information, presenting difficulties in orthogonal decomposition of the near-field space [26].

However, if orthogonal decomposition cannot be achieved in the near field, it can lead to the following two main problems. The first problem is the further increase in the time complexity of near-field beamforming. Without the ability to orthogonally decompose space, it becomes impossible to explore the entire space with a finite number of beams without duplication or leakage. Consequently, it becomes necessary to repeat a certain area to enhance the spatial coverage of the beam training process, introducing significant redundancy and consuming additional time [27]. The second issue is that when estimating coupled variables, the two variables can affect each other, making the estimation problem more complex. This has had a serious impact on issues such as near-field codebook design, beam training, channel estimation, and position estimation.

To address these gaps, We need to search for received signal statistics that are only related to AoA or CoA. We search for such statistics based on the dual properties of time-domain signal processing and spatial signal processing to decouple near-field channel parameters. Based on these statistics, we design a scheme for near-field channel estimation and position estimation, and then based on this scheme, we can design specific estimation algorithms. Our main contributions are as follows:

  • We propose a Joint Autocorrelation and Cross-correlation (JAC) scheme for estimating the coupled near-field CoA and AoA information. Firstly, based on the dual relationship between digital signal processing (DSP) and array signal processing (ASP), we have identified the dual problem of the near-field problem, namely the Doppler problem. Therefore, we refer to the near-field problem as the spatial Doppler problem. Inspired by the idea that the frequency change of time-domain signals can be solved by calculating the coherence time, we found that the CoA information of the near-field can be obtained by solving the spatial autocorrelation function of the signal. So, we proposed a scheme of first solving the CoA based on the spatial autocorrelation function of the received signal, and then substituting the obtained CoA to solve the AoA. Due to the fact that the process of solving AoA is similar to the principle of far-field scenarios, which is to find the direction space with the maximum cross-correlation value with the channel, we call this scheme the JAC scheme. Compared with other existing near-field estimation algorithms, the JAC scheme is not affected by AoA when solving CoA. Therefore, we can perform orthogonal decomposition separately in the space of CoA and AoA. This allows some algorithms based on orthogonal decomposition to be transferred to near-field use through the JAC scheme.

  • We proposed the Inverse Sinc Function (JAC-ISF) algorithm. Since the autocorrelation function of the near-field received signal is in the form of a sincsinc function, we can use the arcsinc{\rm arc}sinc function to solve for CoA. However, since the sincsinc function is not a monotonic function, we need to intercept the values of the sincsinc function between (1,0)(-1,0) to form the arcsinc{\rm arc}sinc function. The advantage of JAC-ISF algorithm is that it is easy to implement because it does not require iterative or optimization operations. The disadvantage is that we only used the main lobe data of the autocorrelation function and did not use all the data, resulting in a certain degree of information loss and a decrease in algorithm accuracy.

  • We proposed the Gradient Descent (JAC-GD) algorithm. In this algorithm, we use the difference between the autocorrelation function and the sinc function as the loss function and employ gradient descent to estimate the value of CoA. Compared to the JAC-ISF algorithm, the JAC-GD algorithm is more complex, but due to its utilization of all autocorrelation function information, it has higher accuracy.

  • We analyzed the time complexity of the JAC scheme and the Cramer-Rao Lower Bound (CRLB) for near-field position estimation. Simulation results show that the JAC-GD algorithm is closer to CRLB compared to other algorithms, and its performance is significantly better than other algorithms at different SNRs and communication distances. This demonstrates that the JAC-GD algorithm achieves better performance while reducing power and time overhead.

The remainder of this paper is organized as follows. Section II presents the system model. Section III introduces the JAC scheme, while Section IV propose two near-field channel and position estimation algorithms. We analyze the cramer-rao lower bound (CRLB) and time complexity in Section V. Simulation results are shown in Section VI while the conclusion is presented in Section VII.

Notations: In this paper, scalars, vectors, and matrices are denoted by italic letters, bold-face lower-case, and upper-case letters, respectively. The space of x×y{x}\times{y} complex-valued matrices is denoted by x×y\mathbb{C}^{{x}\times{y}}. The function sinc(x)sinc(x) is sin(x)x\frac{\sin(x)}{x}. For a complex-value vector 𝒙\bm{x}, 𝒙𝒚{\bm{x}}\otimes{\bm{y}} denotes the Kronecker product of 𝒙\bm{x} and 𝒚\bm{y} while 𝒙2\|{\bm{x}}\|_{2} denotes its binary norm and diag(𝒙)({\bm{x}}) denotes a diagonal matrix with each diagonal entry being the corresponding entry in 𝒙\bm{x}. 𝒙𝒚{\bm{x}}\cdot{\bm{y}} denotes the dot product between these two vectors, while the cross product between 𝒙\bm{x} and 𝒚\bm{y} is represented by 𝒙𝒚{\bm{x}}{\bm{y}} or 𝒙×𝒚{\bm{x}}\times{\bm{y}}. 𝒙i{\bm{x}}_{i} is the iith entry of 𝒙\bm{x}. For a function 𝒚=H(𝒙){\bm{y}}=H({\bm{x}}), H1(𝒚)H^{-1}({\bm{y}}) denotes its inverse function. For a general matrix 𝑨\bm{A}, 𝑨{\bm{A}}^{*}, 𝑨H{\bm{A}}^{H}, 𝑨i,j{\bm{A}}_{i,j} and det(𝑨){\rm det}(\bm{A}) denote its conjugate, conjugate transpose, the (i,j)(i,j)th entry, and the determinant of the 𝑨\bm{A} respectively. ȷ\jmath denotes the imaginary unit, i.e., ȷ2=1{\jmath}^{2}=-1.

II System model

As shown in Figure 1, we consider the channel model in the uplink scenario. The base station (BS) is a uniform linear array with NN antennas located on the xx-axis, and its reference antenna is located at the origin. The user (UE) is a single antenna located at point 𝒑=(px,0,pz){\bm{p}}=(p_{x},0,p_{z}) on the xOzxOz plane. θ\theta and nn are the elevation angle of the reference antenna towards the user and the index of the antenna, respectively. The received signal on BS can be written as:

𝒀=𝒉𝒔H+𝑵σ,{\bm{Y}}={\bm{h}}{\bm{s}}^{H}+{\bm{N}}_{\sigma}, (1)

where 𝒀N×T\bm{Y}\in\mathbb{C}^{N\times T}, 𝒉N×1\bm{h}\in\mathbb{C}^{N\times 1}, 𝒔T×1\bm{s}\in\mathbb{C}^{T\times 1} and 𝑵σN×T{\bm{N}}_{\sigma}\in\mathbb{C}^{N\times T} are the received signal matrix, the channel, the transmitted signal at TT time slot and the Gaussian white noise with power σ\sigma, respectively. Due to the weak scattering, diffraction, and reflection abilities of high-frequency signals, the channel is often sparse, and its power is dominated by the line of sight (LoS) path [23]. Therefore, we only consider the LoS path and treat the non line of sight (NLoS) path as noise. The 𝒔\bm{s} in equation (1) is:

𝒔=[s(1),s(2),,s(t),,s(T)],\bm{s}=[s(1),s(2),\cdots,s(t),\cdots,s(T)],\;\; (2)

where s(t)s(t) can be written as:

s(t)=𝝆teȷ(2πft+𝝍t),s(t)=\bm{\rho}_{t}e^{\jmath(2\pi ft+\bm{\psi}_{t})}, (3)

where 𝝆T×1\bm{\rho}\in\mathbb{C}^{T\times 1}, 𝝍T×1\bm{\psi}\in\mathbb{C}^{T\times 1} and ff are the amplitude, phase of the transmitted signal and the carrier frequency, respectively. For channel 𝒉\bm{h}, without loss of generality, we use a continuous function h(xn)h(x_{n}) to represent it, where xnx_{n} is the xaxisx-{\rm axis} coordinate of the nnth antenna. h(xn)h(x_{n}) can be written as:

h(xn)=ϱneȷkϕ(xn)=ϱneȷk(𝒑𝒙n2𝒑2),\begin{split}&h(x_{n})=\bm{\varrho}_{n}e^{\jmath k\phi(x_{n})}\\ &=\bm{\varrho}_{n}e^{-\jmath k\left(\|{\bm{p}}-{\bm{x}}_{n}\|_{2}-\|{\bm{p}}\|_{2}\right)},\end{split} (4)

where ϱN×1\bm{\varrho}\in\mathbb{C}^{N\times 1} and kϕ(xn)k\phi(x_{n}) are the channel gain and relative delay of the transmitted signal for each subchannel, respectively. ϱ\bm{\varrho} is mainly affected by large-scale fading, while small-scale fading such as near-field effects have little impact on it. Therefore, without loss of generality, we take ϱ=𝟏\bm{\varrho}=\bm{1}. In near-field channel estimation and localization problems, we mainly focus on ϕ(xn)\phi(x_{n}). For 𝒑𝒙n2\|{\bm{p}}-{\bm{x}}_{n}\|_{2} in ϕ(xn)\phi(x_{n}), we use the binomial theorem to expand:

𝒑𝒙n2=(r2+(2rsinθxn+xn2))12=r+12r1(2rsinθxn+xn2)18r3(2rsinθxn+xn2)2+116r5(2rsinθxn+xn2)3+=rxnsinθ+xn2cos2θ2r+xn3sinθcos2θ2r2+o(xn3),\displaystyle\begin{split}&\|{\bm{p}}-{\bm{x}}_{n}\|_{2}\\ &=\left(r^{2}+\left(-2r\sin\theta x_{n}+x_{n}^{2}\right)\right)^{\frac{1}{2}}\\ &=r+\frac{1}{2}r^{-1}\left(-2r\sin\theta x_{n}+x_{n}^{2}\right)\\ &-\frac{1}{8}r^{-3}\left(-2r\sin\theta x_{n}+x_{n}^{2}\right)^{2}\\ &+\frac{1}{16}r^{-5}\left(-2r\sin\theta x_{n}+x_{n}^{2}\right)^{3}+\cdots\\ &=r-x_{n}\sin\theta+\frac{x_{n}^{2}\cos^{2}\theta}{2r}+\frac{x_{n}^{3}\sin\theta\cos^{2}\theta}{2r^{2}}+o(x_{n}^{3}),\end{split} (5)

where r=𝒑2r=\|\bm{p}\|_{2} is the distance between UE and the refrence antenna of the BS. Based on the relationship with xnx_{n}, we categorize the terms in equation (5) as constant, linear, quadratic, and higher-order, respectively. Generally, we consider the phase error of h(xn)h(x_{n}) negligible when it is less than π8\frac{\pi}{8}. The omitted portion of equation (5) represents the infinitesimal of the first few terms; hence, our discussion will focus on these initial terms.

When xn3cos2θsinθ2r2>λ16\frac{x_{n}^{3}\cos^{2}\theta\sin\theta}{2r^{2}}>\frac{\lambda}{16}, leading to r<0.62(Nd)3λr<0.62\sqrt{\frac{(Nd)^{3}}{\lambda}} [19, 28, 29], electromagnetic waves exhibit electrical resistance [21] and cannot radiate energy outward. Consequently, in communication scenarios, as long as the signal can propagate, it must satisfy xn3cos2θsinθ2r2<λ16\frac{x_{n}^{3}\cos^{2}\theta\sin\theta}{2r^{2}}<\frac{\lambda}{16}. Hence, higher-order terms can always be neglected in the communication channel model.

When xn2cos2θ2r<λ16\frac{x_{n}^{2}\cos^{2}\theta}{2r}<\frac{\lambda}{16} holds, r>2(Nd)2λr>\frac{2(Nd)^{2}}{\lambda}. This condition characterizes the Fraunhofer region, also known as the far-field region, where 𝒉[n]{\bm{h}}[n] becomes a function of θ\theta independent of rr. Conversely, when r<2(Nd)2λr<\frac{2(Nd)^{2}}{\lambda}, the corresponding region is termed the Fresnel or near-field region.

To sum up, the phase function of the near-field channel is quadratic, while the far-field channel is a linear function. Thus, the far-field channel and near-field channel can be parameterized as:

hfar(xn)=eȷkp2xnh_{\rm far}(x_{n})=e^{\jmath kp_{2}x_{n}} (6)

and

hnear(xn)=eȷk(p1xn2+p2xn),h_{\rm near}(x_{n})=e^{\jmath k(p_{1}x_{n}^{2}+p_{2}x_{n})}, (7)

respectively. We can observe that the far-field model is a special case of the near-field model when p1=0p_{1}=0. Therefore, designing parameter estimation algorithms based on equation (7) can achieve both far-field channel estimation and near-field channel estimation.

Refer to caption
Figure 1: Near-field channel model for ULA communication system.

III JAC Scheme

III-A The Physical Meaning of Near-field Channel Parameters

In order to find better near-field channel estimation methods, we will analyze the physical meaning of near-field channel parameters in this section. According to equation (4), the UE’s location information is contained in ϕ(x)\phi(x). According to equations (5) and (7), we can obtain:

ϕ(x)=p1x2+p2x,\phi(x)=p_{1}x^{2}+p_{2}x, (8)

where

p1=cos2θ2r,p_{1}=-\frac{\cos^{2}\theta}{2r}, (9)

and

p2=sinθ.p_{2}=\sin\theta. (10)

We can observe that the distance and angle information of the user are coupled in p1p_{1}, while the information of p1p_{1} and p2p_{2} are coupled in ϕ\phi. Therefore, if the distance and angle of the user are estimated directly on ϕ\phi, there will be a phenomenon of mutual coupling between parameters.

Refer to caption
Figure 2: Parameters for controlling the curvature and direction of spherical waves.

We address the parameter coupling phenomenon in near-field channels from two aspects. Firstly, we directly estimate p1p_{1} and p2p_{2}, and then estimate the UE’s position using equations (9) and (10) after obtaining p1p_{1} and p2p_{2}. Secondly, we will find a method to decouple p1p_{1} and p2p_{2} based on their physical meanings. We use β(x)\beta(x) to represent the direction in which the antenna at position (x,0,0)(x,0,0) looks towards the UE, that is:

β(x)=sinθx,β(x)(1,1),\beta(x)=\sin\theta_{x},\,\,\,\,\,\beta(x)\in(-1,1), (11)

where θx\theta_{x} is the elevation angle of the antenna at position (x,0,0)(x,0,0) towards the UE. According to equation (8), the following holds:

β(x)=dϕ(x)dx=2p1x+p2.\beta(x)=\frac{{\rm d}\phi(x)}{{\rm d}x}=2p_{1}x+p_{2}. (12)

Since β(0)=p2\beta(0)=p_{2}, p2p_{2} represents the channel direction at the reference antenna. We define the channel direction at the reference antenna as the angel of arrival (AoA) and the rate of change of a channel with respect to space as the curvature of arrival (CoA) of the channel, which is:

CoA=dβ(x)dx=2p1.{\rm CoA}=\frac{{\rm d}\beta(x)}{{\rm d}x}=2p_{1}. (13)

From equation (13), it can be seen that the physical meaning of p1p_{1} is the curvature of the channel. Fig. 2 shows the relationship between p1p_{1} and p2p_{2} and UE’s location. It can be seen that the size of p1p_{1} affects the CoA of the wavefront, regardless of its direction while p2p_{2} represents the direction of the tangent plane of the wavefront at the reference antenna, without affecting the curvature of the wavefront. Therefore, as long as we find a method that can estimate channel CoA and AoA separately, we can decouple p1p_{1} and p2p_{2}.

III-B The Relationship Between Near-field Problems and Doppler Problems

In this section, we explore the dual relationship between the near-field problem and the Doppler problem to find a method for estimating the near-field channel CoA, and then use the estimated CoA information to estimate the AoA.

For the time-domain function h(t)=eȷϕ(t){h}(t)=e^{\jmath\phi(t)}, if ϕ(t)\phi^{\prime}(t) is a constant value, then h(t){h}(t) is stationary, and its frequency spectrum H(ω){H}(\omega) does not change with time. If there is Doppler effect in the channel, h(t){h}(t) becomes non-stationary. At this point, ϕ(t)\phi^{\prime}(t) is not a constant value, and H(ω){H}(\omega) also changes with time. The coherence time of h(t){h}(t) can reflect the speed at which H(ω){H}(\omega) changes over time.

In near-field problems, the channel also exhibits properties similar to the Doppler effect. We use κ=kxk\kappa=\frac{k_{x}}{k} to represent the direction of the channel, where kxk_{x} is the projection of kk in the xx-axis direction. We select a small interval (x0,x0+Δx)(x_{0},x_{0}+\Delta x) on the array, where ϕ(x)\phi(x) can be seen as a linear function. The pattern on this interval can be written as:

H(κ)|x=x0=1Δx|x0x0+Δxeȷkϕ(x)eȷkκxdx|1Δx|x0x0+Δxeȷk(ϕ(x0)+ϕ(x)(xx0))eȷkκ(xx0)dx|=1Δx|0Δxeȷk(β(x0)κ)xdx|=1Δx|0Δxeȷk(β(x0)κ)xdx|=sinc(β(x0)kΔx2).\displaystyle\begin{split}&H(\kappa)|_{x=x_{0}}\\ &=\frac{1}{\Delta x}\left|\int_{x_{0}}^{x_{0}+\Delta x}e^{\jmath k\phi(x)}e^{-\jmath k\kappa x}{\rm d}x\right|\\ &\frac{1}{\Delta x}\left|\int_{x_{0}}^{x_{0}+\Delta x}e^{\jmath k(\phi(x_{0})+\phi^{\prime}(x)(x-x_{0}))}e^{\jmath k\kappa(x-x_{0})}{\rm d}x\right|\\ &=\frac{1}{\Delta x}\left|\int_{0}^{\Delta x}e^{\jmath k(\beta(x_{0})-\kappa)x}{\rm d}x\right|\\ &=\frac{1}{\Delta x}\left|\int_{0}^{\Delta x}e^{\jmath k(\beta(x_{0})-\kappa)x}{\rm d}x\right|\\ &=sinc(\beta(x_{0})\frac{k\Delta x}{2}).\end{split} (14)

When β(x)\beta(x) is a constant independent of xx, H(κ)H(\kappa) does not change with the variation of xx, and at this time h(x)h(x) is spatially stationary. When β(x)\beta(x) is not a constant, h(x)h(x) is spatially non-stationary.

Refer to caption
Figure 3: Spatial Doppler phenomenon.

We can compare the time signal and spatial signal and find that the Doppler effect causes h(t)h(t) to be non-stationary in time, and the near-field effect causes h(x)h(x) to be non-stationary in space. Therefore, we call the near-field effect the spatial Doppler effect. Hence, we can imitate the time-domain Doppler estimation method and propose its dual spatial Doppler estimation method. Thus, we proposed the concept of coherent space by imitating the concept of coherent time. According to equation (6), when the distance from the user to the reference antenna is greater than the Rayleigh distance, the channel model can be considered as a far-field model. At this point, p1=0p_{1}=0 and β(x)=p2\beta(x)=p_{2} is a constant, so the signal is spatial stationary, meaning that the signal is coherent in the space of the array. Therefore, we define coherent space as the range where the channel can be viewed as a far-field on the line of the array. We use χ0\chi_{0} to represent the length of the coherent space. As shown in Fig. 3, the coherence space size of the channel varies for users at different locations. For user 3, its distance from the reference antenna is equal to the Rayleigh distance, therefore χ0user3=D\chi_{0}^{{\rm user}3}=D, where DD is the physical size of the array. For user 4, it is located in the far-field region of the array, therefore χ0user4>D\chi_{0}^{{\rm user}4}>D. The electromagnetic waves emitted by user 4 can be regarded as plane waves on the array. For near-field users, such as user 1 and user 2, the coherence space of their channels is smaller than the array size, so the channels cannot be considered spatially coherent, i.e. plane wave models cannot be used. And because user 1 is closer to the array than user 2, χ0user1<χ0user2\chi_{0}^{{\rm user}1}<\chi_{0}^{{\rm user}2}.

Therefore, we found that by measuring the spatial coherence of the signals received by the array, we can determine the distance from the user to the reference antenna. If the signals on the entire array are coherent, it indicates that the user is in the far field, while if only a portion is coherent, it indicates that the user is in the near field of the array. The smaller the coherence space, the greater the rate of change of channel direction with space, that is, the larger the CoA. So we can solve p1p_{1} by doing autocorrelation on the signal. In the next subsection, we will provide a detailed introduction to the functional relationship between signal autocorrelation and p1p_{1}.

III-C Estimation Model of Near-field Channel

We define the amplitude spectrum of the spatial autocorrelation function of the received signal as:

c(χ)=1DD~|D~Dh(x)h(xχ)dx|=1DD~|D~Deȷk(p1x2+p2x)eȷk(p1(xχ)2+p2(xχ))dx|=1DD~|D~Deȷk2p1χxeȷk(p1χ2+p2χ)dx|=1DD~|D~Deȷk2p1χxdx|=|sinc(kp1χ(DD~))|,\displaystyle\begin{split}&c\left(\chi\right)=\frac{1}{D-\widetilde{D}}\left|\int_{\widetilde{D}}^{D}h\left(x\right)h^{*}\left(x-\chi\right){\rm d}x\right|\\ &=\frac{1}{D-\widetilde{D}}\left|\int_{\widetilde{D}}^{D}e^{\jmath k\left(p_{1}x^{2}+p_{2}x\right)}e^{-\jmath k\left(p_{1}\left(x-\chi\right)^{2}+p_{2}\left(x-\chi\right)\right)}{\rm d}x\right|\\ &=\frac{1}{D-\widetilde{D}}\left|\int_{\widetilde{D}}^{D}e^{\jmath k2p_{1}\chi x}e^{\jmath k\left(-p_{1}\chi^{2}+p_{2}\chi\right)}{\rm d}x\right|\\ &=\frac{1}{D-\widetilde{D}}\left|\int_{\widetilde{D}}^{D}e^{\jmath k2p_{1}\chi x}{\rm d}x\right|\\ &=\left|sinc\left(kp_{1}\chi\left(D-\widetilde{D}\right)\right)\right|,\end{split} (15)

where χ\chi represents the displacement in the autocorrelation function. According to equation (15), we can write the solution of p1p_{1} as:

p1=minp1|c(χ)|sinc(kp1χ(DD~))||.p_{1}=\min_{p_{1}}\left|c(\chi)-\left|sinc\left(kp_{1}\chi\left(D-\widetilde{D}\right)\right)\right|\right|. (16)

After estimating the parameter p1p_{1}, we can construct a new signal form to transform the near-field problem into a far-field problem:

h~(x)=eȷkp^1x2h(x).\widetilde{h}(x)=e^{\jmath k\hat{p}_{1}x^{2}}h(x). (17)

At this point, h~(x)\widetilde{h}(x) is equivalent to a far-field channel, so we can use traditional far-field channel estimation methods to estimate parameter p2p_{2}. Finally, based on equation (7), we can obtain the estimated channel h^\hat{h}, and based on equations (9) and (10), we can obtain θ^\hat{\theta} and r^\hat{r}.

In summary, we decompose the near-field estimation process into two steps: estimating p1p_{1} and p2p_{2}. The process of estimating p1p_{1} is an estimation method similar to the Doppler effect, which uses autocorrelation function for estimation. The process of estimating p2p_{2} is essentially an estimation of the directional pattern, which is to determine which direction the signal has the greatest cross-correlation with. Therefore, the estimation model proposed in this paper is named the joint auto-correlation and cross-correlation (JAC) scheme.

IV Near Field Localization Algorithms Based on JAC Scheme

The JAC model is mainly divided into two stages, with the second stage essentially being far-field AoA estimation. Therefore, in this section, we propose two algorithms to solve the CoA estimation problem in the first stage of the JAC scheme.

IV-A Inverse sinc function algorithm

In this part, we use the inverse function of the sinc function to solve the parameter p1p_{1}, so we call this algorithm the JAC-ISF algorithm.

According to equation (15), the analytical solution of p1p_{1} can be expressed as the inverse function of the sincsinc function. However, the sincsinc function is not a monotonic function and does not have an inverse function. Therefore, we need to truncate the sincsinc function and use its monotonic interval for channel estimation. Firstly, we can discard the sidelobes and only retain the main lobe. This is because near-field scenes are often accompanied by the presence of XL-MIMO. In XL-MIMO scenario, the sidelobe amplitude of the sincsinc function is much lower than that of the main lobe, so the anti noise performance of the sidelobes is worse. Secondly, according to equation (9), we know that p1p_{1} must be a negative value. Therefore, we only need to retain the part of the sincsinc function (1,0)(-1,0). On this interval, the sincsinc function is a monotonically increasing function and therefore has an inverse function. Thus, arcsinc(x){\rm arc}sinc(x) can be written as:

arcsinc(x)={y|x=sinc(y),y(1,0)}.{\rm arc}sinc(x)=\{y|x=sinc(y),y\in(-1,0)\}. (18)

Fig. 4 shows the graph of the arcsinc{\rm arc}sinc function.

Refer to caption
Figure 4: The arcsinc{\rm arc}sinc function.

Since we need to compare the autocorrelation function with the sincsinc function to obtain the value of p1p_{1}, the autocorrelation function also needs to capture points within the main lobe. According to equation (15), the estimated value of the autocorrelation function can be written as:

c^[η]=1T1Nξt=1Tn=ξ+1N𝒀[n,t]𝒀[nη,t],η[1,ξ],\hat{c}[\eta]=\frac{1}{T}\frac{1}{N-\xi}\sum_{t=1}^{T}\sum_{n=\xi+1}^{N}{\bm{Y}}[n,t]{\bm{Y}}^{*}[n-\eta,t],\,\,\,\eta\in[1,\xi], (19)

where ξ\xi\in\mathbb{Z} and η\eta are the number of points in the autocorrelation function and the number of shifts when performing autocorrelation functions. We can see that when the value of ξ\xi is small, the estimated value of the autocorrelation function is more accurate. However, the number of points in the autocorrelation function decreases, resulting in insufficient data for estimating p1p_{1}. On the contrary, if the value of ξ\xi is large, it will lead to inaccurate estimation of the autocorrelation function. In practical applications, if there is prior information, the appropriate ξ\xi should be selected based on the prior information. In order to consider a more general situation, we choose ξ=N2\xi=\lfloor\frac{N}{2}\rfloor. Then, we need to determine whether the autocorrelation function has zero points to determine if all data is within the main lobe. Due to the fact that autocorrelation functions may not necessarily have zero points when η\eta is an integer, we cannot extract data within the main lobe by finding η\eta values that satisfy c[η]=0c[\eta]=0. Therefore, we select a small value111Here we have chosen δ=0.1\delta=0.1 based on experience. In fact, the choice of δ\delta does not have a significant impact on the performance of the algorithm, so it is sufficient to choose the appropriate one when applying this algorithm. δ=0.1\delta=0.1 and use Υ\Upsilon to represent the set of η\eta that satisfies c[η]δc[\eta]\leq\delta, that is:

Υ={η|c[η]δ}.\Upsilon=\{\eta|c[\eta]\leq\delta\}. (20)

Thus, the number of points retained for the autocorrelation function is:

Nη={minΥ,Υ,ξ,Υ=.N_{\eta}=\left\{\begin{aligned} \min{\Upsilon},\,\,\,\Upsilon\neq\emptyset,\\ \xi,\,\,\,\Upsilon=\emptyset.\end{aligned}\right. (21)

Substituting χ=ηd\chi=\eta d, according to equations (15) and (16), the estimated value of p1p_{1} is:

p^1=1Nηη=1Nηarcsinc(c^[η])kηd2(NNη).\hat{p}_{1}=\frac{1}{N_{\eta}}\sum_{\eta=1}^{N_{\eta}}\frac{{\rm arc}sinc(\hat{c}[\eta])}{k\eta d^{2}(N-N_{\eta})}. (22)

According to equation (17), we use the estimated p1p_{1} to construct an equivalent far-field signal:

𝒀~[n,t]=eȷkp1(nd)2𝒀[n,t].\widetilde{\bm{Y}}[n,t]=e^{\jmath kp_{1}(nd)^{2}}{\bm{Y}}[n,t]. (23)

Then we use the MUltiple SIgnal Classificatio (MUSIC) algorithm on 𝒀~\widetilde{\bm{Y}} to estimate the direction of the maximum eigenvalue, which is p2p_{2}. Finally, the estimated channel is:

𝒉^[n]=eȷk(p^1n2d2+p^2nd).\hat{\bm{h}}[n]=e^{\jmath k(\hat{p}_{1}n^{2}d^{2}+\hat{p}_{2}nd)}. (24)

Since this algorithm obtains p1p_{1} by inverting the sincsinc function, we refer to it as the Inverse Sinc Function (ISF) algorithm. The details of this algorithm are summarized in Algorithm 1.

Algorithm 1 JAC-ISF algorithm
1:  Require: 𝒀\bm{Y}, NN, TT, ξ\xi.
2:  Estimate c[η]c[\eta] based on equation (19).
3:  Determine Υ\Upsilon based on equations (20).
4:  According to equation (21), determine the number of data NηN_{\eta} used to estimate p1p_{1}.
5:  According to equation (22), obtain the estimated value p^1\hat{p}_{1} of p1p_{1}.
6:  According to equation (23), form the equivalent far-field signal 𝒀~\widetilde{\bm{Y}}.
7:  Estimate parameter p2p_{2} using the MUSIC algorithm.
8:   Get h^\hat{h} according to equation (24).
9:  Get θ^\hat{\theta} and r^\hat{r} according to equations (9) and (10).

IV-B Gradient descent method for calculating CoA

In this subsection, we use gradient descent to solve the parameter p1p_{1}. Therefore, we call this algorithm the JAC-GD algorithm.

Similar to the JAC-ISF algorithm, we obtain an estimate of the autocorrelation function using equation (19). According to equation (16), we set the loss function as:

Loss=η=1ξ|c^[η]|sinc(kp1ηd2(Nξ))||.Loss=\sum_{\eta=1}^{\xi}\left|\hat{c}[\eta]-\left|sinc(kp_{1}\eta d^{2}(N-\xi))\right|\right|. (25)

The gradient of the Loss function is:

grad=dLossdp1.grad=\frac{{\rm d}Loss}{{\rm d}p_{1}}. (26)

We update p1p_{1} based on the learning rate α\alpha, that is:

p1(nitr)=p1(nitr1)α(nitr)grad,p_{1}(n_{\rm itr})=p_{1}(n_{\rm itr}-1)-\alpha(n_{\rm itr})grad, (27)

where nitrn_{\rm itr} is the index of the number of iterations. We adopt the learning rate of inverse time decay, that is:

α(nitr)=α011+γ(nitr1),\alpha(n_{\rm itr})=\alpha_{0}\frac{1}{1+\gamma(n_{\rm itr}-1)}, (28)

where α0\alpha_{0} is the initial learning rate and γ\gamma is the decay factor. The details of this algorithm are summarized in Algorithm 2.

Algorithm 2 JAC-GD algorithm
1:  Require: 𝒀\bm{Y}, NN, TT, α0\alpha_{0}, γ\gamma, NitrN_{\rm itr} and ξ\xi
2:  Initialize: P1=0P_{1}=0.
3:  Obtain the c^[η]\hat{c}[\eta] using equation (19).
4:  for nitr=1:Nitrn_{\rm itr}=1:N_{\rm itr} do
5:     Obtain the loss function through equation (25).
6:     Calculate the current gradient as equation (26).
7:     Update α\alpha according to equation (28).
8:     Update p1p_{1} according to equation (27).
9:  end for
10:  Obtain the equivalent far-field channel 𝒉~\tilde{\bm{h}} according to equation (23).
11:  Estimate p2p_{2} using MUSIC algorithm.
12:  Obtain the estimated channel 𝒉^\hat{\bm{h}} based on equation (24).
13:  Obtain the polar coordinates of the UE’s position based on equations (9) and (10).

V Cramer’s lower bound and time complexity analysis

V-A Analysis of Cramer’s Lower bound for Angle and Distance

We define the unknown parameter vector as:

ϵ1=[p1,p2,𝝍,𝝆,σ2]\epsilon_{1}=[p_{1},p_{2},{\bm{\psi}},{\bm{\rho}},\sigma^{2}] (29)

The probability density function of the received signal is:

p(𝒀|ϵ1)=1πNTdet(𝑹)e(𝒀𝝁H)𝑹1(𝒀𝝁),p({\bm{Y}}|\epsilon_{1})=\frac{1}{\pi^{NT}{\rm det}(\bm{R})}e^{-({\bm{Y}}-{\bm{\mu}}^{H}){\bm{R}}^{-1}({\bm{Y}}-{\bm{\mu}})}, (30)

where 𝑹=σ2𝑰NL{\bm{R}=\sigma^{2}{\bm{I}}_{NL}} and

𝝁=[s(1)hnearT(p1,p2),s(2)hnearT(p1,p2),,s(T)hnearT(p1,p2)].\begin{split}&{\bm{\mu}}=[s(1)h_{\rm near}^{\rm T}(p_{1},p_{2}),s(2)h_{\rm near}^{\rm T}(p_{1},p_{2}),\\ &\cdots,s(T)h_{\rm near}^{\rm T}(p_{1},p_{2})].\end{split} (31)

We define 𝔼{(ϵ^1ϵ1)(ϵ^1ϵ1)T}\mathbb{E}\{(\hat{\bm{\epsilon}}_{1}-{\bm{\epsilon}}_{1})(\hat{\bm{\epsilon}}_{1}-{\bm{\epsilon}}_{1})^{\rm T}\} as the covariance matrix of the unknown parameter vector ϵ\bm{\epsilon}, where ϵ^\hat{\bm{\epsilon}} is the unbiased estimator of ϵ\bm{\epsilon}. The covariance matrix has the following inequality relationship:

MSE([ϵ1]i)=𝔼{([ϵ^1]i[ϵ1]i)2}CRLB([ϵ1]i),{\rm MSE}([{\bm{\epsilon}}_{1}]_{i})=\mathbb{E}\{([\hat{\bm{\epsilon}}_{1}]_{i}-[{\bm{\epsilon}}_{1}]_{i})^{2}\}\geq{\rm CRLB}([{\bm{\epsilon}}_{1}]_{i}), (32)

where the CRLB\rm CRLB is cramer-rao lower bound (CRLB) and satisfies:

CRLB([ϵ1]i)=[FIM1(ϵ1)]i,i.{\rm CRLB}([{\bm{\epsilon}}_{1}]_{i})=[{\rm FIM}^{-1}({\bm{\epsilon}}_{1})]_{i,i}. (33)

The Fisher information matrix FIM1(ϵ1){\rm FIM}^{-1}({\bm{\epsilon}}_{1}) can be obtained by the Slepian-Bangs equation [30], which is shown in (34):

[FIM(ϵ1)]i,i=2{μH[ϵ1]i𝑹1μ[ϵ1]j}+Tr{𝑹1𝑹[ϵ1]i𝑹1𝑹[ϵ1]j}.\displaystyle\begin{split}[{\rm FIM}({\bm{\epsilon}}_{1})]_{i,i}&=2\mathcal{R}\{\frac{\partial\mu^{\rm H}}{\partial[{\bm{\epsilon}}_{1}]_{i}}{\bm{R}}^{-1}\frac{\partial\mu}{\partial[{\bm{\epsilon}}_{1}]_{j}}\}\\ &+{\rm Tr}\{{\bm{R}}^{-1}\frac{\partial{\bm{R}}}{\partial[{\bm{\epsilon}}_{1}]_{i}}{\bm{R}}^{-1}\frac{\partial{\bm{R}}}{\partial[{\bm{\epsilon}}_{1}]_{j}}\}.\end{split} (34)

When TT is large, the dimensionality of the FIMs also becomes large, making the inverse operation more difficult. Therefore, we solve FIM1{\rm FIM}^{-1} by solving the Schur complement of the matrix. After simplification, we can obtain the CRLB of unknown parameters p1p_{1} and p2p_{2}:

CRLB(p1)=[FIM1(ϵ1)]1,1=σ22k2f(0)𝝆22f(0)f(2)f2(2)(f(0)f(2)f2(1))(f(0)f(4)f2(2))(f(0)f(3)f(1)f(2))2,\begin{split}&{\rm CRLB}(p_{1})=[{\rm FIM}^{-1}({\bm{\epsilon}}_{1})]_{1,1}=\frac{\sigma^{2}}{2k^{2}}\frac{f(0)}{\|{\bm{\rho}}\|_{2}^{2}}\\ &\frac{f(0)f(2)-f^{2}(2)}{(f(0)f(2)-f^{2}(1))(f(0)f(4)-f^{2}(2))(f(0)f(3)-f(1)f(2))^{2}},\end{split} (35)
CRLB(p2)=[FIM1(ϵ1)]2,2=σ22k2f(0)𝝆22f(0)f(4)f2(2)(f(0)f(2)f2(1))(f(0)f(4)f2(2))(f(0)f(3)f(1)f(2))2,\begin{split}&{\rm CRLB}(p_{2})=[{\rm FIM}^{-1}({\bm{\epsilon}}_{1})]_{2,2}=\frac{\sigma^{2}}{2k^{2}}\frac{f(0)}{\|{\bm{\rho}}\|_{2}^{2}}\\ &\frac{f(0)f(4)-f^{2}(2)}{(f(0)f(2)-f^{2}(1))(f(0)f(4)-f^{2}(2))(f(0)f(3)-f(1)f(2))^{2}},\end{split} (36)

where f(x)=n=1N(nd)xf(x)=\sum_{n=1}^{N}(nd)^{x}.

For the localization problem, we want to know the CRLB of θ\theta and rr, so we define the parameter vector ϵ2\bm{\epsilon}_{2} as:

ϵ2=g(ϵ1)=[θ,r,𝝍T,𝝆T].\bm{\epsilon}_{2}=g(\bm{\epsilon}_{1})=[\theta,r,\bm{\psi}^{\rm T},\bm{\rho}^{\rm T}]. (37)

We exploit the theory of parameter transformation for CRLB of ϵ\bm{\epsilon} [31]:

CRLB(ϵ2)=g(ϵ1)ϵ1CRLB(ϵ1)(g(ϵ1)ϵ1)T.{\rm CRLB}(\bm{\epsilon}_{2})=\frac{\partial g(\bm{\epsilon}_{1})}{\bm{\epsilon}_{1}}{\rm CRLB}(\bm{\epsilon}_{1})(\frac{\partial g(\bm{\epsilon}_{1})}{\bm{\epsilon}_{1}})^{\rm T}. (38)

By combining equations (9) and (10), we can obtain the CRLB of θ\theta and rr as follows:

CRLB(θ)=σ221k2d2cos2(θ)f(0)𝝆22f(0)f(4)f2(2)(f(0)f(2)f2(1))(f(0)f(4)f2(2))(f(0)f(3)f(1)f(2))2,\begin{split}&{\rm CRLB}(\theta)=\frac{\sigma^{2}}{2}\frac{1}{k^{2}d^{2}\cos^{2}(\theta)}\frac{f(0)}{\|\bm{\rho}\|_{2}^{2}}\\ &\frac{f(0)f(4)-f^{2}(2)}{(f(0)f(2)-f^{2}(1))(f(0)f(4)-f^{2}(2))(f(0)f(3)-f(1)f(2))^{2}},\end{split} (39)

and (40).

According to equation (40), we can see that the larger rr, the higher the CRLB\rm CRLB, and when rr\rightarrow\infty, the CRLB(r){\rm CRLB}(r)\rightarrow\infty. This is consistent with the conclusion that the array cannot determine the distance from the user to the array in far-field situations.

CRLB(r)=2σ2f(0)𝝆22r2k2d4cos4θd2sin2θ(f(0)f(4)f2(2))2rdsinθ(f(1)f(2)f(0)f(3))+r2(f(0)f(2)f2(1))(f(0)f(2)f2(1))(f(0)f(4)f2(2))(f(0)f(3)f(1)f(2))2.{\rm CRLB}(r)=2\sigma^{2}\frac{f(0)}{\|\bm{\rho}\|_{2}^{2}}\frac{r^{2}}{k^{2}d^{4}\cos^{4}\theta}\frac{d^{2}\sin^{2}\theta(f(0)f(4)-f^{2}(2))-2rd\sin\theta(f(1)f(2)-f(0)f(3))+r^{2}(f(0)f(2)-f^{2}(1))}{(f(0)f(2)-f^{2}(1))(f(0)f(4)-f^{2}(2))-(f(0)f(3)-f(1)f(2))^{2}}. (40)

V-B Time complexity analysis of JAC-ISF algorithm and JAC-GD algorithm

The computational complexities of the proposed ISF algorithm is analyzed, which are summarized in Table I. For solving and the autocorrelation function, i.e. equation (19), the time complexity is 𝒪(TN)\mathcal{O}(TN). The time complexity of the process of extracting the autocorrelation function in equations (20) and (21) are 𝒪(N)\mathcal{O}(N). The time complexity of using autocorrelation function to estimate p1p_{1} in equation (22) is 𝒪(Nη)\mathcal{O}(N_{\eta}), but sincsinc NηN_{\eta} is proportional to NN, the time complexity is 𝒪(N)\mathcal{O}(N). In summary, the total time complexity of estimateing p1p_{1} is 𝒪(TN)\mathcal{O}(TN). The process of estimating p2p_{2} is the MUSIC algorithm, so the time complexity is 𝒪(TN)\mathcal{O}(TN). Finally, the total time complexity of the ISF algorithm is 𝒪(TN)\mathcal{O}(TN).

In the P-SOWP algorithm, SS refers to the number of grids divided by distance. Although [32] mentions that SS is not a large number compared to NN. However, as NN increases, SS also increases synchronously. This is because when NN increases, the distance resolution of the array increases. In the time complexity of the P-SIGW algorithm, NiterN_{\rm iter} refers to the number of iterations of the algorithm.

Through the comparison of time complexity in Table I, we found that the time complexity of most near-field estimation algorithms is the time complexity of distance estimation multiplied by the time complexity of angle estimation [33]. This is because most algorithms use the principle of cross-correlation to estimate the near-field channel. The principle of cross-correlation refers to determining which space a user is in by examining the correlation between channels and different spaces. This type of channel estimation algorithm based on cross-correlation relies on orthogonal spatial decomposition. In near-field problems, it is difficult to divide the entire space into different orthogonal spaces due to the coupling of distance and direction variables. Moreover, due to the addition of distance as a dimension for measuring space, the time complexity of near-field algorithms exhibits multiplicative time complexity. The JAC model and ISF algorithm proposed in this paper decouple two parameters in near-field channel estimation, and make the time complexity of the entire algorithm exhibit additive time complexity in terms of angle and distance.

TABLE I: Time Complexity of Near Field Algorithms
Algorithm name Time complexity
Proposed ISF 𝒪(TN)\mathcal{O}(TN)
P-SOWP 𝒪(TNS)\mathcal{O}(TNS)
P-SIGW 𝒪(TNS)+𝒪(NiterT2)\mathcal{O}(TNS)+\mathcal{O}(N_{\rm iter}T^{2})

VI Simulation Results

In this section, numerical results are provided to validate the effectiveness of the proposed algorithm. In our simulations, the BS is a linear array with N=200N=200. The carrier frequency is 30GHz. Under this setting, the Rayleigh distance of BS is 200200m. In common cellular scenarios, most users are located within the near-field range of this scenario. Due to the poor scattering, diffraction, and reflection ability of high-frequency signals, we only use information from the LoS path for channel estimation, and the NLoS path is considered as the noise component.

VI-A The Results of Channel Estimation

We use two metrics to measure the performance of the proposed ISF algorithm, which are achievable rate and normalized minimum mean square error (NMSE) criterion. The achievable rate refers to the information rate that the BS can receive by utilizing the estimated channel for beamforming, that is:

Rate=log2(1+PsNσ2|𝒉H𝒉^|2𝒉^22),{\rm Rate}=log_{2}(1+\frac{P_{s}N}{\sigma^{2}}\frac{|{\bm{h}}^{H}\hat{\bm{h}}|^{2}}{\|{\hat{\bm{h}}}\|_{2}^{2}}), (41)

where PsP_{s} is the power of source signal. If the Channel State Information (CSI) is perfect, the achievable rate is:

Ratemax=log2(1+PsNσ2).{\rm Rate}_{\rm max}=log_{2}(1+\frac{P_{s}N}{\sigma^{2}}). (42)

The NMSE is:

NMSE=𝔼(𝒉𝒉^22𝒉22).{\rm NMSE}=\mathbb{E}(\frac{\|{\bm{h}}-\hat{\bm{h}}\|_{2}^{2}}{\|{\bm{h}}\|_{2}^{2}}). (43)

For a comparative analysis, the proposed codebook schemes are compared with the following schemes:

  • SWOMP: It is an angle domain on-grid algorithm [34]. It only searches for angles and not distances, making it suitable for far-field scenario rather than near-field.

  • SS-SIGW-OLS: It is an angle domain off-grid algorithm [35], which is only applicable to far-field search like the SWOMP algorithm. Due to its lack of grid quantization errors, its algorithm has higher precision.

  • P-SOWP: It is an on-grid algorithm in polar domain [32]. This algorithm divides the near-field space into different grids based on angle and distance, and then searches for which grid the UE is in. The angle division method is the same as the far-field, which is uniformly divided. The division method in terms of distance is non-uniform, with denser grids at closer distances.

  • P-SIGW: This is an off-grid algorithm in the polar domain [32], which is based on the P-SOWP algorithm and uses gradient descent to continue searching within the grid, ultimately determining the location of the UE.

  • MUSIC: This is a classic algorithm for far-field spatial spectrum estimation. Because in both JAC-ISF and JAC-GD algorithms, the estimation of p2p_{2} is based on the MUSIC algorithm, we use the MUSIC algorithm as a comparison scheme to demonstrate to what extent the estimation of p1p_{1} in the JAC scheme compensates for the error in estimating the near-field using the MUSIC algorithm.

Fig. 5 shows the comparison of achievable rates between proposed JAC-ISF, JAC-GD algorithms and MUSIC algorithm at different SNRs. All algorithms are performed with a snapshot points of 32. In this simulation experiment, the UE is located on the zz-axis at a distance of 10 m to 50 m, which belonging to a deeper near-field range. We can see that when SNR=-10 dB, due to excessive noise, JAC-ISF cannot accurately estimate p1p_{1}, resulting in its performance being similar to the MUSIC algorithm. This indicates that the estimation of p1p_{1} is close to complete failure at -10 dB. However, as the SNR increases, the JAC-ISF algorithm begins to significantly outperform the MUSIC algorithm. Especially when the SNR exceeds 5 dB, the performance of the JAC-ISF algorithm almost conforms to the bound curve. This experiment proves that, except for a few extreme scenarios, the JAC-ISF algorithm is effective in most cases. For JAC-GD, we can see that its performance is also better than the MUSIC algorithm at -10 dB, indicating that JAC-GD is still effective at low signal-to-noise ratios. And JAC-GD algorithm is closer to the solution under perfect CSI than JAC-ISF algorithm, which indicates the excellent performance of JAC-GD.

Refer to caption
Figure 5: Comparison of achievable rates of algorithms under different SNR with T=32T=32 and |𝒑|(10,50)|{\bm{p}}|\in(10,50).

Fig. 6 shows the NMSE performance under different SNRs. The Fig. 6 (a) shows the curve for a snapshot points of 32. We can see that the proposed JAC-ISF and JAC-GD algorithms perform significantly better than other algorithms in the low SNR regime, but their performance limits are not as good as P-SIGW in the high SNR regime. This indicates that the JAC-ISF and JAC-GD algorithms have strong anti noise performance and are more stable in the presence of high noise levels. In the high SNR regime, the JAC-GD and JAC-ISF algorithms perform better than the SWOMP, SS-SIGW-OLS, and P-SOWP algorithms, but worse than the P-SIGW algorithm. This indicates that the upper performance limits of the JAC-ISF and JAC-GD algorithms are not as good as the P-SIGW algorithm, but in the SNR regime of normal communication scenarios, the JAC-ISF and JAC-GD algorithms perform better. Fig. 6 (b) shows the algorithm performance when the TT is 8. We found that compared to Fig. 6 (a), the P-SIGW algorithm requires higher SNRs to exceed the performance of JAC-ISF and JAC-GD. At this point, only when the SNR reaches 14 dB and 20 dB or above, can the performance of the p-algorithm surpass that of JAC-ISF and JAC-GD, respectively. In addition, we found that the performance curves of JAC-ISF and JAC-GD are not significantly different in Fig. 6 (a) and (b). This indicates that the performance of JAC-ISF and JAC-GD algorithms is less affected by the number of snapshots, and subsequent results will also demonstrate this.

Refer to caption
(a) T=32T=32
Refer to caption
(b) T=8T=8
Figure 6: Comparison of NMSE performance between ISF algorithm and other algorithms under different SNRs.

Fig. 7 shows the NMSE performance vers distances. We can see that under the same snapshot points, the performance of the JAC-ISF algorithm is significantly better than other algorithms except the JAC-GD. We can find that when the communication distance is greater than 20 m, the JAC-ISF algorithm can save twice the number of snapshots compared to the P-SIGW algorithm. This indicates that the JAC-ISF algorithm is not only easy to implement, but also has low time complexity. For the JAC-GD algorithm, we can see that its performance at T=8T=8 is even better than that of P-SIGW at T=32T=32. This demonstrates the significant advantage of the JAC-GD algorithm in the low SNR regime.

Refer to caption
Figure 7: Comparison of NMSE at different distances with SNR=5{\rm SNR}=5.

Fig. 8 shows the performance comparison of various algorithms under different snapshot points, with the UE distance set to 10 m to 100 m. As illustrated in the Fig. 8, the performance of the JAC-ISF and JAC-GD algorithms remain basically unchanged as the number of snapshots increases. This is because in XL-MIMO, the number of spatial samples is relatively large, so even with just one snapshot, the spatial autocorrelation function can be accurately estimated, thereby accurately estimating the channel.

Refer to caption
Figure 8: Comparison of achievable rate at different snapshot points with SNR=5\rm SNR=5.

VI-B The Results of Position Estimation

In the problem of location estimation, we use the Root Mean Squared Error (RMSE) criterion to measure algorithm performance, which can be written as:

RMSE=𝒑𝒑^22,{\rm RMSE}=\|{\bm{p}}-{\hat{\bm{p}}}\|_{2}^{2}, (44)

where the 𝒑^{\hat{\bm{p}}} is the estimated UE’s position, which can be obtained using θ^\hat{\theta} and r^\hat{r}.

Refer to caption
Figure 9: Comparison of RMSE at different SNRs with T=8T=8.

Fig. 9-11 show the RMSE performance of various algorithms at different SNR, snapshot points, and distance. Due to the CRLB depends on rr in equation (40), we only present CRLB when comparing performance at different positions. The Fig. 9 shows the performance comparison at T=8T=8. We can see that the JAC-GD algorithm performs well at low SNR regime. The JAC-ISF algorithm performs slightly better than P-SIGW in the low SNR range, but as the SNR increases, the trend of accuracy improvement in the JAC-ISF algorithm is not as fast as the other two algorithms. This shows that the JAC-ISF algorithm has performance advantages when resources are scarce, while the JAC-GD algorithm has performance advantages in most scenarios. The conclusion of Fig. 10 is similar to the above conclusion, that the JAC-ISF algorithm has performance advantages at less snapshot points, while the JAC-GD algorithm generally has performance advantages. In Fig. 11, we present the estimation results of rr and θ\theta separately, and plot the CRLB as a benchmark. We can see that the JAC-GD algorithm is closest to the CRLB. This demonstrates that the JAC-GD algorithm not only has low time complexity, but also has significant performance advantages.

Refer to caption
Figure 10: Comparison of RMSE with different snapshot points with SNR=5{\rm SNR}=5.
Refer to caption
(a) θ\theta
Refer to caption
(b) rr
Figure 11: Comparison of RMSE at different distances with SNR=5{\rm SNR}=5.

VII Conclusion

In this paper, we proposed a novel model and algorithm for near-field uplink channel estimation based on autocorrelation function, which solved the parameter coupling problem of near-field channel estimation with linear time complexity. Specifically, we first decomposed the near-field channel and concluded that near-field channel estimation only requires estimating the parameters p1p_{1} and p2p_{2}. By analyzing the physical meanings of p1p_{1} and p2p_{2}, we found that essentially we need to estimate the CoA and AoA information of the incoming wave. Through deduction, we found that the estimation problem of CoA is similar to that of Doppler. Therefore, by analogy with the definition of coherence time, we provided the definition of the coherence space of a channel. Based on the above analysis, we got the conclusion that p1p_{1} can be solved by computing the autocorrelation function of the received signal. By solving the autocorrelation function, we found that it is only related to p1p_{1} and independent of p2p_{2}, which means we successfully decoupled p1p_{1} and p2p_{2}. After estimating p1p_{1}, the process of estimating p2p_{2} degenerated into a far-field channel estimation problem. Thus, we used the classical MUSIC algorithm to estimate p2p_{2}. Based on the above model, we proposed two algorithms, i.e. the JAC-ISF and JAC-GD algorithm. Extensive simulation results under various practical setups demonstrated that the JAC-ISF and JAC-GD algorithm has lower time complexity compared to other algorithms and can achieved better performance with less snapshot points. In the future, we will design near-field algorithms for more complex scenarios based on the JAC model.

References

  • [1] T. L. Marzetta, “Massive MIMO: an introduction,” Bell Labs Tech. J., vol. 20, pp. 11–22, Mar. 2015.
  • [2] E. Björnson and L. Sanguinetti, “Power scaling laws and near-field behaviors of massive MIMO and intelligent reflecting surfaces,” IEEE Open Journal of the Communications Society, vol. 1, pp. 1306–1324, Sep. 2020.
  • [3] E. Björnson, L. Sanguinetti, H. Wymeersch, J. Hoydis, and T. L. Marzetta, “Massive MIMO is a reality - what is next?: Five promising research directions for antenna arrays,” Digit. Signal Process., vol. 94, pp. 3–20, Nov. 2019.
  • [4] A. Amiri, M. Angjelichinoski, E. de Carvalho, and R. W. Heath, “Extremely large aperture massive MIMO: Low complexity receiver architectures,” in 2018 IEEE Globecom Workshops (GC Wkshps), Dec. 2018.
  • [5] H. Wang, A. Kosasih, C.-K. Wen, S. Jin, and W. Hardjawana, “Expectation propagation detector for extra-large scale massive MIMO,” IEEE Transactions on Wireless Communications, vol. 19, no. 3, pp. 2036–2051, Mar. 2020.
  • [6] X. Wei and L. Dai, “Channel estimation for extremely large-scale massive MIMO: far-field, near-field, or hybrid-field?” IEEE Commun. Lett., vol. 26, no. 1, pp. 177–181, Nov. 2022.
  • [7] X. Zhang, H. Zhang, J. Zhang, C. Li, Y. Huang, and L. Yang, “Codebook design for extremely large-scale MIMO systems: Near-field and far-field,” IEEE Trans. Commun., vol. 72, no. 2, pp. 1191–1206, Feb. 2024.
  • [8] C. Wang, X. You, X. Gao, X. Zhu, Z. Li, C. Zhang, H. Wang, Y. Huang, Y. Chen, H. Haas, J. S. Thompson, E. G. Larsson, M. D. Renzo, W. Tong, P. Zhu, X. Shen, H. V. Poor, and L. Hanzo, “On the road to 6G: Visions, requirements, key technologies, and testbeds,” IEEE Commun. Surv. Tutorials, vol. 25, no. 2, pp. 905–974, Feb. 2023.
  • [9] N. Shlezinger, G. C. Alexandropoulos, M. F. Imani, Y. C. Eldar, and D. R. Smith, “Dynamic metasurface antennas for 6G extreme massive MIMO communications,” IEEE Wireless Communications, vol. 28, no. 2, pp. 106–113, Jan. 2021.
  • [10] N. Yang and A. Shafie, “Terahertz communications for massive connectivity and security in 6G and beyond era,” IEEE Communications Magazine, vol. 62, no. 2, pp. 72–78, Feb. 2022.
  • [11] Z. Chen, C. Han, Y. Wu, L. Li, C. Huang, Z. Zhang, G. Wang, and W. Tong, “Terahertz wireless communications for 2030 and beyond: A cutting-edge frontier,” IEEE Communications Magazine, vol. 59, no. 11, pp. 66–72, Nov. 2021.
  • [12] Y. Pan, C. Pan, S. Jin, and J. Wang, “RIS-aided near-field localization and channel estimation for the Terahertz system,” IEEE Journal of Selected Topics in Signal Processing, vol. 17, no. 4, pp. 878–892, Jun. 2023.
  • [13] B. Zheng and R. Zhang, “Simultaneous transmit diversity and passive beamforming with large-scale intelligent reflecting surface,” IEEE Trans. Wirel. Commun., vol. 22, no. 2, pp. 920–933, Feb. 2023.
  • [14] Q. Wu and R. Zhang, “Intelligent reflecting surface enhanced wireless network via joint active and passive beamforming,” IEEE Trans. Wirel. Commun., vol. 18, no. 11, pp. 5394–5409, Nov. 2019.
  • [15] X. Peng, L. Zhao, Y. Jiang, J. Liu, and W. Li, “Channel estimation for extremely large-scale massive MIMO systems in hybrid-field channel,” in IEEE/CIC International Conference on Communications in China, ICCC 2023, Dalian, China, August 10-12, 2023.   IEEE, Aug. 2023, pp. 1–6.
  • [16] A. de Jesus Torres, L. Sanguinetti, and E. Björnson, “Near- and far-field communications with large intelligent surfaces,” in 54th Asilomar Conference on Signals, Systems, and Computers, ACSCC 2020, Pacific Grove, CA, USA, November 1-4, 2020, M. B. Matthews, Ed.   IEEE, Nov. 2020, pp. 564–568.
  • [17] H. Zhang, N. Shlezinger, F. Guidi, D. Dardari, M. F. Imani, and Y. C. Eldar, “Near-field wireless power transfer for 6G internet of everything mobile networks: Opportunities and challenges,” IEEE Communications Magazine, vol. 60, no. 3, pp. 12–18, Mar. 2022.
  • [18] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potentials, and future directions,” IEEE Communications Magazine, vol. 61, no. 1, pp. 40–46, Sep. 2023.
  • [19] K. T. Selvan and R. Janaswamy, “Fraunhofer and fresnel distances: Unified derivation for aperture antennas,” IEEE Antennas and Propagation Magazine, vol. 59, no. 4, pp. 12–15, Aug. 2017.
  • [20] T. Wang, K. Zhang, Y. Zhang, H. Tong, and C. Yin, “Near-field beam management in lis-assisted mmWave systems,” in 13th International Conference on Wireless Communications and Signal Processing, WCSP 2021, Changsha, China, October 20-22, 2021.   IEEE, Oct. 2021, pp. 1–6.
  • [21] S. Hu, M. C. Ilter, and H. Wang, “Near-field beamforming for large intelligent surfaces,” in 2022 IEEE 33rd Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Kyoto, Japan, September 12-15, 2022.   IEEE, Jun. 2022, pp. 1367–1373.
  • [22] F. Zheng, “Extremely large-scale array systems: Near-filed codebook design and performance analysis,” ArXiv, vol. abs/2306.01458, 2023.
  • [23] Y. Xie, B. Ning, L. Li, and Z. Chen, “Near-field beam training in THz communications: The merits of uniform circular array,” IEEE Wireless Communications Letters, vol. 12, no. 4, pp. 575–579, Jan. 2023.
  • [24] S. Hu, H. Wang, and M. C. Ilter, “Design of near-field beamforming for large intelligent surfaces,” IEEE Transactions on Wireless Communications, vol. 23, no. 1, pp. 762–774, Jun. 2023.
  • [25] B. Friedlander, “Localization of signals in the near-field of an antenna array,” IEEE Trans. Signal Process., vol. 67, no. 15, pp. 3885–3893, Aug. 2019.
  • [26] A. Kosasih, . T. Demir, and E. Björnson, “Parametric near-field channel estimation for extremely large aperture arrays,” in 2023 57th Asilomar Conference on Signals, Systems, and Computers, Oct. 2023, pp. 162–166.
  • [27] X. Guo, Y. Chen, and Y. Wang, “Compressed channel estimation for near-field XL-MIMO using triple parametric decomposition,” IEEE Transactions on Vehicular Technology, vol. 72, no. 11, pp. 15 040–15 045, May. 2023.
  • [28] E. Björnson, Ö. T. Demir, and L. Sanguinetti, “A primer on near-field beamforming for arrays and reconfigurable intelligent surfaces,” in 55th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2021, Pacific Grove, CA, USA, October 31 - November 3, 2021.   IEEE, Oct. 2021, pp. 105–112.
  • [29] Y. Jiang, F. Gao, M. Jian, S. Zhang, and W. Zhang, “Reconfigurable intelligent surface for near field communications: Beamforming and sensing,” IEEE Trans. Wirel. Commun., vol. 22, no. 5, pp. 3447–3459, May. 2023.
  • [30] S. L. Collier, “Fisher information for a complex gaussian random variable: beamforming applications for wave propagation in a random medium,” IEEE Trans. Signal Process., vol. 53, no. 11, pp. 4236–4248, November. 2005.
  • [31] S. K. Sengijpta, “Fundamentals of statistical signal processing: Estimation theory,” Control Engineering Practice, vol. 37, no. 4, pp. 465–466, 1994.
  • [32] M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: far-field or near-field?” IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, Apr. 2022.
  • [33] L. Li, H. Li, Z. Chen, W. Chen, and S. Li, “An analytical range-angle dependent beam focusing model for Terahertz linear antenna array,” IEEE Wireless Communications Letters, vol. 11, no. 9, pp. 1870–1874, Jun. 2022.
  • [34] J. Rodríguez-Fernández, N. González-Prelcic, K. Venugopal, and R. W. Heath, “Frequency-domain compressive channel estimation for frequency-selective hybrid millimeter wave MIMO systems,” IEEE Transactions on Wireless Communications, vol. 17, no. 5, pp. 2946–2960, Mar. 2018.
  • [35] N. González-Prelcic, H. Xie, J. Palacios, and T. Shimizu, “Wideband channel tracking and hybrid precoding for mmWave MIMO systems,” IEEE Transactions on Wireless Communications, vol. 20, no. 4, pp. 2161–2174, Oct. 2021.