This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Fluid Antennas-Enabled Multiuser Uplink: A Low-Complexity Gradient Descent for Total Transmit Power Minimization

Guojie Hu, Qingqing Wu, Senior Member, IEEE, Kui Xu, Member, IEEE, Jian Ouyang, Member, IEEE, Jiangbo Si, Senior Member, IEEE, Yunlong Cai, Senior Member, IEEE, and Naofal Al-Dhahir, Fellow, IEEE This work was supported in part by the Natural Science Foundations of China under Grants 62201606. Guojie Hu is with the College of Communication Engineering, Rocket Force University of Engineering, Xi’an 710025, China (email: [email protected]). Qingqing Wu is with the Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China ([email protected]). Kui Xu is with the College of Communications Engineering, the Army of Engineering University, Nanjing 210007, China ([email protected]). Jian Ouyang is with the Institute of Signal Processing and Transmission, Nanjing University of Posts and Telecommunications, Nanjing 210003, China ([email protected]). Jiangbo Si is with the Integrated Service Networks Lab of Xidian University, Xi’an 710100, China ([email protected]). Yunlong Cai is with the College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China (email: [email protected]). Naofal Al-Dhahir is with the Department of Electrical and Computer Engineering, The University of Texas at Dallas, Richardson, TX 75080 USA ([email protected]).
Abstract

We investigate multiuser uplink communications from multiple single-antenna users to a base station (BS), which is equipped with multiple fluid antennas (FAs) and adopts zero-forcing receivers to decode multiple signals. We aim to optimize antennas’ positions at the BS, to minimize the total transmit power of all users subject to the minimum rate requirement. After applying transformations, we show that the problem is equivalent to minimizing the sum of each eigenvalue’s reciprocal of a matrix, which is a function of all antennas’ positions at the BS. Subsequently, the projected gradient descent (PGD) method is utilized to find a locally optimal solution. In particular, different from the latest related work, we exploit the eigenvalue decomposition to successfully derive a closed-form gradient for the PGD, which facilitates the practical implementation greatly. We demonstrate by simulations that via careful optimization for all antennas’ positions in our proposed design, the total transmit power of all users can be decreased significantly as compared to competitive benchmarks.

Index Terms:
Fluid antennas, multiuser uplink, total transmit power minimization, projected gradient descent.

I Introduction

Beamforming, which exploits the degree of freedom (DoF) in the spatial domain, is a powerful technique for improving system capacity [1]. In conventional beamforming, positions of antennas at transceivers are fixed which may limit the gains of beamforming depending on channel conditions.

To mitigate the above deficiency, the intelligent reflecting surface (IRS) technique has been proposed and proven to be capable of reconfiguring wireless channels by adjusting passive IRS reflecting coefficients [2]. As another promising technology, fluid antennas (FAs) [3]-[6] has emerged recently. Although its operating principle is different from that of the IRS, FAs can also reshape channel environments artificially, by adaptively adjusting positions of all antennas (connected to the radio frequency chains via flexible cables) supported by the stepper motors or servos. Unlike antenna selection (AS) which requires more candidate antennas, higher hardware cost and larger overhead of channel estimation, and concurrently unlike rotatable uniform linear array (RULA) which just mechanically rotates the transmit/receive array and cannot fully exploit spatial channel variation, FAs fully exploites the channel variation resulting from changes in antennas’ positions to achieve a higher spatial diversity without causing additional hardware or algorithm cost [7]. Driven by these potential advantages, earlier works have applied the technology of FAs to further enhance capacities of multiple-input multiple-output (MIMO) systems [7]-[8], multiuser uplink/downlink communications [9]-[10], physical-layer security systems [11] or interference networks [12].

Refer to caption
Figure 1: Illustration of the system model.

In this letter, as in [9], we focus on FAs-enabled classical multiuser uplink communications. Specifically, we assume multiple single-antenna users that intend to concurrently transmit their signals to a base station (BS), which is equipped with FAs and adopts the widely used zero-forcing (ZF) receivers to detect multiple signals. By carefully optimizing positions of all antennas at the BS, our goal is to minimize the total transmit power of all users subject to the minimum rate requirement for each user. The formulated problem is highly non-convex, and we develop a projected gradient descent (PGD) method to find a locally optimal solution. Unlike [9] which exploits the original definition-based method to compute the gradient in each iteration, the key contribution of this letter is that we successfully derive a closed-form gradient in each iteration with the help of the eigenvalue decomposition. This novelty greatly accelerates the implementation of the PGD method. Numerical results are performed to demonstrate that our proposed method with FAs can significantly decrease the total transmit power of all users as compared to competitive benchmarks.

II System Model and Problem Formulation

As shown in Fig. 1, we consider multiuser uplink communications from MM single-antenna users {Ui}i=1M\left\{{{{\rm{U}}_{i}}}\right\}_{i=1}^{M} to the BS equipped with NN FAs distributed along a linear dimension, with NMN\geq M. Consider the line-of-sight (LoS) propagation environment, the channel vector between the BS and Ui{{{\rm{U}}_{i}}} is denoted by111As shown in the follows, the simple LoS environment is considered here since we aim to demonstrate that our proposed design of FAs’ movements just relies on the slow-changing property of statistical channel state information (CSI). In the simulations, we will show the effectiveness of the proposed FAs’ movement rule when facing random Rician fading channels.

𝐡i(𝐱)=[ej2πλx1sinθi,ej2πλx2sinθi,,ej2πλxNsinθi]T,\begin{split}{}{{\bf{h}}_{i}}({\bf{x}})={\left[{{e^{j\frac{{2\pi}}{\lambda}{x_{1}}\sin{\theta_{i}}}},{e^{j\frac{{2\pi}}{\lambda}{x_{2}}\sin{\theta_{i}}}},...,{e^{j\frac{{2\pi}}{\lambda}{x_{N}}\sin{\theta_{i}}}}}\right]^{T}},\end{split} (1)

where λ\lambda is the signal wavelength, θi{{\theta_{i}}} is the angle of arrival (AoA) to the BS at Ui{\rm{U}}_{i}, and xnx_{n} denotes the adjustable position of the nn-th antenna at the BS, with 𝐱=[x1,x2,,xN]TN×1{\bf{x}}={\left[{{x_{1}},{x_{2}},...,{x_{N}}}\right]^{T}}\in{{\mathbb{R}}^{N\times 1}}. For the multiuser uplink, the received signals 𝐲M×1{\bf{y}}\in{{\mathbb{C}}^{M\times 1}} at the BS can be expressed as

𝐲=𝐖H𝐇(𝐱)𝐏1/2𝐬+𝐖H𝐧,\begin{split}{}{\bf{y}}={{\bf{W}}^{H}}{\bf{H}}({\bf{x}}){{\bf{P}}^{1/2}}{\bf{s}}+{{\bf{W}}^{H}}{\bf{n}},\end{split} (2)

where 𝐇(𝐱)=[𝐡1(𝐱),𝐡2(𝐱),,𝐡M(𝐱)]N×M{\bf{H}}({\bf{x}})=\left[{{{\bf{h}}_{1}}({\bf{x}}),{{\bf{h}}_{2}}({\bf{x}}),...,{{\bf{h}}_{M}}({\bf{x}})}\right]\in{{\mathbb{C}}^{N\times M}}, 𝐏1/2=diag{[P1,P2,,PM]}{{\bf{P}}^{1/2}}={\rm{diag}}\left\{{\left[{\sqrt{{P_{1}}},\sqrt{{P_{2}}},...,\sqrt{{P_{M}}}}\right]}\right\}, in which PiP_{i} denotes the transmit power of Ui{\rm{U}}_{i}, 𝐬=[s1,s2,,sM]TM×1{\bf{s}}={\left[{{s_{1}},{s_{2}},...,{s_{M}}}\right]^{T}}\in{{\mathbb{C}}^{M\times 1}}, in which sis_{i} denotes the transmitted signal of Ui{\rm{U}}_{i} and 𝔼[|si|2]=1,i=1,,M{\mathbb{E}}\left[{{{\left|{{s_{i}}}\right|}^{2}}}\right]=1,\forall i=1,...,M. In addition, 𝐖=[𝐰1,𝐰2,,𝐰M]N×M{\bf{W}}=\left[{{{\bf{w}}_{1}},{{\bf{w}}_{2}},...,{{\bf{w}}_{M}}}\right]\in{{\mathbb{C}}^{N\times M}} is the receive combining matrix at the BS, in which 𝐰i{{\bf{w}}_{i}} is the combining vector for the signal sis_{i}, and 𝐧=[n1,n2,,nN]T{\bf{n}}={\left[{{n_{1}},{n_{2}},...,{n_{N}}}\right]^{T}}, in which nin_{i} is the additive white Gaussian noise at the ii-th BS antenna, with ni𝒞𝒩(0,σ2){n_{i}}\sim{\cal C}{\cal N}(0,{\sigma^{2}}). Based on (2), the received signal-to-interference-plus-noise ratio (SINR) of the signal sis_{i} at the BS is derived as

γi=Pi|𝐰iH𝐡i(𝐱)|2k=1,kiMPk|𝐰iH𝐡k(𝐱)|2+𝐰i22σ2.\begin{split}{}{\gamma_{i}}=\frac{{{P_{i}}{{\left|{{\bf{w}}_{i}^{H}{{\bf{h}}_{i}}({\bf{x}})}\right|}^{2}}}}{{\sum\nolimits_{k=1,k\neq i}^{M}{{P_{k}}{{\left|{{\bf{w}}_{i}^{H}{{\bf{h}}_{k}}({\bf{x}})}\right|}^{2}}+\left\|{{{\bf{w}}_{i}}}\right\|_{2}^{2}{\sigma^{2}}}}}.\end{split} (3)

In this letter, we assume that the BS adopts the widely used linear ZF detector for processing multiple signals, due to its low implementation complexity especially when number of antennas at the BS is large. Based on this, the receive combining matrix 𝐖{\bf{W}} is accordingly expressed as

𝐖=𝐇(𝐱)(𝐇(𝐱)H𝐇(𝐱))1.\begin{split}{}{\bf{W}}={\bf{H}}({\bf{x}}){\left({{\bf{H}}{{({\bf{x}})}^{H}}{\bf{H}}({\bf{x}})}\right)^{-1}}.\end{split} (4)

Substituting (4) into (3), the received SINR of the signal sis_{i} is given by

γi=Pi[𝐇(𝐱)(𝐇(𝐱)H𝐇(𝐱))1]:,i22σ2.\begin{split}{}{\gamma_{i}}=\frac{{{P_{i}}}}{{\left\|{{{\left[{{\bf{H}}({\bf{x}}){{\left({{\bf{H}}{{({\bf{x}})}^{H}}{\bf{H}}({\bf{x}})}\right)}^{-1}}}\right]}_{:,i}}}\right\|_{2}^{2}{\sigma^{2}}}}.\end{split} (5)

Our goal is to optimize the positions of FAs at the BS, i.e., 𝐱{\bf{x}}, to minimize the total transmit power of MM users subject to a minimum achievable rate requirement for each user. Hence, the optimization problem is formulated as222In this work, only antennas’ positions are optimized for total power minimization. Consider the case where receiving beamforming and antennas’ positions are jointly optimized, the generalized Bender’s decomposition [13] can be exploited for obtaining the globally optimal solution.

(P1):min𝐱,𝐏i=1MPi\displaystyle({\rm{P1}}):{\rm{}}\mathop{\min}\limits_{{\bf{x}},{\bf{P}}}\ \sum\nolimits_{i=1}^{M}{{P_{i}}} (6a{\rm{6a}})
s.t.log2(1+γi)ri,i=1,,M,\displaystyle\ {\rm{s.t.}}\quad{\log_{2}}(1+{\gamma_{i}})\geq{r_{i}},\forall i=1,...,M, (6b{\rm{6b}})
𝐱𝒞,\displaystyle\quad\quad\ \ {\bf{x}}\in{\cal C}, (6c{\rm{6c}})

where rir_{i} in the constraint (6b) denotes the minimum rate requirement for Ui{\rm{U}}_{i}, and 𝒞{\cal C} in (6c) denotes the feasible moving region for NN antennas at the BS. More specifically, denote the total span for the movement of FAs as LL and without loss of generality set 0x1<x2<<xNL0\leq{x_{1}}<{x_{2}}<...<{x_{N}}\leq L. Then, consider: i) the minimum distance between any two FAs to avoid the coupling effect as dmin{{d_{\min}}} [7], [8], i.e., |xixj|dmin\left|{{x_{i}}-{x_{j}}}\right|\geq{d_{\min}}, ij\forall i\neq j; ii) the movement span should be the same for each antenna, we can conveniently set 𝒞=Δ{xi[Fi,Gi]}i=1N{\cal C}\buildrel\Delta\over{=}\left\{{{x_{i}}\in[{F_{i}},{G_{i}}]}\right\}_{i=1}^{N}, where

Fi=L(N1)dminN(i1)+(i1)dmin,Gi=L(N1)dminNi+(i1)dmin,\begin{split}{}{F_{i}}=&\frac{{L-(N-1){d_{\min}}}}{N}(i-1)+(i-1){d_{\min}},\\ {G_{i}}=&\frac{{L-(N-1){d_{\min}}}}{N}i+(i-1){d_{\min}},\end{split}

from which we have 0=F1<G1<F2<G2<<FN<GN=L0={F_{1}}<{G_{1}}<{F_{2}}<{G_{2}}<...<{F_{N}}<{G_{N}}=L and GiFi=L(N1)dminN{G_{i}}-{F_{i}}=\frac{{L-(N-1){d_{\min}}}}{N}, i=1,,N\forall i=1,...,N. The feasible movement region for each FA is illustrated in Fig. 2 for better understanding.

Refer to caption
Figure 2: Feasible movement region for each FA.
λi{𝐙}xn=Re[[𝐕1]i,:xn𝐙[𝐕]:,i+[𝐕1]i,:𝐙xn[𝐕]:,i+[𝐕1]i,:𝐙[𝐕]:,ixn]=(a)Re[[𝐕1]i,:xnλi{𝐙}[𝐕]:,i+[𝐕1]i,:𝐙xn[𝐕]:,i+λi{𝐙}[𝐕1]i,:[𝐕]:,ixn].\begin{split}{}\frac{{\partial{\lambda_{i}}\left\{{\bf{Z}}\right\}}}{{\partial{x_{n}}}}=&{\rm{Re}}\left[{\frac{{\partial{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}}}{{\partial{x_{n}}}}{\bf{Z}}{{\left[{\bf{V}}\right]}_{:,i}}+{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}}{{\left[{\bf{V}}\right]}_{:,i}}+{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}{\bf{Z}}\frac{{\partial{{\left[{\bf{V}}\right]}_{:,i}}}}{{\partial{x_{n}}}}}\right]\\ \mathop{=}\limits^{(a)}&{\rm{Re}}\left[{\frac{{\partial{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}}}{{\partial{x_{n}}}}{\lambda_{i}}\left\{{\bf{Z}}\right\}{{\left[{\bf{V}}\right]}_{:,i}}+{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}}{{\left[{\bf{V}}\right]}_{:,i}}+{\lambda_{i}}\left\{{\bf{Z}}\right\}{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{{\left[{\bf{V}}\right]}_{:,i}}}}{{\partial{x_{n}}}}}\right].\end{split} (13)
𝐱tf(𝐱)=[i=1M[𝐕1]i,:𝐙x1[𝐕]:,iλi2{𝐙},i=1M[𝐕1]i,:𝐙x2[𝐕]:,iλi2{𝐙},,i=1M[𝐕1]i,:𝐙xN[𝐕]:,iλi2{𝐙}]𝐱=𝐱tT.\begin{split}{}{\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}})=\left[{\sum\nolimits_{i=1}^{M}{\frac{{-{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{1}}}}{{\left[{\bf{V}}\right]}_{:,i}}}}{{\lambda_{i}^{2}\left\{{\bf{Z}}\right\}}}},\sum\nolimits_{i=1}^{M}{\frac{{-{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{2}}}}{{\left[{\bf{V}}\right]}_{:,i}}}}{{\lambda_{i}^{2}\left\{{\bf{Z}}\right\}}}},...,\sum\nolimits_{i=1}^{M}{\frac{{-{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{N}}}}{{\left[{\bf{V}}\right]}_{:,i}}}}{{\lambda_{i}^{2}\left\{{\bf{Z}}\right\}}}}}\right]_{{\bf{x}}={{\bf{x}}^{t}}}^{T}.\end{split} (18)

Based on (6b), it can be shown that PiP_{i} should satisfy

Pi[𝐇(𝐱)(𝐇(𝐱)H𝐇(𝐱))1]:,i22εiσ2,\begin{split}{}{P_{i}}\geq\left\|{{{\left[{{\bf{H}}({\bf{x}}){{\left({{\bf{H}}{{({\bf{x}})}^{H}}{\bf{H}}({\bf{x}})}\right)}^{-1}}}\right]}_{:,i}}}\right\|_{2}^{2}{\varepsilon_{i}}{\sigma^{2}},\end{split} (7)

where εi=(2ri1){\varepsilon_{i}}=({2^{{r_{i}}}}-1). According to (7), we can equivalently replace the objective of (P1) as [9]

i=1M[𝐇(𝐱)(𝐇(𝐱)H𝐇(𝐱))1]:,i22εiσ2=𝐇(𝐱)(𝐇(𝐱)H𝐇(𝐱))1𝛀1/2F2=tr{(𝛀1𝐇(𝐱)H𝐇(𝐱))1}=i=1M1λi{𝛀1𝐇(𝐱)H𝐇(𝐱)}=Δf(𝐱),\begin{split}{}&\sum\nolimits_{i=1}^{M}{\left\|{{{\left[{{\bf{H}}({\bf{x}}){{\left({{\bf{H}}{{({\bf{x}})}^{H}}{\bf{H}}({\bf{x}})}\right)}^{-1}}}\right]}_{:,i}}}\right\|_{2}^{2}{\varepsilon_{i}}{\sigma^{2}}}\\ =&\left\|{{\bf{H}}({\bf{x}}){{\left({{\bf{H}}{{({\bf{x}})}^{H}}{\bf{H}}({\bf{x}})}\right)}^{-1}}{{\bf{\Omega}}^{1/2}}}\right\|_{\rm{F}}^{2}\\ =&{\rm{tr}}\left\{{{{\left({{{\bf{\Omega}}^{-1}}{\bf{H}}{{({\bf{x}})}^{H}}{\bf{H}}({\bf{x}})}\right)}^{-1}}}\right\}\\ =&\sum\nolimits_{i=1}^{M}{\frac{1}{{{\lambda_{i}}\left\{{{{\bf{\Omega}}^{-1}}{\bf{H}}{{({\bf{x}})}^{H}}{\bf{H}}({\bf{x}})}\right\}}}}\buildrel\Delta\over{=}f({\bf{x}}),\end{split} (8)

where 𝛀=diag{[ε1σ2,ε2σ2,,εMσ2]}{\bf{\Omega}}={\rm{diag}}\left\{{\left[{{\varepsilon_{1}}{\sigma^{2}},{\varepsilon_{2}}{\sigma^{2}},...,{\varepsilon_{M}}{\sigma^{2}}}\right]}\right\} and λi{𝛀1𝐇(𝐱)H𝐇(𝐱)}{{\lambda_{i}}\left\{{{{\bf{\Omega}}^{-1}}{\bf{H}}{{({\bf{x}})}^{H}}{\bf{H}}({\bf{x}})}\right\}} denotes the ii-th eigenvalue of the matrix 𝛀1𝐇(𝐱)H𝐇(𝐱)=Δ𝐙M×M{{\bf{\Omega}}^{-1}}{\bf{H}}{({\bf{x}})^{H}}{\bf{H}}({\bf{x}})\buildrel\Delta\over{=}{\bf{Z}}\in{{\mathbb{C}}^{M\times M}}. Therefore, problem (P1) can be equivalently reformulated as

(P2):min𝐱f(𝐱)\displaystyle({\rm{P2}}):{\rm{}}\mathop{\min}\limits_{{\bf{x}}}\ f({\bf{x}}) (9a{\rm{9a}})
s.t.𝐱𝒞.\displaystyle\ {\rm{s.t.}}\quad{\bf{x}}\in{\cal C}. (9b{\rm{9b}})

Remark 1: Problem (P2) is highly non-convex because its objective is neither convex or concave, which cannot be solved via standard convex optimization techniques. Motivated by this, the authors in [9] try to solve (P2) by resorting to the PGD method, which handles the simple unconstrained or constrained problems well and is not sensitive to concavity or convexity of the objective. However, [9] computes the gradient based on the original definition shown in its equation (12), which has the large implementation complexity. In the next section, we show how to reduce the complexity significantly.

Remark 2: Considering the LoS environment, the BS can easily estimate the CSI by just estimating the AoAs to itself at MM users based on some mature algorithms, such as MUSIC. Based on this, the BS can directly optimize FAs’ positions via the proposed algorithm and then feedback each user the required transmit power based on (7) with optimized 𝐱{\bf{x}}.333In addition, even the general Rician fading is considered, the BS still optimizes FAs’ positions in advance based on the estimated AoAs. Then, in the communication process, all antennas’ positions are not changed and each user sends pilot signals to the BS for uplink channel estimations. When the BS successfully estimates the instantaneous CSI, it can tell each user the required transmit power based on (7). Since no antennas’ movements are involved, the consumed time for estimate-feedback is much smaller than the channel coherence time (CCT), especially for the low-mobility scenario where CCT is relatively larger [14].

III Algorithm Design for Solving (P2)

In this letter, we still exploit the PGD method to find a locally optimal solution to (P2). Specifically, using PGD, the update rule for 𝐱{\bf{x}} in the t+1t+1-th iteration is given by

𝐱t+1=𝐱tδ𝐱tf(𝐱),𝐱t+1={𝐱t+1,𝒞},\begin{split}{}{{\bf{x}}^{t+1}}=&{{\bf{x}}^{t}}-\delta{\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}}),\\ {{\bf{x}}^{t+1}}=&{\cal B}\left\{{{{\bf{x}}^{t+1}},{\cal C}}\right\},\end{split} (10)

where 𝐱t+1{{\bf{x}}^{t+1}} in the first equation is the original updated 𝐱{\bf{x}}, and 𝐱t+1{{\bf{x}}^{t+1}} in the second equation is the additional update (if necessary) via the projection function {}{\cal B}\left\{\cdot\right\} as explained later, which ensures that the solutions for FAs’ positions in each iteration always satisfy the constraint in (9b). Further, 𝐱tf(𝐱){\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}}) denotes the gradient of f(𝐱)f({\bf{x}}) at 𝐱t{{{\bf{x}}^{t}}}, and δ\delta is the step size for the gradient descent.

A. Computing 𝐱tf(𝐱){\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}}): Note that 𝐱f(𝐱)=[f(𝐱)x1,,f(𝐱)xN]T{\nabla_{\bf{x}}}f({\bf{x}})={\left[{\frac{{\partial f({\bf{x}})}}{{\partial{x_{1}}}},...,\frac{{\partial f({\bf{x}})}}{{\partial{x_{N}}}}}\right]^{T}}. Using the chain rule, f(𝐱)xn\frac{{\partial f({\bf{x}})}}{{\partial{x_{n}}}}, n=1,,N\forall n=1,...,N, can be derived as

f(𝐱)xn=i=1M1λi2{𝐙}λi{𝐙}xn.\begin{split}{}\frac{{\partial f({\bf{x}})}}{{\partial{x_{n}}}}=\sum\nolimits_{i=1}^{M}{\frac{{-1}}{{\lambda_{i}^{2}\left\{{\bf{Z}}\right\}}}}\frac{{\partial{\lambda_{i}}\left\{{\bf{Z}}\right\}}}{{\partial{x_{n}}}}.\end{split} (11)

Based on (11), to compute 𝐱f(𝐱){\nabla_{\bf{x}}}f({\bf{x}}), the key is to derive a closed-form expression for λi{𝐙}xn\frac{{\partial{\lambda_{i}}\left\{{\bf{Z}}\right\}}}{{\partial{x_{n}}}}, i=1,,M\forall i=1,...,M and n=1,,Nn=1,...,N.

To proceed, let us denote 𝐙=𝐕𝐃𝐕1{\bf{Z}}={\bf{VD}}{{\bf{V}}^{-1}} as the eigenvalue decomposition of the matrix 𝐙{\bf{Z}}, where 𝐕M×M{\bf{V}}\in{{\mathbb{C}}^{M\times M}} consists of linearly independent columns with unit norm, and 𝐃=diag{[λ1{𝐙},,λM{𝐙}]}{\bf{D}}={\rm{diag}}\left\{{\left[{{\lambda_{1}}\left\{{\bf{Z}}\right\},...,{\lambda_{M}}\left\{{\bf{Z}}\right\}}\right]}\right\}. Then, we can equivalently express λi{𝐙}{\lambda_{i}}\left\{{\bf{Z}}\right\} as

λi{𝐙}=[𝐕1]i,:𝐙[𝐕]:,i.\begin{split}{}{\lambda_{i}}\left\{{\bf{Z}}\right\}={\left[{{{\bf{V}}^{-1}}}\right]_{i,:}}{\bf{Z}}{\left[{\bf{V}}\right]_{:,i}}.\end{split} (12)

Based on (12), λi{𝐙}xn\frac{{\partial{\lambda_{i}}\left\{{\bf{Z}}\right\}}}{{\partial{x_{n}}}} can be expanded as in (13), where =(a)\mathop{=}\limits^{(a)} is established since 𝐙[𝐕]:,i=λi{𝐙}[𝐕]:,i{\bf{Z}}{\left[{\bf{V}}\right]_{:,i}}={\lambda_{i}}\left\{{\bf{Z}}\right\}{\left[{\bf{V}}\right]_{:,i}} and [𝐕1]i,:𝐙=λi{𝐙}[𝐕1]i,:{\left[{{{\bf{V}}^{-1}}}\right]_{i,:}}{\bf{Z}}={\lambda_{i}}\left\{{\bf{Z}}\right\}{\left[{{{\bf{V}}^{-1}}}\right]_{i,:}}. Then, further note that the sum of the first and third terms in (13) equals

[𝐕1]i,:xnλi{𝐙}[𝐕]:,i+λi{𝐙}[𝐕1]i,:[𝐕]:,ixn=λi{𝐙}[[𝐕1]i,:[𝐕]:,i]xn=(b)0,\begin{split}{}&\frac{{\partial{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}}}{{\partial{x_{n}}}}{\lambda_{i}}\left\{{\bf{Z}}\right\}{\left[{\bf{V}}\right]_{:,i}}+{\lambda_{i}}\left\{{\bf{Z}}\right\}{\left[{{{\bf{V}}^{-1}}}\right]_{i,:}}\frac{{\partial{{\left[{\bf{V}}\right]}_{:,i}}}}{{\partial{x_{n}}}}\\ =&{\lambda_{i}}\left\{{\bf{Z}}\right\}\frac{{\partial\left[{{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}{{\left[{\bf{V}}\right]}_{:,i}}}\right]}}{{\partial{x_{n}}}}\mathop{=}\limits^{(b)}0,\end{split} (14)

where =(b)\mathop{=}\limits^{(b)} is established since [𝐕1]i,:[𝐕]:,i{{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}{{\left[{\bf{V}}\right]}_{:,i}}} always equals the constant one and thus is not relevant to xnx_{n} in any situation. Based on (13) and (14), λi{𝐙}xn\frac{{\partial{\lambda_{i}}\left\{{\bf{Z}}\right\}}}{{\partial{x_{n}}}} can be simplified as

λi{𝐙}xn=Re[[𝐕1]i,:𝐙xn[𝐕]:,i]=(c)[𝐕1]i,:𝐙xn[𝐕]:,i,\begin{split}{}\frac{{\partial{\lambda_{i}}\left\{{\bf{Z}}\right\}}}{{\partial{x_{n}}}}=&{\rm{Re}}\left[{{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}}{{\left[{\bf{V}}\right]}_{:,i}}}\right]\\ \mathop{=}\limits^{(c)}&{{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}}{{\left[{\bf{V}}\right]}_{:,i}}},\end{split} (15)

where =(c)\mathop{=}\limits^{(c)} is established since [𝐕1]i,:𝐙xn[𝐕]:,i{{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}}{{\left[{\bf{V}}\right]}_{:,i}}} is a real number. Recall that 𝐙=𝛀1𝐇(𝐱)H𝐇(𝐱){\bf{Z}}={{\bf{\Omega}}^{-1}}{\bf{H}}{({\bf{x}})^{H}}{\bf{H}}({\bf{x}}) and 𝛀1=diag{[1/(ε1σ2),1/(ε2σ2),,1/(εMσ2)]}{{\bf{\Omega}}^{-1}}={\rm{diag}}\left\{{\left[{1/({\varepsilon_{1}}{\sigma^{2}}),1/({\varepsilon_{2}}{\sigma^{2}}),...,1/({\varepsilon_{M}}{\sigma^{2}})}\right]}\right\}. The element in the ii-th row and jj-th column of 𝐙{\bf{Z}} based on (1) can be derived as

[𝐙]i,j=1εiσ2k=1Nej2πλxk(sinθjsinθi),\begin{split}{}{\left[{\bf{Z}}\right]_{i,j}}=\frac{1}{{{\varepsilon_{i}}{\sigma^{2}}}}\sum\nolimits_{k=1}^{N}{{e^{j\frac{{2\pi}}{\lambda}{x_{k}}\left({\sin{\theta_{j}}-\sin{\theta_{i}}}\right)}}},\end{split} (16)

based on which it is easy to derive the element in the ii-th row and jj-th column of 𝐙xn\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}} as

[𝐙xn]i,j=[𝐙]i,jxn=1εiσ22πλ(sinθjsinθi)ej[2πλxn(sinθjsinθi)+π2].\begin{split}{}&{\left[{\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}}}\right]_{i,j}}=\frac{{\partial{{\left[{\bf{Z}}\right]}_{i,j}}}}{{\partial{x_{n}}}}\\ =&\frac{1}{{{\varepsilon_{i}}{\sigma^{2}}}}\frac{{2\pi}}{\lambda}\left({\sin{\theta_{j}}-\sin{\theta_{i}}}\right){e^{j\left[{\frac{{2\pi}}{\lambda}{x_{n}}\left({\sin{\theta_{j}}-\sin{\theta_{i}}}\right)+\frac{\pi}{2}}\right]}}.\end{split} (17)

Finally, by substituting the known 𝐙xn{\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}}} into (15) and then substituting (15) into (11), the gradient 𝐱f(𝐱){\nabla_{\bf{x}}}f({\bf{x}}) at 𝐱t{{\bf{x}}^{t}} can be computed as in (18).

Algorithm 1 BLS for a Feasible δ\delta in the tt-th iteration
1:Input: 𝐱t1{{\bf{x}}^{t-1}}, δ>0\delta>0, 0<ρ<10<\rho<1.
2:Repeat:
3:𝐱t={𝐱t1δ𝐱tf(𝐱),𝒞}{{\bf{x}}^{t}}={\cal B}\left\{{{{\bf{x}}^{t-1}}-\delta{\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}}),{\cal C}}\right\}.
4: If f(𝐱t)>f(𝐱t1)δ𝐱t1f(𝐱)22f({{\bf{x}}^{t}})>f({{\bf{x}}^{t-1}})-\delta\left\|{{\nabla_{{{\bf{x}}^{t-1}}}}f({\bf{x}})}\right\|_{2}^{2}, update δρδ\delta\leftarrow\rho\delta.
5:End
6:Until: f(𝐱t)f(𝐱t1)δ𝐱t1f(𝐱)22f({{\bf{x}}^{t}})\leq f({{\bf{x}}^{t-1}})-\delta\left\|{{\nabla_{{{\bf{x}}^{t-1}}}}f({\bf{x}})}\right\|_{2}^{2}.
Algorithm 2 The Overall Algorithm for Solving (P2)
1:Input: t=1t=1, 𝐱1𝒞{{\bf{x}}^{1}}\in{\cal C}.
2:Repeat:
3: Perform eigenvalue decomposition on [𝐙]𝐱=𝐱t{\left[{\bf{Z}}\right]_{{\bf{x}}={{\bf{x}}^{t}}}} and  compute [{𝐙/xn}n=1N]𝐱=𝐱t{\left[{\left\{{\partial{\bf{Z}}/\partial{x_{n}}}\right\}_{n=1}^{N}}\right]_{{\bf{x}}={{\bf{x}}^{t}}}} to obtain 𝐱tf(𝐱){\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}}).
4: Determine a feasible δ\delta based on Algorithm 1;
5:tt+1t\leftarrow t+1;
6: Update 𝐱t={𝐱t1δ𝐱t1f(𝐱),𝒞}{{\bf{x}}^{t}}={\cal B}\left\{{{{\bf{x}}^{t-1}}-\delta{\nabla_{{{\bf{x}}^{t-1}}}}f({\bf{x}}),{\cal C}}\right\}.
7:End
8:Until: |f(𝐱t)f(𝐱t1)|τ\left|{f({{\bf{x}}^{t}})-f({{\bf{x}}^{t-1}})}\right|\leq\tau.

B. Determining the feasible step size: In the PGD method, a correct setting for the step size in each iteration is important for realizing convergence. Specifically, the feasible δ\delta in each iteration should satisfy δ1/L𝐱\delta\leq 1/{L_{\bf{x}}}, where L𝐱{L_{\bf{x}}} is a Lipschitz constant for 𝐱f(𝐱){\nabla_{\bf{x}}}f({\bf{x}}), which satisfies 𝐱f(𝐱)𝐱f(𝐱)2L𝐱𝐱𝐱2{\left\|{{\nabla_{\bf{x}}}f({\bf{x}})-{\nabla_{{\bf{x^{\prime}}}}}f({\bf{x}})}\right\|_{2}}\leq{L_{\bf{x}}}{\left\|{{\bf{x}}-{\bf{x^{\prime}}}}\right\|_{2}}, 𝐱,𝐱𝒞\forall{\bf{x}},{\bf{x^{\prime}}}\in{\cal C} [15]. Since the structure of 𝐱f(𝐱){{\nabla_{\bf{x}}}f({\bf{x}})} is much complex, generally L𝐱{L_{\bf{x}}} is difficult to determine. Based on this fact, we can instead exploit the backtracking line search (BLS) [16] to find a feasible δ\delta. The details are shown in Algorithm 1, where ρ\rho denotes the shrinking factor.

C. Determining the projection function {}{\cal B}\left\{\cdot\right\}: Recall that the projection function mainly ensures that NN FAs only move in their respective feasible regions. Therefore, according to the rule of nearest distance, {𝐱t+1,𝒞}{\cal B}\left\{{{{\bf{x}}^{t+1}},{\cal C}}\right\} can be determined as

{𝐱t+1,C}xit+1=min(max(Fi,xit+1),Gi).\begin{split}{}&{\cal B}\left\{{{{\bf{x}}^{t+1}},C}\right\}\triangleright x_{i}^{t+1}=\min\left({\max({F_{i}},x_{i}^{t+1}),{G_{i}}}\right).\end{split} (19)

D. The algorithm, complexity analysis and comparison: The overall setups for solving problem (P2) are summarized in Algorithm 2, where τ\tau denotes the prescribed accuracy. Generally, the PGD based minimization may lead to sightly different total transmit power for different initialization 𝐱1{{\bf{x}}^{1}} and step-sizes δ\delta. This is mainly because the PGD may converge to a local minimum of the objective, which is an unavoidable phenomenon arising in non-convex optimization problems. Nevertheless, this phenomenon can be well solved by randomly generating numerous different 𝐱1{{\bf{x}}^{1}} and then selecting the one which produces the minimum power.

Complexity Analysis: To simplify the analysis while still capturing the complexity of Algorithm 2, we here focus on the number of complex multiplications required in each iteration. Specifically, the complexity of the eigenvalue decomposition for [𝐙]𝐱=𝐱t{\left[{\bf{Z}}\right]_{{\bf{x}}={{\bf{x}}^{t}}}} is about 𝒪(M3){\cal O}({M^{3}}) [9]. Further, calculating i=1M[𝐕1]i,:𝐙xn[𝐕]:,i/λi2{𝐙}\sum\nolimits_{i=1}^{M}{-{{\left[{{{\bf{V}}^{-1}}}\right]}_{i,:}}\frac{{\partial{\bf{Z}}}}{{\partial{x_{n}}}}{{\left[{\bf{V}}\right]}_{:,i}}/\lambda_{i}^{2}\left\{{\bf{Z}}\right\}}, n=1,,N\forall n=1,...,N, requires 𝒪(M2){\cal O}({M^{2}}) complex multiplications, leading to the complexity of computing 𝐱tf(𝐱){\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}}) as 𝒪(M2N){\cal O}({M^{2}}N). In addition, the complexity of finding a feasible δ\delta is about 𝒪(TinnerN){\cal O}({T_{{\rm{inner}}}}N), where NN is the complexity of computing δ𝐱tf(𝐱)\delta{\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}}) in step 3 of Algorithm 1, and Tinner{T_{{\rm{inner}}}} is the maximum number of iterations for BLS. Hence, the total complexity of Algorithm 2 is about

𝒪(Touter(M3+M2N+TinnerN)),\begin{split}{}{\cal O}\left({{T_{{\rm{outer}}}}\left({{M^{3}}+{M^{2}}N+{T_{{\rm{inner}}}}N}\right)}\right),\end{split}

where Touter{{T_{{\rm{outer}}}}} is the maximum number of iterations for repeatedly implementing steps 3-5 in Algorithm 2.

Complexity Comparison: As a comparison, if the original definition based method [9] is exploited to compute the gradient, i.e.,

f(𝐱)xn|𝐱=𝐱t=limε0f(x1t,,xnt+ε,,xNt)f(𝐱t)ε,\begin{split}{}&\frac{{\partial f({\bf{x}})}}{{\partial{x_{n}}}}{|_{{\bf{x}}={{\bf{x}}^{t}}}}=\mathop{\lim}\limits_{\varepsilon\to 0}\frac{{f(x_{1}^{t},...,x_{n}^{t}+\varepsilon,...,x_{N}^{t})-f({{\bf{x}}^{t}})}}{\varepsilon},\end{split} (20)

the corresponding complexity will become larger. Specifically, given 𝐱t{{{\bf{x}}^{t}}} and ε\varepsilon, using the eigenvalue decomposition to obtain f(x1t,,xnt+ε,,xNt){f(x_{1}^{t},...,x_{n}^{t}+\varepsilon,...,x_{N}^{t})} for all n=1,,Nn=1,...,N requires a complexity of 𝒪(NM3){\cal O}(N{M^{3}}). Similarly, the complexity of obtaining f(𝐱t){f({{\bf{x}}^{t}})} is 𝒪(M3){\cal O}({M^{3}}). Therefore, the complexity of obtaining 𝐱tf(𝐱){\nabla_{{{\bf{x}}^{t}}}}f({\bf{x}}) is about 𝒪((N+1)M3){\cal O}((N+1){M^{3}}), and then the total complexity of Algorithm 2 becomes

𝒪(Touter(M3+M3N+TinnerN)),{\cal O}\left({{T_{{\rm{outer}}}}\left({{M^{3}}+{M^{3}}N+{T_{{\rm{inner}}}}N}\right)}\right),

which is clearly higher than the complexity of Algorithm 2 in this work, especially when MM is large. We compare the above two complexities versus MM in Fig. 3 for better illustration, where we set Touter=Tinner=10{T_{{\rm{outer}}}}={T_{{\rm{inner}}}}=10 and N=30N=30.

Refer to caption
Figure 3: Computational complexity of the proposed design and the original definition based method.
Refer to caption
Figure 4: The convergence behavior of the proposed PGD method.
Refer to caption

(a)

Refer to caption

(b)

Refer to caption

(c)

Figure 5: Average total transmit power versus (a) NN; (b) LL; (c) KK.

IV Simulation Results

In this section, we present numerical results to demonstrate the effectiveness of the proposed design over the general Rician fading, in which the channel vector between the BS and Ui{\rm{U}}_{i} is 𝐡^i(𝐱)=K/(K+1)𝐡i(𝐱)+1/(K+1)𝐡~i{\widehat{\bf{h}}_{i}}({\bf{x}})=\sqrt{K/(K+1)}{{\bf{h}}_{i}}({\bf{x}})+\sqrt{1/(K+1)}{\widetilde{\bf{h}}_{i}}, where KK is the Rician factor, 𝐡i(𝐱){{\bf{h}}_{i}}({\bf{x}}) is given in (1), and each element of 𝐡~iN×1{\widetilde{\bf{h}}_{i}}\in{{\mathbb{C}}^{N\times 1}} is i.i.d. complex Gaussian distributed with zero mean and unit variance. Under this setup, optimal 𝐱{\bf{x}} (denoted as 𝐱LoS{{\bf{x}}^{{\rm{LoS}}}}) is still obtained based on statistical AoAs, while the objective of total transmit power becomes 𝔼[i=1M1λi{𝛀1𝐇^(𝐱LoS)H𝐇^(𝐱LoS)}]{\mathbb{E}}\left[{\sum\nolimits_{i=1}^{M}{\frac{1}{{{\lambda_{i}}\left\{{{{\bf{\Omega}}^{-1}}\widehat{\bf{H}}{{({{\bf{x}}^{{\rm{LoS}}}})}^{H}}\widehat{\bf{H}}({{\bf{x}}^{{\rm{LoS}}}})}\right\}}}}}\right], with 𝐇^(𝐱LoS)=[𝐡^1(𝐱LoS),,𝐡^M(𝐱LoS)]\widehat{\bf{H}}({{\bf{x}}^{{\rm{LoS}}}})=\left[{{{\widehat{\bf{h}}}_{1}}({{\bf{x}}^{{\rm{LoS}}}}),...,{{\widehat{\bf{h}}}_{M}}({{\bf{x}}^{{\rm{LoS}}}})}\right]. For convincing comparisons, we further consider three widely used benchmarks:

  • RPA: The line segment of length LL is quantized into 2L+12L+1 discrete locations with equal-distance 0.5λ0.5\lambda, and NN out of these 2L+12L+1 locations are optimally selected for antenna positions.

  • FPA: Each antenna has a fixed position, i.e., xi=(i1)dmin{x_{i}}=(i-1){d_{\min}}.

  • Minimum mean square error (MMSE) combining: The BS will exploit MMSE combining to detect multiple signals, where positions of all antennas are optimized employing the method in [9], but base on statistical AoAs.

For the system parameters, we set the minimum distance between any two adjacent FAs as dmin=0.5λ{d_{\min}}=0.5\lambda, and without prejudice to the conclusion, λ\lambda is set to 1 for simplification. We consider M=3M=3 users and the AoAs are θ1=π/16{\theta_{1}}=\pi/16, θ2=π/10{\theta_{2}}=\pi/10 and θ3=π/2{\theta_{3}}=\pi/2, respectively. In addition, the noise power is set as σ2=1{\sigma^{2}}=1 for normalizing the large-scale channel fading power.

Fig. 4 first illustrates the convergence behavior of our proposed design under the LoS channels and for the case of N=4N=4 and ri=1{r_{i}}=1, i=1,2,3\forall i=1,2,3. Corresponding to different L=2.5,3.5,4.5L=2.5,3.5,4.5, the initial condition for the iteration is set as 𝐱1=[0,L/3,2L/3,L]T{{\bf{x}}^{1}}={[0,L/3,2L/3,L]^{T}}. As we can observe, the total transmit power of all users rapidly converges to a constant within dozens of iterations. Therefore, the proposed design is computationally efficient which may be suitable for the practical implementation.

Fig. 5(a) compares the total transmit power of four schemes with respect to (w.r.t.) number of transmit antennas at the BS (NN) for the case of ri=1{r_{i}}=1, i=1,2,3\forall i=1,2,3 and K=10K=10. We can observe that: i) as NN increases, the BS can better distinguish signals in different directions and achieve higher reception gains, which in turn allows the users to transmit their signals with less power; ii) compared to FPA and RPA, the proposed design can optimally exploit the additional spatial DoF, so that the resulting total transmit power can be minimized; iii) as NN increases, the performance gap between RPA and the proposed design decreases. The reason is that when LL is fixed, each antenna can just move in a smaller region when NN increases, which implies that there may be not much performance difference from discrete positions selection in RPA or optimal FAs’ movements in the proposed design; iv) as reported in [9], due to the more powerful detection ability, MMSE combining outperforms the proposed design slightly.

Fig. 5(b) shows the total transmit power w.r.t. the span of FAs’ movement (LL) for the case of ri=1{r_{i}}=1, i=1,2,3\forall i=1,2,3 and K=10K=10, from which it is observed that when LL increases, the total transmit power of RPA, the proposed design and MMSE combining first becomes smaller and then converges to a constant. This phenomenon reveals that it is not necessary to expand LL indefinitely and only a limited span is enough to achieve the optimal performance.

Finally, Fig. 5(c) shows the total transmit power w.r.t. the Rician factor KK for the case of ri=1{r_{i}}=1, i=1,2,3\forall i=1,2,3, N=5N=5 and L=5.5L=5.5, from which it is observed that no matter whether KK is large (the LoS condition is dominate for each channel between the user and the BS) or small (the random Rayleigh fading is dominate for each channel between the user and the BS), our proposed design with statistical AoAs always achieves pretty good performance compared to FPA and RPA, indicating that our proposed design is not sensitive w.r.t. random fading components in Rician channels.

V Conclusion

This letter considers multiuser uplink communication supported by the FAs-enabled base station, which exploits zero-forcing receivers to decode multiple signals. The objective is to optimize the FAs’ positions at the BS, to minimize the total transmit power of all users subject to the minimum rate requirement. We develop a projected gradient descent method to iteratively find a locally optimal solution, at significantly reduced complexity compared to state of the art since a closed-form gradient is derived successfully. Results show the performance superiority of our proposed design compared to several benchmarks.

References

  • [1] W. Xia, et. al., “A deep learning framework for optimization of MISO downlink beamforming,” IEEE Trans. Commun., vol. 68, no. 3, pp. 1866-1880, March 2020.
  • [2] Q. Wu, et. al., “Intelligent reflecting surface enhanced wireless network via joint active and passive beamforming,” IEEE Trans. Wireless Commun., vol. 18, no. 11, pp. 5394-5409, Nov. 2019.
  • [3] L. Zhu, et. al., “Movable antennas for wireless communication: Opportunities and challenges,” IEEE Commun. Mag., early access. DOI: 10.1109/MCOM.001.2300212.
  • [4] W. K. New, et. al., “Fluid antenna system: New insights on outage probability and diversity gain,” IEEE Trans. Wireless Commun., early access. DOI: 10.1109/TWC.2023.3276245.
  • [5] K. K. Wong, et. al., “Fluid antenna systems,” IEEE Trans. Wireless Commun., vol. 20, No. 3, pp. 1950-1962, March 2021.
  • [6] K. K. Wong, et. al., “Fluid antenna multiple access,” IEEE Trans. Wireless Commun., vol. 21, No. 7, pp. 4801-4815, July 2022.
  • [7] W. Ma, et. al., “MIMO capacity characterization for movable antenna systems,” IEEE Trans. Wireless Commun., early access. DOI: 10.1109/TWC.2023.3307696.
  • [8] Y. Ye, et. al., “Fluid antenna-assisted MIMO transmission exploiting statistical CSI,” IEEE Commun. Lett., early access. DOI: 10.1109/LCOMM.2023.3336805.
  • [9] L. Zhu, et. al., “Movable-antenna enhanced multiuser communication via antenna position optimization,” arXiv: 2302.06978, 2023.
  • [10] Z. Xiao, et. al., “Multiuser communications with movable-antenna base station: Joint antenna positioning, receive combining, and power control,” arXiv: 2308.09512, 2023.
  • [11] G. Hu, et. al., “Secure wireless communication via movable-antenna array,” arXiv: 2311.07104, 2023.
  • [12] L. Zhu, et. al., “Movable-antenna array enhanced beamforming: Achieving full array gain with null steering,” IEEE Commun. Lett., early access. DOI: 10.1109/LCOMM.2023.3323656.
  • [13] Y. Wu, et. al., “Movable antenna-enhanced multiuser communication: Optimal discrete antenna positioning and beamforming,” arXiv: 2308.02304, 2023.
  • [14] T. Van Chien, et. al., “Uplink power control in massive MIMO with double scattering channels,” IEEE Trans. Wireless Commun., vol. 21, no. 3, pp. 1989-2005, March 2022.
  • [15] A. Beck, et. al., “A fast iterative shrinkage-thresholding algorithm for linear inverse problems,” SIAM J. Imag. Sci., vol. 2, no. 1, pp. 183-202, 2009.
  • [16] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge, U.K.: Cambridge Univ. Press, 2004.