This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\fail

UAV Communications
for Sustainable Federated Learning

Quoc-Viet Pham, Ming Zeng, Rukhsana Ruby, Thien Huynh-The, and Won-Joo Hwang Q.-V. Pham is with the Korean Southeast Center for the 4th Industrial Revolution Leader Education, Pusan National University, Korea. M. Zeng is with the Department of Electrical Engineering and Computer Engineering, Laval University, Canada. R. Ruby is with the College of Computer Science and Software Engineering, Shenzhen University, China. T. Huynh-The is with the ICT Convergence Research Center, Kumoh National Institute of Technology, Korea. W.-J. Hwang is with the Department of Biomedical Convergence Engineering, Pusan National University, Korea. E-mails: {[email protected], [email protected], [email protected], [email protected], [email protected]}.This work was supported by a National Research Foundation of Korea (NRF) Grant funded by the Korean Government (MSIT) under Grants NRF-2019R1C1C1006143 and NRF-2019R1I1A3A01060518.
Abstract

Federated learning (FL), invented by Google in 2016, has become a hot research trend. However, enabling FL in wireless networks has to overcome the limited battery challenge of mobile users. In this regard, we propose to apply unmanned aerial vehicle (UAV)-empowered wireless power transfer to enable sustainable FL-based wireless networks. The objective is to maximize the UAV transmit power efficiency, via a joint optimization of transmission time and bandwidth allocation, power control, and the UAV placement. Directly solving the formulated problem is challenging, due to the coupling of variables. Hence, we leverage the decomposition technique and a successive convex approximation approach to develop an efficient algorithm, namely UAV for sustainable FL (UAV-SFL). Finally, simulations illustrate the potential of our proposed UAV-SFL approach in providing a sustainable solution for FL-based wireless networks, and in reducing the UAV transmit power by 32.95%, 63.18%, and 78.81% compared with the benchmarks.

Index Terms:
Edge Computing, Energy Harvesting, Federated Learning, Sustainability, UAV Communications.

I Introduction

To overcome challenges of data privacy in conventional deep learning, the concept of federated learning (FL) has been invented by Google in 2016 [1]. In FL, users need not share their data with the central server but only information of the local models is transmitted to the server. Thanks to its characteristics, FL has been adopted in many applications such as Google keyboard suggestions [2], medicine [3], and mobile communications [4]. Although FL has found many benefits in enabling privacy-preserving learning solutions, it also poses significant challenges such as resource management, robustness, and incentive mechanisms [5]. Moreover, powering mobile users with limited battery capacity is of vital importance to enable FL as users may run out of battery during the training process. In this regard, wireless powered communication networks (WPCNs) represent a promising solution, owing to their ability to charge the users wirelessly. Nonetheless, conventional WPCNs often suffer from several challenges, such as doubly near-far issue (i.e., far users need to spend higher power with smaller harvested energy) and severe performance degradation over long distance [6]. To address these challenges, unmanned aerial vehicle (UAV) enabled WPCNs have been introduced, which often supports line-of-sight connections with flexible deployment and controllable mobility [6], thus improving the network performance.

Over the last two years, various studies have been dedicated to 1) FL-WPCNs [7] and 2) FL-UAV communications [8, 9, 10]. As the very first work on FL-WPCNs, Tran et al.[7] considered the application of lightwave (i.e., visible and infrared light) power transfer to wirelessly power mobile users, which then utilize the harvested power to upload their local models to the learning server over radio channels. This work illustrated that users are replenished with enough energy to perform FL computation. However, the framework in [7] requires synchronizations among users, which is a challenging assumption as users may have different computing capabilities and data sizes. With regard to FL-UAV communications, the works in [8, 9, 10] employed FL to enable UAV applications such as quality sensing with UAV swarms [8], secure UAV crowdsensing with blockchain [9], and massive UAV communications [10].

Despite promising results, we are not aware of any work that focuses UAV-enabled wireless-powered FL networks (UAV-WPFNs). Such UAV-WPFNs are very promising in circumstances (e.g., large farms, mountains, and hard-to-reach areas), where a number of users with a limited battery capacity are deployed while no central server is available for data collection and model training. In this work, we propose to leverage the UAV to wirelessly power FL users so as to enhance the sustainability of FL-based networks. We aim to maximize the transmit power efficiency of the UAV by jointly optimizing transmission time allocation, bandwidth allocation, power control, and UAV placement. To address the formulated non-convex problem, we propose an iterative algorithm, namely UAV-SFL to achieve the maximum power efficiency of the UAV. At each iteration, the time allocation and the UAV transmit power are updated using the analogy of the bi-section rule, whereas power and bandwidth allocation of the users, and the UAV placement are updated by solving a convex feasibility problem. Simulation results show the sustainability of the considered UAV wireless-powered FL network, and its superiority over the benchmarks.

II Problem Formulation

II-A Network Model

We consider a network setup with a rotary-wing single-antenna UAV and KK single-antenna users (e.g., mobile users, sensors, and Internet-of-Things (IoT) devices), which are randomly distributed within an area of interest. We denote the set of users as 𝒦={1,,K}\mathcal{K}=\{1,\dots,K\}. The UAV equipped with a mobile edge computing (MEC) server provides computing services and broadcasts energy to the users. Each user has an on-board computing processor to train the local model using its own data and has an energy circuit to harvest energy transmitted by the UAV. Similar to [11], we assume that the UAV can simultaneously broadcast energy and receives the local models from users. Meanwhile, each user can simultaneously harvest energy from the UAV, perform FL tasks, and communicate with the server.

The rotary-wing UAV can hover at a given altitude HH while its coordinate can be optimized to establish LoS connections and maximize the network performance. Denote by 𝐪={x,y}\mathbf{q}=\{x,y\} and 𝐪k={xk,yk}\mathbf{q}_{k}=\{x_{k},y_{k}\} the respective coordinates of the UAV and user kk. The channel gain gkg_{k} of user kk can be calculated as gk=β^0(dk/d0)αg_{k}=\hat{\beta}_{0}(d_{k}/d_{0})^{-\alpha}, where β^0\hat{\beta}_{0} is the reference channel gain at d0=1d_{0}=1 m and dkd_{k} is the distance to the UAV. Here, the distance dkd_{k} is given as dk=H2+𝐪𝐪k22d_{k}=\sqrt{H^{2}+\|\mathbf{q}-\mathbf{q}_{k}\|_{2}^{2}}.

The energy of user kk harvested from the UAV can be given as Ek=η0TPgkE_{k}=\eta_{0}TPg_{k}, where TT is the time duration, PP is the transmit power of the UAV, and 0<η010<\eta_{0}\leq 1 is the energy conversion efficiency of user kk. This linear model has been adopted in many works (e.g. [11] and [12]) to show meaningful insights of UAV-enabled wireless powered communications. The non-linear energy harvesting model, as considered in our work [13], is an interesting topic for future work.

II-B Local Computing Model

Each user has a local dataset, which is used to train the local model and is not shared with the server to protect user privacy. About local computation of user kk, we denote as fkf_{k} (in CPU cycles per second) the CPU computing capability, DkD_{k} as the number of data samples, and CkC_{k} as the number of CPU cycles needed to process a data sample. The computation time for one local iteration is tkcp=CkDk/fkt_{k}^{\text{cp}}=C_{k}D_{k}/f_{k}. The corresponding energy consumption is Ekcp=ζkCkDkfk2E_{k}^{\text{cp}}=\zeta_{k}C_{k}D_{k}f_{k}^{2}, where ζk\zeta_{k} is a coefficient depending on the hardware and chip architecture [14].

II-C Communication Model

After local computation, each user uploads its local model to the server for aggregation. We adopt frequency division multiple access (FDMA) for the uplink operation because FL with FDMA can be implemented in asynchronous manner [15, 16]. Other multiple access techniques such as rate splitting multiple access (RSMA) and non-orthogonal multiple access (NOMA) can be also applied with the consideration of synchronizations among users.

For user kk, the achievable rate is given as Rk=bklog2(1+pkgk/bkn0)R_{k}=b_{k}\log_{2}\left(1+{p_{k}g_{k}}/{b_{k}n_{0}}\right), where bkb_{k} is the allocated bandwidth, pkp_{k} is the transmit power, and n0n_{0} is the noise power spectral density. Denote by ss the constant data size of the local model parameter and gradient that user kk needs to upload to the server. To ensure the transmission of ss within the uploading time tkcmt_{k}^{\text{cm}}, the constraint sRktkcms\leq R_{k}t_{k}^{\text{cm}} should hold. The corresponding energy consumed by user kk is Ekcm=tkcmpkE_{k}^{\text{cm}}=t_{k}^{\text{cm}}p_{k}.

II-D Problem Formulation

We aim at developing a resource allocation framework to maximize the power efficiency of the UAV so as to enable sustainable FL systems. In particular, we are interested in the following optimization problem.

minP,𝐩,𝐟,𝐛,𝐭,𝐪\displaystyle\underset{P,\mathbf{p},\mathbf{f},\mathbf{b},\mathbf{t},\mathbf{q}}{\min} P\displaystyle P (1a)
s.t. tkcmbklog2(1+pkgkbkn0)s,k𝒦,\displaystyle t_{k}^{\text{cm}}b_{k}\log_{2}\left(1+\frac{p_{k}g_{k}}{b_{k}n_{0}}\right)\geq s,\forall k\in\mathcal{K}, (1b)
NkEkcp+EkcmEk,k𝒦,\displaystyle N_{k}E_{k}^{\text{cp}}+E_{k}^{\text{cm}}\leq E_{k},\forall k\in\mathcal{K}, (1c)
0PPmax,\displaystyle 0\leq P\leq P^{\max}, (1d)
0pkpkmax,k𝒦,\displaystyle 0\leq p_{k}\leq p_{k}^{\max},\forall k\in\mathcal{K}, (1e)
k𝒦bkB,\displaystyle\sum\nolimits_{k\in\mathcal{K}}b_{k}\leq B, (1f)
tkcpNk+tkcmT,k𝒦,\displaystyle t_{k}^{\text{cp}}N_{k}+t_{k}^{\text{cm}}\leq T,\forall k\in\mathcal{K}, (1g)
fkminfkfkmax,k𝒦,\displaystyle f_{k}^{\min}\leq f_{k}\leq f_{k}^{\max},\forall k\in\mathcal{K}, (1h)
𝐪22x2+y2C2,\displaystyle\lVert\mathbf{q}\rVert_{2}^{2}\triangleq x^{2}+y^{2}\leq C^{2}, (1i)

where 𝐩={p1,,pK}\mathbf{p}=\{p_{1},\dots,p_{K}\}, 𝐟={f1,,fK}\mathbf{f}=\{f_{1},\dots,f_{K}\}, 𝐛={b1,,bK}\mathbf{b}=\{b_{1},\dots,b_{K}\}, and 𝐭={t1cm,,tKcm}\mathbf{t}=\{t_{1}^{\text{cm}},\dots,t_{K}^{\text{cm}}\}. In the above problem, (1b) indicates the constraint on the data rate transmission, (1c) implies that the energy harvested from the UAV should be greater than the energy consumed for local computation and communication, (1d) and (1e) indicate the feasible range of the transmit power due to power budgets of the UAV and users, and (1f) denotes the bandwidth constraint with BB being the system bandwidth. Constraint (1g) ensures that each global round should be completed within the time frame TT, with NkN_{k} being the number of local iterations required by each user. We do emphasize that NkN_{k} can be preset, and the value is decided by the target accuracy of the local model [15, 7]. The CPU frequency of each user is constrained in (1h) with fkminf_{k}^{\min} and fkmaxf_{k}^{\max} being the minimum and maximum CPU frequency of user kk, respectively. Finally, (1i) expresses that the UAV should be placed within a circular disc with radius CC. Solving problem (1) directly is challenging since multiple optimization variables are coupled. Moreover, the optimization of UAV location arises the non-linearity of (1) caused by the channel gain model and the log2()\log_{2}(\cdot) function of the achievable rates.

III Proposed Solution

To overcome the non-linearity due to the UAV placement we introduce auxiliary variables 𝐮k={uk}k𝒦\mathbf{u}_{k}=\{u_{k}\}_{k\in\mathcal{K}} and rewrite the problem (1) as follows:

minP,𝐩,𝐟,𝐛,𝐭,𝐪,𝐮\displaystyle\underset{P,\mathbf{p},\mathbf{f},\mathbf{b},\mathbf{t},\mathbf{q},\mathbf{u}}{\min} P\displaystyle P (2a)
s.t. tkbklog2(1+pkg0bkuk)s,k𝒦,\displaystyle t_{k}b_{k}\log_{2}\left(1+\frac{p_{k}g_{0}}{b_{k}u_{k}}\right)\geq s,\forall k\in\mathcal{K}, (2b)
NkζkCkDkfk2+tkpkPβ0uk,k𝒦,\displaystyle N_{k}\zeta_{k}C_{k}D_{k}f_{k}^{2}+t_{k}p_{k}\leq\frac{P\beta_{0}}{u_{k}},\forall k\in\mathcal{K}, (2c)
NkCkDkfk+tkT,k𝒦,\displaystyle\frac{N_{k}C_{k}D_{k}}{f_{k}}+t_{k}\leq T,\forall k\in\mathcal{K}, (2d)
uk(H2+𝐪𝐪k22)α/2,k𝒦,\displaystyle u_{k}\geq\left(H^{2}+\|\mathbf{q}-\mathbf{q}_{k}\|_{2}^{2}\right)^{\alpha/2},\forall k\in\mathcal{K}, (2e)
(1d),(1e),(1f),(1h),(1i),\displaystyle\eqref{OptPrb:powerUAV},\eqref{OptPrb:power},\eqref{OptPrb:bAllocation},\eqref{OptPrb:CPUfre},\eqref{OptPrb:coordinate}, (2f)

where β0=η0Tβ^0\beta_{0}=\eta_{0}T\hat{\beta}_{0} and g0=β^0/n0g_{0}=\hat{\beta}_{0}/n_{0}. We also replace tkcmt_{k}^{\text{cm}} by tkt_{k} for ease of presentation. It is however still difficult to solve the problem (2). In this work, we leverage the block successive upper-bound minimization (BSUM) method proposed in [17] to solve (2). Particularly, we divide the entire variables of the problem (2) into three blocks: (𝐭,𝐟)(\mathbf{t},\mathbf{f}), PP, and (𝐩,𝐛,𝐪,𝐮)(\mathbf{p},\mathbf{b},\mathbf{q},\mathbf{u}), and as will be shown in Algorithm 1, these blocks can be updated alternately in an iterative manner.

For the first block variable (𝐭,𝐟)(\mathbf{t},\mathbf{f}), optimizing the problem (2) is equivalent to minimizing the energy consumed for both local computation and communication. From (2b), the minimum transmission time of user kk at the update step κ\kappa is given as

tkmin,(κ)=sbklog2(1+pkg0/bkuk).t_{k}^{\min,(\kappa)}=\frac{s}{b_{k}\log_{2}\left(1+p_{k}g_{0}/b_{k}u_{k}\right)}. (3)

Also from (2d), at the first update step (i.e. κ=0\kappa=0), the transmission time of user kk is limited by the following upper-bound value

tkmax,(κ)=TNkCkDkfkmax.t_{k}^{\max,(\kappa)}=T-\frac{N_{k}C_{k}D_{k}}{f_{k}^{\max}}. (4)

Therefore, the transmission time of user kk is a feasible point within the range [tkmin,(κ),tkmax,(κ)][t_{k}^{\min,(\kappa)},t_{k}^{\max,(\kappa)}]. As the transmission energy of each user becomes smaller when the transmission time is smaller (as Ekcm=tkpkE_{k}^{\text{cm}}=t_{k}p_{k}) and there is one additional constraint (2c), the transmission time can be updated using the analogy of the bi-section rule. In particular, at the update step κ\kappa, the transmission time of user kk is set as follows:

tk(κ)=(tkmax,(κ)+tkmin,(κ))/2,t_{k}^{(\kappa)}=({t_{k}^{\max,(\kappa)}+t_{k}^{\min,(\kappa)}})/{2}{\color[rgb]{0,0,1},} (5)

and the maximum transmission time of user kk for the next update step is set as tkmax,(κ+1)=tk(κ)t_{k}^{\max,(\kappa+1)}=t_{k}^{(\kappa)}, Here, we should note that constraint (2c) of the block variable (𝐭,𝐟)(\mathbf{t},\mathbf{f}) is handled via the optimization of the other blocks. For a given transmission time tk(κ)t_{k}^{(\kappa)}, we can infer from (1h) and (2d) that the optimal CPU frequency of user kk should satisfy fk(κ)=min{max{f¯kmin,fkmin},fkmax}f_{k}^{(\kappa)}=\min\{\max\{\underline{f}_{k}^{\min},f_{k}^{\min}\},f_{k}^{\max}\} in order to minimize the energy consumed for local computation, where

f¯kmin=NkCkDkTtk(κ).\underline{f}_{k}^{\min}=\frac{N_{k}C_{k}D_{k}}{T-t_{k}^{(\kappa)}}. (6)

Similarly for the block of the transmit power PP of the UAV, at the first update step, the maximum transmit power is set as P(κ=0)max=PmaxP^{\max}_{(\kappa=0)}=P^{\max} and the minimum power can be derived from (2c) as P(κ)min=max{Pk(κ)}P^{\min}_{(\kappa)}=\max\{P_{k}^{(\kappa)}\}, where

Pk(κ)=ukβ0(NkζkCkDkfk2+tkpk).P_{k}^{(\kappa)}\ =\frac{u_{k}}{\beta_{0}}\left({\color[rgb]{0,0,0}N_{k}}\zeta_{k}C_{k}D_{k}f_{k}^{2}+t_{k}p_{k}\right). (7)

Similar to the update rule of the transmission time, the transmit power of the UAV can be updated as follows:

P(κ)=(P(κ)max+P(κ)min)/2.P^{(\kappa)}=({P^{\max}_{(\kappa)}+P^{\min}_{(\kappa)}})/{2}. (8)

From (8), the transmit power of the UAV decreases over the course of optimization. The reason is that the energy consumed for both local computation and transmission gets smaller, as optimized in the first block phase. And the maximum transmit power of the UAV for the next update step is set as P(κ+1)max=P(κ)minP^{\max}_{(\kappa+1)}=P^{\min}_{(\kappa)}.

For given (𝐭,𝐟,P)(\mathbf{t},\mathbf{f},P), the problem can be equivalently represented as the following feasibility problem:

find\displaystyle\operatorname*{find} {𝐩,𝐛,𝐪,𝐮}\displaystyle\{\mathbf{p},\mathbf{b},\mathbf{q},\mathbf{u}\} (9a)
s.t. bklog2(1+pkg0bkuk)s/tk,k𝒦,\displaystyle b_{k}\log_{2}\left(1+\frac{p_{k}g_{0}}{b_{k}u_{k}}\right)\geq s/t_{k},\forall k\in\mathcal{K}, (9b)
NkζkCkDkfk2+tkpkPβ0uk,k𝒦,\displaystyle{\color[rgb]{0,0,0}N_{k}}\zeta_{k}C_{k}D_{k}f_{k}^{2}+t_{k}p_{k}\leq\frac{P\beta_{0}}{u_{k}},\forall k\in\mathcal{K}, (9c)
uk(H2+𝐪𝐪k22)α/2,k𝒦,\displaystyle u_{k}\geq\left(H^{2}+\|\mathbf{q}-\mathbf{q}_{k}\|_{2}^{2}\right)^{\alpha/2},\forall k\in\mathcal{K}, (9d)
(1e),(1f),(1i).\displaystyle\eqref{OptPrb:power},\eqref{OptPrb:bAllocation},\eqref{OptPrb:coordinate}. (9e)

This problem is non-convex, but the feasible set composed of (1e), (1f), (1i), and (9d) is convex. To convexify (9b), we introduce the following lemma, and its proof can be found in [18].

Lemma 1.

For every x>0x>0, y>0y>0, τ>0\tau>0, x¯>0\bar{x}>0, y¯>0\bar{y}>0, τ¯>0\bar{\tau}>0, we have the following inequality

τln(1+1xy)\displaystyle\tau\ln\left(1+\frac{1}{xy}\right)\geq 2τ¯ln(1+1x¯y¯)+τ¯1+x¯y¯(2xx¯yy¯)\displaystyle 2\bar{\tau}\ln\left(1+\frac{1}{\bar{x}\bar{y}}\right)+\frac{\bar{\tau}}{1+\bar{x}\bar{y}}\left(2-\frac{x}{\bar{x}}-\frac{y}{\bar{y}}\right)
τ¯2ln(1+1/x¯y¯)τ.\displaystyle-\frac{\bar{\tau}^{2}\ln(1+1/\bar{x}\bar{y})}{\tau}. (10)

With regard to constraint (9b), at the update step κ\kappa, applying the inequality (10) with τ=bk,x=uk/pkg0,y=bk,τ¯=bk(κ1),x¯=uk(κ1)/pk(κ1)g0,y¯=bk(κ1)\tau=b_{k},x=u_{k}/p_{k}g_{0},y=b_{k},\bar{\tau}=b_{k}^{(\kappa-1)},\bar{x}=u_{k}^{(\kappa-1)}/p_{k}^{(\kappa-1)}g_{0},\bar{y}=b_{k}^{(\kappa-1)} yields

bklog2(1+pkg0bkuk)\displaystyle b_{k}\log_{2}\left(1+\frac{p_{k}g_{0}}{b_{k}u_{k}}\right)\geq
1ln(2)(λk+μk(2ukpkpk(κ1)uk(κ1)bkbk(κ1))νkbk),\displaystyle\;\frac{1}{\ln(2)}\left(\lambda_{k}+\mu_{k}\left(2-\frac{u_{k}}{p_{k}}\frac{p_{k}^{(\kappa-1)}}{u_{k}^{(\kappa-1)}}-\frac{b_{k}}{b_{k}^{(\kappa-1)}}\right)-\frac{\nu_{k}}{b_{k}}\right), (11)

with

λk=2bk(κ1)ln(1+pk(κ1)g0bk(κ1)uk(κ1)),\displaystyle\lambda_{k}=2b_{k}^{(\kappa-1)}\ln\left(1+\frac{p_{k}^{(\kappa-1)}g_{0}}{b_{k}^{(\kappa-1)}u_{k}^{(\kappa-1)}}\right), (12)
μk=bk(κ1)(1+bk(κ1)uk(κ1)pk(κ1)g0)1,\displaystyle\mu_{k}=b_{k}^{(\kappa-1)}\left(1+\frac{b_{k}^{(\kappa-1)}u_{k}^{(\kappa-1)}}{p_{k}^{(\kappa-1)}g_{0}}\right)^{-1}, (13)
νk=(bk(κ1))2ln(1+pk(κ1)g0bk(κ1)uk(κ1)).\displaystyle\nu_{k}=(b_{k}^{(\kappa-1)})^{2}\ln\left(1+\frac{p_{k}^{(\kappa-1)}g_{0}}{b_{k}^{(\kappa-1)}u_{k}^{(\kappa-1)}}\right). (14)

Further, we have the following inequality

ukpkpk(κ1)uk(κ1)\displaystyle\frac{u_{k}}{p_{k}}\frac{p_{k}^{(\kappa-1)}}{u_{k}^{(\kappa-1)}}
=14((ukuk(κ1)+pk(κ1)pk)2(ukuk(κ1)pk(κ1)pk)2)\displaystyle=\frac{1}{4}\left(\left(\frac{u_{k}}{u_{k}^{(\kappa-1)}}+\frac{p_{k}^{(\kappa-1)}}{p_{k}}\right)^{2}-\left(\frac{u_{k}}{u_{k}^{(\kappa-1)}}-\frac{p_{k}^{(\kappa-1)}}{p_{k}}\right)^{2}\right)
14(ukuk(κ1)+pk(κ1)pk)2πk(κ)(pk,uk).\displaystyle\leq\frac{1}{4}\left(\frac{u_{k}}{u_{k}^{(\kappa-1)}}+\frac{p_{k}^{(\kappa-1)}}{p_{k}}\right)^{2}\triangleq\pi_{k}^{(\kappa)}(p_{k},u_{k}). (15)

Therefore, (III) can be transformed into the following

bklog2(1+pkg0bkuk)\displaystyle b_{k}\log_{2}\left(1+\frac{p_{k}g_{0}}{b_{k}u_{k}}\right)
1ln(2)(λk+μk(2πk(κ)(pk,uk)bkbk(κ1))νkbk)\displaystyle\geq\frac{1}{\ln(2)}\left(\lambda_{k}+\mu_{k}\left(2-\pi_{k}^{(\kappa)}(p_{k},u_{k})-\frac{b_{k}}{b_{k}^{(\kappa-1)}}\right)-\frac{\nu_{k}}{b_{k}}\right)
Rk(κ)(bk,pk,uk).\displaystyle\triangleq R_{k}^{(\kappa)}(b_{k},p_{k},u_{k}). (16)

With regard to (9c), since both sides are convex expressions, this constraint is non-convex. Fortunately, we can leverage the first-order optimality condition to approximate the right hand side of (9c) as follows [19]:

Pβ0ukPβ0uk(κ1)Pβ0(uk(κ1))2(ukuk(κ1))ϕk(κ)(uk).\frac{P\beta_{0}}{u_{k}}\geq\frac{P\beta_{0}}{u_{k}^{(\kappa-1)}}-\frac{P\beta_{0}}{\left(u_{k}^{(\kappa-1)}\right)^{2}}\left(u_{k}-u_{k}^{(\kappa-1)}\right)\triangleq\phi_{k}^{(\kappa)}(u_{k}). (17)

As a result, at the iteration κ\kappa, constraint (9c) of user kk can be approximated as

NkζkCkDkfk2+tkpkϕk(κ)(uk).{{\color[rgb]{0,0,0}N_{k}}\zeta_{k}C_{k}D_{k}f_{k}^{2}}+t_{k}p_{k}\leq\phi_{k}^{(\kappa)}(u_{k}). (18)

To summarize, we solve the following convex problem to update the block variable (𝐩(κ),𝐛(κ),𝐪(κ),𝐮(κ))(\mathbf{p}^{(\kappa)},\mathbf{b}^{(\kappa)},\mathbf{q}^{(\kappa)},\mathbf{u}^{(\kappa)}).

find\displaystyle\operatorname*{find} {𝐩,𝐛,𝐪,𝐮}\displaystyle\{\mathbf{p},\mathbf{b},\mathbf{q},\mathbf{u}\} (19a)
s.t. Rk(κ)(bk,pk,uk)s/tk,k𝒦,\displaystyle R_{k}^{(\kappa)}(b_{k},p_{k},u_{k})\geq s/t_{k},\forall k\in\mathcal{K}, (19b)
NkζkCkDkfk2+tkpkϕk(κ)(uk),k𝒦,\displaystyle{{\color[rgb]{0,0,0}N_{k}}\zeta_{k}C_{k}D_{k}f_{k}^{2}}+t_{k}p_{k}\leq\phi_{k}^{(\kappa)}(u_{k}),\forall k\in\mathcal{K}, (19c)
(1e),(1f),(1i),(9d).\displaystyle\eqref{OptPrb:power},\eqref{OptPrb:bAllocation},\eqref{OptPrb:coordinate},\eqref{OptPrb2:auxvar}. (19d)
Algorithm 1 Proposed Algorithm
1:Set the iteration index κ=0\kappa=0 and initialize a feasible solution (P(κ),𝐩(κ),𝐟(κ),𝐛(κ),𝐭(κ),𝐪(κ))(P^{(\kappa)},\mathbf{p}^{(\kappa)},\mathbf{f}^{(\kappa)},\mathbf{b}^{(\kappa)},\mathbf{t}^{(\kappa)},\mathbf{q}^{(\kappa)}) for the problem (1).
2:repeat
3:     Set κκ+1\kappa\leftarrow\kappa+1.
4:     Update tk(κ)t_{k}^{(\kappa)} according to (5).
5:     Update fk(κ)f_{k}^{(\kappa)} with f¯kmin\underline{f}_{k}^{\min} in (6).
6:     Update PP based on (8).
7:     Solve (19) to update (𝐩(κ),𝐛(κ),𝐪(κ))(\mathbf{p}^{(\kappa)},\mathbf{b}^{(\kappa)},\mathbf{q}^{(\kappa)}).
8:until |P(κ)P(κ1)|/P(κ1)ϵ|P^{(}\kappa)-P^{(}\kappa-1)|/P^{(}\kappa-1)\leq\epsilon.

The proposed algorithm is summarized in Algorithm 1. In each iteration, block variables are updated cyclically. The algorithm stops when the relative tolerance is less than a preset threshold ϵ\epsilon. As the transmit power of the UAV is reduced after each update step, Algorithm 1 converges to the final solution after a number of steps. At each step, the computational complexity of solving the convex problem (19) is 𝒪((3K+2)2(4K+2)2.5+(4K+2)3.5)\mathcal{O}((3K+2)^{2}(4K+2)^{2.5}+(4K+2)^{3.5}) as it involves 4K+24K+2 constraints and 3K+23K+2 variables [18].

IV Simulation Results

For validating our proposed method, namely UAV-SFL, we deploy a network, in which K=25K=25 users are randomly distributed within a circular disc with a radius of 5050 m, and the rotary-wing UAV hovers at a preset altitude of 2020 m. The system bandwidth B=20B=20 MHz, the respective maximum powers of users and the UAV are 1010 dBm and 3636 dBm, and the energy harvesting efficiency η0=0.9\eta_{0}=0.9. For local computation, the maximum and minimum CPU frequencies fkmax=1f_{k}^{\max}=1 GHz and fkmin=0.1f_{k}^{\min}=0.1 GHz for all users, the data size s=100s=100 Kb, the coefficient of chip ζ=1028\zeta=10^{-28}, the local data DkD_{k} and the workload CkC_{k} are randomly selected from the ranges [5,10][5,10] Mb and [10,20][10,20] cycles per bit, the maximum number of local iterations Nk=4N_{k}=4, and the time frame T=8T=8 s.

Refer to caption
Figure 1: Convergence evolution of the proposed algorithm.

Firstly, we present the convergence evolution of UAV-SFL in Fig. 1. This figure shows that Algorithm 1 can converge to the final solution after a reasonable number of iterations. We recall that at each step, we solve the convex feasibility problem (19), and further update time allocation, transmit power of the UAV, and CPU frequencies using linear computations. Consequently, the total complexity of our proposed UAV-SFL algorithm is very reasonable.

Refer to caption
(a)
Refer to caption
(b)
Figure 2: Performance vs. maximum number of local iterations. (a) computation and transmission time. (b) transmit power of the UAV.

Secondly, in Fig. 2a, we show the average transmission and computation time when varying the maximum number of local iterations. The figure indicates that the higher the maximum number of local iteration is, the larger the computation time is, while the smaller the transmission time is. This is reasonable since the computation time is directly proportional to NkN_{k}, while the total completion time is required to be less than the time frame TT, as indicated in (1g). We can also observe from Fig. 2b that the transmit power of the UAV increases as NkN_{k} increases. The reason is that the majority of the total harvested energy is for executing local computation, and thus the UAV should increase its transmit power to ensure that each user can harvest enough energy to perform FL tasks. Therefore, the proposed UAV-SFL approach can adaptively adjust the transmit power of the UAV to wirelessly power users to complete their FL tasks within the time frame.

Refer to caption
(a)
Refer to caption
(b)
Figure 3: Comparison of UAV-SFL with the other approaches.

Finally, we draw a comparison with three other approaches, including 1) a scheme with Fixed CPU Frequencies (labeled as FF), 2) a scheme with Fixed transmission Time (labeled as FT), and 3) a scheme with Fixed UAV Placement (labeled as FUP). From Fig. 3a, the transmit power of the UAV increases as the data size ss increases. This is because the transmission time and users’ transmit power should increase to complete the local model transmission, which consequently demands a higher transmit power of the UAV. The figure also shows that the transmit power of the UAV derived from our UAV-SFL approach via jointly optimizing the UAV placement, time, and CPU frequencies is significantly lower than those of three benchmarks. On average, UAV-SFL is better than FUP, FF, and FT by 32.95%, 63.18%, and 78.81%, respectively. Fig. 3b plots the transmit power of the UAV as a function of the system bandwidth. From this figure, there is just a minor change in the transmit power of the UAV as the system bandwidth increases. This is because the UAV does not change its optimized coordinate much when the system bandwidth changes [18]. With regard to our UAV-SFL scheme, the transmit power of the UAV slightly reduces as the system bandwidth increases. The reason for this is that increasing the system bandwidth results in larger achievable rates, which reduces the energy needed for transmission and consequently decreases the transmit power of the UAV. Again, this figure shows the superiority of the proposed UAV-SFL approach over the benchmarks.

V Conclusion

In this article, we have considered a network model, where the UAV is deployed to wirelessly power users so as to support sustainable FL-based wireless networks. We have developed an efficient algorithm to obtain the solution with a convergence guarantee. Presented simulation results show that the proposed scheme outperforms the benchmarks with fixed CPU frequencies, transmission time and UAV placement. Several promising directions can be stemmed from our current work such as consideration of other multiple-access techniques (e.g., NOMA and RSMA), fixed-wing UAVs, UAV-assisted visible light communications, and multi-UAV WPFNs. Moreover, as this work is considered for one global round, the extension to multiple global rounds is very promising.

References

  • [1] J. Konečný, H. B. McMahan, F. X. Yu, P. Richtarik, A. T. Suresh, and D. Bacon, “Federated learning: Strategies for improving communication efficiency,” in NIPS Workshop on Private Multi-Party Machine Learning, Barcelona, Spain, Dec. 2016, pp. 1–10.
  • [2] T. Yang, G. Andrew, H. Eichner, H. Sun, W. Li, N. Kong, D. Ramage, and F. Beaufays, “Applied federated learning: Improving google keyboard query suggestions,” arXiv preprint arXiv:1812.02903, 2018.
  • [3] M. J. Sheller, B. Edwards, G. A. Reina, J. Martin, S. Pati, A. Kotrotsou, M. Milchenko, W. Xu, D. Marcus, R. R. Colen et al., “Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data,” Scientific reports, vol. 10, no. 1, pp. 1–12, 2020.
  • [4] Y. Liu, X. Yuan, Z. Xiong, J. Kang, X. Wang, and D. Niyato, “Federated learning for 6G communications: Challenges, methods, and future directions,” China Communications, vol. 17, no. 9, pp. 105–118, 2020.
  • [5] W. Y. B. Lim, N. C. Luong, D. T. Hoang, Y. Jiao, Y.-C. Liang, Q. Yang, D. Niyato, and C. Miao, “Federated learning in mobile edge networks: A comprehensive survey,” IEEE Communications Surveys & Tutorials, vol. 22, no. 3, pp. 2031–2063, 2020.
  • [6] L. Xie, J. Xu, and R. Zhang, “Throughput maximization for UAV-enabled wireless powered communication networks,” IEEE Internet of Things Journal, vol. 6, no. 2, pp. 1690–1703, 2019.
  • [7] H. Tran, G. Kaddoum, H. Elgala, C. Abou-Rjeily, and H. Kaushal, “Lightwave power transfer for federated learning-based wireless networks,” IEEE Communications Letters, vol. 24, no. 7, pp. 1472–1476, 2020.
  • [8] Y. Liu, J. Nie, X. Li, S. H. Ahmed, W. Y. B. Lim, and C. Miao, “Federated learning in the sky: Aerial-ground air quality sensing framework with UAV swarms,” IEEE Internet of Things Journal, 2020, in press.
  • [9] Y. Wang, Z. Su, N. Zhang, and A. Benslimane, “Learning in the air: Secure federated learning for UAV-assisted crowdsensing,” IEEE Transactions on Network Science and Engineering, 2020, in press.
  • [10] H. Shiri, J. Park, and M. Bennis, “Communication-efficient massive UAV online path control: Federated learning meets mean-field game theory,” IEEE Transactions on Communications, vol. 68, no. 11, pp. 6840–6857, 2020.
  • [11] F. Zhou, Y. Wu, R. Q. Hu, and Y. Qian, “Computation rate maximization in UAV-enabled wireless-powered mobile-edge computing systems,” IEEE Journal on Selected Areas in Communications, vol. 36, no. 9, pp. 1927–1941, 2018.
  • [12] Z. Yang, W. Xu, and M. Shikh-Bahaei, “Energy efficient UAV communication with energy harvesting,” IEEE Transactions on Vehicular Technology, vol. 69, no. 2, pp. 1913–1927, 2019.
  • [13] T.-T. Nguyen, V.-D. Nguyen, Q.-V. Pham, J.-H. Lee, and Y.-H. Kim, “Resource allocation for AF relaying wireless-powered networks with nonlinear energy harvester,” IEEE Communications Letters, vol. 25, no. 1, pp. 229–233, 2021.
  • [14] Q.-V. Pham, H. T. Nguyen, Z. Han, and W.-J. Hwang, “Coalitional games for computation offloading in NOMA-enabled multi-access edge computing,” IEEE Transactions on Vehicular Technology, vol. 69, no. 2, pp. 1982–1993, 2020.
  • [15] Z. Yang, M. Chen, W. Saad, C. S. Hong, and M. Shikh-Bahaei, “Energy efficient federated learning over wireless communication networks,” IEEE Transactions on Wireless Communications, vol. 20, no. 3, pp. 1935-1949, 2020.
  • [16] M. Chen, Z. Yang, W. Saad, C. Yin, H. V. Poor, and S. Cui, “A joint learning and communications framework for federated learning over wireless networks,” IEEE Transactions on Wireless Communications, vol. 20, no. 1, pp. 269–283, 2021.
  • [17] M. Hong, M. Razaviyayn, Z.-Q. Luo, and J.-S. Pang, “A unified algorithmic framework for block-structured optimization involving big data: With applications in machine learning and signal processing,” IEEE Signal Processing Magazine, vol. 33, no. 1, pp. 57–77, 2016.
  • [18] A. A. Nasir, H. D. Tuan, T. Q. Duong, and H. V. Poor, “UAV-enabled communication using NOMA,” IEEE Transactions on Communications, vol. 67, no. 7, pp. 5126–5138, 2019.
  • [19] S. Boyd, S. P. Boyd, and L. Vandenberghe, Convex optimization.   Cambridge university press, 2004.