This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

QoS-Driven Resource Optimization for Intelligent Fog Radio Access Network: A Dynamic Power Allocation Perspective

J. Yu, and R. Wang are with the School of Electronics and Information Engineering at Tongji University, Shanghai, 201804, P. R. China (e-mails: [email protected], [email protected]). J. Wu is with the School of Computer Science and Technology at Fudan University, Shanghai, 201203, P. R. China (e-mail: [email protected]). Jun Yu, Rui Wang, Senior Member, IEEE, Jun Wu, Senior Member, IEEE
Abstract

The fog radio access network (Fog-RAN) has been considered a promising wireless access architecture to help shorten the communication delay and relieve the large data delivery burden over the backhaul links. However, limited by conventional inflexible communication design, Fog-RAN cannot be used in some complex communication scenarios. In this study, we focus on investigating a more intelligent Fog-RAN to assist the communication in a high-speed railway environment. Due to the train’s continuously moving, the communication should be designed intelligently to adapt to channel variation. Specifically, we dynamically optimize the power allocation in the remote radio heads (RRHs) to minimize the total network power cost considering multiple quality-of-service (QoS) requirements and channel variation. The impact of caching on the power allocation is considered. The dynamic power optimization is analyzed to obtain a closed-form solution in certain cases. The inherent tradeoff among the total network cost, delay and delivery content size is further discussed. To evaluate the performance of the proposed dynamic power allocation, we present an invariant power allocation counterpart as a performance comparison benchmark. The result of our simulation reveals that dynamic power allocation can significantly outperform the invariant power allocation scheme, especially with a random caching strategy or limited caching resources at the RRHs.

Index Terms:
Fog-RAN, caching, dynamic power allocation, quality-of-service.

I Introduction

As an evolution of the conventional cloud radio access network (C-RAN) [1, 2], the fog radio access network (Fog-RAN) has been regarded as a promising new wireless access network architecture, which can help relieve the large data traffic burden in the backhaul links of a cellular network and satisfy the stricter delay requirements in beyond-fifth-generation (B5G) and sixth-generation (6G) wireless communications [3, 4]. The key technology of the Fog-RAN to achieve this performance enhancement is the introduction of caching resources on the edge devices, i.e., the remote radio heads (RRHs), of the access network. With some popular contents cached at the RRHs, the frequently requested data by the terminals can be instantly acquired from the RRHs without needing to be fetched from remote servers. This scenario avoids possible traffic congestion over the backhaul links and further shortens the content delivery delay.

Due to the advantage of the Fog-RAN, the corresponding studies have recently received much attention. Current studies on the Fog-RAN mainly focus on two aspects: performance characterization and resource optimization. Here, the resources include caching, beamforming, power and computation resources. The authors in [5] investigated the delay and energy efficiency of the Fog-RAN by considering the hybrid caching strategy, which combines coded cached, nonpartitioned cached and uncached files. The hybrid caching strategy was further optimized to balance the delay and energy efficiency. Successful transmission probability (STP) is often used a performance metric to characterize the impact of content caching at an RRH. In [6], the authors first derived the STP of the Fog-RAN system with a proactive probabilistic caching strategy. The caching probability for different contents was further optimized to maximize the STP. In [7], Fog-RAN-assisted transmission was studied in a heterogeneous Fog-RAN wireless network. Different from [6], the authors in [7] optimized the caching strategy in both the RRHs and mobile users, aiming to optimize the STP. The power allocation of the Fog-RAN was studied in [8, 9, 10, 11]. In [8], the authors proposed using the nonorthogonal multiple access (NOMA) technique to distinguish multiple users in the Fog-RAN. An improved fractional transmit power allocation algorithm was developed for the three-user NOMA-assisted Fog-RAN system. In [9], the latency minimization problem for the Fog-RAN was formulated with a dynamic user demand. To understand the demand of the user and to intelligently perform the joint optimization of the proactive cache strategy and power allocation, a deep reinforcement learning approach was used. The authors in [10] jointly optimized the power and the model selection for uplink transmission of the Fog-RAN. The reinforcement learning approach was used to solved the nonconvex mixed-integer programming problem. Subchannel assignment and power control were jointly optimized in [11] for a mmWave-based Fog-RAN. The alternative direction method of multipliers (ADMM) was utilized to solve the power control problem.

The beamforming design problem for the Fog-RAN was investigated in [12, 13, 14]. In particular, the uplink Fog-RAN was studied [12], aiming to maximize the delivery rate subject to the constraints, including the backhaul capacity, transmit power, and file size. A two-layer transmission scheme including the cache level and network level was applied for content transmission. Both the centralized algorithm and the decentralized algorithm were proposed for optimizing the beamforming problem. In [13], the authors considered the Fog-RAN with multicast transmission. The beamforming design problem was constructed to minimize the network total cost subject to the power and signal-to-interference-plus-noise ratio (SINR) constraints. The sparse beamforming vector was found with the convex-concave procedure. The authors in [14] further extended the channel model in [13] to a scenario with both multicast and unicast transmission. A branch-and-bound algorithm was utilized to find the global optimal solution.

Computational resource allocation was considered in [15, 16, 17, 18] for performance improvement. In [15], edge/cloud computing and edge computing task migration were included in the design problem to improve the quality-of-service (QoS) at the users. Optimizing the user association and computational offloading, the communication resources and computational resources could be balanced. A reinforcement-learning-based approach was used in [16] to optimize the computational resources with the objective of reducing the latency and energy consumption. In [17], the authors proposed using the computational resources at the edge devices of the Fog-RAN to create cooperative downlink transmission to decrease the latency. An order-optimal upload-download communication latency pair was characterized to evaluate the network performance. In [18], the authors formulated a joint decision problem for communication, caching and computing resources. The problem was modeled as a multiple-choice multidimensional knapsack problem, and was then solved by the Lagrangian dual decomposition approach.

As we summarized above, even though a large number of contributions to the Fog-RAN have been reported, the investigations are still not sufficient to address all challenges under different application scenarios. A key shortcoming of the existing work is that current studied Fog-RAN are not intelligent enough, leading to that it cannot be used in some complex dynamic communication scenarios. A typical scenario is the high-speed railway scenario [19]. The technology of the high-speed railway, identified as a typical wireless communication scenario for future cellular networks, has developed rapidly on a global scale [20]. How to use the Fog-RAN architecture to provide reliable and low-latency communication services is an interesting topic that will require in-depth studies.

To compensate for the deficiencies of the current research on the Fog-RAN, we aim to design a more intelligent Fog-RAN to assist the communication in a high-speed railway environment. In specific, we investigate the Fog-RAN-assisted downlink data transmission in the high-speed railway scenario. Because of the time-varying channel state information, the corresponding studies become more challenging. Although wireless communication in the high-speed railway scenario has received much attention [21, 22, 23, 24, 25], to our knowledge, few works have studied the Fog-RAN in the high-speed railway scenario. This lack motivates the study of this work.

In the considered high-speed railway Fog-RAN, we assume that the train is cooperatively served by multiple RRHs. By taking the time-varying channel into account, we investigate an intelligently dynamic power allocation to minimize the cost of the entire network power, which includes the transmit power at the RRHs and the power consumed over the backhaul links, subject to a few quality-of-service (QoS) requirements. With this goal, the caching placement on the RRHs affects the physical layer power allocation. To find a reasonable solution to the considered nonconvex problem, smoothed l0l_{0}-norm approximation is employed to convert the nonconvex problem into a tractable form. By analyzing the unique channel properties of the considered high-speed railway Fog-RAN, we provide a closed-form solution in certain special cases. With the obtained solution, the inherent tradeoff among the total network cost, delay and delivery content size is further discussed. Moreover, the invariant power allocation counterpart is derived to evaluate the performance of the proposed dynamic power allocation. Simulation results reveal that the intended dynamic power allocation is able to significantly outperform the invariant power allocation scheme. The performance gain is more pronounced when the random caching strategy is used or the caching resources at the RRHs are limited.

The organization of the remainder of the paper is shown below. In Section II, we present the channel model as well as problem formulation of the considered high-speed railway Fog-RAN. In Section III, optimization of the dynamic power allocation is discussed. In Section IV, we analyze the tradeoff among the total network cost, delay and delivery content size. In Section V, we present the invariant power allocation counterpart. In Section VI, extensive numerical results are stated to demonstrate the performance. Finally, we conclude the paper in Section VII.

II Channel Model and Problem Formulation

Refer to caption
Figure 1: Fog-RAN-assisted high-speed communication system.

In this section, we present the channel model of the considered Fog-RAN. After that, the dynamic optimization problem is formulated.

II-A Channel model

As illustrated in Fig. 1, we consider a dynamic wireless transmission scenario of a train running at high speed along a straight-line railway. To provide the required high data rate and low latency downlink transmission, the train is served by a Fog-RAN in which multiple base stations, i.e., RRHs, are uniformly deployed along one side of the road at intervals of the same size. The RRHs are assumed to be connected to the same baseband unit (BBU). The BBU is responsible for the RRH resource allocation as well as the serving-RRH switch. At any time, the train is served by the two nearest RRHs. Because the service repeats periodically for different pairs of neighboring RRHs, with no loss of generality, only one time interval is considered, where the train is assisted by RRH 11 and RRH 22.

It is assumed that the contents requested by the train include LL different popular contents of the same size QQ. All RRHs have a limited local storage size, and all LL contents cannot be cached in a single RRH simultaneously. Suppose FnF_{n} as the local storage size of RRH nn; thus, we have Fn<LQF_{n}<LQ. According to the requested frequency of the contents, the popularity distribution of the contents is denoted by 𝐩=[p1,p2,,pL]{\bf p}=[p_{1},p_{2},\cdots,p_{L}], where pl(0,1)p_{l}\in(0,1) denotes the popularity of content ll. With unchanged generality, in this study, we make the assumption that p1p2pLp_{1}\geq p_{2}\geq\cdots\geq p_{L} and that the content popularity follows a Zipf distribution given by pl=lηl=1Llηp_{l}=\frac{l^{-\eta}}{\sum^{L}_{l=1}l^{-\eta}}, where η\eta denotes the shaping parameter defining the skewness of the popularity distribution [13, 26, 7]. According to the content popularity distribution, the RRHs can store the requested contents with different caching strategies. Two caching strategies that widely used are the popularity-aware caching (PopC) strategy and random caching (RndC) strategy. The PopC strategy allows each RRH to cache the most popular contents until its storage size is fully utilized, while the RndC strategy asks each RRH to cache the contents randomly with identical probabilities regardless of the content popularity distribution. Therefore, to identify the used caching strategy, we define a cache placement matrix C𝔹N×LC\in\mathbb{{B}}^{N\times L} with cn,l0,1c_{n,l}\in{0,1}. Specifically, cn,l=1c_{n,l}=1 occurs if content ll is cached in RRH nn; otherwise, content ll is not cached in RRH nn. To satisfy the RRH storage size constraints, we have l=1Fcn,lQFn,n\sum_{l=1}^{F}c_{n,l}Q\leq F_{n},\forall n.

We consider the Fog-RAN-assisted downlink transmission described in Fig. 1. We assume dd equal intervals between every two RRHs that d0d_{0} is the distance between each RRH and the road, and that hh is the height of the transmit antenna on each RRH. The train moves down the railway at a unchanging velocity vv. To determine the coordinates of the RRHs, we assume that there is an original point oo and the system time when the train passes through oo equals to 0. The coordinates of RRH nn are represented as (ln,d0)(l_{n},d_{0}). During the time interval (0,T](0,T], RRH 11 and 22 serve the train. According to the geometric structure of the system, the distance between RRH nn and the train can be denoted by

dn(t)=(vtln)2+d02+h2,t(0,T].d_{n}(t)=\sqrt{(vt-l_{n})^{2}+d_{0}^{2}+h^{2}},~{}~{}t\in(0,T]. (1)

When t>Tt>T, the BBU coordinates the handoff process, and the train is served by a new set of RRHs. However, because the communications over different time intervals are periodic, with no loss of generality, we next focus only on the dynamic resource allocation over time interval t(0,T]t\in(0,T].

During the time interval t(0,T]t\in(0,T], we assumed that a requested content has to be delivered from the RRHs to the train. Denote x(t)x(t) as the modulated signal for the requested content transmitted in the downlink. Here, we assume that signals transmitted from different RRHs are sent over an orthogonal bandwidth. Moreover, assume that signal x(t)x(t) is a stochastic process with mean of zero and with unit variance. Thus, the baseband signal transmitted from RRH nn at time tt can be represented by

yn(t)=Pn(t)hn(t)x(t)+nn(t),y_{n}(t)=\sqrt{P_{n}(t)}h_{n}(t)x(t)+n_{n}(t), (2)

where Pn(t)P_{n}(t) denotes the instantaneous transmit power at RRH nn, hn(t)h_{n}(t) shows the instantaneous channel coefficient, and nn(t)n_{n}(t) represents the additive complex cycle symmetric Gaussian noise at the train following CN(0,σ2)CN(0,\sigma^{2}). In this study, we assume that the train runs in an open area and that the channel coefficient is dominated by the line-of-sight (LOS) component without any scatter. In this way, the propagation attenuation model can be represented as hn(t)=Gdnα(t)h_{n}(t)=\frac{\sqrt{G}}{d^{\alpha}_{n}(t)}, where GG shows the constant channel gain and α\alpha represents the path-loss exponent.

At the receiver, the received signals can be combined using a maximal ratio combiner. The instantaneous achievable rate at time tt can be given as

C(t)=Blog2(1+n𝒩Pn(t)|hn(t)|2σ2),C(t)=B\log_{2}\left(1+\sum_{n\in\mathcal{N}}\frac{P_{n}(t)|h_{n}(t)|^{2}}{\sigma^{2}}\right), (3)

where 𝒩={1,2}\mathcal{N}=\{1,2\} and BB denotes the frequency bandwidth that is distributed for every channel between an RRH and the train.

II-B Problem formulation

In the considered Fog-RAN-assisted downlink transmission, if the content requested by the train has been cached at the serving RRH, the serving RRH is able to instantly convey the content to the train; if not, fetching the content from the BBU via backhaul links is required for the RRH , which consumes extra backhaul transmission resources. Denote the instantaneous content delivery rate over the backhaul link between RRH nn and the train at time tt as Rn(t)R_{n}(t). The target of dynamic power allocation is to reduce the entire network power cost as much as we can, which includes the backhaul power consumption and RRH convey power consumption. Specifically, assuming that content ll is requested by the train, the backhaul power consumption can be represented as

Costb=0Tn=12β0TPn(t)𝑑t0(1cn,l)Rn(t)dt.{\rm Cost}_{b}=\int_{0}^{T}\sum^{2}_{n=1}\beta||\int_{0}^{T}P_{n}(t)dt||_{0}(1-c_{n,l})R_{n}(t)dt. (4)

Here, we consider that only the backhaul link associated with the active RRH at which the requested content is not cached consumes extra power. In (4), we use the term 0TPn(t)𝑑t0||\int_{0}^{T}P_{n}(t)dt||_{0} to indicate the active RRH, which implies that if Pn(t)P_{n}(t) is not always zero over time period (0,T](0,T], RRH nn is an active RRH. 1cn,l1-c_{n,l} is used to indicate the impact of content ll caching on the backhaul link associated with RRH nn. We observe that if cn,l=1c_{n,l}=1, which means that content ll has been at RRH nn, then 1cn,l=01-c_{n,l}=0 indicates that no extra power is needed for backhaul link nn; otherwise, extra power is required for backhaul link nn. Parameter β\beta in (4) is a constant number establishing a relationship between the rate Rn(t)R_{n}(t) and the cost of power.

In addition, the entire RRH transmit power consumed over time period (0,T](0,T] can be denoted as

Costp=0Tn𝒩Pn(t)dt.{\rm Cost}_{p}=\int_{0}^{T}\sum_{n\in\mathcal{N}}P_{n}(t)dt. (5)

In this way, the network power cost in total can be written as

Cost=Costb+Costp.{\rm Cost}={\rm Cost}_{b}+{\rm Cost}_{p}. (6)

In addition to minimizing the total network power consumption, our dynamic power allocation also considers several QoS-related constraints. The most important one is the delay requirement. Basically, for an RRH that does not cache the requested content, the delay contains two hops: one for the backhaul link and another for the wireless transmission link between this RRH and the train. However, as we here assume that Rn(t)C(t)R_{n}(t)\geq C(t) always succeeds, the instantaneous delay can be written as

τ(t)=1C(t).\tau(t)=\frac{1}{C(t)}. (7)

In brief,the overall dynamic power allocation problem is formulated as

minPn(t)Cost\displaystyle\min\limits_{P_{n}(t)}{\rm Cost} (8a)
s.t.\displaystyle s.t. 1T0TPn(t)𝑑tPn,avgn𝒩\displaystyle\frac{1}{T}\int_{0}^{T}P_{n}(t)dt\leq P_{n,{\rm avg}}~{}~{}\forall n\in\mathcal{N} (8e)
τ(t)τmax\displaystyle\tau(t)\leq\tau_{\rm max}
0TC(t)𝑑tQ\displaystyle\int_{0}^{T}C(t)dt\geq Q
Pn(t)0,\displaystyle P_{n}(t)\geq 0,

where (8e) denotes the average power constraint of every RRH, with Pn,avgP_{n,{\rm avg}} being the maximum average power at RRH nn; (8e) denotes the instantaneous transmission delay requirement, with τmax\tau_{\rm max} being the maximum delay requirement; (8e) is the requested content delivery that requires to be completed through the network during the time period (0,T](0,T]; and (8e) shows that the instantaneous power at each time tt should not be negative. Our final objective is to minimize the total network cost through maximizing the instantaneous power at every RRH and the instantaneous transmission rate over each backhaul link.

III Dynamic Power Allocation for High-Speed Railway Fog-RAN

In this section, our solution to (8) is presented. Different from the widely studied conventional static power allocation problem, our considered dynamic is more challenging. The reason includes the following aspects. First, because our optimization problem considered the backhaul consumption, a nonconvex l0l_{0}-norm function is introduced, which results in the nonconvexity of our problem. Moreover, our optimization variable is a function with respect to time tt, and the constraints in problem (8) include an integration form. Basically, the dynamic optimization problems are widely applied in the fields of smart power systems, robotics, etc. [27]. In what follows, by using certain approximations, we present several ways to find the optimal solution to the approximated problem.

To address the nonconvex objective function in (8), we utilize the continuous smooth log-function to approximate the l0l_{0}-norm function as [13]

x0log(xθ+1)log(1θ+1),||x||_{0}\approx\frac{\log\left(\frac{x}{\theta}+1\right)}{\log\left(\frac{1}{\theta}+1\right)}, (9)

where the function of θ\theta is to control the smoothness of the approximation. A smaller value of θ\theta results in a better approximation, while leads to a worse smooth function, vice versa. With (9), the nonconvex part Costb{\rm Cost}_{b} in (8) can be written as

Costb\displaystyle{\rm Cost}_{b}\approx cβn=1N(1cn,f)log(0TPn(t)𝑑t+θθ)\displaystyle c\beta\sum^{N}_{n=1}(1-c_{n,f})\log\left(\frac{\int_{0}^{T}P_{n}(t)dt+\theta}{\theta}\right) (10)
×0TRn(t)dt,\displaystyle\times\int_{0}^{T}R_{n}(t)dt,

where c=1log(1θ+1)c=\frac{1}{\log\left(\frac{1}{\theta}+1\right)}. With (10), the objective function in (8) can be rewritten as

Costappro1\displaystyle{\rm Cost}_{\rm appro1}\approx 0Tn𝒩Pn(t)dt+n𝒩bn\displaystyle\int_{0}^{T}\sum_{n\in\mathcal{N}}P_{n}(t)dt+\sum_{n\in\mathcal{N}}b_{n} (11)
×log(0TPn(t)𝑑t+θθ),\displaystyle\times\log\left(\frac{\int_{0}^{T}P_{n}(t)dt+\theta}{\theta}\right),

where bn=cβ(1cn,f)0TRn(t)𝑑tb_{n}=c\beta(1-c_{n,f})\int_{0}^{T}R_{n}(t)dt.

It is observed that the approximated function Costappro1{\rm Cost}_{\rm appro1} is still nonconvex with respect to Pn(t)P_{n}(t) as it involves the summation of concave functions log(0TPn(t)𝑑t+θθ)\log\left(\frac{\int_{0}^{T}P_{n}(t)dt+\theta}{\theta}\right). To address this problem, we apply the majorization-minimization (MM) theory to find the upper bound of Costappro1{\rm Cost}_{\rm appro1}. Considering that the logarithmic function is a concave function, the MM theory applied in our case involves using its first-order Taylor expansion as the upper bound. Then, in the MM algorithm, a solution to (8) is generated through minimizing the following upper-bounded function:

Costappro20Tn𝒩Pn(t)dt+n𝒩bn[log(θ+0TPn0(t)𝑑tθ)+0TPn(t)𝑑t0TPn0(t)𝑑tθ+0TPn0(t)𝑑t],\begin{split}&{\rm Cost}_{\rm appro2}\approx\int_{0}^{T}\sum_{n\in\mathcal{N}}P_{n}(t)dt+\sum_{n\in\mathcal{N}}b_{n}\bigg{[}\\ &\log\Big{(}\frac{\theta+\int_{0}^{T}P^{0}_{n}(t)dt}{\theta}\Big{)}+\frac{\int_{0}^{T}P_{n}(t)dt-\int_{0}^{T}P^{0}_{n}(t)dt}{\theta+\int_{0}^{T}P^{0}_{n}(t)dt}\bigg{]}\end{split}, (12)

where 0TPn0(t)𝑑t\int_{0}^{T}P^{0}_{n}(t)dt denotes a basis point of the Taylor expansion of log(0TPn(t)𝑑t+θθ)\log\left(\frac{\int_{0}^{T}P_{n}(t)dt+\theta}{\theta}\right).

Assuming kn=1+bn1θ+0TPn0(t)𝑑tk_{n}=1+b_{n}\frac{1}{\theta+\int_{0}^{T}P^{0}_{n}(t)dt},solving (8) finally reduces to solving the following convex problem:

minPn(t)0T(n𝒩knPn(t))𝑑t\displaystyle\min\limits_{P_{n}(t)}\int_{0}^{T}\bigg{(}\sum_{n\in\mathcal{N}}k_{n}P_{n}(t)\bigg{)}dt (13a)
s.t.\displaystyle s.t. 1T0TPn(t)𝑑tPn,avgn𝒩\displaystyle\frac{1}{T}\int_{0}^{T}P_{n}(t)dt\leq P_{n,{\rm avg}}~{}~{}\forall n\in\mathcal{N} (13e)
C(t)1τmax\displaystyle C(t)\geq\frac{1}{\tau_{\rm max}}
0TC(t)𝑑tQ\displaystyle\int_{0}^{T}C(t)dt\geq Q
Pn(t)0.\displaystyle P_{n}(t)\geq 0.

For solving (13), firstly, we obtain the following theorem.

Theorem 1.

If TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q, we have C(t)=1τmaxC(t)=\frac{1}{\tau_{\rm max}} at the optimal solution, and the optimal solution of (13) can be represented as

P1(t)=a~0(t)a~2(t)P2(t),P_{1}(t)=\tilde{a}_{0}(t)-\tilde{a}_{2}(t)P_{2}(t), (14)

where a~n(t)\tilde{a}_{n}(t) is as defined in (17). Denote the critical time points {t,t′′}\{t^{\prime},t^{\prime\prime}\} and t~\tilde{t}^{\prime} as defined in (29) and (31), respectively; the optimal P2(t)P_{2}(t) is given in the following four cases:

  • If 0Ta~3(t)𝑑tTP2,avg\int_{0}^{T}\tilde{a}_{3}(t)dt\leq TP_{2,{\rm avg}} and B0B\leq 0

    P2(t)={00<t<ta~3(t)ttTP^{*}_{2}(t)=\left\{\begin{array}[]{cc}0&0<t<t^{\prime}\\ \tilde{a}_{3}(t)&t^{\prime}\leq t\leq T\\ \end{array}\right.
  • If 0Ta~3(t)𝑑tTP2,avg\int_{0}^{T}\tilde{a}_{3}(t)dt\leq TP_{2,{\rm avg}} and B>0B>0

    P2(t)={00<t<min{t,t′′}a~3(t)min{t,t′′}tTP^{*}_{2}(t)=\left\{\begin{array}[]{cc}0&0<t<\min\{t^{\prime},t^{\prime\prime}\}\\ \tilde{a}_{3}(t)&\min\{t^{\prime},t^{\prime\prime}\}\leq t\leq T\\ \end{array}\right.
  • If 0Ta~3(t)𝑑t>TP2,avg\int_{0}^{T}\tilde{a}_{3}(t)dt>TP_{2,{\rm avg}} and B0B\leq 0

    P2(t)={00<t<max{t,t~}a~3(t)max{t,t~}tTP^{*}_{2}(t)=\left\{\begin{array}[]{cc}0&0<t<\max\{t^{\prime},\tilde{t}^{\prime}\}\\ \tilde{a}_{3}(t)&\max\{t^{\prime},\tilde{t}^{\prime}\}\leq t\leq T\\ \end{array}\right. (15)
  • If 0Ta~3(t)𝑑t>TP2,avg\int_{0}^{T}\tilde{a}_{3}(t)dt>TP_{2,{\rm avg}} and B0B\leq 0

    • If tt~t^{\prime}\geq\tilde{t}^{\prime}, the optimal solution is given by (15).

    • Otherwise, the optimal solution can be obtained by solving linear program problem (33).

Proof.

It is noted that if TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q succeeds, for condition (13e), we have 0TC(t)𝑑t0T1τmax𝑑t=TτmaxQ\int_{0}^{T}C(t)dt\geq\int_{0}^{T}\frac{1}{\tau_{\rm max}}dt=\frac{T}{\tau_{\rm max}}\geq Q. This implies that condition (13e) in (13) is redundant. Next, we show that at the optimal solution, (13e) must be active. We prove this result using a contradiction. Assume that at the optimal solution of (13), the optimal Pn(t)P_{n}(t) makes C(t)>1τmaxC(t)>\frac{1}{\tau_{\rm max}} at some time tt. Now, we can multiply Pn(t)P_{n}(t) by a coefficient δn(t)(0,1)\delta_{n}(t)\in(0,1), which can further reduce the value of the objective function while not violating the constraints. According to the above analysis, under the condition of TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q, problem (13) is equivalent to

minPn(t)0T(n𝒩knPn(t))𝑑t\displaystyle\min\limits_{P_{n}(t)}\int_{0}^{T}\bigg{(}\sum_{n\in\mathcal{N}}k_{n}P_{n}(t)\bigg{)}dt (16a)
s.t.\displaystyle s.t. 0TPn(t)𝑑tTPn,avgn𝒩\displaystyle\int_{0}^{T}P_{n}(t)dt\leq TP_{n,{\rm avg}}~{}~{}\forall n\in\mathcal{N} (16d)
Blog2(1+n𝒩GPn(t)dn(t)ασ2)=1τmax\displaystyle B\log_{2}\Big{(}1+\sum_{n\in\mathcal{N}}\frac{GP_{n}(t)}{d_{n}(t)^{\alpha}\sigma^{2}}\Big{)}=\frac{1}{\tau_{\rm max}}
Pn(t)0,n.\displaystyle P_{n}(t)\geq 0,\forall n.

With the equality constraint (16d), by denoting an(t)=Gdn(t)ασ2a_{n}(t)=\frac{G}{d_{n}(t)^{\alpha}\sigma^{2}}, we have

P1(t)=a~0(t)a~2(t)P2(t),P_{1}(t)=\tilde{a}_{0}(t)-\tilde{a}_{2}(t)P_{2}(t), (17)

where a~0(t)=21Bτmax1a1(t)\tilde{a}_{0}(t)=\frac{2^{\frac{1}{B\tau_{\rm max}}}-1}{a_{1}(t)} and a~2(t)=a2(t)a1(t)\tilde{a}_{2}(t)=\frac{a_{2}(t)}{a_{1}(t)}. Problem (16) can be further transformed into

minP2(t)0T(k2k1a~2(t))P2(t)𝑑t\displaystyle\min\limits_{P_{2}(t)}\int_{0}^{T}\bigg{(}k_{2}-k_{1}\tilde{a}_{2}(t)\bigg{)}P_{2}(t)dt (18a)
s.t.\displaystyle s.t. 0TP2(t)𝑑tTP2,avg\displaystyle\int_{0}^{T}P_{2}(t)dt\leq TP_{2,{\rm avg}} (18d)
B0Ta~2(t)P2(t)𝑑tA\displaystyle B\leq\int_{0}^{T}\tilde{a}_{2}(t)P_{2}(t)dt\leq A
0P2(t)a~3(t),\displaystyle 0\leq P_{2}(t)\leq\tilde{a}_{3}(t),

where A=0Ta~0(t)𝑑tA=\int_{0}^{T}\tilde{a}_{0}(t)dt, B=0Ta~0(t)𝑑tTP1,avgB=\int_{0}^{T}\tilde{a}_{0}(t)dt-TP_{1,{\rm avg}} and a~3(t)=a~0(t)a~2(t)\tilde{a}_{3}(t)=\frac{\tilde{a}_{0}(t)}{\tilde{a}_{2}(t)}. Constraint (18d) comes from the fact that 00TP1(t)𝑑tTP1,avg0\leq\int_{0}^{T}P_{1}(t)dt\leq TP_{1,{\rm avg}}, while (18d) comes from the fact that P1(t)0P_{1}(t)\geq 0.

According to the definitions of a~3(t)\tilde{a}_{3}(t) and a~2(t)\tilde{a}_{2}(t), we observe that if constraint (18d) is satisfied, we have

0Ta~2(t)P2(t)𝑑t0Ta~2(t)a~3(t)𝑑t=A.\int_{0}^{T}\tilde{a}_{2}(t)P_{2}(t)dt\leq\int_{0}^{T}\tilde{a}_{2}(t)\tilde{a}_{3}(t)dt=A. (19)

Then, problem (18) can be equivalent to

minP2(t)0T(k2k1a~2(t))P2(t)𝑑t\displaystyle\min\limits_{P_{2}(t)}\int_{0}^{T}\bigg{(}k_{2}-k_{1}\tilde{a}_{2}(t)\bigg{)}P_{2}(t)dt (20a)
s.t.\displaystyle s.t. 0TP2(t)𝑑tTP2,avg\displaystyle\int_{0}^{T}P_{2}(t)dt\leq TP_{2,{\rm avg}} (20d)
0Ta~2(t)P2(t)𝑑tB\displaystyle\int_{0}^{T}\tilde{a}_{2}(t)P_{2}(t)dt\geq B
0P2(t)a~3(t).\displaystyle 0\leq P_{2}(t)\leq\tilde{a}_{3}(t).

To find the analytical solution to (20), we next discuss certain particular cases that help simplify the problem.

Case 1) when 0Ta~3(t)𝑑tTP2,avg\int_{0}^{T}\tilde{a}_{3}(t)dt\leq TP_{2,{\rm avg}} and B0B\leq 0 are satisfied.

In this case, constraints (33d) and (33d) are redundant. Problem (20) reduces to

minPn(t)\displaystyle\min\limits_{P_{n}(t)} 0T(k2k1a~2(t))P2(t)𝑑t\displaystyle\int_{0}^{T}\bigg{(}k_{2}-k_{1}\tilde{a}_{2}(t)\bigg{)}P_{2}(t)dt (21a)
s.t.\displaystyle s.t. 0P2(t)a~3(t).\displaystyle 0\leq P_{2}(t)\leq\tilde{a}_{3}(t). (21b)

It is noted that the optimal solution to (21) depends on the sign of k2k1a~2(t)k_{2}-k_{1}\tilde{a}_{2}(t). To minimize the objective function, the optimal solution to (21) is given as

P2(t)={0t{t|k2k1a~2(t)>0}a~3(t)t{t|k2k1a~2(t)0}.P^{*}_{2}(t)=\left\{\begin{array}[]{cc}0&t\in\{t|k_{2}-k_{1}\tilde{a}_{2}(t)>0\}\\ \tilde{a}_{3}(t)&t\in\{t|k_{2}-k_{1}\tilde{a}_{2}(t)\leq 0\}\\ \end{array}\right.. (22)

Case 2) when 0Ta~3(t)𝑑tTP2,avg\int_{0}^{T}\tilde{a}_{3}(t)dt\leq TP_{2,{\rm avg}} and B>0B>0 are satisfied.

In this case, constraint (33d) is redundant. Problem (20) reduces to

minP2(t)0T(k2k1a~2(t))P2(t)𝑑t\displaystyle\min\limits_{P_{2}(t)}\int_{0}^{T}\bigg{(}k_{2}-k_{1}\tilde{a}_{2}(t)\bigg{)}P_{2}(t)dt (23a)
s.t.\displaystyle s.t. 0Ta~2(t)P2(t)𝑑tB\displaystyle\int_{0}^{T}\tilde{a}_{2}(t)P_{2}(t)dt\geq B (23c)
0P2(t)a~3(t).\displaystyle 0\leq P_{2}(t)\leq\tilde{a}_{3}(t).

Within the time period (0,T](0,T], as the train departs from RRH 11 and move closer to RRH 22, we see that functions k2k1a~2(t)k_{2}-k_{1}\tilde{a}_{2}(t), a~2(t)\tilde{a}_{2}(t), and a~3(t)\tilde{a}_{3}(t) are decreasing, increasing and decreasing functions with respect to tt, respectively. According to this observation, the two critical time points are defined as

t{t|k2k1a~2(t)=0}t′′{t|0Ta~2(t)a~3(t)𝑑t=0Ta~0(t)𝑑t=B}.\begin{split}t^{\prime}&\triangleq\{t|k_{2}-k_{1}\tilde{a}_{2}(t)=0\}\\ t^{\prime\prime}&\triangleq\left\{t|\int_{0}^{T}\tilde{a}_{2}(t)\tilde{a}_{3}(t)dt=\int_{0}^{T}\tilde{a}_{0}(t)dt=B\right\}.\end{split} (24)

It is noted that for the time region (t,T](t^{\prime},T], P2(t)P_{2}(t) should be equal to a~3(t)\tilde{a}_{3}(t) to minimize the value of the objective. Therefore, in the scenario with tt′′t^{\prime}\geq t^{\prime\prime}, the optimal solution is presented as

P2(t)={0t<t′′a~3(t)tt′′.P^{*}_{2}(t)=\left\{\begin{array}[]{cc}0&t<t^{\prime\prime}\\ \tilde{a}_{3}(t)&t\geq t^{\prime\prime}\\ \end{array}\right.. (25)

For the scenario with t<t′′t^{\prime}<t^{\prime\prime}, it is easy to see that for the time region over t(t′′,T]t\in(t^{\prime\prime},T], the optimal P2(t)P_{2}(t) should be equal to a~3(t)\tilde{a}_{3}(t) as given in (25). However, this kind of solution cannot satisfy the constraint (23c). To this end, we need to find a certain P2(t)P_{2}(t) in the time period t(0,t′′]t\in(0,t^{\prime\prime}] to satisfy

0t′′a~2(t)P2(t)𝑑t=Bt′′Ta~0(t)𝑑t.\int_{0}^{t^{\prime\prime}}\tilde{a}_{2}(t)P_{2}(t)dt=B-\int_{t^{\prime\prime}}^{T}\tilde{a}_{0}(t)dt. (26)

Next we show that the optimal solution over t(0,t′′]t\in(0,t^{\prime\prime}] is

P2(t)={0t<ta~3(t)t(t,t′′].P^{*}_{2}(t)=\left\{\begin{array}[]{cc}0&t<t^{\prime}\\ \tilde{a}_{3}(t)&t\in(t^{\prime},t^{\prime\prime}]\\ \end{array}\right.. (27)

We prove this result with contradiction analysis. Assume that there is a new optimal solution P2(t)P^{**}_{2}(t) over t(0,t′′]t\in(0,t^{\prime\prime}] given by

P2(t)={0t<t~<a~3(t)t(t~,t′′].P^{**}_{2}(t)=\left\{\begin{array}[]{cc}0&t<\tilde{t}\\ <\tilde{a}_{3}(t)&t\in(\tilde{t},t^{\prime\prime}]\\ \end{array}\right.. (28)

To satisfy (26), we have t~<t\tilde{t}<t^{\prime}. However, k2k1a~2(t)k_{2}-k_{1}\tilde{a}_{2}(t) is a decreasing function over tt. Hence, 0t′′(k2k1a~2(t))P2(t)𝑑t\int_{0}^{t^{\prime\prime}}\bigg{(}k_{2}-k_{1}\tilde{a}_{2}(t)\bigg{)}P^{**}_{2}(t)dt must be larger than 0t′′(k2k1a~2(t))P2(t)𝑑t\int_{0}^{t^{\prime\prime}}\bigg{(}k_{2}-k_{1}\tilde{a}_{2}(t)\bigg{)}P^{*}_{2}(t)dt with P2(t)P^{*}_{2}(t) given in (27), which implies that P2(t)P^{**}_{2}(t) cannot be an optimal solution. According to the above analysis, the optimal solution for Case 2 is denoted as

P2(t)={0t<min{t,t′′}a~3(t)tmin{t,t′′}.P^{*}_{2}(t)=\left\{\begin{array}[]{cc}0&t<\min\{t^{\prime},t^{\prime\prime}\}\\ \tilde{a}_{3}(t)&t\geq\min\{t^{\prime},t^{\prime\prime}\}\\ \end{array}\right.. (29)

Case 3) when 0Ta~3(t)𝑑t>TP2,avg\int_{0}^{T}\tilde{a}_{3}(t)dt>TP_{2,{\rm avg}} and B0B\leq 0 are satisfied.

In this case, the constraint (33d) is redundant. Problem (20) reduces to

minP2(t)0T(k2k1a~2(t))P2(t)𝑑t\displaystyle\min\limits_{P_{2}(t)}\int_{0}^{T}\bigg{(}k_{2}-k_{1}\tilde{a}_{2}(t)\bigg{)}P_{2}(t)dt (30a)
s.t.\displaystyle s.t. 0TP2(t)𝑑tTP2,avg\displaystyle\int_{0}^{T}P_{2}(t)dt\leq TP_{2,{\rm avg}} (30c)
0P2(t)a~3(t).\displaystyle 0\leq P_{2}(t)\leq\tilde{a}_{3}(t).

We define t~\tilde{t}^{\prime} as

t~{t|t~Ta~3(t)𝑑t=TP2,avg}.\tilde{t}^{\prime}\triangleq\left\{t|\int_{\tilde{t}^{\prime}}^{T}\tilde{a}_{3}(t)dt=TP_{2,{\rm avg}}\right\}. (31)

Again, because k2k1a~2(t)k_{2}-k_{1}\tilde{a}_{2}(t) is a decreasing function, similar to the analysis in Case 2, the optimal solution to problem (30) is presented as

P2(t)={0t<max{t,t~}a~3(t)tmax{t,t~}.P^{*}_{2}(t)=\left\{\begin{array}[]{cc}0&t<\max\{t^{\prime},\tilde{t}^{\prime}\}\\ \tilde{a}_{3}(t)&t\geq\max\{t^{\prime},\tilde{t}^{\prime}\}\\ \end{array}\right.. (32)

It is noted that in (32), the critical time point is max{t,t~}\max\{t^{\prime},\tilde{t}^{\prime}\}, which is different from Case 2 due to the inequality constraint (30c).

Case 4) when 0Ta~3(t)𝑑t>TP2,avg\int_{0}^{T}\tilde{a}_{3}(t)dt>TP_{2,{\rm avg}} and B>0B>0 are satisfied.

In this case, according to Lemma 3, the feasibility of problem (20) requires t~t′′\tilde{t}^{\prime}\leq t^{\prime\prime}. Under this condition, if tt~t^{\prime}\geq\tilde{t}^{\prime}, we see that the optimal solution is equal to (29) and that constraint (33d) is redundant. Otherwise, the optimal solution can be approximately obtained through solving the following linear programming:

minP2(tm)m=1M(k2k1a~2(tm))P2(tm)t\displaystyle\min\limits_{P_{2}(t_{m})}\sum^{M}_{m=1}\bigg{(}k_{2}-k_{1}\tilde{a}_{2}(t_{m})\bigg{)}P_{2}(t_{m}){\bigtriangleup t} (33a)
s.t.\displaystyle s.t. m=1MP2(tm)tTP2,avg\displaystyle\sum^{M}_{m=1}P_{2}(t_{m}){\bigtriangleup t}\leq TP_{2,{\rm avg}} (33d)
m=1Ma~2(tm)P2(tm)tB\displaystyle\sum^{M}_{m=1}\tilde{a}_{2}(t_{m})P_{2}(t_{m}){\bigtriangleup t}\geq B
0P2(tm)a~3(tm),m.\displaystyle 0\leq P_{2}(t_{m})\leq\tilde{a}_{3}(t_{m}),\forall m.

The linear problem is obtained by sampling the time period t(0,T]t\in(0,T] as the discrete time points {t1,t2,,tM}\{t_{1},t_{2},\cdots,t_{M}\} with the adjacent sampling point interval given by t{\bigtriangleup t}. It is noted that t{\bigtriangleup t} is sufficiently small; thus, we can obtain an approximately optimal solution via (33).

Now, by combining the solutions of Case 1-Case 4, we obtain the final optimal solution. This completes the proof of Theorem 1.

Consider a special case where the train is assisted by only RRH 11 over the time period (0,T](0,T], i.e., 𝒩={1}\mathcal{N}=\{1\}. Theorem 1 can be reduced to the following lemma.

Lemma 1.

If TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q and 𝒩={1}\mathcal{N}=\{1\}, we have C(t)=1τmaxC(t)=\frac{1}{\tau_{\rm max}} at the optimal solution, and the optimal solution to (13) can be denoted as

P1(t)=(21Bτmax1)d1α(t)σ2G.P_{1}(t)=\left(2^{\frac{1}{B\tau_{\rm max}}}-1\right)\frac{d^{\alpha}_{1}(t)\sigma^{2}}{G}. (34)
Proof.

When 𝒩={1}\mathcal{N}=\{1\}, problem (13) can be written as

minP1(t)0T(k1P1(t))𝑑t\displaystyle\min\limits_{P_{1}(t)}\int_{0}^{T}\bigg{(}k_{1}P_{1}(t)\bigg{)}dt (35a)
s.t.\displaystyle s.t. 1T0TP1(t)𝑑tP1,avg\displaystyle\frac{1}{T}\int_{0}^{T}P_{1}(t)dt\leq P_{1,{\rm avg}}~{}~{} (35d)
C(t)=1τmax\displaystyle C(t)=\frac{1}{\tau_{\rm max}}
P1(t)0.\displaystyle P_{1}(t)\geq 0.

If the problem is feasible, the solution is determined by constraint (35d), which completes the proof of Lemma 1. ∎

If condition TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q is not satisfied, the conclusions shown in Theorem 1 and Lemma 1 will not be applicable. To propose an efficient way to address this problem, we first give the following lemma.

Lemma 2.

If Tτmax<Q\frac{T}{\tau_{\rm max}}<Q, the optimal solution to (13) can be represented as

Pn(t)=Pn,1(t)+Pn,2(t),P_{n}(t)=P_{n,1}(t)+P_{n,2}(t), (36)

where Pn,1(t)P_{n,1}(t) is the optimal solution obtained by solving (16) and Pn,2(t)P_{n,2}(t) is the solution to

minPn,2(t)0T(n𝒩knPn,2(t))𝑑t\displaystyle\min\limits_{P_{n,2}(t)}\int_{0}^{T}\bigg{(}\sum_{n\in\mathcal{N}}k_{n}P_{n,2}(t)\bigg{)}dt (37a)
s.t.\displaystyle s.t. 1T0TPn,2(t)𝑑tbnn𝒩\displaystyle\frac{1}{T}\int_{0}^{T}P_{n,2}(t)dt\leq b_{n}~{}~{}\forall n\in\mathcal{N} (37e)
n𝒩κn(t)Pn,2(t)0\displaystyle\sum_{n\in\mathcal{N}}\kappa_{n}(t)P_{n,2}(t)\geq 0
0TBlog2(cn(t)+n𝒩κn(t)Pn,2(t))𝑑tQ\displaystyle\int_{0}^{T}B\log_{2}\left(c_{n}(t)+\sum_{n\in\mathcal{N}}\kappa_{n}(t)P_{n,2}(t)\right)dt\geq Q~{}~{}~{}~{}~{}~{}~{}
Pn,2(t)Pn,1(t),\displaystyle P_{n,2}(t)\geq-P^{*}_{n,1}(t),

where bn=Pn,avg1T0TPn,1(t)𝑑tb_{n}=P_{n,{\rm avg}}-\frac{1}{T}\int_{0}^{T}P^{*}_{n,1}(t)dt, Gdnα(t)σ2=κn(t)\frac{G}{d^{\alpha}_{n}(t)\sigma^{2}}=\kappa_{n}(t) and cn(t)=1+n=1Nκn(t)Pn,1(t)c_{n}(t)=1+\sum^{N}_{n=1}\kappa_{n}(t)P^{*}_{n,1}(t), with Pn,1(t)P^{*}_{n,1}(t) being the optimal solution to Pn,1(t)P_{n,1}(t).

Proof.

It is noted that for any feasible Pn(t)P_{n}(t) of (13), we can always decompose it into a sum of Pn,1(t)P_{n,1}(t) and Pn,2(t)P_{n,2}(t). Basically, the choice of Pn,1(t)P_{n,1}(t) and Pn,2(t)P_{n,2}(t) can be arbitrary as long as their sum is equal to Pn(t)P_{n}(t). Without reduction of generality, we assume that the term Pn,1(t)P_{n,1}(t) is chosen as the optimal solution to (16), denoted by Pn,1(t)P^{*}_{n,1}(t). Then, problem (13) changes to

minPn,2(t)0T(n𝒩kn(Pn,1(t)+Pn,2(t)))𝑑t\displaystyle\min\limits_{P_{n,2}(t)}\int_{0}^{T}\bigg{(}\sum_{n\in\mathcal{N}}k_{n}(P^{*}_{n,1}(t)+P_{n,2}(t))\bigg{)}dt (38a)
s.t.\displaystyle s.t. 1T0T(Pn,1(t)+Pn,2(t))𝑑tPn,avgn𝒩\displaystyle\frac{1}{T}\int_{0}^{T}\bigg{(}P^{*}_{n,1}(t)+P_{n,2}(t)\bigg{)}dt\leq P_{n,{\rm avg}}~{}~{}\forall n\in\mathcal{N}~{}~{}~{}~{}~{}~{}~{} (38e)
Blog2(1+n𝒩κn(t)Pn,1(t)+\displaystyle B\log_{2}\left(1+\sum_{n\in\mathcal{N}}\kappa_{n}(t)P^{*}_{n,1}(t)+\right.
n𝒩κn(t)Pn,2(t))dt1τmax\displaystyle~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}\left.\sum_{n\in\mathcal{N}}\kappa_{n}(t)P_{n,2}(t)\right)dt\geq\frac{1}{\tau_{\rm max}}
0TBlog2(1+n𝒩κn(t)Pn,1(t)+\displaystyle\int_{0}^{T}B\log_{2}\left(1+\sum_{n\in\mathcal{N}}\kappa_{n}(t)P^{*}_{n,1}(t)+\right.
n𝒩κn(t)Pn,2(t))dtQ\displaystyle~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}\left.\sum_{n\in\mathcal{N}}\kappa_{n}(t)P_{n,2}(t)\right)dt\geq Q
Pn,1(t)+Pn,2(t)0.\displaystyle P^{*}_{n,1}(t)+P_{n,2}(t)\geq 0.

Problem (38) involves only solving variable Pn,2(t)P_{n,2}(t). We next rewrite the constraints in (38) by analyzing the value range of Pn,2(t)P_{n,2}(t). Because we set Blog2(1+n𝒩GPn,1(t)dnα(t)σ2)=1τmaxB\log_{2}\left(1+\sum_{n\in\mathcal{N}}\frac{GP^{*}_{n,1}(t)}{d^{\alpha}_{n}(t)\sigma^{2}}\right)=\frac{1}{\tau_{\rm max}}, it is observed that n𝒩κn(t)Pn,2(t)0\sum_{n\in\mathcal{N}}\kappa_{n}(t)P_{n,2}(t)\geq 0 must be satisfied; otherwise, the delay constraint (38e) is violated. Then, by rewriting constraints (38e) and (38e) as (37e) and (37e), respectively, we have Problem (37). This completes the proof of Lemma 2.

With Lemma 2, solving Pn(t)P_{n}(t) from (13) under the condition Tτmax<Q\frac{T}{\tau_{\rm max}}<Q reduces to finding Pn,2(t)P_{n,2}(t) from (37). The result in Lemma 2 will be useful in the performance tradeoff analysis in Section IV.

We can easily observe that problem (37) is a convex problem. Then, to obtain the optimal solution, we construct an algorithm based on the Karush-Kuhn-Tucker (KKT) conditions. To proceed, firstly, the Lagrangian function is presented as

L=0T(n𝒩(knPn,2(t)+μ1,n(Pn,2(t)Tbn))μ2(Blog2(cn(t)+n𝒩κn(t)Pn,2(t))Q))dtμ3(t)n=1Nκn(t)Pn,2(t),\begin{split}L=&\int_{0}^{T}\bigg{(}\sum_{n\in\mathcal{N}}\Big{(}k_{n}P_{n,2}(t)+\mu_{1,n}(P_{n,2}(t)-Tb_{n})\Big{)}\\ &-\mu_{2}\Big{(}B\log_{2}(c_{n}(t)+\sum_{n\in\mathcal{N}}\kappa_{n}(t)P_{n,2}(t))-Q\Big{)}\bigg{)}dt\\ &-\mu_{3}(t)\sum^{N}_{n=1}\kappa_{n}(t)P_{n,2}(t),\end{split} (39)

where μ1,n\mu_{1,n}, μ2\mu_{2} and μ3(t)\mu_{3}(t) are non-negative multipliers related to the constraints (37e), (37e) and (37e), respectively. To minimize the Lagrangian function, it is necessary to differentiate the Lagrangian function with respect to Pn,2(t)P_{n,2}(t) and set the derivative to zero for each time tt, which is,

LPn,2(t)=\displaystyle\frac{\partial L}{\partial P_{n,2}(t)}= kn+μ1,nμ3(t)κn(t)\displaystyle k_{n}+\mu_{1,n}-\mu_{3}(t)\kappa_{n}(t)- (40)
μ2Blog2κn(t)cn(t)+n𝒩κn(t)Pn,2(t)=0.\displaystyle\frac{\mu_{2}B}{\log 2}\frac{\kappa_{n}(t)}{c_{n}(t)+\sum_{n\in\mathcal{N}}\kappa_{n}(t)P_{n,2}(t)}=0.

Through combining with constraint (37e), the solution can be obtained given by

Pn,2(t)=\displaystyle P_{n,2}(t)= [(μ2GBlog2dn(t)ασ2(kn+μ1μ3(t)κn(t))cn(t)\displaystyle\left[\left(\frac{\mu_{2}GB}{\log 2d_{n}(t)^{\alpha}\sigma^{2}(k_{n}+\mu_{1}-\mu_{3}(t)\kappa_{n}(t))}-c_{n}(t)-\right.\right. (41)
mnGPm,2(t)dm(t)ασ2)×dn(t)ασ2G,Pn,1(t)]+.\displaystyle\left.\left.\sum_{m\neq n}\frac{GP_{m,2}(t)}{d_{m}(t)^{\alpha}\sigma^{2}}\right)\times\frac{d_{n}(t)^{\alpha}\sigma^{2}}{G},-P^{*}_{n,1}(t)\right]^{+}.

(41) shows that Pn,2(t)P_{n,2}(t) values with different nn are coupled with each other, and the final solution of Pn,2(t)P_{n,2}(t) can be obtained by iteratively updating them until convergence. During the iteration, Lagrangian multipliers μ1,n\mu_{1,n}, μ2\mu_{2} and μ3(t)\mu_{3}(t) can be obtained via the subgradient technique.

Another way to solve (37) is to sample the time period to generate certain discrete time points. If the time interval between two adjacent discrete points is small enough, the obtained result can approximately be considered as a solution to (37). Denote the discrete time points as {t1,t2,,tM}\{t_{1},t_{2},\cdots,t_{M}\} and the adjacent sampling point interval as t{\bigtriangleup t}; the power Pn,2(tm)P_{n,2}(t_{m}) can be efficiently obtained through solving the following problem:

minPn,2(tm)m=1Mn𝒩knPn,2(tm)t\displaystyle\min\limits_{P_{n,2}(t_{m})}\sum^{M}_{m=1}\sum_{n\in\mathcal{N}}k_{n}P_{n,2}(t_{m}){\bigtriangleup t} (42a)
s.t.\displaystyle s.t. m=1MPn,2(tm)tTbnn𝒩\displaystyle\sum^{M}_{m=1}P_{n,2}(t_{m}){\bigtriangleup t}\leq Tb_{n}~{}~{}\forall n\in\mathcal{N} (42e)
n=1Nκn(tm)Pn,2(tm)0m\displaystyle\sum^{N}_{n=1}\kappa_{n}(t_{m})P_{n,2}(t_{m})\geq 0~{}~{}\forall m
m=1MBlog2(cn(tm)+n=1Nκn(tm)\displaystyle\sum^{M}_{m=1}B\log_{2}\bigg{(}c_{n}(t_{m})+\sum^{N}_{n=1}\kappa_{n}(t_{m})
×Pn,2(tm))tQ\displaystyle\times P_{n,2}(t_{m})\bigg{)}{\bigtriangleup t}\geq Q
Pn,2(tm)Pn,1(tm)m.\displaystyle P_{n,2}(t_{m})\geq-P^{*}_{n,1}(t_{m})~{}\forall m.

It is noted that problem (42) has only one nonlinear convex constraint (42e), and thus can be efficiently solved by an interior point algorithm using the software CVX [28].

To summarize, the algorithm we proposed to solve (13) is as follows:

Algorithm 1 Dynamic power optimization in problem (13)
  • Input: BS interval dd, speed v0v_{0}, BS height h0h_{0}, time interval (0,T](0,T], BS coordinate (ln,d0)(l_{n},d_{0}), content size QQ, local storage size FnF_{n}, cache placement matrix CC, bandwidth BB, noise power σ2\sigma^{2}, backhaul cost ratio β\beta, backhaul rate Rn(t)R_{n}(t), delay requirement τmax\tau_{\rm max}, average power of BS Pn,avgP_{n,{\rm avg}}.

  • Initialization: Basis point of the Taylor expansion of Pn(t)P_{n}(t), that is, Pn0(t)P^{0}_{n}(t).

  • Output: Dynamic power allocation Pn(t)P_{n}(t).

  • While not convergence do

    • Update power Pn(t)P_{n}(t);

      1. 1.

        Update Pn(t)P_{n}(t) using Theorem 1 if TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q;

      2. 2.

        Update Pn(t)P_{n}(t) using KKT conditions or solving (42) if Tτmax<Q\frac{T}{\tau_{\rm max}}<Q;

    • Update Pn0(t)P^{0}_{n}(t) using Pn(t)P_{n}(t);

  • End while

IV Cost, Delay and Delivery Content Size Tradeoff Analysis

In our system design, our target is to minimize the total network cost subject to the delay constraint and total content delivery constraint. Both constraints actually determine the total cost consumed by the system. In fact, there is inherently a tradeoff between the total network cost, delay and delivery content size. In this section, we present the tradeoff analysis of the system, which may help simplify the design of the system.

We first present the tradeoff between the total network cost and the delay requirement for a given delivery content size.

Proposition 1.

In the considered system design problem (13), for a given delivery content size QQ, we have

  • when TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q, the total network cost increases as the delay requirement becomes strict.

  • when Tτmax<Q\frac{T}{\tau_{\rm max}}<Q, if we denote the optimal solution to (13) as Pn(t)P^{*}_{n}(t), which is decomposed into the sum Pn,1(t)+Pn,2(t)P^{*}_{n,1}(t)+P^{*}_{n,2}(t), with Pn,1(t)P^{*}_{n,1}(t) being the optimal power term obtained by solving (16), there exists a delay requirement region

    𝝉maxreg=[1τ,τmax],\displaystyle{\bm{\tau}}^{\rm reg}_{\rm max}=\left[\frac{1}{\tau},\tau_{\rm max}\right], (43)

    where τ=Blog2(1+n𝒩GPn,1(t)dnα(t)σ2+mintn𝒩GPn,2(t)dnα(t)σ2)\tau=B\log_{2}(1+\sum_{n\in\mathcal{N}}\frac{GP^{*}_{n,1}(t)}{d^{\alpha}_{n}(t)\sigma^{2}}+\min_{t}\sum_{n\in\mathcal{N}}\frac{GP^{*}_{n,2}(t)}{d^{\alpha}_{n}(t)\sigma^{2}}) such that the total network cost does not increase with the decrease in delay.

Proof.

Under the condition TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q, the original system design problem (13) is equivalent to problem (16), where we have only the power constraint and the delay constraint . At the optimal solution, the delay constraint is always active. Hence, if we decrease the value of τmax\tau_{\rm max} to achieve a strict delay requirement, the total network cost must be increased.

If under the condition Tτmax<Q\frac{T}{\tau_{\rm max}}<Q, for a given delay requirement τmax\tau_{\rm max}, the optimal solution is Pn(t)P^{*}_{n}(t), which is decomposed into the sum Pn,1(t)+Pn,2(t)P^{*}_{n,1}(t)+P^{*}_{n,2}(t), with Pn,1(t)P^{*}_{n,1}(t). We first prove that the solution Pn(t)P^{*}_{n}(t) with its decomposition Pn,1(t)P^{*}_{n,1}(t) and Pn,2(t)P^{*}_{n,2}(t) is also the solution to (13) for a smaller given delay requirement τmax𝝉maxreg\tau^{\prime}_{\rm max}\in{\bm{\tau}}^{\rm reg}_{\rm max}. Then, we prove that this solution is the optimal solution.

It is noted that as Blog2(1+n𝒩Pn,1(t)|hn(t)|2σ2)=τmaxB\log_{2}\left(1+\sum_{n\in\mathcal{N}}\frac{P^{*}_{n,1}(t)|h_{n}(t)|^{2}}{\sigma^{2}}\right)=\tau_{\rm max}, we have n𝒩Pn,2(t)|hn(t)|2σ20\sum_{n\in\mathcal{N}}\frac{P^{*}_{n,2}(t)|h_{n}(t)|^{2}}{\sigma^{2}}\geq 0, as claimed in Lemma 2. This further produces

Blog2(1+n𝒩Pn(t)|hn(t)|2σ2)\displaystyle B\log_{2}\left(1+\sum_{n\in\mathcal{N}}\frac{P^{*}_{n}(t)|h_{n}(t)|^{2}}{\sigma^{2}}\right) (44)
\displaystyle\geq Blog2(1+n𝒩GPn,1(t)dnα(t)σ2+mintn𝒩GPn,2(t)dnα(t)σ2)\displaystyle B\log_{2}\left(1+\sum_{n\in\mathcal{N}}\frac{GP^{*}_{n,1}(t)}{d^{\alpha}_{n}(t)\sigma^{2}}+\min_{t}\sum_{n\in\mathcal{N}}\frac{GP^{*}_{n,2}(t)}{d^{\alpha}_{n}(t)\sigma^{2}}\right)
\displaystyle\geq 1τmax.\displaystyle\frac{1}{\tau_{\rm max}}.

Thus, if we change the delay requirement τmax\tau_{\rm max} to any smaller value τmax\tau^{\prime}_{\rm max} in region 𝝉maxreg{\bm{\tau}}^{\rm reg}_{\rm max}, Pn(t)P^{*}_{n}(t) can still be a solution to the design problem (13), and they have the same total network cost.

In the following, we prove that Pn(t)P^{*}_{n}(t) is the optimal solution to the design problem (13) with a smaller delay requirement τmax𝝉maxreg\tau^{\prime}_{\rm max}\in{\bm{\tau}}^{\rm reg}_{\rm max}, Pn(t)P^{*}_{n}(t). It is noted that if we decrease the delay requirement τmax\tau_{\rm max} to τmax\tau^{\prime}_{\rm max}, the design problem (13) with delay requirement τmax\tau_{\rm max} has a smaller feasible region than that of the design problem with delay requirement τmax\tau^{\prime}_{\rm max}. Then, the value of the objective function, i.e., the total network cost, of the former cannot be smaller than that of the latter. This indicates that Pn(t)P^{*}_{n}(t) is still the optimal solution to the design problem with delay requirement τmax\tau^{\prime}_{\rm max}, which completes the proof of Proposition . ∎

Next, we discuss the tradeoff between the total network cost and delivery content size for a given delay requirement.

Proposition 2.

In the considered system design problem (13), for a given delay requirement τmax\tau_{\rm max}, we have

  • when QTτmaxQ\leq\frac{T}{\tau_{\rm max}}, increasing the delivery content size will not lead to an increase in the total network cost.

  • when Q>TτmaxQ>\frac{T}{\tau_{\rm max}}, increasing the delivery content size must also increase the total network cost.

Proof.

Under the condition QTτmaxQ\leq\frac{T}{\tau_{\rm max}}, because constraint (13e) is the same as (13), the system cost is determined only by the delay requirement. Therefore, increasing the delivery content size will not lead the total network cost to increase.

Under the condition Q>TτmaxQ>\frac{T}{\tau_{\rm max}}, based on the result presented in Lemma 2, with a constant power Pn,1(t)P^{*}_{n,1}(t), we have to increase the value of n𝒩Pn,2(t)|hn(t)|2σ2\sum_{n\in\mathcal{N}}\frac{P^{*}_{n,2}(t)|h_{n}(t)|^{2}}{\sigma^{2}} to meet constraint (13e) with larger QQ. This leads to a larger total network cost. It is noted that when increasing the size of delivery content in (13), we cannot find a solution to Pn,2(t)P^{*}_{n,2}(t) that satisfies constraint (13e) while not increasing the value of the objective function in (13). We prove this conclusion by using a contradiction statement. Assume that with a delivery content size QQ^{\prime}, the optimal solution to (13) is Pn(t)=Pn,1(t)+Pn,2(t){P^{\prime}}^{*}_{n}(t)={P^{\prime}}^{*}_{n,1}(t)+{P^{\prime}}^{*}_{n,2}(t). Now, assume that with a larger delivery content size Q′′Q^{\prime\prime}, the optimal solution to (13) is P′′n(t)=Pn,1(t)+P′′n,2(t){P^{\prime\prime}}^{*}_{n}(t)={P^{\prime}}^{*}_{n,1}(t)+{P^{\prime\prime}}^{*}_{n,2}(t). If Pn(t){P^{\prime}}^{*}_{n}(t) and P′′n(t){P^{\prime\prime}}^{*}_{n}(t) have the same objective function value in (13), we can always obtain a new solution to (13) with a delivery content size QQ^{\prime} as P~n(t)=Pn,1(t)+αP′′n,2(t)\tilde{{P^{\prime}}}^{*}_{n}(t)={P^{\prime}}^{*}_{n,1}(t)+\alpha{P^{\prime\prime}}^{*}_{n,2}(t), where α\alpha is a positive value smaller than 11. This new solution P~n(t)\tilde{{P^{\prime}}}^{*}_{n}(t) produces a smaller objective function value than Pn(t){P^{\prime}}^{*}_{n}(t). This contradicts the fact that Pn(t){P^{\prime}}^{*}_{n}(t) is the optimal solution, which completes the proof of Proposition 2. ∎

V Invariant Power Optimization with QoS Constraints

As another simple power allocation scheme, we consider a constant power optimization design where power does not vary with the channel. Under this situation, the overall optimization problem can be modified as

minPn\displaystyle\min\limits_{P_{n}} Tn𝒩Pn+\displaystyle T\sum_{n\in\mathcal{N}}P_{n}+
0Tn𝒩βPn0(1cn,f)Rn(t)dt\displaystyle\int_{0}^{T}\sum_{n\in\mathcal{N}}\beta||P_{n}||_{0}(1-c_{n,f})R_{n}(t)dt
s.t.\displaystyle s.t. 0PnPn,avgn𝒩\displaystyle 0\leq P_{n}\leq P_{n,{\rm avg}}~{}~{}\forall n\in\mathcal{N} (45e)
1C(t)τmax\displaystyle\frac{1}{C(t)}\leq\tau_{\rm max}
0TC(t)𝑑tQ\displaystyle\int_{0}^{T}C(t)dt\geq Q
Pn(t)0n𝒩,\displaystyle P_{n}(t)\geq 0~{}~{}\forall n\in\mathcal{N},

where C(t)=Blog2(1+n𝒩GPndn(t)ασ2)C(t)=B\log_{2}\left(1+\sum_{n\in\mathcal{N}}\frac{GP_{n}}{d_{n}(t)^{\alpha}\sigma^{2}}\right). Similar to the solution we presented in Section III, we determine PnP_{n} by solving the following problem:

minPnn𝒩knPn\displaystyle\min\limits_{P_{n}}\sum_{n\in\mathcal{N}}k^{\prime}_{n}P_{n} (46a)
s.t.\displaystyle s.t. 0PnPn,avgn𝒩\displaystyle 0\leq P_{n}\leq P_{n,{\rm avg}}~{}~{}\forall n\in\mathcal{N} (46d)
1C(t)τmax\displaystyle\frac{1}{C(t)}\leq\tau_{\rm max}
0TC(t)𝑑tQ,\displaystyle\int_{0}^{T}C(t)dt\geq Q,

where kn=T+1log(1/θ+1)β(1cn,f)0TRn(t)𝑑tθ+Pn0k^{\prime}_{n}=T+\frac{1}{\log(1/\theta+1)}\frac{\beta(1-c_{n,f})\int_{0}^{T}R_{n}(t)dt}{\theta+P^{0}_{n}}, with Pn0P^{0}_{n} being a basis point of the Taylor expansion.

To solve (46), two specific cases are considered. If TτmaxQ\frac{T}{\tau_{\rm max}}\geq Q, we have C(t)=1τmaxC(t)=\frac{1}{\tau_{\rm max}}. Then, constraint (46d) is redundant. PnP_{n} can be found by solving

minPnn𝒩knPn\displaystyle\min\limits_{P_{n}}\sum_{n\in\mathcal{N}}k^{\prime}_{n}P_{n} (47a)
s.t.\displaystyle s.t. 0PnPn,avgn𝒩\displaystyle 0\leq P_{n}\leq P_{n,{\rm avg}}~{}~{}\forall n\in\mathcal{N} (47c)
n𝒩GPndn(t)ασ221τmaxB1.\displaystyle\sum_{n\in\mathcal{N}}\frac{GP_{n}}{d_{n}(t)^{\alpha}\sigma^{2}}\geq 2^{\frac{1}{\tau_{\rm max}B}}-1.

It is noted in (47) that constraint (47c) should be satisfied for any arbitrary tt, and thus causes (47) to contain infinite constraints. We next solve (47) as problem (42) by sampling the time period at certain discrete time points. Denote the discrete time points {t1,t2,,tM}\{t_{1},t_{2},\cdots,t_{M}\} and the adjacent sampling point interval as t{\bigtriangleup t}; the power PnP_{n} can be efficiently obtained through solving the following linear programming problem:

minPn,1n𝒩knPn\displaystyle\min\limits_{P_{n,1}}\sum_{n\in\mathcal{N}}k^{\prime}_{n}P_{n} (48a)
s.t.\displaystyle s.t. 0PnPn,avgn𝒩\displaystyle 0\leq P_{n}\leq P_{n,{\rm avg}}~{}~{}\forall n\in\mathcal{N} (48c)
n𝒩GPndn(tm)ασ221τmaxB1,i.\displaystyle\sum_{n\in\mathcal{N}}\frac{GP_{n}}{d_{n}(t_{m})^{\alpha}\sigma^{2}}\geq 2^{\frac{1}{\tau_{\rm max}B}}-1,\forall i.

If Tτmax<Q\frac{T}{\tau_{\rm max}}<Q, similar to its dynamic counterpart, the power can be denoted as Pn=Pn,1+Pn,2P_{n}=P_{n,1}+P_{n,2} where Pn,1P_{n,1} is used to activate the constraint (48c) in (48). Then, Pn,2P_{n,2} can be obtained by using the Lagrangian method or time period sampling approach.

VI Numerical Results

In the following, we provide numerical results to show the superiority of using dynamic power allocation in a high-speed moving transmission scenario. In particular, we compare the performance of dynamic power allocation and invariant power optimization with respect to different caching schemes. The parameter settings for our simulation are summarized in TABLE I. Moreover, we consider three caching strategies to illustrate the effect of caching on the performance, that is, PopC, RndC and the noncaching scheme (NonC). Because the PopC caching strategy asks each RRH to cache the most popular contents until its storage is full, based on our parameter settings, RRH 11 and RRH 22 both cache contents {1,2,3,4,5}\{1,2,3,4,5\}. NonC assumes that the RRH has no storage resources and that no content is cached at the RRH.

TABLE I: Parameter settings in simulations
Parameter Notation Value
Height of RRH antennas hh 2020 m
Interval between two RRHs dd 10001000 m
Distance between RRH and road d0d_{0} 100100 m
Coordinates of RRH 11 (l1,d0)(l_{1},d_{0}) (200m,100m)(-200~{}\rm{m},100~{}\rm{m})
Coordinates of RRH 22 (l2,d0)(l_{2},d_{0}) (800m,100m)(800~{}\rm{m},100~{}\rm{m})
Path-loss exponent α\alpha 0.80.8
Channel gain GG 22
Train speed v0v_{0} 200Km/h200~{}\rm{Km/h}
Ratio of backhaul power cost and rate β\beta 2.82.8
Noise power σ2\sigma^{2} 11
Content number LL 1515
Content size QQ 11
RRH storage size FnF_{n} 55
Shaping parameter of content popularity η\eta 11

In Fig. 2, the convergence behavior of the proposed algorithm is illustrated. We can observe that the proposed algorithm converges fast in no more than five iterations. Three curves with different delay requirements also demonstrate that a decrease in delay can significantly increase the total network power cost.

In Fig. 3, the dynamic power is illustrated with the time-varying channel. As the train departs from RRH 1 and moves closer to RRH 2, we see that the channel gain of h1(t)h_{1}(t) decreases as time passes, while the channel gain of h2(t)h_{2}(t) increases with time. To satisfy the QoS requirements, the power at RRH 11 gradually increases to compensate for the channel gain loss of h1(t)h_{1}(t) while the power at RRH 22 remains zero. As the train moves closer to RRH 22, RRH 22 begins to serve it, and the power at RRH 11 becomes zero to maintain a lower network power cost. In particular, when the train gets close to RRH 22, the required power at RRH 22 gradually decreases as the quality of channel h1(t)h_{1}(t) improves.

Refer to caption
Figure 2: Convergence behavior of the proposed Algorithm 1 at an average SNR P1,avgσ2=P2,avgσ2=10\frac{P_{1,{\rm avg}}}{\sigma^{2}}=\frac{P_{2,{\rm avg}}}{\sigma^{2}}=10 dB.
Refer to caption
Figure 3: Power variance with a dynamic channel.
Refer to caption
Figure 4: Total power cost for different delay requirements at an average SNR P1,avgσ2=P2,avgσ2=10\frac{P_{1,{\rm avg}}}{\sigma^{2}}=\frac{P_{2,{\rm avg}}}{\sigma^{2}}=10 dB.
Refer to caption
Figure 5: Total power cost for different delay requirements and η\eta at an average SNR P1,avgσ2=P2,avgσ2=10\frac{P_{1,{\rm avg}}}{\sigma^{2}}=\frac{P_{2,{\rm avg}}}{\sigma^{2}}=10 dB.
Refer to caption
Figure 6: Total power cost for different train speeds at an average SNR P1,avgσ2=P2,avgσ2=10\frac{P_{1,{\rm avg}}}{\sigma^{2}}=\frac{P_{2,{\rm avg}}}{\sigma^{2}}=10 dB.

In Fig. 4, we illustrate the effect of the delay requirements on the total network power cost for different caching strategies. It is observed that a stricter delay requirement enhances the total network power cost and that the proposed dynamic power allocation significantly outperforms the invariant power allocation. Moreover, the PopC caching strategy has the lowest total power cost. The RndC strategy performs worse than the PopC strategy. The NonC scheme has the maximum total power cost. The result implies that caching at the RRH is beneficial for reducing the network power consumption and that caching popular contents is more helpful.

Fig. 5 further illustrates the effect of the shaping parameter of popularity η\eta. In general, a larger η\eta implies a larger popularity difference among the contents. We observe that a larger η\eta produces a lower total network power for the PopC caching strategy. This observation is reasonable as the request contents are very likely to be cached at the RRHs, which can reduce the backhaul power consumption. However, for the RndC strategy, we see that the change in η\eta has little effect on the total network power cost. The main reason is that the RndC strategy does not consider the content popularity and caches all contents with equal probabilities.

In Fig. 6, we show the effect of train speed on the total network power consumption. It is observed that the total network power cost increases with the speed, especially for dynamic power allocation. In the given time period (0,T](0,T], a higher speed implies that a longer distance will be covered by the train. This increases the total power consumption.

VII Conclusions

In this paper, we studied the dynamic power allocation of the Fog-RAN-assisted high-speed railway system. For a given caching strategy, we optimized the instantaneous power allocation at the RRHs with the aim to minimize the network power consumption subject in total to several QoS constraints. By analyzing the dynamic power optimization problem, we derived the analytical power solution. Our results showed that caching at the RRHs can significantly reduce the total network power consumption. More so, the dynamic power allocation is significantly superior to the invariant one, as it takes the time-varying characteristic of the channel into consideration.

Appendix A Feasibility analysis of problem (20)

Lemma 3.

For problem (20), if t~>t′′\tilde{t}^{\prime}>t^{\prime\prime}, the optimization problem is infeasible.

Proof.

With an assumption t~>t′′\tilde{t}^{\prime}>t^{\prime\prime}, we have t′′Ta~3(t)𝑑t>TP2,avg\int_{t^{\prime\prime}}^{T}\tilde{a}_{3}(t)dt>TP_{2,{\rm avg}}. Next, we prove the infeasibility of problem (20) by contradiction analysis. Assume that we have a feasible solution P2(t)P^{\prime}_{2}(t) that satisfies P2(t)<a~3(t)P^{\prime}_{2}(t)<\tilde{a}_{3}(t) for t(t′′,T]t\in(t^{\prime\prime},T] and

0t′′a~2(t)P2(t)𝑑t+t′′Ta~2(t)P2(t)𝑑t=t′′Ta~2(t)a~3(t)𝑑t=B.\int_{0}^{t^{\prime\prime}}\tilde{a}_{2}(t)P^{\prime}_{2}(t)dt+\int_{t^{\prime\prime}}^{T}\tilde{a}_{2}(t)P^{\prime}_{2}(t)dt=\int_{t^{\prime\prime}}^{T}\tilde{a}_{2}(t)\tilde{a}_{3}(t)dt=B. (49)

Because a~2(t)\tilde{a}_{2}(t) is an increasing function, when (49) is satisfied, although t′′TP2(t)𝑑t<TP2,avg\int_{t^{\prime\prime}}^{T}P^{\prime}_{2}(t)dt<TP_{2,{\rm avg}}, we have

0t′′P2(t)𝑑t>t′′Ta~3(t)𝑑tt′′TP2(t)𝑑t>TP2,avgt′′TP2(t)𝑑t,\begin{split}\int_{0}^{t^{\prime\prime}}P^{\prime}_{2}(t)dt&>\int_{t^{\prime\prime}}^{T}\tilde{a}_{3}(t)dt-\int_{t^{\prime\prime}}^{T}P^{\prime}_{2}(t)dt\\ &>TP_{2,{\rm avg}}-\int_{t^{\prime\prime}}^{T}P^{\prime}_{2}(t)dt,\end{split} (50)

which implies that P2(t)P^{\prime}_{2}(t) cannot be a feasible solution. We thus complete the proof of Lemma 3. ∎

References

  • [1] Z. Zhao, M. Peng, Z. Ding, W. Wang, and H. V. Poor, “Cluster content caching: An energy-efficient approach to improve quality of service in cloud radio access networks,” IEEE Journal on Selected Areas in Communications, vol. 34, no. 5, pp. 1207–1221, 2016.
  • [2] D. Yan, R. Wang, E. Liu, and Q. Hou, “Admm-based robust beamforming design for downlink cloud radio access networks,” IEEE Access, vol. 6, pp. 27 912–27 922, 2018.
  • [3] M. Peng and K. Zhang, “Recent advances in fog radio access networks: Performance analysis and radio resource allocation,” IEEE Access, vol. 4, pp. 5003–5009, 2016.
  • [4] Z. Zhao, C. Feng, H. H. Yang, and X. Luo, “Federated-learning-enabled intelligent fog radio access networks: Fundamental theory, key techniques, and future trends,” IEEE Wireless Communications, vol. 27, no. 2, pp. 22–28, 2020.
  • [5] Y. Jiang, C. Wan, M. Tao, F. C. Zheng, P. Zhu, X. Gao, and X. You, “Analysis and optimization of fog radio access networks with hybrid caching: Delay and energy efficiency,” IEEE Transactions on Wireless Communications, pp. 1–1, 2020.
  • [6] R. Wang, R. Li, P. Wang, and E. Liu, “Analysis and optimization of caching in fog radio access networks,” IEEE Transactions on Vehicular Technology, vol. 68, no. 8, pp. 8279–8283, 2019.
  • [7] R. Wang, R. Li, E. Liu, and P. Wang, “Performance analysis and optimization of caching placement in heterogeneous wireless networks,” IEEE Communications Letters, vol. 23, no. 10, pp. 1883–1887, 2019.
  • [8] W. Bai, T. Yao, H. Zhang, and V. C. M. Leung, “Research on channel power allocation of fog wireless access network based on noma,” IEEE Access, vol. 7, pp. 32 867–32 873, 2019.
  • [9] G. M. S. Rahman, M. Peng, S. Yan, and T. Dang, “Learning based joint cache and power allocation in fog radio access networks,” IEEE Transactions on Vehicular Technology, vol. 69, no. 4, pp. 4401–4411, 2020.
  • [10] H. Xiang, M. Peng, Y. Sun, and S. Yan, “Mode selection and resource allocation in sliced fog radio access networks: A reinforcement learning approach,” IEEE Transactions on Vehicular Technology, vol. 69, no. 4, pp. 4271–4284, 2020.
  • [11] H. Zhang, L. Zhu, K. Long, and X. Li, “Energy efficient resource allocation in millimeter-wave-based fog radio access networks,” in 2018 2nd URSI Atlantic Radio Science Meeting (AT-RASC), 2018, pp. 1–4.
  • [12] S. He, C. Qi, Y. Huang, Q. Hou, and A. Nallanathan, “Two-level transmission scheme for cache-enabled fog radio access networks,” IEEE Transactions on Communications, vol. 67, no. 1, pp. 445–456, 2019.
  • [13] M. Tao, E. Chen, H. Zhou, and W. Yu, “Content-centric sparse multicast beamforming for cache-enabled cloud RAN,” IEEE Transactions on Wireless Communications, vol. 15, no. 9, pp. 6118–6131, 2016.
  • [14] E. Chen, M. Tao, and Y. Liu, “Joint base station clustering and beamforming for non-orthogonal multicast and unicast transmission with backhaul constraints,” IEEE Transactions on Wireless Communications, vol. 17, no. 9, pp. 6265–6279, 2018.
  • [15] Y. Ma, H. Wang, J. Xiong, J. Diao, and D. Ma, “Joint allocation on communication and computing resources for fog radio access networks,” IEEE Access, vol. 8, pp. 108 310–108 323, 2020.
  • [16] N. Khumalo, O. Oyerinde, and L. Mfupe, “Reinforcement learning-based computation resource allocation scheme for 5g fog-radio access network,” in 2020 Fifth International Conference on Fog and Mobile Edge Computing (FMEC), 2020, pp. 353–355.
  • [17] K. Li, M. Tao, and Z. Chen, “Exploiting computation replication for mobile edge computing: A fundamental computation-communication tradeoff study,” IEEE Transactions on Wireless Communications, vol. 19, no. 7, pp. 4563–4578, 2020.
  • [18] T. Dang and M. Peng, “Joint radio communication, caching, and computing design for mobile virtual reality delivery in fog radio access networks,” IEEE Journal on Selected Areas in Communications, vol. 37, no. 7, pp. 1594–1607, 2019.
  • [19] B. Ai, K. Guan, M. Rupp, T. Kurner, X. Cheng, X. Yin, Q. Wang, G. Ma, Y. Li, L. Xiong, and J. Ding, “Future railway services-oriented mobile communications network,” IEEE Communications Magazine, vol. 53, no. 10, pp. 78–85, 2015.
  • [20] J. Wu and P. Fan, “A survey on high mobility wireless communications: Challenges, opportunities and solutions,” IEEE Access, vol. 4, pp. 450–476, 2016.
  • [21] P. Muneer and S. M. Sameer, “Joint ML estimation of CFO and channel, and a low complexity turbo equalization technique for high mobility OFDMA uplinks,” IEEE Transactions on Wireless Communications, vol. 14, no. 7, pp. 3642–3654, 2015.
  • [22] J. Wang, H. Zhu, and N. J. Gomes, “Distributed antenna systems for mobile communications in high speed trains,” IEEE Journal on Selected Areas in Communications, vol. 30, no. 4, pp. 675–683, 2012.
  • [23] T. Li, K. Xiong, P. Fan, and K. B. Letaief, “Service-oriented power allocation for high-speed railway wireless communications,” IEEE Access, vol. 5, pp. 8343–8356, 2017.
  • [24] C. Zhang, P. Fan, K. Xiong, and P. Fan, “Optimal power allocation with delay constraint for signal transmission from a moving train to base stations in high-speed railway scenarios,” IEEE Transactions on Vehicular Technology, vol. 64, no. 12, pp. 5775–5788, 2015.
  • [25] X. Liu and D. Qiao, “Location-fair beamforming for high speed railway communication systems,” IEEE Access, vol. 6, pp. 28 632–28 642, 2018.
  • [26] R. Wang, R. Li, P. Wang, and E. Liu, “Analysis and optimization of caching in fog radio access networks,” IEEE Transactions on Vehicular Technology, vol. 68, no. 8, pp. 8279–8283, 2019.
  • [27] A. Simonetto, E. Dall’Anese, S. Paternain, G. Leus, and G. B. Giannakis, “Time-varying convex optimization: Time-structured algorithms and applications,” Proceedings of the IEEE, pp. 1–17, 2020.
  • [28] M. Grant and S. Boyd, “CVX: Matlab software for disciplined convex programming, version 2.1,” http://cvxr.com/cvx, Mar. 2014.