Latency Minimization for mmWave D2D Mobile Edge Computing Systems: Joint Task Allocation and Hybrid Beamforming Design
Abstract
Mobile edge computing (MEC) and millimeter wave (mmWave) communications are capable of significantly reducing the network’s delay and enhancing its capacity. In this paper we investigate a mmWave and device-to-device (D2D) assisted MEC system, in which user A carries out some computational tasks and shares the results with user B with the aid of a base station (BS). We assume partial offloading model and the task can be partitioned into two portions: the first part is computed locally at user A, while the second part is transmitted to the BS and computed by the MEC server. The computational results are then sent to user B through a D2D link and via the link from the BS to user B, respectively. To support computation offloading, both the users and the BS are equipped with multiple antennas and employ analog and digital (A/D) hybrid beamforming. Moreover, we propose a novel two-timescale joint hybrid beamforming and task allocation algorithm to reduce the system latency whilst cut down the required signaling overhead. Specifically, the high-dimensional analog beamforming matrices are updated in a frame-based manner based on the channel state information (CSI) samples, where each frame consists of a number of time slots, while the low-dimensional digital beamforming matrices and the offloading ratio are optimized more frequently relied on the low-dimensional effective channel matrices in each time slot. A stochastic successive convex approximation (SSCA) based algorithm is developed to design the long-term analog beamforming matrices. As for the short-term variables, the digital beamforming matrices are optimized relying on the innovative penalty-concave convex procedure (penalty-CCCP) for handling the mmWave non-linear transmit power constraint, and the offloading ratio can be obtained via the derived closed-form solution. Simulation results verify the effectiveness of the proposed algorithm by comparing the benchmarks.
Index Terms:
Mobile edge computing, D2D, mmWave, latency minimization.I Introduction
Given the rapid growth of computational-intensive mobile applications such as virtual reality (VR) [1], augmented reality (AR) [2], automatic driving [3], and face recognition [4], conventional remote cloud computing centers tend to struggle in meeting the stringent latency requirements of next-generation wireless systems [5]. Mobile edge computing (MEC) - which supports servers at the base station (BS) of cellular networks - has emerged as a promising solution [6, 7]. Thanks to the proximity of the mobile devices to the server, users can directly offload the computational-intensive tasks to the edge server without passing the back-haul networks, which significantly reduces the end-to-end delay and the network burden[8, 9, 10, 11, 12, 13, 14, 15, 16, 17]. Specifically, the works in [8, 9, 10, 11, 12] considered the binary offloading in MEC systems. The authors of [8] studied an energy efficient binary offloading problem and designed optimal scheduling policies for both the mobile execution and cloud execution. An MEC system combined with energy harvesting techniques has been investigated in [9, 10]. In [11], the authors proposed a general framework for offloading tasks from a single user to multiple access points. Moreover, the authors of [12] investigated a joint design problem of the computation offloading decision, the resource allocation, and the content caching strategy. The partial offloading schemes have been proposed to further improve the performance of MEC systems [13, 14, 15, 16, 17]. In [13], an optimal resource allocation scheme has been proposed for a multi-user MEC system based on time-division multiple access (TDMA) and orthogonal frequency-division multiple access (OFDMA), respectively. The authors of [14] studied a multi-user TDMA partial offloading MEC system, and derived the optimal solution to the delay minimization problem. By taking user cooperation into consideration, the authors of [15] investigated an energy-efficient problem for both binary offloading and partial offloading. To improve edge cloud efficiency with limited communication and computation capacities, the collaboration between cloud computing and edge computing was studied in [16, 17]. Furthermore, the authors of [18, 19, 20] investigated the intelligent reflecting surface (IRS) assisted MEC systems to improve the network efficiency.
However, MEC needs frequent data exchange between the mobile devices and the edge server, which requires a large communication capacity of the radio access network. Taking the -degree immersive VR as an example, even under the HEVC video compression rate, a bit rate of up to 1 Gbps [21] is needed to match the million pixel human-eye accuracy, which is challenging for the current 5th generation (5G) mobile communication technology [22]. Therefore, it is necessary to further enhance the system capacity for beyond 5G MEC systems. The millimeter wave (mmWave) and device-to-device (D2D) communications are exactly two promising techniques. MmWave has tremendous spectral resources and can achieve multi-gigabit transmission capacity. Moreover, at this short wavelength it is possible to integrate a large number of antenna elements in a compact space [23], thus achieving significant beamforming gain. D2D communication supports multiplex of the cellular spectrum, which allows mobile devices in proximity to communicate directly. It features high data rate, low latency, and high throughput [24], which fits the communication needs of MEC systems well. As a result, a number of solutions applying D2D techniques to MEC systems have been proposed to enable direct data transmission and computational resource sharing [25, 26, 28, 27].
To the best of our knowledge, the mmWave and D2D assisted MEC has not been well investigated in the literature. Although the D2D-aided MEC systems have been studied in the aforementioned works [25, 26, 28, 27], they assumed sub-6GHZ band and the tremendous mmWave spectral resource has not been considered to further enhance the capacity. Moreover, despite the tremendous benefits brought by the integration of mmWave, D2D and MEC, there are more challenges compared with existing works. To elaborate, 1) The challenges of the physical layer signal processing incurred in the mmWave frequency band, such as hybrid analog and digital (A/D) beamforming [31, 29, 33, 34, 32, 30], associated non-linear power consumption model, CSI acquisition etc. 2) The design of practical protocols to avoid the occurrence of transmission collisions that may happen in simultaneous uplink/downlink and D2D transmissions. 3) The design of efficient algorithms to solve the challenging non-convex optimization problems.
Hence, to fill this research blank and tackle the above challenges, we investigate a mmWave and D2D assisted MEC system, where user A processes the computational tasks to be solved and then shares the results with user B with the aid of a BS. The investigated model is general and its typical application scenarios include vehicle to vehicle communication [35], VR/AR gaming [36], and ultra high definition video transmission [37]. We assume partial offloading model as in [13, 14, 15, 16, 17], i.e., the task of user A can be partitioned into two portions: the first part is computed locally at user A, while the second part is transmitted to the BS and computed by the MEC server. The computation results are then sent to user B through a D2D link and the link from the BS to user B, respectively. In order to support computation offloading, both the users and the BS are equipped with multiple antennas and employ A/D hybrid beamforming. However, directly solving it by using the single-timescale algorithm requires very high complexity and a large amount of CSI feedback. Thus, we propose a novel two-timescale joint hybrid beamforming and task allocation algorithm to reduce the system latency whilst cut down the required signaling overhead. Specifically, the high-dimensional analog beamforming matrices are updated in a frame-based manner based on the channel state information (CSI) samples, where each frame consists of multiple time slots, while the low-dimensional digital beamforming matrices and the offloading ratio are optimized more frequently relied on the low-dimensional effective channel matrices in each time slot. We respectively formulate a long-term weighted ergodic channel capacity maximization problem and a short-term latency minimization problem for practical design. Our main contributions are summarized as follows:
-
•
We study a novel scenario that combines MEC with mmWave and D2D to significantly reduce the delay. We consider the raw data and result transmission in the uplink, the downlink, and the D2D link in details and make practical protocols to avoid collisions.
-
•
For the long-term weighted ergodic channel capacity maximization problem, a stochastic successive convex approximation (SSCA) based algorithm is developed for designing the analog beamforming matrices, which employs surrogate functions to approximate the original problem and converges to a stationary feasible solution.
-
•
Regarding the design of the digital beamforming matrices and offloading ratio, we equivalently decompose the short-term latency minimization problem into several decoupled subproblems. For the subproblems w.r.t. the digital beamforming matrices, an efficient penalty-CCCP based algorithm is proposed to tackle the nonlinear mmWave transmit power constraints. We also develop a low-complexity heuristic algorithm to design the digital beamforming matrices for performance-complexity trade-off.
-
•
The closed-form expressions of offloading ratio are derived based on classified discussion. We compare our proposed joint design algorithm of the task allocation and hybrid beamforming with the conventional algorithms in the simulation. The results verify the effectiveness of our proposed joint design algorithm.
The paper is structured as follows. Section II describes the system model. Section III formulates the two-timescale problem under investigation. The long-term analog beamforming design problem is solved in Section IV while the solutions to the design of the short-term digital beamforming matrices and the optimal offloading ratio are given in Section V. Section VI presents the simulation results and Section VII concludes this paper.
Notations: Scalars, vectors and matrices are denoted by lower case, boldface lower case and boldface upper case letters, respectively. represents an identity matrix and denotes an all-zero matrix. For a matrix , , , , and denote its transpose, conjugate, conjugate transpose, Moore-Penrose inverse and Frobenius norm, respectively. For a square matrix , and denotes its trace and inverse, respectively, while means that is positive (negative) semi-definite. For a vector , represents its Euclidean norm. () denotes the real (imaginary) part of a variable. denotes the absolute value of a complex scalar. denotes the space of complex (real) matrices. denotes the angle operator.
II System Model
In this section, we introduce the investigated system model. As shown in Fig. 1, we consider a system consisting of user A, user B and a BS with a MEC server. User A aims to process computation tasks and share the results with user B with the aid of the BS. We assume partial offloading model as as [13, 14, 15, 16, 17], i.e., user A has a total of bits task to be processed, and this task can be divided into two parts: bits and bits, where denotes the offloading ratio. The first part is transmitted to the BS and computed by the MEC server, and the second part is computed at the local CPU of user A. The computational results are transmitted to user B through the D2D link and the downlink (between the BS and user B), respectively.111It is worth mentioning that unlike the works in [28] where the authors utilize the computation resource of both the MEC server and the D2D users, we do not use the computation resource of user B because transmitting the raw data is time-consuming while the computing capacity at user B has no advantages over that of the BS. Both the users and the BS are equipped with A/D hybrid beamformers and work in mmWave band (The hybrid beamforming architecture of the BS is not plotted here since it is similar with that of the users).
II-A Computation model
In this paper, we adopt a general compression model and denote the computational results as bits, where denotes the compression ratio for the computation task and can be chosen as different values based on the category of the task and the adopted algorithm [38]. Defining , , , and for convenience of notation, where and stand for the local computing capacity and the edge computing capacity (computing capacity of the MEC server), respectively, and , and represent the transmission rates of the uplink (from user A to the BS), downlink and D2D link, respectively. We express different delays as follows,
-
•
The local computing time: .
-
•
The computing time at the MEC server: .
-
•
The offloading time from user A to the BS: .
-
•
The delay for transmitting the computational result from the BS to user B: .
-
•
The delay for transmitting the computational result from user A to user B: .
Let us consider the process of computation and transmission more concretely. As shown in Fig. 2, there are four cases in total.
-
•
Case 1: and . In this case, the local computing at user A finishes before the edge offloading. Thus user A has to wait until the task offloading is over to send the local computing result to user B through the D2D link. Moreover, in this case the transmission of the local computing result ends before the edge computing. Hence, the BS can send the edge computing results to user B directly without waiting until the transmission of D2D link is over.
-
•
Case 2: and . In this case, the local computing at user A also finishes before the edge offloading. Hence, similar with Case 1, user A has to wait until the task offloading is over. However, we consider that the edge computing finishes before the transmission of the local computing result. Under this situation, the BS has to wait until the D2D link transmission is over to send the edge computing results to user B. Otherwise, collisions would happen at user B.
-
•
Case 3: and . In this case, the edge offloading ends before the local computing. Hence user A can send the local computing results to user B through the D2D link directly since the communication resource is available at the moment. Moreover, in this case the edge computing finishes before the D2D transmission of the local computing result from user A to user B. Thus, the BS has to wait until the D2D link transmission is over to send the edge computing result to user B.222It is also possible that the edge computing finishes before the local computing. However, if the BS transmits the edge computing results to user B immediately, collisions may happen because user A does not know when the BS finishes its transmission and may send the local computing results to user B simultaneously. Thus, we assume that user A has a priority to transmit results to user B compared to the BS, even if the computation at the BS ends earlier.
-
•
Case 4: and . In this case, the edge offloading ends before the local computing, and the D2D link transmission finishes before the edge computing. As a result, no wait happens.
According to the four cases discussed above, we obtain the expression for the overall system delay as follows,
(1) |
II-B Communication model
Consider the three mmWave links that adopt hybrid A/D beamforming structures. User A and user B are equipped with , antennas, respectively, and , RF chains, respectively, while the BS has antennas and RF chains. Let , and denote the data symbols that transmitted from user A to the BS, the BS to user B and user A to user B, respectively. The received signal at the uplink, the downlink and the D2D link can be written as333 Here we adopt a single analog beamforming matrix at user A and at user B for different transmission phases, because this scheme can avoid frequent hand-off of the analog beamforming matrices with acceptable performance loss. Moreover, although we introduce the proposed algorithm under this case, it can be readily extended to the situation that there are independent analog beamforming matrices for different transmission stages.
respectively, where and represent the transmitting digital beamforming matrices of user A for the uplink and the D2D link, respectively. represents the long-term transmitting analog beamforming matrices of user A. and represent the receiving digital beamforming matrices of user B for the downlink and the D2D link, respectively. represents the long-term receiving analog beamforming matrices of user B. and represent the receiving and transmitting digital beamforming vectors at the BS, respectively, and and represent the long-term receiving and transmitting analog beamforming matrices at the BS, respectively. , , and denote the channel matrices of the uplink, downlink and D2D link, respectively, , , and denote the zero mean additive white Gaussian noise of the uplink, downlink and D2D link, respectively.
With the above definitions, we write the transmission rate for the uplink , downlink , and D2D link , respectively as (2)-(4)444We do not include the receiving digital beamformers in the rate expressions because it is well-known that the optimal digital receivers (i.e., minimum mean square error (MMSE) receivers) can achieve the maximum system rate, see [48] for more details.,
(2) | |||||
(3) | |||||
(4) |
where , and represent the bandwidth of the uplink, downlink and D2D link, respectively.
In practice, the relationship between the circuit power and the output power may be non-linear due to the working mode of RF power amplifiers (PA) in mmWave band [39]. Hence, it is necessary to take the non-linear energy efficiency of PAs into consideration. Specifically, we consider the Doherty PA in this paper, which is one of the most widely used PA architecture in high frequency band that has enhanced energy efficiency and linearity [40]. The relationship between the output power and the actual PA power consumption is given by [41]
(5) |
where is the maximum output power of the PA. In order to compute the total power consumption of all PAs, we must calculate the output power of each PA first, which is given by
(6) |
(7) |
(8) |
where and represent the output power of the th PA of user A in the uplink and D2D link, respectively, and represents the output power of the th PA of the BS in the downlink. By substituting (6)-(8) into (5), the power consumption of each PA, i.e. , and can be obtained.
III Problem Formulation
Based on the system model introduced above, we provide the two-timescale latency minimization problem in this section. We first introduce the proposed two-timescale scheme. Then, we formulate the long-term optimization problem and the short-term optimization problem, respectively. For the former, we seek to maximize the weighted ergodic channel capacity. As for the latter, we minimize the overall system latency. The details are given as follows.
III-A Two-timescale scheme
In a typical mobile radio environment, the channel matrices appearing in the system model of Section II will exhibit a random behavior and change more or less rapidly over time. The joint design of A/D hybrid beamforming matrices and offloading ratio for each channel instance is not realistic for implementation since as it requires repeated application of a search-based algorithm with extremely high computational complexity [42]. Moreover, this approach entails a huge amount of overhead in the estimation and exchange of real-time CSI information, and is likely to be very sensitive to CSI delays. Therefore, to circumvent these difficulties, we hereby propose a practical two-timescale hybrid beamforming scheme that takes into account changes in both the instantaneous CSI and their local statistics. Let us define the following concepts of timescales:
-
•
Long-timescale: The time interval over which the channel statistics555In this work, channel statistics refer to the moments or distribution of the channel fading realizations. The proposed long-term beamforming design only has to obtain a single (potentially outdated) channel sample at each frame. By observing one channel sample at each time, our proposed algorithm can automatically learn the channel statistics and converge to a stationary point of the considered stochastic optimization problem. are assumed constant.
-
•
Short-timescale: The time interval over which the channel gains are assumed constant, i.e. the channel coherence time.
As illustrated in Fig. 3, the time domain is divided into a number of super frames within which the channel statistics are invariant. Each super frame consist of frames, and each frame is further divided into time slots. In our proposed approach, to reduce CSI overhead, we only make use of one complete estimated CSI at the end of each frame, while we employ a so-called effective CSI (the multiplication of the analog beamforming matrices and the CSI matrices, take the uplink as an example, while ) with reduce dimension within each time slot. The short-timescale digital beamforming matrices and the offloading ratio are optimized in each time slot by using the effective real-time channel matrices with reduced dimension, and the long-timescale analog beamforming matrices are updated at the end of each frame based on estimated (possibly outdated) CSI. In the following, we formulate the long-term optimization problem and the short-term optimization problem, respectively.
Remark 1.
We assume the tasks can be finished within a channel coherence time. If the user has too many tasks and cannot finish in a single time slot, then he can allocate his tasks to multiple time slots and ensure as much tasks being finished in a time slot as possible. Hence we focus on the latency minimization in a single time slot in this paper.
III-B Problem formulation
Note that the long-timescale analog beamforming matrices should be optimized based on the CSI statistics over a long-term scale, and we cannot directly optimize them by minimizing the system latency that relies on the optimal digital beamforming matrices and offloading ratio for all channel realizations. To overcome this difficulty, we propose to optimize the analog beamforming matrices by maximizing the weighted ergodic sum capacity, that does not depend on the short-term variables. Then, we minimize the system latency by optimizing the digital beamforming matrices and offloading ratio in each time slot.666Note that it makes sense that the maximization of the channel capacity by designing long-term analog beamforming matrices can help minimize the system latency and this formulation is more suitable for practical design. Moreover, we validate the effectiveness of the proposed algorithm in our simulation.
1) The long-term master problem for designing analog beamforming matrices yields
(9) |
where we define , , and for convenience to meet the unit modulus constraint, and the weight and can be empirically chosen based on the transmission tasks of the corresponding link. Specifically, we choose the weight as , where , and denote the total number of transmission bits of the uplink, the downlink and the D2D link in the last super frame. Please note that this formulation is quite similar with that of maximizing the queue-length-weighted sum rate, which is widely adopted in the area of wireless resource scheduling [43, 44, 45], and it is also reasonable to use the number of transmission data in the last super frame because this information is available at the BS and the statistics between two adjacent super frames are much alike. By defining
(10) | |||||
(11) | |||||
(12) |
then, the expressions of the ergodic channel capacity for the link between user A and the BS, the link between the BS and user B, and the link between user A and user B, are given by , , and , respectively [46], and .
2) The short-term optimization problem for designing the digital beamforming matrices and offloading ratio can be expressed as
(13a) | ||||
s.t. | (13b) | |||
(13c) | ||||
(13d) | ||||
(13e) |
where denotes the set of the short-term optimization variables. (13c) and (13d) denote the transmit power constraints at user A and the BS, respectively. (13e) denotes the output power constraints of the PAs.
IV Long-term analog beamforming design
In this section, we introduce the proposed long-term analog beamforming design algorithm for solving . The original problem cannot be solved straightforwardly due to the stochastic and non-convex objective function. However, based on the theoretical framework exposed in [47], we seek to approximate the original objective function (9) by using a quadratic surrogate function. Specifically, at the end of each channel frame , the channel samples , and are obtained and the surrogate objective function is updated based on these channel samples and the approximated gradients as follows,
(14) |
where is a constant, and , , , and denote the approximation of the objective function , the partial derivatives , , and , respectively. The quantities can be updated based on the following expressions
(15) |
(16) |
(17) |
(18) |
(19) |
with initial value , , , and . The expressions of the partial derivatives are given in Appendix A, and is a sequence of the parameters to be properly chosen. Subsequently, we aim to solve the approximated problem at time frame , which is given by
(20) |
It is readily seen that this is a convex problem and the solution is given by
(21) |
Then, the long-term variables are updated as
(22) |
where similarly denotes a sequence of parameters. Based on [47], the convergence can be guaranteed if we choose and by following the conditions
(23) |
Then the proposed SSCA-based algorithm can be guaranteed to converge to a stationary solution of . We summarize the proposed long-term analog beamforming design algorithm in Algorithm 1, and its complexity is dominated by the procedure of updating the surrogate functions, which is given by .
V short-term digital beamforming and offloading ratio design
In this section, we introduce the proposed algorithm for solving 2. We first decompose 2 into several subproblems that are easier to solve. Then, for the subproblems regarding the digital beamforming design, we propose a penalty-CCCP based algorithm to handle the non-linear power consumption constraints. As for the subproblems regarding the offloading ratio design, we derive closed-form solution via classification and discussion. The details are as follows.
V-A Problem decomposition
Due to the fact that the transmissions occur over orthogonal time in a single time slot, the transmission rates of the uplink, downlink and D2D link, i.e. , , are independent of each other. Furthermore, since the overall system delay (13a) is nonincreasing with , and , we can maximize the transmission rates first, and then optimize the offloading ratio. Hence the short-term latency minimization problem 2 can be equivalently decomposed into the following two parts.
-
•
The digital beamforming design subproblems for the uplink, the downlink and the D2D link, respectively, are provided as:
(24a) s.t. (24b) (24c) (25a) s.t. (25b) (25c) (26a) s.t. (26b) (26c) -
•
The offloading ratio optimization subproblem is given by:
(27)
Although we have decomposed the original problem into several more tractable one, there are still some challenges, i.e. the non-linear power consumption constraint in and the multi-case piece-wise objective function in . In the following two subsections, we introduce our proposed algorithms for tackling these issues.
V-B Short-term digital beamforming design
In this subsection, we introduce the proposed short-term digital beamformer design algorithm for solving . Since these subproblems are essentially the same, we focus on the uplink to introduce our proposed algorithm. First, we equivalently transform into a more tractable form based on the celebrated weighted minimum mean square error (WMMSE) method [48] as follows,
(28a) | ||||
s.t. | (28b) |
where is an auxiliary variable satisfying and
(29) |
Then, to tackle the non-linear power constraint, we introduce two auxiliary variables and , and equivalently convert (28) as
(30a) | ||||
s.t. | (30b) | |||
(30c) | ||||
(30d) | ||||
(30e) | ||||
(30f) |
where is the set of optimization variables and ,, while is defined as
(31) |
Then, we solve the transformed problem based on the penalty-CCCP framework, the detailed introduction of which can be found in [49]. By penalizing the equality constraints (30b), (30e) and (30f) into the objective function (30a), we obtain the penalized problem shown as follows.
(32a) | ||||
s.t. |
where denotes a penalty coefficient. Referring to the penalty-CCCP, the proposed algorithm contains two loops, where the penalty coefficient is adjusted in the outer loop, while in the inner loop the optimization variables are updated in a block coordinate descent fashion. In each block of the inner loop, we aim to decompose the resulting penalized problem into a number of subproblems, which can be solved easily in parallel. To this end, we divide the optimization variables into three blocks, i.e. , , . The detailed solutions within each block are given in Appendix B and we summarize the proposed penalty-CCCP based algorithm for uplink short-term digital beamforming design in Algorithm 2. The complexity is given by , where and denote the iteration numbers for the outer and inner loops, respectively.
Although the computational complexity in a single iteration is low, the required iteration number may be large for the double loop nature of the penalty-CCCP. Hence, we also propose a low-complexity heuristic algorithm for the short-term digital beamformer design. Ignoring the non-linear power constraint, the uplink digital beamforming design problem yields
(33a) | ||||
s.t. | (33b) |
where . Since the long-term analog beamformer is fixed, we further view as the effective channel matrix, where and are the diagonal singular value matrix and the right singular vector matrix of , respectively. This problem has a well-known water-filling solution which is given by
(34) |
where is the set of the right singular vectors corresponding to the largest singular values of , and is the diagonal power allocation matrix. Since this low-complexity algorithm does not consider the nonlinear transmit power constraint directly, we scale the digital beamforming matrix to satisfy constraint (24b) and (24c), where the scaling factor can be conveniently found by using the bisection search. The computational complexity of the proposed low-complexity short-term digital beamforming design algorithm is given by .
V-C Optimization of offloading ratio
In this subsection, we aim to optimize the offloading ratio by solving 4. Referring to the analysis of different cases for timelines shown in Fig. 2, it is readily seen that is a linear piece-wise function of , and we can derive the optimal expression for through classification and analysis. Based on the four cases shown in Fig. 2, let us rewrite the expression of with being the variable as (35). In order to derive the optimal , we need to analyze all possible situations. It is apparent that when grows from 0 to 1, Case 1 and Case 3 will happen for sure, while the occurrence of Case 2 depends on the condition of , which can be simplified to . Similar to Case 2, the occurrence of Case 4 depends on , which can be simplified to . As we can see, the criteria of Case 2 and Case 4 contradict each other and thus only one of them can appear. Hence, there are two possible situations in general.
(35a) | |||||
(35b) | |||||
(35c) | |||||
(35d) |
i) Situation A:
In this situation, Case 2 does not happen. Thus, the expression of consists of (35c), (35d) and (35a) in order as increases from 0 to 1. However, note that the line function (35a) is the same as (35d). Therefore, the expression of in situation A essentially consists of two line segments. It is apparent that (35a) or (35d) is nondecreasing, while the monotonicity of (35c) depends on whether is positive or negative:
-
1)
: In this case, (35c) is a nondecreasing function of , thus the whole function is nondecreasing and we have .
-
2)
: In this case, the expression of consists of a decreasing line followed by an increasing one, thus the optimal should be at the turning point, i.e. .
ii) Situation B:
In this situation, Case 4 does not happen. So the expression of consists of (35c), (35b) and (35a) in order. Since (35a) is a nondecreasing line segment, we can conclude that . Then, let us focus on the monotonicity of (35c) and (35b). There are four kinds of situations in total:
-
1)
and : In this case, is monotonically decreasing when . Thus we obtain .
-
2)
and : In this case, is decreasing first when and then increasing when . Thus we obtain .
-
3)
and : This case is impossible because from we obtain while from we obtain .
-
4)
and : In this case, is nondecreasing in the whole feasible region of . Thus we obtain .
By following the above discussion, we obtain the final result of the optimal . The above analysis is summarized in a flowchart given as Fig. 4.
The complexity of the proposed short-term variables design algorithm for solving is dominated by the penalty-CCCP algorithm, whose complexity is given above. Regarding the convergence, according to the detailed convergence analysis of the penalty-CCCP algorithm [49], the proposed short-term digital beamforming algorithms converge to the stationary solutions of , and . Moreover, considering the optimality of the derived offloading ratio and the fact that is non-increasing with the transmission rate , and . The proposed joint short-term digital beamforming and offloading ratio design algorithm converge to a stationary point of .
Remark 2.
We assume that the design algorithm is implemented at the BS. Specifically, in each time slot, the effective uplink CSI matrix is estimated at the BS. The effective downlink CSI matrix and D2D CSI matrix are estimated at user B and fed back to the BS. Moreover, user A sends the necessary information such as the local computing capacity and the task compression ratio to the BS and the BS conducts the short-term digital beamforming and task allocation algorithm. In each frame, the full channel samples , and are collected by the BS using the similar channel estimation strategy and Algorithm 1 is then performed.
VI Simulation results
In this section, we present simulation results to verify the effectiveness of our proposed algorithm. The simulation parameters are provided as follows unless otherwise stated. The number of antennas at the BS server is set as , while the number of antennas at the users is . The number of RF chains at the BS is , with user RF chain numbers set as . We set the number of data streams as , and , respectively. For the mmWave channel model, we employ the generally used extended Salch-Valenzuela geometric model [50]. Specifically, the channel matrix is given by
(36) |
where and are the number of transmit and receive antennas, respectively. is the number of distinguishable paths, is the complex gain of the th path, and are the receive and transmit antenna array response vectors, where and are the azimuth angles of arrival and departure, respectively. is the maximum Doppler shift, and is the delay. The expression of the response vector is given by
(37) |
where , is the wavelength and is the antenna spacing set as . We assume that there are 1 line-of-sight (LOS) path and 15 non-line-of-sight (NLOS) path, and the gain for the LOS path is , while the gain for the NLOS path is . We set the Doppler shift as and the transmission delay as according to [42].
The BS is located at and the positions of user A and user B are set to and , respectively, with and . The path loss is modeled as , where is the path loss at the reference distance and is set to dB, is the link distance, and is the path loss exponent where we set it for the uplink, the downlink and the D2D link as , and , respectively. The power of additive white Gaussian noise is assumed to be and the power budgets of the BS and user A are and , respectively. The maximum power output of the PA is set to and we assume that the bandwidth of the three links are MHz [51]. The number of computation tasks is bits and the compression ratio is chosen as , which is a typical value for tasks like such as MPEG4 video (2D data and point cloud data) compression [52]. The computation capacities of the local CPU and the edge server are Mbps and Mbps, respectively. The number of frames in a channel statics coherence time and the number of time slots in a frame is set to and , respectively. As for the algorithm parameters, the long-term analog beamforming design algorithm is updated based on and . For the short-term digital beamforming design, the tolerance of accuracy is set as , and . The initial value of the penalty coefficient is and the control parameter is .
We first study the convergence behavior of the proposed algorithm. Fig. 5 presents the convergence performance of the proposed long-term analog beamforming design Algorithm 1. The left y axis shows the value of the weighted ergodic channel capacity, which converges rapidly within 100 iterations. We also provide the value of the system latency, which is shown along the right y axis. As we can see, the system latency converges almost synchronously with the weighted ergodic capacity and decreases significantly when the iteration number increases, which validates the effectiveness of convergence for the proposed long-term algorithm. Fig. 6 presents the convergence performance of the proposed short-term digital beamforming design Algorithm 2. As shown in Fig. 6LABEL:sub@fig:3a, the objective function converges within about 40 iterations. Fig. 6LABEL:sub@fig:3b shows the penalty terms versus the number of iterations, which finally decreases to a level less than , indicating that the constraint is satisfied at the convergence point, thereby verifying the effectiveness of the proposed penalty-CCCP algorithm for handling the non-linear power constraint.


In the following, we analyze the performance of our proposed two-timescale joint hybrid beamforming and offloading ratio design algorithm. We provide the following benchmarks for comparison,
-
•
Two-timescale heuristic beamforming: This scheme adopts Algorithm 1 for the long-term analog beamforming design and the derived optimal solution for the offloading ratio design, and the proposed heuristic low-complexity algorithm is employed for the short-term digital beamforming design.
-
•
Two-timescale binary offloading: This scheme adopts Algorithm 1 for the long-term analog beamforming design and Algorithm 2 for the short-term digital beamforming design. Moreover, the binary offloading strategy, i.e. selecting the one that has the lowest delay between the local computing scheme () and the edge computing scheme (), is employed.
-
•
Single-timescale OMP: This scheme adopts the OMP algorithm [29] for the hybrid beamforming design and the optimal solution for the offloading ratio design in each time slot.
-
•
Single-timescale CM: Similar to the single-timescale OMP, however, it employs the CM algorithm [30] for the A/D hybrid beamforming design.
-
•
Single-timescale AO: Similar to the single-timescale OMP, however, it employs the AO algorithm [31] for the A/D hybrid beamforming design.
We assume that the CSI delay is proportional to the size of the required CSI matrix as [53]. Specifically, the size of the CSI overhead for the two-timescale algorithm in a frame is given by , where is the number of quantization bits for each element of the CSI matrices, while that of the single-timescale algorithm is given by . Fig. 7 illustrates the CSI overhead versus the number of antennas at the BS . We can see that the proposed two-timescale algorithm remarkably reduces the required signaling overhead, especially when is large. In the simulation, we set the CSI delay of the single-timescale algorithm as ms. Then, the CSI delay of the two-timescale algorithm can be computed as ms.
Fig. 8 shows the latency performance of different algorithms versus the CSI delay. As we can see, the proposed two-timescale algorithms vary slightly when the CSI delay increases, while the conventional single-timescale algorithms degrade dramatically with the CSI delay. We also observe that the proposed algorithm outperforms the other compared algorithms when the CSI delay is larger than ms. Fig. 9 compares the delay of different algorithms versus the transmit power of user A, i.e., . We observe that our proposed algorithm provides evident superiority over the single-timescale algorithms and the binary offloading algorithm. When the transmit power is large, the proposed heuristic short-term beamforming algorithm achieves close performance as the proposed short-term penalty-CCCP based algorithm. However, when the transmit power is small, the gap between the heuristic low-complexity algorithm and the penalty-CCCP based algorithm becomes large for the impact of the non-linear PAs becomes evident.
Fig. 10 presents the latency of various algorithms when the user position changes. We observe that our proposed algorithm still achieves the lowest latency performance, and as the distance between the BS and the users increases, the latency of different schemes gradually converges to the small level. This is not hard to understand because when the users are far from the BS, the users will offload less tasks to the edge server, for transmitting the raw data through the uplink is time-consuming. When the distance is quite large, the offloading ratio will be close to 0, i.e. the local computing scheme. In fact, by observing the two-timescale binary offloading algorithm, we can find that the local computing scheme starts to outperform the edge computing scheme if is larger than 80m.
Fig. 11 indicates the delay of different algorithms versus the computation resource ratio , where and is fixed to 200MHz. We present the latency of our proposed algorithm under two transmit power settings, i.e. mW and mW. As we can see, when mW, the proposed algorithm and the binary offloading algorithm vary slightly, even when is large. However, when mW, the proposed algorithm and the binary offloading algorithm decrease evidently with . This is because when the system latency is limited by the transmission rate, simply increasing the computing resource will not significantly reduce the delay, which motivates us to alleviate the system bottleneck instead of simply raising the edge computing capacity.
Fig. 12 shows the latency performance of the analyzed algorithms versus different quantization bits of the analog beamformer. We observe that latency of all algorithms decreases with the increasing of quantization bits. Moreover, the proposed algorithm only needs 4 or 5 quantization bits to achieve near performance with that of infinite quantization level, which means that the developed algorithm is efficient in practice.
Fig. 13 illustrates the delay of different algorithms versus the Rician factor , which is defined as . It is observed that when increases, the delay of the proposed two-timescale algorithm decreases significantly. This is because a larger Rician factor means a more deterministic channel. Hence the proposed stochastic optimization algorithm will perform better. We also observe that the latency of the single-timescale AO algorithm and the CM algorithm decreases with , which is due to the fact that as the channel approaches rank-1, these two algorithms can find near optimal solutions. However, because of the CSI delay, our proposed two-timescale algorithm still outperforms them. The performance of the single-timescale OMP algorithm degrades severely with because the OMP algorithm assumes that the LOS and NLOS components have the same gain and is not suitable for channels with large Rician factors. In contrast, our developed algorithm does not assume a specific channel model and can be applied to a variety of channels.
VII Conclusion
In this paper, we investigated a mmWave and D2D assisted MEC system, in which user A aims to process computation tasks and share the results with another user B with the aid of BS. We proposed a two-timescale algorithm where the analog beamforming matrices are updated at a long-timescale and the digital beamforming matrices and the offloading ratio are optimized at a short-timescale to reduce the required CSI overhead and minimize the system latency. We developed a SSCA-based algorithm to design the long-term analog beamforming matrices. The short-term digital beamforming matrices have been optimized relying on the concept of the penalty-CCCP for dealing with the mmWave non-linear transmit power constraint, and the offloading ratio has been obtained via the closed-form solution. We carried out the optimality and computational complexity analysis for the long-term and short-term design algorithms, respectively. Simulation results have been provided to verify the effectiveness of our proposed joint design algorithm. Extending the mmWave and D2D assisted MEC system to the multi-user case and more specific computation model is worthy of further investigation.
Appendix A Derivation of the gradients
Based on the rules of matrix computation, we express the derivatives associated with the long-term analog beamforming matrices as follows
(38) |
(39) |
(40) |
(41) |
and the derivatives associated with the analog beamforming matrices are given by
(42) |
(43) |
(44) |
(45) |
where
(46) |
(47) |
and
(48) |
Moreover, we have , , and for the real value objective function. By substituting (42)-(45) into (38)-(41), we finally obtain the derivatives for the long-term analog beamforming matrices.
Appendix B Derivation of updating steps in the inner loop of Algorithm 1
In this appendix, we provide the detailed solutions for updating the block of variables in the proposed penalty-CCCP based short-term digital beamforming algorithm.
Block 1: We update and in parallel with the other variables fixed. The subproblem with regard to is given by
(49) |
which is an unconstrained problem. By applying the first order optimality condition, the optimal solution to is given by
(50) |
The subproblem w.r.t. is given by
(51a) | ||||
s.t. |
This is a convex problem and the optimal solution can be expressed as
(52) |
Block 2: We update and in parallel by fixing the other variables. The subproblem of is given by
(53) |
By checking the first order optimality condition, the optimal can be expressed as
(54) |
where the last equality holds from substituting the optimal value of , i.e. (50) into (29).
The subproblem w.r.t. is given by
(55a) | ||||
s.t. |
It is readily seen that the problem can be divided into parallel subproblems, which yields
(56a) | ||||
s.t. | (56b) |
Since the objective function is piecewise, we need to discuss different situations and make comparison. Defining and whose expressions are given by (57) and (58), respectively,
(57) |
(58) |
where , we can express the optimal solution to as
(59) |
Block 3 We update with the other variables fixed. The subproblem regarding is given by
(60) |
By expanding the last term of (60) and ignoring the constant. We rewrite (60) as
(61) |
Note that the last term of (61) is concave. Hence we can approximate the original problem using the CCCP [54]. Through the first order Taylor expansion, we provide a tight upper bound of (61) as follows
(62) |
where is the current value of variable . By applying the first order optimality condition and setting , we express the solution to as
(63) |
References
- [1] F. Guo, F. R. Yu, H. Zhang, H. Ji, V. C. M. Leung, and X. Li, “An adaptive wireless virtual reality framework in future wireless networks: A distributed learning approach,” IEEE Trans. Veh. Technol., vol. 69, no. 8, pp. 8514-8528, Aug. 2020.
- [2] J. Ren, Y. He, G. Huang, G. Yu, Y. Cai, and Z. Zhang, “An edge computing based architecture for mobile augmented reality,” IEEE Netw., vol. 33, no. 4, pp. 162-169, Aug. 2019.
- [3] J. Zhang, H. Guo, J. Liu, and Y. Zhang, “Task offloading in vehicular edge computing networks: A load-balancing solution,” IEEE Trans. Veh. Technol., vol. 69, no. 2, pp. 2092-2104, Feb. 2020.
- [4] T. Chan, K. Jia, S. Gao, J. Lu, Z. Zeng, and Y. Ma, “PCANet: A simple deep learning baseline for image classification?,” IEEE Trans. Image Process., vol. 24, no. 12, pp. 5017-5032, Dec. 2015.
- [5] S. Abolfazli, Z. Sanaei, E. Ahmed, A. Gani, and R. Buyya, “Cloud-based augmentation for mobile devices: Motivation, taxonomies, and open challenges,” IEEE Commun. Surv. Tuts., vol. 16, no. 1, pp. 337-368, 1st Quart., 2014.
- [6] Y. C. Hu, M. Patel, D. Sabella, N. Sprecher, and V. Young, “Mobile edge computing: A key technology towards 5G,” ETSI White Paper. no. 11, Sept. 2015.
- [7] Y. Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, “A survey on mobile edge computing: The communication perspective,” IEEE Commun. Surv. Tuts., vol. 19, no. 4, pp. 2322-2358, 4th quart., 2017.
- [8] W. Zhang, Y. Wen, K. Guan, D. Kilper, H. Luo, and D. O. Wu, “Energy-optimal mobile cloud computing under stochastic wireless channel,” IEEE Trans. Wireless Commun., vol. 12, no. 9, pp. 4569-4581, Sept. 2013.
- [9] Y. Mao, J. Zhang, and K. B. Letaief, “Dynamic computation offloading for mobile-edge computing with energy harvesting devices,” IEEE J. Sel. Areas Commun., vol. 34, no. 12, pp. 3590-3605, Dec. 2016.
- [10] C. You, K. Huang, and H. Chae, “Energy efficient mobile cloud computing powered by wireless energy transfer,” IEEE J. Sel. Areas Commun., vol. 34, no. 5, pp. 1757-1771, May 2016.
- [11] T. Q. Dinh, J. Tang, Q. D. La, and T. Q. S. Quek, “Offloading in mobile edge computing: Task allocation and computational frequency scaling,” IEEE Trans. Commun., vol. 65, no. 8, pp. 3571-3584, Aug. 2017.
- [12] C. Wang, C. Liang, F. R. Yu, Q. Chen, and L. Tang, “Computation offloading and resource allocation in wireless cellular networks with mobile edge computing,” IEEE Trans. Wireless Commun., vol. 16, no. 8, pp. 4924-4938, Aug. 2017.
- [13] C. You, K. Huang, H. Chae, and B. Kim, “Energy-efficient resource allocation for mobile-edge computation offloading,” IEEE Trans. Wireless Commun., vol. 16, no. 3, pp. 1397-1411, Mar. 2017.
- [14] J. Ren, G. Yu, Y. Cai, and Y. He, “Latency optimization for resource allocation in mobile-edge computation offloading,” IEEE Trans. Wireless Commun., vol. 17, no. 8, pp. 5506-5519, Aug. 2018.
- [15] X. Cao, F. Wang, J. Xu, R. Zhang, and S. Cui, “Joint computation and communication cooperation for energy-efficient mobile edge computing,” IEEE Int. Things J., vol. 6, no. 3, pp. 4188-4200, Jun. 2019.
- [16] J. Ren, G. Yu, Y. He, and G. Y. Li, “Collaborative cloud and edge computing for latency minimization,” IEEE Trans. Veh. Technol., vol. 68, no. 5, pp. 5031-5044, May 2019.
- [17] J. Zhao, Q. Li, Y. Gong, and K. Zhang, “Computation offloading and resource allocation for cloud assisted mobile edge computing in vehicular networks,” IEEE Trans. Veh. Technol., vol. 68, no. 8, pp. 7944-7956, Aug. 2019.
- [18] X. Cao, B. Yang, H. Zhang, C. Huang, C. Yuen, and Z. Han, “Reconfigurable-intelligent-surface-assisted MAC for wireless networks: Protocol design, analysis, and optimization,” IEEE Int. Things J., vol. 8, no. 18, pp. 14171-14186, Sept. 2021.
- [19] X. Cao, B. Yang, C. Huang, C. Yuen, Y. Zhang, D. Niyato, and Z. Han, “Converged reconfigurable intelligent surface and mobile edge computing for space information networks," IEEE Netw., vol. 35, no. 4, pp. 42-48, Jul./Aug. 2021.
- [20] T. Bai, C. Pan, Y. Deng, M. Elkashlan, A. Nallanathan, and L. Hanzo, “Latency minimization for intelligent reflecting surface aided mobile edge computing,” IEEE J. Sel. Areas Commun., vol. 38, no. 11, pp. 2666-2682, Nov. 2020.
- [21] M. S. Elbamby, C. Perfecto, M. Bennis, and K. Doppler, “Toward low-latency and ultra-reliable virtual reality,” IEEE Netw., vol. 32, no. 2, pp. 78-84, Mar./Apr. 2018.
- [22] M. S. Elbamby, C. Perfecto, M. Bennis, and K. Doppler, “Edge computing meets millimeter-wave enabled VR: Paving the way to cutting the cord,” Proc. WCNC, 2018, pp. 1-6.
- [23] S. A. Busari, K. M. S. Huq, S. Mumtaz, L. Dai, and J. Rodriguez, “Millimeter-wave massive MIMO communication for future wireless systems: A survey,” IEEE Commun. Surv. Tuts., vol. 20, no. 2, pp. 836-869, 2nd quart., 2018.
- [24] A. Bazzi, B. M. Masini, A. Zanella, and I. Thibault, “On the performance of IEEE 802.11p and LTE-V2V for the cooperative awareness of connected vehicles,” IEEE Trans. Veh. Technol., vol. 66, no. 11, pp. 10419-10432, Nov. 2017.
- [25] Y. Li, L. Sun, and W. Wang, “Exploring device-to-device communication for mobile cloud computing,” in Proc. ICC, Sydney, NSW, 2014, pp. 2239-2244.
- [26] W. Hu, and G. Cao, “Quality-aware traffic offloading in wireless networks,” IEEE Trans. Mobile Comput., vol. 16, no. 11, pp. 3182-3195, Nov. 2017.
- [27] Y. Tao, C. You, P. Zhang, and K. Huang, “Stochastic control of computation offloading to a helper with a dynamically loaded CPU,” IEEE Trans. Wireless Commun., vol. 18, no. 2, pp. 1247-1262, Feb. 2019.
- [28] Y. He, J. Ren, G. Yu, and Y. Cai, “D2D communications meet mobile edge computing for enhanced computation capacity in cellular networks,” IEEE Trans. Wireless Commun., vol. 18, no. 3, pp. 1750-1763, Mar. 2019.
- [29] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave MIMO systems,” IEEE Trans. Wireless Commun., vol. 13, no. 3, pp. 1499-1513, Mar. 2014.
- [30] J. Zhang, M. Haardt, I. Soloveychik, and A. Wiesel, “A channel matching based hybrid analog-digital strategy for massive multi-user MIMO downlink systems,” in Proc. SAM, Jul. 2016, pp. 1-5.
- [31] X. Yu, J. Shen, J. Zhang, and K. B. Letaief, “Alternating minimization algorithms for hybrid precoding in millimeter wave MIMO systems,” IEEE J. Sel. Topics in Signal Process., vol. 10, no. 3, pp. 485-500, Apr. 2016.
- [32] F. Sohrabi, and W. Yu, “Hybrid digital and analog beamforming design for large-scale antenna arrays,” IEEE J. Sel. Topics in Signal Process., vol. 10, no. 3, pp. 501-513, Apr. 2016.
- [33] Q. Shi, and M. Hong, “Spectral efficiency optimization for millimeter wave multiuser MIMO systems,” IEEE J. Sel. Topics in Signal Process., vol. 12, no. 3, pp. 455-468, Jun. 2018.
- [34] Y. Cai, Y. Xu, Q. Shi, B. Champagne, and L. Hanzo, “Robust joint hybrid transceiver design for millimeter wave full-duplex MIMO relay systems,” IEEE Trans. Wireless Commun., vol. 18, no. 2, pp. 1199-1215, Feb. 2019.
- [35] J. Choi, V. Va, N. Gonzalez-Prelcic, R. Daniels, C. R. Bhat, and R. W. Heath, “Millimeter-wave vehicular communication to support massive automotive sensing,” IEEE Commun. Magazine, vol. 54, no. 12, pp. 160-167, Dec. 2016.
- [36] H. Huang, B. Liu, L. Chen, W. Xiang, M. Hu, and Y. Tao, “D2D-assisted VR video pre-caching strategy,” IEEE Access, vol. 6, pp. 61886-61895, 2018.
- [37] A. Asadi, Q. Wang, and V. Mancuso, “A survey on device-to-device communication in cellular networks,” IEEE Commun. Surv. Tuts., vol. 16, no. 4, pp. 1801-1819, 4th quart., 2014.
- [38] J. Ren, Y. Ruan, and G. Yu, “Data transmission in mobile edge networks: Whether and where to compress?,” IEEE Commun. Lett., vol. 23, no. 3, pp. 490-493, Mar. 2019.
- [39] S. C. Cripps, RF Power Amplifier for Wireless Communication, Norwod, MA, USA: Artech House, 2006.
- [40] C. Fager, W. Hallberg, M. Özen, K. Andersson, K. Buisman, and D. Gustafsson, “Design of linear and efficient power amplifiers by generalization of the Doherty theory,” Proc. PAWR, Phoenix, AZ, 2017, pp. 29-32.
- [41] C. Lin and G. Y. Li, “Energy-efficient design of indoor mmWave and sub-THz systems with antenna arrays,” IEEE Trans. Commun., vol. 15, no. 7, pp. 4660-4672, Jul. 2016.
- [42] Y. Cai, K. Xu, A. Liu, M. Zhao, B. Champagne, and L. Hanzo, “Two-timescale hybrid analog-digital beamforming for mmWave full-duplex MIMO multiple-relay aided systems,” IEEE J. Sel. Areas Commun., vol. 38, no. 9, pp. 2086-2103, Sept. 2020.
- [43] Y. Cui, V. K. N. Lau, R. Wang, H. Huang, and S. Zhang, “A survey on delay-aware resource control for wireless systems-large deviation theory, stochastic lyapunov drift, and distributed stochastic learning,” IEEE Trans. Inf. Theory, vol. 58, no. 3, pp. 1677-1701, Mar. 2012.
- [44] K. Kar, S. Sarkar, A. Ghavami, and X. Luo, “Delay guarantees for throughput-optimal wireless link scheduling,” IEEE Trans. Autom. Control, vol. 57, no. 11, pp. 2906-2911, Nov. 2012.
- [45] M. J. Neely, “Delay analysis for maximal scheduling with flow control in wireless networks with bursty traffic,” IEEE/ACM Trans. Netw., vol. 17, no. 4, pp. 1146-1159, Aug. 2009.
- [46] T. L. Marzetta and B. M. Hochwald, “Capacity of a mobile multiple-antenna communication link in Rayleigh flat fading,” IEEE Trans. Inf. Theory, vol. 45, no. 1, pp. 139-157, Jan. 1999.
- [47] A. Liu, V. K. N. Lau, and B. Kananian, “Stochastic successive convex approximation for non-convex constrained stochastic optimization,” IEEE Trans. Signal Process., vol. 67, no. 16, pp. 4189-4203, Aug. 15, 2019.
- [48] Q. Shi, M. Razaviyayn, Z. Luo, and C. He, “An iteratively weighted MMSE approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,” IEEE Trans. Signal Process., vol. 59, no. 9, pp. 4331-4340, Sept. 2011.
- [49] Y. Cai, Q. Shi, B. Champagne, and G. Y. Li, “Joint transceiver design for secure downlink communications over an amplify-and-forward MIMO relay,” IEEE Trans. Commun., vol. 65, no. 9, pp. 3691-3704, Sept. 2017.
- [50] P. Smulders, and L. Correia, “Characterization of propagation in 60 GHz radio channels,” Electron. Commun. Eng. J., vol. 9, no. 2, pp. 73-80, Apr. 1997.
- [51] M. E. Hajj, G. El Zein, G. Zaharia, H. Farhat, and S. Sadek, “Angular measurements and analysis of the indoor propagation channel at 60 GHz,” Proc. WiMob, Barcelona, Spain, 2019, pp. 121-126.
- [52] S. Schwarz et al., “Emerging MPEG standards for point cloud compression,” IEEE J. Emerg. Sel. Topics Circuits Syst., vol. 9, no. 1, pp. 133-148, Mar. 2019.
- [53] A. Liu, and V. K. N. Lau, “Impact of CSI knowledge on the codebook-based hybrid beamforming in massive MIMO,” IEEE Trans. Signal Process., vol. 64, no. 24, pp. 6545-6556, Dec. 15, 2016.
- [54] A. L. Yuille, and A. Rangarajan, “The concave-convex procedure,” Neural Computaion, vol. 15, no. 4, pp. 915-936, Apr. 2003.