\doiinfo

10.1109/OJCOMS.2022.3219557 \correspCORRESPONDING AUTHOR: Abhay Mohan M. V. (e-mail: [email protected]).

Interference-Aware Accurate Signal Recovery in sub-1 GHz UHF Band Reuse-1 Cellular OFDMA Downlinks

ABHAY MOHAN M. V. \authorrefmark1 and K. GIRIDHAR\authorrefmark1 Department of Electrical Engineering, Indian Institute of Technology Madras, Chennai 600 036, India

Abstract

Reuse-1 systems operating in the sub-1 GHz UHF band are limited by substantial co-channel interference (CCI). In such orthogonal frequency division multiple access (OFDMA) cellular systems, the inter-sector or inter-tower interference (ITI) makes accurate signal recovery quite challenging as sub-1 GHz bands only support single-input single-output (SISO) links. Interference-aware receiver algorithms are essential to mitigate the ITI in such low-frequency bands. Such algorithms enable ubiquitous mobile broadband access over the entire homeland, say with $>95\%$ geographical coverage with quality of service guarantees. One element of the interference-aware signal recovery is the least-squares-based joint channel estimation scheme that uses non-orthogonal pilot subcarriers. This estimator is then compared with a variant that uses orthogonal pilot subcarriers to bring out the advantage of this joint estimator. It is shown that the proposed joint estimator requires fewer pilots to be well-determined when compared to its under-determined orthogonal counterpart. Moreover, it is easy to implement and does not require any knowledge of channel statistics. This work also derives a compensation factor needed for the interference-aware detector in the presence of inter-carrier interference (ICI) originating from multiple transmitters. Simulation results show that the proposed joint channel estimator outperforms traditional estimators at moderate to high frequency selectivity. The proposed compensation factor to the joint detector is found to be essential for recovering the transmitted signal in the absence of phase-tracking pilots.

{IEEEkeywords}

Carrier frequency offset compensation, inter-carrier interference, interference-aware receivers, inter-tower interference, joint channel estimation, joint detection, OFDMA, UHF cellular systems.

1 INTRODUCTION

\IEEEPARstart

Broadband cellular networks based on the current 4G-Long Term Evolution (LTE) or the emerging 5G-New Radio (NR) wireless standards use universal frequency reuse or reuse-1 [1, 2]. In such OFDM/OFDMA block-modulated networks, all cell towers and sectors use the same frequency resource¹¹1The word sector is used to refer to the full $360^{\circ}$ region served by a cell site, or to simply one part of that region (typically a $120^{\circ}$ portion, when three sectors are deployed per cell site). In this paper, the term inter-tower interference (ITI) also subsumes the inter-sector interference that could be present at the sector boundaries. to provide a higher sum throughput. Co-channel interference (CCI), which arises in such ultra-dense networks, is a crucial bottleneck that restricts the throughput of the cell-edge user [3]. CCI manifests as inter-cell or inter-sector interference, broadly referred to as inter-tower interference (ITI). This contribution is focussed on a downlink cellular model with ITI. Downlink power control, interference coordination, beamforming, etc. (see, for example, [4, 5, 6, 7, 8]) are used to reduce this ITI and improve cell-edge user throughput.

OFDMA networks deployed in the ultra-high frequency (UHF), particularly the sub-1 GHz bands, can provide excellent geographical coverage and are used for supplementary uplink in 5G [9], [10]. The 3GPP study item [11] has identified typical deployment scenarios for next-generation access technologies. Eleven of the twelve scenarios identified have an option for sub-6 GHz deployment, and three particularly consider sub-1 GHz bands alone. The scenarios include rural deployment with inter-site distances or ISD of 1 to 5 km, extreme long-distance coverage in low density areas with ISD of the order of 100 km, and urban coverage for massive machine-type communications (mMTC). The importance of these bands for rural connectivity is also studied in [12] and [13]. This frequency band has also been examined for suitability for wideband Short-Range Devices (SRDs) in [14],[15] as well as low-power, long-range IoT [16]. The band also finds use in deploying resilient machine-to-machine systems, including smart grids, which are critical to the national infrastructure [17]. The IEEE 802.22 and 802.11af standards, which are expected to play a significant role in bridging the rural connectivity for 6G networks, also operate in this band [18], [19].

However, the sub-1 GHz bands do not support spatial filtering/beamforming because the large wavelengths involved make single input single output (SISO) the only realistic deployment. The receiver will likely pick up significant ITI, especially in the cell-edge region, even after employing interference coordination and power control algorithms. The ITI is especially detrimental if the aim is to provide coverage $>95\%$ . In such cases, cells have to be designed in an overlapping manner. This overlap causes high CCI at the cell boundaries and has to be overcome purely by better signal processing algorithms for (i) carrier recovery, (ii) channel estimation, and (iii) data detection. The motivation of our work is to ensure accurate signal recovery in the presence of strong ITI by addressing each of these three aspects.

1.1 Literature Survey

1.1.1 Channel Estimation Schemes

OFDM channel estimation typically consists of obtaining the initial estimates at the pilot locations and then interpolating these estimates over the data subcarriers. The channel interpolation can be done using various methods such as linear interpolation, transform domain techniques, Wiener filtering, etc. [20]. The initial estimate quality will affect the quality of the interpolated channel.

The initial channel estimates can be obtained using frequency or time-domain least-squares, the linear minimum mean squared error (LMMSE) approach [21], the maximum likelihood (ML) approach [22] or the Bayesian MMSE approach [23]. The ML and time-domain least squares solutions (also called modified least squares or mLS) are found to be equivalent, and the MMSE solution reduces to ML at high signal-to-noise ratios (SNRs) [20, 23]. Thus, the mLS/ML is a minimum variance unbiased estimator that yields the best performance without knowledge about the channel correlation matrix. The mLS typically uses implicit DFT interpolation but can also work with other interpolation methods.

Signal detection in the presence of interference requires estimates of the channels of the interfering signals and that of the desired signal. These channels can be estimated by allotting them non-intersecting sets of frequency orthogonal pilot subcarriers like those described in [24]. However, such designs are not flexible as they would have to reserve pilot sets for the maximum number of BS towers that the user equipment (UE) is likely to detect. The number of such towers is calculated based on inter-site distances and path loss calculations alone. The combination of shadowing, fading, and dynamic interference control techniques can reduce the number of towers that interfere with the UE. A design based on the maximum number of BS towers in a region would be spectrally inefficient as it would require more pilot subcarriers in total.

The pilot-on-pilot allocation scheme multiplexes the pilot positions of the desired signal and interferers [24]. The pilots originating from different towers would then be separated with the help of codes rather than frequency orthogonality. Interfering pilots are either suppressed or cancelled out altogether in such an allocation [25, 26, 27]. The 5G-NR standards [28], for example, define the demodulation reference signal (DM-RS) sequence with such a pilot-on-pilot allocation by using orthogonal cover codes (OCCs). The channel is assumed to be static over the time and frequency resources over which the OCC is applied. This assumption makes DM-RS channel estimation suffer from performance degradation in channels with moderate to high selectivity [29, 30].

The sounding reference signal (SRS) in 5G also uses pilot-on-pilot arrangement [28]. However, the ports use distinct phase-shifted pilot sequences [31] rather than relying on OCC to separate the ports. The phase-shifted pilots result in time-shifted CIR for each port, which can then be separated by windowing the CIR [32]. However, this approach suffers from an error floor caused due to interpolation error that arises from zero padding in the CIR [33].

1.1.2 Management of CCI and ICI

Co-channel interference must either be avoided or suppressed to decode the received signal. The transmitter powers at the different towers can be optimized by solving a system of equations as in [34] to minimize the CCI. The idea is to reduce the interference power while keeping the quality at the desired receiver at an acceptable level. While this method utilizes knowledge of channel states of the neighboring cells, recent works like [4] have proposed strategies that do not require this knowledge. CCI is suppressed in [7] by a combination of interference coordination and beamforming. Beamforming-based techniques use multi-antenna systems and cannot be practically deployed in the sub-1 GHz band. Power control faces limitations in the presence of multiple high-power interferers, as the transmitted power of a tower cannot be reduced without the associated cost of reduced throughput in its home cell.

Thus, either successive interference cancellation (SIC) or joint detection has to be employed to deal with interference at the receiver [35]. When the signal-to-interference ratio (SIR) is very high, the signal can be decoded satisfactorily by ignoring the interference. On the other hand, SIC can be employed when interference power exceeds the signal power by a large margin [36]. A joint detector such as the one described in [35, 37] is the best option when the signal and interference powers are comparable. Such detectors typically utilize a joint loglikelihood ratio (JLLR) metric [38] that accounts for the interferer constellations and the signal constellation in the minimum distance calculation.

The local oscillators of the different towers seen by the receiver induce frequency offsets with varying signs and magnitudes on the receiver. The total frequency synchronization error is comprised of the error in local oscillator frequencies and the relative Doppler of the mobile UE relative to the BS [39]. The term carrier frequency offset (CFO) will be used in our work to denote this combined error. CFO error is the major contributor²²2Inter-Block Interference (IBI) can also contribute to ICI in OFDMA cellular systems. In our work, we assume that there is sufficient cyclic prefix available on the downlink ITI signals from GPS synchronized BS to ensure that the UE determines an IBI-free window. to inter-carrier interference (ICI) in an OFDM or OFDMA system.

ICI cancellation literature focuses on removing or suppressing ICI in the time or frequency domains. Frequency domain approaches are limited and more complex as they mostly require reconstructing the ICI to cancel them [40]. On the other hand, a time-domain algorithm reduces the CFO before it is formed by the FFT operation [41]. Some methods, like [42], combine time domain and frequency domain approaches. Other methods for multi-transmitter CFO compensation involve singular value decomposition [43], lattice filtering [44], iterative design [45], etc. Complete compensation for the CFOs is not possible when multiple transmitters are present, as in the case of a reuse-1 system [45].

The downlink multi-transmitter CFO problem is distinct from CFO correction in the absence of co-channel interferers for uplink OFDMA links [46, 47]. CCI comes into the picture on the uplink in MIMO systems that use spatial multiplexing [48, 40]. Based on the downlink CFO estimate, the uplink compensation schemes generally assume some amount of precompensation by the UE [40]. UE-specific precompensation is not possible in the downlink as the towers simultaneously cater to multiple UEs.

The ICI contribution of multiple CFOs in the downlink is studied extensively in Coordinated Multi-Point (CoMP) literature [49, 50, 51, 52, 53]. A receiver in a CoMP system sees multiple towers from nearby cells that transmit desired information as a form of diversity [49]. The degradation due to multi-transmitter CFO is studied in [50]. The ICI that arises from downlink transmitters is an issue that reuse-1 systems share with CoMP schemes. The ICI compensation proposed in [51] utilizes a time-domain derotation factor that is a nonlinear function of the CFOs. Some corrections are proposed to this derotation factor in [52]. These methods assume that the difference between the minimum and maximum CFOs is less than half of the subcarrier bandwidth. The CFO correction is shifted to the transmitter side in [53] by first de-biasing the various CFOs at the receiver and then feeding back the residual to the respective towers. However, [53] assumes a quasi-static, non-selective channel and a single-carrier system.

Refer to caption — Figure 1: Block diagram describing a system with three ITI signals ( $M$ =4) and the proposed receiver structure.

1.2 Key Contributions

This work describes an algorithm that jointly estimates the desired and interference channels in a spectrally efficient manner. The proposed estimator employs the comb-type pilot-on-pilot design strategy described in [24]. The pilot pattern used at the different towers may be random PSK/QAM sequences. The signal model in this work enables joint estimation by accounting for both interferer and desired signal pilots. The proposed channel estimator is an extension of the mLS estimator [21]. It will be shown that the combination of mLS and pilot-on-pilot configuration makes this estimator spectrally efficient.

The time-domain methods, such as those described in [41, 51], and [52], do not entirely compensate for multi-transmitter CFO even when used in conjunction with frequency domain approaches (as seen in [42]). The residual CFOs cause phase rotations in the signal and interferer constellations. The interference-aware joint detector proposed in this work tracks and compensates for this phase resulting from the residual CFO.

The key contributions of this work are captured below with reference to the block diagram in Fig. 1.

1.

We first propose a joint channel estimation algorithm (labeled (A) in Fig. 1) that can estimate the desired and interference channels in a spectrally efficient manner. This estimator is shown to outperform null-on-pilot and existing pilot-on-pilot-based estimation algorithms [30, 31, 33]. A comparison of the Cramer-Rao lower bound (CRLB) indicates that there is no degradation in using the pilot-on-pilot estimator compared to a conventional null-on-pilot estimator like [21]. This estimator is shown to be well-defined even when the conventional orthogonal estimator may be under-determined. The proposed estimator is also shown to be a minimum variance unbiased estimator (MVUE).
2.

The second contribution is the inclusion of a phase correction factor (labeled (B) in Fig. 1) in the JLLR-based detection framework discussed in [35] and [38] to work in the presence of distinct CFOs. The phase due to the residual CFO will have to be tracked with dedicated pilots in every OFDM symbol in the absence of this factor. Such pilots are unnecessary overhead and reduces spectral efficiency. Since the LLR is calculated considering the ITI structure, this joint detector manages both interference and CFO errors elegantly using only a single receive antenna.

The proposed approaches for channel estimation and signal detection complement each other because the channel estimates carry information on the distortion caused due to the CFOs to the JLLR-based detector. The joint detector has a simple mechanism to track and compensate for the incremental “phase ramp” caused due to the residual CFO error. It is shown that the absence of the modification to the JLLR block can lead to an unacceptably high block error rate in the scenarios considered.

1.3 Related References and Novelty

The proposed estimator can be considered an extension of the mLS estimator [21] for the case when the received signal contains CCI. The work in [54] proposes a similar estimator but requires a specialized training sequence with an entire OFDM symbol reserved for pilots. The proposed estimator requires fewer pilots to estimate the channel and is shown to be MVUE. The present work also identifies the condition for joint estimator superiority over an orthogonal-pilot-based estimator.

Unlike existing pilot-on-pilot-based methods, the proposed method does not enforce orthogonality using OCC [30] or phase-shifted pilots [31, 33]. Simulation results show that the proposed design is not vulnerable to frequency selectivity. The proposed joint channel estimation scheme allows overlapping multipath delays and does not require interference-free initial channel estimates as in [55]. The algorithm proposed in [56] requires the knowledge of the maximum delay spread, whereas our work makes no such assumption. An uplink scenario involving coordinated joint detection of $K$ users by $M$ tower equipment is described in [40]. In the downlink model considered here, the UE only needs to decode the message from one of the towers while modeling the interference structure caused by the other ITI terms.

To the best of our knowledge, this is the first work to propose a joint channel estimation framework requiring fewer pilots than its orthogonal counterpart. Unlike an orthogonal pilot system, the proposed estimator needs to be designed for the number of interferers seen by the UE rather than the worst case. The proposed modification to the JLLR expression compensates for the common phase error from the residual CFO and is required when frequency synchronization errors are present.

This paper is organized as follows. Section II describes the system model under consideration. The key components of the proposed interference-aware receiver are presented in Section III. Section IV presents some numerical simulation results and the results of a link-level simulation. The work is concluded in Section V.

Notations

Bold symbols denote vectors or matrices. Uppercase letters usually indicate the frequency domain, and lowercase letters are for the time domain. A hat on top of a parameter (e.g., $\hat{x}$ ) will represent an estimate of the parameter $x$ . The subscripts $k$ and $m$ are used to denote the $k^{th}$ subcarrier and $m^{th}$ tower, respectively. Finally, the OFDM symbol index shall be denoted by $i$ , and $j$ shall be reserved for the unit imaginary number.

2 System Model

An $N$ subcarrier reuse-1 downlink OFDMA system with a cyclic prefix (CP) of $N_{cp}$ samples is considered. Assume that the system operates in the sub-1 GHz UHF band. The link between the base station/tower and the UE is assumed to be single-input single-output (SISO). Although the UE is near $N_{T}$ base stations/towers, not all of them are visible.

Consider the situation where only $M<N_{T}$ towers are typically visible to the UE due to long-term fading, shadowing, interference management or avoidance schemes, etc. A preamble symbol³³3The estimate of $M$ is obtained from a banded preamble design, where each of the $N_{T}$ towers occupy non-overlapping (orthogonal) subcarriers. As explained in Section IV-4.3, a filter bank can be used at the receiver to detect the presence of interferers. with $N_{T}$ bands can be used to estimate $M$ . The towers are numbered from $0,1,...M-1$ , and the $0^{th}$ tower denotes the one to which the UE is communicating. The other $M-1$ towers transmit co-channel interfering signals since they occupy the same time-frequency resources. With the UE carrier frequency acting as the reference, the CFOs of the $M$ towers are denoted as $\{\Delta f_{0},\Delta f_{1},...\Delta f_{M-1}\}$ and the subcarrier bandwidth normalized CFOs are $\{\epsilon_{0},\epsilon_{1},...\epsilon_{M-1}\}$ respectively. The data is assumed to be coded. The system is described by the block diagram in Fig. 1 for $M=4$ .

A mathematical model for the system can be developed based on the interference-free system model described in [57]. When all UEs have different CFOs, it is observed that the individual constellations of the different transmitters are rotated by an amount corresponding to the respective offset. For the signal belonging to the $m^{th}$ tower, the CFO-induced phase at the $n^{th}$ sample of the $i^{th}$ received OFDM symbol shall be $c_{i,m}(n)=e^{j2\pi\frac{\left(i(N+N_{cp})+n\right)\epsilon_{m}}{N}}$ . Then, the time domain expression for the received OFDM symbol before CP removal is given by

y_{i}(n)=\sum\limits_{m=0}^{M-1}c_{i,m}(n)\ s_{i,m}(n)+w_{i}(n)

(1)

where $s_{i,m}$ denotes the faded OFDM symbol from the $m^{th}$ tower. Let $x_{i,m}$ be the transmit symbol and $h_{i,m}$ be the channel impulse response (CIR) for the $m^{th}$ downlink signal. Then $s_{i,m}$ is the convolution of $x_{i,m}$ and $h_{i,m}$ . The last term $w_{i}(n)\in\mathcal{N}(0,\sigma^{2})$ in (1) is an additive white Gaussian noise (AWGN) sample. With the symbol index $i$ omitted for notational convenience, the frequency domain measurements for the $k^{th}$ subcarrier can be written as follows

Y[k]=\sum\limits_{m=0}^{M-1}C_{m}A_{m}H_{k,m}X_{m}[k]+ICI+W[k]\\ C_{m}:=e^{j\left(2\pi\frac{i(N+N_{cp})+N_{cp}+(N-1)/2}{N}\right)\epsilon_{m}},\\ A_{m}:=\frac{sin(\pi\epsilon_{m})}{Nsin\left(\frac{\pi\epsilon_{m}}{N}\right)}\hfill

(2)

where $C_{m}$ is the CFO-induced phase distortion, $A_{m}$ is the corresponding amplitude distortion, $H_{k,m}$ is the channel frequency response (CFR) and $X_{m}[k]$ is the modulated symbol placed in the $k^{th}$ subcarrier by the $m^{th}$ tower. The term $ICI$ in (2) is the total inter-carrier interference caused by the CFOs. The measurement noise is zero mean and Gaussian with $W[k]\sim\mathcal{N}(0,\sigma^{2})$ . The phase terms independent of the OFDM symbol index $i$ in the above expression can be absorbed into the frequency response, which is now denoted by $H_{k,m}^{\prime}$ , to yield

Y[k]=C_{0}^{\prime}H_{k,0}^{\prime}X_{0}[k]+\sum\limits_{m=1}^{M-1}C_{m}^{\prime}H_{k,m}^{\prime}X_{m}[k]\\ +\sum\limits_{m=0}^{M-1}C_{m}^{\prime}I_{l,k}(m)+W[k]

(3)

where $C_{m}^{\prime}:=e^{j2\pi\frac{\left(i(N+N_{cp})\right)}{N}\epsilon_{m}}$ is the symbol index dependent component of the phase distortion. The signal term ( $m=0$ ) and the co-channel interference terms ( $m>0$ ) are shown separately in this equation. Here, $I_{l,k}(m)$ is the ICI term between subcarriers $l$ and $k$ from the $m^{th}$ tower. This term shall be examined in more detail in Section III-B, where the time-domain compensation scheme for ICI is derived.

Observe that $H_{k,m}^{\prime}$ in the above expression is an attenuated and phase-rotated version of $H_{k,m}$ in (2). $H_{k,m}^{\prime}$ can be estimated for every $P^{th}$ OFDM symbol using pilot-aided channel estimation. Starting from the pilot-containing symbol, $i$ takes index values $0,1,2,...P-1$ , and again wraps back to $0$ when the channel is estimated again. This is because $C_{m}^{\prime}$ also gets absorbed into $\hat{H}_{k,m}^{\prime}$ during estimation. $C_{m}^{\prime}$ produces a “phase ramp” called CFO-induced common phase error for the OFDM symbols following the pilot-laden symbol. This $\epsilon_{m}$ -dependent phase ramping term $C_{m}^{\prime}$ will cause a time-progressive phase shift for the data symbols received after the pilot-laden symbol. This term must be compensated for while modeling the interference-aware detector.

3 Proposed Interference-Aware Receiver

The data bits are coded, modulated, and loaded into OFDM subcarriers before transmission. The signals from the different towers are assumed to arrive within the CP duration of the OFDM symbol. This is reasonable at the cell edge since the strong interferers are almost equally distant from the UE. In other words, assuming IBI-free timing recovery at the UE is possible even with strong ITI. The objective is to recover the transmitted data bits from the frequency-domain received symbol (3). The following subsections look at algorithms proposed for joint channel estimation and frequency offset compensation.

3.1 Joint Channel Estimation

The choice of the mLS algorithm used to construct the proposed joint channel estimator shall now be justified. In the time domain, the multipath components will be limited to a sparse subset of the first $N_{cp}$ samples of the estimated channel impulse response. Even if the multipath components’ exact location is unknown, only $N_{cp}$ samples need to be estimated in the worst case. The samples after $N_{cp}$ can be made zero as the CIR is usually limited to the cyclic prefix length.

The traditional least squares formulation performs poorly in the presence of noise, as the number of estimates and the number of pilots ( $N_{p}$ ) are equal. Suppose $N_{p}$ pilots are used in mLS. Since $N_{cp}$ is usually just a fraction of $N_{p}$ , high-quality channel estimates are obtained as the estimation problem becomes overdetermined. If reduction of pilot overhead is the primary consideration, the number of pilots needed for estimating the channel can be reduced from $N_{p}$ to $N_{cp}$ . The mLS is suitable to estimate interference channels compared to traditional least squares due to this reduction in the pilot requirement. For an $M$ tower system, the number of pilots required reduces from $MN_{p}$ to $MN_{cp}$ . The fourth subsection shows that the proposed joint estimator can further reduce the number of pilots required in interference channels.

To derive the joint estimator, (24) is rewritten in vector form (note that for the symbols with pilots, $i=0$ and $\tilde{C}_{m}^{\prime}=1$ ) as follows:

\mathbf{Y}^{(D)}=\mathbf{X_{0}}\mathbf{F_{N_{cp}}}\mathbf{\tilde{h}_{0}}^{\prime}+\sum\limits_{m=1}^{M-1}\mathbf{X_{m}}\mathbf{F_{N_{cp}}}\mathbf{\tilde{h}_{m}}^{\prime}+\sum\limits_{m=0}^{M-1}\mathbf{\tilde{I}_{m}}+\mathbf{\tilde{W}}

(4)

Here $\mathbf{Y}^{(D)}$ is an $N\times 1$ OFDM received signal vector in the frequency domain, with each entry of the vector corresponding to a particular subcarrier. The superscript ^(D) and variables with tilde $(\ \tilde{}\ )$ indicate that derotation was done in the time domain to reduce the ICI. More details on derotation schemes shall be provided in the next section. $\mathbf{F}_{N_{cp}}$ is the subsampled DFT matrix consisting of columns from 1 to $N_{cp}$ of the full $N$ -point DFT matrix $\mathbf{F}$ . $\mathbf{X_{m}}$ is a diagonal matrix with modulated data or pilots of the $m^{th}$ BS tower as its diagonal elements, and $\mathbf{\tilde{h}_{m}}^{\prime}$ is the $N_{cp}\times 1$ CIR vector corresponding to the $N\times 1$ CFR vector $\mathbf{\tilde{H}_{m}^{\prime}}$ .

The proposed joint estimator allows a comb-type pilot arrangement in the OFDM symbols with pilots. Equation (4) can then be rewritten in matrix form as follows. The corresponding matrix dimensions are also provided.

\displaystyle\begin{split}&\mathbf{Y}^{(D)}_{p}=\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\ \mathbf{\tilde{h}^{\prime}}_{N_{cp}M}+\mathbf{\tilde{I}+\mathbf{\tilde{W}}}\quad\quad\quad\hfill(N_{p}\times 1)\\ &\mathbf{X_{p}}=\left[\mathbf{X_{0}}\ \mathbf{X_{1}}\ ...\ \mathbf{X_{M-1}}\right]\quad\quad\quad\quad\quad\quad\ \hfill(N_{p}\times MN_{p})\\ &\mathbf{F}_{pN_{cp}M}=\begin{bmatrix}\mathbf{F}_{pN_{cp}}\quad\mathbf{0}\quad...\quad\mathbf{0}\\ \mathbf{0}\quad\mathbf{F}_{pN_{cp}}\quad...\quad\mathbf{0}\\ ...\quad...\quad...\quad...\\ \mathbf{0}\quad\mathbf{0}\quad...\quad\mathbf{F}_{pN_{cp}}\\ \end{bmatrix}\quad\quad\ \hfill(MN_{p}\times MN_{cp})\\ &\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}=\begin{bmatrix}\mathbf{\tilde{h}^{\prime}}_{0}\ \mathbf{\tilde{h}^{\prime}}_{1}\ ...\ \mathbf{\tilde{h}^{\prime}}_{M-1}\\ \end{bmatrix}^{T}\quad\quad\quad\quad\quad\ \ \hfill(MN_{cp}\times 1)\end{split}

(5)

The subscript _p denotes that only pilot subcarriers are selected for those matrices or vectors. The rows corresponding to pilot subcarriers are selected from $\mathbf{F}_{N_{cp}}$ to get $\mathbf{F}_{pN_{cp}}$ . The objective now is to solve the following least-squares problem.

\min\limits_{\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}}\|\mathbf{Y}^{(D)}_{p}-\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\ \mathbf{\tilde{h}^{\prime}}_{N_{cp}M}\|^{2}

(6)

This minimization can be solved by differentiating the function with respect to $\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}$ and setting the result equal to zero.

-2(\mathbf{Y}^{(D)}_{p})^{H}\boldsymbol{X_{p}}\mathbf{F}_{pN_{cp}M}+2{(\mathbf{\tilde{h}^{\prime}}_{N_{cp}M})}^{H}\mathbf{F}_{pN_{cp}M}^{H}\boldsymbol{X_{p}}^{H}\boldsymbol{X_{p}}\mathbf{F}_{pN_{cp}M}=0.

(7)

This can be rewritten as

{\mathbf{F}_{pN_{cp}M}}^{H}\boldsymbol{X_{p}}^{H}\boldsymbol{X_{p}}\mathbf{F}_{pN_{cp}M}\ {\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}}=\mathbf{F}_{pN_{cp}M}^{H}\boldsymbol{X_{p}}^{H}\mathbf{Y}^{(D)}_{p}.

(8)

The estimate of $\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}$ can be obtained by multiplying throughout by the regularized inverse of ${\mathbf{F}_{pN_{cp}M}}^{H}\boldsymbol{X_{p}}^{H}\boldsymbol{X_{p}}{\mathbf{F}_{pN_{cp}M}}$ .


	$\displaystyle\mathbf{\hat{h}}_{JmLS}=\mathbf{J}_{M}\mathbf{Y}^{(D)}_{p}$	(9a)
where

	$\displaystyle\mathbf{J}_{M}=\left({\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{X_{p}}^{H}\mathbf{X_{p}}\mathbf{F}_{pN_{cp}M}+\alpha\mathbf{I}\right)^{-1}{\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{X_{p}}^{H}$	(9b)

Here, $\alpha>0$ is a Tikhonov regularization factor [58]. When guard subcarriers are present in the OFDM symbol, the reduced DFT matrix loses orthogonality, and the inverse becomes ill-conditioned. The regularization parameter $\alpha$ improves the condition number of this inverse. $\alpha$ can be chosen according to the Hoerl-Kennard-Baldwin formula [59, 60]. As per this formula, the regularization factor would be,

\alpha=\frac{MN_{cp}}{{\mathbf{\hat{h}}_{JmLS}}^{H}\mathbf{\hat{h}}_{JmLS}}\hat{\sigma}^{2}

(10)

Here, $MN_{cp}$ is the number of parameters to be estimated. $\hat{\sigma}^{2}$ is the estimated noise variance. As the regularization factor requires an estimate of the channel impulse response, ${\mathbf{\hat{h}}_{JmLS}}^{H}\mathbf{\hat{h}}_{JmLS}$ can be initialized to 1 the first time JmLS is applied. The previous CIR estimate can be used in the subsequent JmLS estimations. The initialization value of 1 corresponds to the assumption of a single path with no fading or attenuation. The noise variance estimate is also required for the joint detector, and can be estimated by leaving a few subcarriers blank.

$\mathbf{J}_{M}$ shall be called the joint modified least squares (JmLS) estimate for the remainder of this paper. The CFRs can be obtained from the JmLS time domain estimate by multiplying each $N_{cp}\times 1$ subvectors in $\mathbf{\hat{h}}_{JmLS}$ with $\mathbf{F}_{N_{cp}}$ . This is equivalent to taking an $N$ point FFT of each such subvector in $\mathbf{\hat{h}}_{JmLS}$ .

3.1.1 CRLB of the joint CIR estimate

The pilots are assumed to be equally spaced in the frequency domain from band-edge to band-edge to derive the CRLB. The residual ICI term $\mathbf{\tilde{I}}$ is merged with the noise term $\mathbf{\tilde{W}}$ . Under these assumptions, following the steps given in Appendix 6, CRLB of the total variance of an unbiased joint estimator of $\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}$ is given by

CRLB_{tot}(\mathbf{\hat{h}_{J}})=Tr\left(({\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{F}_{pN_{cp}M})^{-1}\right)\sigma^{2}

(11)

where $Tr(.)$ denotes the trace of a matrix. If the pilot spacing is assumed to be a power of 2, as given in Appendix 6, then

CRLB_{tot}(\mathbf{\hat{h}_{J}})=\frac{MN_{cp}}{N_{p}^{(J)}}\sigma^{2}

(12)

where the number of joint estimator pilots $N_{p}^{(J)}=N/2^{i}$ , $i$ being a non-negative integer. Equation (12) is only approximate since the inverse will not be a pure identity matrix in the presence of guard subcarriers. Equation (11) can be used in such cases.

3.1.2 JmLS as a Minimum Variance Unbiased Estimator (MVUE)

The JmLS estimator can be shown to be unbiased by simply substituting (5) into (9a) and performing the expectation operation. The MVUE property of an unbiased estimator can be proven if its MSE is shown to be equal to the CRLB. The total MSE of the JmLS estimates is now computed as follows:

MSE_{{}_{\Sigma}}=Tr\left(E\left[(\mathbf{\hat{h}}_{JmLS}-\mathbf{\tilde{h}^{\prime}}_{N_{cp}M})(\mathbf{\hat{h}}_{JmLS}-\mathbf{\tilde{h}^{\prime}}_{N_{cp}M})^{H}\right]\right)

(13)

The following expression is obtained by applying the steps given in Appendix 7. For unit amplitude pilots,

MSE_{{}_{\Sigma}}=Tr\left(\left({\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{F}_{pN_{cp}M}+\alpha\mathbf{I}\right)^{-1}\right)\sigma^{2}

(14)

Equation (14) is the same as the CRLB expression (12) except for the regularization factor. When pilots are band-edge to band-edge, and the pilot spacing is assumed as per the CRLB derivation, regularization is unnecessary. For $\alpha=0$ , (14) becomes equal to (11). Thus the proposed JmLS estimator in (9a) achieves CRLB and is an MVUE as well as an efficient estimator.

3.1.3 CRLB of Orthogonal CIR estimation scheme

The CRLB of the joint estimator can now be compared to the mLS algorithm with a null-on-pilot allocation. An orthogonal pilot pattern with $N_{p}^{(O)}$ pilots per BS tower can be compared to the JmLS pilot-on-pilot pattern with a total of $N_{p}^{(J)}$ pilots. This orthogonal scheme can estimate CIR using mLS for each tower. This algorithm shall be called orthogonal modified least squares (OmLS) to distinguish it from the joint estimation scheme. The OmLS estimate is described by


	$\displaystyle\mathbf{\hat{h}}_{OmLS}=\mathbf{O}_{m}\mathbf{Y}^{(D)}_{p,m}$		(15a)
	$\displaystyle\mathbf{O}_{m}=\left(\mathbf{F}_{pN_{cp}}^{H}\mathbf{X_{m}}^{H}\mathbf{X_{m}}\mathbf{F}_{pN_{cp}}+\alpha\mathbf{I}\right)^{-1}\mathbf{F}_{pN_{cp}}^{H}\mathbf{X_{m}}^{H}$		(15b)

$\mathbf{Y}^{(D)}_{p,m}$ is the vector consisting of only those subcarriers in $\mathbf{Y}^{(D)}_{p}$ that correspond to the $m^{th}$ tower as per the null-on-pilot scheme. The CRLB of OmLS is the special case of JmLS with $M=1$ , except for a pilot boosting factor which shall be explained shortly. The CRLB of the total variance of the estimate for the $m^{th}$ user is then

CRLB(\mathbf{\hat{h}}_{orth,m})=Tr\left(({\mathbf{F}_{pN_{cp}}}^{H}E\left[{\mathbf{X_{m}}}^{H}{\mathbf{X_{m}}}\right]{\mathbf{F}_{pN_{cp}}})^{-1}\right)\sigma^{2}

(16)

The number of BS towers $M$ seen by the UE changes with time and location. OmLS can then only be designed for $N_{T}>M$ towers, the maximum number a UE is likely to see at the cell edge. Since $(N_{T}-1)N_{p}^{(O)}$ subcarriers are nulls in OmLS, the power of the $N_{p}^{(O)}$ pilot subcarriers can be boosted by a factor $\beta=N_{T}$ . This means $E\left[{\mathbf{X_{m}}}^{H}{\mathbf{X_{m}}}\right]=\beta\mathbf{I}$ . The expression in (16) can be further simplified in terms of the pilot boosting factor and the number of OmLS pilots.

\displaystyle CRLB(\mathbf{\hat{h}}_{orth,m})

\displaystyle=Tr\left((\beta N_{p}^{(O)}\mathbf{I})^{-1}\right)\sigma^{2}=\dfrac{N_{cp}}{\beta N_{p}^{(O)}}\sigma^{2}

(17)

JmLS and OmLS are allotted the same total number of pilot subcarriers to enable a fair comparison. That is, for $N_{T}$ BS towers, $N_{p}^{(O)}=N_{p}^{(J)}/N_{T}$ . The aggregate CRLB for the CIRs of all the users is obtained by substituting the value of $\beta$ .

\displaystyle\sum_{m}CRLB(\mathbf{\hat{h}}_{orth,m})=\dfrac{MN_{cp}}{\ N_{p}^{(J)}}\sigma^{2}

(18)

This CRLB is identical to the CRLB derived for JmLS in (12) and it shows that there is no degradation by going for a joint estimator instead of using orthogonal pilots. The next subsection shows that JmLS has an advantage over OmLS in terms of the number of pilots needed.

3.1.4 Condition for JmLS superiority over OmLS

The model in (5) can be rewritten as a system of equations.

\mathbf{X_{p}}^{H}\mathbf{Y}^{(D)}_{p}=\mathbf{X_{p}}^{H}\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\ \mathbf{\tilde{h}^{\prime}}_{N_{cp}M}+\mathbf{\tilde{I}+\mathbf{\tilde{W}}}

(19)

This is a system with $MN_{p}^{(J)}$ equations and $MN_{cp}$ unknowns. As $N_{p}^{(J)}=N_{T}N_{p}^{(O)}$ , JmLS would be solvable when $N_{p}^{(O)}N_{T}>N_{cp}$ . When $N_{p}^{(O)}<N_{cp}$ , the OmLS estimator is underdetermined as it would have fewer equations than the number of unknowns. This means that the matrix inverted in (15b) will not be full rank. Thus, JmLS will be able to perform better than OmLS whenever the following set of inequalities hold true.

\displaystyle\begin{split}N_{p}^{(O)}N_{T}/N_{cp}&>1\\ N_{p}^{(O)}/N_{cp}&<1\end{split}

(20)

It follows that JmLS can work with fewer pilots than OmLS when multiple towers are present.

3.1.5 Comparison of Complexity

The estimator matrix $\mathbf{J}_{M}$ in (9a) is an $MN_{cp}\times N_{p}^{(J)}$ matrix. Both $\mathbf{X_{p}}$ and $\mathbf{F}_{pN_{cp}M}$ are required to calculate this matrix. These are computed with the individual pilot matrices $\{\mathbf{X_{m}}\}$ , and the subsampled DFT matrix $\mathbf{F}_{pN_{cp}}$ . Since the transmitted pilot sequence and their locations are known at the receiver, $\mathbf{J}_{M}$ can be precalculated and stored. $\mathbf{J}_{M}$ should be computed for different $M$ values and the various tower combinations that the receiver is likely to see. The appropriate estimator matrix can then be chosen by determining which $M$ out of the $N_{T}$ possible bands in the banded preamble (as explained in footnote 3 and the simulation section) contain signals of significant strength. The OmLS estimator matrices for the $N_{T}$ towers can also be precalculated and stored using similar logic.

The OmLS and JmLS estimations reduce to matrix multiplications by employing the precalculated estimator matrices at the receiver. A well-determined OmLS-based estimator requires $N_{p}^{(O)}=N_{cp}$ subcarriers per tower and $N_{T}N_{cp}$ subcarriers in total. However, JmLS can work with slightly more than $N_{cp}$ pilots. The total number of pilots in OmLS and JmLS are kept equal in this work for a fair comparison of the schemes. This means that JmLS is well-determined and will perform well, while OmLS is underdetermined. For this design, the complexity of JmLS in terms of complex multiplications is $O(MN_{cp}N_{p}^{(J)})$ . The corresponding complexity of OmLS to estimate the CIR for all the channels is $M\times O(N_{cp}N_{p}^{(O)})=O(MN_{cp}N_{p}^{(O)})$ .

In the case described above, $N_{p}^{(J)}=N_{T}N_{p}^{(O)}=N_{cp}$ which implies $N_{p}^{(O)}=N_{cp}/{N_{T}}$ . Although OmLS is less complex than JmLS, the estimates are meaningless as $N_{p}^{(O)}<N_{cp}$ . Thus, JmLS requires more complex multiplications to produce reliable channel estimates. On the other hand, OmLS is less complex but underdetermined. This higher complexity can be justified as the JmLS estimator can work with fewer pilots than OmLS. If the total number of pilots of both estimators are fixed such that the equations are well determined in either case, $N_{p}^{(J)}$ becomes equal to $N_{cp}$ and $N_{p}^{(O)}=N_{cp}$ . In this case, OmLS and JmLS would have the complexity of the order of $O(MN_{cp}^{2})$ . However, OmLS would require $N_{T}$ times more pilots in total.

3.2 Frequency offset compensation

3.2.1 Time-domain ICI Compensation Scheme

Time-domain compensation is a computationally simple first step towards reducing the ICI power at the receiver. The CFOs should be estimated at the receiver as their true values are unknown. This work estimates CFOs using a banded preamble, as explained in the simulation section. These estimates are then used to perform offset correction in the time domain. Some of the choices for the compensation factor are:

1.

Desired signal CFO [41].
2.

A function of the arctan of the CFOs as proposed in CoMP literature [51, 52].
3.

Mean or weighted mean of the CFOs can also serve as a near-optimal derotator as proposed in this work.

In CCI-free systems, ICI is mostly removed with the help of the time-domain scheme described in [41]. In this algorithm, the received time domain signal (1) is multiplied with the conjugate of $c_{i,0}(n)$ to remove the effect of CFO. This is equivalent to “de-rotating” the signal that was rotated by the CFO, and hence this approach is called signal derotation (SD). Some amount of ICI shall remain as an estimate of $\epsilon_{0}$ is used in practice.

Although the optimal time-domain derotation factor for a co-channel interference system has not been explored in the literature, there are several studies made in the related field of downlink CoMP. These studies have been outlined in the literature survey. In [51], the derotator proposed is the arctan of the ratio of the weighted sum of sines and cosines of the normalized CFOs. A later study [52] corrected this expression and found that even for the case of $M=2$ , certain cross terms should be accounted for within the arctan expression. It is shown that for $M=2$ and similar fading conditions, the derotator reduces to the mean of the normalized CFOs. For dissimilar fading conditions, the derotator CFO shall move towards the CFO of the signal with the larger fading factor, i.e., lower path loss. This corrected expression has not been derived for $M>2$ .

In this work, a simple linear function of the CFOs is shown to be a near-optimal derotator by using a slightly different set of approximations than the ones used in [51] and [52]. The proposed derotator is a weighted mean of the normalized CFOs and shall be called the weighted mean derotator (WMD). The formulation of the optimization problem and the steps to arrive at this solution is given in Appendix 8. When the user equipment is near the cell edge, the desired and interferer channels have similar path loss. The weighting factors of the CFOs of the different towers become equal in such scenarios, and the WMD reduces to a simple mean derotator (MD). Their expressions are given by

\displaystyle\epsilon_{wmd}=\frac{\sum\limits_{m=0}^{M-1}P_{m}\epsilon_{m}}{\sum\limits_{m=0}^{M-1}P_{m}}

(21)

and

\displaystyle\epsilon_{md}=\frac{1}{M}\sum\limits_{m=0}^{M-1}\epsilon_{m}

(22)

As mentioned at the start of this subsection, the normalized CFOs used in the MD and WMD expressions are replaced by their estimates obtained using a preamble described in section 4-4.3. The simulation section explains how the weights used in the WMD formulation can also be obtained from this preamble. It will be shown through simulations that there is no loss in terms of block-error rate (BLER) by using the mean derotator over WMD.

ICI compensation can be carried out using either the proposed derotation factor or the arctan-based derotator. For any derotation factor $\epsilon_{d}$ , the time-domain derotation is given by

y_{D,i}(n)=e^{-j2\pi\frac{\left(i(N+N_{cp})+n\right)\epsilon_{d}}{N}}y_{i}(n)

(23)

Here $i$ is the OFDM symbol index. $i$ is 0 for OFDM symbols in which channel estimation takes place (pilot-containing symbols). For symbols that do not contain pilots, $i$ increments by 1 for every symbol until the next pilot-containing symbol is reached. In place of (3), the following new equation is obtained after derotation

Y_{D}[k]=\tilde{C}_{0}^{\prime}\tilde{H}_{k,0}^{\prime}X_{0}[k]+\sum\limits_{m=1}^{M-1}\tilde{C}_{m}^{\prime}\tilde{H}_{k,m}^{\prime}X_{m}[k]\\ +\sum\limits_{m=0}^{M-1}\tilde{C}_{m}^{\prime}\tilde{I}_{l,k}(m)+\tilde{W}[k]

(24)

$Y_{D}$ is the frequency domain received signal that has undergone time-domain derotation and $\tilde{C}_{m}^{\prime}:=e^{j2\pi\frac{i(N+N_{cp})}{N}(\epsilon_{m}-\epsilon_{d})}$ . The tilde in the above terms indicate that $\epsilon_{m}$ has been replaced with $\epsilon_{m}-\epsilon_{d}$ .

3.2.2 Frequency offset compensation for joint detector

A maximum likelihood detector that compensates for the effect of residual CFO phase ramp can be derived using the model described in (24). The joint detector can be formulated while including the effect of residual CFO as

LLR_{0,\lambda,k}=ln\left(\dfrac{P\left(b_{\lambda}\left(X_{0}[k]\right)=1\;\middle|\;Y_{D}[k],\mathbf{\tilde{C}}^{\prime},\mathbf{\tilde{H}}_{k}^{\prime}\right)}{P\left(b_{\lambda}\left(X_{0}[k]\right)=0\;\middle|\;Y_{D}[k],\mathbf{\tilde{C}}^{\prime},\mathbf{\tilde{H}}_{k}^{\prime}\right)}\right).

(25)

where the notation $b_{\lambda}(.)$ denotes the $\lambda^{th}$ bit of the modulated symbol on the $k^{th}$ subcarrier. The probabilities in the numerator and denominator would be Gaussian, assuming that the residual ICI does not much distort the noise distribution. Then, the Bayes theorem can be applied while assuming equal prior probabilities for the constellation points. Following the steps given in Appendix 9, the Max-Log-MAP approximation [61] for the solution is given by

LLR_{0,\lambda,k}\approx\min\limits_{\begin{subarray}{c}X_{0}\in\mathbf{X}_{0}^{\lambda_{1}},\\ X_{m}\in\mathbf{X_{m}}\\ m\neq 0\end{subarray}}\frac{1}{\sigma^{2}}\|Y_{D}[k]-\tilde{C}^{\prime}_{0}\ \tilde{H}_{k,0}X_{0}[k]\\ -\sum\limits_{m=1}^{M-1}\tilde{C}^{\prime}_{m}\ \tilde{H}_{k,m}X_{m}[k]\|^{2}\\ -\min\limits_{\begin{subarray}{c}X_{0}^{\prime}\in\mathbf{X}_{0}^{\lambda_{0}},\\ X_{m}[k]\in\mathbf{X_{m}}\\ m\neq 0\end{subarray}}\frac{1}{\sigma^{2}}\|Y_{D}[k]-\tilde{C}^{\prime}_{0}\ \tilde{H}_{k,0}X_{0}^{\prime}[k]\\ -\sum\limits_{m=1}^{M-1}\tilde{C}^{\prime}_{m}\ \tilde{H}_{k,m}X_{m}[k]\|^{2}

(26)

The solution is a modified version of the joint log-likelihood ratio (JLLR) [38] detector and shall be called offset-corrected JLLR (OC-JLLR). The residual ICI is considered a part of the effective noise in the above approximation. The LLR expression includes the phase ramp term $\tilde{C}_{m}^{\prime}$ that tracks the phase change in the data symbols caused due to CFO. It will be shown that block error rate flooring occurs due to CFO-induced ICI in the absence of this term in the JLLR expression. Equation (26) can be interpreted as the minimum distance receiver for a “super-constellation” derived from the desired and interference signals.

4 Simulation Results and Discussion

4.1 Time-domain ICI Compensation Scheme

The extent of residual ICI after compensation depends on the time-domain ICI compensation technique used. Numerical simulations are performed to compare the average residual ICI power of schemes such as signal derotation (SD) [41], arctan-based derotation (TD) [51, 52], the proposed mean and weighted mean derotation (MD, WMD), and a brute-force search for the optimal derotator. The simulation parameters are described in Table 1.

The amount of CFO experienced by signals from various towers can be quantified as follows. The oscillator at the UE has significantly lower accuracy requirements than the BS [39]. Typical numbers for oscillator accuracy in 5G are 10 parts per million (ppm) of the carrier frequency at the UE [62] and as low as 0.1 ppm for base stations that cover a moderate area [39]. It is then evident that the CFO will have the same sign for all the ITI terms. The CFO would be within $10\pm 0.1$ ppm per the tightest specifications in 5G-NR.

In the first scenario considered, the normalized CFOs are chosen from the interval $[\epsilon_{min},\epsilon_{max}]$ . Here, $\epsilon_{min}=\frac{(\Delta f_{UE}-\Delta f_{BS})f_{c}}{\Delta f_{sc}}$ and $\epsilon_{max}=\frac{(\Delta f_{UE}+\Delta f_{BS})f_{c}}{\Delta f_{sc}}$ . $\Delta f_{sc}$ is the OFDM subcarrier spacing, $\Delta f_{UE}$ is the local oscillator error of the UE in ppm, $\Delta f_{BS}$ is the local oscillator error of the BS in ppm, and $f_{c}$ is the carrier frequency. As $\Delta f_{UE}$ is at least an order lower than $\Delta f_{BS}$ (as UE typically has poorer quality equipment), both $\epsilon_{min}$ and $\epsilon_{max}$ are positive. Thus, the normalized CFO varies over a relatively small range. For $\Delta f_{UE}=10$ ppm and $\Delta f_{BS}=1$ ppm, the normalized CFO ranges between $0.3$ and $0.36$ . Thus, the CFOs will have the same sign. Time domain compensation is most effective in this scenario since the common frequency offset can be removed via derotation. The average residual ICI corresponding to CFO ranging from $10\pm 0.1$ ppm to $10\pm 1$ ppm are plotted in Fig. 2. The case when derotation is absent is also plotted.

Table 1: Numerical Simulation Parameters

Parameter	Value
Carrier Frequency	500 MHz
Max. no. of Towers ( $N_{T}$ )	8
No. of Towers ( $M$ )	4
Relative received powers	[0 3 -3 0] dB
CFO range	10 $\pm$ 0.1 ppm to
	10 $\pm$ 1 ppm
Subcarrier Bandwidth	15 kHz
FFT Size	2048
Channel Model	TDL-B
RMS Delay Spread	300 ns
CP length ( $N_{cp}$ )	72 samples

It can be seen that the residual ICI is very high ( $5$ dB) in the absence of derotation. Among the derotation techniques, signal derotation performs worst. WMD is very close to the optimum derotation found by brute force search. It outperforms the arctan-based derotator proposed in [51] by approximately 0.5 dB. The mean derotator and the arctan-based derotator have very similar performances.

The carrier frequency offsets are randomly drawn from $[-\epsilon_{max},\epsilon_{max}]$ when no assumptions are made regarding the transmitter (base station) and receiver (UE) oscillator accuracy. In this case, the average residual ICI power is plotted in Fig. 3 for various values of $\epsilon_{max}$ . Since the CFOs are randomly distributed around 0, the mean value will also be close to zero. This means that the extent of offset compensation via derotation is limited. The level of residual ICI is seen to be higher in this scenario. It is seen that the case without any ICI compensation (no derotation) performs better than the signal derotation technique. This is because the mean value of the CFOs is near zero and ‘no derotation’ is the same as derotating with zero. Depending on the CFO of the desired signal, SD could perform the derotation with the minimum or maximum value among the CFOs. This kind of derotation can further amplify the ICI rather than reduce it. Hence it is clear that in such scenarios, it is better not to perform derotation than derotating with the desired signal CFO.

In Fig. 3, WMD is seen to be near-optimal even for a random distribution of CFO. The arctan-based derotator (TD) [51] performs well for $\epsilon_{max}<0.3$ but degrades for larger $\epsilon_{max}$ . This degradation is because the assumptions made during the derivation of this technique require that $\epsilon_{max}-\epsilon_{min}\leq 0.5\implies\epsilon_{max}\leq 0.25$ . The WMD uses a different set of assumptions and approximations (as given in Appendix 8) and is seen to perform close to the optimal derotator in practice.

4.2 JmLS Channel Estimation Performance

The MSE performance of the JmLS channel estimator is studied in this section. First, the total MSE of the CIR for all the towers is calculated using link-level simulations. This is plotted along with the theoretical CRLB on the CIR of the joint estimator given by (30). $N_{T}=8$ and $M\leq 4$ is considered for this simulation. The JmLS channel estimator uses $N_{p}^{(J)}=512$ band-edge to band-edge pilots. The corresponding OmLS design uses $N_{p}^{(O)}=64$ pilots per tower as the system has to be designed for eight towers. The OmLS estimator is underdetermined since $N_{p}^{(O)}<N_{cp}$ but the JmLS estimator is well conditioned. Even though the CRLB of the OmLS estimator is not defined as the equations are underdetermined, the MSE can still be found. The results in Fig. 4 show that the JmLS estimator MSE coincides with the CRLB for the joint estimator and decreases linearly with increasing SNR. On the other hand, the MSE of the OmLS estimator does not improve with SNR because the number of pilots are insufficient.

The following results are based on a calibrated link-level simulation of an OFDMA framework with parameters shown in Table 2. Some parameter values have changed from Table 1. The UE now occupies only a fraction of the total usable subcarriers.

Estimators are now compared in terms of the average per-subcarrier MSE of the CFR. JmLS is compared with the OmLS estimator and two existing pilot-on-pilot estimation algorithms. These are the robust MMSE algorithm [63] which can be used to refine and interpolate channel estimates obtained using 5G DM-RS, and the windowed DFT-based algorithm [32, 31, 33] used to extract channel information from phase-shifted pilots employed in 5G SRS.

The literature survey explains that the DM-RS uses pilot-on-pilot allocation by spreading pilot subcarriers across time and frequency. OCCs are used to spread the pilots and multiplex DM-RS ports in the same set of pilot subcarrier resources. These ports may correspond to the channels of the spatially multiplexed layers of the same link or the channels of different links. In the reuse-1 scenario, the desired and interferer channels are assigned to distinct DM-RS ports. The first step in the receiver is to perform pilot ‘despreading’ and least-squares estimation as described in [30]. These estimates are subsequently refined and interpolated using robust MMSE or Weiner filtering as described in [63]. The DMRS-MMSE estimator is assumed to know the true value of delay spread and SNR for calculating the Weiner filter weights.

A pilot sequence based on double symbol type-II DM-RS [28] is used as one of the benchmarks to compare the performance of the proposed JmLS scheme. One of the three CDM groups is reserved for pilots to accommodate $M=4$ towers. If one DM-RS additional position⁴⁴4The terms such as ‘type-II’, ‘CDM group’, ‘additional position’, etc. are specific to 5G. More details can be found in [28] and [30]. is considered, this amounts to four pilot-containing symbols per slot. A 20 PRB allocation would have $20\text{ PRB }\times 12\times\frac{1}{3}\text{ pilot subcarriers per PRB}=80$ pilot subcarriers per pilot-containing symbol. The pilot overhead is the same as that for JmLS in this configuration of DM-RS.

The second pilot-on-pilot scheme in the literature is used in 5G SRS channel estimation. Phase-shifted sequences are used to separate channel impulse responses in time so that they can be windowed out in the receiver [33]. This approach causes interpolation errors due to zero padding in practical non-sample-spaced channels. In addition to OmLS and DMRS-MMSE, phase-shifted pilots coupled with windowed DFT-based channel estimation shall also be compared with the JmLS scheme. This approach shall be labeled as SRS-DFT, although it is not specific to SRS. Once again, the total number of pilots shall be the same as JmLS.

Table 2: OFDMA Link-Level Simulation Parameters

Parameter	Value
Time of Flights (for $M=4$ )	[0 150 300 450] ns
Used Subcarriers	240 (20 PRBs)
No. of JmLS Pilots $(N_{p}^{(J)})$	80
No. of OmLS Pilots $(N_{p}^{(O)})$	10

The result in Fig. 5 compares the performance of JmLS with that of OmLS, DMRS-MMSE, and SRS-DFT. For a 30 ns delay spread, DMRS-MMSE is marginally superior to the proposed JmLS estimator. However, at 300 ns delay spread, it is clear that DMRS-MMSE floors with a high MSE while the JmLS performance does not change. That is, the DMRS-MMSE is not suitable for moderate to highly frequency selective channels. The DM-RS scheme cannot track the frequency selectivity well because it will effectively yield only $2$ estimates per PRB. JmLS, with $80$ pilots in $240$ subcarriers, gives $4$ estimates per PRB. JmLS has twice the number of estimates and hence does not floor at high delay spreads.

The OmLS MSE floors at high SNRs because the number of pilots per tower is the total number of pilots divided by $N_{T}$ , which is 8 in this case. Thus, the number of pilots per tower becomes insufficient to estimate the channel with good quality when the allocation is orthogonal. JmLS MSE does not floor because it depends on the visible number of towers rather than the maximum number of towers, i.e., $M$ rather than $N_{T}$ . OmLS floors at a lower MSE than DMRS-MMSE as mLS-based schemes that estimate the channel in time-domain have a natural advantage due to the rejection of the noise samples beyond the cyclic prefix length.

The SRS-DFT scheme also floors as the TDL-B channel model is non-sample-spaced. The MSE performance of the SRS-DFT scheme shown in Fig. 5 is similar to that found in [33]. It is seen that JmLS is the only one among the four schemes whose performance does not floor, irrespective of the delay spread used. It is also spectrally efficient as it provides the best estimation quality with the least number of pilots.

4.3 Link-Level Simulation Performance

The proposed receiver is now evaluated in terms of coded BLER performance. LDPC codes of rate 1/2 are used for this simulation. The signal modulation from each tower is assumed to be known at the receiver. When this information is unknown, modulation order classification techniques such as those described in [64] and [65] could be used. The noise variance, $\sigma^{2}$ , can be estimated either from a few subcarriers left purposefully blank or using guard subcarriers if the adjacent channel interference is low. The first approach is used in the simulations presented in this paper.

The link-level simulator follows the frame structure described in 5G standards [28] with 15 kHz subcarrier spacing. Every frame has ten slots, and every slot has 14 OFDM symbols. The simulations use a time-varying channel with Doppler corresponding to 10 kmph for the low-mobility scenario and 100 kmph for the high-mobility scenario. The channel is estimated in the $1^{st}$ , $5^{th}$ , $9^{th}$ and $14^{th}$ symbols in this simulation setup. Linear interpolation is used to provide the CFR estimates in the remaining symbols.

A new realization is generated for each tower-to-UE channel in every frame. The CFO relative to each BS tower is randomly generated for every frame while staying within the limit of $10\pm 0.2$ ppm relative to the carrier frequency. As mentioned in Section III-3.2.1, the tightest constraint on the BS oscillator accuracy is 0.1 ppm in the 5G specs. The current simulations exceed the error expected in such systems. Simulations are run for a minimum of 5000 frames or until twenty block errors are detected at a particular SNR.

The CFOs corresponding to different towers are estimated at the start of each frame with the help of a two-symbol preamble. The two symbols in this preamble are identical. Each preamble symbol consists of $N_{T}$ orthogonal bands in the frequency domain. Each band is a block of subcarriers with guard subcarriers in between. In the frequency domain, subcarriers of each band are loaded with a Zadoff-Chu sequence of appropriate length.

The received time-domain samples corresponding to the two-symbol preamble are first filtered using a filter bank to separate the preamble signals belonging to each tower/band. The power of this filtered signal can be used to detect the presence or absence of an ITI signal, thus providing an estimate of $M$ . This power can also be used to define the weights for performing WMD and as an indicator to select the appropriate estimator matrix $\mathbf{J}_{M}$ .

The timing and frequency synchronization procedure is implemented using the algorithm proposed in [66]. The signal from the band corresponding to the desired tower can be cross-correlated with a time-domain preamble signal to estimate the timing offset of the signal from the desired tower. The first and second preamble symbols can be identified using this timing offset. The times of arrival of significant interferers are also estimated to determine an appropriate inter-block-interference (IBI)-free window. Since the two symbols will be identical except for the CFO and doppler induced phase shifts, the combined phase effect of CFO and doppler can be estimated by measuring the phase difference as suggested in [66]. The proposed phase derotation scheme is now applied, followed by OFDM demodulation.

The preamble sequence is not helpful for channel estimation since it is narrowband. Thus, the next step at the receiver is the channel estimation in the four symbols of a slot, followed by estimate interpolation over the entire slot. The channel estimation is done for the desired signal and the interfering signals. The estimates of the channel, noise variance, and CFOs are passed to the OC-JLLR detector to obtain the LLR values. The BLER is computed after LDPC decoding.

The simulation results for various scenarios are presented in the following subsections.

4.3.1 Low Mobility, $N_{T}=8$ , $M=4$

The desired signal is modulated with 4-QAM. Three ITI signals are also present with powers of 3, -3, and 0 dB relative to the desired signal, each modulated with 4-QAM. This means that the interferer powers are high and comparable to the desired signal power. The ITI signals are assumed to arrive at the time of flights given in Table 2. In pilot-carrying OFDM symbols, a comb-type pilot arrangement interlaced with data is assumed with a pilot subcarrier separation of 3. The number of used subcarriers and the number of pilots per OFDM symbol are given in Table 2. A TDL-B channel, as defined in [67] and scaled with an RMS delay spread of 300 ns (long delay spread channel), is assumed. In this scenario, the UE is assumed to have low mobility with speed restricted to below 10 kmph.

The performance of the OmLS, JmLS, and DMRS-MMSE channel estimation schemes are compared in terms of coded block error rate in Fig. 6. The SRS-DFT approach is not simulated here since its performance is found to be flooring similar to OmLS in Fig. 5. The performance of the joint detector fed with the true channel knowledge is also plotted for reference.

It is seen that at low delay spreads, DMRS-MMSE is superior and is close to the performance for the case of true channel knowledge. This is unsurprising as the robust-MMSE interpolation is known to be near-optimal. The true values of the delay spread and SNR are given to the MMSE interpolator. Because of this, it can achieve near-ideal performance at low delay spreads. However, even at an rms delay spread of $300$ ns, DMRS-MMSE fails as the number of estimates becomes insufficient.

By comparing Fig. 5 and Fig. 6, it is seen that the BLER performance of JmLS improves at high delay spreads even though there is no improvement in the MSE performance. This is because high frequency selectivity is known to be beneficial for the performance of error correction codes [68]. A similar BLER improvement is also seen in the case of true channel knowledge when the delay spread increases.

The ICI compensation scheme proposed in this work comprises both a time-domain derotation scheme and the offset correction in the joint detector. It was found in Fig. 2 that WMD performs close to the optimum time-domain derotator, and the MD performs close to the arctan derotator proposed in [51]. The SD has the worst performance among the time-domain derotators. The simulations results that follow shall only contain the results for SD, MD, and WMD schemes to avoid confusion. Further, only the OmLS and JmLS channel estimation schemes shall be included in future comparisons.

It is seen in Fig. 7 that since the present design satisfies (20), the OmLS estimator has an unacceptable performance resulting in a BLER close to unity. This simulation shows that JmLS can perform well in scenarios where OmLS fails. The decoder fails in the absence of offset correction term as suggested by the proposed OC-JLLR in (26). All blocks will now be in error, irrespective of SNR. This is shown by the result indicated as “pure JLLR” in Fig. 7. As a benchmark for comparison, the BLER when CFO is absent and when perfect channel knowledge is available at the receiver is also plotted.

The simulation results show that WMD and MD do not differ much in terms of BLER. The weights for WMD are almost equal in the cell-edge scenario, as the path losses from the different towers are comparable. It is seen that there is a difference of just 0.5 dB in terms of ICI cancellation for the two schemes. However, both perform better than the signal derotation (SD) approach used in interference-free systems. The SD approach floors more at high SNRs than WMD or MD.

4.3.2 Low Mobility, $N_{T}=8$ , $M=3$ , 16-QAM

In this scenario, the desired signal is modulated with 16-QAM. Two ITI signals are also present with powers of 3 and -3 dB relative to the desired signal, each modulated with 4-QAM. The remaining assumptions in the previous section also hold here.

The inferences drawn from this result (shown in Fig. 8) are similar to the $M=4$ case. It is seen that 16-QAM with two 4-QAM interferers perform slightly better than 4-QAM with three 4-QAM interferers even though the super-constellation order is the same. One reason for this behavior is that 16-QAM has better minimum-distance properties than the irregular super-constellation formed by adding two faded 4-QAM signals. In addition, the detector for $M=3$ requires only 3 sets of channel estimates and hence has less estimation error.

4.3.3 Low Mobility, $N_{T}=8$ , $M=2$ , 64-QAM

The last scenario considered in the low mobility case is when the desired signal is modulated with 64-QAM. A single interfering signal whose power is 3 dB above the desired signal and modulated with 4-QAM is also present. This case is plotted in Fig. 9.

Although super-constellation remains the same, the performance for $M=2$ is superior to the previous two scenarios. This result also shows that a joint detector is feasible even for large constellation sizes like 64-QAM, but the number of interferers it can tolerate will reduce accordingly. For example, simulations show that it is not possible to accommodate two 4-QAM interferers on top of a 64-QAM desired signal for the code rate under consideration. Since high-order constellations are typically seen towards the center of the cell where interference is minimal, the results show that joint detectors can handle the interferences that are likely to be seen in the cell edge.

4.3.4 High Mobility, TDL-E LOS channel

The final scenario uses a TDL-E channel model with a line-of-sight (LOS) Rician path and the usual non-line-of-sight (NLOS) Rayleigh multipaths. The UE is assumed to be mobile with a maximum speed of 100 kmph. The delay spread scaling factor considered for this channel is 100 ns since the TDL-E model is more spread out in terms of multipath. If a 300 ns scaling factor were to be considered as in the previous cases, this channel would have significant inter-block interference.

The results plotted in Fig. 10 consider a 4-QAM signal with 4-QAM interferer(s) for $M=$ 2, 3, and 4. WMD is not plotted in these results since previous simulations have shown no significant difference between WMD and MD in terms of BLER for the given interference profile. The performance of OmLS is similar to the results shown so far and is not plotted here.

The $M=4$ result (denoted by 4T in the figure) shows that the proposed MD is superior to SD even in the presence of a significant Doppler component. Referring back to Fig. 2, it is seen that when the BS oscillator error is 0.2 ppm, the residual ICI power is around -30 dB for SD. This means that the ICI is comparable to noise at an SNR of 30 dB. Hence, SD starts to floor near 30 dB SNR. SD would floor sooner for higher CFOs, as predicted by Fig. 2. The effect of the derotation factor is not as pronounced for this particular code rate when $M<4$ .

The results show that the SNR required to reach a particular BLER increases with increasing $M$ . The first reason for this is an increase in the order of super-constellation. A secondary reason is the accumulation of errors in channel estimation of both the desired and interferer channels. This error accumulates linearly with $M$ .

5 Conclusion

Interference-aware receiver algorithms that aid accurate signal recovery even in strong ITI were proposed in this work. It described a joint channel estimation scheme that uses a non-orthogonal pilot pattern to estimate the desired and interferer channels efficiently. The CRLB of the proposed estimator was derived and compared with its orthogonal counterpart. This joint estimator was shown to reduce the number of pilots by almost a factor of $N_{T}$ . The condition under which the proposed estimator becomes superior to the orthogonal estimator was also derived, and the computational complexity of the estimator was analyzed. A low-complexity time-domain CFO compensation scheme that performs close to the optimal time-domain scheme was proposed. Although it does not have significant performance advantages over the existing arctan-based compensation, the proposed derotator can be easily computed as it is a linear function of the CFOs. Irrespective of the time-domain derotator used, it should be coupled with a CFO correction factor to compensate for the phase shift caused due to the residual CFO. This compensation factor was derived for the JLLR detector in this work.

Simulation results reveal that JmLS works with fewer pilots than OmLS, provided that condition (20) is met. For the same pilot overhead, JmLS is shown to be advantageous at moderate to high delay spreads compared to an MMSE estimator that uses DM-RS pilots. All data blocks are erroneous in the absence of the proposed CFO correction factor for the JLLR. The proposed techniques enable high-quality signal recovery in interference-limited reuse-1 cellular OFDMA systems operating in the sub-1 GHz UHF band. {appendices}

6 CRLB of joint channel estimate

Ignoring the ICI term, the noise vector can be rewritten as

\mathbf{\tilde{W}}\approx\mathbf{Y}^{(D)}_{p}-\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\ \mathbf{\tilde{h}^{\prime}}_{N_{cp}M}

(27)

Since $\mathbf{\tilde{W}}$ is a Gaussian vector, the log likelihood function of the data for the parameter $\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}$ is given by

\displaystyle\begin{split}&ln\ p(\mathbf{Y}^{(D)}_{p};\mathbf{\tilde{h}^{\prime}}_{N_{cp}M})\ =\ -\frac{N}{2}ln\ (2\pi\sigma^{2})-\\ &\frac{(\mathbf{Y}^{(D)}_{p}-\mathbf{X_{p}}\mathbf{F}_{pN_{cp}M}\ \mathbf{\tilde{h}^{\prime}}_{N_{cp}M})^{H}(\mathbf{Y}^{(D)}_{p}-\mathbf{X_{p}}\mathbf{F}_{pN_{cp}M}\ \mathbf{\tilde{h}^{\prime}}_{N_{cp}M})}{2\sigma^{2}}\end{split}

(28)

The Fisher information matrix [69] is then given by

\mathbf{\mathcal{I}}(\mathbf{\tilde{h}^{\prime}}_{N_{cp}M})=-E\left[\nabla_{\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}}^{2}ln\ p(\mathbf{Y}^{(D)}_{p};\mathbf{\tilde{h}^{\prime}}_{N_{cp}M})\right]\\ =\frac{({\mathbf{F}_{pN_{cp}M}}^{H}E\left[\mathbf{X_{p}}^{H}\mathbf{X_{p}}\right]\mathbf{F}_{pN_{cp}M})}{\sigma^{2}}.

(29)

CRLB of the total variance of an unbiased estimator of $\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}$ is found as the trace of the inverse of the Fisher information matrix (29).

CRLB_{tot}(\mathbf{\hat{h}_{J}})=Tr\left(({\mathbf{F}_{pN_{cp}M}}^{H}E\left[\mathbf{X_{p}}^{H}\mathbf{X_{p}}\right]\mathbf{F}_{pN_{cp}M})^{-1}\right)\sigma^{2}

(30)

Here $\mathbf{\hat{h}_{J}}$ is an unbiased joint estimator of the CIR. Looking at (30) for the case with unit-amplitude pilots such as 4-QAM, $\mathbf{X_{i}}^{H}\mathbf{X_{i}}=\mathbf{I}$ . Although $\mathbf{X_{i}}^{H}\mathbf{X_{j}}\neq\mathbf{0}$ for any two integers $i\neq j,\ E\left[\mathbf{X_{i}}^{H}\mathbf{X_{j}}\right]=\mathbf{0}$ . $\mathbf{X_{i}}^{H}\mathbf{X_{j}}$ is the product of two diagonal matrices with unit-amplitude sequences in the diagonal. This product results in another unit-amplitude sequence. The pilot sequences are assumed to be generated from a constellation with equal probability for all symbols. It then follows that the expected value of such a sequence is zero. Thus, $E[\mathbf{X_{p}}^{H}\mathbf{X_{p}}]=\mathbf{I}$ and (30) can be rewritten as

CRLB_{tot}(\mathbf{\hat{h}_{J}})=Tr\left(({\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{F}_{pN_{cp}M})^{-1}\right)\sigma^{2}

(31)

for the case of 4-QAM or any pilot sequence of unit amplitude. The above expression can be further simplified with the help of the following result.

Lemma 1.

For a subsampled DFT matrix $\mathbf{F}_{pN_{cp}}$ of size $N_{p}^{(J)}\times N_{cp}$ , the product $\mathbf{F}_{pN_{cp}}^{H}\mathbf{F}_{pN_{cp}}=N_{p}^{(J)}\mathbf{I}$ , where $N_{p}^{(J)}$ is the number of joint estimator pilots.

Proof.

It follows from the definition of N-point DFT that $\mathbf{F}^{H}\mathbf{F}=N\mathbf{I}$ . Now, recall $\mathbf{F_{N_{cp}}}$ defined previously as a column subsampled version of $\mathbf{F}$ . It is clear that any two columns of $\mathbf{F_{N_{cp}}}$ are orthogonal to each other. Thus, $\mathbf{F_{N_{cp}}}^{H}\mathbf{F_{N_{cp}}}=N\mathbf{I_{N_{cp}}}$ . Now, define

\displaystyle\begin{split}\mathbf{F_{pN_{cp}}}=[F(i,k)],\quad&i=np\text{ and }n=0,1,2...\\ &k=0,1...N_{cp}-1.\end{split}

(32)

Here $p$ is the pilot spacing, i.e., number of subcarriers between consecutive pilots.

F(i,k)=exp\left(\dfrac{-j2\pi ki}{N}\right)=exp\left(\dfrac{-j2\pi kn}{N/p}\right)

(33)

When the pilot spacing is such that the number of pilots, $N/p$ , is an integer, (33) shows that the N-point DFT becomes an $N/p$ point DFT. This happens when $p$ is a power of 2. In such cases, it follows that $\mathbf{F}_{pN_{cp}}^{H}\mathbf{F}_{pN_{cp}}=\frac{N}{p}\mathbf{I}=N_{p}^{(J)}\mathbf{I}$ . ∎

This result is now applicable in the inverse term in (31).

\displaystyle\left({\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{F}_{pN_{cp}M}\right)^{-1}

\displaystyle=\frac{1}{N_{p}^{(J)}}\mathbf{I}_{MN_{cp}\times MN_{cp}}

(34)

Then, (31) can be rewritten as

CRLB_{tot}(\mathbf{\hat{h}_{J}})=\frac{MN_{cp}}{N_{p}^{(J)}}\sigma^{2}

(35)

This is a lower bound on the total variance across all channels.

7 MSE of JmLS estimate

The MSE of the joint estimate is given by the expression below.

\displaystyle\begin{split}MSE_{{}_{\Sigma}}&=Tr\left(E\left[(\mathbf{\hat{h}}_{JmLS}-\mathbf{\tilde{h}^{\prime}}_{N_{cp}M})(\mathbf{\hat{h}}_{JmLS}-\mathbf{\tilde{h}^{\prime}}_{N_{cp}M})^{H}\right]\right)\\ &=Tr\left(E\left[A+B+C+D\right]\right)\end{split}

(36)

where,

\displaystyle\begin{split}A=&\mathbf{J}_{M}\left(\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\ \mathbf{\tilde{h}^{\prime}}_{N_{cp}M}+\mathbf{\mathbf{W}}\right)\times\\ &\left(\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\ \mathbf{\tilde{h}^{\prime}}_{N_{cp}M}+\mathbf{\mathbf{W}}\right)^{H}\mathbf{J}_{M}^{H}\\ =&\mathbf{J}_{M}\left(\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}{\mathbf{\tilde{h^{\prime}}}_{N_{cp}M}}^{H}\mathbf{F}_{pN_{cp}M}^{H}\mathbf{X_{p}}^{H}+\right.\\ &\left.\mathbf{F}_{pN_{cp}M}\mathbf{W}^{H}+\mathbf{W}\mathbf{\tilde{h^{\prime}}}_{N_{cp}M}^{H}\mathbf{F}_{pN_{cp}M}^{H}\mathbf{X_{p}}^{H}+\mathbf{W}\mathbf{W}^{H}\right)\mathbf{J}_{M}^{H}\end{split}

(37)

The following equations simplify the expression for the expected value of A.

\displaystyle\begin{split}E[\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}\mathbf{\tilde{h^{\prime}}}_{N_{cp}M}^{H}]&=\sigma_{h}^{2}\mathbf{I}\\ \mathbf{J}_{M}\mathbf{X_{p}}\mathbf{F}_{pN_{cp}M}&=\mathbf{F}_{pN_{cp}M}^{H}\mathbf{X_{p}}^{H}\mathbf{J}_{M}^{H}=\mathbf{I}\\ E[\mathbf{W}]&=\mathbf{0}\\ E[\mathbf{W}\mathbf{W}^{H}]&=\sigma^{2}\mathbf{I}\\ \mathbf{J}_{M}\mathbf{J}_{M}^{H}&=\left({\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{X_{p}}^{H}\mathbf{X_{p}}\mathbf{F}_{pN_{cp}M}+\alpha\mathbf{I}\right)^{-1}\end{split}

(38)

These relations can now be used on $A,B,C$ and $D$ .

\displaystyle\begin{split}&E[A]=\sigma_{h}^{2}\mathbf{I}+\sigma^{2}\left({\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{X_{p}}^{H}\mathbf{X_{p}}\mathbf{F}_{pN_{cp}M}+\alpha\mathbf{I}\right)^{-1}\\ &B=-\mathbf{J}_{M}\left(\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}+\mathbf{\mathbf{W}}\right)\mathbf{\tilde{h^{\prime}}}_{N_{cp}M}^{H}\\ &E[B]=-\sigma_{h}^{2}\mathbf{I}\hskip 36.98866pt\left(\because\mathbf{J}_{M}\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}=\mathbf{I}\right)\\ &C=-\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}\left(\mathbf{X_{p}}\ \mathbf{F}_{pN_{cp}M}\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}+\mathbf{\mathbf{W}}\right)^{H}\mathbf{J}_{M}^{H}\\ &E[C]=-\sigma_{h}^{2}\mathbf{I}\\ &D=\mathbf{\tilde{h}^{\prime}}_{N_{cp}M}\mathbf{\tilde{h^{\prime}}}_{N_{cp}M}^{H}\\ &E[D]=\sigma_{h}^{2}\mathbf{I}\end{split}

(39)

Thus,

\displaystyle\begin{split}MSE_{{}_{\Sigma}}&=Tr\left(\sigma^{2}\left({\mathbf{F}_{pN_{cp}M}}^{H}\mathbf{X_{p}}^{H}\mathbf{X_{p}}\mathbf{F}_{pN_{cp}M}+\alpha\mathbf{I}\right)^{-1}\right)\\ &=\sigma^{2}\frac{MN_{cp}}{N_{p}^{(J)}}\end{split}

(40)

where the simplified expression (with $\mathbf{X_{p}}^{H}\mathbf{X_{p}}=\mathbf{I}$ ) is valid for unit-amplitude pilot sequences such as 4-QAM. As shown in Lemma 1 of Appendix 6, $\mathbf{F}_{pN_{cp}M}^{H}\mathbf{F}_{pN_{cp}M}=N_{p}^{(J)}\mathbf{I}$ . As $\mathbf{I}$ is an $MN_{cp}\times MN_{cp}$ matrix here, its trace is equal to $MN_{cp}$ .

8 Time-domain derotation factor

Referring to the system model described in (3) and the expression for ICI given in [57], the total ICI at the $k^{th}$ subcarrier is given by

\displaystyle\begin{split}ICI(k)&=\sum\limits_{m=0}^{M-1}C_{m}^{\prime}e^{j2\pi\frac{N_{cp}}{N}\epsilon_{m}}\\ &\times\sum\limits_{l=0,l\neq k}^{N-1}\left(\frac{sin(\pi(l-k+\epsilon_{m}))}{Nsin(\pi\frac{(l-k+\epsilon_{m})}{N})}e^{j\pi(l-k+\epsilon_{m})\frac{N-1}{N}}\right)\\ &\times H_{k,m}X_{m}[k]\end{split}

(41)

The objective of reducing the ICI power has been realized in prior works [51, 52] by employing a series of approximations and upper bounds on the ICI power. This paper uses a similar approach to obtain a different expression by employing different approximations. As a first step, it follows from the inequality of arithmetic and geometric means [70] that $a_{1}^{2}+a_{2}^{2}\geq 2a_{1}a_{2}\ \forall\ a_{1},a_{2}\in\mathbb{C}$ . This means that $(a_{1}+a_{2})^{2}\leq 2(a_{1}^{2}+a_{2}^{2})$ . Repeated application of this property can be used to prove that $\left(\sum\limits_{m=0}^{M-1}a_{m}\right)^{2}\leq M\ \sum\limits_{m=0}^{M-1}a_{m}^{2}$ . This upper bound can be applied to the expression for the ICI power at the $k^{th}$ subcarrier.

\displaystyle|ICI(k)|^{2}=

\displaystyle\left|\sum\limits_{m=0}^{M-1}f_{1}(\epsilon_{m})\right|^{2}\leq M\sum\limits_{m=0}^{M-1}|f_{1}(\epsilon_{m})|^{2}\hskip 85.35826pt

(42)

where

\displaystyle\begin{split}|f_{1}(\epsilon_{m})|^{2}=&\left|\sum\limits_{l=0,l\neq k}^{N-1}\left(\frac{sin(\pi(l-k+\epsilon_{m}))}{Nsin(\frac{\pi(l-k+\epsilon_{m})}{N})}e^{j\pi(l-k+\epsilon_{m})\frac{N-1}{N}}\right)\right|^{2}\\ &\times|H_{k,m}|^{2}|X_{m}[k]|^{2}\text{\hskip 14.22636pt}\end{split}

(43)

because $|ab|^{2}=|a|^{2}|b|^{2}$ and $|C_{m}^{\prime}e^{j2\pi\frac{N_{cp}}{N}\epsilon_{m}}|=1$ as it is a pure phase term. A sinc function can be substituted for the ratio of sines in the above expression. This is possible because the FFT size $N$ is large and $sin(\frac{\theta}{N})\approx\frac{\theta}{N}$ .

\displaystyle\begin{split}|f_{1}(\epsilon_{m})|^{2}=&\left|\sum\limits_{l=0,l\neq k}^{N-1}sinc(l-k+\epsilon_{m})e^{j\pi(l-k+\epsilon_{m})\frac{N-1}{N}}\right|^{2}\\ &\times|H_{k,m}|^{2}|X_{m}[k]|^{2}\\ \leq&\left(\sum\limits_{l=0,l\neq k}^{N-1}\left|sinc(l-k+\epsilon_{m})\right|\right)^{2}|H_{k,m}|^{2}|X_{m}[k]|^{2}\end{split}

(44)

where the absolute value is taken inside the summation by employing the triangle inequality [71]. It is now evident that the value of $\epsilon_{m}$ can be reduced to reduce the ICI. This is because, for $max(|\boldsymbol{\epsilon}|)\leq 0.5$ and $l\neq k$ , the absolute value of sinc reduces with decreasing $\epsilon_{m}$ . On applying a time domain derotation factor $\epsilon_{*}$ , the quantity $\epsilon_{m}$ in the equation gets replaced with $\epsilon_{m}-\epsilon_{*}$ . We search for $\epsilon_{*}$ between $min(\boldsymbol{\epsilon})$ and $max(\boldsymbol{\epsilon})$ as the optimum value is known to lie within this range [52]. Thus, the ICI power at the $k^{th}$ subcarrier is upper bounded by

\displaystyle\begin{split}|ICI(k)|^{2}\leq&\sum\limits_{m=0}^{M-1}\left(\sum\limits_{l=0,l\neq k}^{N-1}\left|sinc(l-k+\epsilon_{m}-\epsilon_{*})\right|\right)^{2}f_{2}(k,m)\\ f_{2}(k,m)=&M|H_{k,m}|^{2}|X_{m}[k]|^{2}\end{split}

(45)

The inequality of arithmetic and geometric means [70] can once again be used to upper bound the above square of a summation with a scaled version of summation of squares.

\displaystyle\begin{split}|ICI(k)|^{2}\leq&\sum\limits_{m=0}^{M-1}\ \sum\limits_{l=0,l\neq k}^{N-1}\left(sinc(l-k+\epsilon_{m}-\epsilon_{*})\right)^{2}f_{3}(k,m)\\ f_{3}(k,m)=&MN|H_{k,m}|^{2}|X_{m}[k]|^{2}\end{split}

(46)

By following the approximation used in [51] and [72] for large $N$ , $\sum\limits_{l=0,l\neq k}^{N-1}\left(sinc(l-k+\epsilon_{m}-\epsilon_{*})\right)^{2}\approx 1-$ $sinc^{2}(\epsilon_{m}-\epsilon_{*}))$ . Since $N$ is the total number of subcarriers in the OFDM system, it is usually of the order of $10^{3}$ and the approximation will hold. The next step is to find the optimum derotator $\epsilon_{*}$ that minimizes the upper bound on ICI power given above. Then, the cost function is

\displaystyle C(\epsilon_{*})=

\displaystyle\sum\limits_{m=0}^{M-1}(1-sinc^{2}(\epsilon_{m}-\epsilon_{*}))f_{3}(k,m)

(47)

Using the first two terms of the Maclaurin expansion for $sinc(x)$ , the approximation $sinc(x)\approx 1-\frac{(\pi x)^{2}}{3!}$ can be made. Then $sinc^{2}(x)\approx 1-2\frac{(\pi x)^{2}}{3!}+(\frac{(\pi x)^{2}}{3!})^{2}$ . Here, $x=\epsilon_{m}-\epsilon_{*}$ . The higher order terms of the Maclaurin expansion will become negligible when the sinc function is squared. Substituting this expansion in (47),

\displaystyle C(\epsilon_{*})=

\displaystyle\sum\limits_{m=0}^{M-1}\left(2\frac{(\pi(\epsilon_{m}-\epsilon_{*}))^{2}}{3!}-\frac{(\pi^{4}(\epsilon_{m}-\epsilon_{*})^{4})}{(3!)^{2}}\right)f_{3}(k,m)

(48)

Differentiating the cost function,

\displaystyle\begin{split}\frac{\partial C(\epsilon_{*})}{\partial\epsilon_{*}}&=\sum\limits_{m=0}^{M-1}\left(\frac{-4\pi(\epsilon_{m}-\epsilon_{*})}{3!}+\frac{4(\pi^{4}(\epsilon_{m}-\epsilon_{*})^{3})}{(3!)^{2}}\right)f_{3}(k,m)\\ &=\sum\limits_{m=0}^{M-1}\left(-((\epsilon_{m}-\epsilon_{*}))+\frac{(\pi^{3}(\epsilon_{m}-\epsilon_{*})^{3})}{3!}\right)f_{3}(k,m)\end{split}

(49)

In a 5GNR-type deployment, the oscillator at the UE has significantly lower accuracy requirements than the BS [39]. Oscillator accuracy at the UE is of the order of 10 parts per million (ppm) of the carrier frequency [62] and as low as 0.1 ppm at the base station side [39]. Thus, the CFO will have the same sign for all the ITI terms. The CFO of the different towers will vary between 9 ppm and 11 ppm if a BS CFO of 1 ppm is considered. For the maximum carrier frequency of 1 GHz (as sub-1 GHz systems are considered in this work) and a subcarrier spacing of 15 kHz, the normalized CFO would be between $0.6$ and $0.73$ . This means that $\epsilon_{m}-\epsilon_{*}$ is of the order of 0.1. The cubed term in (49) will then be two orders of magnitude lower than the linear term and can hence be ignored while finding the minima. Substituting for $f_{3}(k,m)$ and equating the derivative to zero,

\displaystyle\sum\limits_{m=0}^{M-1}\left(\epsilon_{m}-\epsilon_{*}\right)|H_{k,m}|^{2}|X_{m}[k]|^{2}=0

(50)

Since the derotation is done in the time domain, the dependence on the index $k$ needs to be removed. Thus, the terms dependent on $k$ are averaged over all subcarriers.

\displaystyle\begin{split}&\sum\limits_{m=0}^{M-1}\left(\epsilon_{m}-\epsilon_{*}\right)\frac{1}{N}\sum\limits_{k=0}^{N-1}|H_{k,m}|^{2}|X_{m}[k]|^{2}=0\\ \implies&\sum\limits_{m=0}^{M-1}\left(\epsilon_{m}-\epsilon_{*}\right)P_{m}=0\end{split}

(51)

where $P_{m}$ is the power of the $k^{th}$ channel scaled by the mean value of the constellation of the $m^{th}$ tower. This mean value is independent of the index $m$ as it will be common across all the towers. This is because the transmitted power per OFDM symbol is the same for all base stations. Thus,

\displaystyle\epsilon_{*,opt}=\frac{\sum\limits_{m=0}^{M-1}\epsilon_{m}P_{m}}{\sum\limits_{m=0}^{M-1}P_{m}}

(52)

9 Offset Corrected Joint Detector

Bayes Theorem is applied to the joint detector formulation described in (25).

\displaystyle\begin{split}&LLR_{0,\lambda,k}=\hfill\\ &ln\left(\dfrac{P\left(Y_{D}[k]\;\middle|\;b_{\lambda}\left(X_{0}[k]\right)=1,\mathbf{\tilde{C}}^{\prime},\mathbf{\tilde{H}}_{k}^{\prime}\right)P(b_{\lambda}\left(X_{0}[k]\right)=1)}{P\left(Y_{D}[k]\;\middle|\;b_{\lambda}\left(X_{0}[k]\right)=0,\mathbf{\tilde{C}}^{\prime},\mathbf{\tilde{H}}_{k}^{\prime}\right)P(b_{\lambda}\left(X_{0}[k]\right)=0)}\right)\end{split}

(53)

Summation is done over all possibilities for which $b_{\lambda}\left(X_{0}[k]\right)=1$ in the numerator and $b_{\lambda}\left(X_{0}[k]\right)=0$ in the denominator,assuming equal prior probabilities for constellation points. That is, the marginal distribution is found from the joint distribution.

\displaystyle\begin{split}&LLR_{0,\lambda,k}=\\ &ln\left(\dfrac{\sum\limits_{X_{0}\in\boldsymbol{X_{0}^{(1,\lambda)}}}\sum\limits_{\begin{subarray}{c}X_{m}\in\boldsymbol{X_{m}},\\ m\neq 0\end{subarray}}P\left(Y_{D}[k]\;\middle|\;\boldsymbol{C},\boldsymbol{\tilde{H}}_{k}^{\prime},\boldsymbol{X[k]}\right)}{\sum\limits_{X_{0}^{\prime}\in\boldsymbol{X_{0}^{(0,\lambda)}}}\sum\limits_{\begin{subarray}{c}X_{m}\in\boldsymbol{X_{m}}\\ m\neq 0\end{subarray}}P\left(Y_{D}[k]\;\middle|\;\boldsymbol{C},\boldsymbol{\tilde{H}}_{k}^{\prime},\boldsymbol{X[k]}\right)}\right)\end{split}

(54)

$\boldsymbol{X_{0}^{(1,\lambda)}}$ denotes the subset of the constellation of $\boldsymbol{X_{0}}$ for which the $\lambda^{th}$ bit is 1, and $\boldsymbol{X_{0}^{(0,\lambda)}}$ denotes the subset of the constellation of $\boldsymbol{X_{0}}$ for which the $\lambda^{th}$ bit is 0. Assuming that the residual ICI does not distort the Gaussianity of the noise heavily, the JLLR equation becomes:

\displaystyle\begin{split}&LLR_{0,\lambda,k}=\\ &ln\left\{\dfrac{\sum\limits_{\begin{subarray}{c}X_{0}\in\boldsymbol{\boldsymbol{X_{0}^{(1,\lambda)}}}\\ X_{m}\in\boldsymbol{X_{m}}\\ m\neq 0\end{subarray}}exp\left(\begin{array}[]{l}-\frac{1}{\sigma^{2}}\|Y_{D}[k]-\tilde{C}^{\prime}_{0}\tilde{H}_{k,0}X_{0}[k]+\\ \ \ \quad\sum\limits_{\begin{subarray}{c}m\in\mathcal{M}\\ m\neq 0\end{subarray}}\tilde{C}^{\prime}_{m}\tilde{H}_{k,m}X_{m}[k]\|^{2}\end{array}\right)}{\sum\limits_{\begin{subarray}{c}X_{0}^{\prime}\in\boldsymbol{\boldsymbol{X_{0}^{(0,\lambda)}}}\\ X_{m}\in\boldsymbol{X_{m}}\\ m\neq 0\end{subarray}}exp\left(\begin{array}[]{l}-\frac{1}{\sigma^{2}}\|Y_{D}[k]-\tilde{C}^{\prime}_{0}\tilde{H}_{k,0}X_{0}^{\prime}[k]+\\ \ \ \quad\sum\limits_{\begin{subarray}{c}m\in\mathcal{M}\\ m\neq 0\end{subarray}}\tilde{C}^{\prime}_{m}\tilde{H}_{k,m}X_{m}[k]\|^{2}\end{array}\right)}\right\}\end{split}

(55)

The complexity of the detector can be reduced by using the suboptimal Max-Log-MAP approximation proposed in [61],

LLR_{0,\lambda,k}\approx\min\limits_{\begin{subarray}{c}X_{0}\in\mathbf{X}_{0}^{\lambda_{1}},\\ X_{m}\in\mathbf{X_{m}}\\ m\neq 0\end{subarray}}\frac{1}{\sigma^{2}}\|Y_{D}[k]-\tilde{C}^{\prime}_{0}\ \tilde{H}_{k,0}X_{0}[k]\\ -\sum\limits_{m=1}^{M-1}\tilde{C}^{\prime}_{m}\ \tilde{H}_{k,m}X_{m}[k]\|^{2}\\ -\min\limits_{\begin{subarray}{c}X_{0}^{\prime}\in\mathbf{X}_{0}^{\lambda_{0}},\\ X_{m}[k]\in\mathbf{X_{m}}\\ m\neq 0\end{subarray}}\frac{1}{\sigma^{2}}\|Y_{D}[k]-\tilde{C}^{\prime}_{0}\ \tilde{H}_{k,0}X_{0}^{\prime}[k]\\ -\sum\limits_{m=1}^{M-1}\tilde{C}^{\prime}_{m}\ \tilde{H}_{k,m}X_{m}[k]\|^{2}

(56)

This expression shall be called the offset-corrected joint log likelihood ratio (OC-JLLR) expression.

ACKNOWLEDGMENT

The authors would like to thank the anonymous peer reviewers for their valuable feedback.

References

[1] A. S. Ameen, A. Doufexi, and A. R. Nix, “Proposed ITU-R compatible inter-site and inter-sector interference models for LTE-Advanced networks,” IEEE Trans. Veh. Technol., vol. 69, no. 12, pp. 14 304–14 315, 2020.
[2] X. Qi, S. Khattak, A. Zaib, and I. Khan, “Energy efficient resource allocation for 5G heterogeneous networks using genetic algorithm,” IEEE Access, vol. 9, pp. 160 510–160 520, 2021.
[3] J. Cao, T. Peng, Z. Qi, R. Duan, Y. Yuan, and W. Wang, “Interference management in ultradense networks: A user-centric coalition formation game approach,” IEEE Trans. Veh. Technol., vol. 67, no. 6, pp. 5188–5202, 2018.
[4] L. Xiao, H. Zhang, Y. Xiao, X. Wan, S. Liu, L.-C. Wang, and H. V. Poor, “Reinforcement learning-based downlink interference control for ultra-dense small cells,” IEEE Trans. Wireless Commun., vol. 19, no. 1, pp. 423–434, 2020.
[5] Y. Wang, G. Feng, Y. Sun, S. Qin, and Y.-C. Liang, “Decentralized learning based indoor interference mitigation for 5G-and-Beyond systems,” IEEE Trans. Veh. Technol., vol. 69, no. 10, pp. 12 124–12 135, 2020.
[6] O. A. Antwi and A. Acakpovi, “Evaluation of inter-cell interference and BER on a downlink PDSCH of the LTE network,” in 2019 International Conference on Computing, Computational Modelling and Applications (ICCMA), 2019, pp. 96–965.
[7] M. C. Necker, “Towards frequency reuse-1 cellular FDM/TDM systems,” in Proceedings of the 9th ACM international symposium on Modeling analysis and simulation of wireless and mobile systems. ACM, 2006, pp. 338–346.
[8] S. Moon and I. Hwang, “Hybrid beamforming with joint transmission for New Radio-based communication systems,” in 2019 International Conference on Information and Communication Technology Convergence (ICTC), 2019, pp. 1066–1069.
[9] 5G; NR; Overall description - Stage-2, 3GPP TS 38.300 version 16.4.0. Release 14, Jan. 2021.
[10] User Equipment (UE) radio transmission and reception; Part 1: Range 1 Standalone, 3GPP TS 38.101-1 version 16.5.0. Release 16, Nov. 2020.
[11] 5G; Study on Scenarios and Requirements for Next Generation Access Technologies, 3GPP TS 38.913 version 14.2.0. Release 14, May. 2017.
[12] S. Koratagere Anantha Kumar, R. W. Stewart, D. Crawford, and S. Chaudhari, “Techno-economic study of 5G network slicing to improve rural connectivity in India,” IEEE Open Journal of the Communications Society, vol. 2, pp. 2645–2659, 2021.
[13] J. Jeon, G. Lee, A. A. Ibrahim, J. Yuan, G. Xu, J. Cho, E. Onggosanusi, Y. Kim, J. Lee, and J. C. Zhang, “MIMO evolution toward 6G: Modular massive MIMO in low-frequency bands,” IEEE Commun. Mag., vol. 59, no. 11, pp. 52–58, 2021.
[14] Electromagnetic compatibility and Radio spectrum Matters (ERM); System Reference document (SRdoc); Technical characteristics and spectrum requirements of wideband SRDs with advanced spectrum sharing capability for operation in the UHF 870 - 876 MHz and 915 - 921 MHz frequency bands, ETSI TR 103 245 version 1.1.1. Nov. 2014.
[15] ECC Report 189, “Future spectrum demand for short range devices in the UHF frequency bands,” Electronic Communications Committee, Tech. Rep., 2014, [Online]. Available: https://docdb.cept.org/download/723. Accessed on: July 08, 2022.
[16] R. Hussain, M. Abou-Khousa, M. Umar Khan, and M. S. Sharawi, “A compact sub-1 GHz IoT antenna design with wide tuning capabilities,” in 2021 15th European Conference on Antennas and Propagation (EuCAP), 2021, pp. 1–3.
[17] System Reference document (SRdoc); Critical Infrastructure Utility Operations requirements for Smart Grid systems, other radio systems, and future radio spectrum access arrangements below 1,5 GHz, eTSI TR 103 492 version 1.1.1. Jan. 2019.
[18] E. Yaacoub and M.-S. Alouini, “A key 6G challenge and opportunity—Connecting the base of the pyramid: A survey on rural connectivity,” Proc. IEEE, vol. 108, no. 4, pp. 533–582, 2020.
[19] E. Semaan, E. Tejedor, R. K. Kochhar, and S. Magnusson, “Realizing the 6G vision - Why is spectrum fundamental?” June 2022, the Ericsson Blog. [Online]. Available: https://www.ericsson.com/en/blog/2022/6/6g-spectrum-why-its-fundamental. Accessed on: June 27, 2022.
[20] M. K. Ozdemir and H. Arslan, “Channel estimation for wireless OFDM systems,” IEEE Communications Surveys & Tutorials, vol. 9, no. 2, pp. 18–48, 2007.
[21] J.-J. Van De Beek, O. Edfors, M. Sandell, S. K. Wilson, and P. O. Borjesson, “On channel estimation in OFDM systems,” in 1995 IEEE 45th Vehicular Technology Conference. Countdown to the Wireless Twenty-First Century, vol. 2. IEEE, 1995, pp. 815–819.
[22] E. Larsson, G. Liu, J. Li, and G. Giannakis, “Joint symbol timing and channel estimation for OFDM based WLANs,” IEEE Commun. Lett., vol. 5, no. 8, pp. 325–327, 2001.
[23] M. Morelli and U. Mengali, “A comparison of pilot-aided channel estimation methods for OFDM systems,” IEEE Trans. Signal Process., vol. 49, no. 12, pp. 3065–3073, 2001.
[24] P. Karunakaran, P. Suryasarman, V. Ramaswamy, K. Kuchi, J. K. Milleth, and B. Ramamurthi, “On pilot design for interference limited OFDM systems,” in 2011 8th International Symposium on Wireless Communication Systems, 2011, pp. 522–526.
[25] H. R. Abdelkader and A. E. Elmahdy, “Interference mitigation in MIMO systems with imperfect CSI using beamforming based precoding algorithm,” in 16th International Computer Engineering Conference (ICENCO), 2020, pp. 197–202.
[26] S. Pratschner, E. Zöchmann, and M. Rupp, “Low complexity estimation of frequency selective channels for the LTE-A uplink,” IEEE Wireless Commun. Lett., vol. 4, no. 6, pp. 673–676, 2015.
[27] S. Pratschner, S. Schwarz, and M. Rupp, “Single-user and multi-user MIMO channel estimation for LTE-Advanced uplink,” in 2017 IEEE International Conference on Communications (ICC), 2017, pp. 1–6.
[28] 5G; NR; Physical channels and modulation, 3GPP TS 38.211 version 16.2.0. Release 16, Jun. 2020.
[29] D. Kong, X.-G. Xia, P. Liu, and Q. Zhu, “MMSE channel estimation for two-port demodulation reference signals in new radio,” arXiv preprint arXiv:2007.14168, 2020.
[30] R. Preethi, M. V. Abhay Mohan, and K. Giridhar, “Exploiting implicit OVSF structure in DM-RS for improved channel estimation in 5G NR systems,” in 2022 IEEE 95th Vehicular Technology Conference: (VTC2022-Spring), 2022, pp. 1–6.
[31] Y. Li, “Simplified channel estimation for OFDM systems with multiple transmit antennas,” IEEE Trans. Wireless Commun., vol. 1, no. 1, pp. 67–75, 2002.
[32] H. Tran, T.-A. Mai, S. Dang, and H.-A. Ngo, “Large-scale MU-MIMO uplink channel estimation using sounding reference signal,” in 2018 2nd International Conference on Recent Advances in Signal Processing, Telecommunications & Computing (SigTelCom), 2018, pp. 107–110.
[33] G. Auer, A. Dammann, and S. Sand, “Channel estimation for OFDM systems with multiple transmit antennas by exploiting the properties of the discrete fourier transform,” in 14th IEEE Proceedings on Personal, Indoor and Mobile Radio Communications, 2003. PIMRC 2003., vol. 2, 2003, pp. 1954–1958 vol.2.
[34] S. Pietrzyk and G. Janssen, “Subcarrier allocation and power control for QoS provision in the presence of CCI for the downlink of cellular OFDMA systems,” in The 57th IEEE Semiannual Vehicular Technology Conference, 2003. VTC 2003-Spring., vol. 4, 2003, pp. 2221–2225 vol.4.
[35] J. Lee, D. Toumpakaris, and W. Yu, “Interference mitigation via joint detection,” IEEE J. Sel. Areas Commun., vol. 29, no. 6, pp. 1172–1184, 2011.
[36] S. Vanka, S. Srinivasa, Z. Gong, P. Vizi, K. Stamatiou, and M. Haenggi, “Superposition coding strategies: Design and experimental evaluation,” IEEE Trans. Wireless Commun., vol. 11, no. 7, pp. 2628–2639, 2012.
[37] R. Ghaffar and R. Knopp, “Interference suppression strategy for cell-edge users in the downlink,” IEEE Trans. Wireless Commun., vol. 11, no. 1, pp. 154–165, 2012.
[38] Vishnu O. C., V. Kumar, M. Madhusoodanan, and K. Giridhar, “Joint uplink LLR computation, quantization, and combining for load-balanced small cells,” in 2016 IEEE Global Communications Conference (GLOBECOM). IEEE, 2016, pp. 1–6.
[39] J.-C. Lin, “Synchronization requirements for 5G: An overview of standards and specifications for cellular networks,” IEEE Veh. Technol. Mag., vol. 13, no. 3, pp. 91–99, 2018.
[40] V. Kotzsch, J. Holfeld, and G. Fettweis, “Joint detection and CFO compensation in asynchronous multi-user MIMO-OFDM systems,” in VTC Spring 2009-IEEE 69th Vehicular Technology Conference. IEEE, 2009, pp. 1–5.
[41] T.-D. Chiueh and P.-Y. Tsai, OFDM baseband receiver design for wireless communications. John Wiley & Sons, 2008.
[42] W. Han and G. Wang, “Multiuser interference suppression for the uplink of interleaved MIMO OFDMA,” in 2013 International Conference on Communications, Circuits and Systems (ICCCAS), vol. 1, 2013, pp. 29–33.
[43] A. Santra and K. Hari, “A novel subspace based method for compensation of multiple CFOs in uplink MIMO OFDM systems,” IEEE Commun. Lett., vol. 21, no. 9, pp. 1993–1996, 2017.
[44] M. Schlamann, L. Haering, and A. Czylwik, “Lattice filter-based frequency-domain equalizer with multiple frequency offset compensation for MIMO SDMA OFDM systems,” in OFDM 2014; 18th International OFDM Workshop 2014 (InOWo’14), 2014, pp. 1–6.
[45] E.-S. Jeon, J. Seo, J. Yang, and D. Kim, “Iterative joint detection, ICI cancelation and estimation of multiple CFOs and channels for DVB-T2 in MISO transmission mode,” IEEE Trans. Broadcast., vol. 60, no. 1, pp. 29–37, 2014.
[46] M. Morelli, “Timing and frequency synchronization for the uplink of an OFDMA system,” IEEE Trans. Commun., vol. 52, no. 2, pp. 296–306, 2004.
[47] D. Huang and K. Letaief, “An interference-cancellation scheme for carrier frequency offsets correction in OFDMA systems,” IEEE Trans. Commun., vol. 53, no. 7, pp. 1155–1165, 2005.
[48] M. Schellmann and V. Jungnickel, “Effects of multiple users’ CFOs in OFDM-SDMA up-link - an interference model,” in 2006 IEEE International Conference on Communications, vol. 10, 2006, pp. 4642–4647.
[49] P. Marsch and G. P. Fettweis, Coordinated Multi-Point in Mobile Communications: From theory to practice. Cambridge University Press, 2011.
[50] K. Manolakis, L. Thiele, C. Oberli, T. Haustein, and V. Jungnickel, “Impairment modeling for joint transmission CoMP,” in 2011 2nd International Conference on Wireless Communication, Vehicular Technology, Information Theory and Aerospace & Electronic Systems Technology (Wireless VITAE), 2011, pp. 1–5.
[51] K. Deng, Y. Tang, S. Shao, and K. Sun, “Correction of carrier frequency offsets in OFDM-based spatial multiplexing MIMO with distributed transmit antennas,” IEEE Trans. Veh. Technol., vol. 58, no. 4, pp. 2072–2077, 2009.
[52] Y.-S. Yang, W.-C. Huang, C.-P. Li, H.-J. Li, and G. L. Stuber, “A low-complexity transceiver structure with multiple CFOs compensation for OFDM-based coordinated multi-point systems,” IEEE Trans. Commun., vol. 63, no. 7, pp. 2658–2670, 2015.
[53] B. W. Zarikoff and J. K. Cavers, “Coordinated multi-cell systems: Carrier frequency offset estimation and correction,” IEEE J. Sel. Areas Commun., vol. 28, no. 9, pp. 1490–1501, 2010.
[54] V.-D. Nguyen, M. Patzold, F. Maehara, H. Haas, and M.-V. Pham, “Channel estimation and interference cancellation for MIMO-OFDM systems,” IEICE Transactions on Communications, vol. 90, no. 2, pp. 277–290, 2007.
[55] M. Raghavendra, S. Bhashyam, and K. Giridhar, “Interference rejection for parametric channel estimation in reuse-1 cellular OFDM systems,” IEEE Trans. Veh. Technol., vol. 58, no. 8, pp. 4342–4352, 2009.
[56] A. Jeremic, T. A. Thomas, and A. Nehorai, “OFDM channel estimation in the presence of interference,” IEEE Trans. Signal Process., vol. 52, no. 12, pp. 3429–3439, 2004.
[57] Y. S. Cho, J. Kim, W. Y. Yang, and C. G. Kang, MIMO-OFDM wireless communications with MATLAB. John Wiley & Sons, 2010.
[58] N. Galatsanos and A. Katsaggelos, “Cross-validation and other criteria for estimating the regularizing parameter,” in [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing, 1991, pp. 3021–3024 vol.4.
[59] A. E. Hoerl, R. W. Kannard, and K. F. Baldwin, “Ridge regression: some simulations,” Communications in Statistics-Theory and Methods, vol. 4, no. 2, pp. 105–123, 1975.
[60] K. Qian, “On the determination of proper regularization parameter,” B.S. thesis, The University of Stuttgart, 2017.
[61] P. Robertson, P. Hoeher, and E. Villebrun, “Optimal and sub-optimal maximum a posteriori algorithms suitable for turbo decoding,” European Transactions on Telecommunications, vol. 8, no. 2, pp. 119–125, 1997.
[62] T. Yuichi, “Performance requirements for reference clocks for 5G communications,” Seiko Epson Corporation, Tech. Rep., 2020.
[63] P. Hoeher, S. Kaiser, and P. Robertson, “Two-dimensional pilot-symbol-aided channel estimation by Wiener filtering,” in 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, 1997, pp. 1845–1848 vol.3.
[64] A. Gomaa, L. M. Jalloul, M. M. Mansour, K. Gomadam, and D. Tujkovic, “Max-Log-MAP optimal MU-MIMO receiver for joint data detection and interferer modulation classification,” IEEE Commun. Lett., vol. 20, no. 7, pp. 1389–1392, 2016.
[65] S. Desai and K. Giridhar, “Low complexity quasi-MLM modulation classification based optimal overloaded MU-MIMO receiver,” in IEEE 91st Vehicular Technology Conference (VTC2020-Spring), 2020, pp. 1–5.
[66] T. Schmidl and D. Cox, “Robust frequency and timing synchronization for OFDM,” IEEE Trans. Commun., vol. 45, no. 12, pp. 1613–1621, 1997.
[67] 5G; Study on channel model for frequencies from 0.5 to 100 GHz, 3GPP TS 38.901 version 16.1.0. Release 14, Jan. 2020.
[68] H. Gacanin and F. Adachi, “Frequency-Domain Interleaving for OFDM/TDM Using MMSE-FDE,” in 2008 IEEE International Conference on Communications, 2008, pp. 3790–3794.
[69] S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice Hall PTR, 1993.
[70] K.-M. Chong, “The arithmetic mean—geometric mean inequality: A new proof,” Mathematics Magazine, vol. 49, no. 2, pp. 87–88, 1976.
[71] W. Rudin et al., Principles of mathematical analysis. McGraw-hill New York, 1976, vol. 3.
[72] T. Pollet, M. Van Bladel, and M. Moeneclaey, “BER sensitivity of OFDM systems to carrier frequency offset and Wiener phase noise,” IEEE Trans. Commun., vol. 43, no. 2/3/4, pp. 191–193, 1995.

{IEEEbiography}

[ [Uncaptioned image] ]ABHAY MOHAN M V received the B.Tech degree in electronics and communication engineering from the Government Engineering College-Palakkad, Kerala, India, in 2014 and the M.Tech degree in Communication Engineering and Signal Processing from the Government Engineering College-Thrissur, Kerala, India, in 2017. He is currently pursuing the Ph.D. degree in electrical engineering from the Indian Institute of Technology Madras, Chennai, India.

{IEEEbiography}

[ [Uncaptioned image] ]K. GIRIDHAR received the B.Sc. degree in applied sciences from the PSG College of Technology, Coimbatore, India, in 1985, the ME degree in electrical communications from the Indian Institute of Science, Bangalore, India, in 1989 and the PhD degree in electrical engineering from the University of California, Santa Barbara, USA in 1993.

He was a member of research staff at CRL, Bharat Electronics, Bangalore, India, between 1989-90 and a research affiliate in electrical engineering at Stanford University, California, USA, between 1993-94. He is currently a Professor at the Indian Institute of Technology Madras (IITM), Chennai, India. His research interests are broadly in the areas of adaptive signal processing and wireless communications systems, with an emphasis on various transceiver algorithms and custom air-interface design for strategic communications. More recently, his research group has focused on non-orthogonal spectrum sharing and distributed radar signal processing.

Dr. Giridhar has been a consultant for many telecom and signal processing industries and research labs in India, and has taken sabbaticals with Gigabit Communications Inc, San Jose in summer of 2000, with Beceem Communications Inc, Santa Clara, between 2004-05, and with Data Patterns (India) Ltd, Chennai between 2018-19. He is a founding Director of QallCover Communications LLP, Chennai, and has been a visiting faculty at Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, and at Stanford University.

Interference-Aware Accurate Signal Recovery in sub-1 GHz UHF Band Reuse-1 Cellular OFDMA Downlinks

Abstract

1 INTRODUCTION

1.1 Literature Survey

1.1.1 Channel Estimation Schemes

1.1.2 Management of CCI and ICI

1.2 Key Contributions

1.3 Related References and Novelty

Notations

2 System Model

3 Proposed Interference-Aware Receiver

3.1 Joint Channel Estimation

3.1.1 CRLB of the joint CIR estimate

3.1.2 JmLS as a Minimum Variance Unbiased Estimator (MVUE)

3.1.3 CRLB of Orthogonal CIR estimation scheme

3.1.4 Condition for JmLS superiority over OmLS

3.1.5 Comparison of Complexity

3.2 Frequency offset compensation

3.2.1 Time-domain ICI Compensation Scheme

3.2.2 Frequency offset compensation for joint detector

4 Simulation Results and Discussion

4.1 Time-domain ICI Compensation Scheme

4.2 JmLS Channel Estimation Performance

4.3 Link-Level Simulation Performance

4.3.1 Low Mobility, NT=8N_{T}=8, M=4M=4

4.3.2 Low Mobility, NT=8N_{T}=8, M=3M=3, 16-QAM

4.3.3 Low Mobility, NT=8N_{T}=8, M=2M=2, 64-QAM

4.3.4 High Mobility, TDL-E LOS channel

5 Conclusion

6 CRLB of joint channel estimate

Lemma 1.

Proof.

7 MSE of JmLS estimate

8 Time-domain derotation factor

9 Offset Corrected Joint Detector

ACKNOWLEDGMENT

References

4.3.1 Low Mobility, $N_{T}=8$ , $M=4$

4.3.2 Low Mobility, $N_{T}=8$ , $M=3$ , 16-QAM

4.3.3 Low Mobility, $N_{T}=8$ , $M=2$ , 64-QAM