Terahertz Wireless Communications with Flexible Index Modulation Aided Pilot Design
Abstract
Terahertz (THz) wireless communication is envisioned as a promising technology, which is capable of providing ultra-high-rate transmission up to Terabit per second. However, some hardware imperfections, which are generally neglected in the existing literature concerning lower data rates and traditional operating frequencies, cannot be overlooked in the THz systems. Hardware imperfections usually consist of phase noise, in-phase/quadrature imbalance, and nonlinearity of power amplifier. Due to the time-variant characteristic of phase noise, frequent pilot insertion is required, leading to decreased spectral efficiency. In this paper, to address this issue, a novel pilot design strategy is proposed based on index modulation (IM), where the positions of pilots are flexibly changed in the data frame, and additional information bits can be conveyed by indices of pilots. Furthermore, a turbo receiving algorithm is developed, which jointly performs the detection of pilot indices and channel estimation in an iterative manner. It is shown that the proposed turbo receiver works well even under the situation where the prior knowledge of channel state information is outdated. Analytical and simulation results validate that the proposed schemes achieve significant enhancement of bit-error rate performance and channel estimation accuracy, whilst attaining higher spectral efficiency in comparison with its classical counterpart.
Index Terms:
Terahertz wireless communications, hardware imperfections, index modulation, pilot design, turbo receiver.I Introduction
With the escalating requirement of wireless data throughput and proliferation of multimedia terminals, exploitation of Terahertz (THz) spectrum has attracted enormous attentions from the global community due to its potential of achieving terabit-per-second (Tbps) data transmission [1, 5, 6, 2, 3, 4]. This motivates the emergence of numerous bandwidth-consuming applications, such as high-definition holographic video meeting, virtual/augmented reality, and wireless backhaul, etc. [5, 6, 7]. Besides, THz beams are extremely narrow in order to overcome the severe signal attenuation, which adds difficulty to eavesdropping in the wiretap scenario, enabling secure data transmission for military applications [5]. Furthermore, THz waves are also considered for joint radar and communication (JRC) systems due to its desirable resolution and highly directional beams [1]. Thanks to the considerable merits above, THz wireless communication has been regarded as a promising candidate for the next-generation networks [8].
However, there still exist tremendous challenges for commercialization of THz wireless communications. Firstly, in comparison with communication systems operating at lower-frequency bands, THz signals suffer from more severe attenuation induced by free-space path loss and molecular absorption effects from atmospheric particles like [9]. To address this issue, high-gain directional antennas (or antenna arrays with beamforming) are often invoked to maintain both the coverage and the quality of THz signals [8, 10]. Aside from the channel peculiarities, another critical point that limits the performance of THz wireless communications is hardware imperfection, which includes local-oscillator (LO) phase noise, in-phase/quadrature (I/Q) imbalance, and nonlinearity of the power amplifier [11, 12]. These impairments are generally omitted in low-rate systems by assuming ideal RF front ends [13]. However, they cannot be overlooked in THz communication systems due to the well-known “THz Gap” posing considerable difficulty to THz hardware fabrication [1], which have a detrimental impact on the performance of THz communication systems [12]111Granting that the hardware impairments can be mitigated with compensation techniques, there still exists residual imperfections that cannot be fully eliminated. Neglecting these residual imperfections could cause substantial performance loss [13, 22].. Hence, researchers have recently started to investigate the hardware imperfection issue regarding THz communication systems. In [12], distortions induced by the hardware imperfections were approximately modelled as circularly symmetric complex Gaussian (CSCG) variables. This model was further introduced to the beamforming-aided multiple-input single-output (MISO) and THz spatial modulation (THz-SM) systems in [14] and [15], respectively, where the deleterious impact of hardware impairments is validated, and the corresponding receiving algorithms were developed to mitigate the resultant performance degradation. Except for imperfections of the mixer, the LO and the amplifier, additional nonlinearity induced by the frequency-multiplier-last architecture for THz wave generation was also considered in [16] and [17], where pre-compensation schemes and modified receiver design were proposed correspondingly.

Aside from the performance loss caused by hardware imperfections, due to the time-variant property of strong phase noise mentioned in [18] and [19], channel-tracking pilots are required to be frequently utilized in order to timely update the equivalent channel fading coefficients in THz communication systems. Conventional channel estimation methods usually insert preamble sequences with fixed positions periodically in the data frame [20, 21], which causes undesirable spectral efficiency loss222Although large bandwidth is usually employed in THz wireless communications, the system spectral efficiency has to be sufficiently high in order to achieve over 100 Gbps data rate. Besides, only slight spectral efficiency loss could result in considerable data rate degradation in THz communication systems with tens of GHz bandwidth.. In this paper, a THz-band single-carrier communication system under strong hardware impairments is investigated. To compensate for the throughput loss caused by frequent channel tracking, whilst attaining high channel estimation accuracy simultaneously, index modulation (IM) is introduced to pilot design, which is capable of achieving superior spectrum and energy efficiency gains for wireless communication systems [23, 24, 25, 26, 27]. More specifically, aside from the data symbol bits, additional information can be implicitly conveyed by the subcarrier activation patterns in orthogonal frequency division multiplexing (OFDM) systems, e.g., the “on/off” status of subcarriers [25], index of used constellation alphabet for each subcarrier [26], and positions of inserted pilots in the frequency domain [27]. Based on the IM philosophy, a novel flexible pilot design strategy is proposed. More specifically, unlike classical training sequences with fixed position, the IM-aided pilots are flexibly inserted into the data frame, and their positions are determined by the conveyed information bits, referred to as index bits. In other words, additional binary bits can be conveyed by the indices of pilots, which cancels out the throughput loss, and even achieves higher spectral efficiency than data transmission without pilot sequences. To detect the indices of pilot symbols at the receiver, referred to as index pattern, distinguishable constellation alphabets are employed for the pilots and data symbols, respectively. Then a turbo receiving algorithm is proposed by updating the index pattern of pilots and the estimate of channel state information (CSI) iteratively, which ensures superior precision of index pattern detection and channel estimation. Furthermore, the extension of the proposed flexible IM-aided pilot design to frequency-selective fading channels considering THz communication systems with ultra-broad bandwidth, is briefly discussed. To validate the feasibility of the proposed turbo receiving algorithm, intuitive performance analysis is carried out from a geometrical point of view, followed by complexity evaluation. Simulation results demonstrate that the proposed flexible IM-aided pilot design and the corresponding turbo receiving algorithm are capable of enhancing both the channel estimation accuracy and bit-error rate (BER) performance at higher spectral efficiency, compared with its conventional counterpart with fixed pilot assignment. Analytical and simulative results also show that the proposed turbo receiving algorithm still works well when the prior knowledge of CSI fed into the iterative receiver is outdated.
The remainder of the paper is organized as follows. Section II presents the transceiver model of the THz single-carrier communication system with hardware imperfections. Then the proposed flexible IM-aided pilots and the turbo receiving algorithm are illustrated in Section III. Afterwards, performance analysis and simulation results are provided in Section IV and V, respectively. Finally, Section VI draws the conclusion.
Notation: and denote the transpose and the conjugate transpose. , and stand for the 2-norm, the floor operator, and the length of a line segment, respectively. Besides, represents the binomial coefficient.


II System Model
The transceiver model of the THz single-carrier communication system is illustrated in Fig. 1. Note that aside from the use of directional antennas, large antenna arrays with beamforming could be adopted to compensate for the severe path loss of THz signals, which are actually equivalent to a single high-gain directional antenna if the same signal is transmitted on each antenna element for diversity gain. Hence, a single-input single-output (SISO) scenario is assumed in this paper for simplicity. At the transmitter, the incoming bits are firstly modulated as -ary data symbols, which are then combined with the pilots, generating an -dimensional signal vector denoted as with average transmit power . Afterwards, for is processed by the mixer and power amplifier to generate THz waves for data transmission, which induces non-negligible phase noise, I/Q imbalance and nonlinearity, leading to severe signal distortions at the transmitter side. For simplicity, the nonlinearity of the power amplifier is assumed to be already compensated [28, 29]. According to the mathematical models for phase noise and I/Q imbalance [18, 20], the distorted signal can be formulated as
(1) |
where we have
(2) |
and denote the amplitude and phase imbalance of the I/Q branches, respectively. Besides, is the phase noise component at the transmitter. For ultra-high-rate THz communication systems, the impacts of phase noise on transmitted signals are considered to be frequency-dependent [11, 30]. On one hand, the lower-frequency components in the power spectral density (PSD) of phase noise contribute to phase errors of the THz signals; On the other hand, the higher-frequency components cause inter-carrier interference (ICI) to the adjacent carriers [10]. Since we focus on a single-carrier THz system in this paper, only the rotational effects of phase noise are taken into consideration as illustrated in (2). In [18], the process of for follows a random-walk model, where the phase noise changes in a symbol-wise manner. However, for THz wireless communications, since ultra-wide bandwidth is usually employed, leading to extremely short symbol period, it is reasonable to assume a block-wise changing model for the phase noise, which keeps constant within each data block, whilst varies over adjacent blocks randomly. More specifically, the data frame is equally separated into -length blocks for , where equals . Then phase noise terms within each data block are assumed to be constant, shown as
(3) |
whilst the phase noise for adjacent data blocks follow the random-walk model, expressed by
(4) |
where is uniformly distributed in , and represents the random variation of phase noise between the -th and -th data blocks, following Gaussian distribution as .
To mitigate the severe signal attenuation induced by path loss and molecular absorption effects, high-gain directional antennas are employed at the THz transceiver, which indicates that multi-path effects are negligible. Therefore, a line-of-sight (LoS) flat-fading channel is assumed [31]333It has been stated in [31] that the coherence bandwidth exceeds 60 GHz for THz communication systems at 300 GHz equipped with directional antennas. Therefore, the frequency-selective fading due to spreading and molecular absorption loss is not mainly considered in this paper., denoted as for . Based on the THz channel characteristics [12], the overall path gain can be expressed as
(5) |
where and denote the spreading loss and molecular absorption loss, and is the beam misalignment fading originated from the narrow beam-width of directional antennas. The expressions of these coefficients are detailed in [12], thus omitted here for brevity.
According to Fig. 1, aside from the additive white Gaussian noise (AWGN) component following , additional phase noise and I/Q imbalance are induced at the receiver side. For simplicity, these imperfections are well approximated as an additive distortion term following , where is the average power of the received signals, and denotes the level of hardware imperfections at the receiver [13, 12]. Then the distorted signal for can be formulated as444In this paper, the approximate distortion model is only applied to characterize the hardware imperfections at the receiver, because it is consider to be less dominant than the counterpart at the transmitter, as explained in [32].
(6) |
By substituting (1) into (6), the expression of can be further derived as
(7) |
where is the equivalent channel fading vector, and denotes the distortion-plus-noise component, following . Finally, is processed by the signal detector and a constellation demapper to demodulate the information bits, where pilot-assisted channel estimation is performed.

III Proposed Schemes
Based on the block-wise random-walk property of phase noise, it is assumed that the equivalent channel fading vector in (7) remains approximately constant within each data block, denoted as for , whilst changes happen across different blocks due to random fluctuations of phase noise as well as possible variations of the physical channel coefficients.
III-A Classical Pilot Assignment
Classical channel estimation methods usually employ a preamble sequence inserted at the beginning of each data block [20, 21], as illustrated in Fig. 2 (a), where denotes the preamble length for each data block. Then the channel estimate of the -th data block can be obtained with least-squared (LS) algorithm as [33]
(8) |
where is the preamble matrix consisting of the pilot symbols and their conjugates, formulated as
(9) |
and is defined as the corresponding received signal vector. Based on the block structure in Fig. 2 (a), the system spectral efficiency can be calculated as
(10) |
It is readily seen from (10) that the classical fixed preamble results in throughput degradation.
III-B Proposed Flexible IM-Aided Pilot Design
Index bits | Indices | Subblocks |
---|---|---|
To address this issue, a flexible pilot design strategy based on IM is proposed for the THz communication system, as illustrated in Fig. 2 (b) and Fig. 3. At the transmitter, the incoming bits for the -th data block is firstly divided into data symbol bits and index bits. On one hand, the binary bits are fed into a constellation mapper of an -ary alphabet , yielding the data symbol vector . On the other hand, pilots are generated using an -ary constellation alphabet for channel estimation in each data block. Note that . Unlike classical preamble assignment, the proposed IM-aided pilots are no longer fixed, whose indices in the data block are determined by the index bits with an index selector. More specifically, as presented in Fig. 2 (b), the -th data block for is further split into subblocks with size of , denoted as for . In each subblock, aside from the data symbols, there are totally pilots drawn from , denoted as , corresponding to the index set . Here is determined by the incoming bits according to the mapping table between the index bits and the pilot positions, satisfying , which can be exemplified by Table I with and . By such arrangement, additional bits are conveyed by the pilot indices in each subblock. Hence, the system spectral efficiency applying the proposed flexible IM-aided pilots becomes
(11) |
which indicates that the proposed pilot design is capable of compensating for the throughput loss induced by pilots, and even attaining higher spectral efficiency than transmission without using pilots, given that .

In order to detect the indices of pilots for channel estimation and index bit recovery, the constellation alphabets employed for data symbols and pilots should be distinguishable, i.e., . As an example, by considering the phase-shift-keying (PSK) format, one feasible constellation design for is illustrated in Fig. 4, where and . Note that denotes the average power ratio between and , and is set as in this example, which however is not guaranteed to be optimal. The impacts of on the system BER performance are discussed as follows.
Firstly, the normalized minimum Euclidean distance between constellations in and , which determines the performance of index pattern detection [24], can be expressed as
(12) |
whose minimum value equals as . Then it can be seen that, the value of should be sufficiently away from to enlarge , thus improving the accuracy of the index pattern detection. Besides, with the increase of , the average transmit power of pilots becomes larger, which enhances the channel estimation accuracy. Note that for our proposed flexible IM-aided pilot design, channel estimation errors not only negatively affect demodulation of data symbols, but also contribute to poor index pattern detection, which leads to increased BER of index bits, and further causes additional errors of data symbol bits due to wrong detections of the index pattern. Hence, needs to be sufficiently large for accurate channel estimation.
Furthermore, in the -th data block, by assuming that its channel estimation error is tolerable to correctly detect the index pattern, the channel estimate using LS criterion can be expressed as
(13) |
where , and denotes the distortion-plus-noise term following . Here is the -by- identity matrix. By assuming that each candidate of is chosen as the pilot symbol equiprobably, the channel estimation error, defined as , approximately follows CSCG distribution as , where the transmitted power is set as without loss of generality. Afterwards, the received signal for data symbol detection can be formulated as
(14) | ||||
where we have the equivalent noise term written as
(15) |
following . Then the received signal-to-noise ratio (SNR) can be approximately written as
(16) |
It can be seen in (16) that, as increases to infinity, the SNR value approaches zero, causing significant BER performance degradation. Hence, an intuitive guideline for the power allocation issue is that, should be set to be sufficiently over to ensure accuracy of channel estimation and index pattern detection, but cannot be too large in case of undesirable SNR degradation. This finding can be well supported by Fig. 11, where the system BER performance versus is simulated. Note that a sophisticated trade-off is required in this optimal constellation design (power allocation) of and for the PSK format as Fig. 4, which may be out of the scope of the main idea in this paper. Hence, the derivative work of the optimal power allocation issue is set aside as our future work for detailed investigation. Instead, numerical methods can be applied to obtain the value of that optimizes the BER performance as illustrated in Fig. 11.
III-C Turbo Receiving Algorithm for Index Pattern Detection and Channel Estimation
As shown in Fig. 3, a turbo receiving algorithm is developed to jointly estimate the index pattern and the channel fading vector in an iterative manner, which can be clearly illustrated with mathematical induction as below.
Initialization: Before data transmission, a preamble of length , is utilized to obtain an initial channel estimate with LS algorithm. Then is employed as the prior knowledge of CSI to detect the index pattern of pilots for the 1-st data block.
Coarse index pattern detection: For the -th data block (), is employed as its prior knowledge of CSI. The received signals in the -th data subblock can be formulated as
(17) |
where
(18) | ||||
Here follows CSCG distribution as . Based on (17), a symbol-wise log-likelihood ratio (LLR) detector can be applied to obtain the positions of pilots [24], where the logarithm ratio between a posteriori probabilities of the event that the symbol is modulated by or is calculated as
(19) |
Since we have and , (19) can be further derived as
(20) |
where the likelihood function is formulated as
(21) |
By substituting (21) into (20), the final expression of the LLR value can be derived as
(22) | ||||
where stands for the LLR value calculated using as the channel estimate, and is the estimate of . After all the LLR values of the -th subblock are obtained using (22), the indices of pilots in each subblock can be coarsely determined by
(23) | ||||
where , and the operator finds the largest LLR values. Then the resultant is defined as the coarse estimate of the pilot index pattern for the -th data block.
Iterative joint index pattern detection and channel estimation: At the -th iteration (), the positions of the flexible IM-aided pilots are firstly determined by , and then they are employed to update the channel estimate, where the resultant CSI is further fed back for updating the pilot index pattern. More specifically, for the -th subblock, the received symbols of the indices are concatenated together, as illustrated in (III-C) (on the top of next page),
(24) |
which is then utilized to estimate the channel fading vector
(25) |
where is an pilot matrix defined as
(26) |
By replacing with in (22), the LLR values of the -th subblock are re-calculated, denoted as for . Then (23) is applied to these latest LLR values, where the pilot indices of the -th subblock are updated as , satisfying . By repeating the operations above on each of the subblocks, the output index pattern of the -th data block in the -th iteration can be obtained, expressed as .
With the iterations going on, when equals , it is implied that the detection of the pilot index pattern has converged. Therefore, the stopping criterion for the iterations is set as . By defining as the final output of the proposed turbo receiving algortihm for the -th data block, the fine channel estimate, denoted as , can be calculated using LS criterion as (25) with all the detected flexible pilots corresponding to . Afterwards, and for are fed into the subsequent stages for signal demodulation. Besides, is also employed as the prior knowledge of CSI for coarse index pattern detection of the -th data block. The aforementioned turbo receiving algorithm can be summarized as Algorithm 1, where the initialization phase is omitted for brevity.
Remark 1: As illustrated in (III-C)-(26), is employed as the extrinsic information in the -th iteration to update the index pattern of the -th subblock, instead of . Such arrangement is to eliminate the positive feedback that hinders the convergence of the proposed receiving algorithm, inspired by the principles of iterative decoding for turbo codes.
Remark 2: For the -th and -th data blocks, when the physical channel coefficients and I/Q imbalance parameters remain approximately unchanged, it is reasonable to employ as the prior CSI knowledge of the -th data block for coarse index pattern detection, since there only exists a relatively small phase noise difference between and . On the other hand, for fast-fading channel, where the physical channel coefficient also varies across adjacent data blocks, is considered to be outdated for the -th block. Under such “worst” case, it is validated that the proposed turbo receiving algorithm is still capable of achieving superior performance of index pattern detection and channel estimation. Aside from the relevant simulation results, an intuitive geometrical analysis is provided below to support the validity of the proposed receiving algorithm under relatively fast-fading channel.


III-D Extension to Frequency-Selective Fading Channel
In this paper, flat-fading channel is mainly considered for THz communication systems with high-gain directional antennas [31]. However, the multi-path effects in the THz band, which are explicitly described in [34], cannot be fully eliminated by highly directional transmission under certain cases [35]. For instance, there may exist possible reflectors close to the LoS path, yielding additional reflected paths; Besides, combined reflection and transmission of THz signals on a multi-layer objects like window glass also contribute to multiple available paths. Furthermore, when the communication bandwidth is sufficiently high, frequency-selective channel fading should be considered even for LoS scenario without multi-path propagation [36]. Hence, in this subsection, the proposed flexible IM-aided pilot design is extended to the frequency-selective fading channel, where some guidelines about the generalized design will be provided.
Several assumptions are made under the frequency-selective fading channel: a) The length of channel impulse response (CIR) is assumed to be known, denoted as ; b) The CIR keeps approximately constant within each data block of length , since the symbol period is extremely short due to the ultra-high communication bandwidth. Under these assumptions, the generalized data block structure with the proposed flexible IM-aided pilots is modified as illustrated in Fig. 5. On one hand, for each data block of length with classical fixed preamble in Fig. 5 (a), a pilot sequence is added at the beginning of the block, consisting of the cyclic prefix (CP) part of length and pilot symbol part with length . By considering the existence of I/Q imbalance at the transmitter, should be no less than for channel estimation [20]. Besides, CP of length is also applied to the data symbol part. On the other hand, unlike its classical counterpart, the pilot sequence is no longer fixed in each of the data block with the proposed flexible IM-aided pilots, as shown in Fig. 5 (b). By such operations, additional information bits can be conveyed by the positions of pilots to compensate for the spectral efficiency loss caused by channel estimation, which have totally possible choices for each data block. Then each data block could convey additional index bits, yielding the system spectral efficiency calculated as
(27) | ||||
where denotes the modulation order of data symbols. At the receiver, for the -th data block, the index pattern can be coarsely estimated using the channel estimate of the -th data block as the prior knowledge of CSI. Note that a fixed pilot sequence is required to provide initial channel estimate for the -st data block, as stated in the Initialization phase in Section III-C. Before coarse index pattern detection, equalization techniques such as frequency-domain equalization (FDE) or decision-feedback equalization (DFE) based on zero-forcing (ZF) or minimum mean squared error (MMSE) criterions are employed to eliminate the inter-symbol interference (ISI) caused by the frequency selectivity of THz channels [36]. Afterwards, a correlation-based method could be utilized to detect the index of pilot sequences. More specifically, sliding correlation is performed on the equalized signal vector, denoted as , using the pilot sequence (including CP) , which yields cross-correlation values for as
(28) |
Then the start index of the pilot sequence in the -th data block can be obtained as
(29) |
Finally, the index bits can be recovered from using a look-up table between different index patterns and binary bits, and the demodulation of data symbols is straightforward. The aforementioned extension of the proposed flexible IM-aided pilot design to the THz frequency-selective fading channel will be discussed into more detail in our near-future research.
IV Performance Analysis
IV-A Geometrical Analysis
Under relatively fast-fading channel, there exists apparent difference between the physical channel coefficients of the -th and -th data blocks for , aside from the variations of phase noise. Then , which is served as the prior CSI knowledge for the -th data block, is likely to be outdated. To validate the feasibility of the proposed turbo receiving algorithm, an intuitive geometrical demonstration is provided to figure out the error probability of the LLR-based index pattern detection. For tractable analysis, several assumptions are made as below: (1) is approximated as zero; (2) and are employed for modulation, where is empirically set to be larger than ; (3) There is only phase difference in the physical channel coefficients between adjacent blocks, whilst the amplitude remains approximately unchanged555Due to the ultra-wide bandwidth employed in the THz communication system, the change of transmission distance is marginal across hundreds of symbol periods, where the path gain is approximately constant.. Then the received signal model in (7) can be simplified as (the indices are omitted for brevity)
(30) |
Without loss of generality, we consider a pilot symbol is transmitted, and set the corresponding channel fading coefficient as . Then the outdated prior knowledge of CSI is written as , where is phase difference from the actual channel, and can be set within due to the rotational symmetry of the PSK constellation alphabets. Based on this model, the LLR value for the pilot symbol , denoted as , is calculated as
(31) | ||||
When the signal-to-distortion-and-noise ratio (SDNR) corresponding to the thermal noise and hardware imperfections at the receiver is sufficiently high, (31) can be further approximated as
(32) | ||||
It is seen from (32) that, at high-SDNR region, the accuracy of index pattern detection is mainly dependent on the minimum Euclidean distances between and the constellation points in and with a clockwise rotation of . For the sake of clarity, the employed constellation alphabets are drawn in Fig. 6, where the point A denotes the received signal for sufficiently high SDNRs. Then it can be observed that
(33) |
and
(34) |
According to (33) and (34), when , the pilot symbol is likely to be wrongly detected if point A is closer to C than B, where B and C correspond to and , respectively. Define as the value of within satisfying , then it can be seen from Fig. 6 that the sub-interval of in meeting yields . Besides, can be obtained by solving
(35) |
whose solution within with respect to the value of is presented in Fig. 7.
On the other hand, for , and correspond to and with a clockwise phase rotation of , respectively. Similar to , is likely to be wrongly detected if A is closer to than . Due to the symmetric property of the constellation alphabets, the sub-interval of that meets can be readily obtained as . Therefore, the overall interval width of causing wrong detection of at high-SDNR region, termed as wrong detection region, can be approximated as . For instance, when , it is seen from Fig. 7 that the width of wrong detection area is less than , which means the possibility for correct index pattern detection approximately exceeds , if is uniformly distributed in . Hence, it can be concluded that, when the SDNR is sufficiently high, with proper design of and , the indices of flexible IM-aided pilots can be correctly detected even using outdated CSI knowledge, which enables the proposed turbo receiver to gradually enhance the precision of the index pattern detection through iterations.


IV-B Complexity Analysis
For classical pilot assignment, according to (8), the computational complexity of channel estimation can be calculated as in terms of the number of complex multiplications per data block. On the other hand, for the proposed flexible IM-aided pilot design, the complexity of the proposed turbo receiving algorithm is investigated. Since the LS channel estimation in the initialization phase only performs once before transmission, it can be omitted in the calculation of detection complexity. Besides, for coarse index pattern detection, complex multiplications are required for each data block. Moreover, for iterative joint index pattern detection and channel estimation, in each iteration, the numbers of complex multiplications for the LLR detector and LS channel estimator per data block are equal to and , respectively. Hence, the total count of required complex multiplications per data block yields , where represents the iteration times. Although additional overhead is induced by the proposed turbo receiving algorithm compared with the conventional counterpart, is usually less than for fast fading channel according to the numerical results in Section V, which is expected to be further reduced under the scenario of quasi-static fading channel. Besides, the number of complex multiplications per data block only increases linearly with the system parameters including , , , and . Therefore, the complexity of the proposed turbo receiving algorithm is acceptable, considering its remarkable benefits over classical pilot assignment.

V Numerical Results
The channel estimation accuracy and BER performance of the THz communication system with the proposed flexible IM-aided pilots and turbo receiving algorithm are evaluated via Monte Carlo simulations, compared with its conventional counterpart with fixed pilot assignment. Besides, the latency induced by the proposed turbo receiver is also investigated. The parameters used in simulations are listed as below. A single-carrier THz communication system at 300 GHz with the bandwidth of 10 GHz is considered, where the transmission distance is set as m [31]. In simulations, the amplitude of the physical channel fading coefficient is assumed to be approximately constant, whilst the phase term randomly changes across different data blocks, which keeps constant within each block, following independent uniform distribution between . For the frame structure, the block length is set as , which can be further split into subblocks of length . The preamble length of classical pilot assignment equals for each data block, whilst , and are set for the proposed pilot design. In addition, for hardware imperfections, the amplitude and phase imbalances of the I/Q branch at the transmitter are set as and . Besides, for is assumed to be independently identically distributed Gaussian variables following . Besides, the level of hardware impairments at the receiver is set as dB. Note that these parameters of hardware impairments are on the same order as those in the existing literature [16, 12, 18]. Moreover, in simulations, and are adopted by the proposed scheme, whilst QPSK is applied for both data symbols and pilots in its conventional counterpart. Based on the aforementioned parameters, the system spectral efficiencies of the proposed pilot design and its conventional counterpart are calculated as and bit/s/Hz, respectively.

Figure 8 presents the mean-square error (MSE) results for channel estimation of the proposed turbo receiving algorithm with flexible IM-aided pilots, using LS and minimum mean-square error (MMSE) estimators with classical pilot assignment as the benchmarks. Here is set as , and denotes the signal-to-noise ratio (SNR) per bit. A lower bound for the achievable MSE of the proposed turbo receiving algorithm is obtained by performing LS estimation with all the flexible pilots, assuming that their indices are perfectly detected by the receiver. Besides, the maximum iteration number is fixed as , respectively, where the stopping criterion for iterations stated in Section III-C is not employed. It is clear that the estimation MSE for the proposed turbo receiving algorithm declines with the increase of , which gradually converges to the lower bound. Explicitly, when equals , the MSE curve fits the lower bound quite well as is larger than dB. This demonstrates that, with the aid of the proposed turbo receiver, the flexible IM-aided pilots are capable of approaching the estimation accuracy of fixed preamble assignment with the same pilot length, whilst fully compensating for the spectral efficiency loss at the same time. Moreover, the proposed scheme improves the estimation accuracy significantly at even higher spectral efficiency over its conventional counterparts based on LS and MMSE channel estimation, which attains MSE of lower than in most of the SNR region. This is because more pilot symbols can be utilized for channel estimation by employing the flexible IM-aided pilot design without sacrificing the spectral efficiency.
Besides, the BER performances of the THz communication system employing flexible IM-aided pilots and the proposed turbo receiver with different numbers of iterations are presented in Fig. 9, where equals . When , severe error floor is witnessed for the proposed scheme due to the poor detection accuracy of the pilot index pattern. With the increase of , the performance of the proposed scheme enhances significantly thanks to the reduced error of the index pattern detection and channel estimation, which is aligned with the MSE results in Fig. 8. Besides, the proposed strategy with is capable of achieving about dB and dB performance gains over its conventional counterparts using LS an MMSE channel estimators at the BER of , respectively, whilst enhancing the spectral efficiency by about bit/s/Hz at the same time. This validates the remarkable advantages of the proposed flexible IM-aided pilot design and the turbo receiving algorithm in attaining desirable channel estimation accuracy and reliable high-rate data transmission. Moreover, the system performance with the proposed turbo receiver employing the iteration stopping criterion is also simulated, which achieves almost the same performance as that of with reduced computational complexity and latency.
For latency analysis, the statistics of the iteration times required for the proposed turbo receiving algorithm with the stopping criterion of iterations are presented in Fig. 10, where the bar corresponding to iterations denotes the proportion of simulated data blocks requiring iterations for convergence. Besides, equals , and the maximal iteration number is set as for brevity. It is observed from Fig. 10 that, although iterations are required in the turbo receiving algorithm for some data blocks, the corresponding proportion is less than at different SNRs. Furthermore, this proportion is gradually decreased to about as increases to dB. On the other hand, the cases of account for the proportion over when is larger than dB. Hence, it is indicated that, although additional time delay is induced by the proposed turbo receiving algorithm under fast fading channel, the resultant latency is moderate especially at high SNRs, since it is more likely to spend less than iterations for convergence. Besides, such latency is expected to be lower under the scenario of quasi-static fading channel, where the difference between the channel fading vectors of adjacent data blocks is marginal.

Figure 11 shows the BER performance of the THz communication system using flexible IM-aided pilots and the turbo receiving algorithm with respect to the value of . It is observed that, when , the overall BER value is higher than , for the reason that the average transmit power of pilots is low, causing poor channel estimation accuracy. Furthermore, for , the BER of index bits gradually declines as becomes larger, due to higher allocated power for the pilot symbols and also the enlarged distance between and from according to (12). Besides, for the data symbol bits, there is also performance improvement with the increase of until despite its reduced power, thanks to the enhanced accuracy of the index pattern detection and channel estimation. As continues to enlarge, the performance of data part begins to deteriorate. Since the data symbol bits account for the majority of transmitted information in most cases, the trend of the overall BER with respect to is generally dependent on the data symbol bits, as illustrated in Fig. 11. Clearly, there is a trade-off to determine the optimal for reliable data transmission, which is aligned with the analytical work in Section III-B. Numerically, an optimal value of can be approximately obtained as from the BER curve of overall bit errors.

VI Conclusion
In THz wireless communication systems with hardware imperfections, due to the time-variant property of the strong phase noise as well as possible fluctuations of the physical channel coefficient, pilots are required to be frequently inserted for channel tracking, leading to undesirable throughput loss. To compensate for the spectral efficiency loss, whilst attaining high accuracy of channel estimation simultaneously, a novel flexible pilot design strategy is proposed based on the IM technique, where the positions of pilots are no longer fixed in the data frame, so that additional information bits could be conveyed by their indices. At the receiver, a turbo receiving algorithm is developed for iterative index pattern detection and channel estimation, which is capable of achieving superior performance even with outdated prior knowledge of CSI. To validate the feasibility of the proposed pilot design as well as the turbo receiving algorithm, an intuitive geometrical analysis is performed, followed by the computational complexity analysis. Analytical and simulative results demonstrate that the proposed flexible IM-aided pilot design with the turbo receiving algorithm could guarantee the accuracy of channel estimation. Besides, it is capable of improving the BER performance with higher spectral efficiency, in comparison with its conventional counterpart.
Aside from the use of highly directional antennas, the multiple-input multiple-output (MIMO) architecture with beamforming/combining techniques is capable of further enhancing the coverage of the THz communication systems. For single-stream MIMO systems being quite suitable for THz wireless transmission due to low hardware cost and power consumption, the proposed flexible IM-aided pilot design is also applicable, where the beamforming vector design needs careful investigation. The detailed research will be set aside as our future work.
References
- [1] H. Elayan, O. Amin, B. Shihada, R. M. Shubair, and M. Alouini, “Terahertz band: The last piece of RF spectrum puzzle for communication systems,” IEEE Open J. Commun. Society, vol. 1, pp. 1–32, Jan. 2020.
- [2] Z. Chen et al., “A survey on terahertz communications,” China Commun., vol. 16, no. 2, pp. 1–35, Feb. 2019.
- [3] X. Yu et al., “Exploring THz band for high speed wireless communications,” in Proc. IEEE IRMMW-THz, Copenhagen, Sep. 2016, pp. 1–2.
- [4] K. Huang and Z. Wang, “Terahertz Terabit Wireless Communication,” IEEE Microwave Mag., vol. 12, no. 4, pp. 108–116, Jun. 2011.
- [5] I. F. Akyildiz, J. M. Jornet, and C. Han, “Terahertz band: Next frontier for wireless communications,” Phys. Commun., vol. 12, pp. 16–32, Jan. 2014.
- [6] K. Tekbiyik, A. R. Ekti, G. K. Kurt, and A. Gorcin, “Terahertz band communication systems: Challenges, novelties and standardization efforts,” Phy. Commun., vol. 35, pp. 100700.
- [7] A. -A. A. Boulogeorgos et al., “Terahertz technologies to deliver optical network quality of experience in wireless systems Beyond 5G,” IEEE Commun. Mag., vol. 56, no. 6, pp. 144–151, Jun. 2018.
- [8] T. S. Rappaport et al., “Wireless communications and applications above 100 GHz: Opportunities and challenges for 6G and beyond,” IEEE Access, vol. 7, pp. 78729–78757, 2019.
- [9] H. Song and T. Nagatsuma, “Present and future of terahertz communications,” IEEE Trans. THz Sci. Technol., vol. 1, no. 1, pp. 256–263, Sep. 2011.
- [10] E. N. Papasotiriou, A. A. Boulogeorgos, and A. Alexiou, “Performance Analysis of THz Wireless Systems in the Presence of Antenna Misalignment and Phase Noise,” IEEE Commun. Lett., vol. 24, no. 6, pp. 1211–1215, Jun. 2020.
- [11] T. Schenk, RF Imperfections in High-Rate Wireless Systems. The Netherlands: Springer, 2008.
- [12] A. -A. A. Boulogeorgos, E. N. Papasotiriou, and A. Alexiou, “Analytical performance assessment of THz wireless systems,” IEEE Access, vol. 7, pp. 11436–11453, Jan. 2019.
- [13] E. Björnson, M. Matthaiou, and M. Debbah, “A new look at dual-hop relaying: Performance limits with hardware impairments,” IEEE Trans. Commun., vol. 61, no. 11, pp. 4512–4525, Nov. 2013.
- [14] A. -A. A. Boulogeorgos and A. Alexiou, “Analytical performance evaluation of beamforming under transceivers hardware imperfections,” in Proc. IEEE WCNC, Marrakesh, Morocco, Apr. 2019.
- [15] T. Mao, Q. Wang and Z. Wang, “Spatial Modulation for Terahertz Communication Systems With Hardware Impairments,” IEEE Trans. Veh. Technol., vol. 69, no. 4, pp. 4553–4557, Apr. 2020.
- [16] Y. R. Ramadan, H. Minn, and M. E. Abdelgelil, “Precompensation and system parameters estimation for low-cost nonlinear Tera-Hertz transmitters in the presence of I/Q imbalance,” IEEE Access, vol. 6, pp. 51814–51833, Oct. 2018.
- [17] T. Mao, Q. Wang, and Z. Wang, “Receiver design for the low-cost Terahertz communication system with hardware impairment,” in Proc. IEEE ICC 2020, Virtual Conference, Jun. 2020.
- [18] G. Colavolpe, A. Barbieri, and G. Caire, “Algorithms for iterative decoding in the presence of strong phase noise,” IEEE J. Sel. Areas Commun., vol. 23, no. 9, pp. 1748–1757, Sep. 2005.
- [19] R. Krishnan, A. Amat, T. Eriksson, and G. Colavolpe, “Constellation optimization in the presence of strong phase noise,” IEEE Trans. Commun., vol. 61, no. 12, pp. 5056–5066, Dec. 2013.
- [20] W. Hou and M. Jiang, “Enhanced joint channel and IQ imbalance parameter estimation for mobile communications,” IEEE Commun. Lett., vol. 17, no. 7, Jul. 2013.
- [21] S. Buzzi, M. Lops, and S. Sardellitti, “Performance of iterative data detection and channel estimation for single-antenna and multiple-antennas wireless communications,” IEEE Trans. Veh. Technol., vol. 53, no. 4, pp. 1085–1104, Jul. 2004.
- [22] C. Studer, M. Wenk, and A. Burg, “MIMO transmission with residual transmit-RF impairments,” in Proc. 2010 Intern. ITG WSA, Bremen, 2010, pp. 189–196.
- [23] E. Basar, M. Wen, R. Mesleh, M. Di Renzo, Y. Xiao, and H. Haas, “Index modulation techniques for next-generation wireless networks,” IEEE Access, vol. 5, pp. 16693–16746, 2017.
- [24] T. Mao, Q. Wang, Z. Wang, and S. Chen, “Novel Index Modulation Techniques: A Survey,” IEEE Commun. Surveys & Tut., vol. 21, no. 1, pp. 315–348, Firstquarter 2019.
- [25] E. Basar, U. Aygolu, E. Panayirci and H. V. Poor, “Orthogonal Frequency Division Multiplexing With Index Modulation,” IEEE Trans. Signal Process., vol. 61, no. 22, pp. 5536–5549, Nov. 2013.
- [26] M. Wen, E. Basar, Q. Li, B. Zheng, and M. Zhang, “Multiple-Mode Orthogonal Frequency Division Multiplexing With Index Modulation,” IEEE Trans. Commun., vol. 65, no. 9, pp. 3892–3906, Sep. 2017.
- [27] Q. Li, M. Wen, Y. Zhang, J. Li, F. Chen, and F. Ji, “Pilot insertion with index modulation for OFDM-Based vehicular communications,” in Proc. 2018 IEEE GlobalSIP, Anaheim, CA, USA, Nov. 2018, pp. 1204–1208.
- [28] J. Qi and S. Aissa, “Analysis and compensation of power amplifier nonlinearity in MIMO transmit diversity systems,” IEEE Trans. Veh. Technol., vol. 59, no. 6, pp. 2921–2931, Jul. 2010.
- [29] M. Shabany and P. G. Gulak, “Efficient compensation of the nonlinearity of solid-state power amplifiers using adaptive sequential Monte Carlo methods,” IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 55, no. 10, pp. 3270–3283, Nov. 2008.
- [30] H. Steendam, “The effect of synchronisation errors on multicarrier systems,” Ph.D. dissertation, Universiteit Gent, 2000.
- [31] C. Han, A. O. Bicen, and I. F. Akyildiz, “Multi-wideband waveform design for distance-adaptive wireless communications in the Terahertz band,” IEEE Trans. Signal Process., vol. 64, no. 4, pp. 910–922, Feb. 2016.
- [32] X. Zhang, M. Matthaiou, M. Coldrey, and E. Bjornson, “Impact of residual transmit RF impairments on training-based MIMO systems,” IEEE Trans. Commun., vol. 63, no. 8, pp. 2899–2911, Aug. 2015.
- [33] S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory, 1st ed. Englewood Cliffs, NJ, USA: Prentice-Hall, 1993.
- [34] C. Han, A. O. Bicen, and I. F. Akyildiz, “Multi-ray channel modeling and wideband characterization for wireless communications in the terahertz band,” IEEE Trans. Wirel. Commun., vol. 14, no. 5, pp. 2402–2412, May 2015.
- [35] B. Peng, S. Wesemann, K. Guan, W. Templ, and T. Kurner, “Precoding and detection for broadband single carrier terahertz massive MIMO systems using LSQR algorithm,” IEEE Trans. Wirel. Commun., vol. 18, no. 2, pp. 1026–1040, Feb. 2019.
- [36] V. Schram, Y. Wu, M. Kolleshi, and W. Gerstacker, “Comparison of transmission concepts for indoor THz communication systems,” in Proc. 2020 Third International Workshop on Mobile Terahertz Systems (IWMTS), Essen, Germany, 2020, pp. 1–5.