This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Comments on “A New ML Based Interference Cancellation Technique for Layered Space-Time Codes”

Hufei Zhu,  Wen Chen,  Hufei Zhu is with Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, P. R. China and Huawei Technologies Co., Ltd., Shenzhen 518129, P. R. China, e-mail: [email protected];Wen Chen is with Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, P.R. China and SEU SKL for Mobile Communications, e-mail: [email protected] work is supported by NSF China #60972031, by SEU SKL project #W200907, by Huawei Funding #YJCB2009024WL and #YJCB2008048WL, by New Century Excellent Talent in Universities #NCET-06-0386, and by National 973 project #2009CB824900.
Abstract

In this comment, we justify that the computational complexity proposed in the paper ”A New ML Based Interference Cancellation Technique for Layered Space-Time Codes” (IEEE Trans. on Communications, vol. 57, no. 4, pp. 930-936, 2009) is O(N3)O(N^{3}) rather than the claimed O(N2)O(N^{2}), where NN is the number of receive antennas.

A maximum likelihood (ML) based interference cancellation (IC) detector was proposed in [1] for double space-time transmit diversity (DSTTD), which consists of two Alamouti’s space-time block codes (STBC) units [2]. In many application areas of interest, the computational complexity of the detector in [1] can be less than that of the conventional minimum mean squared error (MMSE) IC detector for DSTTD [3]. However, the complexity claimed in [1] needs to be modified, as will be discussed in this comment.

Let NN denote the number of receive antennas. In [1], the theoretical analysis gaves a complexity of O(N2)O(N^{2}) (i.e. 7N2+62N1037N^{2}+62N-103 real multiplications and 12N2+47N10312N^{2}+47N-103 real additions) [1, Table \@slowromancapi@], while numerical experiments were not carried out to verify the given complexity. In what follows, we show that the complexity is not O(N2)O(N^{2}), but O(N3)O(N^{3}), and then give the exact complexity that is verified by our numerical experiments.

Firstly, we show that a complexity of O(N3)O(N^{3}) is required to perform the orthonormalization process by equations (9), (13), (14) and (15) in [1]. Let ()T(\bullet)^{T} and ()H(\bullet)^{H} denote transpose and conjugate transpose of a vector, respectively. Equation (9) in [1] defines the basis vectors

𝐯i=[aibi𝐞iT]T,{\bf{v}}_{i}=\left[{\begin{array}[]{*{20}c}{a_{i}}&{b_{i}}&{\bf e}_{i}^{T}\\ \end{array}}\right]^{T}, (1)

where i=1,2,,2N2i=1,2,\cdots,2N-2, and 𝐞i{\bf e}_{i} is the (2N2)×1(2N-2)\times 1 vector with the ithi^{th} element to be 11 and all others to be zero. Equation (13) in [1] utilizes 𝐯1{\bf{v}}_{1} and 𝐯2{\bf{v}}_{2}, which is

𝛉1=𝐯1/𝐯1,𝛉2=𝐯2/𝐯2.{\bm{\uptheta}}_{1}={\bf{v}}_{1}/\left\|{\bf{v}}_{1}\right\|,\quad{\bm{\uptheta}}_{2}={\bf{v}}_{2}/\left\|{\bf{v}}_{2}\right\|. (2)

Moreover, we represent equations (14) and (15) in [1] as

𝛉2n1=(𝐯2n1j=12n2c2n1j𝛉j)/,\displaystyle{\bm{\uptheta}}_{2n-1}=\left({{\bf{v}}_{2n-1}-\sum\nolimits_{j=1}^{2n-2}{c_{2n-1}^{j}{\bm{\uptheta}}_{j}}}\right)/\left\|\cdots\right\|, (3a)
𝛉2n=(𝐯2nj=12n2c2nj𝛉j)/,\displaystyle{\bm{\uptheta}}_{2n}=\left({{\bf{v}}_{2n}-\sum\nolimits_{j=1}^{2n-2}{c_{2n}^{j}{\bm{\uptheta}}_{j}}}\right)/\left\|\cdots\right\|, (3b)

where

c2n1j=𝛉jH𝐯2n1,\displaystyle{c_{2n-1}^{j}={{\bm{\uptheta}}_{j}^{H}{\bf{v}}_{2n-1}}}, (4a)
c2nj=𝛉jH𝐯2n,\displaystyle{c_{2n}^{j}={{\bm{\uptheta}}_{j}^{H}{\bf{v}}_{2n}}}, (4b)

and n=2,3,,N1n=2,3,\cdots,N-1. It can be seen that [𝛉2n1𝛉2n]\left[{\begin{array}[]{*{20}c}{{\bm{\uptheta}}_{2n-1}}&{{\bm{\uptheta}}_{2n}}\\ \end{array}}\right] consists of 2×22\times 2 Alamouti sub-blocks [4]. Thus we can obtain 𝛉2n{\bm{\uptheta}}_{2n} from 𝛉2n1{\bm{\uptheta}}_{2n-1}, to avoid computing (3b) and (4b).

Let 𝛉i,j,,k{\bm{\uptheta}}\sim\left\lfloor{i,j,\cdots,k}\right\rfloor denote that only the ith,jth,,kthi^{th},j^{th},\cdots,k^{th} entries in the vector 𝛉{\bm{\uptheta}} are non-zero. From (1), we obtain

𝐯i1,2,i+2,{\bf{v}}_{i}\sim\left\lfloor{1,2,i+2}\right\rfloor, (5)

where i=1,2,,2N2i=1,2,\cdots,2N-2. From (2) and (5), we obtain

𝛉11,2,3,𝛉21,2,4.{\bm{\uptheta}}_{1}\sim\left\lfloor{1,2,3}\right\rfloor,\quad{\bm{\uptheta}}_{2}\sim\left\lfloor{1,2,4}\right\rfloor. (6)

Let n=2n=2 in (3b) to obtain

𝛉3=(𝐯3c31𝛉1c32𝛉2)/1,2,3,4,5=15{\bm{\uptheta}}_{3}=\left({{\bf{v}}_{3}-c_{3}^{1}{\bm{\uptheta}}_{1}-c_{3}^{2}{\bm{\uptheta}}_{2}}\right)/\left\|\cdots\right\|\\ \sim\left\lfloor{1,2,3,4,5}\right\rfloor=\left\lfloor{1-5}\right\rfloor (7)

and

𝛉4=(𝐯4c41𝛉1c42𝛉2)/14,6,{\bm{\uptheta}}_{4}=\left({{\bf{v}}_{4}-c_{4}^{1}{\bm{\uptheta}}_{1}-c_{4}^{2}{\bm{\uptheta}}_{2}}\right)/\left\|\cdots\right\|\sim\left\lfloor{1-4,6}\right\rfloor, (8)

where (5) and (6) are utilized. From (6)-(8), it can be seen that for n=1,2n=1,2, we have

𝛉2n11(2n+1),𝛉2n12n,2n+2.{\bm{\uptheta}}_{2n-1}\sim\left\lfloor{1-(2n+1)}\right\rfloor,\quad{\bm{\uptheta}}_{2n}\sim\left\lfloor{1-2n,2n+2}\right\rfloor. (9)

Assume for any nn, 𝛉2n1{\bm{\uptheta}}_{2n-1} and 𝛉2n{\bm{\uptheta}}_{2n} satisfy (9). This assumption will be verified in this paragraph. From (3b), it can be seen that 𝛉2(n+1)1{\bm{\uptheta}}_{2(n+1)-1} includes the sum of 𝛉2n1{\bm{\uptheta}}_{2n-1}, 𝛉2n{\bm{\uptheta}}_{2n} and 𝐯2(n+1)1{\bf{v}}_{2(n+1)-1}, while 𝛉2(n+1){\bm{\uptheta}}_{2(n+1)} includes the sum of 𝛉2n1{\bm{\uptheta}}_{2n-1}, 𝛉2n{\bm{\uptheta}}_{2n} and 𝐯2(n+1){\bf{v}}_{2(n+1)}. From (5) and the assumption (9), we can conclude that 𝛉2(n+1)1{\bm{\uptheta}}_{2(n+1)-1} and 𝛉2(n+1){\bm{\uptheta}}_{2(n+1)} also satisfy (9). Then the assumption (9), which is valid for n=1,2n=1,2, is still valid for all the subsequent (n+1)(n+1)s where n=1,2,,N2n=1,2,\cdots,N-2. Thus we have verified the assumption (9) for any nn.

It can be seen from (9) that in (3a), c2n1j𝛉jc_{2n-1}^{j}{\bm{\uptheta}}_{j} requires more than jj multiplications, while j=12n2c2n1j𝛉j\sum\nolimits_{j=1}^{2n-2}{c_{2n-1}^{j}{\bm{\uptheta}}_{j}} requires more than j=12n2j2n2\sum\nolimits_{j=1}^{2n-2}{j}\approx 2n^{2} multiplications. Then totally it requires more than n=2N12n223N3\sum\nolimits_{n=2}^{N-1}{2n^{2}}\approx\frac{2}{3}N^{3} multiplications to compute (3a) for n=2,3,,N1n=2,3,\cdots,N-1. Thus we have shown that the actual complexities of the detector in [1] should be at least O(N3)O(N^{3}).

TABLE I: The computational complexities of the equations in [1]
Equation Number Complex Multiplications Complex Additions Real Multiplications Real Additions
(9) and (11) 4(N-1) 2(N-1) 4(N-1)+4 3
(13) 9 4
(14) 23N(N1)(N2)\frac{2}{3}N(N-1)(N-2) 23N(N1)(N2)\frac{2}{3}N(N-1)(N-2) (6N+5)(N2)(6N+5)(N-2) 2(N+1)(N2)2(N+1)(N-2)
(23) 2(N1)N2(N-1)N 2(N1)N2(N-1)N 4(N1)4(N-1)
(25) 2(N1)N2(N-1)N 2(N1)N2(N-1)N 4(N1)4(N-1)
(28) 4N4N 4N4N
Sum 23N3+2N2+163N4\frac{2}{3}N^{3}+2N^{2}+\frac{16}{3}N-4 23N3+2N2+103N2\frac{2}{3}N^{3}+2N^{2}+\frac{10}{3}N-2 6N2+5N96N^{2}+5N-9 2N22N+32N^{2}-2N+3

The dominant computations of the ML based IC detector [1] come from equations (9), (11), (13), (14), (23), (25) and (28) in [1], of which the complexities are listed in Table \@slowromancapi@. One complex multiplication takes four real multiplications and two real additions, while one complex addition needs two real additions. Therefore, it can be seen from Table \@slowromancapi@ that the complexities of the detector are equivalent to

83N3+14N2+793N25\frac{8}{3}N^{3}+14N^{2}+\frac{79}{3}N-25 (10)

real multiplications and

83N3+10N2+463N9\frac{8}{3}N^{3}+10N^{2}+\frac{46}{3}N-9 (11)

real additions. The total complexity is the sum of real multiplications and additions [1], which is

163N3+24N2+1253N34\frac{16}{3}N^{3}+24N^{2}+\frac{125}{3}N-34 (12)

floating-point operations (flops). We also carried out numerical experiments to count the flops required by the detector in [1]. The results of our numerical experiments are identical to those computed by (12), i.e., our numerical experiments have accurately verified (12).

TABLE II: Complexity comparison
The ML based IC The MMSE IC
detector for DSTTD [1] detector for DSTTD [3]
Real Real Total Real Real Total
N Mult. Add. Flops Mult. Add. Flops
2 105 83 188 128 135 263
3 252 199 451 360 369 729
4 475 383 858 768 770 1538
5 790 651 1441 1400 1380 2780
6 1213 1019 2232 2304 2241 4545
7 1760 1503 3263 3528 3395 6923
8 2447 2119 4566 5120 4884 10004

Table \@slowromancapi@ in [1] compared the complexities of the ML based IC detector for DSTTD in [1] and the conventional MMSE IC detector for DSTTD in [3]. From (10), (11) and (12), it can be seen that Table \@slowromancapi@ in [1] should be modified to Table \@slowromancapii@ in this comment, where the total complexity of the MMSE IC detector in [3] is

15N3+732N232N15N^{3}+\frac{73}{2}N^{2}-\frac{3}{2}N (13)

flops [1]. From Table \@slowromancapii@, it can be seen that the complexity of the detector proposed in [1] is about 2.22.2 times smaller than that of the MMSE IC detector [3] when the number of receive antennas is 88.

References

  • [1] S. Jung and J. Lee, “A New ML Based Interference Cancellation Technique for Layered Space-Time Codes”, IEEE Trans. on Communications, vol. 57, no. 4, pp. 930-936, 2009.
  • [2] S. M. Alamouti., “A simple transmit diversity technique for wireless communications,” IEEE J. Select. Areas Commun., vol. 16, no. 8, pp. 1451-1458, 1998.
  • [3] A. F. Naguib, N. Seshadri and A. R. Calderbank, “Applications of space-time block codes and interference suppression for high capacity and high data rate wireless systems”, Signals, Systems &\& Computers, 1998, Conference Record of the Thirty-Second Asilomar Conference, vol.2, pp. 1803-1810, Nov. 1998.
  • [4] A. H. Sayed, W. M., Younis and A. Tarighat, “An invariant matrix structure in multiantenna communications”, IEEE Signal Processing Letters, vol. 12, no. 11, pp. 749-752, 2005.