Least-Squares Iterative PAR Reduction for Point-to-Point ... - TrSys

Report 3 Downloads 29 Views
Least-Squares Iterative PAR Reduction for Point-to-Point Large-Scale MIMO-OFDM Systems Abdul Wakeel and Werner Henkel Transmission Systems Group (TrSyS) Jacobs University Bremen Bremen 28759, Germany Email: {a.wakeel and w.henkel}@jacobs-university.de

Abstract—This paper addresses peak-to-average ratio (PAR) reduction in orthogonal frequency division multiplexing (OFDM) based large-scale multiple-input multiple-output (MIMO) systems. We propose a new technique for PAR reduction in point-topoint scenarios. The last eigenchannels of a massive MIMO channel are often very weak. Not using them for data transmission will offer redundancy for PAR reduction. These eigenchannels are used to approximate the peaks which exceed a given target value in a least squares fashion. This approximate exceedence model is then subtracted from the original signal in time domain for PAR reduction. It has been shown through simulation results that a considerable gain can be obtained with the proposed method with marginal increase in the average power and almost negligible loss in the channel capacity. Index Terms—Large-scale multiple-input multiple-output (MIMO), singular value decomposition (SVD), precoding, orthogonal frequency division multiplexing (OFDM), peak-to-average ratio (PAR), least squares estimation.

I. I NTRODUCTION With the advent of new applications, the demand for high data rates are increasing day by day. New concepts and new technologies have been introduced to meet the public demands. One of the promising technique which has gained a lot of popularity in modern day communication systems is orthogonal frequency division multiplexing (OFDM), which offers high data rates flexibly. Moreover, due to its simple implementation, robustness against frequency selective fading in its coded form, and easy treatment of inter-symbol interference, OFDM is making its way into all wireless and wireline communication systems. To reap the benefits of spatial multiplexing and provide high through puts, massive or large-scale multiple input multiple output (MIMO) is a concept for future wireless communication systems. The concept was first proposed in [2] for multiuser MIMO (MU-MIMO) systems. The transmitter at the base station (BS) was supposed to have a large number of transmit antennas serving single antenna terminal units (TU). However, we adopt the term large-scale MIMO and extend it to a pointto-point MIMO system, where the transmitter and the receiver are supposed to be equipped with large number of antennas, in the range of hundred (e.g. communication between two BSs). Thus, OFDM-based large-scale MIMO will be a choice for future wireless communication technologies for point-to-point communications. However, a major drawback of OFDM is its

high peak-to-average ratio (PAR), which is defined as PAR =

maxk |xk |2 , E{|xk |2 }

(1)

where maxk |xk |2 represents the maximum instantaneous peak power and E{|xk |2 } denotes the average power of the OFDM signal. The high PAR of the OFDM signal is due to the statistical independence of the carriers, which results in an almost Gaussian distributed time-domain signal, which exhibits a high PAR. This high PAR drives the power amplifier (PA) to operate in its non-linear range. It has been mentioned in [2], that large-scale MIMO systems will be equipped with low-power low-cost radio-frequency (RF) components. For low-power RF components to operate in its linear range, it is necessary to lower down the dynamic range of the transmitted signal. Different techniques have been proposed for PAR reduction in MIMO-OFDM systems, equipped with a small number of antennas. The well-known techniques amongst them are Selected Mapping (SLM) [4], [5], Partial Transmit Sequences (PTS) [6], and Tone Reservation (TR) [8]. However, for largescale MU-MIMO-OFDM systems, a first approach has been made in [9], [10]. In this paper, we propose a new method for PAR reduction in point-to-point (PtP) large-scale MIMO-OFDM systems using the weakest eigenchannel(s) to estimate and model the threshold excursion caused by the remaining dimensions. This model function is then subtracted from the original signal in time domain for PAR reduction. The procedure is iterated to reach the PAR target value. The notations used throughout this paper are capital and lowercase letters for DFT domain and time-domain representations, respectively. Boldface letters are used to represent matrices and vectors. The rest of this paper is organized as follows. In Section II, we present the key idea. The system model is discussed in Section III. Section IV provides the core idea of estimating the exceedence values above a certain threshold. The simulation results for our approach are shown in Section V. Section VI finally concludes the paper. II. K EY IDEA Let H(n) be an Mr × Mt channel gain matrix in DFT domain at carrier n known to both the transmitter and the receiver. Using singular value decomposition (SVD), the channel

matrix H(n) for a point-to-point large-scale MIMO can be rephrased as H(n) = U(n) · Λ(n) · VH (n) ,

(2)

where V(n) and U(n) are unitary pre-processing and postprocessing matrices, respectively, (VH · V = UH · U = I) and Λ(n) is a diagonal matrix of the singular values of H(n). For Mt = Mr , Λ(n) can be written as   σ1,1 (n) 0 0 0   0 σ2,2 (n) 0 0   Λ(n) =   . (3) .. . . . .   . . ... . 0 0 0 σMt ,Mt (n) Usually, the singular values are sorted in a descending order along the diagonals by the SVD algorithm, i.e., σ1,1 > σ2,2 > . . . > σMt ,Mt . For large-scale MIMO-OFDM systems, the last singular value is typically so small that the corresponding eigenchannels are hardly suited for data transmission. Not using them would offer redundancy for peak-to-average ratio (PAR) reduction without a lot of cost in data rate. Throughout this paper, it is assumed that the last eigenchannel is too weak and is hence utilized for PAR reduction. III. S YSTEM

Fig. 1.

MODEL

Transmitter system model of MIMO-OFDM for LS algorithm

Figure 1 shows transmitter model with some basic components for a point-to-point MIMO-OFDM. For a largescale MIMO, the number of transmit antennas can be in the range of hundred, however, for the purpose of this paper, we have considered a 40 × 40 MIMO-OFDM system, i.e., Mt = Mr = 40. As shown in Fig. 1, the input vector X(n) = (X1,n , X2,n , · · · , XMt ,n )T , where Xµ,n is the input symbol at the µth (µ = 1, 2, · · · , Mt ) spatial channel and the nth carrier is pre-processed using a preprocessing matrix and converted into a parallel data stream using a serial to parallel converter. The time-domain signal is obtained taking an IFFT. The timedomain signal is padded with a cyclic prefix (CP) to mitigate the effect of inter-symbol interference (ISI) after parallel-toserial conversion. This signal is transmitted over a MIMO channel with a channel gain matrix H. At the receiver, reverse processes are used to obtain an estimate of the original signal. For better understanding, we give the input-output relation in frequency domain here. Let Y(n) = (Y1,n , Y2,n , · · · , YMr ,n )T be the output vector, where Yρ,n is the output symbol of the ρth (ρ = 1, 2, · · · , Mr ) spatial channel. The transmit signal X(n)

is pre-multiplied by V(n), whereas the signal at the receiver is post-multiplied by UH (n) to obtain the output Y(n) as shown in Fig. 2.

Fig. 2.

SVD MIMO diagonalization

I

I

z }| { }| { z Y(n) = U(n)H · U(n) · Λ(n) · VH (n) · V(n) · X(n) , (4) {z } | {z } | H(n)

˜ X(n)

which can also be written as

Y(n) = Λ(n) · X(n) , For the Mt × Mr = 40 × 40 case, Eq. (4) is rephrased as   X1,n σ1,1 (n) 0 0 0   X2,n  0 σ (n) 0 0 2,2   Y(n) =   .. ..   . . 0 0 0 XMt ,n 0 0 0 σMt ,Mt (n) (5) It is assumed that only the last spatial dimension is reserved for PAR reduction while the other dimensions are used for data transmission. Let S(n) denote the first dimensions used for data transmission, i.e., T S(n) = X1,n X2,n . . . XMt −1,n 0 . For estimating and modeling the peak values which exceed a given threshold, we use R(n), defined as, T , R(n) = 0 0 . . . 0 XMt ,n

whereas XMt ,n should not be confused with the input data, it contains the modeled corrective signal that is used for PAR reduction. As shown in Fig. 1, the input data vector S(n) and corrective signal R(n) are pre-processed as T ˜ S(n) = V(n)·S(n) = V(n)· X1,n . . . XMt −1,n 0 (6) and T ˜ . R(n) = V(n) · R(n) = V(n) · 0 . . . 0 XMt ,n (7) The two are then transformed into time domain and subtracted, i.e., ˜ = ˜s − ˜r = F−1 VX = F−1 V(S − R) , x (8) using a block-IDFT matrix F−1 with blocks of diagonal submatrices with Mt = Mr (= 40) identical diagonal elements ˜ ... are obtained by wn,k = √1N ej2πkn/N . All vectors X, X, ˜ concatenating the vectors X(n), X(n), .... Now, the goal is to estimate and model the exceedence values by R in DFT domain to finally subtract the model.



   . 

for n = 0, ..., N − 1, which in more compact form can be written as ϕ= V·R , (12)

t

where V is a block-diagonal unitary matrix. The column vector ϕ shall approximate E in a least-squares sense at all frequencies representing the determined exceedence peaks, i.e., min ||ϕ − E||2 . (13) R

Eq. (13) can be solved for R as a least squares estimation,

e(t )

R = V−1 · E .

Inside R, only the components at positions Mt (i), i = 1, ..., N will be kept, eliminating (zeroing) all others. The modified R is then pre-coded, transformed into time domain using the IFFT, and is subtracted from the original signal for PAR reduction. 2) 2nd approach: As shown, Eq. (7) essentially only addresses the last columns of the blocks V(i) inside the V matrix. Thus, Eq. (7) is rephrased as

t

Fig. 3.

IV. M ODELING

Representation of exceedence values

THE PEAK EXCURSIONS BY THE LAST SPATIAL DIMENSION

Let ˜ s be the transmitted signal in time domain with no data transmission on the reserved eigenchannels. The proposed algorithm searches for any peak value that exceeds a PAR target value. These exceedence excursions are then represented and modeled by the last spatial dimension in frequency domain. Let e be a vector representing the exceeding excursions in time domain on all spatial dimensions of a large-scale MIMOOFDM transmitted signal, i.e., e = ((e1,1 e2,1 · · · eMt ,1 ), · · · , (e1,n e2,n · · · eMt ,n ))T ,

(9)

where 

for xµ,k ≤ xtarget , xµ,k − e · xtarget for xµ,k > xtarget . (10) Let E be the DFT-domain counterpart of e, i.e., E = F · e, where F is a block DFT matrix. We will use two equivalent approaches to estimate the peak values exceeding the given threshold. 1) 1st approach: In order to estimate and model the exceedence excursions by the reserved spatial dimension, Eq. (7) can be reformulated as   0   ..    .   V(0) 0 ... 0  XMt ,0   0    V(1) . . . 0    ..  ..   =ϕ, . . . .. ..   .  ···   0   0 ··· · · · V(N − 1)   ..   . XMt ,N −1 (11) eµ,k =

0

j∠(xµ,k )

(14)

 V (0) :Mt 0   ..  . 0

0 V:Mt (1) ··· 0

... ... .. . ···

0 0 .. . V:Mt (N − 1)

XMt ,0    XMt ,1  =ϕ,  ..   . XMt ,N−1 (15) 

where V:Mt (n) represents the Mt th column ( : stands for all rows) of the nth V matrix, which can be represented in a compact form as ϕ = V:Mt · RMt ,

(16)

where V:Mt is a block diagonal matrix with the Mt th column of V(n) at the nth diagonal, and RMt contains only the last spatial components. In a least-squares sense, the column vector ϕ shall approximate E according to min ||ϕ − E||2 .

RMt

(17)

In Eq. (17), the number of equations are more than the number of unknowns. Thus, this leads to a pseudo-inverse solution. Solving Eq. (17) for RMt in least-squares sense results in H H ·E, · V:Mt )−1 · V:M RMt = (V:M t t

(18)

where H stands for Hermitian (conjugate transpose). Both approaches are equivalent and, of course, deliver the same result. The first one, however, is the less complex, since no pseudoinverse needs to be computed. The procedure is iterated, as the peaks exceeding the target value are not estimated exactly by the least-squares approach in one iteration. The least squares algorithm is described as follows. Least Squares algorithm 1) Initialize S to be the DFT-domain information vector, with the reserved dimension set to zero. Precode it using the pre-processing matrix as shown in Eq. 6. Set i = 0. ˜ 2) Initialize the time-domain solution ˜s, i.e., ˜s = IFFT(S).

˜s(i+1) = ˜s(i) − ˜r ,

(19)

i = i + 1 and go to Step 3. i is the iteration counter and ˜s denotes the Mt · N long timedomain vector, where Mt is the number of transmit antennas and N is the IFFT length. The structure follows (9). It is very unlikely for all antenna channels to encounter a value exceeding the threshold at the same time. Hence, the leastsquares approach will not approximate the peak, but yield a result reduced by a factor of M , since it tries, at the same time, to approximate the zeros (no exceedences) in the other antenna channels.1 This is the drawback of an l2 norm instead of l∞ . We introduce a weighting factor γ in the algorithm, which leads to optimum performance for γ = M . The weighting can be realized by modifying steps 3 or 5 of the presented LS algorithm. Modifying Step 3 means weighting the peak limit excursions before mapping them onto the last spatial dimension. Equation (10) becomes  0 for xµ,k ≤ xtarget , eµ,k = γ(xµ,k − ej∠(xµ,k ) · xtarget ) for xµ,k > xtarget . (20) Alternatively, one could instead write Step 5, Eq. (19) as ˜s(i+1) = ˜s(i) − γ˜r .

(21)

V. R ESULTS AND DISCUSSION A massive MIMO-OFDM systems is expected to be equipped with a large number of antennas (up to one hundred or more), however, for our simulation results we have considered a 40 × 40 MIMO-OFDM system with N = 128 subcarriers and 16-QAM modulation. It is assumed that the channel is perfectly known at the transmitter and the receiver (perfect CSI). The channel matrix H is taken from [4], which described by a matrix polynomial H(z) = PlH −1can be −k H z , where Hk is an Mr × Mt matrix having k k=0 complex fading channel coefficients. The channel coefficients are i.i.d. complex Gaussian distributed with zero mean and variance 1/lH , with lH = 20. Due to transmitter-sided precoding, the average power is distributed over all spatial dimensions. Moreover, for PAR reduction a time-domain signal ˜r is added to the original signal ˜s, thus, the average power increases per iteration. The PAR after applying the algorithm is defined in here as PAR = 1 Illustration:

max∀µ ,∀k |xµ,k + γrµ,k |2 , σ2

(22)

Assume e and M −1 zeros to be approximated in least-squares d sense. This means dx [(x − e)2 + (M − 1)(x − 0)2 ] = 0 =⇒ x = e/M .

where k is the sample index, and σ 2 = E∀µ ,∀k {|xµ,k |2 } is chosen to be the average power (over all spatial dimensions) without any PAR measures, i.e., with an unused spatial dimension and without any increase in the average power after the algorithm. As a performance measure, we consider the complementary cumulative distribution function (ccdf), which is the probability that the current PAR exceeds a certain threshold τth , i.e., P r{P AR > τth } [4]. Figure 4 shows the simulation results for a massive MIMOOFDM system with a target PAR value of 6.5 dB, and a weighting factor γ = 30. A gain of approximately 5.0 dB is obtained at 10−5 with as few as 8 iterations. Moreover, Fig. 4 clearly shows a gain of 3.2 dB with the first iteration and as much as 4.2 dB with only 3 iterations. The higher number of iterations show that the algorithm converges to the target value. However, the computational complexity of the proposed algorithm increases with the number of iterations, as more IFFT/ FFTs are needed to alternate between time and DFT domains. As mentioned earlier, for modeling the exceedence excur-

0

10

PAR original 1st iteration 2nd iteration 3rd iteration 5th iteration 8th iteration

−1

10

−2

10 CCDF (PAR)

3) Initialize e to represent the exceedence excursion according to Eq. (10) and transform it into DFT domain, i.e., E = FFT(e). 4) Approximate E by the last spatial dimension using equations (14) or (18). Precode it with the first three dimensions set to zero as shown in Eq. (7) and transform it into time domain applying the IFFT modulator, i.e., ˜ ˜ r = F−1 R. 5) Update the time-domain vector

−3

10

−4

10

−5

10

5

6

7

8

9

10

11

12

PAR / dB

Fig. 4. CCDF(PAR) of the LS algorithm for a PAR target value of 6.5 dB without mean power constraints (γ = 30)

sions, our algorithm searches for the peak values that exceeds a given target value on all spatial dimensions in time domain. These values are transformed into DFT domain, and are estimated by the last spatial dimension. The estimated model is re-transformed into time domain and are subtracted from the original signal for PAR reduction. However, in doing so, two facts needs to be taken into consideration: 1) Capacity loss: Loss in channel capacity due to the reserved eigenchannels. 2) Relative mean power increase ∆E: Increase in the aver-

age power per iteration due to addition of the corrective signal, and performance of the algorithm with a mean power constraint.

4

5.5

x 10

Massive MIMO−OFDM channel capacity Massive MIMO−OFDM channel capacity with reserved eigenchannels Massive MIMO−OFDM channel capacity with 2 % reserved tones Massive MIMO−OFDM channel capacity with 5 % reserved tones Massive MIMO−OFDM channel capacity with 10 % reserved tones

5

A. Capacity analysis of the proposed algorithm

min(Mr ,Mt )

X

CMIMO =

log2 (1 + αi σi2 ) ,

(23)

Sum capacity (bits/MIMO−OFDM symbol)

4.5

The proposed algorithm reserves the weakest eigenchannel (spatial dimension) to model the peak excursion due to the other spatial dimensions. Thus, there is a slight loss in the channel capacity. SVD decomposes the channel matrix H into independent parallel subchannels, thus, the capacity of a largescale MIMO channel is equal to the sum of the capacities of the independent parallel subchannels,

4

3.5

3

2.5

2

i=1

where σi2 is the ith singular value of the channel matrix H, min(Mr , Mt ) is the minimum of the number of receive or transmit antennas, and αi = Φi /ζ 2 denotes the transmittersided signal-to-noise ratio (SNR) of the ith parallel subchannel. For a MIMO-OFDM channel, (23) is formulated as CT =

N min(M r ,Mt ) X X

n=1

log2 (1 + αi (n)σi2 (n)) ,

(24)

N min(M r ,Mt )−1 X X

n=1

log2 (1 + αi (n)σi2 (n)) .

(25)

i=1

Figure 5 shows the capacity curves for a 40 × 40 MIMOOFDM channel with and without reserved eigenchannels (using (24) and (25), respectively). It is obvious from Fig. 5 that at low SNR values the capacity curve for reserved eigenchannels overlap the actual capacity curve while there is a negligible capacity loss at higher SNR values. Figure 5 also shows capacity curves for 2%, 5%, and 10% reserved tones for large-scale MIMO-OFDM obtained using the following equation CT =

r ,Mt ) X X min(M log2 (1 + αi (n)σi2 (n)) .

N −θ carriers

1

0

5

10

15

20

25

SNR/dB

Fig. 5. Capacity curve of a 40×40 MIMO-OFDM channel with and without reserved eigenchannels and reserved tones (2%, 5%, and 10%), averaged over 100000 channel models

i=1

where σi2 (n) and αi (n) are the singular values and the SNRs at the nth subcarrier, respectively. In case of an LS algorithm with no data transmission on the last spatial dimension, Eq. (24) becomes CLS =

1.5

(26)

i=1

where, θ is the number of reserved tones. It is clear from the figure that the capacity loss is very high for tone reservation. The capacity loss in our approach is far lower than for 2% reserved tones even at higher SNR values. B. Mean power effects of the proposed algorithm In order to reduce the clipping probability of the signal ˜s, a time-domain signal ˜r is added to ˜s for PAR reduction. However, in doing so, the mean transmit power is increased.

Thus, the relative mean transmit power increase ∆E is defined as [7] E{||˜si + ˜ri ||22 } ∆E = 10 log10 , (27) E{||˜s||22 } where E{||˜s||22 } = σ 2 and E{||˜si + ˜ri ||22 } are the average powers at the ith iteration. Figure 6 shows the mean power increase ∆E per iteration in dB for different PAR target values. Obviously, the increase in average power is high for low PAR target values, as more peaks are approximated. However, the increase in the average power is still marginal, in a range of one-two tenth of a decibel for as many as 10 iterations. The simulation results under different mean power constraints are shown in Fig. 7. A gain of approximately 5.2 dB and 4.8 dB is obtained at 10−5 for a slight increase in the mean power, i.e., ∆E = 0.5 dB and 0.4 dB, respectively. VI. C ONCLUSIONS We presented a novel PAR reduction algorithm for OFDM based massive MIMO systems in point-to-point scenarios. The proposed least-squares iterative approach exploits the massive degree of freedom (DoF) of a massive MIMO by reserving the weakest eigenchannels. These eigenchannels are used to estimate and model the peak excursion values exceeding a chosen PAR target value. This model function is then subtracted from the original signal in time domain for PAR reduction. We proposed two different approaches to estimate and model the peak values, which are actually equivalent. Simulation results shows that a gain of approximately 5.0 dB

Target value 7.5 dB Target value 7.0 dB Target value 6.5 dB 0.25

∆E (dB)

0.2

0.15

0.1

0.05

0

1

2

3

4

5 6 Number of iterations

7

8

9

10

Fig. 6. Mean power increase ∆E, in dB, per iteration for different target values, weighting factor γ = 30

0

10

PAR original ∆E = 0.1 dB ∆E = 0.2 dB ∆E = 0.3 dB ∆E = 0.4 dB ∆E = 0.5 dB

−1

10

−2

CCDF (PAR)

10

−3

10

−4

10

−5

10

5

6

7

8

9 PAR / dB

10

11

12

13

Fig. 7. CCDF(PAR) of the LS algorithm with a weighting factor γ = 30, for a PAR target value of 6.5 dB with different mean power constraints ∆E

is obtained with as few as 8 iterations at a CCDF of 10−5 . This gain is obtained at the expense of a minor increase in average power and almost no loss in the channel capacity. R EFERENCES [1] S.H. Han, and J.H. Lee, “An overview of peak-to-average power ratio reduction techniques for multicarrier transmission,” IEEE Wireless Communications, pp. 56–64, April 2005.

[2] T.L. Marzetta, “Noncooperative cellular wireless with unlimited number of base station antennas,” IEEE Transactions on Wireless Communications, vol. 9, No. 11, pp. 3590–3600, November 2010. [3] E.G. Larson, F. Tufvesson, O. Edfors, and T.L. Marzetta, “Massive MIMO for next generation wireless systems,” arXiv:1304.6690v2, May 2013. [4] C. Siegl and R.F.H. Fischer, “Peak-to-average ratio reduction in multi-user OFDM,” International Symposium on Information Theory, Nice, France, June 24-29, 2007. [5] R.F.H. Fischer and, M. Hoch “Peak-to-average ratio reduction in MIMOOFDM,” IEEE International Conference on Communication, Glasgow, Scotland, June 24-28, 2007. [6] C. Siegl and R.F.H Fischer, “Partial transmit sequences for peak-toaverage power ratio reduction in multiantenna OFDM,” EURASIP Journal on Wireless Communications and Networking, vol. 2008. [7] J. Tellado, Peak-to-average power reduction for multicarrier modulation, Ph.D. thesis, Stanford University, 1999. [8] W. Henkel, A. Wakeel, and M. Taseska, “Peak-to-average ratio reduction with tone reservation in multi-user and MIMO-OFDM,” 1st IEEE International Communication Conference China (ICCC), Beijing, 2012. [9] S.K. Muhammad and E.G. Larsson, “Per antenna constant envelop precoding for large multi-user MIMO systems,” arXiv:1111.3752v1, January 2012. [10] C. Studes and E.G. Larsson, “PAR-aware larege-scale multi-user MIMOOFDM downlink,” in Proc. of the 9th International Symposium on Wireless Communication Systems (ISWCS), Paris, France, August 2012. [11] A. Goldsmith, Wireless Communications, Cambridge University Press 2005. [12] E. Telatar, “Capacity of multi-antenna Gaussian channels,” Eur. Trans. Telecomm. ETT, vol. 10, no. 6, pp.585-596, November 1998.