229x Filetype PDF File size 1.85 MB Source: lucris.lub.lu.se
Approximative Matrix Inverse Computations for Very-large MIMO and Applications to
Linear Pre-coding Systems
Prabhu, Hemanth; Rodrigues, Joachim; Edfors, Ove; Rusek, Fredrik
Published in:
[Host publication title missing]
2013
Link to publication
Citation for published version (APA):
Prabhu, H., Rodrigues, J., Edfors, O., & Rusek, F. (2013). Approximative Matrix Inverse Computations for Very-
large MIMO and Applications to Linear Pre-coding Systems. In [Host publication title missing] (pp. 2710-2715).
IEEE - Institute of Electrical and Electronics Engineers Inc..
Total number of authors:
4
General rights
Unless other specific re-use rights are stated the following general rights apply:
Copyright and moral rights for the publications made accessible in the public portal are retained by the authors
and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the
legal requirements associated with these rights.
• Users may download and print one copy of any publication from the public portal for the purpose of private study
or research.
• You may not further distribute the material or use it for any profit-making activity or commercial gain
• You may freely distribute the URL identifying the publication in the public portal
Read more about Creative commons licenses: https://creativecommons.org/licenses/
Take down policy
If you believe that this document breaches copyright please contact us providing details, and we will remove
access to the work immediately and investigate your claim.
Approximative Matrix Inverse Computations for
Very-large MIMO and Applications to Linear
Pre-coding Systems
Hemanth Prabhu, Joachim Rodrigues, Ove Edfors, and Fredrik Rusek
Department of Electrical and Information Technology, Lund University, Sweden
{Hemanth.Prabhu, Joachim.Rodrigues, Ove.Edfors, Fredrik.Rusek}@eit.lth.se
Abstract—In very-large multiple-input multiple-output slower rate than inner products of the propagation vectors
(MIMO) systems, the base station (BS) is equipped with very with themselves when the number of antennas grow, i.e.,
large number of antennas as compared to previously considered the user channels are asymptotically orthogonal. In [3], mea-
systems. There are various advantages of increasing the number surements in a realistic propagation environment for large
of antennas, and some schemes require handling large matrices
for joint processing (pre-coding) at the BS. The dirty paper array of antennas at a BS (up to 128 antennas at the BS
coding (DPC) is an optimal pre-coding scheme and has a and 26 different single antennas users) are performed. It
very high complexity. However, with increasing number of was shown that by using reasonably large antenna arrays it
BS antennas, linear pre-coding performance tends to that of is possible to decorrelate single user channels. Furthermore,
the optimal DPC. Although linear pre-coding is less complex in [4], residential area measurements for very-large MIMO
than DPC, there is a need to compute pseudo inverses of
large matrices. In this paper we present a low complexity system were performed, showing linear pre-coding sum rates
approximation of down-link Zero Forcing (ZF) linear pre-coding of up to 98% of those achieved by dirty paper coding (DPC),
for very-large multi-user MIMO systems. Approximation for BS to Mobile Station (MS) antenna ratios as low as 10.
using a Neumann series expansion is opted for inversion of
matrices over traditional exact computations, by making use
of special properties of the matrices, thereby reducing the cost Although there is a clear benefit of scaling up the number
of hardware. With this approximation of linear pre-coding, of BS antennas, including an almost (near) optimal linear pre-
we can significantly reduce the computational complexity for coding, the hardware cost and signal processing complexity
large enough systems, i.e., where we have enough BS antenna
elements. For the investigated case of 8 users, we obtain 90% can be very high. When using linear pre-coding, such as
of the full ZF sum rate, with lower computational complexity, Zero-Forcing (ZF), we can operate at the above mentioned
when the number of BS antennas per user is about 20 or more. BS/MS antenna ratios around 10 and the main source of
I. INTRODUCTION complexity for ZF pre-coding becomes the inverse of a K×K
Multiple-Input Multiple-Output (MIMO) techniques for matrix, where K is the number of users. The assumption of
wireless communication offer high data rates and reliabil- a significantly higher number of antenna elements does not
ity through the utilization of multiple transmit and receive affect the dimensions of this matrix, but it does offer the
antennas. These techniques are becoming more mature and opportunity to carry out the matrix inverse by much simpler
have been incorporated in advanced standards like LTE (Long means than outright inversion.
Term Evolution) Release 10 [1] to meet the International
Mobile Telecommunications-Advanced (IMT-A) requirements There are various hardware implementations for matrix
of gigabits-per-sec data rates. Basically, the more antennas the inversion using different algorithms, QR-Gram Schmidt [5],
transceivers are equipped with, the better performance can QR-Givens Rotation [6], and Gauss-Jordan [7]. While these
be obtained in terms of data rate, diversity (reliability) and methods are generic and work well for any type of matrix,
spectral efficiency. we can exploit the special structure of matrices appearing in
In [2], a Multi-User (MU) MIMO system with (an assump- very-large MIMO to reduce the complexity of the linear pre-
tion of) unlimited number of base station (BS) antennas in coding and make it more hardware efficient. To meet these
a multi-cell environment is investigated. It is shown that all objectives, approximations (using Neumann series) of matrix
the effects of uncorrelated noise and fast fading disappear, as inversion is opted rather than computing the exact inverse.
does the intra-cell interference. The assumption of an unlim- We describe the general setting in which our pre-coding is
ited number of BS antennas greatly simplifies the theoretical assumed to operate, discuss the linear class of pre-coders, and
analysis. However, it is obvious that in a practical system the finally focus on complexity of the ZF pre-coder, when using
numberofantennas cannot be arbitrarily large due to physical, QR-Gram Schmidt and Neumann series expansion to perform
cost, and power constraints. matrix inversion. We derive and compare complexity both in
The theoretical analysis in [2], assumes that inner products terms of the number of operations and the energy required to
between propagation vectors of different users grow at a perform the computations.
where P is a K × K diagonal matrix for power allocation.
Thesumrate is maximized by optimizing the power allocation
under the constraint that Tr(P) = 1, where Tr(·) is the trace
Pre-Coding operator.
Although optimal sum rate is achieved by DPC, this
(Base Station) approach is too resource expensive to be implemented in
Central Processing hardware and is used as a benchmark for ZF, Minimum Mean
Unit
Square Error (MMSE) and low complexity approximations of
ZF. The pre-coding matrix F can be decomposed as
1 √
F = √ W P; (4)
γ
Fig. 1: System model of a MU-MIMO system with M-antenna
base station and K single-antenna mobiles stations. where W represents a particular linear pre-coding algorithm,
and γ = ||Wp(P)||2 , is a power normalization factor, where
F
|| • ||F is Frobenius Norm.
II. SYSTEM DESCRIPTION A. ZF pre-coding scheme
The system model and the pre-coding in the following ZF linear pre-coding transmits user signals towards the
section is in line with the corresponding description in [8]. intended user with nulls steered in the direction of other users.
A MU-MIMO system model consists of a BS equipped with The ZF pre-coder is given as
Mantennas, which is simultaneously serving K single users
antenna, as shown in Fig.1. The received vector y of size W =H†; (5)
ZF
K×1is described as † H H −1
√ where H = H (HH ) is the pseudo-inverse of the
y = ρHz+n; (1) channel matrix H. A perfect CSI at the transmitter and nulling
makes this scheme interference free, and the sum rate is given
where H is the K ×M propagationmatrix of complex-valued as
K
channel coefficients, z is an M × 1 transmit vector, and n is X ρP
C = max log 1+ i : (6)
an additive noise vector with Independent and Identically Dis- ZF 2 γ
Tr(P)=1 i=1
tributed (IID) zero-mean and unit-variance complex Gaussian
random variables. The scalar ρ is a measure of the Signal-to- As the number of BS antennas M increases, H tends to
Noise Ratio (SNR) of the link, which is proportional to the have nearly orthogonal columns as the terminals are not
transmitted power divided by the noise-variance. Furthermore, correlated due to their physical separation. This assures that
it also absorbs various normalizing constants. The total trans- the performance of ZF pre-coding will be close to that of
mit power is normalized and independent of the number of optimal DPC pre-coding. However, ZF pre-coding requires
antennas M, the transmit vector z satisfies E{||z||2} = 1. computation of the pseudo-inverse (in (5)), which requires the
The pre-coding process at the transmit side is specified as computationally expensive inversion of a K ×K matrix.
z = Fx; (2) B. MMSE pre-coding scheme
MMSE pre-coding can trade interference reduction for
where F is a M ×K pre-coding matrix, x a K × 1 vector signal power inefficiency. The MMSE pre-coder is given as
containing user symbols, as described in [4].
Although the very-large MU-MIMO model is similar to a W =HH HHH+αI −1 ; (7)
MMSE
standard MIMO model, the increased number of BS antennas where α = K=ρ. At low SNRs (large α) the MMSE ap-
has several consequences. Things that were random before, H
proaches a Matched Filter (MF) pre-coder, i.e., W =H ,
now start to look deterministic. For example, the distribution MF
of the singular values of the channel matrix approaches a and at high SNRs (low α) it approaches the ZF pre-coder.
deterministic function [9]. Another observed property is that C. Low Complexity Pre-Coding
very wide (or tall) matrices tend to be very well conditioned.
A problem with both ZF and MMSE pre-coding is the
III. LINEAR PRE-CODING SCHEMES inverse operation of the K ×K matrix. Since the complexity
The optimal sum rate in the downlink of a MU-MIMO for both linear pre-coders is similar (when α is not large in
system with prefect channel state information (CSI) can be (7)), in this paper we analyse impact of low complexity (ap-
achieved by the interference pre-subtraction coding technique proximations) only on ZF pre-coder. A standard and expensive
called DPC [10]. The optimal sum rate is given as approach would be to compute the exact inverse of the matrix
Z(,HHH)in
C = max log det(I +ρHHPH);
DPC 2 (3) † H H −1 H −1
Tr(P)=1 W =H =H (HH ) =H (Z) : (8)
ZF
However, as the number of BS and MS antennas (M and The modification is described as follows. The scalar multi-
K) increases, the eigenvalues of the matrix Z converges to plication by δ=(M + K) in (12) is represented as a diagonal
a fixed deterministic distribution known as the Marchenko- matrix
Pastur distribution. Now following the analysis in [8], the D = δ I :
MP M+K K
largest and the smallest eigenvalues of Z converge to
Using this notation, (12) is rewritten as
1 2 1 2
λ (Z) −→ 1+√ ; λ (Z) −→ 1−√ ; L
max min
β β −1 X n
Z ≈ (I −D Z) D ; (13)
K MP MP
where (β = M=K), as M and K grows to infinity. By scaling n=0
the Z matrix with a factor β , the eigenvalues are found the accuracy of the approximation,for a given number of terms
1+β
in the region (L), depend on the size of the eigenvalues of (I − DMPZ).
√ Thesmaller their magnitude, the faster the convergence. Given
λ β Z −→ 1+2 β ; this, we want to pre-condition our matrix Z so that it will lead
max 1+β 1+β
to a fast convergence for a finite M and K system.
√ Now, assume that we want to pre-condition it with a diag-
λ β Z −→ 1−2 β : (9)
min 1+β 1+β onal matrix D, with non-zero diagonal entries. In principle,
Hence,theeigenvaluesofI −β=(1+β)Z = I −Z=(M+K) we would like to calculate the eigenvalues of (I − DZ) and
√ K √ K optimize D so that the magnitudes of the eigenvalues are as
lie in the range [−2 β=(1+β);2 β=(1+β)], where I is small as possible. This, however, is a complex and non-trivial
K
an K×K identity matrix. By asymptotically increasing β, the task. We will therefore use Gershgorins circle theorem [12] to
eigenvalues of I −Z=(M+K)lie in the range derive an upper bound of the magnitude of the eigenvalues.
K
" √ √ # By keeping this bound small, by selecting D, we can also
lim −2 β ; 2 β −→ [−0;0]: (10) guarantee that the magnitude of the eigenvalues are small.
β→+∞ 1+β 1+β In this derivation of the ”optimal” D we will assume that the
Hermitian matrix Z = HHH is diagonallydominant,meaning
Therefore, as β grows, the faster is the convergence of that the magnitude of the diagonal elements z are larger than
ii
1 n the sum of the magnitude of the off-diagonal elements in the
lim I − Z ≃0 : (11) P
n→∞ K M+K K same row, zij;i 6= j, namely that |zii| > |zij|. The
i6=j
It is known that if a matrix satisfies (11), its inverse can be largest magnitude of any eigenvalue of (I − DZ) is upper
bounded by
expressed by Neumann series [11] as
L X !
n
X max|λ | ≤ max |1−d z |+d |z | ; (14)
−1 δ δ i i ii i ij
Z ≈ I − Z ; (12) i i
M+K K M+K i6=j
n=0
and under the condition of a diagonally dominant Z, the
with equality when L grown to infinity, and δ < 1 is a smallest upper bound is obtained if d = 1=z . For this
attenuation factor introduced, since for finite M and K the i ii
selection of D we also have that max|λ | < 1, which
i
eigenvalues of channel realizations may lie outside the range i
specified in (9). For a feasible implementation of a matrix guarantees convergence of the Neumann series. Hence, our
inversion using Neumann series the number of iterations (L) final approximation of the inverse of a diagonally dominant
needs to be finite (or small). Z is a matrix D = diag(1=z11;1=z22;::::;1=zkk), the inverse
Theinverseof Z is approximatedby a summation of powers can be expressed using Neumann series as
of a matrix (or matrix multiplications) (12), which has a L
3 −1 X n
Z ≈ (I −DZ) D: (15)
complexity order O((L − 1) · K ). Although the complexity K
order can be equal or higher (depending on L) than computing n=0
the exact inverse (direct inversion, QR based etc), matrix Afast (or accelerated) way to compute the series (15) and
multiplications are preferable in hardware compared to exact (12), up to L = 2P −1 terms, where P is an integer, is to use
inversion. the identity
The convergence of (11) is based on the fact that the
L P−1
eigenvalues lie in the range given by (9) as M and K grows X n Y n
−1 2
Z ≈ (I −DZ) D= (I +(I −DZ) ) D;
asymptotically. However, for practical systems with finite M K
n=0 n=0
and K the eigenvalues may lie outside this range. In addition (16)
to what is described in [8], we introduce one modification of which leads to a numerical complexity proportional to the
the Neumann series inversion. It is based on the fact that the logarithm of the number or terms in the truncated series. In
closer the eigenvalues of our matrix are to 1, the faster the terms of number of matrix multiplications, the brute force
convergence of the series in (12). computation of the inverse using (15) (or (12)) would require
no reviews yet
Please Login to review.