Professional Documents
Culture Documents
Chapter 23
Aspects of OFDM-Based 3G
LTE Terminal Implementation
Wen Xu
Infineon Technologies AG, Germany
Jens Berkmann
Infineon Technologies AG, Germany
Cecilia Carbonelli
Infineon Technologies AG, Germany
Christian Drewes
Infineon Technologies AG, Germany
Axel Huebner
Infineon Technologies AG, Germany
abstract
3GPP standardized an evolved UTRAN (E-UTRAN) within the release 8 Long Term Evolution (LTE)
project. Targets include higher spectral efficiency, lower latency, and higher peak data rate in comparison with previous 3GPP air interfaces. The E-UTRAN air interface is based on OFDMA and MIMO
in downlink and on SCFDMA in uplink. Main challenges for a terminal implementation include an efficient realization of fast and precise synchronization, MIMO channel estimation and equalization, and
a turbo decoder for data rates of up to 75 Mbps per spatial MIMO stream. In this study, the authors
outline the current 3GPP LTE standard and highlight some implementation details of an LTE terminal.
Efficient sample algorithms are presented for key components in the baseband signal processing including synchronization, cell search, channel estimation and equalization, and turbo channel decoder. Their
performances, computational and memory requirements, and relevant implementation challenges are
discussed.
DOI: 10.4018/978-1-61520-674-2.ch024
Copyright 2010, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
1. introduction
The mobile radio network technology family of
the 3GPP (3rd Generation Partnership Project) as
well as its predecessor ETSI (European Telecommunications Standards Institute), including GSM/
EDGE (Global System for Mobile communications/Enhanced Data rate for GSM Evolution)
and UMTS/HSPA (Universal Mobile Telecommunication System/High Speed Packet Access)
technologies, now accounts for nearly 90% of
all mobile subscribers worldwide. The further
increasing demand on high data rates in new
applications such as mobile TV, online gaming,
multimedia streaming, etc., has motivated the
3GPP to work on the long term evolution (LTE)
project since late 2004. Overall target was to select
and specify technology that would keep 3GPPs
technologies at the forefront of mobile wireless
well into the next decade.
Key objectives of the 3GPP LTE, whose radio
access is called Evolved UMTS Terrestrial Radio
Access Network (E-UTRAN), include substantially improved end-user throughputs, sector capacity,
reduced user plane latency, significantly improved
user experience with full mobility, simplified
lower-cost network and reduced User Equipment
(UE) complexity. Currently, first 3GPP LTE specification is being finalized within 3GPP release 8.
Specifically, the physical layer has become quite
stable recently for a first implementation.
The air interface of E-UTRAN is based on
OFDMA (Orthogonal Frequency Division Multiple Access) and MIMO (Multiple-Input Multiple
Output) in downlink (DL) and on SCFDMA
(Single Carrier Frequency Division Multiple Access) in uplink (UL) direction. Main challenges
for a terminal implementation include efficient
realization of the synchronization, channel estimation and equalization, and the turbo decoding
algorithms. We show that for quick and robust
synchronization and cell search, algorithms based
on auto- and cross-correlation provide the best
performance-complexity trade-off. Although the
inner receiver processing, mainly channel estimation and equalization, can nicely and straightforwardly be parallelized due to frequency domain
processing, careful algorithm design is required
to achieve low complexity and high performance.
Due to the high data rate of up to 75 Mbps per
spatial MIMO stream, the turbo decoder design
demands a special consideration. In addition,
flexibility for different MIMO modes, low power
consumption and small silicon area need to be
taken into account in the implementation of most
of the core algorithms (Berkmann, Carbonelli,
Dietrich, Drewes & Xu, 2008).
This chapter is structured as follows: In section 2, we first give an overview on the 3GPP
LTE system and its evolution to LTE-advanced,
especially the physical layer. Then in the following
sections, according to the functional signal flow,
efficient sample algorithms for key components in
the baseband signal processing are presented. The
computational and memory requirements for the
example implementations are evaluated, and the
challenges are highlighted. Specifically, section 3
deals with LTE-relevant synchronization and cell
search, including symbol timing and identification,
frame timing and cell identification based on the
primary and secondary synchronization signals.
In section 4, channel estimation based on the LTE
reference signals is described. The algorithm used
is the 2x 1-dimentional Wiener filtering In section
5, different equalization algorithms such as the
linear MMSE, and the non-linear M-algorithm, the
tree search based fixed sphere decoder (FSD) and
parallel smart candidate adding (PSCA) algorithm
are compared. The LTE-specific turbo coding and
decoding algorithm including the newly specified
turbo interleaver, and its implementation, are
discussed in section 6. Finally, some concluding
remarks are given in section 7.
527
2. overview of lte
and lte-advanced
Historical roots
During the last twenty years the Global System
for Mobile Communications (GSM) evolved into
the most successful cellular radio system. It is
followed by the Universal Mobile Telecommunications System (UMTS), sometimes also referred
to as Wideband Code-Division Multiple-Access
(WCDMA). Both systems have been specified by
the 3rd Generation Partnership Project (3GPP).
Until recently, most of the efforts in evolving
cellular radio systems were spent on enhancing
basic circuit switched voice telephony services.
In the meantime, more focus is put on the respective packet-switched data extensions to GSM and
WCDMA: General Packet Radio Service (GPRS),
Enhanced Data Rates for GSM Evolution (EDGE),
and High Speed Packet Access (HSPA).
Figure 1 shows the evolution of major digital
cellular radio systems without considering first
generation (analogue) cellular radio systems.
Among them only a few were mainstream systems,
including all 3GPP-based systems starting with
GSM. By end of 2008, these 3GPP-based systems
account to almost 90% of all worldwide cellular
radio subscriptions. Operators have either started
or planned to launch commercial services based
528
introduction to lte
Key objectives of LTE include amongst others, see
also 3GPP TR 25.913 (2006-03), improved enduser experience by increased spectral efficiency
and peak and cell-edge data rates. This results in
an overall increased end-user throughput. Some
specific properties of LTE are (3GPP TS 36.101,
2008-09; 3GPP TS 36.211, 2008-09; 3GPP TS
36.306, 2008-5)
529
530
baseband implementation
of an lte terminal
Here, we only consider baseband processing
and omit all analogue components, higher layer
protocols and application processing. From a
baseband signal processing perspective, the
main challenge for a UE implementation is in the
downlink receiver. The receiver can be separated
531
532
3. syncHronization and
cell identification
In cellular systems, synchronization is the very
first task when a UE tries to establish a radio
connection with a network. It is well-known
that OFDM-based systems are very sensitive to
frequency and timing offsets/errors between the
received signal and the local references used for
signal demodulation. The frequency and timing
offsets can be caused by for example oscillator
mismatch at transmitter and receiver, Doppler
effects, multipath propagation, etc. Frequency
offset can destroy orthogonality among subcarriers
and generate inter-channel interference (ICI) and
multiple access interference. Timing offset can
result in severe inter-block interference (IBI). To
avoid a performance loss, frequency and timing
offset must be precisely estimated and adequately
compensated. Specifically in the OFDM-based
LTE system, the whole synchronization is usually
accomplished first in DL and then in UL.
1)
LTE-A is planned to be backwards compatible to LTE. That means that any LTE terminal
can operate in an LTE-A network. The current
study item phase of LTE-A within 3GPP will take
until the end of 2009 (3GPP RP-090067, 2009).
Afterwards the 3GPP work item phase will start
to produce a set of LTE-A specifications. First
specifications are expected to be finalised by
December 2010 within the scope of 3GPP release
10 (3GPP RP-080756, 2008).
2)
533
to the acquired frame structure during the initial DL synchronization. The eNB will then
receive the PRACH and estimate the transmit
timing of the UE. Based on this, the eNB
informs the UE to adjust its transmit timing
(timing advance command). Simultaneously,
the UE identification is performed.
Here, well focus on DL synchronization in
FDD mode. As shown in Figure 3, the coarse
timing and frequency synchronization are initially performed to estimate the begin of the radio
frame (BOF) and the carrier frequency offset
(CFO). The begin of the OFDM symbol (BOS)
needs also be estimated at the same time. Unless
otherwise stated, a time-domain OFDM symbol
includes the CP. The BOS therefore indicates the
beginning of the CP. An overview on the OFDM
synchronization can be found in (Morelli, Kuo
& Pun, 2007).
system model
We start with a perfect synchronized OFDM system with a DFT size N and CP length NCP. Given
the transmitted time domain signal s(n) and a
channel with the overall channel impulse response
(CIR) h(l) (l = 0, 1, , Nch1, Nch = maximum
channel delay spread), the received time domain
baseband signal r(n) in an OFDM symbol, after
removing CP, can be written as
r (n ) =
N ch -1
h(l )s(n - l ) + z (n ),
0 n N -1
R(k ) = H (k )S (k ) + Z (k ),
(2)
with S(k), R(k), H(k) and Z(k) being the N-point
DFT of s(n), r(n), h(n) and z(n), respectively. The
N-point DFT is defined here as
S (k ) = DFTN {s(n )} =
534
N -1
s(n )e
-j
2 pnk
N
n =0
Here we assume a multipath propagation channel with the channel coherence time much larger
than the OFDM symbol duration, which indicates
that the CIR remains constant at least for several
consecutive symbols. To justify this assumption
we take the worst case of the LTE as an example.
The carrier frequency is f0 = 2.69 GHz and the UE
moves with v = 500 km/h (the highest vehicular
speed considered in LTE), then the Doppler spread
of the channel equals fD = vf0/c 1.25 kHz (c = speed
of light). The channel coherence time becomes t
= 1/fD 0.8 ms which is much larger than Tsymb =
0.07 ms, the approximate OFDM symbol duration
in LTE. For lower mobile speeds, the coherence
time becomes proportionally larger.
Now consider the received signal has a timing
offset in multiples of the sampling period TS,2
and a normalized carrier frequency offset (CFO)
= NTSfd (fd = CFO in Hz), compared with the local
references. The received signal becomes
r (n ) = e
2 pn e N ch -1
N
(3)
l =0
(1)
0 k N -1
(4)
2)
3)
CP length.
Extended CP for f = 7.5 kHz: Each slot
has Ksymb = 3 symbols and each symbol has
an equal symbol length Nsymb = 2N + 2NeCP
where the CP length is 2NeCP.
1
W
W -1
r (n + m)r (n + P + m)
*
m =0
(5)
C AC (n )
(6)
E 0 (n )E1 (n )
with
E 0 (n ) =
1
W
W -1
E1 (n ) =
1
W
W -1
r (n + m)
(7)
m =0
r (n + P + m)
m =0
(8)
535
'
C AC
(n ) =
Ksymb -1 K
slot
-1
C
l =0
k =0
E 0' (n ) =
1
Ksymb Kslot
Ksymb -1 Kslot -1
l =0
k =0
e =
1
C AC (q)
2p
(10)
E 0 (n + lN symb + kN slot )
E1' (n ) =
1
Ksymb Kslot
Ksymb -1 Kslot -1
l =0
k =0
E1 (n + lN symb + kN slot )
(14)
(11)
536
(n + lN symb + kN slot )
(13)
AC
(12)
(9)
1
Ksymb Kslot
'
rAC
(n ) =
'
C AC
(n )
E 0' (n )E1' (n )
(15)
(16)
(1)
with N ID
= 0, 1, , 167 being the cell group
(2 )
ID and N ID = 0, 1, 2 being the physical layer
ID (sometimes called sector ID) within a cell
group.
The information on the physical layer ID
(2 )
(1)
N ID within a group and the group ID N ID
are
carried through two DL SCH signals, the primary
synchronization channel (P-SCH) and secondary
synchronization channel (S-SCH) signal, respectively. Both P-SCH and S-SCH occupy 72 center
subcarriers, where in LTE no signal is transmitted
on the DC subcarrier (see Figure 4). The P-SCH
is imbedded in the last OFDM symbol and S-SCH
in the second last OFDM symbol, of subframe 0
and 5 in each frame. For convenience, we divide
the frame into two: the first half frame consisting of subframe 0 4 and the second half frame
consisting of subframe 5 9. Each half frame has
a duration of 5 ms and exactly one P-SCH in it.
From the position of the P-SCH the begin of the
half frame is known, but not the BOF. In LTE,
cell
N ID
can be computed in two steps:
537
-j
puk (k +1)
N ZC
0 k < N ZC
(17)
538
plk
N
Due to many attractive properties, the ZC sequence is used in LTE to generate the DL P-SCH,
UL RACH preample as well as demodulation/
sounding reference signals. For P-SCH signals in
frequency domain, NZC = 63 is chosen. In order to
avoid high interference from local oscillator leakage, du(k) at DC subcarrier K1 = (NZC 1)/2= 31 is
set to zero. In this way, a P-SCH signal has actually
62 non-zero elements in frequency domain. In
addition, 10 zero-valued subcarriers are inserted
at the lower and higher subcarrier boundaries of
the P-SCH, i.e. Su(k) = 0 for k = 5, , 1 and
k = 63, , 67, as guard regions. It can be shown
that the ZC sequence is symmetric with respect to
k = K1, the index of the DC subcarrier, i.e.
du (k ) = du (N ZC - k )
(18)
k = 1, 2,..., K1
otherwiese
(19)
Then we have
Su (k ) = Su (-k ) = Su (N - k )
(20)
The latter equation results from the periodicity of Su(k) and su(n) when an N-point DFT is
applied. Consider
su (n ) = IDFTN {Su (k )} =
1
N
K1
k =-K1
Su (k )e
2 pnk
N
(21)
It is easy to verify
su (n ) = su (-n ) = su (N - n )
(22)
Figure 5. Frequency and time domain P-SCH signals (N=128, u=34), with ampl = amplitude, imag
= imaginary part, real = real part
N n =0
N
mean. It should be noticed that the P-SCH signal
is not a ZC sequence any more. For different
u, the P-SCH signals are in general no longer
orthogonal to each other. However, they can still
be considered as pseudo noise with many good
properties, including
small power variation in both time and frequency domains, an attractive property to
achieve a low peak-to-average power ratio
(PAR);
very good auto-correlation property in both
time and frequency domain, an ideal property for synchronization.
C XC (n, u )
E 0 (u )E1 (n )
(23)
where
C XC (n, u ) =
1
W
W -1
s (m)r (n + m)
m =0
*
u
(24)
539
Figure 6. Auto-correlation of P-SCH signal with its periodically extended duplicate in frequency and
time domain (N=128, u=34)
E 0 (u ) =
1
W
W -1
E1 (n ) =
1
W
W -1
m =0
(m )
u
(25)
r (n + m)
m =0
(26)
n ,u
rXC (n, u ) =
C XC (n, u )
E 0 (u )E1 (n )
(27)
Similar to the P-SCH, S-SCH signal has 62 nonzero elements in frequency domain, with the DC
subcarrier and 10 subcarriers at the lower and upper
Figure 7. Reverse correlation of the P-SCH signal in frequency and time domain (N=128, u=34)
540
(28)
(2 )
d (2k ) = s (k + m 0 )c k + N ID
(1)
q = N ID
30
(33)
(1)
The relationship between N ID
and the index
pair m0 and m1 are also listed as a look-up table in the
specification (3GPP TS 36.211, 2008-09). It can be
verified that this ensures 0 m 0 < m1 30 for
(1)
0 N ID
167 . In cell identification procedure,
we usually need to first determine the shifts m0
(1)
, say, by a simple table
and m1, then obtain N ID
look-up.
The frequency domain S-SCH sequence in
subframe 5 can in fact be obtained by simply exchanging the shift index m0 with m1 in the above
equations, i.e.
(2 )
d (2k ) = s (k + m1 )c k + N ID
) (
d (2k + 1) = s (k + m1 ) z k + (m 0 mod 8) c k + N
(2 )
ID
+3
) (
(34)
(29)
where s (k ) , z (k ) , c (k ) {1} are the predefined length-31 binary m-sequences. For
the index k outside of the range 0 k 30, the
(
)
z (k ) = z (k mod 31) and c (k ) = c (k mod 31) .
(30)
with
(1)
m = N ID
+ q(q + 1) 2
(31)
N (1) + q (q + 1)
ID
q=
30
(32)
(2 )
+3
d (2k + 1) = s (k + m 0 ) z k + (m1 mod 8) c k + N ID
541
(2 )
d0 (k ) = d (2k ) c k + N ID
cyclic
cross-correlation
Compute
and
the
30
C0 (m ) = s (k + m )d0 (k )
for m =
k =0
) (
0 mod 8) c k + N
d1 (k ) = d (2k + 1)z k + (m
and
the
cyclic
(2 )
ID
+3
cross-correlation
30
2
1 = arg max C1 (m ) .
be estimated as m
m
0 < m
1 , the S-SCH is in subframe
When m
0
0. Otherwise, exchange the value of m
1 , and the S-SCH is in subframe 5.
and m
(1)
by
Determine the BOF and determine N ID
a table look-up.
542
(35)
E 0 (n )E1 (n )
with
C RC (n )
rRC (n ) =
C RC (n ) =
E 0 (n ) =
1
W
1
W
W -1
r (n + m)r (n + N - m)
*
m =1
W -1
r (n + m)
m =1
(36)
(37)
E1 (n ) =
1
W
W -1
r (n + N - m)
m =1
(38)
(39)
WE1 (n + 1) = WE1 (n ) - r (n ) + r (n +W )
(40)
We take the LTE system with 20 MHz bandwidth (30.72 MHz sampling rate) and W = N =
2408 as an example. Since E1(n) and |r(n) |2 have
been calculated in previous stages, a low computational complexity of 330.72106 93106 MAC
(multiply-accumulate) operations4 per second are
required, assuming for given n, 3 real MAC are
needed to compute E1(n+1) in (40).
For the cross-correlation approach, 4W real
MAC are required to compute CXC(n, u) for each
n and each u. Then, a total of 4W30.72106
252109 MAC per second are required for each u.
For three different u, the computational complex-
543
WC AC (n + 1) = WC AC (n ) - r * (n )r (n + P ) + r * (n +W )r (n + P +W )
Efficient data detection in OFDM systems operating in time and frequency selective channels
requires accurate channel estimation. In LTE,
like in many OFDM-based systems, in order to
facilitate channel estimation, known symbols,
called pilots, are inserted at specific locations in
the time-frequency grid and channel estimation
is performed by interpolation. The resulting twodimensional pilot pattern is typically irregular, as
shown in Figure 8, for the LTE case where the
pilot spacing in the frequency direction equals
six OFDM symbols, while in the time direction
two OFDM symbols per slot (referred to as reference symbols) containing pilots are available, at
a distance of 4 and 3 OFDM symbols from one
another 5(3GPP TS 36.211, 2008-09).
Several pilot-aided channel-estimation
schemes have been proposed for OFDM applications (Edfors, Sandell, van de Beek, Wilson
& Borjesson, 1998; Le, 2000; Li, Cimini Jr. &
Sollenberger, 1998; Negi & Cioffi, 1998; Rinne
& Renfors, 1996). Some low complexity approaches (Rinne & Renfors, 1996) resort to a
simple piecewise-constant and piecewise-linear
interpolations between pilots. They require no
a priori knowledge on the channel statistics but
need a large number of pilots to achieve good
performance. Other schemes are based on the
frequency-domain linear minimum mean squared
(41)
Notice that CAC(n) and r*(n)r(n+P) have been
calculated before, therefore, computation of
CAC(n+1) will need only a few MAC, say 6 real
MAC operations. For the 30.72 MHz sampling
rate, a total of 630.72106 184106 real MAC per
second are required. Such a complexity requirement can easily be fulfilled nowadays.
Our simulations have shown that among the
methods investigated, the P-SCH based crosscorrelation method, mainly because of the use
of noise-free replica in correlation, provides the
highest accuracy in timing estimation. By properly averaging over more symbols, the CP based
auto-correlation can not only estimate the CFO
with high accuracy, but also achieve a robust BOS
estimate. In practice, combined or mixed methods
based on auto- and cross-correlation are employed.
They usually provide a good compromise in terms
of performance and computational requirements.
In addition, over-sampling of the received data
may significantly improve the performance of
the synchronization and cell search, but demand
more computational efforts.
Notice that initial acquisition of timing/frequency offset, cell ID, etc. is mainly performed
during call setup, partly during handover, cell (re)
544
4. cHannel estimation
n = 0, 1, , N - 1,
l = 0, 1, , L - 1
(42)
where x(n,1), H(n,1) and z(n,1) denote the transmitted symbol with energy (per symbol) Es, the
channel transfer function sample and the additive
545
{n, l }
(43)
546
n F
(44)
Time direction:
wlT
= w l (0), , w l (N t - 1) = rlT Rt-1
t
t
t
t
l T
(45)
where ()T and ()-1 denote the vector transpose and
the matrix inverse operation, respectively. The elements of the cross- and auto-correlation matrices
in (44)-(45) are given by (assuming uniform and
symmetric Doppler and delay power spectra)
rn (i ) = si(2pt max DF (n - i ))
i = 0, 1, , N f - 1
N0
Es
d(i - j )
i, j = 0, 1, , N f - 1
(46)
rl (i ) = si(2p fmaxTsymb (l - i ))
t
i = 0, 1, , N t - 1
N0
Es
d(i - j )
i, j = 0, 1, , N t - 1
(47)
where si is the sinc function, while F and Tsymb
denote the subcarrier spacing and the symbol
duration, respectively.
Note that the indices n and l in equations (44)(47) account for the fact that 1D Wiener filtering
amounts to a window sliding operation along the
frequency or time axis. Also, F and T denote the
547
Table 1. Limit values for Doppler bandwidth and delay spread in a typical LTE system
LTE
1) Time
2) Frequency
1) Frequency
2) Time
Bd,lim = 1/(2DT)
1/(26Tsymb) = 1167 Hz
1/(24Tsymb) = 1745 Hz
Td,lim = 1/(DF)
1/(3f) = 22.2 s
1/(6f) = 11.1 s
Table 2. Limit values for Doppler bandwidth and delay spread in a typical DVB-T/H system (ETSI EN
300 744, v.1.5.1, 2004-11)
DVB-T/H
1) Time
2) Frequency
1) Frequency
2) Time
Bd,lim = 1/(2DT)
1/(24Tsymb) = 112 Hz
1/(2Tsymb) = 446 Hz
Td,lim = 1/(DF)
1/(3f) = 298 s
1/(12f) = 74 s
548
Figure 10. Uncoded BER performance of QPSK 2x2 system: robust and adaptive 2x1D Wiener channel
estimator
(48)
We note that Nt = 4 implies an initial buffering
(and delay) of at least 4 OFDM symbols. In other
terms, at each reference symbol, we first perform
frequency interpolation. Then, using the 4 most
recent frequency estimates, we interpolate the
previous 2 or 3 OFDM symbols, depending on
whether we are in the first or second reference
symbol of a slot.
Alternative choices are possible for the positioning and length of the Wiener filters. For
549
Figure 11. Throughput performance of QPSK 1x2 system: robust and adaptive 2x1D Wiener channel
estimator
550
mimo detection
In the following, we consider an NT NR MIMO
system, i.e., a system with NT transmit and NR
receive antennas, where NT NR. The input
signal to this system is given by the vector
T
s = s 0 , ..., sN -1 of NT symbols, each of
T
(49)
Pr (cn ,k = +1 | r)
Pr (cn ,k = -1 | r)
(50)
551
1/2
min
zz
c C n ,1k
Hs
- m in1
c C n ,k
1/2
zz
Hs
(51)
The LLRs in (51) can be approximately represented as the difference between the metrics
L cn+,k1 and L cn-,1k :
( )
( )
( ( )) -
L (cn ,k | r) min
L cn+,k1
+1
c C n ,k
( ( ))
min L cn-,1k
c C n-,1k
(52)
which correspond to the hypotheses that cn,k = +1
and cn,k = 1, respectively, are transmitted. Cn+1,k
N K -1
are the sets of 2 T
bit vectors c, havand C-1
n ,k
ing cn,k = +1 and cn,k = 1. Then, a so-called ML
hypothesis is defined as
sML
arg min
s S
1/2
zz
Hs
(53)
min
s S
1/2
zz
Hs
(54)
min
ML
s Sn ,k
1/2
zz
Hs
(55)
in SnML
is called counter
where the element cnML
,k
,k
hypothesis at symbol position n and bit position
552
n ,k
L (cn ,k ) ML
Ln ,k - LML ,
cnML
= +1
,k
cnML
= -1
,k
(56)
(57)
553
554
6. cHannel codinG
and decodinG
While the control channel data is protected by a
64-state tail-biting convolutional code, LTE (3GPP
TS 36.212, 2008-09) ultimately adopted the turbo
channel coding scheme from 3GPP HSPA (3GPP
TS 25.212, 2008-09) for the user data despite the
numerous proposals that voted for low density
parity check (LDPC) coding (Gallager, 1962)
555
556
The turbo code (Berrou, Glavieux & Thitimajshima, 1993) used for LTE (like HSPA)
is composed of a parallel concatenation of 2
identical binary rate-1/2 systematic recursive
8-state convolutional encoders whose inputs
are linked by an interleaver (the turbo-internal
interleaver). The systematic part, being the raw
data before encoding, is only transmitted once
thereby creating a rate-1/3 turbo mother code
(before rate-matching). The decoding of turbo
codes is done in an iterative fashion as indicated
in Figure 18. Each of the two received data sequences (L(X k ), L(Yk1 )) and (L(I (X k )), L(Yk2 ))
is decoded with a probabilistic soft-in-soft-out
(SISO) decoder operating on the respective decoding trellis where L(X k ), L(Yk1 ), L(Yk2 ), denote
the LLR of the k-th systematic, parity1, and parity2 bit, respectively, and where I(.) denotes the
turbo internal interleaver function. A particular
part of the output of the SISO module, the socalled extrinsic information Lext(Xk), is passed on
to the next SISO module as a priori information.
After a certain number of iterations decoding is
stopped and a hard decision is made on the final
soft output.
Optimally, the SISO decoding module computes the maximum a posteriori (MAP) LLR
Lapp(Xk) = logP(Xk = 0|R)/P(Xk = 0|R) for each
systematic bit given the entire received code
sequence R. The trellis based implementation
in the form of the well-known BCJR-algorithm
(Bahl, Cocke, Jelink & Raviv 1974) operating in
the log-domain (log-MAP or forward-backward
algorithm) (Robertson, Hoeher & Villebrun, 1997)
makes use of forward and backward recursions
running on the decoding trellis and storage of
partial path metrics associated with trellis states.
One particularly important approximation thereof
is known as the max-log-MAP algorithm (Rob-
Figure 17. Virtual buffer rate matching and incremental redundancy. Limited buffer rate matching
left.
557
max
o
S
k 1
(sk 1 )
k 1
(sk 1 )
k
(sk 1, sk )
(sk 1, sk )
k
(sk )
(sk )
(58)
(59)
(60)
(61)
with X k meaning logical inversion of the bit. One
nice property of the max-log-MAP algorithm is
that all operations to be done are just additions
and comparisons without the need for further table
lookups as would be for the log-MAP. Moreover,
the performance gap when using max-log-MAP
instead of log-MAP can be closed up to a very
few tenth of dBs if the extrinsic information is
properly scaled down before passing it on to the
subsequent SISO module (Claussen, Karimi &
Mulgrew, 2005).
An exact implementation of (58) would be
unsuitable for a practical implementation for two
reasons. Firstly, the forward (or backward) recursion had to be stored over the entire decoding trellis
and, secondly the decoding throughput would be
limited by 1 decoded bit per 2 time units9. This
is due to the fact that the exact implementation
of (58) requires to execute a forward recursion
from the very left corner of the trellis until the
558
Figure 19. (a) Exact implementation of (max-)log-MAP algorithm, (b) implementation with M subblock
decoders
I (i + jW ) / W I (i + kW ) / W ,
(62)
I (k ) = ak + bk 2 mod K
(63)
559
10296
10276
250368
51024
51024
1237248
102048
75376
1237248
150752
75376
1827072
302752
151376
3667200
560
7. conclusion
In this chapter, we overviewed the current 3GPP
LTE standard (release 8) and its evolution to the
LTE-A (release 9), and highlighted first implementation details in an LTE terminal. The sample
core functional algorithms of the baseband signal
processing, especially the synchronization, cell
search, channel estimation and equalization, and
the turbo channel decoder, were described and
analysed. Their performances, computational
efforts, memory requirements, and relevant implementation challenges were discussed. As a result,
LTE terminals can be implemented with currently
acKnowledGment
The authors wish to gratefully acknowledge the
support of their colleagues at Infineon Technologies. Parts of this work are carried out within the
scope of the EUREKA MEDEA+ project MIMOWA, which is partly funded by the German
Federal Ministry of Education and Research.
references
3GPP R1-091580 (2009). LTE-performance and
IMT-Advanced requirements. Presented at 3GPP
TSG RAN WG1 Meeting #56bis, Seoul, Korea.
3GPP RP-080756. (2008). 3GPP presentation on
the LTE-Advanced as an IMT-Advanced technology solution. Presented at the 41st 3GPP TSG RAN
meeting, Kobe, Japan.
3GPP RP-090067. (2009). Status report for WI to
TSG RAN; work item name: further advancements
for E-UTRA (LTE-Advanced). Presented at the 43rd
3GPP TSG RAN meeting, Biarritz, France.
3GPP TR 25.913, V7.3.0 (2006-03). Requirements
for evolved UTRA (E-UTRA) and evolved UTRAN
(E-UTRAN) (release 7). Technical report, 3GPP,
TSG RAN.
3GPP TR 36.814, V0.4.1 (2009-02). Further advancements for E-UTRA; physical Layer aspects
(release 9). Technical report, 3GPP TSG RAN.
3GPP TR 36.913, V8.0.1 (2009-03). Requirements for Further Advancements for E-UTRA
(LTE-Advanced) (release 8). Technical report,
3GPP TSG RAN.
3GPP TS 25.212, V8.4.0 (2008-12). Multiplexing
and channel coding (FDD) (release 8). Technical
specification, 3GPP, TSG RAN.
3GPP TS 36.101, V8.3.0 (2008-09). Evolved
universal terrestrial radio access (E-UTRA);
user equipment (UE) radio transmission and
reception (release 8). Technical specification,
3GPP, TSG RAN.
3GPP TS 36.211, V8.4.0 (2008-09). Evolved universal terrestrial radio access (E-UTRA); physical
channels and modulation (release 8). Technical
specification, 3GPP, TSG RAN.
3GPP TS 36.212, V8.4.0 (2008-09). Evolved
universal terrestrial radio access (E-UTRA);
multiplexing and channel coding (release 8).
Technical specification, 3GPP, TSG RAN.
3GPP TS 36.213, V8.4.0 (2008-09). Evolved
universal terrestrial radio access (E-UTRA);
physical layer procedures (release 8). Technical
specification, 3GPP, TSG RAN.
3GPP TS 36.300, V8.6.0 (2008-09). Evolved
universal terrestrial radio access (E-UTRA) and
evolved universal terrestrial radio access network
(E-UTRAN); overall description; stage 2 (release
8). Technical specification, 3GPP, TSG RAN.
3GPP TS 36.306, V8.2.0 (2008-05). Evolved
universal terrestrial radio access (E-UTRA); user
equipment (UE) radio access capabilities (release
8). Technical specification, 3GPP, TSG RAN.
Alamouti, S. M. (1998). A simple transmit diversity technique for wireless communications. IEEE
Journal on Selected Areas in Communications,
16(8), 14511458. doi:10.1109/49.730453
561
562
563
endnotes
564