Professional Documents
Culture Documents
4, MAY 2007
Abstract— Video broadcast and Mobile TV have received over WiMAX, many challenging issues need to be addressed.
significant interests from both academia and industry recently. For example, WiMAX currently defines protocols only in
The emerging Mobile WiMAX (802.16e) is capable of providing physical layer (PHY) and media access control layer (MAC).
high data rate and flexible Quality of Service (QoS) mechanisms,
making the support of Mobile TV very attractive. However, Can a simple end-to-end video broadcast solution (such
how to efficiently deliver video broadcast over WiMAX is not as H.264/RTP/UDP/IP packet transmission over WiMAX
straightforward, especially in the multi-BS mode. The multi- MAC/PHY, the so-called IP data cast) work efficiently? Is the
BS mode requires multiple BSs to be synchronized in the video quality smooth during movement and handoff? If the
transmission of common multicast/broadcast data. In this paper, answer is no, what is the best solution? What are the key de-
we first identify the key design issues for video broadcast over
WiMAX in the multi-BS mode. Then, we present an end-to-end sign issues in such a solution? How many mobile TV channels
solution which fully addresses key issues such as synchronization, (e.g., H.264/AVC [14], 320 x 240 pixels, and 30fps) can be
energy efficiency and robust video quality. Moreover, we propose supported with the required quality? What about the coverage?
a methodology to optimize the coverage, the spectrum efficiency How is the energy efficiency? How long does it take to switch
and the video quality. Results show that our proposed scheme video channels? Can we support mobile TV broadcast, two-
can significantly improve the coverage and spectrum efficiency
while satisfying video quality requirements. way voice and data applications simultaneously? To answer
all these questions, we need to clearly understand the video
Index Terms— Mobile TV, WiMAX, macro-diversity, coverage, traffic (like H.264/AVC), the wireless link budget and wireless
spectrum efficiency, video quality
channel characteristics. We need to know the packet error rate
given packet size, modulation coding scheme and wireless
I. I NTRODUCTION channel condition. We also need to study the relationship
between the packet error rate and the effective video frame
M OBILE WiMAX has made major strides over the past
year and is rapidly proving itself as a leading solution
for broadband wireless services. The Mobile WiMAX Air
rate (the average number of video frames per second to be
correctly decoded).
Interface is based on the IEEE 802.16-2004 [1] Air Interface The purpose of this paper is to address these questions and
Standard and the IEEE 802.16e Mobile Amendment [2]. Mo- design a viable end-to-end solution, which may be the first
bile WiMAX Air Interface adopts the Orthogonal Frequency attempt. We also identify key design parameters and present a
Division Multiple Access (OFDMA) for improved multi-path methodology to optimize spectrum efficiency (in terms of the
performance in the non-line-of-sight (NLoS) environments. number of video channels provided), video quality (in terms
The inclusion of SIMO/MIMO antenna techniques along with of effective frame rate) and coverage area. To the best of our
flexible OFDMA sub-channelization schemes, advanced cod- knowledge, such a study has not been thoroughly conducted
ing and modulation all enable the Mobile WiMAX technology in the literature [5]–[11].
to support high data rates. In addition, OFDMA subchannel- The rest of this paper is organized as follows. In Section II,
ization and MAP (a header part of OFDMA frame used to we describe the preliminaries of WiMAX multicast/broadcast
indicate the control information of current frame, like the service (MBS) and the issues in a baseline system. In Section
allocation of subchannel and slot) based signaling schemes III, we present our proposed end-to-end solution for MBS
provide a flexible mechanism for optimal scheduling of space, over multi-BS (Multiple Base Stations). Then, we develop a
frequency and time resources over the air interface on a frame- new methodology to optimize key objectives such as spectrum
by-frame basis. efficiency, coverage and video quality in Section IV. Section
The high data rate and QoS assurance provided by WiMAX V presents our results to compare the performance of our
make it commercially viable to support multimedia applica- proposed scheme with that of the baseline schemes. Finally,
tions, such as video telephony, video gaming, and mobile we conclude the paper in Section VI.
TV broadcasting. However, to provide mobile TV services
II. W I MAX BASICS AND MBS S UPPORT [1]–[4]
Manuscript received May 15, 2006; revised December 1, 2006. This work
was supported in part by the NSF Career Award ANI-0093241 and under A. OFDMA frame structure
NSF grant DBI-0529012.
Janfeng Wang is with Philips Research North America (e-mail: jian- The IEEE 802.16e PHY supports TDD, FDD, and Half-
feng.wang@philips.com). Duplex FDD operation. Fig. 1 illustrates the OFDM frame
Muthaiah Venkatachalam is with Intel Corporation (e-mail: mutha- structure for a Time Division Duplex (TDD) implementation.
iah.venkatachalam@intel.com).
Yuguang Fang is with the University of Florida (e-mail: fang@ece.ufl.edu). Each frame is divided into DL and UL sub-frames separated
Digital Object Identifier 10.1109/JSAC.2007.070508. by Transmit/Receive and Receive/Transmit Transition Gaps
0733-8716/07/$25.00
c 2007 IEEE
WANG et al.: SYSTEM ARCHITECTURE AND CROSS-LAYER OPTIMIZATION OF VIDEO BROADCAST OVER WIMAX 713
OFDM s ym bo ls
MBS Contents
F CH Controlle
r
P AM S BM
Server
MB S OFDM A zo ne AAA
s l e nn ah cbu S l a c ig o L
P AM - L D
DL B ur st 4
Core Network
el bm a e r P
DL B ur st
1 ...
DL B ur s t 5
DL B ur s t 6
P AM - L U
DL B ur s t 2
H.264/AVC Compliant Communication Entity to connect Wired PHY Medium Wireless PHY Medium
encoder/transcoder MBS Controller and BSs
H.264/AVC Compliant Communication Entity to connect
encoder/transcoder MBS Controller and BSs
Fig. 3. Baseline MBS system over WiMAX
Fig. 4. Proposed end-to-end MBS solution over WiMAX
Padding
Encrypted SDU M-1
Encrypted SDU 0
Encrypted SDU 1
Parity Section 0
Parity Section 1
MBS_MAC_PDU
Construction Ö Ö
WiMAX RS-Section
MBS_DATA_IE n
PHY/MAC 5
...
Burst Scheduling
Subchannels
MBS_MAC_PDU n
OFDMA MAC
OFDMA Region Allocation Region RS-Section CRC 5
Header
Allocation
MBS MAP
6
Frame
No.
MBS_DATA_IE MBS_MAC_PDU 6
Buffering
MBS-enhanced transport-sublayer PDU
Note: AU represents access unit [13]. The decoding of each access unit results in one decoded picture.
MBS Region
Fig. 5. OFDMA region mapping for MBS MAC PDU and MBS DATA IE H.264/AVC Standard Decoder
C. Burst scheduling and OFDMA region allocation instance, (I0 P2 p1 P4 p3 P6 p5 P8 p7 p9 ). Multiple access units are
Burst-based transmission is proposed in our system. The encapsulated into one RTP packet. By using interleaved mode
scheduling of burst transmission among multiple video chan- and multi-time aggregation defined in RFC3984 [15], we allow
nels follows the round-robin algorithm, as shown in Fig. 8. one RTP packet contains the access units of the whole base-
The burst size is chosen to fully utilize frequency diversity layer GOP subsequence and another RTP packet contains the
and time diversity while balancing energy efficiency and video access units of the enhanced-layer GOP subsequence.
channel switch delay. To be explained later, the base layer GOP subsequence
Once a burst is scheduled for transmission, OFDMA is applied with more robust FEC coding (RS outer coding
frame number and OFDMA data region to transmit each and CTC inner coding) than that of the enhanced layer
MBS MAC PDU is assigned. The OFDMA data region is GOP subsequence. In this way, most of users can correctly
specified by OFDMA symbol offset, subchannel offset, the decode the base-layer all the time. In addition, access units
number of OFDMA symbols and the number of subchannels. in enhanced-layer may be dropped if the virtual buffer, which
One MBS DATA IE for each MBS MAC PDU is con- contains all the multiplexed MBS-enhanced transport-sublayer
structed to indicate its associated multi-cast CID, selected PDUs, is larger than certain threshold (in terms of transmission
MCS, and the OFDMA data region. MBS DATA IE also time). The dropping policy is set up in such way that video
includes the redirection to the next MBS MAP which contains quality can be kept smooth over time and fair across video
MBS DATA IE(s) belonging to the same Multicast CID. Via channels as much as possible.
redirection, WiMAX PHY/MAC at MSS side knows the frame In our system, we target H.264/AVC for its technical matu-
number of next OFDMA frame containing MBS MAC PDUs rity and commercial availability. However, there is no reason
which it intends to decode. Since the MBS client knows the we exclude other video coding schemes in which scalability
end of burst and the beginning of the next burst, MBS can (temporal scalability, spatial scalability, or SNR scalability)
save power significantly by putting transceiver into sleep/idle can be exploited. For example, Multiple Description Coding
mode before next burst starts. (MDC) scheme could be applied here in a way that one
description entity is treated as the base layer and another
description entity is treated as the enhanced layer.
D. Channel switching
When MBS client intends to switch to another video F. RS coding and decoding
channel, it first gets the corresponding new Multicast CID
As we assume, one RTP packet contains multiple access
via mapping from video channel to multicast CID. Then
units, i.e., encoding video pictures. After encryption, we apply
WiMAX PHY/MAC checks the MBS MAP in the new
RS outer coding to each RTP packet to increase the reliability
OFDMA frame, locates the MBS DATA IE containing the
of transmission over WiMAX. More robust RS FEC is used
new multicast CID and starts decoding the corresponding
for RTP packet containing base-layer video subsequence than
MBS MAC PDU. In the meantime, WiMAX PHY/MAC stops
RTP packet containing enhanced-layer video subsequence.
decoding the MBS MAC PDUs belonging to the old video
As shown in Fig. 6, each RTP payload is fragmented into
channel. Via MBS MAP redirection, WiMAX PHY/MAC
fixed-size SDUs. Each SDU is encrypted thereafter. Assume
knows the frame number of next frame containing interested
the number of encrypted SDUs is M. N-M parity sections are
MBS MAC PDUs.
appended. For convenience, we represent the super RS block
Since MBS client can locate and decode only the packets on
as RS(N, M) , where N is RS block size. N and M may
the video channel currently being watched, no power will be
vary GOP by GOP. By adjusting N, we can apply unequal
wasted for decoding unwanted video packets. This is another
protection to base-layer GOP subsequences and enhanced-
feature for optimizing energy efficiency in our system.
layer GOP subsequences. (N, M) represents RS code book
Define Ti as the transmission time of one GOP sequence
index, which will be transmitted with RS super block for
for video
Kchannel i. The average channel switching time is decoding.
Tcs ≈ i=1 Ti /2, where K is the total number of supporting
To be discussed next, both SDUs and parity sections are
video channels.
called RS section. Cyclic redundancy check (CRC) is gener-
ated for each RS section such that the RS decoder at MBS
E. GOP (Group of Pictures) structure and RTP multi-time client can tell whether the RS-section has error, thus increasing
aggregation error coding efficiency. Note that if 2s+r < N −M (s errors, r
We exploit temporal scalability available in H.264/AVC. erasures), then the original transmitted code word will always
We divide each GOP (e.g., I0 p1 P2 p3 P4 p5 P6 p7 P8 p9 1 ) into be recovered. An erasure occurs when the position of an erred
one base-layer GOP sub-sequence (e.g., I0 P2 P4 P6 P8 ) and symbol is known. A decoder can correct up to (N − M )/2
one enhanced-layer GOP subsequence (e.g., p1 p3 p5 p7 p9 ). The errors or up to N − M erasures.
encoding of one video picture (or frame) results in one
access unit. Correspondingly, the decoding of one access unit G. MBS MAC PDU construction
results in one complete picture. The decoding sequence is, for Each MBS MAC PDU contains 10-Byte MAC header, L-
1 Picture I, i.e., instantaneous decoding refresh (IDR) picture, is coded using
Byte RS-section and 4-Byte CRC. By checking CRC, the
intra coding. Pictures P and p are non IDR pictures using inter prediction MBS client can determine which RS-section has error. MAC
coding. header contains compressed RTP header (which includes RTP
WANG et al.: SYSTEM ARCHITECTURE AND CROSS-LAYER OPTIMIZATION OF VIDEO BROADCAST OVER WIMAX 717
sequence number and RTP timestamp), the type of RS-section OFDMA region
(data or parity), the sequence number of RS-section, the size allocation
H. Synchronization across multi-BS and macro-diversity registered MBS service via one BS in the MBS geograph-
ical zone, it should correctly receive video broadcast ser-
As we have known so far, we allocate schedule-to- vice throughout the same MBS geographical zone without
transmit OFDMA frame number and OFDMA region to each doing re-configuration and regular handoff when the MSS
MBS MAC DPU at MBS server side. WiMAX PHY/MAC at moves. Moreover, multicast and broadcast service flows are
BSs side just follows the same schedule made by MBS server maintained regardless which mode the MSS is operating at.
and transmits MBS MAC DPU copies at pre-determined By running idle mode and/or putting MSS into scheduled
OFDMA frame and data region with the same PHY channel sleep mode (during off-burst period), MSS can greatly reduce
coding. In other words, same encrypted video content can be the power consumption by removing unnecessary control
transmitted across multiple BSs using the same transmission signaling and not decoding the unwanted video content.
mechanism (symbol, subchannel, modulation, and etc.) at the In case MSS migrates to BS advertising another
same time. In this way, macro-diversity can be achieved and MBS ZONE ID, i.e., to another MBS geographical zone, it is
receiving performance can be enhanced. expected to register at that BS for further reception of MBS
We notice that the delays to transmit video PDUs from MBS content.
server to BSs can vary from BS to another. So the copies of
the same PDU may arrive at BSs in different time. However, IV. J OINT O PTIMIZATION OF V IDEO Q UALITY, C OVERAGE
this should not be a problem because we allow long enough AND S PECTRAL E FFICIENCY
delay guard such that all the PDU copies can arrive at BSs well
In this section, we identify the key design parameters and
before schedule-to-transmit time. In case some PDU copy does develop a methodology to jointly optimize video quality,
not meet the schedule-to-transmit time, it should be dropped
coverage and spectral efficiency, which conflicts with each
at BS.
other in nature.
I. MBS registration, handoff and low-power mode operation A. Optimization problem formulation
We use the effective frame rate (EFR) to represent video
When an MSS is being registered at BS for receiving
quality. The effective frame rate is the correctly decoding
multicast and broadcast services, it shall initiate dynamic
frame rate per video channel at the application layer. Denote
service addition (DSA) procedure with respect to multicast
Fb as the base-layer video frame rate and Fe as the enhanced-
and broadcast connections to inform the BS that the MSS
layer video frame rate (after shaping). Fe is adjusted to meet
is a consumer of certain multicast/broadcast services. Such
the total bandwidth requirements. Denote GERb and GERe
knowledge may be used to initiate bi-directional upper layers
as the base layer GOP subsequence decoding error rate and
communication between the MSS and the network for the
the enhanced layer GOP subsequence decoding error rate,
purpose of configuration of multicast/broadcast service.
respectively. The effective frame rate can be lower bounded
Mapping video channels to multicast CIDs shall be known
by
to all BSs belonging to the same MBS geographical zone.
In addition, the same security mechanism is applied through- EF R ≥ Fb (1 − GERb ) + Fe (1 − GERb ) (1 − GERe ) .
out the same MBS geographical zone. Once an MSS have (1)
718 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 25, NO. 4, MAY 2007
GERb and GERe depend on reference distance (d), refer- where Lb and Le are RS section size for base layer and
ence speed (s), channel coding scheme (RS outer coding and enhanced layer, respectively. Suppose that the RS coding rate
CTC inner coding). The channel coding scheme, represented for base layer is ρb and the RS coding rate for enhanced layer
by Ψ = {Lb , Le , ρb , ρe , M CSb , M CSe }, is a parameter set is ρe . Then the number of RS sections for base layer GOP
which consists of RS section size L, RS coding rate ρ, and subsequence, Nb , equals Mb /ρb . The number of RS sections
MCS scheme for base layer and enhanced layer, respectively. for enhanced layer GOP subsequence, Ne , equals Me /ρe .
In the following, we use EF R (d, s, Ψ, fb , fe ) to represent the Now we calculate the bandwidth requirement to trans-
dependence of EFR on various factors. mit one GOP subsequence after encryption and RS coding.
To meet a certain EF R (d, s, Ψ, fb , fe ), minimal bandwidth Denote Lpdu as the MAC PDU size and Lmap ie as the
(in terms of OFDMA slot rate) per video channel is required. MAP DATA IE size. Denote BP S (M CS) as the number
Denote the mean bandwidth requirement (in terms of OFDMA of non-duplicate information bytes which can be transmitted
slot rate) for each channel as R (Ψ, fb , fe ). The total band- in one OFDMA slot under certain MCS. The MCS used to
width required to support K video channels can be represented transmit MAP DATA IE should be more robust than that for
as K (1 + δK ) R (Ψ, fb , fe ), where δK is a guard factor (in MAC PDU. Then, the total number of OFDMA slots required
terms of percentage of mean) provided to absorb video traffic to transmit one GOP subsequence is
variation. With the consideration of the multiplexing gain, δK
becomes smaller as K increases. −S (N, M CSpdu ) =
SGOP
Lpdu Lmap ie (5)
Since the worst video quality is normally experienced at the N BP S(MCSpdu ) + BP S(MCSmap ) ,
cell edge, we can guarantee sufficient video quality throughout
where the MAC PDU size is calculated by
the cell if we can satisfy video quality at the cell edge. Let
d denote the cell radius. Suppose the effective frame rate Lpdu = M AC Header + CRC + L. (6)
requirement throughout the cell is EF Rmin .
Understanding that the supported number of video chan- Note MCS for base layer and MCS for enhanced layer are
nels is a good indicator of the spectral efficiency for video different, set as M CSb and M CSe , respectively. Then the
transmission, we formulate the joint optimization of spectral total number of OFDMA slot required to transmit one GOP
efficiency and video quality as sequence is
MaximizeK SGOP = SGOP −S (Nb , M CSb ) + SGOP −S (Ne , M CSe ) .
s.t. (2) (7)
EF R (d, s, ψ ∗ , fb , fe∗ ) > EF Rmin , It can be further represented as
where ψ ∗ and fe∗ are derived by SGOP (ψ, Γb , Γe ) = SGOP −S (1+β)GS b (Γb )
, M CS b
ρb Lb
{Ψ∗ , fe∗ } = argmax EF R (d, s, Ψ, fb , fe ) (1+β)GSe (Γe )
{Ψ,fe } (3) + SGOP −S ρe Le , M CSe .
s.t.K (1 + δK ) R (Ψ, fb , fe ) ≤ , (8)
Finally, the OFDMA slot rate required to support one video
where is the total bandwidth (in terms of OFDMA slot rate)
channel can be derived as
available for MBS.
In the following two subsections, we derive the bandwidth fb fe
R (ψ, fb , fe ) = λSGOP ψ, , . (9)
requirement R (Ψ, fb , fe ) and RS block decoding error rate λ λ
GER used for the calculation of EF R (d, s, Ψ, fb , fe )) in
detail. C. RS block decoding error rate (GER)
In this subsection, we compute the RS block decoding error
B. Bandwidth requirement (R (Ψ, fb , fe )) rate as a function of such factors as distance d, speed s and
We first calculate the RS block size of one GOP subse- channel coding scheme.
quence. Suppose the number of video frame in the base-layer For the theoretical analysis, we assume a stationary mem-
GOP subsequence is Γb (Γb = Fb /λ , where λ is the GOP oryless channel model. This premise is justified because the
arrival rate) and the number of video frame in the enhanced- shadowing margin and the temporal fading margin are applied
layer GOP subsequence after shaping is Γe (Γe = Fe /λ). and several randomization and interleaving procedures are
By following video traffic model (e.g., [12], [13]), we can used.
derive the base-layer GOP subsequence size, represented as As we discussed in the previous sections, we use RS outer
GSb (Γb ), and the enhanced-layer GOP subsequence size, coding with CRC insertion. Assuming the reliability of CRC
represented as GSe (Γe ). Assume β percentage of encryption for each RS section, the normal RS decoding turns out to
overhead is added to video traffic. Then we can calculate the be erasure RS decoding. Recall that N is the RS block size
number of data RS sections for base layer GOP subsequence, and M is the number of data section. The RS block (one
Mb , and the number of data RS sections for enhanced layer GOP subsequence) decoding error probability for erasure RS
GOP subsequence, Me , as decoding, denoted as GER, is
Mb = (1+β)GS b (Γb ) N
N N −i
Lb (4) GER ≈ Psi (1 − Ps ) , (10)
Me = (1+β)GSLe
e (Γe )
, i=N −M
i
WANG et al.: SYSTEM ARCHITECTURE AND CROSS-LAYER OPTIMIZATION OF VIDEO BROADCAST OVER WIMAX 719
250 50
45
Relative Coverage Gain (%)
Jianfeng Wang received the B.E. and M.E. degrees Yuguang Fang received a Ph.D. degree in Systems
in electrical engineering from Huazhong University Engineering from Case Western Reserve University
of Science and Technology, China, in 1999 and in January 1994 and a Ph.D degree in Electrical
2002, respectively, and the Ph.D. degree in electrical Engineering from Boston University in May 1997.
engineering from University of Florida in 2006. He was an assistant professor in the Department of
From January 2006 to July 2006, he was a research Electrical and Computer Engineering at New Jersey
intern in the Wireless Standards and Advanced Institute of Technology from July 1998 to May 2000.
Technology group at Intel Corporation. In October He then joined the Department of Electrical and
2006, he joined Philips Research North America as Computer Engineering at University of Florida in
a senior member research staff in the Department of May 2000 as an assistant professor and got an early
Wireless Communications and Networking. He is a promotion to an associate professor with tenure in
member of the IEEE. He can be reached at jianfeng.wang@philips.com. August 2003 and to a full professor in August 2005. He has published over
200 papers in refereed professional journals and conferences. He received the
Muthaiah Venkatachalam is the lead E2E architect National Science Foundation Faculty Early Career Award in 2001 and the
in the Wireless Standards and Advanced Technology Office of Naval Research Young Investigator Award in 2002. He has served
group at Intel Corporation. He is currently heading on several editorial boards of technical journals including IEEE Transactions
the MAC layer definition, design and specification on Communications, IEEE Transactions on Wireless Communications, IEEE
for the next generation mobile WiMAX. He has Transactions on Mobile Computing and ACM Wireless Networks. He have
played a significant role in the evolution of broad- also been actively participating in professional conference organizations such
band wireless industry by actively participating and as serving as The Steering Committee Co-Chair for QShine, the Technical Pro-
contributing to standards development at IEEE and gram Vice-Chair for IEEE INFOCOM’2005, Technical Program Symposium
WiMAX Forum. He is the chair of the ”Idle mode Co-Chair for IEEE Globecom’2004, and a member of Technical Program
and Paging” subteam in the WiMAX Forum, which Committee for IEEE INFOCOM (1998, 2000, 2003-2007). He is a senior
is a key component of the E2E architecture for member of the IEEE.
WiMAX. He serves as an editor for Elsevier Journal of “Computer Networks”
and has served as an editorial board member for “Special Issue on Media
and Stream Processing” in the International Journal of Embedded Systems.
He has served as an organizing committee member for 5th, 6th and 7th
Workshops on Media and Streaming Processors and is currently an organizing
committee member for the 8th workshop on Media and Streaming Processors.
He has several publications and over 35 patents pending. Previously at Intel,
he has led the efforts on developing network processor based IP and ATM
traffic management solutions; processing architectures for Intel’s IXP23xx
Network processor family; and system architectures for broadband access,
wireless access and metropolitan optical networking platforms. He has a
bachelor’s degree from the Indian Institute of Technology at Kharagpur and
a graduate degree from the University of Texas at Austin. He can be reached
at muthaiah.venkatachalam@intel.com.