You are on page 1of 7

International Journal of Advanced Computer Science, Vol. 2, No. 6, Pp. 222-228, Jun., 2012.

Manuscript
Received:
5, Jul., 2011
Revised:
28,Nov., 2011
Accepted:
3, May, 2012
Published:
15, Jul., 2012

Keywords
distributed
video coding,
side
information
creation,
mode
decision,
gray code,
motion
compensated
frame
interpolation,
quality
enhancement

Abstract A novel technique to create
side information at the block level with skip
mode, Wyner-Ziv mode and Intra/WZ mode
is presented for distributed video coding
framework and the principle of its operation
is discussed. Some novel tools such as coding
mode decision, gray mapping and lots of
advanced techniques such as side
information creation, LDPCA channel code
and correlation noise modeling are all
adopted in this framework. Simulation
results demonstrate that the proposed
scheme can achieve up to 2.4dB
improvement in rate-distortion performance
when compared to the state-of-the-art DVC
codec, especially for high motion video
sequences and long group of pictures sizes.


1. Introduction
Conventional hybrid predictive video compression
standards, such as MPEG-4 and H.264/AVC [1], that has
been much improved over the last decades. In this video
coding architecture, the correlation between (temporal) and
within (spatial) the video frames is exploited at the encoder,
using motion estimation/compensation, leading to the
encoder is typically 5 to 10 times more complex than the
decoder. However, such a heavy encoder with light decoder
video coding system is usually suitable for video
broadcasting or video streaming applications (e.g., video on
demand system) where video is encoded once and decoded
many times. Notwithstanding the huge success of predictive
video coding, this approach is less adequate for emerging
application scenarios such as wireless PC cameras, mobile
camera phones requiring low power and low computational
complexity at the video encoder side or at least a flexible
distribution of the codec complexity, robustness to channel
losses, high compression efficiency and low latency as
well. Driven by these new applications and associated


This work was supported by the National Physical Science Major Fund
Project (project number: U0735004).
D. Liu and M. H. Du are with South China University of Technology,
School of Electronic and Information Engineering
(liu-dong@mail.scut.edu.cn)
requirements, the beginning of this decade has seen the
emergence of a novel video coding paradigm, well known
as distributed video coding (DVC) [2]. DVC is based on
two major information theory results: the Slepian-Wolf [3]
and the Wyner-Ziv [4] theorems. The Slepian-Wolf theorem
states that the optimal rate achieved when performing joint
lossless encoding and decoding of two correlated sources
can theoretically also be reached by doing separate
encoding and joint decoding with a vanish error probability.
The Wyner-Ziv theorem extends this result to lossy coding
by stating that the same rate can be achieved by lossy
compression when the side information (SI) is made
available at the decoder in the case of Gaussian memoryless
sources and mean squared error distortion. Together, the
Slepian-Wolf (SW) and the Wyner-Ziv (WZ) theorems
suggest that it is possible to compress two statistically
dependent signals in a distributed way (separate encoding
and joint decoding) approaching the coding efficiency of
traditional predictive coding schemes (joint encoding and
decoding). WZ coding based architectures may provide the
following functional benefits which are rather important for
many emerging applications: 1) flexible allocation of the
overall video codec complexity; 2) improved error
resilience due to its intrinsic joint source-channel coding
framework; 3) codec independent scalability; 4)
exploitation of multiview correlation without encoders
communicating among them.
The SI creation process at the decoder is rather complex
and central for the WZ codec rate-distortion (RD)
performance since it determines the amount of errors that
have to be corrected through the SW decoder. Thus, the
efficiency of the SI creation techniques significantly
influences the RD performance of DVC. Indeed, the higher
quality of SI, the higher the correlation between the SI and
the current WZ frame is, the better the estimation and the
fewer are the parity bits necessary to be sent from the
encoder to the decoder to achieve a certain target quality. In
DVC, the SI can be generated in several ways: 1) average
of the two nearest key frames [5]. 2) Motion Compensated
Frame Interpolation (MCFI) has been used in almost the all
DVC codecs to create SI by interpolating the current frame
from the key frames [6]. Spatial motion vector smoothing,
hierarchical block size technique combined with an
adaptive search range strategy has late been proposed to
improve the performance of MCFI in [7]. 3) SI is created or
refined using some learning tools [8]-[13]. 4) SI generation
with the help of some auxiliary information, or hints, sent
by the encoder, such as the cyclic redundancy check (CRC)
[14] and hash bits [15]-[16].
Distributed Video Coding with Novel Coding
Mode Decision and Side Information Creation
Dong Liu & Ming Hui Du
Liu et al.: Distributed Video Coding with Novel Coding Mode Decision and Side Information Creation.
International Journal Publishers Group (IJPG)


223
Motion-compensated frame interpolation (MCFI) has
been used in almost the all DVC codecs to generate SI by
interpolating the current frame from previous and next
reference frames. The scheme holds as long as the block
has constant velocity. However, when high motion or
strong illumination changes occur, MCFI often fails to
create a high quality SI. To overcome this drawback, a
novel coding framework to combine MCFI technique with
encoder sent additional information is presented; it can be
achieved by intra encoding some blocks of the current WZ
frame with the help of auxiliary information, such as hash
bits, which sent from the encoder to the decoder. These
intra blocks in WZ frames have a lower quality, and are
used by the decoder to generate better SI for regions where
MCFI technique fails. The novel contributions of this paper
are: 1) block based encoding mode decision algorithm and
2) advanced SI creation framework based on a motion
compensated quality enhancement (MCQE) technique
which generate SI for the intra blocks [17]. In order to
access the impact and benefits of the proposed
contributions, the proposed DVC codec is evaluated in term
of RD performance, according to the DISCOVER [18]-[19]
test conditions. The evaluation shows RD performance
gains up to 2.1 dB, especially for high motion and long
Group of pictures (GOP) sizes.
This paper is organized as follows. In section II, the
proposed DVC codec framework is described. Next, in
section III, the proposed encoder mode decision algorithm
is presented in detail. In section IV an advanced SI
generation framework integrating all the proposed
techniques is presented. And section V presents the test
conditions and the RD performance of the novel DVC
codec. The last section finally concludes the paper.

2. The Proposed DVC Codec
Framework
The DVC codec adopted in this paper follows the
coding architecture proposed by DISCOVER and with
some modifications. The channel encoder uses the
rate-compatible LDPC Accumulate (LDPCA) codes for
distributed source coding introduced in [20]. Fig.1
illustrates the architecture of the proposed DVC encoder
with three block-based coding modes: the skip mode, the
usual WZ mode and the novel Intra/WZ mode. In a
nutshell, the adopted DVC encoder works as follows:
At the encoder:
1) Frame classification: first, the coding process
constructs a Group of Picture (GOP) by dividing the
video frames into one key frame and the remaining WZ
frames. The key frames are coded using the
H.264/AVC Intra coding scheme while WZ frames are
coded using a DVC approach.
2) Transform: over each WZ frame, an 4 4 integer DCT
is applied. Then, the resulting DCT coefficients of the
entire WZ frame are grouped into DCT coefficients
bands, according to the position occupied by each DCT
coefficient within each 4 4 block.
3) Quantization and Gray Mapping: each DCT
coefficients band is uniformly quantized with
2
k
M
levels. Quantization bins of the DCT coefficients
are here mapped to binary codewords using Gray
representation [21] since this allows improving the
correlation between the WZ and SI data and, therefore,
to enhance the overall RD performance [22]. Over the
resulting Gray mapped quantized symbol stream,
bitplane extraction is performed. For a given DCT
band, the quantized symbols bits of the same
significance (e.g. the most significant bit) are grouped
together, creating the corresponding bitplane array
which is then independently LDPCA encoded.
4) Encoder mode decision and ternary decision map
coding will be described in detail in section III.
5) LDPCA encoding: the LDPCA encoding process for
the DCT coefficients band starts with the MSB array,
corresponding to the most significant bits of the band
quantized symbols. The parity information created by
the LDPCA encoder for each bitplane of each
quantized DCT band is then stored in the buffer and
sent in chunks upon decoder request, through the
feedback channel.
6)
DCT
Uniform
Quantizer
LDPCA
Encoder
Buffer
Coding
Mode
Decision
H.264/AVC
Block
Encoder
H.264/AVC
Intra Encoder
Ternary
Decision
Map Encoer
.
.
.
DVC Encoder
Feedback Channel
WZ
Bitstream
Header
Bitstream
Intra
Block
Bitstream
Keyframe
Bitstream
Key Frames
Bitplane 1
Bitplane Mk
Gray
Mapping
WZ
Frames

Fig. 1 Proposed DVC encoder framework.

Fig. 2 shows the proposed DVC decoder framework.
First, MCFI module generates an estimate of WZ frame
i
X ,
based on the decoded reference frames
'
b
X and
'
f
X . It is
also necessary to generate SI based on the Intra decoded
blocks in WZ frames (Intra/WZ mode), thus, the Intra
blocks in the WZ frames and the ternary decision map are
decoded and used by the MCQE module along with the
MCFI. The SI is constructed by multiplexing the MCFI
estimated blocks (WZ mode) with the MCQE estimated SI
blocks (Intra/WZ mode). Then the DCT is applied to the SI,
the SI DCT bands are not used directly by the LDPCA
decoder, but they are first converted into soft-input
information, through a correlation noise modeling (CNM)
by using a Laplacian distribution due to the good trade-off
between model accuracy and complexity that expresses the
statistical dependency between
i
X and
i
Y . The Laplacian
distribution parameter is calculated by online estimation
[23]-[24]. A novel soft reconstruction function is used,
which can be seen as a generation of the optimal MMSE
reconstruction [25] and leads to better WZ decoded PSNR
International Journal of Advanced Computer Science, Vol. 2, No. 6, Pp. 222-228, Jun., 2012.
International Journal Publishers Group (IJPG)


224
than the optimal reconstruction function MMSE. Finally,
the IDCT is applied to the reconstructed DCT coefficients
and the decoded frame
'
i
X is obtained. To obtain the
decoded video sequence, the decoded key frames and WZ
frames are conveniently mixed.

LDPCA
Decoder
Soft
Reconstruction
IDCT
Ternary
Decision Map
Decoder
Motion
Compensated
Quality
Enhancement
Frame
Buffer
Motion
compensated
Frame
Interpolation
H.264/AVC
Block Decoder
H.264/AVC Intra
Decoder
Correlation
Noise Modeling
DCT
Feedback Channel
DVC Decoder
WZ
Bitstream
Header
Bitstream
Intra
Block
Bitstream
Keyframe
Bitstream
X
b

X
f

X
b

X
f

Yi
Decoded
WZ
Bitstream
Decoded
Key
Bitstream
Inverse Gray
Mapping
MUX

Fig. 2 Proposed DVC decoder framework.

3. Coding Mode Decision and
Ternary Decision Map Coding
The references frames (
b
X and
f
X ) are stored in a
buffer at the encoder to help to decide whether each block
in WZ frame is to be encoded in the skip mode, the WZ
coding mode or the Intra/WZ mode. The benefit of
introducing skip mode is triple folds. First, it reduces
significant the bit rate to be sent to the decoder and will
contribute substantially to the RD performance. Second, the
skip mode leads to reduced data for LDPC encoding at the
encoder, which is very much desired for DVC. Third, it also
leads to reduced data for LDPC decoding and will help
speed up the decoding [22]. Due to encoder complexity
constraints, it is proposed here to use MAD (Mean Absolute
Differences) as an indication of the temporal coherence
between the block k, in the current WZ frame
i
X , and the
co-located blocks in the average frame (
ave
X ) of both
reference frames, using (Equ. 1) and (Equ. 2).
( ) 2
ave b f
X X X = + (Equ. 1)
( , )
1
( ) ( , ) ( , )
i ave
x y k
MAD k X x y X x y
M e
= (Equ. 2)
In the mode decision process for block k, the block
mode m(k) is determined based on two preset
threshold
1 2
T T < , according to (Equ. 3).
2
1 2
1
1 ( )
( ) 0 ( )
1 ( )
if MAD k T
m k if T MAD k T
if MAD k T
>
= s <
<

(Equ. 3)
When
2
MAD(k) T > , it indicates that the correlation between
the source and the MCFI frames is weak, therefore,
intra/WZ mode coding should be performed, i.e. ( ) m k 1 = .
When
1
( ) MAD k T < it means the correlation between the
source and the MCFI frames is high, therefore the block
should be classified as skip, i.e. ( ) m k 1 = .
If
1 2
( ) T MAD k T s < , it should be encoded in the WZ mode,
i.e. ( ) m k 0 = . Following coding mode decision, the
ternary decision map was encoded by using RLE and
UVLC and then transmitted from the encoder to the
decoder for each WZ frame block.

4. Side Information Creation
A. Motion Compensated Frame Interpolation
To create the SI for the WZ mode blocks, the proposed
DVC codec uses the advanced block-based MCFI
framework [12]. Its basic architecture is presented in Fig.3.
MCFI process is performed in the following steps.
- Both references frames are first low-pass-filtered to
improve the motion estimation. A modified
block-matching criterion is used to estimate the
motion vectors between the past and future
references frames.
- Then, bidirectional motion estimation is applied to
refine the forward motion vectors obtained in step 1
above by combining a hierarchical block size
technique with an adaptive search range.
- Next, a spatial motion smoothing algorithm based
on weighted vector median filter is applied to make
the final motion field smoother.
- Finally, the MCFI interpolated frame can be
constructed by using a bidirectional motion
compensation technique.

Frame
Buffer
Frame
Structure
Definition
Forward
Motion
Estimation
BiDirectional
Motion
Estimation
Spatial
Motion
Smoothing
BiDirectional
Motion
Compensation
Xb

Xf

Xb
Xf
Fig. 3 MCFI block diagram.

B. Motion Compensated Quality Enhancement
When high motion or strong illumination changes
occur, the MCFI technique often fails, leading to a poor SI
quality. The idea of the proposed SI generation framework
is to improve the SI quality by using the received low
quality Intra coded blocks which is sent by encoder,
through a novel MCQE module, in Fig.4. The MCQE
module [17] works as follows:
-
i
T Block estimation: first, bidirectional motion
estimation is performed in order to obtain a pair of
symmetric motion vectors. Then, the residual error
is inspected through the SSD (Sum of Square
Difference) metric
b f
SSD SSD + s , (Equ. 4)
With:
Liu et al.: Distributed Video Coding with Novel Coding Mode Decision and Side Information Creation.
International Journal Publishers Group (IJPG)


225
( ) | | ( )

=
A =
1
0
1
0
2
,
N
x
N
y
b b
y x X E SSD

( ) | | ( )

=
A =
1
0
1
0
2
,
N
x
N
y
f f
y x X E SSD

=
|
|
.
|

\
|
=
1
0
1
0
2 2
12
2
12
8
N
x
N
y
K I
| |

(Equ. 5)
Where,
b
X A and
f
X A correspond to the residual
errors in the motion estimation process.
I
| and
K
| are the quantization step size used for the Intra
blocks
i
I and the key frames, respectively. If
(Equ. 5) is satisfied, this symmetric motion vectors
together with
'
b
X and
'
f
X , are used to obtain two
motion compensated versions of the same
i
I block.
Finally, the two motion compensated blocks and the
Intra decoded block are averaged and the
i
T
estimated blocks are obtained.
-
i
U Block estimation:
i
U estimated blocks can be
obtained by combining the remaining blocks from
i
I with the corresponding MCFI estimated
i
S
blocks using a weighted average algorithm, and the
weights e are proportional to the estimated quality
of each
i
S or
i
I block.
( ) ( ) ( ) ( ) y x S y x I y x U
i i i
, 1 , , e e + = (Equ. 6)
- SI
i
Y generation: the SI
i
Y can be constructed by
multiplexing the blocks
i
T with blocks
i
U
and
i
S .
B
i
D
i
r
e
c
t
i
o
n
a
l

M
o
t
i
o
n

E
s
t
i
m
a
t
i
o
n
Motion
Compensation
Average
X
b

X
f

I
i
Motion
Compensation
SSD
b
+SSD
f s
T
i
X
b
MC
X
f
MC
I
i
MUX
Weighted
Average
I
i
S
i
S
i
Y
i
U
i

Fig. 4 MCQE block diagram.

5. Simulation Results
In order to evaluate the performance of the advanced
DVC codec framework, it is essential to define first the test
conditions used. After, the experimental results are
presented and discussed.
A. Test Conditions
All the results here presented were taken with the
following test conditions.
- Test sequences: Hall Monitor, Foreman and Soccer,
since their contents are getting more and more
complex, Soccer is the most complex sequence in
terms of motion content. All frames for each
sequence are used.
- Temporal and spatial resolution: QCIF at 15Hz.
- GOP sizes: 2, 4 and 8.
- WZ frames RD points: Eight RD points (
i
Q ) are
defined in the RD performance evaluation
corresponding to a different number of WZ coded
bitplanes which are shown in Fig.5; while RD
points 1 (
1
Q ) and 8 (
8
Q ) correspond to the lowest
and the highest decoded quality respectively.
- Key frame coding: Key frames are Intra coded with
the H.264/AVC Main profile with a quantization
parameter (QP) depending on the number of coded
WZ bitplanes. The QP values for the key frames, as
defined in Table 1, are adapted for each video
sequence in order to obtain a rather constant
decoded video quality for the key frames and WZ
frames.
16 8 0 0
8 0 0 0
0 0 0 0
0 0 0 0
Q
1
32 8 0 0
8 0 0 0
0 0 0 0
0 0 0 0
Q
2
32 8 4 0
8 4 0 0
4 0 0 0
0 0 0 0
Q
3
32 16 8 4
16 8 4 0
8 4 0 0
4 0 0 0
Q
4
32 16 8 4
16 8 4 4
8 4 4 0
4 4 0 0
Q
5
64 16 8 4
16 8 4 4
8 4 4 4
4 4 4 0
Q
6
64 32 16 8
32 16 8 4
16 8 4 4
8 4 4 0
Q
7
128 64 32 16
64 32 16 8
32 16 8 4
16 8 4 0
Q
8

Fig. 5 Eight quantization matrices associated to eight different RD
performance points.

TABLE 1
QUANTIZATION PARAMETERS FOR THE KEY FRAMES ASSOCIATED TO
DIFFERENT RD POINTS
QP
I
(15Hz)
Q
1
Q
2
Q
3
Q
4
Q
5
Q
6
Q
7
Q
8
Hall Monitor
Foreman
Soccer
37 36 36 33 33 31 29 24
40 39 38 34 34 32 29 25
44 43 41 36 36 34 31 25


DISCOVER video codec will be used as a benchmark
to evaluate the proposed DVC codec performance.

B. Rate-distortion performance and analysis
This section presents the RD performance of the
proposed DVC codec, notably in comparison with the
benchmark mentioned above and with the DVC codec
without the proposed SI generation algorithm.
International Journal of Advanced Computer Science, Vol. 2, No. 6, Pp. 222-228, Jun., 2012.
International Journal Publishers Group (IJPG)


226
50 100 150 200 250 300 350
31
32
33
34
35
36
37
38
39
40
41
Hall Monitor(GOPsize:2):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
B
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
a
50 100 150 200 250 300 350 400 450 500
28
30
32
34
36
38
40
Foreman(GOPsize:2):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
b
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
b
50 100 150 200 250 300 350 400 450 500
26
28
30
32
34
36
38
40
Soccer(GOPsize:2):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
b
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
c
Fig. 6 RD performance for (a) Hall Monitor, (b) Foreman, (c) Soccer
video sequences (QCIF @15 Hz, GOP size 2).
50 100 150 200 250 300
31
32
33
34
35
36
37
38
39
40
41
Hall Monitor(GOPsize:4):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
B
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
a
50 100 150 200 250 300 350 400 450 500 550
26
28
30
32
34
36
38
40
Foreman(GOPsize:4):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
b
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
b
50 100 150 200 250 300 350 400 450 500 550 600
26
28
30
32
34
36
38
40
Soccer(GOPsize:4):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
b
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
c
Fig. 7 RD performance for (a) Hall Monitor, (b) Foreman, (c) Soccer
video sequences (QCIF @15 Hz, GOP size 4).
Liu et al.: Distributed Video Coding with Novel Coding Mode Decision and Side Information Creation.
International Journal Publishers Group (IJPG)


227
50 100 150 200 250 300
31
32
33
34
35
36
37
38
39
40
41
Hall Monitor(GOPsize:8):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
B
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
a
50 100 150 200 250 300 350 400 450 500 550
26
28
30
32
34
36
38
Foreman(GOPsize:8):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
b
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
b
100 200 300 400 500 600
26
28
30
32
34
36
38
Soccer(GOPsize:8):(QCIF@15HZ)
Bitrate(kbps)
P
S
N
R
(
d
b
)


DISCOVER
with skip and Intra/WZ coding modes
without skip and Intra/WZ coding modes
c
Fig. 8 RD performance for (a) Hall Monitor, (b) Foreman, (c) Soccer
video sequences (QCIF @15 Hz, GOP size 8).

Fig.6 represents the RD performance obtained for the
Hall Monitor, Foreman and Soccer test video sequences
considering a GOP size of 2. It can be clearly concluded
that the proposed DVC codec with and without coding
mode decision provides better RD performance than the
DISCOVER video codec. As Fig.6 illustrates, the proposed
DVC codec outperforms the DVC scheme without coding
mode decision for all sequences and RD points. As
expected, the higher the motion content of the video
sequence, the higher is the coding gain of the proposed with
coding mode decision. For GOP size 2, coding gains up to
0.89 dB are achieved with the proposed technique when
compared to a DVC without coding mode decision, notably
for the Soccer sequence.
Fig.7 and Fig.8 illustrate the RD performance obtained
for the Hall Monitor, Foreman and Soccer test video
sequences for GOP size of 4 and 8, respectively. As it can
be seen from Fig.6, Fig.7 and Fig.8, the RD gains increase
with the GOP size for all sequences and RD points; coding
gains up to 1.47 dB and 2.4 dB are achieved for the Soccer
sequence for GOP sizes of 4 and 8, respectively. Since
MCFI SI quality tends to decrease with the GOP size and
the motion content of the sequences, the proposed scheme
with coding mode decision can benefit of improving the
quality of SI as it can exploitation some Intra coded data.
And moreover, introducing skip mode can reduce
significant the bit rate to be sent to the decoder and will
contribute substantially to the RD performance.

6. Conclusion
This paper proposes some advanced techniques such as
encoder mode decision, Gray mapping and an advanced SI
creation framework and evaluates its RD performance in
the context of a transform domain LDPCA based WZ video
codec with feedback channel. The proposed DVC codec
improves the RD performance with compared to the
DISCOVER codec. Simulation results show a significant
improvement of RD performance typically for complex
video sequences and long GOP sizes. For the future works,
there is still much work remaining to be done in DVC to
achieve a better RD performance and we believe that more
advanced SI creation algorithms and some other issues such
as encoder rate control in DVC should be worked on.
7. References
[1] T. Wiegand, G. J. Sullivan, G.Bjontegaard, & A. Luthra,
Overview of The H.264/AVC Video Coding Standard,
(2003) IEEE Transactions on Circuits and System for Video
Technology, vol. 13, no. 7, pp. 560-576.
[2] B. Girod, A. Aaron, S. Rane, & D. Rebollo-Monedero,
Distributed Video Coding, (2005) Proceeding of the IEEE,
vol. 93, pp.71-83.
[3] J. Slepian & J. Wolf, Noiseless Coding of Correlated
Information Sources, (1973) IEEE Trans. Information
Theory, vol. 19, pp. 471-480.
[4] A. Wyner & J. Ziv, The Rate-distortion Function for Source
Coding with Side Information at The Decoder, (1976)
IEEE Trans. Information Theory, vol. 22, pp.1-10.
[5] A. Aaron , R. Zhang, & B. Girod, Wyner-Ziv Coding of
International Journal of Advanced Computer Science, Vol. 2, No. 6, Pp. 222-228, Jun., 2012.
International Journal Publishers Group (IJPG)


228
Motion Video, (2002) Conference Record of the
Thirty-Sixth Asilomar Conference on Signals, Systems and
Computers, Pacific Grove, CA, USA, vol.1, PP. 240-244.
[6] J. Ascenso, C. Brites, & F. Pereira, Improving Frame
Interpolation With Spatial Motion Smoothing for Pixel
Domain Distributed Video Coding, (2005) Process 5th
EURASIP Conference Focused on Speech Image Processing,
Multimedia Communications and Services. Smolenice,
Slovak Republic.
[7] J. Ascenso & F. Pereira, Advanced Side Information
Creation Techniques and Framework for Wyner-Ziv Video
Coding, (2008) Journal of Visual Communication and
Image Representation, vol. 19, no. 8, pp. 600-613.
[8] J. Ascenso, C. Brites, & F. Pereira, Motion Compensated
Refinement for Low Complexity Pixel Based Distributed
Video Coding, (2005) IEEE International Conf. Advanced
Video and Signal based Surveillance, Como, Italy, pp.
593-598.
[9] W. Weerakkody, W. Fernando, J. Martinez, P. Cuenca, & F.
Quiles, An Iterative Refinement Technique for Side
Information Generation in DVC, (2007) Proc. IEEE ICME,
Binjing, China, pp.164-167.
[10] D. Varodayan , D. Chen , M. Flierl, & B. Girod. Wyner-Ziv
Coding of Video with Unsupervised Motion Vector
Learning, (2008) Signal Processing: Image
Communication, vol. 23, no. 5, pp. 369-378.
[11] S. Ye, M. Ouaret, F. Dufaux, & T. Ebrahimi. Improved Side
Information Generation for Distributed Video Coding by
Exploiting Spatial and Temporal Correlations, (2009)
Journal on Image and Video Processing, vol. 2009, pp. 1-15.
[12] R. Martins, C. Brites, J. Ascenso, & F. Pereira, Refining
Side Information for Improved Transform Domain
Wyner-Ziv Video Coding, (2008) IEEE Trans. Circuits Syst.
Video Technol., vol. 19, pp. 1327-1340.
[13] R. Martins, C. Brites, J. Ascenso, & F. Pereira, Statistical
Motion Learning for Improved Transform Domain
Wyner-Ziv Video Coding, (2010) Image Processing, IET,
vol. 4, no. 1, pp. 28-41.
[14] R. Puri, A. Majumdar, & K. Ramchandran, PRISM: A
Video Coding Paradigm with Motion Estimation at the
Decoder, (2007) IEEE Trans. on Image Processing, pp.
2436-2448.
[15] A. Aaron, S. Rane, & B. Girod, Wyner-Ziv Video Coding
with Hash-based Motion Compensation at the Receiver,
(2004) Proc. IEEE International Conf. on Image Proc.,
Singapore, pp. 24-27.
[16] J, Ascenso & F, Pereira, Adaptive Hash-based Side
Information Exploitation for Efficient Wyner-Ziv Video
Coding, (2007) IEEE International Conference on Image
Processing, vol. 3, pp. 29-32.
[17] J. Ascenso, C. Brites, & F. Pereira, A Flexible Side
Information Generation Framework for Distributed Video
Coding, (2010) Multimedia Tools and Applications, vol. 48,
no. 3, pp. 381-409.
[18] X. Artigas, J. Ascenso, & M. Dalai, The Discover Codec:
Architecture, Techniques and Evaluation, (2007) Picture
Coding System.
[19] http://www.discoverdvc.org.
[20] D. Varodayan, A. Aaron, & B. Girod, Rate-adaptive Codes
for Distributed Source Coding, (2006) EURASIP Signal
Processing Journal, Special Section on Distributed Source
Coding, vol. 86, no. 11, pp. 3123-3130.
[21] J.R.Bitner, G.Ehrlich, & E.M.Reingold,Efficient Generation
of the Binary Reflected Gray Code and Its Applications,
(1976) Commun. ACM, vol. 19, no. 9, pp. 517-521.
[22] G.Hua & C.W.Chen, Distributed Video Coding with Zero
Motion Skip and Efficient DCT Coefficient Encoding,
(2008) IEEE international conference on multimedia and
expo, Hannover, Germany, pp. 777-780.
[23] C. Brites & F. Pereira, Correlation Noise Modeling for
Efficient Pixel and Transform Domain WynerZiv Video
Coding, (2008) IEEE Transactions on Circuits and Systems
for Video Technology, vol.18, no. 9, pp. 1177-1190.
[24] J. Skorupa, J. Slowack, S. Mys, N. Deligiannis, J. De Cock,
P. Lambert, A. Munteanu, & R. Van de Walle, Exploiting
Quantization and Spatial Correlation in Virtual-Noise
Modeling for Distributed Video Coding, (2010) Signal
Processing: Image Communication, vol. 25, no. 9, pp.
674-686.
[25] D. Kubasov, J. Nayak, & C. Guillemot, Optimal
Reconstruction in Wyner-Ziv Video Coding with Multiple
Side Information, (2007) IEEE 9th Workshop on
Multimedia Signal Processing, Chania, Greece, PP.
183-186.

Dong Liu was born in Shan Dong, China,
in 1986. He received the B.Sc. degree
from Guangxi University in 2008. He is
currently pursuing his Ph.D. degree in
South China University of Technology,
School of Electronic and Information
Engineering. His current research
interests include distributed video coding
and image and signal processing.

Ming Hui Du was born in Guangdong,
China, in 1964. He received the B.Sc.
degree in wireless electronics from Fudan
University of electronic engineering in
1985, and obtained the Ph.D. degree from
South China University of Technology
(SCUT) in 1991. He is currently
Professor at the School of Electronic and
Information Engineering at SCUT. He is
a member of IEEE, and a senior member of the Chinese Institute
of Electronics. He also is a vice chairman of the Institute of Image
and Graph of Guangdong. Research topics include image and
signal processing, wireless communication.

You might also like