Professional Documents
Culture Documents
=
|
|
\ .
This methodology represents a real contribution scenario
with high fidelity, where original 10-bit sequences are
previously downscaled to 8-bits to be 8-bit depth encoded
(422@8b), stored, and transmitted, and finally up-scaled to
10-bits on the decoder side.
PSNR
/SSIM
PSNR/SSI
M
Sequence
10b
H.264
Hi422@10b
Round,
clip, >> 2
Sequence
8b
H.264
Hi422@8b
<< 2
Fig. 1. Experimental architecture for 422@10b and 422@8b
The aim of this experiment is to evaluate the coding
performance of Hi422 profile of H.264, for 10-bit and 8-bit
pixel depth sequences, and not to evaluate custom
implementations [14] [15]. For this reason, all of the
simulations have been run using the AVC/H.264 reference
software JM, version 18.0, available from [4].
IV. EXPERIMENTAL RESULTS
Five original 422@10b high definition sequences, have
been used in this experiment, with a 10 second length each.
Four of them, CrowdRun, Ducks, IntoTree, and ParkJoy are
supplied by SVT [16], and the last one, Dancer, is available at
[17], supplied by the European Broadcasting Union (EBU).
Fig. 2. First frame of Test sequences used in experiments (422@10b)
To run the simulations, the authors followed the Common
Simulation Conditions, recommendations made by ITU-T in
[18], for the analysis of H.264 coding of high resolution
sequences, fitting them to the current professional scenario for
live transmissions that use fix GOP. As such, Hi422 profile
and GOP length 32 were used. Other relevant parameters used
in this work are shown in Table I.
10 N = , (1)
120
TABLE I
COMMON SIMULATION CONDITIONS FOR H.264
Parameter Value
CABAC On
Number of B frames 7
Pyramid Coding Levels 3
Explicit Pyramid Format (JM defined) b3r0b1r1b0e2b2e2b5r1b4e2b6e2
Rate Distortion Optimization 1
Motion Estiamtion Search Range 64
Use Fast Motion Estimation 3
Number of Reference Frames 4
QP for I Slices 22, 27, 32, 37
QP for P Slices 23, 28, 33, 38
QP for B Slices 24, 29, 34, 39
To compute the objective difference between 422@10b and
422@8b simulations, we used the Bjntegaard Delta
methodology defined by ITU in [19], which computes the
BD-Rate, BD-PSNR, and BD-SSIM.
This method calculates the average difference between two
Rate-Distortion curves (Rate vs. PSNR or Rate vs. SSIM) a
and b, fitting each of these curves through four data points
obtained for each QP (22, 27, 32, and 37). The convention
used in this work implies that a negative BD-PSNR or BD-
SSIM value means a lower quality performance of Hi422@8b
regarding Hi422@10b, for the same bit rate, and therefore a
positive value means a higher 8-bit coding performance.
Concerning BD-Rate, a negative value means that, for the
same quality, a lower bit rate is obtained from Hi422@8b
coding regarding Hi422@10b. Consequently, 8-bit depth gets
a bit rate saving for the same 10-bit quality. BD-Rate is
measured as a percent of bit rate between bit rates of curves
a and b. Table II shows the BD-PSNR and BD-Rate
simulation results for 720p format.
TABLE II
BD-PSNR (720P50)
PSNR Y U V
720p50
BD-Rate
(%)
BD-PSNR
(dB)
BD-Rate
(%)
BD-PSNR
(dB)
BD-Rate
(%)
BD-PSNR
(dB)
CrowdRun -0.0918 0.0020 4.0450 -0.0916 4.0188 -0.0935
Ducks -1.3139 0.0516 1.1551 -0.0282 3.5450 -0.0325
IntoTree -0.0225 0.0014 5.4909 -0.0629 8.4008 -0.0801
ParkJoy -0.9126 0.0386 0.3849 -0.0400 2.0398 -0.0586
Dancer 0.1798 -0.0070 4.7290 -0.1099 7.2565 -0.1665
TOTAL 0.43 0.02 3.16 0.07 5.05 0.09
Can be observed as the luminance component (Y), for the
first four sequences, shows a slight PSNR improvement for 8-
bit coding, and hence a slight bit rate saving. Only the
Dancer sequence shows an insignificant luminance
improvement for 10-bit encoding. The average BD-PSNR
performance for the luminance component is negligible
(0.02dB), with a meager 0.43% bit rate saving for 422@8b
encoding.
On the other hand, the results for U and V chroma
components obtain a slightly better performance for 10-bit
encoding, with a negligible PSNR improvement (0.02 and
0.07dB) and a rate saving of around 5%. Considering that U
and V color components spend statistically fewer bits than the
luminance component, no straight conclusions can be derived
from U and V bit rate savings (3% and 5%, respectively).
In order to appreciate the small differences between both
bit depths coding, Fig. 3 depicts an enlargement of PSNR
curves for the Y component of the ParkJoy sequence, from
20Mbps to 35Mbps, which are common bit rates used for
HDTV contribution services. Figure 4 shows the global
PSNR-Rate simulation results obtained from the U component
for the ParkJoy sequence, where unnoticeable differences
can be observed.
32.0
32.5
33.0
33.5
34.0
34.5
35.0
35.5
36.0
20 22 24 26 28 30 32 34
P
S
N
R
(
d
B
)
Bitrate (Mb)
PSNR-Y
ParkJoy 720p50
Hi422@10
b
Hi422@8b
Fig. 3. PSNR-Y, ParkJoy@720p50, zoom from 20-35Mbps
29
31
33
35
37
39
41
0 10 20 30 40 50 60
P
S
N
R
(
d
B
)
Bitrate (Mb)
PSNR-U
ParkJoy 720p50
Hi422@10
b
Hi422@8b
Fig. 4. PSNR-U, ParkJoy@720p50
In an analogous way, the 720p SSIM results are depicted in
Table III, and a similar trend can be derived. No noticeable
quality differences are shown (lower than 0.003) between both
422@10b and 422@8b encoding profiles, and only a slight bit
rate saving (0.27% for luminance and 4% and 5% for color
components) can be obtained from 10-bit depth coding.
Regarding the 1080i format, Table IV depicts the BD-
PSNR results, where again the luminance component obtains
for 422@8b encoding a slight PSNR improvement (0.4dB)
and bit rate saving (0.0186%). The U and V color components
achieve a negligible PSNR improvement, with a meager data
rate saving (3.3% and 6.13%). Table V shows the 1080i25
SSIM-Rate results obtained for the Y, U, and V, components,
in which no noticeable SSIM difference can be observed, and
only a slight bit rate saving similar to 720p50 format is
present.
121
122
TABLE III
BD-SSIM RESULTS (720P50)
SSIM Y U V
720p50
BD-Rate
(%)
SSIM BD-Rate
(%)
SSIM BD-Rate
(%)
SSIM
CrowdRun 0.3779 -0.0001 6.7057 -0.0056 6.9034 -0.0047
Ducks -0.2068 0.0004 1.9941 -0.0015 3.5571 -0.0010
IntoTree 0.1029 0.0001 5.6341 -0.0017 8.8491 -0.0013
ParkJoy 0.6824 -0.0002 2.8037 -0.0019 4.2846 -0.0016
Dancer 0.4334 -0.0002 3.2694 -0.0011 4.6582 -0.0013
TOTAL 0.2780 0.0000 4.0814 -0.0024 5.6505 -0.0020
TABLE IV
BD-PSNR RESULTS (1080I25)
PSNR Y U V
1080i25
BD-Rate
(%)
BD-PSNR
(dB)
BD-Rate
(%)
BD-PSNR
(dB)
BD-Rate
(%)
BD-PSNR
(dB)
CrowdRun -0.2208 0.0065 3.4337 -0.0712 3.0257 -0.0632
Ducks 0.2301 -0.0082 4.2152 -0.0762 10.9798 -0.0832
IntoTree 0.6380 -0.0149 8.3053 -0.0801 12.3458 -0.0986
ParkJoy -0.3607 0.0105 2.7265 -0.0624 5.7560 -0.0642
Dancer -2.3228 0.0989 -2.1784 0.0517 -1.4362 0.0359
TOTAL -0.4072 0.0186 3.3005 -0.0476 6.1342 -0.0547
TABLE V
BD-SSIM RESULTS (1080I25)
SSIM Y U V
1080i25
BD-Rate
(%)
SSIM BD-Rate
(%)
SSIM BD-Rate
(%)
SSIM
CrowdRun 0.0975 -0.0063 -0.5927 0.0007 -4.1455 0.0023
Ducks 0.0927 0.0002 2.7838 -0.0018 7.7800 -0.0015
IntoTree 0.9010 -0.0006 9.2103 -0.0019 15.2666 -0.0015
ParkJoy 0.2084 0.0000 3.3945 -0.0020 6.0989 -0.0016
Dancer 0.5702 -0.0002 4.0392 -0.0013 6.1847 -0.0014
TOTAL 2.3048 -0.0014 3.7670 -0.0013 6.2369 -0.0007
V. CONCLUSIONS
The experiment results reveal that the hypothetical
efficiency improvement obtained using the 10-bit Hi422
profile against 8-bit is reduced to an unnoticeable gain in
terms of bit rate saving, but exclusively for color video
components, around 5%, but not for the luminance
component. In most of the sequences, especially for the 720p
format, the luminance component achieves better PSNR and
bit rate saving figures. It is well known that the human visual
perception system is more sensitive to luminance information
that color.
Both the PSNR and SSIM video quality metrics used in the
experiment show the same trend, and it does not allow to
confirm that 10-bit H.264 coding offers a better perceptual
quality than 8-bit sample depth coding, under these specific
test conditions.
ACKNOWLEDGEMENT
This work has been jointly supported by the MINECO and
European Commission (FEDER funds) under the project
TIN2012-38341-C04-04.
REFERENCES
[1] Draft ITU-T Recommendation and Final Draft International
Standard of Joint Video Specification (ITU-T Rec. H.264 |
ISO/IEC 14496-10 AVC) Joint Video Team (JVT), Mar.2003,
Doc. JVT-G050.
[2] G.J. Sullivan, P. Topiwala, and A. Luthra, The H.264/AVC
Advanced Video Coding Standard: Overview and Introduction
to the Fidelity Range Extensions (FRExt), SPIE Conference on
Applications of Digital Image Processing XXVII, August 2004.
[3] B. Bross, W.-J. Han, J.-R. Ohm, G.J. Sullivan, and T. Wiegand.
High efficiency video coding (HEVC) text specification draft 6.
Document JCTVCH1003 of JCT-VC, February 2012.
[4] ftp://standards.polycom.com/reference_software/
[5] Z. Wang, A..C. Bovik, H.R. Sheikh, and E.P. Simoncelli,
Image quality assessment: From error visibility to structural
similarity, IEEE Trans. Image Process., vol. 13, no. 4, pp.
600612, Apr. 2004.
[6] ITU-T and ISO/IEC JTC 1, Generic coding of moving pictures
and associated audio information Part 2:Video, ITU-T Rec.
H.262 and ISO/IEC 13818-2 (MPEG-2), Nov. 1994.
[7] E. Dumic, M. Mustra, S. Grgic, G. Gvozden, "Image quality of
4:2:2 and 4:2:0 chroma subsampling formats," ELMAR '09
International Symposium, pp.19-24, 28-30 Sept. 2009.
[8] ISO/IEC JTC 1, Coding of audio-visual objects Part 2:
Visual, ISO/IEC 14496-2 (MPEG-4 Part 2), Jan.1999.
[9] D. Marpe, T. Wiegand, S. Gordon, "H.264/MPEG4-AVC
fidelity range extensions: tools, profiles, performance, and
application areas", IEEE International Conference on Image
Processing ICIP 2005, vol.1, pp. I-593-6, 11-14 Sept.2005.
[10] T. Wedi and Y. Kashiwagi, Subjective quality evaluation of
H.264/AVC FRExt for HD movie content, Joint Video Team
document JVT-L033, July, 2004.
[11] T. Chujoh, R. Noda, Internal bit depth increase for coding
efficiency, ITU-T SG16 Q.6 Document, VCEG-AE13,
Marrakech, Jan. 2007.
[12] Joint Call for Proposals on Video Compression Technology,
ITU-T SG16 Q6 document VCEG-AM91 and ISO/IEC
JTC1/SC29/WG11, Kyoto, Japan, Jan. 2010.
[13] W. Gish, H. Yu, " Extended Sample Depth: Implementation
and Characterization," ISO/IEC JTC1/SC29/WG11 and ITU-T
SG16 Q.6 Document JVT-H016, Geneva, Switzerland, May,
2003.
[14] P. Larbier, Using 10-bits AVC/H.264 Encoding with 4:2:@ for
Broadcast Contribution, ATEME, Bievres, France
[15] M. Compton, 10 bit high quality MPEG-4 AVC video
compression, Tandberg Television, Southampton, UK.
[16] ftp://vqeg.its.bldrdoc.gov/HDTV/SVT_MultiFormat/
[17] http://www.ebu.ch/fr/technical/hdtv/test_sequences.php
[18] T. Tan, G. Sullivan, T. Wedi, "Recommended Simulation
Common Conditions for Coding Efficiency Experiments," ITU-
T SG16 Q.6 Document, VCEG-AA10, Nice, France, October,
2005.
[19] G. Bjntegaard, "Calculation of average PSNR differences
between RD-curves", ITU-T SG16 Q.6 Document, VCEG-M33,
Austin, US, April 2001.