Comparative study of Intra Frame Coding efficiency in HEVC and VP9 By: Shwetha Chandrakant Kodpadi 1001051972 [email protected]Spring 2014 EE5359 Multimedia Processing Interim Report Under the guidance of Dr.K.R.Rao University of Texas at Arlington Dept. of Electrical Engineering
34
Embed
Comparative study of Intra Frame Coding efficiency in HEVC and VP9 By: Shwetha Chandrakant Kodpadi 1001051972 [email protected] Spring.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Comparative study of Intra Frame Coding efficiency in HEVC and VP9
University of Texas at ArlingtonDept. of Electrical Engineering
Contents• Objective• General Compression Dataflow• HEVC Coding Standard• Google VP9• Intra Prediction Modes in HEVC and VP9• Performance Metrics• Test sequences and Implementation• Implementation Results• Conclusions• Acronyms and abbreviations• References
Objective
This project aims at studying and comparing intra frame coding efficiency between video coding technologies HEVC and VP9.Overview of different coding tools in HEVC and VP9 such as Prediction, Transforms, Entropy Coding and Post-processingDetailed analysis and implementation of HEVC and VP9 Intra Frame coding [1][3] and compare the performances by using various performance metrics.PSNR, MSE, BD-PSNR and BD-BR [14] are used as comparison metrics.
General compression dataflow
Figure 1: Hybrid block-based codec dataflow [9]
Both HEVC and VP9 video compression standards are hybrid block-based codecs relying on spatial transformations [9]. General compression data flow of hybrid block-based encoders is illustrated in Figure 1.
General compression dataflow (Continued..)• The input video frame is initially partitioned into blocks of the same
size called macroblocks• A macroblock is subpartitioned into smaller blocks to perform
prediction (intra and inter predictions)• The aim of the prediction process is to reduce data redundancy and
therefore, not store excessive information in coded bitstream• Intra-prediction works within a current video frame and is based upon
the compressed and decoded data available for the block being predicted
• Predicted frame is subtracted from the original data to get residuals which are subjected to transforms such as forward discrete Fourier transform and discrete cosine transform [22]
• The transform coefficients are further quantized and subjected to Entropy Coding which makes it possible to get compressed bit-stream.
High Efficiency Video Coding
• High Efficiency Video Coding (HEVC) is the latest Video Coding format [4].
• It challenges the state-of-the-art H.264/AVC [21] Video Coding standard which is in current use in the industry by being able to reduce the bit rate by 50%, retaining the same video quality [4].
• Also HEVC supports increased use of parallel processing architectures [1].
Figure 2: Encoder block diagram for HEVC [4]
Figure 3: Decoder block diagram for HEVC [17]
VP9
• VP9 is an open and royalty free video compression standard being developed by Google [2][3].
• One of the goals for VP9 is to reduce the bit rate by 50% compared to VP8 while having the same video quality [7]
• VP9 expands techniques used in AVC and VP8 and is very likely to replace H.264/AVC at least in the YouTube video service [9].
Figure 4: Encoder block diagram for VP9 [19]
Figure 5: Decoder block diagram for VP9 [19]
HEVC and VP9 Coding Tools
• Macroblock concept and Prediction block sizes• Prediction Modes (Intra and Inter)• Transform and Quantization • Entropy Coding• Post Processing
Intra Prediction modes in HEVC•HEVC has 35 luma intra prediction modes (Figure 6)•Intra prediction can be done at different block sizes, ranging from 4 X 4 to 64 X 64 (whatever size the PU has) (Figure 7)•HEVC also includes a planar and DC intra prediction modes
Figure 6: Modes and directional orientations for intra picture prediction for HEVC [1]
Figure 7: Luma intra prediction modes for different PU sizes in HEVC [8]
VP9 techniques
• Recursive variable block size support (64x64 to 4x4, square and rectangular)• Finer motion vector precision (16th-pel) and adaptive motion filters (sharp, lowpass, regular) Figure 8: Example partitioning of a 64x64
Super-block
• Multi-level past and future reference (alt-ref) frames• larger dct transforms (32x32,16x16, 8x8, 4x4) and adst
transforms (16x16, 8x8, 4x4)
Intra Prediction modes in VP9
•There are total of ten intra-prediction modes in VP9 [9]: DC, True Motion (TM), and eight angular modes (H, V, D207, D153, D135, D117, D63 and D45 on Figure 8)
Figure 9: VP9 angular intra-prediction modes [9]
Performance Metrics
Criteria to evaluate the compression qualityTwo types of quality measures Objective quality measure- PSNR, MSE Structural quality measure- SSIM [10]MSE and PSNR for a NxM pixel image are defined as
where O is the original image and R is the reconstructed image. M and N are the width and height of an image and ‘L’ is the maximum pixel value in the NxM pixel image.
M
m
N
n
nmRnmONM
MSE1 1
2,,*
1
dBMSE
LPSNR ,log10
2
10
Performance Metrics(Continued..)
Bjøntegaard-Delta Bit-Rate and peak signal-to-noise ratio Measurements
•The Bjøntegaard delta peak signal-to-noise ratio (dB) [14][20] and the Bjøntegaard delta bit-rate (%) will be used compare performance of HEVC and VP9 Codecs.•As rate-distortion (R-D) performance assessment [14], Bjøntegaard-Delta bit-rate measurement method is used for calculating average bit-rate differences between R-D curves for the same objective quality (e.g., for the same PSNR-YUV values), where negative BD-BR values Indicate actual bit-rate savings.•Encoding time is used to compare the implementation complexity
Implementation
• For comparison purposes open-source implementations of the reviewed codecs will be used. HEVC compression efficiency will be measured with the HM Test Model [12].
• Evaluation of VP9 compression performance will be carried out with the VPX encoder from The WebM Project [13].
• Since HEVC has more Intra Prediction modes and few other features better than VP9, both the codecs are configured to establish a fair comparison.
Implementation(Continued..)• The test video sequence set that will be used for the comparison
• HEVC provides better compression rates than VP9, but VP9 is patent-free and can be used without licensing expenses.
• For Intra frame coding, HEVC gives 13% more bitrate savings than VP9. And the encoding time taken by VP9 is marginally less than HEVC.
List of Acronyms
• ADST - Asymmetric Discrete Sine Transform
• AVC – Advanced Video Coding• BD-BR- Bjøntegaard-Delta Bit-Rate
Measurements• BD-PSNR - Bjøntegaard-Delta Peak signal to
noise ratio• CU- Coding unit• CTU- Coding tree unit• DBF- Deblocking Filter• DFT – Discrete Fourier Transform• DCT – Discrete Cosine Transform• DST – Discrete Sine Transform• DPB - Decoded Picture Buffer• DC – Direct Current• HD- High definition • HEVC-High Efficiency Video Coding
• ITU-T - International Telecommunication Union (Telecommunication Standardization Sector)
• JPEG - Joint photographic experts group• JCT-VC- Joint collaborative team on video
coding• MSE-Mean square error• MPEG-Moving picture experts group• NGOV- Next Geneneration Open Video• PU- Prediction unit • PSNR-Peak signal to noise ratio• PU – Prediction Unit• RD – Rate Distortion• SAO - Sample Adaptive Offset• SSIM- Structural similarity index• TM- True Motion • TU-Transform units • VCEG – Video Coding Experts Group
References
• [1] G.J. Sullivan et al, “Overview of the high efficiency video coding (HEVC) standard”, IEEE Trans. circuits and systems for video technology, vol. 22, no.12, pp. 1649 – 1668, Dec 2012.
• [2] D. Grois et al, “Performance Comparison of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders” 30th Picture Coding Symposium 2013 (PCS 2013), San José, CA, USA, Dec 8-11, 2013
• [3] D. Mukherjee et al, “The latest open-source video codec VP9–An overview and preliminary results”, Google Inc., United States
• [4] G.J. Sullivan et al, "Standardized Extensions of High Efficiency Video Coding (HEVC)", IEEE Journal of Selected Topics in Signal Processing, vol.7, no.6, pp.1001-1016, Dec. 2013
• [5]Article on HEVC - http://en.wikipedia.org/wiki/High_Efficiency_Video_Coding• [6] Q. Cai et al, “Lossy and lossless intra coding performance evaluation: HEVC, H.264/AVC,
JPEG 2000 and JPEG LS”, Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific, vol.9, no.12, pp.1-9, Dec 2012.
• [7] "VP-Next Overview and Progress Update" (PDF). WebM Project (Google). Retrieved 2012-12-29. Available on : http://downloads.webmproject.org/ngov2012/pdf/04-ngov-project-update.pdf
• [8]M.T. Pourazad et al, “HEVC:The new gold standard for video compression”, IEEE consumer electronics magazine ,vol.1, no.7, pp.36-46, July 2012.
References(Continued..)• [9] M.P. Sharabayko et al, "Intra Compression Efficiency in VP9 and HEVC" Applied
Mathematical Sciences, Vol. 7, no. 137, pp.6803 – 6824, Hikari Ltd, 2013• [10] Z. Wang et al, “Image quality assessment: From error visibility to structural similarity,”
IEEE Trans. on Image Processing, vol. 13, no. 4, pp. 600-612, Apr. 2004. • [11] H. Jain, “Comparative performance analysis of HEVC and H.264 Intra frame coding
and JPEG2000”, EE5359, UTA, spring 2013.• http://www-ee.uta.edu/Dip/Courses/EE5359/index.html.• [12] HM Reference Software- https://hevc.hhi.fraunhofer.de/HM-doc/• [13] Chromium® open-source browser project, VP9 source code,
• [14] G. Bjøntegaard, “Calculation of average PSNR differences between RD-curves”, ITU-T Q.6/SG16 VCEG 13th Meeting, Document VCEG-M33, Austin, USA, Apr. 2001.
• [15] S. Jeong et al., High efficiency video coding for entertainment quality. ETRI J. vol.33, pp.145–154, 2011.
• [16] JVT Draft ITU-T recommendation and final draft international standard of joint video specification (ITU-T Rec. H.264-ISO/IEC 14496-10 AVC), March 2003, JVT-G050-http://ip.hhi.de/imagecom_G1/assets/pdfs/JVT-G050.pdf
References(Continued..)
• [17] White paper on PSNR-NI - http://www.ni.com/white-paper/13306/en/• [18] HEVC tutorial by I.E.G. Richardson: http://www.vcodex.com/h265.html• [19] J. Padia, “Complexity reduction for VP6 to H.264 transcoder using motion vector
reuse,” M.S. Thesis, EE Dept., UTA, Arlington, TX, 2010. Available on : http://www-ee.uta.edu/Dip/Courses/EE5359/index.html
• [20] G. Bjøntegaard , “Improvements of the BD-PSNR model” ITU-T SG16 Q.6, Doc. VCEG-AI11, Berlin, Germany, July 16-18, 2008
• [21] T. Wiegand et al, “Overview of the H.264/AVC Video Coding Standard”, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 13, No. 7, pp. 560-576, Jul. 2003.
• [22] N. Ahmed , T. Natarajan, K.R. Rao, “Discrete Cosine Transform”, IEEE Transactions on Computers, Vol. C-23, pp. 90-93, Jan. 1974.
• [23] I.E.G. Richardson, “The H.264 advanced video compression standard”, 2nd Edition, Hoboken, NJ, Wiley, 2010.
• [24] I.E.G. Richardson, “Video Codec Design: Developing Image and Video Compression Systems”, Wiley, 2002.
• [25] K.R. Rao, D.N. Kim and J.J. Hwang, “Video Coding Standards: AVS China, H.264/MPEG-4 Part 10, HEVC, VP6, DIRAC and VC-1”, Springer, 2014.
• [26] B. Bross et al, “High Efficiency Video Coding (HEVC) Text Specification Draft 10”, Document JCTVC-L1003, ITU-T/ISO/IEC Joint Collaborative Team on Video Coding (JCT-VC), Mar. 2013 available on http://phenix.itsudparis.eu/jct/doc_end_user/current_document.php?id=7243