Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video Coding Iole Moccagatta, PhD Multimedia Group, IMEC, Kapeldreef 75, B-3001, Leuven, Belgium [email protected]ESTIMedia 2006 October 26th, 2006
45
Embed
Recent Developments in Video Compression Standards and their …peace.snu.ac.kr/ESTIMedia/pdf/keynote_B.pdf · 2017-01-31 · Recent Developments in Video Compression Standards and
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video CodingIole Moccagatta, PhDMultimedia Group, IMEC, Kapeldreef 75, B-3001, Leuven, [email protected]
ESTIMedia 2006October 26th, 2006
Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video Coding
• The MPEG-4 video standards• H.264/MPEG-4 AVC• Scalability in H.264 Ann. G/MPEG-4 SVC • MPEG-4 MVC: context, motivation, and coding
principles• FVV system and application scenarios• H.264/MPEG-4 AVC complexity• H.264 Ann. G/MPEG-4 SVC and MVC complexity• Impact on embedded platforms• Conclusions
Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video Coding
Fig. 7: (a) – (e) Comparison of R-D curves for MPEG-2 (MP2), MPEG-4 Part 2 ASP (MP4 ASP) and H.264/AVC (MP4 AVC). I frames were inserted every 15 frames (N=15) and two non-reference B frames per reference I or P frame were used (M=3)
Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video Coding
• Three techniques:– Inter-layer Intra Texture Prediction
• un-constrained (multiple loop decoding at target layer multiple mot. comp.) and constrained (single loop decoding)
– Inter-layer Motion Prediction
• macroblock partitioning, scaled motion vectors and reference indices of base layer are used in enhancement layer (base layer mode)
• for each motion vector a quarter-sample motion vector refinement is additionally transmitted and added to the derived motion vectors (1/4pel refinement mode)
– Inter-layer Residual Prediction
• only code the difference between current layer residual information and previous layer (up-sampled) residual information
• Some concepts already existed in MPEG-2/4 for spatial scalability
Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video Coding
H.264/AVC BP Encoder: Complexity Estimation from Profiling
• Profiled on a reduced instruction set computing (RISC) platform (one PE, 1GHz Ultra Sparc II CPU and 8 Gbytes RAM)
• Disclaimer: non-optimized SW, no algorithm optimization (ex: integer-pel full-search ME)– Note: complexity analysis based on MPEG ref. SW (JM) typically overstates the actual
complexity of the H.264/AVC encoder by an order of magnitude, and that of the decoder by a factor of 2 to 3 [Shafer, EBU Tech Review, Jan ‘03]
• Requirements– memory transfer req. = 460 GB/s
– computational req. = 300 GIPS
[Chie
n,
IEEE C
om
m.
Mag
., A
ug.
05]
Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video Coding
• Inter-viewpoint prediction + inter-viewpoint & temporal prediction– introduced to enhance compression of the simultaneous and multiple video streams by exploiting
inter-viewpoint interpolation
– add-ups to “classical” temporal interpolation
– stress memory bandwidth requirements: think H.264/AVC B-frames, but coming from neighboring view as wall
• memory hierarchy is very important!
• Intermediate view synthesis– required by FVV system that use MPEG-4 MVC as key coding technology
– against traditional encoder vs. decoder complexity distribution
– new technology for consumer market apps. where cost is key factor for success
• new solutions are needed!
• Simultaneous decoding of multiple frames to enable “Matrix-like bullet time” visual effects
• Efforts to reduce complexity are under way– algorithm’s complexity reduction
• ex: simplified prediction structures to reduce the number of reference candidates– speed-ups approaches
• ex: speed-up of block-level illumination compensation
Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video Coding
• How to address joint throughput and computational power requirements:– can not satisfy these requirements with Task Level Parallelism (TLP, i.e. functional
split of the pipeline) alone
• Data Level Parallelism (DLP, such as inner loop-level parallelism) and/or Instruction Level Parallelism (ILP, i.e. VLIW) is a must have
– DLP ex: multimedia instruction set extensions (e.g., Intel’s MMX and SSE)• H/W acceleration is a maybe
– ex1: Application Specific Instruction Set Processor (ASIP)– ex2: H/W for CABAC, CA-VLC, etc.
– multi-core architectures to beat energy/power consumption
• load balancing is key issue (see clock islands in H/W)– need clever partition
» make full use of all the resources» switch-off what is not needed when is not needed
– design vs. run time approach vs. combined approach» design time: TL and DL parallelism» run time: RTOS with multi-threading
"Von Neumann is a poor use of scaling — all the energy is going on the communication between the processor and the memory. It’s much better to use 20
microprocessors running at 100MHz than one at 2GHz"[Hugo De Man]
Recent Developments in Video Compression Standards and their Impact on Embedded Platforms: from Scalable to Multi-view Video Coding
• H.264/MPEG-4 Part 10 AVC• SVC extension of H.264/MPEG-4 AVC (FDIS Jan ’07)• MVC extension of H.264/MPEG-4 AVC (FDIS Jan ’08)
• Memory access complexity significantly increased due to frame/layer/view-point prediction– need to minimize data transfer between processing components as well as
storage requirements
• Computational complexity increased due to combination of high spatial resolution and algorithmic complexity of codec’s tools– DLP and ILP are a must have