Top Banner
Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : [email protected] Website: http://www-ee.uta.edu/dip
78

Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : [email protected] Website: .

Dec 27, 2015

Download

Documents

Gillian Ramsey
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Multimedia Processing LabNH 140

Advisor : Dr. K.R. Rao

Phone : (817) 272-3478 Email : [email protected]

Website: http://www-ee.uta.edu/dip

Page 2: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Multimedia Network

Page 3: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Home Media Ecosystem

A case for importance of research in multimedia

Page 4: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Video Redundancy – An Example

Page 5: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

The need for video compression

Video signal : Sequence of frames (images) related among temporal dimension

TV video quality: 704x576 pixels per frame, 12 bpp, 25 frames per second - > 121 Mbps

Too much data for video transmission or storage

Increasing importance of multimedia communication

t

NEED FOR VIDEO COMPRESION

Page 6: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Research Focus Areas

Coding Efficiency

Network awareness

+

implem

entatio

n?20052005

20102010

19991999

19941994

MPEG4MPEG4

H.264H.264

19921992MPEG1MPEG1

Video Conferencing

H.263H.263

20032003

Mobile Phone

Hand PC

Mobile TV

SVCHDTV

Year

MPEG2MPEG2

H.265(?)H.265(?)

mobile

Blue ray DVD

2009

MVC HVC(?)HVC(?)

PC

Page 7: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Research : Image, Video, Audio

Image Video AudioJPEG, JPEG-LS, LOCO, CALIC

MPEG 1,2,4,7, 21 Dolby True HD

JPEG 2000 H.264, H.265(?),HVC HD-AAC

JPEG XR–AIC VP6, VP7, VP8 MP3, MP3 Pro

JBIG1,2 VC–1 (WMV–9) AAC–SBR

PNG Wyner Ziv HE–AC3

GIF AVS China part 2 AVS China part 3

Dirac,Dirac Pro(BBC) ATSC (E-AC3)

Real Networks-RV10 WMA

DTS-HD Audio

Page 8: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Video Compression Standards

Standard Main Applications Year

JPEG, JPEG2000 Image 1992-1999, 2000

JBIG, JBIG2 Fax 1995-2000

H.261 Video Conferencing 1990

H.262, H.262+ DTV, SDTV, HDTV 1995, 2000

H.263, H.263++ Videophone 1998, 2000

MPEG-1 Video CD 1992

MPEG-2 DTV, SDTV, HDTV, DVD 1995

MPEG-4 Part 2 Interactive video 2000

MPEG-7 Multimedia Content description 2001

MPEG-21 Multimedia Framework 2002

H.264/MPEG-4 part 10 Advanced Video Coding 2003

Page 9: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Latest Video CodecsStandard Main Applications Year

Dirac (B.B.C.) Internet streaming to Ultra-high definition TV 2008

Dirac pro/VC-2 Studio and professional use 2009

VC-1 (SMPTE/Microsoft) Internet streaming to High definition TV 2006

VC-3 Compositing, mastering, and multi-generational use 2006

VP6 (On2 technologies) Broadcasting 2003

VP7 Broadcasting 2005

VP8 Broadcasting 2008

RV10 (Real Networks) Internet streaming 2008

AVS China IP TV , Terrestrial digital TV, Satellite broadcast, Video surveillance

2005

H.264 Fidelity Range Extensions

Studio editing, Post processing, Digital cinema 2004

H.264 SVC, MVC Scalable video coding, panaromic video 2006-2009

HVC High Efficiency Video Coding 2010 ?

Page 10: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Advanced Television Systems Committee (ATSC) 

Advanced Television Systems Committee (ATSC) www.atsc.org

A/53B ATSC Standard: Digital television standard Revision B with amendment (Video: MPEG-2, Audio: AAC), 2007

A/153 Digital TV Mobile and handheld specifications 2009 (Video: H.264) (Audio HE AACv2, ISO/ IEC 14496-3)

Digital TV in North America

Page 11: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Advanced Television Systems Committee (ATSC)…….continued

ATSC Mobile DTV includes a highly robust transmission system based on vestigial sideband (VSB) modulation coupled with a flexible and extensible IP based transport, efficient MPEG AVC (H.264) video and HE AAC v2 audio (ISO/IEC 14496-3) coding.

The Candidate Standard consists of eight parts:

• Part 1 – Mobile/Handheld Digital Television System• Part 2 – RF/Transmission System Characteristics• Part 3 – Service Multiplex and Transport Subsystem Characteristics• Part 4 – Announcement• Part 5 – Presentation Framework• Part 6 – Service Protection• Part 7 – Video System Characteristics• Part 8 – Audio System Characteristics

Page 12: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

ATSC Broadcast System

Fig 1. ATSC Broadcast system with TS main and M/H Services [1]

Page 13: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Comparison of various video compression standards

Algorithmic Element

MPEG-2 Video

(H.262)

MPEG-4 AVC(H.264)

SMPTE VC-1(Windows

Media Video 9)

Dirac(BBC)

DiracPRO(BBC)

AVS Part 2China

AVS Part 7China

Intra Prediction

None: MB encoded DC predictors

4x4 spatial16x16 spatial

I-PCM

Frequency domain

coefficient

4x4 spatial 4x4 Spatial

(forward, backward)

8×8 block based Intra Prediction

Intra_4x4 (4x4 spatial).Direct Intra Prediction

Picture coding type

FrameField

Picture AFF

FrameField

Picture AFFMB AFF

FrameField

Picture AFFMB AFF

Frame Intra – Frame,Field

(Interlace, Progressive)

Frame Frame

Motion compensation block size

16×16, 16×8, 8×16

16×16, 16×8, 8×16, 8×8,

8×4, 4×8, 4×4

16×16, 8×8 4×4 N/A 16×16, 16×8, 8×16, 8×8

16×16, 16×8, 8×16, 8×8, 8×4, 4×8

Motion vector

Precision

Full pelHalf pel

Full pelHalf pel

Quarter pel

Full pelHalf pel

Quarter pel

1/8 pel N/A 1/4 pel 1/4 pel

Page 14: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Comparison of various video compression standards

Algorithmic Element

MPEG-2 Video(H.262)

MPEG-4 AVC(H.264)

SMPTE VC-1(Windows

Media Video 9)Dirac DiracPRO

AVS Part 2

AVS Part 7

P frame typeSingle

reference

Single referenceMultiple

reference

Single reference,Intensity

compensation

Single reference,Multiple

reference

No P frames

Single and multiple

reference (maximum of 2

reference frames)

Single and multiple

reference (maximum of 2

reference frames)

B frame typeOne reference

each way

One reference each way,Multiple

reference,Direct & spatial direct weighted

prediction.

One reference each way

One reference each way,Multiple

reference

No B frames

One reference each way, Multiple

reference.Direct and

symmetrical mode.

No B frames.

In loop filters None De-blockingDe-blocking

Overlap transform

None NoneDe-blocking

filter.De-blocking

filter.

Page 15: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Comparison of various video compression standards

Algorithmic Element

MPEG-2 Video

(H.262)

MPEG-4 AVC(H.264)

SMPTE VC-1(Windows

Media Video 9)

Dirac DiracPRO AVS Part 2

AVS Part 7

Entropy coding VLC CAVLC,CABAC Adaptive VLC Arithmetic coding

Context based adaptive

binary arithmetic

coding,Exp-Golomb

coding.

2D variable length coding.

Context based adaptive 2D

variable length coding.

Transform 8×8 DCT 4×4 integer DCT8×8 integer DCT

4×4 integer DCT8×8 integer DCT

8×4 & 4×8 integer DCT

4×4 wavelet transform

4×4 wavelet

transform

8×8 DCT 4×4 DCT

Other Quantization scaling

matrices.

Quantization scaling

matrices.

Range reduction.

Instream-post processing

control

Quantization scaling matrices.

Quantization scaling

matrices.

Quantization scaling

matrices.

Quantization scaling

matrices.

Page 16: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Standards Comparison

Standard Main Compression Technologies Main Target Applications

H.264/MPEG-4 Part 10

Standardization bodyJVT (ISO/IEC & ITU-T)Main Target Bitrate8 kb/s up to about 150 Mb/s

– Integer DCT– Adaptive quantization– Zigzag reordering– Alternate Scan ordering– Predictive motion compensation– Bi-directional motion compensation– Variable block size motion compensation with small block sizes– Quarter pixel motion compensation– Motion vector over picture boundaries– Multiple reference picture motion compensation– Adaptive intra directional prediction– In-loop deblocking filter– Arithmetic coding– Variable length coding– Error resilient coding

– Broadcast over cable, terrestrial and satellite– Interactive or serial storage on optical and magnetic devices, DVD, etc– Conversational services– Video on demand– MMS over ISDN, DSL, Ethernet, LAN, wireless and mobile networks– HDTV– Digital camera

Page 17: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Standards Comparison

AVS Part 2 Standardization bodyAVS workgroupMain Target Bitrate1 Mb/s up to about 20 Mb/s

– Interlace handling: Picture-level adaptive frame/field coding (PAFF)– Macroblock-level adaptive frame/field coding (MBAFF)– Intra prediction: 5 modes for luma and 4 modes for chroma– Motion compensation: 16×16, 16×8, 8×16, 8×8 block size– Resolution of MV: 1/4-pel, 4-tap interpolation filter– Transform: 16 bit-implemented 8×8 integer cosine transform– Quantization and scaling: scaling only in encoder– Entropy coding: 2D-VLC and Arithmetic Coding– In-loop deblocking filter– Motion vector prediction–Adaptive scan

– HD broadcasting– High density storage media– Video surveillances– Video on demand

Page 18: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Standards ComparisonAVS Part 7 Standardization body

AVS workgroupMain Target Bitrate1 Mb/s up to about 20 Mb/s

– Intra prediction: 9 modes for luma and 3 modes for chroma– Motion compensation: 16×16, 16×8, 8×16, 8×8, 8×4, 4×8 block size– Resolution of MV: 1/4-pel– Transform: 16 bit-implemented 4×4 integer cosine transform– Quantization and scaling: scaling only in encoder– Entropy coding: Context based adaptive 2D variable length coding– In-loop deblocking filter

– Record and local playback on mobile devices– Multimedia Message Service (MMS)– Streaming and broadcasting– Real-time video conversation

Dirac Standardization bodyBBC R&DMozilla Public License (MPL)Main Target BitrateFew hundred kbps up to about 15 Mbps

– 4×4 wavelet transform– Dead-zone quantization and scaling – Entropy coding: Arithmetic coding– Hierarchical motion estimation– Intra, Inter prediction– Single and multiple reference P, B frames– 1/8 pel motion vector precision– 4×4 overlapped block based motion compensation (OBMC)– Daubechies wavelet filters

– Broadcasting – Live streaming video – Pod casting – Peer to peer transfers – HDTV with SD (standard definition) simulcast capability – Desktop production – News links – Archive storage – PVRs (personal video recorders)– Multilevel Mezzanine coding

Page 19: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Standards Comparison

DiracPRO (SMPTE VC-

2)

Standardization bodyBBC R&DSMPTE Main Target BitrateLossless HD to < 50 Mb/s Compression ratio 20:1

– 4×4 wavelet transform– Dead-zone quantization and scaling – Entropy coding: Context based adaptive binary arithmetic coding (CABAC), exponential Golomb coding– Intra-frame only (forward, backward prediction modes also available) – Frame, Field coding (Interlaced and progressive)– Daubechies wavelet filters

– Professional (high quality, low latency) applications (not for end user distribution)– Lossless or visually lossless compression for archives– Mezzanine compression for re-use of existing equipment– Low delay compression for live video links

SMPTE VC-1 (WMV-9)

Standardization bodySMPTE 421M Main Target Bitrate10 kbps – 8 Mbps

– Integer DCT– Adaptive block size transform: (8×8), (8×4), (4×8) and (4×4)– Motion estimation for (16×16) and (8×8) blocks– ½ pixel and ¼ pixel motion vector resolution– Dead zone and uniform quantization– Multiple VLCs– In-loop deblock filtering, fading compensation

– Media delivery over the Internet– Broadcast TV– HD DVD– Digital projection in theaters, mobile phones– DVB-T, DVB-S

Page 20: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Performance comparison of various video coding standards

Page 21: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Audio Compression StandardsStandard Main Applications Year

Dolby True HD Lossless audio, Blu-ray Disc players, A/V receivers, and home-theater

2006

HD-AAC Soundtrack applications 1997

MP3 Handheld devices 1991

MP3 Pro Handheld devices 2001

AAC–SBR DAB – High quality audio 2003

HE–AC3 Satellite or terrestrial audio broadcasting 2005

AVS China part 3 Handheld and broadcasting 2004

AC3 Pro Satellite or terrestrial audio broadcasting 2006

E-AC3 Enhanced AC-3 or Dolby Digital Plus (Multiple program streams, multi channel signals beyond 5.1)

2007

DTS – Digital Theater Systems

DTS – High Definition Audio 2008

Page 22: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Current Research Activities of MPL

Mobile Applications Development of virtual lab platform for

mobile software application

Developing a low complexity video

codec for mobile application

Complexity reduction Complexity reduction in existing video

codecs

Complexity reduction in existing audio

codecs

Quality Improvement Optimizing existing video codecs using

perceptual coding techniques

Improve Robustness Error Resilience of video streams in a

Lossy Wireless Environment

Error concealment techniques for

wireless video transmission

Transcoders Video transcoders : VP6 to H.264, H.264

to VC-1, Wyner Ziv to H.264, H.264-to-

AVS China, H.264 to DIRAC transcoders

Video/Audio Integration AVS China – Audio/Video codec –

Multiplex/demultiplex and lip sync

DIRAC video codec and AAC -

Multiplex/demultiplex and lip sync

Page 23: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Virtual lab. Platforms for Mobile SW Applications

Page 24: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Low complexity Codec Applications

SensorCamPillCamWearableCamDisposable cam.ScanCam

Pill Cam

Wearable Cam

Disposable Cam

Page 25: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Transcoding Applications

Low complexity Encoder

Low complexity

Decoder

The transcoding platforms handle the high complexity decoding on one side and high complexity encoding on the other (right) side

Page 26: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

An application scenario for transcoding

Page 27: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Error Concealment in Lossy Wireless Environment

Typical situation of 3G/4G cellular telephony

Source

Information lost due to lossy wireless network

Destination

Reconstruct lost information

Original Information

Page 28: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

AVSEncoder

Audio Source

AVSEncoder

VideoSource

Multiplexer

Encoded Stream

AVSDecoder

AVSDecoder

DemultiplexerLip Synch

Video

Audio

Compressed Video

Compressed Audio

Compressed Video

Compressed Audio

AVS – Audio Video Standard of China

Multiplexing of Audio/Video And Lip Sync

Page 29: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

A quick view on H.264 - Encoder

Page 30: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Profiles in H.264

4:2:2, 4:4:4, upto 12 bit depth

Page 31: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Intra Adaptive Directional Prediction 4x4 in H.264

Page 32: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Intra Adaptive Directional Prediction 8x8 in H.264

Page 33: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Intra Adaptive Directional Prediction 16x16 in H.264

Page 34: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Motion Estimation/Compensation Sizes (H.264)

Page 35: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Sub pixel accuracy for ME/MC (H.264)

bb

a cE F I JG

h

d

n

H

m

A

C

B

D

R

T

S

U

M s NK L P Q

fe g

ji k

qp r

aa

b

cc dd ee ff

hh

gg

Page 36: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Scanning of transform coefficients (H.264)

0 1 5 6

2 4 7 12

3 8 11 13

9 10 14 15

a b

0 2 8 12

1 5 9 13

3 6 10 14

4 7 11 15

Zig-zag scan Alternate scan

Page 37: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

SVC Extensions (H.264)

Page 38: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Future Standards Activities – Bit depth Scalability

LCD dynamic range – 500:1 HDR displays: Sharp “Mega-contrast”, LG.Philips - 1,000,000:1, Dolby – 250,000:1

+ + =

Tone Mapping

HDRrange

8-bitrange

Bit Depth

Scalable Coder

HDR video input10, 12, 14 bits/pixel

HDR video output(HDR storage/display)

LDR video output(conventional display)

Page 39: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Future Standards Activities – 3D Video

Consumer Electronics auto-stereoscopic display,

10+ views requiredDigital Cinema

polarized glasses, 2 views sufficient

3D Video (3DV)/Free View-Point Video (FVV) effort initiated in MPEG. Similar concept to MPEG-C. Any number of views can be recreated using depth map in the decoder.

2D video data + depth

Page 40: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Future Standards Activities – 3D Video

Paramount Pictures' Beowulf is benefiting from theaters utilizing next-generation 3D technology (grossed approximately $23.4 million of a total domestic gross over 79.4 million.” “U2 3D, the first live-action movie to be shot, produced, and screened exclusively with digital 3-D technology DreamWorks Animation is joining the digital 3-D wave

Studio plans to release all its pics in 3-D starting in 2009.”

Page 41: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Original and compressed Lena image with different methods

(a) Original Lena (51251224)

(b) AIC: 0.22bpp, PSNR=28.84dB

(c) JPEG2000: 0.22bpp, PSNR=29.57dB

Page 42: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Compressed Lena image with different methods(contd.)

(d) M-AIC: 0.22bpp, PSNR=29.02dB (e) JPEG: 0.22bpp, PSNR=24.29dB

Page 43: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

AVS

AVS is a set of integrity standard system-system, video, audio and media copyright management.

AVS-M is the seventh part of the video coding standard developed by AVS work group of China which aims for mobile systems and devices.

In AVS-M,a Jiben Profile has been defined which has 9 different levels.

AVS follows a layered structure for the data and this representation is seen in the coded bitstream.

Sequence layer provides an entry point into the coded video. It consists of a set of mandatory and optional downloadable parameters.

Page 44: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

AVS-M ENCODER

Block Diagram of AVS-M encoder [34] Block Diagram of AVS-M encoder [34]

Page 45: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

AVS-M DECODER

Block Diagram of AVS-M Decoder [34] Block Diagram of AVS-M Decoder [34]

Page 46: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

AVS-M Analysis

Page 47: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

AVS-M Analysis

Page 48: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

AVS-M Analysis

OriginalOriginal

Decoded sequenceDecoded sequence

Page 49: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

AVS-M Analysis

Page 50: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac features Direct support of multiple picture formats 4K e-cinema through to quarter common intermediate

format (QCIF) Supports I-frame only up to long group of picture (GOP)

structures Direct support of multiple chroma formats e.g.

4:4:4/4:2:2/4:2:0 Direct support of multiple bit depths e.g. 8 bit to 16 bit Direct support of interlace via metadata Direct support of multiple frame rates from 23.97 fps to

60fps Definable pixel aspect ratios Multiple color spaces with metadata Definable wavelet depth

Page 51: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac Encoder

Dirac encoder architecture [1]Dirac encoder architecture [1]

Page 52: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac Decoder

Dirac decoder architecture [8]Dirac decoder architecture [8]

Page 53: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac Results for Miss America

Page 54: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac Results for Miss America

Page 55: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac Results for Miss America

Page 56: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac Results for Miss America

Page 57: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac Results for Miss America

Page 58: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Dirac Results for Miss America

Page 59: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Current Interns & Alumni Network

Nikshep Patil – intern @ Datamatics

Radhika Veerla (Aug 08) – job@RIM  

Theju Jacob (Aug 08) – Ph.D. student

Pooja Agawane (Aug 08) – job@Intel

Leena Agarwal (Dec 07) – job@Intel

Rahul Panchal (May 07) – job@Qualcomm

Harishankar Murugan (May 07)- job@NVidia

Sreejana Sharma (May 07)- job@Intel

Hitesh Yadav (August 06)- job@Intel

Basavaraj S. M. (May 06)- Job@Fast VDO

Rochelle Pereira (Dec 05)- job@NVidia

Sandya Sheshadri (Dec 05) – job@Microsoft

Tarun Bhatia (Dec 05)- job@wirelessventures

Vidhya Vijaykumar job@TI

Current & Recent Grads:

Jay R Padia (M.S) (May 2010) - Job @ Intel

Att Kruafak (Ph.D) – job @ Engineer CAT, Thailand

Sangseok Park (Dec 2008) (Ph.D) – job @ DiaLogic

Aruna Ravi Subramanya

Sahana Devaraju

Tejaswini Purushottam

job@microchipKrishnan -intern@ FastVDOSwaroop Suchethan - job@EricssonJennie Abraham - job@Ericsson

Page 60: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Current Interns & Alumni Network

Pragnesh Ramolia- job @ Tactel US

Nikshep Patil –job @ Marvell Semiconductors

Sreya – Intern @ RIM Shreyanka – Intern @ Intel Amruta –Intern @ RIM Tejas –Intern @ RIM Sadaf –Inern @ Ericsson

Anuradha (Dec 04) –job @ Qualcomm

Shubha Kumbadkone (Dec 04) –job @ Intel

Nandakishore (Aug 04) –job @ Qualcomm

Phani (May 04) – job @ Qualcomm

Ravi Kumar (May 04) –job @ Qualcomm

Page 61: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

ReferencesDIRAC1. T. Borer, and T. Davies, “Dirac video compression using open technology”, BBC EBU

Technical Review, July 20052. BBC Research on Dirac: http://www.bbc.co.uk/rd/projects/dirac/index.shtml3. The Dirac web page: http://dirac.sourceforge.net4. T. Davies, “The Dirac Algorithm”: http://dirac.sourceforge.net/documentation/algorithm/,

2005.5. Dirac developer support: Overlapped block-based motion compensation:

http://dirac.sourceforge.net/documentation/algorithm/algorithm/toc.htm6. “Dirac Pro to bolster BBC HD links”:

http://www.broadcastnow.co.uk/news/multi-platform/news/dirac-pro-to-bolster-bbc-hd-links/1732462.article

7. Dirac software and source code: http://diracvideo.org/download/dirac-research/8. Dirac video codec - A programmer's guide:

http://dirac.sourceforge.net/documentation/code/programmers_guide/toc.htm9. Daubechies wavelet: http://en.wikipedia.org/wiki/Daubechies_wavelet10. Daubechies wavelet filter design: http://cnx.org/content/m11159/latest/11. Dirac developer support: Wavelet transform:

http://dirac.sourceforge.net/documentation/algorithm/algorithm/wlt_transform.xht12. Dirac developer support: RDO motion estimation metric:

http://dirac.sourceforge.net/documentation/algorithm/algorithm/rdo_mot_est.xht13. A. Ravi and K.R. Rao, “Performance analysis and comparison of the DIRAC video codec

with H.264/MPEG-4 part 10 AVC”, IJWMIP , vol.4, pp. 635-654, 2011.

Page 62: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

H.2641. T.Wiegand, et al “Overview of the H.264/AVC video coding standard”, IEEE Trans. on Circuit

and Systems for Video Technology, vol.13, pp. 560-576, July 2003.2. T. Wiegand and G. J. Sullivan, “The H.264 video coding standard”, IEEE Signal Processing

Magazine, vol. 24, pp. 148-153, March 2007.3. D. Marpe, T. Wiegand and G. J. Sullivan, “The H.264/MPEG-4 AVC standard and its

applications”, IEEE Communications Magazine, vol. 44, pp. 134-143, Aug. 2006.4. S.K.Kwon, A.Tamhankar and K.R.Rao, “Overview of H.264 / MPEG-4 Part 10” J. Visual

Communication and Image Representation, vol. 17, pp.186-216, April 2006. 5. A. Puri, X. Chen and A. Luthra, “Video coding using the H.264/MPEG-4 AVC compression

standard”, Signal Processing: Image Communication, vol. 19, pp. 793-849, Oct. 2004.6. H.264 AVC JM software: http://iphome.hhi.de/suehring/tml/7. [19] H.264/MPEG-4 AVC: http://en.wikipedia.org/wiki/H.2648. M.Fieldler, “Implementation of basic H.264/AVC decoder”, seminar paper at Chemnitz

University of Technology, June 2004.9. H.264 encoder and decoder: http://www.adalta.it/Pages/407/266881_266881.jpg10. R. Schäfer, T. Wiegand and H. Schwarz, “The emerging H.264/AVC standard”, EBU

Technical Review, Jan. 2003.11. H.264 reference software download : http://iphome.hhi.de/suehring/tml/12. D. Marpe, T. Wiegand, and S. Gordon, "H.264/MPEG4-avc fidelity range extensions: tools,

profiles, performance, and application areas," in, IEEE International Conference on Image Processing, vol. 1, pp. I-593-6, 2005.

13. S. Saponara et al, "The JVT advanced video coding standard: complexity and performance analysis on a tool-by-tool basis," in Packet Video Workshop, Nantes, France, April 2003.

References

Page 63: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

VC-11. VC-1 technical overview -

http://www.microsoft.com/windows/windowsmedia/howto/articles/vc1techoverview.aspx2. Microsoft Windows Media: http://www.microsoft.com/windows/windowsmedia3. http://en.wikipedia.org/wiki/VC-14. S. Srinivasan, et al, “Windows Media Video 9: overview and applications”, Signal Processing:

Image Communication, vol .19, Issue 9, pp. 851-875, Oct. 2004 5. S. Srinivasan and S. L. Regunathan, “An overview of VC-1”, SPIE / VCIP, vol. 5960, pp. 720-

728, July 2005.AVS1. AVS Video Expert Group, “Information technology – Advanced coding of audio and video – Part 2:

Video (AVS1-P2 JQP FCD 1.0),” Audio Video Coding Standard Group of China (AVS), Doc. AVS-N1538, Sept. 2008.

2. AVS Video Expert Group, “Information technology – Advanced coding of audio and video – Part 3: Audio,” Audio Video Coding Standard Group of China (AVS), Doc. AVS-N1551, Sept. 2008.

3. L Yu et al., “Overview of AVS-Video: Tools, performance and complexity,” SPIE VCIP, vol. 5960, pp. 596021-1~ 596021-12, Beijing, China, July 2005.

4. L. Fan, S. Ma and F. Wu, “Overview of AVS video standard,” IEEE Int’l Conf. on Multimedia and Expo, ICME '04, vol. 1, pp. 423–426, Taipei, Taiwan, June 2004.

5. W. Gao et al., “AVS – The Chinese next-generation video coding standard,” National Association of Broadcasters, Las Vegas, 2004.

6. Special issue on 'AVS and its Applications' Signal Processing: Image Communication, vol. 24,pp. 245-344, April 2009..

7. AVS China software : ftp://159.226.42.57/public/avs_doc/avs_software (need password)

References

Page 64: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

References

8. AVS working group official website, http://www.avs.org.cn

9. http://www-ee.uta.edu/dip/Courses/EE5351/ISPACSAVS.pdf

10. W.Gao et al., “AVS–the Chinese next-generation video coding standard,” National Association of Broadcasters, Las Vegas, 2004.

11. L.Fan, “Mobile Multimedia Broadcasting Standards”, ISBN: 978-0-387-78263-8, Springer US, 2009

12. F.Yi et al., “Low-Complexity Tools in AVS Part 7”, J. Comput. Sci. Technol, vol.21, pp. 345-353, May. 2006

13. L.YU, S.Chen and J.Wang, “Overview of AVS-video coding standards”, Signal Process: Image Commun, vol. 24, Issue 4, pp 247-262, April 2009

14. W.Gao, “AVS–A project towards to an open and cost efficient Chinese national standard”, ITU-T VICA workshop, ITU Headquarters, Geneva, 22-23 July 2005.

15. Z. Zhang et al., “Improved intra prediction mode-decision method”, Proc. of SPIE ,Vol. 5960, pp. 59601W-1~ 59601W-9, Beijing, China, July 2005.

16. Z.. Ma et al., “Intra coding of AVS Part 7 video coding standard”, J. Comput. Sci. Technol,vol.21, Feb.2006

Page 65: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

References

17. W.Gao and T.Huang “AVS Standard -Status and Future Plan”, Workshop on Multimedia New Technologies and Application, Shenzhen, China, Oct. 2007.

18. Y.Cheng et al., “Analysis and application of error concealment tools in AVS-M decoder”, Journal of Zhejiang University –Science A, vol. 7, pp. 54-58, Jan 2006.

19. M.Liu and Z.Wei, “A fast mode decision algorithm for intra prediction in AVS-M video coding” Vol 1, ICWAPR 07, Issue, 2-4, pp.326 –331, Nov. 2007.

20. Q.Wang et al., “Context-Based 2D-VLC for Video Coding”, IEEE Int’l Conf. on Multimedia and Expo (ICME), vol.1, pp. 89-92, June. 2004.

21. http://vspc.ee.cuhk.edu.hk/~ele5431/AVS.pdf

22. W.Gao, K.N. Ngan and L.Yu, “Special issue on AVS and its applications: Guest editorial”, Signal Process: Image Commun, vol. 24, Issue 4, pp. 245-344, April 2009.

23. S.W.Ma and W.Gao, “Low Complexity Integer Transform and Adaptive Quantization Optimization”, J. Comput. Sci. Technol, vol.21, pp.354-359, May 2006.

24. S.Hu, X.Zhang and Z.Yang, “Efficient Implementation of Interpolation for AVS”, Image and Signal Processing, 2008. Congress, vol. 3, Issue, 27-30, pp.133 –138, May 2008.

25. R. Schafer and T. Sikora, “Digital video coding standards and their role in video communications”, Proc. of the IEEE, vol. 83, pp. 907-924, June 1995.

26. A. K. Jain, “Image data compression: A review”, Proc. IEEE, vol. 69, pp. 349-384, March 1981.

Page 66: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

References

JPEG, JPEG-2000, JPEG-XR (XR Extended range)

1. AIC website: http://www.bilsen.com/aic/

2. T. Wiegand et.al, “Overview of the H.264/AVC Video Coding Standard,” IEEE Trans. on Circuits and Systems for Video Technology, vol. 13, pp.560-576, July 2003.

3. G. Sullivan, P. Topiwala and A. Luthra, “The H.264/AVC Advanced Video Coding Standard: Overview and Introduction to the Fidelity Range Extensions,” SPIE Conference on Applications of Digital Image Processing XXVII, vol. 5558, pp. 53-74, Aug. 2004.

4. I. Richardson, H.264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia, Hoboken, NJ: Wiley, 2003.

5. P. Topiwala, “Comparative study of JPEG2000 and H.264/AVC FRExt I-frame coding on high definition video sequences,” Proc. SPIE Int’l Symposium, Digital Image Processing, vol. , pp. San Diego, Aug. 2005.

6. P. Topiwala, T. Tran and W.Dai, “Performance comparison of JPEG2000 and H.264/AVC high profile intra-frame coding on HD video sequences,” Proc. SPIE Int’l Symposium, Digital Image Processing, applications of digital image processing XXIX, vol. 6321, pp. , San Diego, Aug. 2006.

Page 67: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

References

7. T. Tran, L.Liu and P. Topiwala, “Performance comparison of leading image codecs: H.264/AVC intra, JPEG 2000, and Microsoft HD photo,” Proc. SPIE Int’l Symposium, Digital Image Processing, vol. , pp. ,San Diego, Sept. 2007.

8. G. J. Sullivan, “ ISO/IEC 29199-2 (JpegDI part 2 JPEG XR image coding – Specification),” ISO/IEC JTC 1/SC 29/WG1 N 4492, Dec. 2007

9. D. Marpe, T.Weigand and G. Sullivan, “The H.264/MPEG4 advanced video coding standards and its applications”, IEEE Communications Magazine, vol. 44, pp.134-143, Aug. 2006.

10. A. Skodras, C. Christopoulus and T. Ebrahimi, “The JPEG2000 still image compression standard,” IEEE Signal Processing Magazine, vol. 18, pp. 36-58, Sept. 2001.

11. D.S. Taubman and M.W. Marcellin, JPEG 2000: Image compression fundamentals, standards and practice, Kluwer academic publishers, 2001.

12. W.B. Pennebaker and J.L. Mitchell, JPEG: Still image data compression standard, Kluwer academic publishers, 2003.

13. D. Marpe, V. George, and T.Weigand, “Performance comparison of intra-only H.264/AVC HP and JPEG 2000 for a set of monochrome ISO/IEC test images”, JVT-M014, pp.18-22, Oct. 2004

14. D. Marpe et al, “Performance evaluation of motion JPEG2000 in comparison with H.264 / operated in intra-coding mode”, Proc. SPIE, vol. 5266, pp. 129-137, Feb. 2004.

15. Z. Xiong et al, “A comparative study of DCT- and wavelet-based image coding,” IEEE Trans. on Circuits and Systems for Video Tech., vol.9, pp. 692-695, Aug. 1999.

Page 68: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

References

16. H.264/AVC reference software (JM 13.2) Website: http://iphome.hhi.de/suehring/tml/download/17. JPEG reference software website: ftp://ftp.simtel.net/pub/simtelnet/msdos/graphics/jpegsr6.zip18. Microsoft HD photo specification: http://www.microsoft.com/whdc/xps/wmphotoeula.mspx19. JPEG2000 latest reference software (Jasper Version 1.900.0) Website:

http://www.ece.ubc.ca/mdadams/jasper20. JPEG-LS reference software website http://www.hpl.hp.com/loco/ 21. M.D. Adams, “JasPer software reference manual (Version 1.900.0),” ISO/IEC JTC 1/SC 29/WG 1

N 2415, Dec. 2007.22. M.D. Adams and F. Kossentini, “Jasper: A software-based JPEG-2000 codec implementation,” in

Proc. of IEEE Int. Conf. Image Processing, vol.2, pp 53-56, Vancouver, BC, Canada, Oct. 2000.23. M. J. Weinberger, G. Seroussi, and G. Sapiro, “LOCO-I: A low complexity, context-based, lossless

image compression algorithm”, Hewlett-Packard Laboratories, Palo Alto, CA.24. M.J. Weinberger, G. Seroussi and G. Sapiro, “The LOCO-I lossless image compression algorithm:

principles and standardization into JPEG-LS”, IEEE Trans. Image Processing, vol. 9, pp. 1309-1324, Aug.2000.

25. Ibid, “LOCO-I A low complexity context-based, lossless image compression algorithm”, Proc. 1996 DCC, pp.140-149, Snowbird, Utah, Mar. 1996.

26. K. Sayood, “Introduction to Data Compression”, Third Edition, Morgan Kaufmann Publishers, 2006.

27. M.Ghanbari, “Standard Codecs: Image Compression to Advanced Video Coding”. IEE, London, UK, 2003.

28. Z. Wang and A. C. Bovik, “Modern image quality assessment”, Morgan and Claypool Publishers, 2006.

Page 69: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

References

29. Special Issue on JPEG-2000, Signal Processing: Image Communication, vol. 17, pp. 1-144, Jan 2002.30. A. Stoica, C. Vertan, and C. Fernandez-Maloigne, “Objective and subjective color image quality evaluation for

JPEG 2000- compressed images,” IEEE Int’l Symposium on Signals, Circuits and Systems, vol. 1, pp. 137 – 140, July 2003.

31. J. J. Hwang and S. G. Cho, “Proposal for objective distortion metrics for AIC standardization”, ISO/IEC JTC 1/SC 29/WG 1 N4548, Mar 2008.

32. H. R. Wu and K. R. Rao, “Digital video image quality and perceptual coding,” Boca Raton, FL: Taylor and Francis, 2006.

33. I. H. Witten, R. M. Neal, and J. G. Cleary, “Arithmetic coding for data compression,” Communications of the ACM, vol. 30, pp. 520-540, June 1987.

34. Z. Zhang, R. Veerla and K. R. Rao, “A modified advanced image coding”, Proceedings of CANS’ 2008, Romania, Nov. 8-10, 2008.

35. X. Shang, “Structural similarity based image quality assessment: pooling strategies and applications to image compression and digit recognition,” M.S. Thesis, EE Department, The University of Texas at Arlington, Aug. 2006.

36. A. M. Eskicioglu and P. S. Fisher, “Image quality measures and their performance,” IEEE Signal Processing Letters, vol. 43, pp. 2959-2965, Dec. 1995.

37. Test images found in: http://www.hlevkin.com/default.html#testimages38. Information collected for various topics included in the material: www-ee.uta.edu/dip39. Y-L. Lee and K-H. Han, “Complexity of the proposed lossless intra for 4:4:4”, (ISO/IEC JTC1/SC29/WG11 and

ITU-T SG 16 Q.6) document JVT-Q035, 17-21 Oct. 2005.40. M. Ouaret F. Dufaux and T. Ebrahimi, “ On comparing JPEG 2000 and intraframe AVC”’,SPIE, Applications of

digital image processing XXIX, vol.6312, pp. ,Aug. 2006.41. S-T. Hsiang, “ A new subband/wavelet framework for AVC/H.264 intraframe coding and performance comparison

with motion-JPEG 2000”, VCIP, Proc of SPIE-IS& T Electronic Imaging, SPIE vol. 6822, pp. 68220P-1 thru 68220P-12, Jan. 2008.

42. S. Srinivasan et al, “An introduction to the HD photo technical design” , JPEG document wg1n4183, April 2007.

Page 70: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

References

Books

1. I. Richardson “The H.264 advanced video compression standard” Hoboken, NJ: Wiley, 2010.

Page 71: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

4x4 INTDCT in H.264

Vcodex white paper on 4x4 transform and quantization in H.264 http://www.vcodex.com/files/H264_4x4_transform_whitepaper_Apr09.pdf

  The description of the normative inverse quantization and transform process is found in the latest standard specification:

http://www.itu.int/rec/T-REC-H.264 Last, the following papers and standardization contributions contain valuable

information and insight on the transform and quantization design of H.264/MPEG-4 Part 10 AVC:

1) H. S. Malvar, A. Hallapuro, M. Karczewicz, and L. Kerofsky, “Low-Complexity Transform and Quantization in H.264/AVC”, IEEE Trans. on Circ. Sys. on Video Tech., vol. 13, pp. 598-603, July 2003, 

2) A. Hallapuro, M. Karczewicz, and H. Malvar, “Low Complexity Transform and Quantization – Part I: Basic Implementation”, JVT of ISO/IEC MPEG and ITU-T VCEG, JVT-B038, Feb. 2002. 

3) A. Hallapuro, M. Karczewicz, and H. Malvar, “Low Complexity Transform and Quantization – Part II: Extensions”, Joint Video Team of ISO/IEC MPEG and ITU-T VCEG, JVT-B039, Feb. 2002.

Page 72: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

LARGE SIZE TRANSFORMS

W.K. Cham, “Simple order-16 integer transform for video coding” IEEE ICIP 2010, Hong Kong, Sept.2010.

  R. Joshi, Y.A. Reznik and M. Karczewicz, “ Efficient large size transforms for high-

performance video coding”, SPIE 0ptics + Photonics, vol. 7798, paper 7798-31, San Diego, CA, Aug. 2010.

A.T. Hinds, “ Design of high- performance fixed-point transforms using the common factor method”, SPIE 0ptics + Photonics, vol. 7798, paper 7798-29, San Diego, CA, Aug. 2010.

G.J. Sullivan, “ Standardization of IDCT approximation behavior for video compression: the history and the new MPEG-C parts 1 and 2 standards”, SPIE vol. 6696, paper 35, Aug.2007.

I. E. Richardson , “The H.264 Advanced Video Compression Standard”, 2nd Edition, Wiley publications, 2010.

Page 73: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

High efficiency video coding (HEVC) http://www.h265.net/ has info on developments in HEVC NGVC – Next generation video

coding. Some of the tools contributing to the gain are:

(1) RD Picture Decision

(2) RDO_Q (from Qualcomm)

(3) MDDT (from Qualcomm)

(4) New Offset (from Qualcomm)

(5) Adaptive Interpolation Filter (from Qualcomm & Nokia)

(6) Block Adaptive Loop Filter (BALF) (from Toshiba)

(7) Bigger Blocks and Bigger transform (32x32 and 64x64) (Qualcomm)

(8) Motion Vector Competition (France Telecomm)

(9) Template matching JVT KTA reference software (KTA: key technical areas) 

http://iphome.hhi.de/suehring/tml/download/KTA/ G.J. Sullivan and J.-R. Ohm,“Recent developements in standardization of high

efficiency video coding“, Proc. SPIE, vol. 7798, pp. 77980V-1 thru V-7, San diego, CA Aug. 2010.

Page 74: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

NEW GENERATION VIDEO CODING (NGVC)

VCEG MPEG

(ITU-T) (ISO/IEC)

Joint collaborative team on video coding (JCT-VC) (15-23 April 2010- first meeting) Table. 1 [1]

Page 75: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Technical assessment first JCT-VC, Dresden, Germany 15-23 April 2010

All proposed algorithms are based on the traditional MC hybrid (transform-DPCM)coding approach.

Random Access Low Delay TMUC ( test Model Under Consideration) Coding Units (CU) Prediction Units (PU) Transform Units (TU)

Page 76: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Coding Units Intra prediction – upto 28 angular directions ME/MC

Inter prediction ( Multiple ref. pictures, bi-prediction, weighted prediction)

New MV competition Transform unit block size 4X4 to 64X64 ( Mode dependent directional transform MDDT and rotational transforms)

Page 77: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

ADAPTIVE LOOP FILTER

JCT- VC : Developing a well validated design called TM leading to HEVC standardization by 2011.

First version of HEVC is probably expected by end of 2012 or early 2013.

Page 78: Multimedia Processing Lab NH 140 Advisor : Dr. K.R. Rao Phone : (817) 272-3478 Email : rao@uta.edu Website: .

Explore the field of multimedia processing in

MPL @

- Dr. K.R. Rao

(817) 272-3478 [email protected]

NH 140

http://www-ee.uta.edu/dip