Lecture 5: Video Compression Standards (Part2) Tutorial 3 : Introduction to Histogram

Similar documents
Video Compression Standards (II) A/Prof. Jian Zhang

Lecture 4: Video Compression Standards (Part1) Tutorial 2 : Image/video Coding Techniques. Basic Transform coding Tutorial 2

Lecture 5: Error Resilience & Scalability

MPEG-4: Simple Profile (SP)

The Scope of Picture and Video Coding Standardization

Lecture 3: Image & Video Coding Techniques (II) & Standards (I) A/Prof. Jian Zhang

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

Thanks for slides preparation of Dr. Shawmin Lei, Sharp Labs of America And, Mei-Yun Hsu February Material Sources

Video Compression MPEG-4. Market s requirements for Video compression standard

Digital Video Processing

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

Multimedia Standards

Georgios Tziritas Computer Science Department

Video Compression. Learning Objectives. Contents (Cont.) Contents. Dr. Y. H. Chan. Standards : Background & History

Video Coding Standards

Video Codecs. National Chiao Tung University Chun-Jen Tsai 1/5/2015

In the name of Allah. the compassionate, the merciful

Video Compression An Introduction

Introduction to Video Coding

Introduction to Video Compression

PREFACE...XIII ACKNOWLEDGEMENTS...XV

CMPT 365 Multimedia Systems. Media Compression - Video Coding Standards

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 10 ZHU Yongxin, Winson

Standard Codecs. Image compression to advanced video coding. Mohammed Ghanbari. 3rd Edition. The Institution of Engineering and Technology

Overview : Image/Video Coding Techniques

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

Video Coding Standards: H.261, H.263 and H.26L

TECHNICAL RESEARCH REPORT

VIDEO COMPRESSION STANDARDS

10.2 Video Compression with Motion Compensation 10.4 H H.263

Overview : Image/Video Coding Techniques

Video coding. Concepts and notations.

MPEG-2. ISO/IEC (or ITU-T H.262)

4G WIRELESS VIDEO COMMUNICATIONS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

Advanced Video Coding: The new H.264 video compression standard

Video Transcoding Architectures and Techniques: An Overview. IEEE Signal Processing Magazine March 2003 Present by Chen-hsiu Huang

New Techniques for Improved Video Coding

The Basics of Video Compression

Lesson 6. MPEG Standards. MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21

IEE 5037 Multimedia Communications Lecture 12: MPEG-4

JPEG 2000 vs. JPEG in MPEG Encoding

The VC-1 and H.264 Video Compression Standards for Broadband Video Services

IST MPEG-4 Video Compliant Framework

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

MPEG-4 departs from its predecessors in adopting a new object-based coding:

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

Recommended Readings

Using animation to motivate motion

ECE 634: Digital Video Systems Scalable coding: 3/23/17

Image and Video Watermarking

Wavelet-Based Video Compression Using Long-Term Memory Motion-Compensated Prediction and Context-Based Adaptive Arithmetic Coding

5LSE0 - Mod 10 Part 1. MPEG Motion Compensation and Video Coding. MPEG Video / Temporal Prediction (1)

Lecture 3 Image and Video (MPEG) Coding

Fernando Pereira. Instituto Superior Técnico

Chapter 10. Basic Video Compression Techniques Introduction to Video Compression 10.2 Video Compression with Motion Compensation

Video Coding in H.26L

Module 6 STILL IMAGE COMPRESSION STANDARDS

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

THE H.264 ADVANCED VIDEO COMPRESSION STANDARD

MPEG-2. And Scalability Support. Nimrod Peleg Update: July.2004

Cross Layer Protocol Design

LECTURE VIII: BASIC VIDEO COMPRESSION TECHNIQUE DR. OUIEM BCHIR

MPEG-4: Overview. Multimedia Naresuan University

Ch. 4: Video Compression Multimedia Systems

Parallel Implementation of Arbitrary-Shaped MPEG-4 Decoder for Multiprocessor Systems

ITU-T DRAFT H.263 VIDEO CODING FOR LOW BITRATE COMMUNICATION LINE TRANSMISSION OF NON-TELEPHONE SIGNALS. DRAFT ITU-T Recommendation H.

CMPT 365 Multimedia Systems. Media Compression - Video

Introduction of Video Codec

Lecture 6: Compression II. This Week s Schedule

MULTIDIMENSIONAL SIGNAL, IMAGE, AND VIDEO PROCESSING AND CODING

微电子学院 School of Microelectronics. Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 12 ZHU Yongxin, Winson

Lecture 13 Video Coding H.264 / MPEG4 AVC

Lecture 1: Introduction & Image and Video Coding Techniques (I)

Optimal Estimation for Error Concealment in Scalable Video Coding

A Hybrid Temporal-SNR Fine-Granular Scalability for Internet Video

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Emerging H.26L Standard:

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.

Digital video coding systems MPEG-1/2 Video

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

The new Hybrid approach to protect MPEG-2 video header

Compressed-Domain Video Processing and Transcoding

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

INTERNATIONAL TELECOMMUNICATION UNION ITU-T H.263 (03/96) TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU TRANSMISSION OF NON-TELEPHONE SIGNALS VIDEO

Digital Video Compression and H.261 Recommendation

MULTIMEDIA COMMUNICATION

Introduction to Video Encoding

Introduction to LAN/WAN. Application Layer 4

COMP 249 Advanced Distributed Systems Multimedia Networking. The Video Data Type Coding & Compression Basics

TRANSCODING OF H264 BITSTREAM TO MPEG 2 BITSTREAM. Dr. K.R.Rao Supervising Professor. Dr. Zhou Wang. Dr. Soontorn Oraintara

Transcription:

Lecture 5: Video Compression Standards (Part) Tutorial 3 : Dr. Jian Zhang Conjoint Associate Professor NICTA & CSE UNSW COMP9519 Multimedia Systems S 006 jzhang@cse.unsw.edu.au Introduction to Histogram Image Histogram 4. Digital Video Coding (DVC) Structure Hybrid MC/DPCM/DCT --Tutorial diff input MC Encoder COMP9519 Multimedia Systems Lecture 5 Slide 3 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 4 J Zhang Rec

Block Based Estimation Tutorial Block base search Block Based Estimation Tutorial Block base search Reconstructed Frame Reconstructed Frame Compensated Frame Vector W=Search Range 16 Vector W=Search Range 16 16 W 16 W W W 16 W 16 W Search Window Position of Current Block 16x16 -- Macroblock Search Window Position of Current Block Compensated MB 16x16 -- Macroblock COMP9519 Multimedia Systems Lecture 5 Slide 5 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 6 J Zhang Information Entropy Tutorial Histogram & Entropy--Tutorial Information Entropy: The Entropy is defined as the average information content per symbol of the source. The Entropy, H, can be expressed as follows: Look at a source that contains m possible symbols: {si, i=1,..m}. The occurrence probabilities: {Pi, i=1,..m}, So The info. content of a symbol si; = i( s ) = log p bits I i i The entropy of an information source is a function of occurrence probabilities. The entropy reaches the Max. when all symbols in the set are equally probable. H = m i = 1 p i log p i i bits Entropy = 7.63 bits/pixel Calculate: P1, P..Pm. H = m i = 1 p i log p i COMP9519 Multimedia Systems Lecture 5 Slide 7 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 8 J Zhang

Key operations in image/video coding -- Tutorial A -dimentional transform can be extended T T according to :[C] = [C ][T] =[T] [F][T] Key operations in image/video coding -- Tutorial Discrete Cosine Transform For a -D input block U, the transform coefficients can be found as T Y = CUC T The inverse transform can be found as U = C YC The NxN discrete cosine transform matrix C=c(k,n) is defined as: 93% of energy now in one term that is in position (0,0) c( k, n) = 1 N for k = 0 and 0 n N 1, π (n + 1) k cos for1 k N 1and 0 n N 1. N N [1] Do you get any hint to do DCT transform for a 8x8 block? COMP9519 Multimedia Systems Lecture 5 Slide 9 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 10 J Zhang Key operations in image/video coding --Tutorial Quantization weighting matrix (QWM) Diff quantizers for diff. DCT Coef. Default Qant. QWM C ij = 16 * C Recommended MPEG quantization matrix 8 16 19 6 7 9 34 16 16 4 7 9 34 37 19 6 7 9 34 34 38 6 7 9 34 37 40 6 7 9 3 35 40 48 6 7 9 3 35 40 48 58 6 7 9 34 38 46 56 69 7 9 35 38 46 56 69 83 [] Quantization results of the DCT coefficients, you can choose an appropriate quantization step-size Q {4,3}. W-Matrix. You need to tell the Q value and the QWM (left or right?) COMP9519 Multimedia Systems Lecture 5 Slide 11 J Zhang Quantized Setpsize Coef.ij ij //( Q * W weighting 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 ij ) Key operations in image/video coding -- Tutorial The distribution of -D DCT Coefficients after the quantization 51 [3] Run-length (Zig-zag) results COMP9519 Multimedia Systems Lecture 5 Slide 1 J Zhang Ref: H. Wu 68 3 5 0 0 0 10 0 4 3 0 0 0 0 9 3 0 0 0 0 0 3 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Zig-Zag scan

Key operations in image/video coding -- Tutorial DC coefficient is coded differentially as (size, amplitude). There are 1 categories of size Key operations in image/video coding -- Tutorial Zig-Zag scan 11 Note: We have block0; block1; block; block3. The predicated DC value for block0 is 0 The predicated DC value for block1 is the DC value of block0 COMP9519 Multimedia Systems Lecture 5 Slide 13 J Zhang [4] VLC of the Zig-zag scan run-length pairs for the four 8x8 sub-image blocks COMP9519 Multimedia Systems Lecture 5 Slide 14 J Zhang Digital Video Coding (DVC) Structure Video Block Structure Tutorial Typical MC/DPCM/DCT video coding architectures use the 4::0 (YCbCr) format as block data structure The architecture consists of the following layered data structure for each picture: Picture layer Slice or group of blocks (GOB) layer Macroblock (MB) layer: 16x61 pixels 4Y,Cb & Cr Block size layer: 8x8 pixels.. MC is applied to the MBs. Ref: H. Wu Digital Video Coding (DVC) Tutorial Standards MPEG-1 (ISO/IEC 1117) - -1 0 1 3 4 5 6 7 8 9 10 11 1 Coding order: 0,3,1,,6,4,5,9,7,8 Intra coded picture (I-Picture): Coded on their own (all MBs are intra) and server as random access. Predicted picture (P-Picture): Coded with reference (MC predictions) to the previous anchor I or P picture. Bidirectionally predicted picture (B-Picture): Coded with reference to the previous and/or future anchor I or P pictures (forward or backward MC prediction and/or linear interpolation). COMP9519 Multimedia Systems Lecture 5 Slide 15 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 16 J Zhang

Digital Video Coding (DVC) Tutorial Standards MPEG- Scalability Digital Video Coding (DVC) Tutorial Standards MPEG- Scalability SNR Scalability (mode encode) 16x16 16x16 8x8 layer spatially scalable coder COMP9519 Multimedia Systems Lecture 5 Slide 17 J Zhang Spatiotemporal weighted Prediction in Spa-Scal.+ Pred COMP9519 Multimedia Systems Lecture 5 Slide 18 J Zhang Drift will not be to the enhanced layer This is because that the diff. refinement coef. do feed back into the lower MC Pred loop at the encoder However, the drift will be introduced once the error or packet loss is occurred in the enhanced layer. Standards MPEG- Error Resilience Most digital storage media and communication channel are not error free. Channel coding is beyond the scope of the MPEG specification The compression scheme needs to robust to residual errors The error resilience schemes can be applied in both sides Encoder/decoder The slice structure of MPEG allows resync. after an error to limit the bit errors within a certain area Decoders may also attempt to conceal the effect of errors Five major parts form the prevision of error resilience Error Detection: It is desirable to detect quickly that an error has occurred and immediately take steps to prevent catastrophic degradation in video quality. Error detection can be based on a syntactic or a semantic basis such as illegal VLC, more 64 DCT coef. etc. Resynchronization: When synchronization is lost with the arriving bitstream, the decoder will normally go looking for a unique resync. codeword The resync. word is followed by sufficient data to allow the decoder to continue the decoding process from that point recovery: It explores to find the boundary of corrupted bitstream and then determine if the regain for decoding is utilized COMP9519 Multimedia Systems Lecture 5 Slide 19 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 0 J Zhang

Temporal/Special Localization: It prevents the error propagation by inserting regular intra coded picture, slides or macroblocks due to any transmission error. Two type of error propagations: temporal error propagation errors are propagated into next frames spatial error propagation errors are propagated within a video picture Concealment: The impact of any errors can be concealed using correctly decoded information from the current or previous pictures Conclusion: While all of the previous parts are important, the use of effective error concealment will lead to the greatest improvement in subjective quality of decoded for error corrupted bitstreams The impact of errors within decoded picture Original picture Error damaged picture COMP9519 Multimedia Systems Lecture 5 Slide 1 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide J Zhang Concealment Techniques Spatial concealment: Concealment Techniques Spatial concealment: Simple interpolation with above/below slices This technique is best suited to little spatial activity but is far less successful in areas where there is significant spatial details. Error damaged picture Spatial concealed picture COMP9519 Multimedia Systems Lecture 5 Slide 3 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 4 J Zhang

Concealment Techniques Simple temporal concealment The impact of errors within decoded picture Previous frame Current frame This technique is effective in the relatively stationary area but much less effective in the fast moving background Error damaged picture Temporal concealed picture COMP9519 Multimedia Systems Lecture 5 Slide 5 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 6 J Zhang Concealment Techniques compensated concealment It combines both temporal replacement and motion estimation High correlation among nearby MVs in a picture The assumption is the linear changes on MVs from the below to above MBs. It has been proven that this scheme can improve significantly error concealment in moving areas of the picture Concealment Techniques compensated concealment Encoder has option to transmit the concealment MVs with Intra-coded MB if they are above/below a corrupted MB Lost MB COMP9519 Multimedia Systems Lecture 5 Slide 7 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 8 J Zhang

Temporal Localization Cyclic intra-coded pictures Temporal Localization Cyclic intra-coded slices I Extra intra-coded I-pictures can be sent to replace B or P- pictures. Error propagation can be reduced at the cost of extra overhead in the bitstream This scheme will reduce the coding efficiency Extra intra-coded I-slides can be used to periodically refresh the frame from the top to the bottom over a number of frames The disadvantage is that the partial updating of the screen in a frame period will produce a noticeable windscreen wiper effect COMP9519 Multimedia Systems Lecture 5 Slide 9 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 30 J Zhang Temporal Localization Cyclic intra-coded marcoblock (MB) refreshment A fixed number of frames Spatial Localization Small Slices Mode (slice mode in MPEG-4 or H.63) The 11MBs or MBs per slice can reduce the damage to decoded picture after a corrupted MB is detected.. Extra intra-coded MBs can be inserted to refresh the frame periodically over a fixed number of frames This scheme has been widely used in current MPEG-4 codec COMP9519 Multimedia Systems Lecture 5 Slide 31 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 3 J Zhang

Spatial Localization Small Slices Mode (slice mode in MPEG-4 or H.63) Some significant factors impact on the number of bits required to code a slice in a picture (coding mode, picture activities etc.) Spatial Localization Fixed Bits Payload (Adaptive Slice Mode) The disadvantage of the small slice mode does not take into account of the packing structure to accommodate one slice per packet The fixed bits payload adaptive slice mode addresses the weak point caused by small slice Bad Case Good Case COMP9519 Multimedia Systems Lecture 5 Slide 33 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 34 J Zhang The Combination of Error Resilience Schemes Spatial interpolation Simple marcoblock temporal replacement Small slice mode Adaptive slice mode Small slice mode + compensated concealment Advanced error concealment J. Zhang, M.R. Frater and J.F. Arnold A Cell Loss Concealment Technique for MPEG Coded Video, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 4, pp. 659-665, JUNE 000. Layered Coding for error concealment Base layer important data/information High layer less important Layered Coder High Priority Low Priority Low Loss Channel High Loss Channel X X It applies to Spatial Scalability, SNR Scalability and Partitioning Scalability & Frequency Scalability COMP9519 Multimedia Systems Lecture 5 Slide 35 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 36 J Zhang

5.1 Demo for Error Resilience Demo 5. Introduction to MPEG-4 Standard New Functionalities & Associated Application for MPEG-4 Content-based Multimedia base searching Video Games 3-D movies Content-based Interactivity Improved Coding Efficiency Video telephone Video streaming Video aurthoring Mobility access Remote Monitoring Wireless LAN Universal Access COMP9519 Multimedia Systems Lecture 5 Slide 37 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 38 J Zhang 5. Introduction to MPEG-4 Standard 1. Systems Defines the architecture, multiplex and syntax of the MPEG-4 standard. Syntax component for configuring decoders with multiple coding tools Defining the compositing operation permitted by decoders for user interaction. Video Define the video coding algorithms and tools (normative parts) specifies the coding and animation of synthetic and natural hybrid video (SNHC) 3. Audio 4. Conformance Testing 5. Technical Report Software codec specification 6. DSM-CC Multimedia integration framework 5.3 MPEG-4 visual standard Video Coding and Communication MPEG-4 standard: video part -- content based video coding scheme To enable all these content-based functionalities, MPEG-4 relies on a revolutionary, content based representation of audiovisual objects. As opposed to classical rectangular video (eg: MPEG1/), MPEG-4 treats a scene as a composition of several objects that are separately encoded and decoded The scalability at the object or content level enables to distribute the available bit-rate among the objects in the scene Visually, more important objects are allocated more bits. Encoded once and automatically played out at different rates with acceptable quality for the communication environment and bandwidth at hand. COMP9519 Multimedia Systems Lecture 5 Slide 39 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 40 J Zhang

Major Functionalities in MPEG-4 Visual Standard Compression: Improved Coding Efficiency Coding of Multiple Concurrent Streams Content-Based Interactivity: Content-based Manipulation and Bitstream Editing Hybrid Natural & Synthetic Coding Improved Temp. Random Access COMP9519 Multimedia Systems Lecture 5 Slide 41 J Zhang Universal Access: Content-based Scalability in Textures, Image and Video Complexity scalability for en/de Spatial scalability Temporal scalability Quality scalability Fine grain scalability Shape & Alpha Channel Coding Robustness in Error-Prone Environments Access and manipulation of arbitrarily shaped images Ref: Thomas Sikora Object Based MPEG-4 Video Verification Model 1. In MPEG-4, scenes are composed of different objects to enable contentbased functionalities.. Flexible coding of video objects 3. Coding of a Video Object Plane (VOP) Layer COMP9519 Multimedia Systems Lecture 5 Slide 4 J Zhang Video Object Planes (VOP s) Ref: Thomas Sikora Decomposition into VOP s Ref: Thomas Sikora Original Binary Segmentation Mask Background Layer VOP Foreground Layer VOP The binary segmentation Mask is to extract the back/fore-ground layers The overlapping VOP s brining the opportunity to do the manipulation of Scene content Ref: MPEG-4 AKIYO testing video sequence COMP9519 Multimedia Systems Lecture 5 Slide 43 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 44 J Zhang

Video Object Plane layered coding Arbitrary VOP Shape MPEG-4 VOP-coder (MV) Ref: Thomas Sikora Texture DCT bitstream DCT-Based Approach for Coding VOP s Ref: Thomas Sikora Similar to H.63 Rectangular VOP (MV) Similar to H.63 Texture DCT bitstream Block diagram of the basic MPEG-4 hybrid DPCM/transform codec structure COMP9519 Multimedia Systems Lecture 5 Slide 45 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 46 J Zhang Coding of a Video Object Plane Background Padding for Compensation Padded background Ref: Thomas Sikora Ref: Thomas Sikora Previous Frame Current Frame COMP9519 Multimedia Systems Lecture 5 Slide 47 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 48 J Zhang

One Typical Example -- Sprite Coding Scalability of Video Content Spatial Scalability Coder/Decoder 1. A non-changing background only has to be transmitted once. Only foreground objects transmitted and re- Inserted at the decoder 3. Object are much smaller than full video Coder/Decoder Coder/Decoder COMP9519 Multimedia Systems Lecture 5 Slide 49 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 50 J Zhang Scalability of Video Content Temporal Scalability 60 HZ Coder/Decoder 60 HZ Coder/Decoder 30 HZ Coder/Decoder 15 HZ The MPEG-4 Video Verification Model (VM) Coding of arbitrarily shaped images (VOP s) Coding of shape and Transparency information for each VOP I,P and B frames (VOP s) Block-based hybrid MC/DCT coding approach based on ITU H.63 and MPEG techniques Temporal and Spatial Scalability of VOP s COMP9519 Multimedia Systems Lecture 5 Slide 51 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 5 J Zhang

5.4 Introduction to H.64 Video Coding Standard It started from the ITU-T H.6L Project (Long term) It aims to improve the coding efficiency up to 50% compared to MPEG-4 video coding standard In Dec. 001, MPEG and ITU-T experts set up joint video team (JVT) to focus on this new standard. The final version of the standard has been approved by ITU-T 003. H.64 video coding standard or MPEG-4 Part 10. The new technical approaches: An Adaptive deblocking loop filter to remove the artifacts Multiple frame for ME/MC Predication in Intra mode Integer transform Optimized rate control strategy (my opinion) 5.4.1 Video Codec Structure of H.64 MB of Input Image Signal - Intra/Inter Decoder 0 Coder Control Transform/ Quantizer Intra_Frame Prediction Comp. Predication Estimator Deq./Inv. Transform Deblocking Filter Control Quant. Transf. coeffs Entropy Coding Bitstream Output COMP9519 Multimedia Systems Lecture 5 Slide 53 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 54 J Zhang 5.4. Video Codec Structure of H.64 (H.6L TML-8 Design Part 1 of 4) Hybrid of DPCM/MC/Trans coding as in Prior standards. Common elements include: 16x16 macroblocks Conventional sampling of chrominance and association of luminance and chrominance data Block motion displacement vectors over picture boundaries Variable block-size motion Block transforms (not DCT, wavelets or fractals) Scalar quantization (weighted) 5.4.3 H.64: Compensation Accuracy MB of Input Image Signal - Intra/Inter Decoder 0 Coder Control Transform/ Quantizer Intra_Frame Prediction Comp. Predication Deq./Inv. Transform Deblocking Filter Control Quant. Transf. coeffs Mode 1 0 Mode 0 1 Entropy Coding Mode 3 0 1 Bitstream Output Mode 4 0 1 3 Estimator Mode 5 0 1 3 4 5 6 7 Mode 6 0 1 3 4 5 6 7 Mode 7 0 1 3 4 5 6 7 8 9 10 11 1 13 14 15 1/4 (QCIF) or 1/8 (CIF) pel COMP9519 Multimedia Systems Lecture 5 Slide 55 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 56 J Zhang

5.4.4 H.64: Multiple Reference Frames MB of Input Image Signal - Intra/Inter Decoder 0 Coder Control Transform/ Quantizer Intra_Frame Prediction Comp. Predication Deq./Inv. Transform Deblocking Filter Control Quant. Transf. coeffs Entropy Coding Bitstream Output 5.4.4 H.64: Multiple Reference Frames Compensation: Multiple reference pictures (per H.63++ Annex U) B picture prediction weighting New SP transition pictures for sequence switching Various block sizes and shapes for motion compensation (7 segmentations of the macroblock: 16x16, 16x8, 8x16, 8x8, 8x4, 4x8, 4x4) 1/4 sample (sort of per MPEG-4) and 1/8 sample accuracy motion Estimator Multiple Reference Frames for Compensation COMP9519 Multimedia Systems Lecture 5 Slide 57 J Zhang COMP9519 Multimedia Systems Lecture 5 Slide 58 J Zhang