Video Coding. Video Coding (esp. ITU & ISO/IEC Standards) Standardization Organizations. The Scope of Picture and Video Coding Standardization

Similar documents
The Scope of Picture and Video Coding Standardization

EE Low Complexity H.264 encoder for mobile applications

Video Compression Standards (II) A/Prof. Jian Zhang

Advanced Video Coding: The new H.264 video compression standard

Digital Video Processing

Video Coding Standards: H.261, H.263 and H.26L

Introduction to Video Coding

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

Video Coding Standards

Video Codecs. National Chiao Tung University Chun-Jen Tsai 1/5/2015

Laboratoire d'informatique, de Robotique et de Microélectronique de Montpellier Montpellier Cedex 5 France

H.264 / AVC (Advanced Video Coding)

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

Standard Codecs. Image compression to advanced video coding. Mohammed Ghanbari. 3rd Edition. The Institution of Engineering and Technology

Lecture 5: Video Compression Standards (Part2) Tutorial 3 : Introduction to Histogram

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Video coding. Concepts and notations.

RECOMMENDATION ITU-R BT

Lecture 6: Compression II. This Week s Schedule

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

High Efficiency Video Coding: The Next Gen Codec. Matthew Goldman Senior Vice President TV Compression Technology Ericsson

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

HEVC The Next Generation Video Coding. 1 ELEG5502 Video Coding Technology

An Efficient Table Prediction Scheme for CAVLC

Lecture 4: Video Compression Standards (Part1) Tutorial 2 : Image/video Coding Techniques. Basic Transform coding Tutorial 2

MPEG-4: Simple Profile (SP)

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

NEW CAVLC ENCODING ALGORITHM FOR LOSSLESS INTRA CODING IN H.264/AVC. Jin Heo, Seung-Hwan Kim, and Yo-Sung Ho

Editorial Manager(tm) for Journal of Real-Time Image Processing Manuscript Draft

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

An Improved H.26L Coder Using Lagrangian Coder Control. Summary

Mark Kogan CTO Video Delivery Technologies Bluebird TV

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

THE H.264 ADVANCED VIDEO COMPRESSION STANDARD

Emerging H.26L Standard:

VHDL Implementation of H.264 Video Coding Standard

Updates in MPEG-4 AVC (H.264) Standard to Improve Picture Quality and Usability

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

VIDEO COMPRESSION STANDARDS

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012

Scalable Video Coding in H.264/AVC

Fraunhofer Institute for Telecommunications - Heinrich Hertz Institute (HHI)

The Basics of Video Compression

Performance Analysis of DIRAC PRO with H.264 Intra frame coding

Transcoding from H.264/AVC to High Efficiency Video Coding (HEVC)

White paper: Video Coding A Timeline

Video Compression MPEG-4. Market s requirements for Video compression standard

4G WIRELESS VIDEO COMMUNICATIONS

MPEG-4 Part 10 AVC (H.264) Video Encoding

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Overview, implementation and comparison of Audio Video Standard (AVS) China and H.264/MPEG -4 part 10 or Advanced Video Coding Standard

Ch. 4: Video Compression Multimedia Systems

PREFACE...XIII ACKNOWLEDGEMENTS...XV

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 10 ZHU Yongxin, Winson

Reduced Frame Quantization in Video Coding

TRANSCODING OF H264 BITSTREAM TO MPEG 2 BITSTREAM. Dr. K.R.Rao Supervising Professor. Dr. Zhou Wang. Dr. Soontorn Oraintara

Video Compression An Introduction

Introduction of Video Codec

Multimedia Standards

COMPARATIVE ANALYSIS OF DIRAC PRO-VC-2, H.264 AVC AND AVS CHINA-P7

Video Compression. Learning Objectives. Contents (Cont.) Contents. Dr. Y. H. Chan. Standards : Background & History

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

Thanks for slides preparation of Dr. Shawmin Lei, Sharp Labs of America And, Mei-Yun Hsu February Material Sources

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

Deblocking Filter Algorithm with Low Complexity for H.264 Video Coding

Digital Image Representation Image Compression

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

Implementation and analysis of Directional DCT in H.264

New Techniques for Improved Video Coding

The VC-1 and H.264 Video Compression Standards for Broadband Video Services

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

Recent Developments in Video Compression Standardization CVPR CLIC Workshop, Salt Lake City,

Encoding Video for the Highest Quality and Performance

Recent, Current and Future Developments in Video Coding

10.2 Video Compression with Motion Compensation 10.4 H H.263

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

Digital video coding systems MPEG-1/2 Video

High Efficiency Video Coding (HEVC) test model HM vs. HM- 16.6: objective and subjective performance analysis

Lesson 6. MPEG Standards. MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21

High Efficiency Video Coding. Li Li 2016/10/18

Video Coding in H.26L

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

CMPT 365 Multimedia Systems. Media Compression - Video Coding Standards

Introduction to Video Encoding

Lecture 3: Image & Video Coding Techniques (II) & Standards (I) A/Prof. Jian Zhang

MPEG-2. And Scalability Support. Nimrod Peleg Update: July.2004

In the name of Allah. the compassionate, the merciful

H.264 to MPEG-4 Transcoding Using Block Type Information

VIDEO CODING STANDARDS FOR MULTIMEDIA COMMUNICATION 1. Video Coding Standards for Multimedia Communication: H.261, H.

Selected coding methods in H.265/HEVC

Comparative and performance analysis of HEVC and H.264 Intra frame coding and JPEG2000

Lecture 13 Video Coding H.264 / MPEG4 AVC

COMPLEXITY REDUCTION FOR VP6 TO H.264 TRANSCODER USING MOTION VECTOR REUSE JAY R PADIA. Presented to the Faculty of the Graduate School of

Audio and video compression

IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.264 FOR BASELINE PROFILE SHREYANKA SUBBARAYAPPA

EE 5359 H.264 to VC 1 Transcoding

Transcription:

(esp. ITU & ISO/IEC Standards) Gary J. Sullivan, Ph.D. ITU-T T VCEG Rapporteur Chair ISO/IEC MPEG Rapporteur Co-Chair Chair ITU/ISO/IEC JVT Rapporteur Co-Chair Chair IEEE Fellow (2006) Microsoft Corporation Architect March 2006 Standardization Organizations Two organizations have historically dominated general-purpose video compression standardization: ITU-T Experts Group (VCEG) International Telecommunications Union Telecommunications Standardization Sector (ITU-T, a United Nations Organization, formerly CCITT), Study Group 16, Question 6 ISO/IEC Moving Picture Experts Group (MPEG) International Standardization Organization and International Electrotechnical Commission, Joint Technical Committee Number 1, Subcommittee 29, Working Group 11 Recently, the Society for Picture and Television Engineers (SMPTE) has also entered with VC-1, based on Microsoft s WMV 9 but this talk covers only the ITU and ISO/IEC work. Standards Overview March 06 Gary J. Sullivan 1 H.120 (194-19) ISO/IEC ITU-T H.261 (1990+) Chronology of International Standards MPEG-1 (1993) H.263 (1995-2000+) H.262 // MPEG-2 (1994/95-199+) MPEG-4 Visual (199-2001+) 1990 1992 1994 1996 199 2000 2002 H.264 // MPEG-4 AVC (2003-2006) Standards Overview March 06 Gary J. Sullivan 2 2004 The Scope of Picture and Standardization Only the Syntax and are standardized: Permits optimization beyond the obvious Permits complexity reduction for implementability Provides no guarantees of Quality Source Destination Pre-Processing Post-Processing & Error Recovery Encoding Decoding Scope of Standard Standards Overview March 06 Gary J. Sullivan 3 Predictive and DPCM Separate quantization of each sample is known as pulse-code modulation (PCM) Predictive or Differential PCM: Generate an estimate for the value of the input data, and encode only the remaining difference. + - Predictor (e.g. weighted sum) Quantizer Delay (Memory) + (Channel) Decoded Samples H.120 : The First Digital Standard ITU-T (ex-ccitt) Rec. H.120: The first digital video coding standard (194) v1 (194) had conditional replenishment, DPCM, scalar quantization, variable-length coding, switch for quincunx sampling v2 (19) added motion compensation and background prediction Operated at 1544 (NTSC) and 204 (PAL) kbps Few units made, essentially not in use today (Decoding Process Shown Shaded) Standards Overview March 06 Gary J. Sullivan 4 Standards Overview March 06 Gary J. Sullivan 5

Intra Picture by DCT Basic intra image representation: Discrete Cosine (DCT) (early 70s, ITU+ISO JPEG approved 92): Analyze x blocks of image according to DCT frequency content (images tend to be smooth) Find magnitude of each discrete frequency within the block Round off ( quantize ) the amounts to scaled integer values ( 50s, 60s,...) Send integer approximations to decoder using Huffman variable-length codes (VLC, early 50s) The Discrete Cosine The DCT (unitary type II DCT): Fm M 1 N 1 2 u v c M c N f m M x nn y x u y v, ) = u v (, ) cos ( ) cos ( ) 2 2 + 1 π + + M 2 + 1 π 2 2 N, n ( x = 0 y = 0 The Inverse DCT (unitary type III DCT): M 1 N 1 2 f ( mm x, nn y ) c (, ) cos ( ) cos ( ), M c N F u v x u y v + + = u v m n 2 2 + 1 π M 2 + 1 π 2 2 N u = 0 v = 0 Definition of Constants cu = 1 / 2 for u = 0, o therw ise 1. cv = 1 / 2 for v = 0, otherwise 1. M = in current visual standards N = in current visual standards Standards Overview March 06 Gary J. Sullivan 6 Standards Overview March 06 Gary J. Sullivan 7 Coefficient Scan Order: The Zig-Zag Zag Scan 0 1 2 3 4 5 6 7 9 10 11 0 0 0 0 16 17 1 19 20 21 22 23 24 25 26 27 2 29 30 31 32 33 34 35 36 37 3 39 40 41 42 43 44 45 46 47 4 49 50 51 52 53 54 55 56 57 5 59 60 61 62 63 Interframe Large areas of images stay the same from frame to frame, changing mostly due to motion Conditional Replenishment: Can signal to leave a block area of the image unchanged, or replace it with new data Interframe Difference : Could encode a refinement to the value of an area Displaced Frame Difference : Can predict an image area by copying some nearby part of the previous image (motion compensation) and optionally adding some refinement Standards Overview March 06 Gary J. Sullivan Standards Overview March 06 Gary J. Sullivan 9 P-Picture Picture Predictive I P P P P H.261: The Basis of Modern Compression ITU-T (ex-ccitt) Rec. H.261: The first widespread practical success First design (late 90) embodying typical structure dominating today: 16x16 macroblock motion compensation, x DCT, scalar quantization, zig-zag scan, and run-length variable-length coding Key aspects later dropped by other standards: loop filter, integer motion comp., 2-D VLC, header overhead v2 (early 93) added a backward-compatible high-resolution graphics trick mode Operated at 64-204 kbps Still in use, although mostly as a backward-compatibility feature overtaken by H.263 Standards Overview March 06 Gary J. Sullivan 10 Standards Overview March 06 Gary J. Sullivan 11

Blocks and The luma and chroma planes are divided into blocks. Luma blocks are associated with Cb and Cr blocks to create a macroblock. Y macroblock x sample blocks Standards Overview March 06 Gary J. Sullivan 12 Cb Cr H.261&3 Macroblock Structure = luma sample = chroma sample (two chroma fields) Decisions: 16x16 macroblock DCT of x blocks H.261: 16x16 1-pel motion H.263: 16x16 1/2-pel motion or (AP mode) x 1/2-pel motion with overlapping Standards Overview March 06 Gary J. Sullivan 13 Basic Hybrid Structure of H.261, etc. (late 90) - Quantizer 0 - Compensated Predictor Deq./Inv. Predictive with (old-fashioned) B Pictures Estimator I B P B P Standards Overview March 06 Gary J. Sullivan 14 Standards Overview March 06 Gary J. Sullivan 15 MPEG-1: Practical at Higher Rates than H.261 Formally ISO/IEC 11172-2 ( 93), developed by ISO/IEC JTC1 SC29 WG11 (MPEG) use is fairly widespread (esp. CD in Asia), but mostly overtaken by MPEG-2 Superior quality to H.261 when operated a higher bit rates ( 1 Mbps for CIF 352x2 resolution) Can provide approximately VHS quality between 1-2 Mbps using SIF 352x240/2 resolution Technical features inherited from H.261 16x16 macroblocks 16x16 motion compensation, x DCT, scalar quantization, zig-zag scan, and run-length variable-length coding Technical features added: Bi-directional motion prediction Half-pixel motion Slice-structured coding DC-only D pictures Quantization weighting matrices Standards Overview March 06 Gary J. Sullivan 16 Interlaced (Welcome to the 1940 Analog World) Vertical Vertical Horizontal Temporal Standards Overview March 06 Gary J. Sullivan 17

MPEG-2/H.262: Even Higher Bit Rates and Interlace Formally ISO/IEC 131-2 & ITU-T H.262, developed ( 94) jointly by ITU-T and ISO/IEC SC29 WG11 (MPEG) Now in very wide use for DVD and standard and high-definition DTV (the most commonly used video coding standard) Primary new technical features: Support for interlaced-scan pictures Increased DC quantization precision Also Various forms of scalability (SNR, Spatial, breakpoint) I-picture concealment motion vectors Essentially the same as MPEG-1 for progressive-scan pictures, and MPEG-1 forward compatibility required Not especially useful below 2-3 Mbps (range of use normally 2-5 Mbps SDTV broadcast, 6- DVD, 20 HDTV) Essentially fixed frame rate H.263: The Next Generation ITU-T Rec. H.263 (v1: 1995): The next generation of video coding performance, developed by ITU-T the current premier ITU-T video standard (has overtaken H.261 as dominant videoconferencing codec) Superior quality to prior standards at all bit rates (except perhaps for interlaced video) Better by a factor of two at very low rates Versions 2 (late 1997/early 199) & v3 (2000) later developed with a large number of new features Profiles defined early 2001 A somewhat tangled relationship with MPEG-4 Standards Overview March 06 Gary J. Sullivan 1 Standards Overview March 06 Gary J. Sullivan 19 What Was in H.263 Version 1? Baseline Algorithm Features beat H.261 Half-pel motion compensation (also in MPEG-1) 3-D variable length coding of DCT coefficients Median motion vector prediction More efficient coding pattern signaling (?) Deletable GOB header overhead (also in MPEG-1, but not 2?) Optional Enhanced Modes Increased motion vector range with picture extrapolation Variable-size, overlapped motion with picture extrapolation PB-frames (bi-directional prediction) Arithmetic entropy coding Continuous-presence multipoint / video mux H.263+ Feature Categories Error resilience Improved compression efficiency (e.g., 15-25% overall improvement over H.263v1) Custom and Flexible Formats Scalability for resilience and multipoint Supplemental enhancement information Standards Overview March 06 Gary J. Sullivan 20 Standards Overview March 06 Gary J. Sullivan 21 H.263++ Version 3 Features Annex U: Fidelity enhancement by macroblock and block-level reference picture selection a significant improvement in picture quality Annex V: Packet Loss & Error Resilience using data partitioning with reversible VLCs (roughly similar to MPEG-4 data partitioning, but improved by using reversible coding of motion vectors rather than coefficients) Annex W:Additional Supplemental Enhancement Information IDCT Mismatch Elimination (specific fixed-point fast IDCT) Arbitrary binary user data Text messages (arbitrary, copyright, caption, video description, and URI) Error Resilience: Picture header repetition (current, previous, next+tr, next-tr) Spare reference pictures for error concealment Interlaced field indications (top & bottom) MPEG-4 4 Visual Visual : : Baseline H.263 and Many Creative Extras MPEG-4 part 2 (v1: early 1999), formally ISO/IEC 14496-2 Contains the H.263 baseline design coding efficiency enhancements (esp. at low rates) Adds many creative new extras: more coding efficiency enhancements error resilience / packet loss enhancements segmented coding of shapes zero-tree wavelet coding of still textures coding of synthetic and semi-synthetic content, 10 & 12-bit sampling, more v2 (early 2000) & v3 (early 2001) & Standards Overview March 06 Gary J. Sullivan 22 Standards Overview March 06 Gary J. Sullivan 23

MPEG-4 Visual Focus: Simple Profile The most basic video coding profile of MPEG-4 No shape coding Progressive-scan video only Most popular in low cost / low rate / low resolution apps (e.g., mobile) top bit rate & resolution limited Basic contents H.263 baseline vectors over picture boundaries Variable block-size motion compensation Intra DCT coefficient prediction Handling of four streams in most levels Error / packet-loss features data partitioning, RVLC Standards Overview March 06 Gary J. Sullivan 24 MPEG-4 Visual Focus: Advanced Simple Profile Target goal: General rectangular video with improved coding efficiency Progressive-scan and interlaced video support Up to SDTV resolution Basic contents All of Simple profile B pictures Global motion compensation Quarter-sample motion compensation Interlace handling Standards Overview March 06 Gary J. Sullivan 25 MPEG-4 Visual Focus: Studio Profile Target goal: studio & professional use Progressive-scan and interlaced video support Up to very high resolution and bit rate Basic contents Enhanced-accuracy IDCT B pictures 10 & 12 bit sample accuracy 4:2:2 & 4:4:4 chroma sampling structures The H.264/MPEG-4 Advanced (AVC) Standard Gary J. Sullivan, Ph.D. ITU-T T VCEG Rapporteur Chair ISO/IEC MPEG Rapporteur Co-Chair Chair ITU/ISO/IEC JVT Rapporteur Co-Chair Chair Microsoft Corporation Architect Standards Overview March 06 Gary J. Sullivan 26 March 2006 The Advanced Project AVC = ITU-T T H.264 / MPEG-4 4 part 10 History: ITU-T Q.6/SG16 (VCEG - Experts Group) H.26L standardization activity (where the L stood for long-term ) Aug 1999: 1 st test model (TML-1) July 2001: MPEG open call for technology: H.26L demo ed Dec 2001: Formation of the Joint Team (JVT) between VCEG and MPEG to finalize H.26L as a new joint project (similar to MPEG-2/H.262) July 2002: Final Committee Draft status in MPEG Dec 02 technical freeze, FCD ballot approved May 03 completed in both orgs July 04 Fidelity Range Extensions (FRExt) completed Jan 05 Scalable launched H.264/MPEG-4 4 AVC Objectives Primary technical objectives: Significant improvement in coding efficiency High loss/error robustness Network Friendliness (carry it well on MPEG-2 or RTP or H.32x or in MPEG-4 file format or MPEG-4 systems or ) Low latency capability (better quality for higher latency) Exact match decoding Additional version 2 objectives (in FRExt): Professional applications (more than bits per sample, 4:4:4 color sampling, etc.) Higher-quality high-resolution video Alpha plane support (a degree of object functionality) Standards Overview March 06 Gary J. Sullivan 2 Standards Overview March 06 Gary J. Sullivan 29

Relating to Other ITU & MPEG Standards Same design to be approved in both ITU-T VCEG and ISO/IEC MPEG In ITU-T VCEG this is a new & separate standard ITU-T Recommendation H.264 ITU-T Systems (H.32x) support it In ISO/IEC MPEG this is a new part in the MPEG-4 suite Separate codec design from prior MPEG-4 visual New part 10 called Advanced (AVC similar to AAC position in MPEG-2 as separate codec) Not backward or forward compatible with prior standards (incl. the prior MPEG-4 visual spec. core technology is different) MPEG-4 Systems / File Format supports it H.222.0 MPEG-2 Systems also supports it Standards Overview March 06 Gary J. Sullivan 30 A Comparison of Performance Test of different standards (ICIP 2002 study) Using same rate-distortion optimization techniques for all codecs Streaming test: High-latency (included B frames) Four QCIF sequences coded at 10 Hz and 15 Hz (Foreman, Container, News, Tempete) and Four CIF sequences coded at 15 Hz and 30 Hz (Bus, Flower Garden, Mobile and Calendar, and Tempete) Real-time conversation test: No B frames Four QCIF sequences encoded at 10Hz and 15Hz (Akiyo, Foreman, Mother and Daughter, and Silent Voice) Four CIF sequences encoded at 15Hz and 30Hz (Carphone, Foreman, Paris, and Sean) Compare four codecs using PSNR measure: MPEG-2 (in high-latency/streaming test only) H.263 (high-latency profile, conversational high-compression profile, baseline profile) MPEG-4 Visual (simple and advanced simple profiles with & without B pictures) H.264/AVC (with & without B pictures) Note: These test results are from a private study and are not an endorsed report of the JVT, VCEG or MPEG organizations. Standards Overview March 06 Gary J. Sullivan 31 Comparison to MPEG-2, H.263, MPEG-4p2 Quality Y-PSNR [db] 39 3 37 36 35 34 33 32 31 30 29 2 27 Foreman QCIF 10Hz 0 50 100 150 200 250 Bit-rate [kbit/s] JVT/H.264/AVC MPEG-4 Visual MPEG-2 H.263 Standards Overview March 06 Gary J. Sullivan 32 Comparison to MPEG-2, H.263, MPEG-4p2 Quality Y-PSNR [db] 3 37 36 35 34 33 32 31 30 29 2 27 26 25 Tempete CIF 30Hz 0 500 1000 1500 2000 2500 3000 3500 Bit-rate [kbit/s] JVT/H.264/AVC MPEG-4 Visual MPEG-2 H.263 Standards Overview March 06 Gary J. Sullivan 33 Caution: Your Mileage Will Vary This encoding software may not represent implementation quality These tests only up to CIF (quarter-standard-definition) resolution Interlace, SDTV, and HDTV not tested in this test Test sequences may not be representative of the variety of content encountered by applications These tests so far not aligned with profile designs This study reports PSNR, but perceptual quality is what matters Computing Resources for the New Design New design includes relaxation of traditional bounds on computing resources rough guess 2-3x the MIPS, ROM & RAM requirements of MPEG-2 for decoding, 3-4x for encoding Particularly an issue for low-power (e.g., mobile) devices Problem areas: Smaller block sizes for motion compensation (cache access issues) Longer filters for motion compensation (more memory access) Multi-frame motion compensation (more memory for reference frame storage) In-loop deblocking filter (more processing & memory access) More segmentations of macroblock to choose from (more searching in the encoder) More methods of predicting intra data (more searching) Arithmetic coding (adaptivity, computation on output bits) Standards Overview March 06 Gary J. Sullivan 34 Standards Overview March 06 Gary J. Sullivan 35

H.264/MPEG-4 4 AVC Structure - Scal./ - Deblocking Standards Overview March 06 Gary J. Sullivan 36 Accuracy - Scal./ - De-blocking 16x16 MB Types 0 x Types 16x 0 x16 0 1 1 x x4 4x 0 0 1 0 1 vector accuracy 1/4 sample (6-tap filter) x 0 1 2 3 4x4 0 1 2 3 Standards Overview March 06 Gary J. Sullivan 37 Multiple Reference Frames Intra - Scal./ - De-blocking Multiple Reference Frames Generalized B Frames - Scal./ - Directional spatial prediction (9 types for 4x4 luma pred, 4 types for 16x16 luma pred, 4 types for x chroma pred) M A B C D E F G H I a b c d J e f g h K i j k l L m n o p De-blocking 3 4 7 0 5 e.g., Mode 4: diagonal down/right prediction a, f, k, p are predicted by (A + 2M + I + 2) >> 2 6 1 Standards Overview March 06 Gary J. Sullivan 3 Standards Overview March 06 Gary J. Sullivan 39 Deblocking - Scal./ 4x4 Block Integer 1 1 1 1 2 1 1 2 H = 1 1 1 1 1 2 2 1 Repeated transform of DC coeffs for x chroma and 16x16 - Intra luma blocks De-blocking - Scal./ - Deblocking Standards Overview March 06 Gary J. Sullivan 40 Standards Overview March 06 Gary J. Sullivan 41

- Scal./ - Deblocking UVLC/Exp-Golomb + CABAC or CAVLC AVC Version 1 Profiles Three profiles in version 1: Baseline, Main, and Extended Baseline (esp. conferencing & Wireless) I and P progressive-scan picture coding (not B) In-loop deblocking filter 1/4-sample motion compensation Tree-structured motion segmentation down to 4x4 block size VLC-based entropy coding Some enhanced error resilience features Flexible macroblock ordering/arbitrary slice ordering Redundant slices Standards Overview March 06 Gary J. Sullivan 42 Standards Overview March 06 Gary J. Sullivan 43 Non-Baseline AVC Version 1 Profiles Amendment 1: Fidelity-Range Extensions Main Profile (esp. Broadcast) All Baseline features except enhanced error resilience features Interlaced video handling Generalized B pictures Adaptive weighting for B and P picture prediction CABAC (arithmetic entropy coding) Extended Profile (esp. Streaming) All Baseline features Interlaced video handling Generalized B pictures Adaptive weighting for B and P picture prediction More error resilience: partitioning SP/SI switching pictures AVC standard finished 2003 ITU-T/H.264 finalized May, 2003 MPEG-4 AVC finalized July, 2003 (same thing) Only corrigenda (bug fixes) since then Fidelity-Range Extensions (FRExt) New work item initiated in July 2003 More than bits, color other than 4:2:0 Alpha coding More coding efficiency capability Also new supplemental information Standards Overview March 06 Gary J. Sullivan 44 Standards Overview March 06 Gary J. Sullivan 45 FRExt Finished July 04 New Things in FRExt Part 1 Project initiated July 2003 Motivations Higher quality, higher rates 4:4:4, 4:2:2, and also 4:2:0, 10, or 12 bits (14 bits considered and not included) Lossless Stereo Finished in one year! (July 04) Larger transforms x transform (again!) Drop 4x, x4, or larger, 16-point ed intra prediction modes for x block size Quantization matrix 4x4, x, intra, inter trans. coefficients weighted differently Old idea, dating to JPEG and before (circa 196?) Full capabilities not yet explored (visual weighting) in various color spaces 4:4:4, 4:2:2, 4:2:0, Monochrome, with/without Alpha New integer color transform (a VUI-message item) Standards Overview March 06 Gary J. Sullivan 46 Standards Overview March 06 Gary J. Sullivan 47

New Things in FRExt Part 2 x 16-Bit (Bossen( Bossen) ) Efficient lossless interframe coding Film grain characterization for analysis/synthesis representation Stereo-view video support Deblocking filter display preference 12 10 6 4 3 10 4 3 12 6 6 4 12 3 10 3 6 10 4 12 3 6 10 4 12 6 4 12 3 10 10 4 3 12 6 12 10 6 4 3 Standards Overview March 06 Gary J. Sullivan 4 Standards Overview March 06 Gary J. Sullivan 49 x Advantage (JVT-K02, IBBP coding, prog.. scan) Quantization Matrix Sequence Movie 1 Movie 2 Movie 3 Movie 4 Movie 5 Crawford Riverbed Average % BD bit-rate reduction 11.59 12.71 12.01 11.06 13.46 10.93 15.65 12.4 Similar concept to MPEG-2 design Vary step size based on frequency Adapted to modified transform structure More efficient representation of weights Eight downloadable matrices (at least 4:2:0) Intra 4x4 Y, Cb, Cr Intra x Y Inter 4x4 Y, Cb, Cr Inter x Y Standards Overview March 06 Gary J. Sullivan 50 Standards Overview March 06 Gary J. Sullivan 51 New Profiles Created by FRExt Some Notes on Quality Testing 4:2:0, -bit: High (HP) 4:2:0, 10-bit: High 10 (Hi10) 4:2:2, 10-bit: High 4:2:2 (Hi422) 4:4:4, 12-bit: High 4:4:4 (Hi444) Effectively the same tools, but acting on different input data (with a couple of exceptions in the 4:4:4 profile) Standards Overview March 06 Gary J. Sullivan 52 Use appropriate High profile (incl. adaptive transform) If testing for PSNR, use flat quant matrices Otherwise, use non-flat quant matrices Use more than 1 or 2 reference pictures Use hierarchical reference frames coding structure Use CABAC entropy coding If testing high-quality PSNR, use adaptive quantization* Use rate-distortion optimization in encoder Use large-range good-quality motion search Use bi-predictive search optimization (see JVT-N014) * = See G. Sullivan & S. Sun, On Dead-Zone, VCIP 2005/JVT-N011 Standards Overview March 06 Gary J. Sullivan 53

A Performance Test for High Profile (from JVT-L033 - Panasonic) Subjective tests by Blu-Ray Disk Founders of FRExt HP 4:2:0/ (HP) 1920x100x24p (100p), 3 clips. Nominal 3:1 advantage to MPEG-2 Mbps HP scored better than 24 Mbps MPEG-2 Apparent transparency at 16 Mbps Mean Opinion Score 5 4.5 4 3.5 3 2.5 Figure 1: Results of subjective test with studio participants (Blu-Ray Disk Founders) 3.65 3.71 4.00 2 H.264/AVC H.264/AVC H.264/AVC H.264/AVC Original MPEG2 24 Mbs, FRExt FRExt FRExt FRExt DVHS Standards Mbps Overview 12Mbps March 0616Mbps 20Mbps Gary J. Sullivan 54 emulation 3.90 4.03 3.59 5: Perfect 4: Good 3: Fair (OK for DVD) 2: Poor 1: Very Poor For Further Information JVT, MPEG, and VCEG management team members: Gary Sullivan (garysull@microsoft.com) Jens Ohm (ohm@ient.rwth-aachen.de) Ajay Luthra (aluthra@motorola.com) Thomas Wiegand (wiegand@hhi.de) IEEE Transactions on Circuits and Systems for Technology Special Issue on H.264/AVC (July 2003) Paper in Proceedings of IEEE January 2005 (Sullivan & Wiegand) I. Richardson, H.264 and MPEG-4 Compression, 2003 Overview incl. FRExt: SPIE Aug 2004 (Sullivan, Topiwala, & Luthra) Paper at VCIP 2005: Meta-overview and deployment Wikipedia H.264 page Standards Overview March 06 Gary J. Sullivan 55