Joint Impact of MPEG-2 Encoding Rate and ATM Cell Losses on Video Quality

Similar documents
MPEG-2 Video Services over Packet Networks: Joint Effect of Encoding Rate and Data Loss on User-Oriented QoS

Perceptual Quality Metric as a Performance Tool for ATM. Olivier Verscheure and Xavier Garcia Adanez. Swiss Federal Institute of Technology

Reliable Transmission of MPEG-2 VBR Video Streams over New. Network Adaptation and ATM Adaptation Layers

UBR Congestion controlled Video Transmission over ATM Eltayeb Omer Eltayeb, Saudi Telecom Company

QoE Characterization for Video-On-Demand Services in 4G WiMAX Networks

Advanced Video Coding: The new H.264 video compression standard

VIDEO COMPRESSION STANDARDS

The Scope of Picture and Video Coding Standardization

Optimal Estimation for Error Concealment in Scalable Video Coding

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

4G WIRELESS VIDEO COMMUNICATIONS

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

MULTIMEDIA COMMUNICATION

MPEG-2. ISO/IEC (or ITU-T H.262)

Video Quality Analysis for H.264 Based on Human Visual System

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

PREFACE...XIII ACKNOWLEDGEMENTS...XV

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

Multimedia Standards

The new Hybrid approach to protect MPEG-2 video header

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

No-reference perceptual quality metric for H.264/AVC encoded video. Maria Paula Queluz

Digital Video Processing

The VC-1 and H.264 Video Compression Standards for Broadband Video Services

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

Error Concealment Used for P-Frame on Video Stream over the Internet

JPEG 2000 vs. JPEG in MPEG Encoding

Cross Layer Protocol Design

RECOMMENDATION ITU-R BT.1720 *

MPEG-l.MPEG-2, MPEG-4

Comparison of Shaping and Buffering for Video Transmission

Homogeneous Transcoding of HEVC for bit rate reduction

IST MPEG-4 Video Compliant Framework

Robust MPEG-2 SNR Scalable Coding Using Variable End-of-Block

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

EE Low Complexity H.264 encoder for mobile applications

Video-Aware Wireless Networks (VAWN) Final Meeting January 23, 2014

5LSE0 - Mod 10 Part 1. MPEG Motion Compensation and Video Coding. MPEG Video / Temporal Prediction (1)

Video Communication with QoS Guarantees over HIPERLAN/2

Lesson 6. MPEG Standards. MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21

Audio and video compression

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

Zonal MPEG-2. Cheng-Hsiung Hsieh *, Chen-Wei Fu and Wei-Lung Hung

Advanced Encoding Features of the Sencore TXS Transcoder

Parameter Equipment Motivation Monitoring method. Smooth play-out Test stream

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

Tutorial T5. Video Over IP. Magda El-Zarki (University of California at Irvine) Monday, 23 April, Morning

White paper: Video Coding A Timeline

Digital video coding systems MPEG-1/2 Video

Multimedia Signals and Systems Motion Picture Compression - MPEG

MAXIMIZING BANDWIDTH EFFICIENCY

Video over ATM Networks. Table of Contents. Gregory A. Coffey,

Compressed-Domain Video Processing and Transcoding

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

Deduction of Video Quality Degradation in Transport Networks

SURVEILLANCE VIDEO FOR MOBILE DEVICES

Ch. 4: Video Compression Multimedia Systems

MPEG-4: Simple Profile (SP)

Lecture 3 Image and Video (MPEG) Coding

Mark Kogan CTO Video Delivery Technologies Bluebird TV

MPEG-2 Patent Portfolio License Illustrative Cross-Reference Chart Ctry. Patent No. Claims Category Description Standard Sections

Investigation of the GoP Structure for H.26L Video Streams

Perceptual Quality Measurement and Control: Definition, Application and Performance

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

CSCD 443/533 Advanced Networks Fall 2017

The Performance of MANET Routing Protocols for Scalable Video Communication

STATISTICAL PROPERTIES OF MPEG STREAMS AND ISSUES FOR TRAMSMISSION OF VIDEO INFORMATION IN HIGH SPEED NETWORKS

Video-Aware Link Adaption

IPTV 1

Scalable Extension of HEVC 한종기

Frequency Band Coding Mode Selection for Key Frames of Wyner-Ziv Video Coding

MODELING AND SIMULATION OF MPEG-2 VIDEO TRANSPORT OVER ATM NETWOR.KS CONSIDERING THE JITTER EFFECT

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

AUDIOVISUAL COMMUNICATION

Unit-level Optimization for SVC Extractor

CS 260: Seminar in Computer Science: Multimedia Networking

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012

CMPT 365 Multimedia Systems. Media Compression - Video Coding Standards

over the Internet Tihao Chiang { Ya-Qin Zhang k enormous interests from both industry and academia.

Experiments Involving the Transmission of Layered Video over a Local ATM Network

A real-time SNR scalable transcoder for MPEG-2 video streams

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

MITIGATING THE EFFECT OF PACKET LOSSES ON REAL-TIME VIDEO STREAMING USING PSNR AS VIDEO QUALITY ASSESSMENT METRIC ABSTRACT

Experimental Evaluation of H.264/Multiview Video Coding over IP Networks

Module 6 STILL IMAGE COMPRESSION STANDARDS

1480 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 5, OCTOBER 2012

Chapter 2 MPEG Video Compression Basics

Networking Applications

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

Video coding. Concepts and notations.

Mesh Based Interpolative Coding (MBIC)

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

Combined Copyright Protection and Error Detection Scheme for H.264/AVC

Transcription:

Published in GLOBECOM 98, Sidney, November 998 Joint Impact of MPEG- Encoding Rate and ATM Cell Losses on Video Quality Olivier Verscheure, Pascal Frossard and Maher Hamdi Institute for computer Communications and Applications Swiss Federal Institute of Technology Email: Olivier.Verscheure@epfl.ch URL: http://icawww.epfl.ch/ Abstract This work addresses the analysis of the end-user perception of CBR MPEG- video services transmitted over ATM/AAL networks. We first independently study how the perceptual video quality behaves with the constant encoding bit rate (CBR) and the network cell loss ratio (CLR). We then jointly analyze the impact of these two components onto the user-oriented QoS. All the quality assessments are based on a perceptual quality metric, MPQM, which proved to behave consistently with human judgements []. We show that, when considering video transmission over lossy networks, there is an optimal bit rate to be determined that maximizes the end-user perception of the service under some given network conditions. Introduction The Asynchronous Transfer Mode (ATM) technology is reaching a certain level of maturity that permits its deployment in local as well as in wide area networks. One of the key points of such a technology is that it may provide statistical guarantees on performance. In other words, each user may agree with the network on a set of performance parameters (i.e. Quality of Service (QoS) parameters) which will have to be met. now with the Signal Processing Laboratory (LTS) Audiovisual applications (e.g. video conferencing, video on demand, teleteaching, etc.) are foreseen as one of the major users of such broadband networks. At the heart of this revolution is the digital compression of audio and video signals. The biggest advantage of compression resides in data rate reduction which results in reduction of transmission costs. The choice of the compression algorithm mostly depends on the available bandwidth or storage capacity and the features required by the application. The MPEG- standard [], a truly integrated audio-visual standard developed by the International Organization for Standards (ISO), is capable of compressing NTSC or PAL video into an average bit rate of to 6 Mbits/s with a quality comparable to analog CATV []. However, a lot of work remains to be done to optimize these audiovisual applications so that they can be offered at attractive prices. In other words, the user expects an adequate audiovisual quality at the lowest possible cost. In the case of video transmission over packet networks, the end-to-end User-oriented Quality of Service (U-QoS) results both from the video encoding quality and the degradations due to data loss, delay and delay jitter during the transmission. This work focuses on the analysis of the quality perceived by the end-user in the case of CBR MPEG- video transmission over ATM networks. The paper is organized as follows: Section first introduces the MPEG- video and system standards and then briefly describes the impact of data loss onto the reconstructed video sequence. The analysis of the user-oriented QoS is subject of the Sec.. This section first starts with some preliminary studies as well as the description of the experimental setup and then presents some impor-

tant results concerning the impact of both data loss and encoding bit rate on the user perception of the service. Concluding remarks are given in Sec.. MPEG- over Packet Networks. MPEG- Backgrounder An MPEG- video stream is highly hierarchically structured as illustrated in Fig.. The stream consists of a sequence composed of several pictures. The MPEG- video standard defines three different types of pictures : intra-coded (I-), predicted (P-) and bidirectional (B-) pictures. The use of these three picture types allows MPEG- to be robust (I-pictures provide error propagation reset points) and efficient (B- and P- pictures allow a good overall compression ratio). Each picture is composed of slices which are, by definition, a series of macroblocks. Each macroblock (6 6 pixels) contains blocks (8 8 pixels) of luminance and, or 8 blocks of chrominance depending on the chroma format. the following information elements: sequence, Group of Pictures (GoP), picture, slice, TS and PES. In general, when a header is damaged, the underlying information is lost.. MPEG- Sensitivity to Data Loss In an MPEG- video stream, data loss reduces quality depending strongly on the type of the lost information. Losses in syntactic data, such as headers and system information, affect the quality differently than losses of semantic data such as pure video information (e.g. motion vectors, DCT coefficients, etc.). Furthermore, the quality reduction depends also on the location of the lost semantic data due, for instance, to the predictive structure of MPEG- video coded streams. Video Sequence Temporal Loss Time Temporal Loss P or B Picture P or BPicture Spatial Loss Cell Losses Spatial Loss I Picture Figure. Data loss propagation in MPEG- video streams. Figure. MPEG- video structure Before being transmitted, such a video stream goes through the MPEG- Transport Stream (TS) layer. Basically, the stream is first segmented into variablelength Packetized Elementary Stream packets and then subdivided into fixed-length TS packets of 88 bytes. Moreover, it is worth noting that a non-encoded header (i.e. syntactic information) is inserted before each of Figure shows how network losses map onto visual information losses in different types of pictures. Indeed, data loss spreads within a single picture up to the next resynchronization point (e.g. picture or slice headers) mainly due to the use of differential coding, run-length coding and variable length coding. This is referred to as spatial propagation and, it may damage any type of picture. When loss occurs in a reference picture (intra-coded or predictive frame), the damaged macroblocks will affect the non intra-coded macroblocks in subsequent frame(s) which reference the errored macroblocks. This is known as temporal propagation and is due to inter-frame predictions. Error concealment algorithms have already shown

. 0... Ski Basketball PSNR 0 Ski Basketball 0 0 0 0 0 0 0 Figure. MPQM versus encoding bit rate. Figure. PSNR versus encoding bit rate. that it is possible to reduce the impact of data loss on the visual information [,, 6]. These error concealment algorithms include, for example, spatial interpolation, temporal interpolation and early resynchronization techniques. Early resynchronization decoding techniques limit the spatial propagation of errors by decoding some semantic information that is normally discarded from the damaged MPEG- video streams. All these techniques only allow to increase the range of acceptable cell loss ratios. Data loss may still involve very annoying perturbation in the decoded video, especially when frames are lost. User-oriented QoS Analysis As stated before, the quality of a video service perceived by an end-user results both from the video encoding quality and the degradations due to data loss, delay and delay jitter during the transmission. In this work, we analyze the impact of video encoding and data loss only on the end-to-end U-QoS in the case of CBR MPEG- video services delivery.. Encoding Quality We first analyse how the video quality behaves with the constant encoding bit rate. Several studies have shown that a good estimation of subjective video quality has to incorporate some modeling of the Human Visual System [7]. In this work, we used a computational quality metric for moving pictures, namely the MPQM metric, which proved to behave consistently with human judgements []. The resulting quality rating is a value between and according to the ITU-R 00. standard [8] (the higher the value, the better the quality). In our simulations, two 8 frame-long sequences conforming to the ITU-T 60 format have been used (i.e. Basketball and Ski). These sequences have been encoded with a constant bit rate (CBR) as interlaced video, with a structure of images per GOP, B- pictures between every reference picture and one slice per line. For that purpose, the TM MPEG- software encoder has been used [9]. Figure shows how the video quality behaves according to the constant encoding bit rate for the Ski and BasketBall sequences. An important result that can be extracted from the graph is that the perceptual quality saturates at high bit rates. Increasing the bit rate may thus result, at some point, in a waste of bandwidth since the end user does not perceive an improvement in quality anymore. Additional compressed video sequences show results with the same behavior in []. Such a quality saturation is however not captured by the PSNR metric as illustrated in Fig.. Many rate control algorithms are using the PSNR as PSNR stands for Peak Signal-to-Noise Ratio

the performance evaluation tool. Both figures show that even a gain of several dbs in PSNR may not reflect any change from the user viewpoint.. Cell Loss Impact on Quality Up to this point, we did not consider any data loss in the bitstream. We now study the practical case of CBR MPEG- transmission over ATM/AAL networks [0]. Application These techniques have been used for different reasons. The first is to be consistent with real implementations and, the second, to be able to perform the perceptual measurements. Indeed, the human visual system models currently developed and the derived metrics have been tested for errors below the so-called suprathreshold of vision. The problem is that in general, the errors due to cell losses generate highly visible artifacts (i.e. holes) in the reconstructed sequence. By using concealment techniques, most of the artifacts may be considered as being below the supra-threshold of vision, making the perceptual measure accurate. Common Part Convergence Sublayer 88 Octets 76 Octets 8 Octets CPCS_PDU CPCS_PDU Trailer. SAR Sublayer ATM Layer Octets 8 Octets.. 8 ATM Cells Figure. AAL Segmentation Mechanism.. Ski Basketball Before being transmitted, the MPEG- video bitstream is encapsulated into 8800-bytes length Packetized Elementary Stream (PES) packets and divided into fixed length Transport Stream (TS) packets by the MPEG- system encoder. The transmission of video over AAL is based on the approved ATM Forum Video on Demand specification []. This scheme packetizes two TS packets into a single AAL-SDU. The AAL adds its 8-byte trailer and the resulting AAL-PDU is segmented into 8 ATM cells without any padding (see Fig. ). The simulation framework is composed of two multimedia workstations and one ATM switch. The switching module is implemented as a multiplexer with limited buffer size. To generate cell losses in the multimedia streams, we load the multiplexing stage with background traffic provided by several On-Off sources. This type of source model is widely used to simulate a multiplex of traffic such as the one that could be found at the entrance of an ATM switch. The decoder provides the error concealment techniques briefly described in Sec.. and tested in []. 0 6 0 0 0 0 Network CLR Figure 6. MPQM versus Network CLR for a fixed CBR=6 MBits/s. Figure 6 presents how the video quality behaves with the Cell Loss Ratio (CLR) (i.e. background load). It is shown that, on a semi-logarithmic scale and for a given average bit rate, first the video quality remains constant with the CLR. This constant value corresponds to the encoding quality. Then, beyond a certain CLR, the perceptual quality quickly drops.. End-User Perception of the Service In the last two subsections, we independently analyzed the impact of both the encoding bit rate and the network CLR on the end-user perception of the video service. We will show now that these two components Above the suprathreshold of vision, the MPQM metric partly looses its accuracy

..8.6 0 0 Network CLR 0 0 0 0 0.. 0 0 0 Figure 7. MPQM versus encoding bit rate and network CLR. Figure 8. MPQM versus encoding bit rate for a fixed CLR=0?. are actually not independent resulting in a complex relationship gathering all the parameters together. For this purpose, we used the same simulation setup with the Ski sequence only. Figure 7 shows how the user-oriented video quality varies with both the encoding bit rate and the network cell loss ratio. One of the most interesting feature to be extracted from the graph is the evolution of the quality with the encoding bit rate for a non-zero network CLR. Indeed, the video quality first increases up to a certain bit rate and then smoothly decreases. This behavior is better illustrated on Fig. 8 which has been extracted from the previous graph. Such an important behavior may be explained by the following: since the CLR is equal to the ratio between the cell loss rate (number of lost cells per second) and the cell rate (i.e. bit rate), then the lower the bit rate, the lower the cell loss rate for a fixed CLR. Also, the lower the cell loss rate, the lower the number of lost cells per frame on average. Therefore, the lower the bit rate, the higher the CLR for an equivalent perceived degradation (see Fig. 9). Therefore, at a given threshold, a bit rate increment involves a smaller encoding quality improvement than a quality degradation due to video data loss. Finally, when considering video transmission over lossy networks, not only it is bandwidth consuming to increase the encoding bit rate above a certain threshold (which varies according to the scene complexity), it is also quality consuming. In other words, when the user-oriented QoS is not high enough, an increase of the encoding bit rate at a constant CLR may even degrade the quality depending on the position of the working point on the D graph presented herebefore. There is actually an optimal bit rate to be determined that maximizes the end-user perception of the service under some given network conditions (i.e. network impairments). It has to be noticed that such a conclusion is general enough to be applied to a different encoding system and/or networking technology. It may be argued that the packet loss ratio increases along with the transmission rate in some packet networks. However, such an observation enforces our analysis. Indeed, the higher the PLR, the higher the number of loss hits in average. Conclusion and Future Works In this paper, we jointly analyzed the impact of MPEG- encoding rate and ATM cell losses on the user-oriented video quality. It appeared that the encoding bit rate has to be carefully chosen as soon as cell losses occur. The video quality may indeed decrase

.... CBR Mbit/s CBR 7 Mbit/s CBR Mbit/s 0 6 0 0 0 0 Network CLR Figure 9. MPQM versus network CLR for CBRs of, 7 and MBits/s. when increasing the encoding bit rate under the same network conditions. This conclusion still holds for a different encoding system and networking technology. Further works are being carried out. We are currently working on an analytical analysis of this joint impact on the video quality []. We are also investigating the effect of higher order statistics of the loss process (i.e. variance, correlations) on the perceived quality. Finally, a similar set of experiments is being applied to H.6 over IP networks. References [] C. J. van den Branden Lambrecht and O. Verscheure, Perceptual Quality Measure using a Spatio-Temporal Model of the Human Visual System, in Proceedings of the SPIE, vol. 668, pp. 0 6, San Jose, CA, January 8 - February 996. [] ISO/IEC JTC, Information Technology - Generic Coding of Moving Pictures and Associated Audio Information - Part, and, ISO/IEC JTC, 996. [] C. Lopez Fernandez, A. Basso and J.-P. Hubaux, Error Concealment and Early Resynchronization Techniques for MPEG- Video Streams Damaged by Transmission over ATM Networks, in SPIE, San Jose, USA, February 996. [] S. Lee, J. Youn and S. Jang, Transmission error detection, resynchronization, and error concealment for mpeg- video decoder, in SPIE, 99. [6] S. Aign and K. Fazel, Temporal and spatial error concealment techniques for hierarchical mpeg- video coded, in ICC 9, 99. [7] C. J. van den Branden Lambrecht, Perceptual Models and Architectures for Video Coding Applications, PhD thesis, Swiss Federal Institute of Technology, CH 0 Lausanne, Switzerland, 996. [8] ITU-R Commity, ITU-R 00. Recommendations and Reports, in ITU, vol. XI. 986. [9] C. Fogg, mpegencode/mpegdecode, in MPEG Software Simulation Group, 996. [0] X. Garcia Adanez, O. Verscheure and J.-P. Hubaux, New Network and ATM Adaptation Layers for Real-Time Multimedia Applications: A Performance Study Based on Psychophysics, in COST-7 workshop, Barcelona, Spain, November 996. [] The ATM Forum, Video on Demand Specification.0, Jan. 996. [] O. Verscheure, P. Frossard and M. Hamdi, MPEG- Video Services over Packet Networks: Joint Effect of Encoding Rate and Data Loss on User-Oriented QoS, in 8th International Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV), Cambridge, UK, July 8-0 998. [] B. G. Haskell, A. Puri and A. N. Netravali, Digital Video: an Introduction to MPEG-, Digital Multimedia Standards Series. Chapman and Hall, 997. 6