over the Internet Tihao Chiang { Ya-Qin Zhang k enormous interests from both industry and academia.

Similar documents
Optimal Mode Selection in ICnt ernet Video Communication: An End-to-Erid Approach

MODELING AND SIMULATION OF MPEG-2 VIDEO TRANSPORT OVER ATM NETWOR.KS CONSIDERING THE JITTER EFFECT

Error Concealment Used for P-Frame on Video Stream over the Internet

Lecture 6: Internet Streaming Media

To address these challenges, extensive research has been conducted and have introduced six key areas of streaming video, namely: video compression,

Transporting Real-Time Video over the Internet: Challenges and Approaches

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.

Improving Interactive Video in Wireless Networks Using Path Diversity 1

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

2 RTP Encapsulation and its Application in NS-2 Simulation

Recommended Readings

Prioritisation of Data Partitioned MPEG-4 Video over Mobile Networks Λ

Internet Streaming Media. Reji Mathew NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2007

Video Streaming in Wireless Environments

Delay Constrained ARQ Mechanism for MPEG Media Transport Protocol Based Video Streaming over Internet

The RTP Encapsulation based on Frame Type Method for AVS Video

A Hybrid Temporal-SNR Fine-Granular Scalability for Internet Video

ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS

An Adaptive Multimedia Transmission Protocol for Distributed Multimedia Applications

Optimal Estimation for Error Concealment in Scalable Video Coding

Extensions to RTP to support Mobile Networking: Brown, Singh 2 within the cell. In our proposed architecture [3], we add a third level to this hierarc

Network-Adaptive Video Coding and Transmission

Multi-path Transport of FGS Video

RECOMMENDATION ITU-R BT.1720 *

MITIGATING THE EFFECT OF PACKET LOSSES ON REAL-TIME VIDEO STREAMING USING PSNR AS VIDEO QUALITY ASSESSMENT METRIC ABSTRACT

Internet Streaming Media. Reji Mathew NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2006

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

Internet Streaming Media

CS 218 F Nov 3 lecture: Streaming video/audio Adaptive encoding (eg, layered encoding) TCP friendliness. References:

Layered Self-Identifiable and Scalable Video Codec for Delivery to Heterogeneous Receivers

Advanced Video Coding: The new H.264 video compression standard

Quality of Service (QoS) Whitepaper

Comparison of Shaping and Buffering for Video Transmission

Partial Reliable TCP

MaVIS: Media-aware Video Streaming Mechanism

SMART: An Efficient, Scalable and Robust Streaming Video System

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

A Transport Infrastructure Supporting Real Time Interactive MPEG-4 Client-Server Applications over IP Networks

Scalable video coding with robust mode selection

Digital Asset Management 5. Streaming multimedia

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

R-D points Predicted R-D. R-D points Predicted R-D. Distortion (MSE) Distortion (MSE)

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

Very Low Bit Rate Color Video

Error Control Techniques for Interactive Low-bit Rate Video Transmission over the Internet.

Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet

Adaptive Multi-level Streaming Service using Fuzzy Similarity in Wireless Mobile Networks

Rate-distortion Optimized Streaming of Compressed Light Fields with Multiple Representations

VIDEO TRANSMISSION OVER UMTS NETWORKS USING UDP/IP

Multi-path Forward Error Correction Control Scheme with Path Interleaving

Networking Applications

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

Digital Image Stabilization and Its Integration with Video Encoder

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Improving Interactive Video in Ad-hoc Networks Using Path Diversity

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

1480 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 5, OCTOBER 2012

CS 5520/ECE 5590NA: Network Architecture I Spring Lecture 13: UDP and TCP

In the name of Allah. the compassionate, the merciful

Internet Streaming Media

PAPER Optimal Quantization Parameter Set for MPEG-4 Bit-Rate Control

Real-Time Protocol (RTP)

(a) Figure 1: Inter-packet gaps between data packets. (b)

MISB EG Motion Imagery Standards Board Engineering Guideline. 24 April Delivery of Low Bandwidth Motion Imagery. 1 Scope.

FRACTAL COMPRESSION USAGE FOR I FRAMES IN MPEG4 I MPEG4

Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA

VoIP transmission mechanism based on TCP

VIDEO COMPRESSION STANDARDS

Outline. QoS routing in ad-hoc networks. Real-time traffic support. Classification of QoS approaches. QoS design choices

Streaming H.264/AVC Video over the Internet

Real-Time Course. Video Streaming Over network. June Peter van der TU/e Computer Science, System Architecture and Networking

COMP 249 Advanced Distributed Systems Multimedia Networking. Performance of Multimedia Delivery on the Internet Today

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING

Secure Scalable Streaming and Secure Transcoding with JPEG-2000

TCP PERFORMANCE FOR FUTURE IP-BASED WIRELESS NETWORKS

A MULTIPOINT VIDEOCONFERENCE RECEIVER BASED ON MPEG-4 OBJECT VIDEO. Chih-Kai Chien, Chen-Yu Tsai, and David W. Lin

Circuit Breakers for Multimedia Congestion Control

Streaming MPEG-4 Audio-Visual Objects with Quality Adaptation

RTP Profile for TCP Friendly Rate Control draft-ietf-avt-tfrc-profile-03.txt

Parameter Equipment Motivation Monitoring method. Smooth play-out Test stream

RTP: A Transport Protocol for Real-Time Applications

JPEG 2000 vs. JPEG in MPEG Encoding

An Adaptive Rate-Control Streaming Mechanism with Optimal Buffer Utilization

signal-to-noise ratio (PSNR), 2

Streaming (Multi)media

Video Redundancy Coding in H.263+ Stephan Wenger Technische Universität Berlin

CSCD 433/533 Advanced Networks Fall Lecture 14 RTSP and Transport Protocols/ RTP

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

Improving the Eectiveness of ATM Trac Control over Hybrid. Fiber-Coax Networks. Polytechnic University

For layered video encoding, video sequence is encoded into a base layer bitstream and one (or more) enhancement layer bit-stream(s).

Time. Objective 1: end to end delay below 100 ms. Shaping & Packetization. Dec. Enc. Network Adaptation. Frame Grabber. Video Adaptor.

RTP. Prof. C. Noronha RTP. Real-Time Transport Protocol RFC 1889

Real-time and smooth scalable video streaming system with bitstream extractor intellectual property implementation

QoE Characterization for Video-On-Demand Services in 4G WiMAX Networks

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

Cross-layer Feedback Control for Video Communications via Mobile Ad-hoc Networks

Transport Protocols and TCP

Congestion Feedback in RTCP

Receiver-based adaptation mechanisms for real-time media delivery. Outline

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Transcription:

An End-to-End Architecture for MPEG-4 Video Streaming over the Internet Y. Thomas Hou Dapeng Wu y Wenwu Zhu z Hung-Ju Lee x Tihao Chiang { Ya-Qin Zhang k Abstract It is a challenging problem to design an ecient MPEG-4 video delivery system that can maximize the perceptual quality while achieving high resource utilization. This paper addresses this problem by presenting an architecture of transporting MPEG-4 video over the Internet, which includes an end-to-end feedback control algorithm and a source encoding rate control algorithm. Our feedback control algorithm is capable of estimating the available bandwidth in the network based on the feedback information from the receiver, while our source encoding rate control algorithm is able to adjust the encoding rate of MPEG-4 video to the desired rate. Simulation results demonstrate that our architecture achieves good perceptual picture quality under low bit-rate and varying network conditions while eciently utilizing network resources. 1 Introduction With the success of Internet and the emerging of multimedia communication era, the new international standard, MPEG-4, is poised to become the enabling technology for multimedia communications in the next millennium [3]. MPEG-4 builds on elements from several successful technologies such as digital video, computer graphics, and the World Wide Web with the aim of providing powerful tools in the production, distribution, and display of multimedia contents with unprecedented new features and functions. With the extreme exibility and eciency provided by coding a new form of visual data called visual object (VO), it is foreseen that MPEG-4 will be capable of addressing emerging truly interactive content-based video services as well as conventional storage and transmission of video. Y. T. Hou is with Fujitsu Laboratories of America, Sunnyvale, CA, USA. y D. Wu is with Polytechnic University, Brooklyn, NY, USA. z W. Zhu is with Bell Labs, LucentTechnologies, Murray Hill, NJ, USA. x H.-J. Lee is with Sarno Corporation, Princeton, NJ, USA. { T. Chiang is with Sarno Corporation, Princeton, NJ, USA. k Y.-Q. Zhang is with Microsoft Research, Beijing, China. Recently, Internet video streaming has brought enormous interests from both industry and academia. Video streaming typically has delay and loss requirements. 1 However, delay and loss in the current Internet are drastically varying from time to time, which isvery detrimental to video streaming. Thus, it is a major challenge to design an ecient video delivery system that can maximize users' perceived quality of service (QoS) while achieving ecient network resource utilization. This paper presents an end-to-end transport architecture for MPEG-4 video over the Internet. The objective of our architecture is to achieve good perceptual quality at the application level while being adaptive to network condition and utilizing network resource eciently. More specically, our framework consists of two key components: (1) an end-to-end feedback control algorithm; and (2) an adaptive video encoding rate control algorithm. Simulation results show that our MPEG-4 transport architecture is capable of transporting MPEG-4 video over the network with good perceptual quality under low bit-rate and varying network conditions and utilize network resource eciently. Previous work on feedback control for Internet video include [6]. Since this algorithm employs multiplicative rate increase, there is frequent and large rate uctuation, which leads to large packet loss ratio. This paper extends the feedback control technique in [6] for MPEG-4 video. In particular, we design an endto-end feedback control algorithm for MPEG-4 video by employing the RTCP feedback mechanism and using packet loss as congestion indication. Our source encoding rate control in this paper is based on the following concepts and techniques: (1) an accurate second-order rate distortion model for the target bit-rate estimation, (2) a sliding window method for smoothing the impact of scene change, (3) an adaptive data-point selection criterion for model 1 Streaming video implies that the content need not be downloaded in full before it begins playing, but is played out even as it is being received and decoded.

Raw Video Feedback Control Protocol Rate Adaptive MPEG-4 Encoder RTP/UDP/IP Module Internet Feedback Control Protocol QoS Monitor RTP/UDP/IP Module MPEG-4 Decoder Figure 1: An architecture of transporting MPEG-4 video over the Internet. updating process, (4) an adaptive threshold shape control for optimal use of bit budget, and (5) a dynamic bit-rate allocation among video objects with dierent coding complexities. This algorithm has been accepted in the international standard for MPEG-4 [3]. The remainder of this paper is organized as follows. Section 2 gives an overview of our MPEG-4 transport architecture. In Section 3, we design an end-to-end feedback control algorithm for MPEG-4 video streaming. Section 4 describes the video encoding rate control algorithm. In Section 5, we use simulation results to demonstrate the performance of MPEG-4 video streaming under our transport architecture. Section 6 concludes this paper. 2 Transporting MPEG-4 Video Streams over the Internet Figure 1 gives an overview of our architecture for transporting MPEG-4 video over the Internet. In essence, the MPEG-4 video encoder is controlled by a feedback control algorithm and performs adaptive encoding based on feedback so that it can respond gracefully to network congestion. Under our architecture, we design an end system (sender and receiver) based feedback control for MPEG-4 video using the RTCP ow control mechanism. At the receiver side, the receiver monitors the packet loss ratio of the arriving packet stream and periodically conveys packet loss information to the source with RTCP feedback control packets. At the source side, there is a rate calculation algorithm that estimates the the available network bandwidth based on packet loss information in the returning RTCP packets. Section 3 presents the details of our feedback control algorithm. Based on the estimated rate by the feedback control algorithm, the source encoder performs compression so that the output rate matches the estimated rate. A good source encoding rate control should not only be able to produce the desired output rate, but also maximize the perceptual quality. Under MPEG-4 environment, adaptive encoding may be achieved by alteration of both the encoder's quantization parameters and the video frame rate. In Section 4, we will describe our source encoding rate control algorithm for MPEG-4. Our algorithm is capable of achieving the desired output rate with excellent perceptual quality and has been accepted in the international standard for MPEG-4 [3]. 3 A Feedback Control Algorithm Under our architecture, we let the MPEG-4 video source gradually increase its transmission rate to probe available network bandwidth. Such rate increase will rst have the source's rate reach the available network bandwidth. Then the source rate will overshoot the available network bandwidth and fall into the congestion region. Congestion is detected by the receiver through packet loss in the received packets. The receiver sends feedback RTCP packets to the source to inform it about congestion status. Once source receives such feedback, it decreases it transmission rate. In consistent with the RTP/RTCP standard [4], we let the source periodically send one RTCP control packet for every N s MPEG-4 video packets. The receiver sends a feedback RTCP control packet to the source upon receiving N r packets or at least once every 5 seconds. The returning RTCP packet contains the packet loss ratio P loss the receiver observed during the N r packet time interval since the previous feedback RTCP packet. Algorithm 1 Sender Behavior Feedback Control Algorithm The sender starts to transmit at the output rate r := InitRate, which is greater than or equal to its minimum rate MinRate; For every N s transmitted RTP data packets, the sender sends a forward RTCP control packet; Upon the receipt of a backward RTCP with the packet loss ratio P loss from the receiver, the output rate r at the source is adjusted according to the following rule: if (P loss P threshold ) r := minf(r + AddRate); MaxRateg; else r := maxf( r); MinRateg Receiver Behavior

INITIALIZATION Set output rate Initialize buffer size & fullness execution flow data flow PRE-ENCODING STAGE quantization parameter calculation. 3. Encoding stage: the major tasks the encoder has to complete include Frame-level target bitrate estimation Delay Actual bitcounts Buffer occupancy Multiple video objects? No Quantization calculation Encoding frame/ object/macroblock R-D model update Frame skipping control Shape threshold control Yes POST-ENCODING STAGE Macroblock rate control ENCODING STAGE Object-level target bitrate estimation Figure 2: Source encoding rate control. Delay MAD calculation Shape & Motion bits The receiver keeps track of the sequence number in the RTP header of the arriving packets; Upon receiving N r packets or at most 5 seconds, the receiver sends a feedback RTCP packet to the source containing packet loss rate P loss it observes during this time interval. 2 4 Source Encoding Rate Control A good source encoding rate control should not only be able to produce the desired output rate, but also maximize the perceptual quality. Figure 2 is a block diagram of our source encoding rate control algorithm. Our source encoding rate control is performed in four stages as follows. 1. Initialization stage: the major tasks the encoder has to complete include initializing the buer size based on the latency requirement subtracting the bit counts for the rst I- frame from the total bit counts initializing the buer occupancy in the middle level. 2. Pre-encoding stage: the tasks of the rate control scheme include estimation of the target bits further adjustment of the target bits based on the buer status for each VO encoding the video frame (object) and recording all actual bit-rate activating the macroblock-layer rate control if desired. 4. Post-encoding stage: the encoder needs to complete the following tasks updating the corresponding quadratic ratedistortion model for the entire frame or an individual VO performing the shape threshold control to balance the bit usage between shape information and texture information performing the frame skipping control to prevent the potential buer overow or underow. Our source encoding rate control achieves more accurate target bit rate allocation under the constraints of low latency and limited buer size [1, 2]. In addition to the frame-level rate control, the encoding scheme is also applicable to the macro-block-level rate control for ner bit allocation and buer control, and multiple video objects (VOs) rate control for better video object presentation when feedback indicates more network bandwidth is available. 5 Simulation Results In this section, we implement our proposed architecture and algorithms on our network simulator and perform a simulation study on MPEG-4 video transport over the network. The purpose of this section is to demonstrate that our architecture and algorithms can (1) transport MPEG-4 video streams over the network with good perceptual picture quality under both low bit rate and varying network conditions, and (2) adapt to available network bandwidth and utilize it eciently. We employ a parking lot network conguration (Fig. 3), where path G1 consists of multiple ows and traverse from the rst switch (SW1) to the last switch (SW5), path G2 starts from SW2 and terminates at the last switch (SW5), and so forth. Clearly, Link45 is the potential bottleneck link for all ows. In our simulations, path G1 consists of one MPEG- 4 source and ve TCP connections while paths G2,

G1.... G1 Link12 Link23 Link34 Link45 G2. SW1 SW2 SW3 SW4 SW5 G3 G4 G2 G3 G4 Figure 3: A parking lot network. Table 1: Simulation parameters. MaxPL 48 bits IR 1 Kbps AIR.5 Kbps PR Kbps End MR 5 Kbps System.95 (MPEG-4) N s 79 N r 25 P threshold 5% Buer size 1Mbytes End Packet size 576 bytes System Default timeout 5 ms (TCP) TCP version Reno Buer size 1 Kbytes Switch Packet processing delay 4 s Buer management Tail Dropping End system Speed 1 Mbps Link to switch Distance 1km Inter-switch Speed Kbps Distance 1 km G3, and G4 all consist of ve TCP connections, respectively. All the TCP sources are persistent during the whole simulation run. At the source side, we use the standard raw video sequence \Akiyo" in QCIF format for the MPEG-4 video encoder. The encoder performs MPEG-4 compression and adaptively adjusts its rate under our feedback control algorithm (Algorithm 1). The encoded bit stream is packetized with RTP/UDP/IP protocol overhead and sent to the network. Packets may be dropped due to congestion in the network. For arriving packets, the receiver extracts the packet content to form the bit stream for the MPEG-4 decoder. For a lost packet, the video object plane (VOP) associated with the lost packet will be discarded and a previous VOP will be copied over. Table 1 lists the parameters used in our simulation. We use 576 bytes for the path MTU. Therefore, the Figure 4: Video objects VO1 (left) and VO2 (right) in \Akiyo" MPEG-4 video sequence. maximum payload length, MaxPL, is 526 bytes (576 bytes minus 5 bytes of overhead) [5]. We run our simulation for 45 seconds. Since there is only 3 continuous frames in \Akiyo" sequence available, we repeat the video sequence cyclically during the 45-second simulation run. In the simulations, we identify two VOs as VO1 (background) and VO2 (foreground) (see Fig. 4) and encode them separately. Figure 5 shows source rate behavior during the 45 second simulation run. We nd that the source is able to adjust its rate to keep track of the varying network available bandwidth. Figure 6 shows the link utilization and packet loss ratio during the same simulation run. The oscillation in source rate (Fig. 5) and network utilization (Fig. 6) are due to the propagation delay of the links and the binary nature of our feedback control algorithm. The source performs additive rate increase until it reaches the available link bandwidth. After that the source rate overshoots and results in congestion and packet loss. Packet loss is detected at the receiver and such information is conveyed to the source. Upon receiving such feedback, the source decrease its rate. Despite the oscillations, the bottleneck link (Link45) is 1% utilized. Furthermore, we nd that the average packet loss ratio is only 1.6%, which demonstrates the eectiveness of our feedback control algorithm. Figure 7 shows the PSNR for the Y component of each VOs in the MPEG-4 video sequence at the receiver for the same simulation run. To examine the perceptual quality of the MPEG-4 video, we play out the decoded video sequence at the receiver. We observe that the reconstructed video have good perceptual quality. Figure 8 shows a sample video frame at the receiver. In summary, based on the simulation results in this section, we conclude that our architecture and algorithms can (1) transport MPEG-4 video streams over the network with good perceptual picture quality un-

Rate (kbps) 6 5 3 1 Figure 5: source. Link Utilization & Packet Loss (%) 5 1 15 25 3 35 45 1 1 8 6 The output rate of the MPEG-4 video Link34 Utilization Link23 Utilization Link12 Utilization Link45 Utilization Packet Loss Ratio 5 1 15 25 3 35 45 Figure 6: Link utilization and packet loss ratio. 5 VO1 der both low bit rate and varying network conditions; and (2) adapt to available network bandwidth and utilize it eciently. 6 Concluding Remarks This paper presents an end-to-end transport architecture for MPEG-4 video over the Internet. The proposed end-to-end transport architecture includes an end-to-end feedback control algorithm and a source encoding rate control algorithm. Simulation results show that our MPEG-4 transport architecture is capable of transporting MPEG-4 video over the network with good perceptual quality under low bit-rate and varying network conditions and utilize network resource eciently. References [1] T. Chiang and Y.-Q. Zhang, \A new rate control scheme using quadratic rate distortion model," IEEE Trans. on Circuits and Systems for VideoTechnology, vol. 7, no. 1, pp. 246{25, Feb. 1997. [2] H.-J. Lee, T. Chiang and Y.-Q. Zhang, \Scalable rate control for very low bit rate (VLBR) video," Proc. IEEE ICIP'97, pp. 768{771, Oct. 1997. [3] ISO/IEC JTC 1/SC 29/WG 11, \Information technology - coding of audio-visual objects, part 1: systems, part 2: visual, part 3: audio," FCD 14496, Dec. 1998. [4] H. Schulzrinne, S. Casner, R. Frederick and V. Jacobson, \RTP: a transport protocol for real-time applications," RFC 1889, Internet Engineering Task Force, Jan. 1996. PSNR_Y (db) 3 1 VO2 [5] H. Schulzrinne, D. Homan, M. Speer, R. Civanlar, A. Basso, V. Balabanian and C. Herpel, \RTP payload format for MPEG-4 elementary streams," Internet Draft, Internet Engineering Task Force, March 1998. 5 1 15 25 3 35 45 Figure 7: PSNR of each VO of MPEG-4 video at the receiver. [6] T. Turletti and C. Huitema, \Videoconferencing on the Internet," IEEE/ACM Trans. on Networking, vol. 4, no. 3, pp. 3{351, June 1996. Figure 8: A sample frame during the simulation run.