Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet

Similar documents
WITH the emerging third-generation (3G) wireless technology,

Error Resilient Image Transmission over Wireless Fading Channels

Multi-path Forward Error Correction Control Scheme with Path Interleaving

Recommended Readings

Wireless Video Transmission: A Single Layer Distortion Optimal Approach

MOBILE VIDEO COMMUNICATIONS IN WIRELESS ENVIRONMENTS. Jozsef Vass Shelley Zhuang Jia Yao Xinhua Zhuang. University of Missouri-Columbia

Modified SPIHT Image Coder For Wireless Communication

LIVE VIDEO STREAMING OVER PACKET NETWORKS AND WIRELESS CHANNELS

ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS

Quality of Service (QoS) Whitepaper

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo

Adaptive Forward Error Correction for Real-time Internet Video

Delay Constrained ARQ Mechanism for MPEG Media Transport Protocol Based Video Streaming over Internet

Module 6 STILL IMAGE COMPRESSION STANDARDS

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P

SNR Scalable Transcoding for Video over Wireless Channels

Multidimensional Layered Forward Error Correction using Rateless Codes

ADAPTIVE JOINT H.263-CHANNEL CODING FOR MEMORYLESS BINARY CHANNELS

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

SMART: An Efficient, Scalable and Robust Streaming Video System

Scalable Video Coding

A Hybrid Temporal-SNR Fine-Granular Scalability for Internet Video

Cost-based cache replacement and server selection for multimedia proxy across wireless Internet

Bandwidth Aggregation with Path Interleaving Forward Error Correction Mechanism for Delay-Sensitive Video Streaming in Wireless Multipath Environments

SMART: An Efficient, Scalable, and Robust Streaming Video System

A NOVEL PACKET LOSS RECOVERY TECHNIQUE FOR MULTIMEDIA COMMUNICATION

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

Optimal Estimation for Error Concealment in Scalable Video Coding

ESTIMATION OF THE UTILITIES OF THE NAL UNITS IN H.264/AVC SCALABLE VIDEO BITSTREAMS. Bin Zhang, Mathias Wien and Jens-Rainer Ohm

An Optimal UEP Scheme of Audio Transmission over MIMO Wireless Links

Survey on Concurrent Multipath Scheduling for Real Time Video Streaming in Wireless Network

A New Configuration of Adaptive Arithmetic Model for Video Coding with 3D SPIHT

REGION-BASED SPIHT CODING AND MULTIRESOLUTION DECODING OF IMAGE SEQUENCES

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

40 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 15, NO. 1, JANUARY 2006

MULTICAST is a promising technique for cost effectively. Sender-Adaptive and Receiver-Driven Layered Multicast for Scalable Video Over the Internet

Scalable Video Transport over Wireless IP Networks. Dr. Dapeng Wu University of Florida Department of Electrical and Computer Engineering

THE design of robust systems for the transmission of images

Unequal Packet Loss Protected Transmission for FGS Video. Lianji Cheng; Li Song; Songyu Yu

Packet-Level Forward Error Correction in Video Transmission

Comparison of Shaping and Buffering for Video Transmission

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

Multi-path Transport of FGS Video

Scalable Video/Image Transmission using Rate Compatible PUM Turbo Codes

TCP PERFORMANCE FOR FUTURE IP-BASED WIRELESS NETWORKS

Xiaoqing Zhu, Sangeun Han and Bernd Girod Information Systems Laboratory Department of Electrical Engineering Stanford University

Distributed Video Coding

Rate Distortion Optimized Joint ARQ-FEC Scheme for Real-Time Wireless Multimedia

A 3-D Virtual SPIHT for Scalable Very Low Bit-Rate Embedded Video Compression

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction

JPEG 2000 vs. JPEG in MPEG Encoding

MODELING AND SIMULATION OF MPEG-2 VIDEO TRANSPORT OVER ATM NETWOR.KS CONSIDERING THE JITTER EFFECT

Overview. A Survey of Packet-Loss Recovery Techniques. Outline. Overview. Mbone Loss Characteristics. IP Multicast Characteristics

Systematic Lossy Error Protection for Video Transmission over Wireless Ad Hoc Networks

Advances in Efficient Resource Allocation for Packet-Based Real-Time Video Transmission

Layered Multicast with Forward Error Correction (FEC) for Internet Video

ELEC 691X/498X Broadcast Signal Transmission Winter 2018

A Model for Streaming 3D Meshes and Its Applications

Effect of RED and different packet sizes on Multimedia performance over wireless networks

Robust Video Coding. Heechan Park. Signal and Image Processing Group Computer Science Department University of Warwick. for CS403

Layered Self-Identifiable and Scalable Video Codec for Delivery to Heterogeneous Receivers

ARCHITECTURES OF INCORPORATING MPEG-4 AVC INTO THREE-DIMENSIONAL WAVELET VIDEO CODING

Cross-Layer Techniques for Adaptive Video Streaming over Wireless Networks

Improving the quality of H.264 video transmission using the Intra-Frame FEC over IEEE e networks

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

International Journal of Research Available at

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

Final Report. ARO grant DAAD (1 October December 2003) Streaming Video Compression for Heterogeneous Networks

FORWARD ERROR CORRECTION CODING TECHNIQUES FOR RELIABLE COMMUNICATION SYSTEMS

Multimedia Data Transmission over Mobile Internet using Packet-Loss Punctured (PaLoP) Codes

CS 641 Project Report Error resilient video transmission over wireless networks. December Gang Ding

An Unequal Packet Loss Protection Scheme for H.264/AVC Video Transmission

Unit-level Optimization for SVC Extractor

Jimin Xiao, Tammam Tillo, Senior Member, IEEE, Yao Zhao, Senior Member, IEEE

signal-to-noise ratio (PSNR), 2

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

Receiver-based adaptation mechanisms for real-time media delivery. Outline

CC-SCTP: Chunk Checksum of SCTP for Enhancement of Throughput in Wireless Network Environments

Video Streaming in Wireless Environments

Improving TCP-Friendly Rate Control in Wired and Wireless Networks By a Scheme Based on Wireless Signal Strength

Proxy-based TCP-friendly streaming over mobile networks

over the Internet Tihao Chiang { Ya-Qin Zhang k enormous interests from both industry and academia.

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

MITIGATING THE EFFECT OF PACKET LOSSES ON REAL-TIME VIDEO STREAMING USING PSNR AS VIDEO QUALITY ASSESSMENT METRIC ABSTRACT

Optimal Audio Transmission over Wireless Tandem Channels

Achieving Reliable Digital Data Communication through Mathematical Algebraic Coding Techniques

WITH THE rapid advance of wireless local area network

CSE 123: Computer Networks

Bayesian Piggyback Control for Improving Real-Time Communication Quality

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

EXPLORING ON STEGANOGRAPHY FOR LOW BIT RATE WAVELET BASED CODER IN IMAGE RETRIEVAL SYSTEM

A Crypto Signature Scheme for Image Authentication over Wireless Channel

Request for Comments: 5109 December 2007 Obsoletes: 2733, 3009 Category: Standards Track. RTP Payload Format for Generic Forward Error Correction

The Performance of MANET Routing Protocols for Scalable Video Communication

Enhanced Parity Packet Transmission for Video Multicast using R-DSTC

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.

4G WIRELESS VIDEO COMMUNICATIONS

Error Control for Receiver-Driven Layered Multicast of Audio and Video

Adaptive Cross-Layer Protection Strategies for Robust Scalable Video Transmission Over WLANs

Transcription:

Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet GuiJin Wang Qian Zhang Wenwu Zhu Jianping Zhou Department of Electronic Engineering, Tsinghua University, Beijing, P. R. China Microsoft Research, China, No. 49, Zhichun Road Haidian District, Beijing, P. R. China ABSTRACT ü Streaming high-fidelity audio over networks with both bit errors and packet erasures is becoming increasingly important due to the emerging of wireless Internet. In this paper, we propose an end-to-end architecture for scalable audio streaming over wireless Internet. Considering the characteristic of scalable audio, a novel layered product code is presented to handle bit errors and packet losses simultaneously. Specifically, unequal row channel code and unequal column code are adopted for different layers of scalable audio based on their quality impacts. Moreover, Rate-Distortion based bit allocation is proposed to determine each channel-coding rate and the source-coding rate so as to minimize the expected end-to-end distortion. The simulation results demonstrated effectiveness of our proposed error protection scheme. I. INTRODUCTION With the explosive growth of the Internet and the emerging of the third generation (3G) wireless technology, it is becoming increasingly important to streaming high-fidelity audio in the handheld devices across wireless Internet. However, transmission of audio across wireless Internet still remains a very challenging task due to the limited and varying network bandwidth as well as the different transmission errors occurred on the Internet and wireless links. In the heterogeneous network, the data packets can experience packet losses caused by congestion at routers and bit errors resulting from fading and shadowing effects on the wireless link. Both these errors usually have severe adverse effects on the decompression of the received stream, and can even make the decoder crash. Therefore, dynamic network condition monitoring and efficient error controlling are necessary to ensure high-fidelity audio transmission. Several error protection schemes had been studied for audio streaming over the Internet or wireless channels [, 2, 3]. -based error control schemes are presented for audio streaming in the Internet [2, 3]. Yung et. al. proposed a fixed Unequal Error Protection (UEP) scheme for MPEG audio over wireless channel []. However, neither of these schemes considers time-varying channel conditions or handles packet loss and bit errors simultaneously. To address the error protection for both bit errors and packet losses, Sachs et al. proposed a multiple-description This work is performed while he is a visiting student at Microsoft Research China. product code scheme for the SPHIT image transmission [4]. In [4], a concatenated channel code is used, which includes a row code based on RCPC codes with CRC error detection and a source-channel column code consisting of the scalable SPIHT image coder and an UEP RS erasure-correction codes. However, how to distribute the varying network resources among RCPC rate, RS protection level, and image source rate had not been discussed. Moreover, this protection scheme assumes no bit error after the RCPC channel decoding, which may not be true in practical wireless networks. A hybrid coder is presented for embedded image transmission over channels with bit errors and packet erasures [5]. This hybrid coder combines the RCPC/CRC concatenated channel coder that handles the bit errors, with packetized zerotree wavelet coder, which provides robustness to packet loss by producing the datastream with independently decodable packets. However, how to determine the RCPC rate with varying network condition, and how to recover the lost packets have not been discussed. Note that all the above protection schemes are proposed for image transmission. Considering the different characteristics between the audio and the image, such as the D-R relation and the delay constraints, applying them to audio streaming is not straightforward. To the best of our knowledge, there is no report for scalable audio streaming over wireless Internet. In this work, we propose a novel end-to-end architecture to deliver the scalable audio over such channels with bit errors and packet erasures. More specifically, based on dynamic channel estimation from the feedback control information, channel-adaptive layered product code is adopted by combining unequal row (inner) channel code with unequal column (outer) channel code. Moreover, an optimal bit allocation is presented to allocate bits between the source and product code, which results in minimal expected end-to-end distortion for audio transmission. The rest of this paper is organized as follows. The architecture for scalable audio streaming over wireless Internet is described in Section 2. Section 3 presents the optimal bit allocation scheme, where the novel layered product code is also introduced. Simulation results are given in section 4. Finally, Section 5 provides the concluding remarks. II. AN END-TO-END ARCHITECTURE FOR SCALABLE AUDIO STREAMING OVER WIRELESS INTERNET

As mentioned above, to transmit scalable audio over wireless Internet, bit errors will confront on wireless links. In addition, packet losses occur in both wired and wireless network. To efficiently deliver scalable audio over wireless Internet, the applications should be aware of the network conditions. In the meanwhile, appropriate error protection scheme should be used. Figure depicts our proposed end-to-end architecture for scalable audio streaming over channels with both packet losses and bit errors. On the sender side, the audio signal is encoded into several quality layers. According to network conditions and audio characteristic, each layer is divided into blocks and unequal protections are applied both in the block and across the blocks. The layered product-code is used to handle packet losses and bit errors simultaneously. In the product code, the row (inner) channel code is used to deal with the bit errors, and the column (outer) channel code, on the other hand, is used to deal with the packet losses. On the receiver side, after the process of channel decoding, the reconstructed packets are directed to the source decoder. In the mean time, some related network parameters, such as packet loss rate, bit error rate and transmission delay, are collected in the network monitor module and fed back to the sender. Based on those feedbacks, network status estimation module in the sender dynamically estimates the network status and available bandwidth. Once the available bandwidth is obtained, the target bit allocation module on the sender side allocates the bits among the row (inner) channel code, the column (outer) channel code and the source coding so as to minimize the expected end-to-end distortion in the receiver. In our scheme, a scalable audio with error resilience is used as an example, where the error resilient tools, like Data Partition (DP) and Reversible Variable Length Codes (RVLC), are adopted. Bit-plane coding is used to generate an embedded stream, which can be truncated anywhere. Therefore, layers of the same frame are correlated. To be specific, the higher layer information relies on the corresponding ones in the lower layers. On the receiver side, if any residual error occurs in the lower layers, the corresponding information in the higher layers will become useless no matter whether they are correct or not. The key components of our proposed end-to-end delivery architecture are channel adaptive layered product encoder and target bit allocation. We will present details for those modules in the following sections. III. CHANNEL-ADAPTIVE LAYERED PRODUCT CODE AND OPTIMAL BIT ALLOCATION 3. A Novel Product Channel Code In this work Reed-Solomon (RS) code [7] is used for forward error correction both in the row (inner) channel code and in the column (outer) channel code. An RS code defined by RS(n, k) is a length-n code which contains k source symbols and n-k protection symbols. RS (n, k) can generally correct t = ( n k) / 2 symbol errors. With the knowledge of the error position, it can correct up to t = n k symbol errors. Most commonly, n = 2 m, where m is the symbol length in bits (in our scheme we use m=8, because it is convenient to access information in byte). Figure 2 depicts the illustrative diagram of layered product code for our proposed scheme. The main idea is to divide each layer into blocks, add the column RS code first across the blocks, and then apply the row RS code to each block. The blocks in all layers are concatenated into one packet, which contains only one block each layer. As stated above, the row RS code is used to deal with the bit errors, and the column RS code is mainly used to deal with the packet losses. When the packet is lost caused by the network congestion or the large-scale path loss and fading, the column RS code can be used for recovery. If the error position is known, RS code can correct twice as many rows compared to a decoder without such knowledge. On the other hand, the row RS code can also prevent the block from being corrupted by the bit error. If a row is indicated to fail by the row RS, it will be marked as packet erasure. This erased packet, which marked by row RS code, can still be recovered by the column RS code. Furthermore, note that not all errors occur in the same column among the corrupted blocks, and the number of the recovered packets can be more than the maximum rows that the column RS can correct. Since the information in the lower layer is more important, the higher product channel rate is assigned to the lower layer. The protection degrees of the row channel code and the column code in a layer are determined according to the network conditions and its impact on the overall quality. 3.2 Optimal Bit Allocation for Product Code It is known that under a given channel condition, the additional increases the error robustness, but on the other hand, reduces the available rate for source coding. Thus there is a trade-off between source rate and rate. In this section we will discuss how to allocate bits between the source coding and the channel coding based on the rate-distortion relation. First we should determine two channel-code rates for packet loss protection and bit-error protection. Secondly, according to the characteristic of scalable audio, the higher layer information relies on the corresponding ones in the lower layers, and unequal error protection is necessarily adopted for two directional error protections. As discussed in [7], the end-to-end distortion, D(R), is composed of the source distortion and the channel distortion. The source distortion, D s (R s ), is caused by media rate controls; while the channel distortion, D c (R c ), results from noisy channel. Thus, assuming that D s and D c are uncorrelated, the end-to-end distortion can be represented as D ( R) = Ds ) + Dc ), () where R, R s, and R c represent the available rate, source rate,

and error correction rate, respectively. As mentioned in Section 2, we adopt scalable audio coder. Therefore, the source distortion can be obtained through the source rate 2Rs distortion function Ds ) = A2, with A being a constant. When calculating D c (R c ), we assume that each block of the same quality layer is equally important and the distortion resulting from the corrupted block is equal to that resulting from the lost one. Note that in the wireless Internet scenario, even a packet is received, some blocks in this packet may not be used by the source decoder due to the bit error. Thus we divide the channel distortion into two parts. One is the distortion caused by packet losses, and the other is the distortion caused by the bit errors in the received packets. The latter one can be also further divided into two cases. The first case is that the lost block, which is marked by row RS, can be recovered by the column channel code. The other is the distortion caused by which the column code is too weak to recover the loss blocks. Suppose the column RS code for each layer is represented as (n, k ), (n 2, k 2 ),..., (n L, k L ), where (n -k ) (n 2 -k 2 )... (n L -k L ). The following is the analysis of the channel distortion: The number of lost packets varies from n -k + to n. That is, even the lowest layer cannot be fully recovered in the column direction. In this case, the distortion can be represented as n L D c base = ( Pcolumn( t, n ) ( { wi t = n k + i= m i [ P dep( i, Prow( D( ]})), (2) j= where L is the number of layers that need to be transmitted, m i is the number of blocks in the i th layer, D( represents the distortion caused by the j th block in the layer P column (l, m) is the probability of l lost packets within m packets, P row ( is the corrupted probability of the j th block in the layer and P dep (i-, is the dependent probability of the j th block in the layer i. The number of lost packets varies from n 2 -k 2 + to n -k. That is, the parity blocks in the lowest layer can be used to recover the blocks corrupted by the bit errors, but other higher layers cannot. In this case, the distortion can be denoted as n2 L D c enh = ( Pcolumn( t, n2) ( { wi t= n2 k2 + i= m i [ P dep( i, Prow( D( ]})). (3) j= Similar analysis can be performed for other packet loss cases. The channel distortion can be presented as Dc ) = Dc base + Dc enh +... + Dc enhl. (4) Note that for the dependent probability and corrupted probability of a certain block in the above equations, different calculation method should be used according to the packet loss situation. After the above discussions, the challenging task is to find the optimal row channel rate, column channel rate and source rate for each layer. This can be formulated as follows: min [ Ds ) + Dc _ row, Rc _ column)] R,( n, k ) s i s.t. i Rs + Rc _ row + Rc _ column R, ( ni ki) ( n j k j ) for i j. (5) IV. SIMULATION RESULTS This simulation is to demonstrate effectiveness of our proposed channel-adaptive error protection scheme for scalable audio streaming over wireless Internet. In this simulation we tested three schemes: () Fixed column channel code only (20% for the first layer and 0% for the rest layers); (2) Fixed product code (20% for the first layer and 0% for the other layers both in the row and column channel-code); (3) Our proposed channel-adaptive layered product code. In all the tested cases, the MPEG-4 standard audio clip, horn_23_2, is used for testing. The scalable audio coder encodes the audio signals at a coding rate of 64 kbps. All the simulations are performed at the rate of 48kbps, and the packet loss ratio of the channel varies from 8% to 5%. We test these schemes on the channels with different bit error rates. More specifically, we classify these channels into two types: A. channel with packet rate from 8% to 5% and high BER from 0-3 to 5x0-3 ; B. channel with packet loss rate from 8% to 5% and low BER from 0-4 to 5x0-4. The total available bandwidth is the same for all the protection schemes in all the cases. In this simulation, we illustrate the quality of the decoded audio through the Noise-Mask-Ratio (NMR), where a lower value of the NMR shows a better-reconstructed audio. Figure 3 shows the average NMR results over 30 Monte Carlo simulations. It can be seen from Figure 3 that our proposed novel channel-adaptive layered product code obtains better results than the other two tested schemes in the noisy channel with both bit errors and packet losses. In the type A channel, the average NMR results of the three schemes are 6.84 db, 6.37 db, and 5.70 db, respectively. In the type B channel, the average NMR results of the three schemes are 6.50 db, 6.74 db, and 5.36 db, respectively. We can see that our scheme achieves more than db gains in both types channels. It also can be seen from Figure 3 that under type A channel, scheme 2 is better than the scheme

with the column channel code only; while for type B channel, scheme outperforms scheme 2. V. CONCLUSIONS In this paper, we present an end-to-end architecture for scalable audio streaming over wireless Internet in which a novel channel adaptive layered product-code is proposed to efficiently handle bit errors and packet erasures simultaneously. Considering the characteristic of scalable audio, unequal row channel code and unequal column code are adopted for different layers of scalable audio. Moreover, Rate-Distortion based bit allocation is proposed to determine each channel-coding rate and source-coding rate so as to minimize the expected end-to-end distortion. Simulation results show that our proposed approach can achieve more than db improvement under different channel conditions. REFERENCES [] C. Yung, H. Fu, C. Tsu R. S. Cheng, and D. George, Unequal error protection for wireless transmission of mpeg audio. Circuits and Systems, 999. ISCAS '99. Proceedings of the 999 IEEE International Symposium on, Volume: 6, 999. [2] J. Bolot, and A. Vega-Garcia, The case for -Based error control for packet audio in the Internet. ACM Multimedia Systems 97. [3] C. Perkins, O. Hodson, and V. Hardman, A Survey of Packet-Loss recovered techniques for streaming audio. IEEE Trans. Network, vol. 2, no. 5, Sept. 998, pp. 40-48. [4] D. G. Sachs, R. Anand, and K. Ramchandran, Wireless image transmission using multiple-description based concatenated codes. Proceedings DCC 2000. Snowbird, UT, USA. [5] P. Cosman, J. Rogers, P. G. Sherwood, and K. Zeger "Image Transmission over Channels with Bit Errors and Packet Erasures" 32ST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS Monterey, California, November 998. [6] R.E.Blahut, Digital Transmission of Information Reading, MA:Addison-Wesley, 990. [7] Q. Zhang, W. Zhu, and Y.-Q. Zhang, "Network-adaptive rate control with TCP-friendly protocol for multiple video objects", IEEE International Conference on Multimedia and Expo (ICME), July, 2000, New York. scalable audio encoder channel-adaptive layered product encoder buffer Internet Server target bit allocation network status estimation Gateway Data path audio decoder layered product decoder buffer Control path Display network monitor (packet loss rate, bit error rate, etc.) feedback Client Figure. End-to-end architecture for scalable audio streaming over wireless Internet.

Layer Layer 2 Layer L ' ' ' ' ' ' ' ' ' k n k k 2 n2 k2 k L nl k L Packet block Packet n k n -k symbol Data Unit Column Row k 2 Column Row k L packets Column R o w F E C P klen Figure 2. The illustrative product code diagram of our proposed scheme. NMR 6 4 2 0 8 Scheme Scheme 2 Our Scheme NMR 4 2 0 8 Scheme Scheme 2 Our Scheme 6 6 4 4 2 0 20 40 60 80 00 Frame Number 2 0 20 40 60 80 00 Frame Number ( a ) ( b ) Figure 3. The NMR comparison results for horn_23_2 using different protection schemes at 48kbps. (a) High bit error rate. (b) Low bit error rate.