Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA

Size: px
Start display at page:

Download "Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA"

Transcription

1 Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author s advance manuscript, without editing, corrections, or consideration by the Review Board. The AES takes no responsibility for the contents. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42 nd Street, New York, New York , USA; also see All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society. Error-Robust Frame Splitting For Audio Streaming Over the Lossy Packet Network Jong Kyu Kim 1, Hwan Sik Yun 1, Jung Su Kim 1, Joon-Hyuk Chang 2, and Nam Soo Kim 1 1 School of Electrical Engineering, Seoul National University, Seoul , Korea 2 School of Electronic and Electrical Engineering, Inha University, Incheon , Korea Correspondence should be addressed to Chong Kyu Kim-Author (ckkim@hi.snu.ac.kr) ABSTRACT In this paper, we propose a novel audio streaming scheme for perceptual audio coder over the packet-switching network. Each frame is split into several subframes which are independently decoded based on the specified packet size for robust error concealment. We further improve the subframe splitting techniques by allocating the spectral lines to each subframe adaptively. Through an informal listening test, it is discovered that our approach enhances audio signal under the lossy packet network environments. 1. INTRODUCTION Audio streaming has become one of the most popular data services in mobile communications on these days. Most of the audio streaming services are based on the packet-switching network where messages are divided into packets and each packet is transmitted individually. In the mobile packet-switching network, one of the most typical type of errors is the packet loss. Packet loss may arise in many different forms on the internet or wireless networks [1]. Under such packet loss conditions, it is crucial to guarantee the user-perceived quality of service (QoS). There are several practical techniques of audio dd streaming such as the error resilience (ER) or error protection (EP) tools in the MPEG Advanced Audio Coding (AAC) standard [2]. These tools can be applied to cope with bit errors caused by the packet losses error. When packet loss occurs, error concealment is usually applied to substitute the lost part with a suitable data. Error concealment algorithms are implemented at the receiver of the audio streams and they do not usually require any side information from the transmitter [3]. The major objective of packet error concealment is to regenerate the lost data so that it is perceptually indistinguishable from the original. Also, there have been several proposals on packeti-

2 zation scheme for error-robust audio streaming over the packet-switching networks. RTP payload format [4] defines a general and configurable payload structure to transport MPEG-4 elementary streams, which includes detection of the loss of crucial information in the bitstream, optional interleaving of audio frames, and retransmission or forward error correction with due consideration to congestion control. A more specific strategy on the packetization of audio bitstream has been proposed with an internal structure of encoded audio frame [5]. This strategy arranges MPEG-AAC frames in different packets according to the proportional priority considering the tradeoff between redundancy overhead and retransmission delay. In this paper, we propose a novel frame splitting scheme for robust audio streaming over the packetswitching networks. The proposed scheme can be applied to the cases in which a single audio frame should be split into a number of seprate packets. Such situation happens when the size of the packet is smaller than that of the audio frame or when the audio frame should be segmented into subblocks and interleaved. The proposed technique is found effective to enhance the audio quality that may possibly be degraded due to the packet losses and the mismatch between the audio frame and packet sizes specified in network and audio codec configurations. The rest of this paper is organized as follows; in Section 2 we address the general structure of a perceptual audio coder and its bitstream. And we address its defects when applied to transmission over packetswitching network. Then we describe the proposed frame splitting scheme in Section 3 and present an adaptive frame splitting technique which prevents deterioration of coding efficiency in Section 4. Following the experimental results in Section 5, we conclude this paper in Section STRUCTURE OF AUDIO BITSTREAM Generally, perceptual audio coders are developed with little attention to the transmission error. In this section, we consider the problem of streaming the compressed audio data over the lossy packet network. In conventional audio coding algorithm, each block of audio samples are converted to a frame of bitstream which is independently decoded. A block of input samples are transformed into a set of spectral lines in the frequency domain through a timefrequency transformation. Perceptual audio coding algorithms achieve a high coding gain by exploiting both the perceptual irrelevancies and the statistical redundancies in the spectral domain [6]. Perceptually irrelevant components are removed by adjusting the quantization stepsizes depending on the masking level computed from the psycho-acoustics model. On the other hand, statistical redundancies are removed with the use of an entropy coding technique such as the Huffman coding or DPCM. Consequently, the compressed audio bitstream should consist of the entropy encoded spectral information, side information which is used to decode each spectral line and header information which conveys the configurations e.g., sampling rate, number of channels and so on. In general, these informations are written sequentially as shown in Fig. 1. As mentioned above, audio frame is the smallest unit that can be decoded independently in a perceptual audio coder. For the transmission of the compressed audio data over a packet-switching network, the bitstream should be segmented into serveral packets of appropriate size. If the audio encoder has been developed without knowing the network specification, there usually exist some mismatches between the audio frame and packet sizes. If the frame border does not coincide with the packet border, information in a frame spans over two adjacent packets. When either of these two packets is missing at the receiver, the frame can not be decoded perfectly; more precisely, part of the frame can be decoded depending on which part of the frame was lost. However the possibility is low since the audio bitstream is highly vulnerable to consecutive bit errors caused by the missing packet. An even worse case arises when a single frame is segmented into several packets. This happens when the packet size should be smaller than the frame size or when a frame is segmented into partitions and interleaved Fig. 1: Structure of the bistream in the conventional audio coding algorithms Page 2 of 7

3 Author et al. Fig. 2: Synchronization between audio frames and packets over several packets. In this case, the loss of a single packet results in a loss of the whole audio frame even though all the other packets are received successfully. Fig. 3: Sequential Splitting 3. FRAME SPLITTING In order to cope with the mismatch between the audio frame and packet sizes and to achieve an efficient audio data streaming which is robust to packet losses, we modify the conventional audio encoding technique. Our approach splits each audio frame into several subframes such that the size of a subframe matches the size of a packet and each subframe can be decoded independently. Even though the basic idea can be applied to various perceptual audio coders, we focus on the modification of the MPEG-AAC in this work Splitting into Subframes Every time a frame is encoded, the available number of packets is given to maintain the time synchrony. This comes from the network bandwidth configuration. As a result, the audio coder operates in a variable rate mode. An example of assigning each packet to the corresponding audio frame is shown in Fig. 2 where we can see that the number of packets varies from frame to frame. One drawback of this scheme is that the frame which is split into smaller number of packets is given less bits for audio compression irrespective of the spectral contents. This effect becomes weaker as the packet size gets smaller. In conventional perceptual audio coders, spectral lines are grouped into frequency bands which are referred to as scalefactor bands (as in MPEG-AAC). The spectral lines in each scalefactor band are entropy coded with a side information added to the bitstream. Since the spectral lines in each scalefactor band are coded jointly, it is desirable to split the audio frame by treating each scalefactor band as the basic unit. Fig. 4: Interleaving Splitting 3.2. Scaeflactor Band Allocation Rule The rule according to which the scalefactor bands are allocated to each packet can be arbitrarily chosen if only both the encoder and the decoder know it exactly. It is easy to devise two simple rules: sequential and interleaving schemes of splitting. In the sequential splitting scheme, adjacent scalefactor bands are allocated into the same packet as shown in Fig. 3. A major shortcoming of this scheme is to make a large spectral gap in the frequency domain when a packet is missing. To alleviate this deterioration, the interleaving scheme interleaves the order of scalefactor bands before sequentially assigning to each packet as shown in Fig. 4. Since the interleaving operation disperses the effect of packet loss, missing spectral lines which appear as a large spectral gap in the original sequential splitting scheme are replaced by multiple AES 121st Convention, San Francisco, CA, USA, 2006 October 5 8 Page 3 of 7

4 small gaps which are perceptually preferred. This also helps error concealment because missing spectral lines can be predicted based on the correlation with the neighboring spectral lines. A disadvantage of the interleaving scheme is that it decreases the coding efficiency since the spectral lines collected over a wide frequency range has low redundancies. This results in a deterioration of the perceived audio quality at the same bitrate. Consequently, an optimal splitting rule should be designed based on a tradeoff between the coding efficiency and error concealment. In the splitting rule, the number of scalefactor bands allocated to each subframe is an important factor that determines the audio quality. It is due to the fact that spectral lines are distributed unequally over the whole frequency range. For instance, there are usually more spectral contents in low frequency bands than in high frequency bands. If every packet assumes an equal number of scalefactor bands, the low frequency bands are likely to be coded with less bits than required. Adjusting the splitting rule according to the overall statistical distribution of the audio signals can alleviate this effect in some degree. However the distribution of spectral information varies rapidly over time and the fixed rule may not guarantee a proper splitting for some frames leading to a degradation of the audio quality. A suboptimal splitting rule will be discussed in the following section Encoding After the allocation of scalefactor bands according to the splitting rule mentioned above, each subframe is independently encoded. Encoding is executed based on the general audio coding algorithm with a slight modification. First, the number of available bits is given as a parameter for the separate encoding of each subframe. This parameter is used for rate control. Since each subframe should be quantized independently, it is inavoidable to modify the conventional audio coding algorithm. Instead of computing the global gain over all the scalefactor bands, separate global gain is obtained for each subframe considering only the scalefactor bands that belong to it. Once the global gain is obtained, each scalefactor is fed into the rate control loop which iteratively determines the quantization levels of the scalefactor bands according to the bitrate constraint [2].The rate control loop is almost the same to that of the conventional audio coder. The only difference lies on that in our approach the scalefactor bands that belong to the same subframe are considered simultaneously. After the bit allocation, encoding data of each subframe is separately written as a bitstream for later packetization. The bitstream structure of each subframe is not much different from that of the normal audio frame specified in conventional audio coding. For a decoding robust to packet loss, the header is added to all the subframe data. Even though this may be considered an overhead on the limited network bandwidth, the header data is usually much smaller than the other data that describes the audio contents. 4. ADAPTIVE SPLITTING Now, what remains is how to optimally split the audio frame into a finite number of subframes. Splitting here means a mapping that allocates each scalefactor band to a specific subframe on packet. As mentioned in the previous section, a fixed allocation is not desirable for achieving high audio quality despite its advantage that it does not require any side information to be delivered to the receiver. A more promising approach is to split the audio frame such that all the subframes are encoded with a equal amount of coding efficiency such that no specific subframe causes the low audio quality. To measure the coding efficiency for each subframe, we apply the noise-to-mask ratio (NMR) which represents a ratio of the quantization noise to the masking threshold [2]. Let R i denote the NMR for the i th scalefactor band. Then, R i = N i M i (1) where N i is the power of the quantization noise and M i is the masking threshold computed from the psycho-acoustics model for the i th scalefactor band. Creating a constant NMR over all scalefactor bands is an objective in the rate-distortion control module of the MPEG-AAC when the number of available bits are higher or lower than the required bits [2]. Analogous to this method, we also aim to allocate sclaefactor bands to subframes such that all Page 4 of 7

5 the subframes have almost the same level of NMR. For the adaptive frame splitting, we propose an algorithm that operates in an iterative manner. A flowchart of the overall algorithm is shown in Fig. 5. At the initial phase, scalefactors are allocated to each subframe with a default splitting rule then each subframe is encoded. After encoding, NMR for each subframe is calculated and it is checked whether the NMRs are equally distributed. If the NMRs are found to be unbalanced, scalefactor bands are reallocated by increasing the number of scalefactor bands in the subframe with the maximum NMR while decreasing the number of scalefactor bands in subframe with minimum NMR. Then the process of encoding and NMR computation are executed again. As this iteration continues, the number of scalefactor bands allocated to each subframe converges as shown in Fig.?? in which the number of scalefactor bands in each subframe is plotted. The iteration stops when the frame splitting does not make any change. Information on the adaptive splitting should be included in the bitstream of each subframe. Decoder arranges the decoded scalefactor bands according to it. For an independent decoding, each packet should have the location index as well as the number of scalefactor bands the subframe has. In our implementation, we assign 5 bits for the starting location index and another 5 bits for the number of scalefactor bands. An example of frame splitting with the relevant information to be coded is given in Table 1. Subframe Index First Scalefactor Band Index Last Scalefactor Band Index Starting Location Index Number of Scalefactor Bands Table 1: Representation of Allocation Information. 5. TEST RESULTS To evaluate the performance of the proposed scheme, we implemented the frame splitting module on the MPEG-AAC platform. For simplicity, we made several modifications on the original MPEG-AAC algorithm. First, we did not apply the block switching technique such that the audio analysis could be performed based on only the long block. Second, additional encoding tools such as temporal noise shaping Fig. 5: A flowchart of the overall frame splitting algorithm Page 5 of 7

6 (TNS) and gain control were not applied. Finally, the bit reservoir was not adopted. Sampling Rate Frame Size Input File Length Number of Channels 11,025 Hz ms 40 s Mono Table 2: Test audio coder specifications. Bitrate 8.4 kbps Packet Length 20 ms Packet Size 168 bits PER (Packet Error Rate) over 20% Table 3: Packet network specifications Specifications for the audio frame, packet size and network condition in the experiments are shown in Tables. These specifications were derived from an audio streaming application where the input signal is compressed by the MPEG-AAC and then transmitted over the Code Division Multiple Access (CDMA) packet-switching network. When a packet loss occurred, an error concealment algorithm was applied to reconstruct the missing spectral lines. For the error concealment, we took the simple repetition strategy with which the lost spectral lines were substituted with those spectral components that were successfully received most recently. If the packet loss occurred continuously (burst packet loss), corresonding spectral lines were faded out exponentially, and muted from a certain number of consecutive packet losses. The same error concealment scheme was applied to both the original audio coder and proposed one. A comparison of waveforms decoded from the audio bitstream damaged from some packet losses is given in Fig. 7. The first plot shows the original input waveform and the second plot is the waveform obtained from the conventional MPEG-AAC decoder. The third plot displays the waveform obtained from the proposed frame splitting algorithm. The graph at the bottom illustrates the applied error sequence where 0 represents no packet error and 1 indicates the packet loss. At the locations where packet losses occurred, original algorithm could not decode the received frames and faded out the waveform. In contrast, the proposed algorithm recovered the partly Fig. 6: An iteration to find the numbers of scalefactor bands assigned to the subframes Fig. 7: Decoded Waveform Page 6 of 7

7 lost audio frames and the lost spectral lines could be concealed more faithfully. This example clearly demonstrates the advantage of our frame splitting technique in the loss packet environments. For a further evaluation of the performance, an informal listening test was carried out by ten listeners. All the ten subjects provided an opinion that the decoded audio data obtained from the proposed approach had much less interruption caused by the packet losses compared to that from the original MPEG-AAC. for transport of MPEG-4 elementary streams, IETF RFC 3640, [5] J. Korhonen, Y. Wang and D. Isherwood, Toward bandwidth-efficient and error-robust audio streaming over lossy packet networks, Multimedia Systems Journal (MMSJ), [6] T. Painter, A. Spanias. Perceptual coding of digital audio, In Proc. the IEEE, April CONCLUSIONS In this paper, we have proposed a frame splitting scheme in perceptual audio coding algorithms. Each subframe is independently encoded such that it fits the specified packet size. Received packets are independently decoded without being affected by the other missing packets. An informal subjective listening evaluation has shown that the suggested scheme dramatically improves the audio streaming quality under the lossy packet network environment. 7. ACKNOWLEDGEMENT This work was supported by SK Telecom, and the authors would like to thank Dr. D. H. Lee, Dr. S. S. Park and D. S. Woo at SK Telecom for their helpful discussions. 8. REFERENCES [1] Y. Wang, A. Ahmaniemi, D. Isherwood and W. Huang, Content-based UEP: A new scheme for packet loss recovery in music streaming, ACM Multimedia Conference, Berkeley, CA, USA, Nov [2] ISO. Information Technology-Coding of Audio- Visual Objects, ISO/IEC JTC1/SC29 WG11, ISO/IEC IS (Part-3, Audio). [3] B.W. Wah, X. Su and D. Lin., A survey of error concealment schemes for real-time audio and video transmissions over the internet, IEEE International Symposium on Multimedia Software Engineering, Taipei, Taiwan, pp , Dec [4] J. van der Meer, D. Mackie, V. Swaminathan, D. Singer, P. Singer, RTP payload format Page 7 of 7

Module 6 STILL IMAGE COMPRESSION STANDARDS

Module 6 STILL IMAGE COMPRESSION STANDARDS Module 6 STILL IMAGE COMPRESSION STANDARDS Lesson 19 JPEG-2000 Error Resiliency Instructional Objectives At the end of this lesson, the students should be able to: 1. Name two different types of lossy

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS

ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS E. Masala, D. Quaglia, J.C. De Martin Λ Dipartimento di Automatica e Informatica/ Λ IRITI-CNR Politecnico di Torino, Italy

More information

The MPEG-4 General Audio Coder

The MPEG-4 General Audio Coder The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)

More information

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose Department of Electrical and Computer Engineering University of California,

More information

Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms

Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms 26 IEEE 24th Convention of Electrical and Electronics Engineers in Israel Packet Loss Concealment for Audio Streaming based on the GAPES and MAPES Algorithms Hadas Ofir and David Malah Department of Electrical

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Parametric Coding of High-Quality Audio

Parametric Coding of High-Quality Audio Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

MPEG-4 General Audio Coding

MPEG-4 General Audio Coding MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1 General Audio Coding Solid state players, Internet audio, terrestrial and

More information

A Hybrid Temporal-SNR Fine-Granular Scalability for Internet Video

A Hybrid Temporal-SNR Fine-Granular Scalability for Internet Video 318 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 3, MARCH 2001 A Hybrid Temporal-SNR Fine-Granular Scalability for Internet Video Mihaela van der Schaar, Member, IEEE, and

More information

Delay Constrained ARQ Mechanism for MPEG Media Transport Protocol Based Video Streaming over Internet

Delay Constrained ARQ Mechanism for MPEG Media Transport Protocol Based Video Streaming over Internet Delay Constrained ARQ Mechanism for MPEG Media Transport Protocol Based Video Streaming over Internet Hong-rae Lee, Tae-jun Jung, Kwang-deok Seo Division of Computer and Telecommunications Engineering

More information

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo

Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo Coding for the Network: Scalable and Multiple description coding Marco Cagnazzo Overview Examples and motivations Scalable coding for network transmission Techniques for multiple description coding 2 27/05/2013

More information

Multi-path Forward Error Correction Control Scheme with Path Interleaving

Multi-path Forward Error Correction Control Scheme with Path Interleaving Multi-path Forward Error Correction Control Scheme with Path Interleaving Ming-Fong Tsai, Chun-Yi Kuo, Chun-Nan Kuo and Ce-Kuen Shieh Department of Electrical Engineering, National Cheng Kung University,

More information

Context based optimal shape coding

Context based optimal shape coding IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing September 13-15, 1999, Copenhagen, Denmark Electronic Proceedings 1999 IEEE Context based optimal shape coding Gerry Melnikov,

More information

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,

More information

Multimedia Communications. Audio coding

Multimedia Communications. Audio coding Multimedia Communications Audio coding Introduction Lossy compression schemes can be based on source model (e.g., speech compression) or user model (audio coding) Unlike speech, audio signals can be generated

More information

Recommended Readings

Recommended Readings Lecture 11: Media Adaptation Scalable Coding, Dealing with Errors Some slides, images were from http://ip.hhi.de/imagecom_g1/savce/index.htm and John G. Apostolopoulos http://www.mit.edu/~6.344/spring2004

More information

Robustness of Multiplexing Protocols for Audio-Visual Services over Wireless Networks

Robustness of Multiplexing Protocols for Audio-Visual Services over Wireless Networks Robustness of Multiplexing Protocols for Audio-Visual Services over Wireless Networks W. S. Lee, M. R. Frater, M. R. Pickering and J. F. Arnold School of Electrical Engineering University College UNSW

More information

6MPEG-4 audio coding tools

6MPEG-4 audio coding tools 6MPEG-4 audio coding 6.1. Introduction to MPEG-4 audio MPEG-4 audio [58] is currently one of the most prevalent audio coding standards. It combines many different types of audio coding into one integrated

More information

Lecture 16 Perceptual Audio Coding

Lecture 16 Perceptual Audio Coding EECS 225D Audio Signal Processing in Humans and Machines Lecture 16 Perceptual Audio Coding 2012-3-14 Professor Nelson Morgan today s lecture by John Lazzaro www.icsi.berkeley.edu/eecs225d/spr12/ Hero

More information

Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet

Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet Channel-Adaptive Error Protection for Scalable Audio Streaming over Wireless Internet GuiJin Wang Qian Zhang Wenwu Zhu Jianping Zhou Department of Electronic Engineering, Tsinghua University, Beijing,

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings MPEG-1 Overview of MPEG-1 1 Standard Introduction to perceptual and entropy codings Contents History Psychoacoustics and perceptual coding Entropy coding MPEG-1 Layer I/II Layer III (MP3) Comparison and

More information

Network-Adaptive Video Coding and Transmission

Network-Adaptive Video Coding and Transmission Header for SPIE use Network-Adaptive Video Coding and Transmission Kay Sripanidkulchai and Tsuhan Chen Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA 15213

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

Audio and video compression

Audio and video compression Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and

More information

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P SIGNAL COMPRESSION 9. Lossy image compression: SPIHT and S+P 9.1 SPIHT embedded coder 9.2 The reversible multiresolution transform S+P 9.3 Error resilience in embedded coding 178 9.1 Embedded Tree-Based

More information

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC Ralf Geiger 1, Gerald Schuller 1, Jürgen Herre 2, Ralph Sperschneider 2, Thomas Sporer 1 1 Fraunhofer IIS AEMT, Ilmenau, Germany 2 Fraunhofer

More information

Chapter 14 MPEG Audio Compression

Chapter 14 MPEG Audio Compression Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1

More information

DRA AUDIO CODING STANDARD

DRA AUDIO CODING STANDARD Applied Mechanics and Materials Online: 2013-06-27 ISSN: 1662-7482, Vol. 330, pp 981-984 doi:10.4028/www.scientific.net/amm.330.981 2013 Trans Tech Publications, Switzerland DRA AUDIO CODING STANDARD Wenhua

More information

DAB. Digital Audio Broadcasting

DAB. Digital Audio Broadcasting DAB Digital Audio Broadcasting DAB history DAB has been under development since 1981 at the Institut für Rundfunktechnik (IRT). In 1985 the first DAB demonstrations were held at the WARC-ORB in Geneva

More information

Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video

Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 3, MARCH 2001 357 Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video Michael Gallant,

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

S.K.R Engineering College, Chennai, India. 1 2

S.K.R Engineering College, Chennai, India. 1 2 Implementation of AAC Encoder for Audio Broadcasting A.Parkavi 1, T.Kalpalatha Reddy 2. 1 PG Scholar, 2 Dean 1,2 Department of Electronics and Communication Engineering S.K.R Engineering College, Chennai,

More information

4G WIRELESS VIDEO COMMUNICATIONS

4G WIRELESS VIDEO COMMUNICATIONS 4G WIRELESS VIDEO COMMUNICATIONS Haohong Wang Marvell Semiconductors, USA Lisimachos P. Kondi University of Ioannina, Greece Ajay Luthra Motorola, USA Song Ci University of Nebraska-Lincoln, USA WILEY

More information

Wireless Video Transmission: A Single Layer Distortion Optimal Approach

Wireless Video Transmission: A Single Layer Distortion Optimal Approach 2009 Data Compression Conference Wireless Video Transmission: A Single Layer Distortion Optimal Approach Negar Nejati Homayoun Yousefi zadeh Hamid Jafarkhani Department of EECS University of California,

More information

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06 Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06 Goals of Lab Introduction to fundamental principles of digital audio & perceptual audio encoding Learn the basics of psychoacoustic

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

1480 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 5, OCTOBER 2012

1480 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 5, OCTOBER 2012 1480 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 5, OCTOBER 2012 Wireless H.264 Video Quality Enhancement Through Optimal Prioritized Packet Fragmentation Kashyap K. R. Kambhatla, Student Member, IEEE,

More information

over the Internet Tihao Chiang { Ya-Qin Zhang k enormous interests from both industry and academia.

over the Internet Tihao Chiang { Ya-Qin Zhang k enormous interests from both industry and academia. An End-to-End Architecture for MPEG-4 Video Streaming over the Internet Y. Thomas Hou Dapeng Wu y Wenwu Zhu z Hung-Ju Lee x Tihao Chiang { Ya-Qin Zhang k Abstract It is a challenging problem to design

More information

ADAPTIVE JOINT H.263-CHANNEL CODING FOR MEMORYLESS BINARY CHANNELS

ADAPTIVE JOINT H.263-CHANNEL CODING FOR MEMORYLESS BINARY CHANNELS ADAPTIVE JOINT H.263-CHANNEL ING FOR MEMORYLESS BINARY CHANNELS A. Navarro, J. Tavares Aveiro University - Telecommunications Institute, 38 Aveiro, Portugal, navarro@av.it.pt Abstract - The main purpose

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 213 http://acousticalsociety.org/ ICA 213 Montreal Montreal, Canada 2-7 June 213 Engineering Acoustics Session 2pEAb: Controlling Sound Quality 2pEAb1. Subjective

More information

Streaming (Multi)media

Streaming (Multi)media Streaming (Multi)media Overview POTS, IN SIP, H.323 Circuit Switched Networks Packet Switched Networks 1 POTS, IN SIP, H.323 Circuit Switched Networks Packet Switched Networks Circuit Switching Connection-oriented

More information

Partial Reliable TCP

Partial Reliable TCP Partial Reliable TCP Yao-Nan Lien and Ming-Han Wu Computer Science Department,National Chengchi University, Taipei, Taiwan, R.O.C. lien@cs.nccu.edu.tw ABSTRACT-Some new information services over IPbased

More information

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP TRADEOFF BETWEEN COMPLEXITY AND MEMORY SIZE IN THE 3GPP ENHANCED PLUS DECODER: SPEED-CONSCIOUS AND MEMORY- CONSCIOUS DECODERS ON A 16-BIT FIXED-POINT DSP Osamu Shimada, Toshiyuki Nomura, Akihiko Sugiyama

More information

Multiple Description Coding for Video Using Motion Compensated Prediction *

Multiple Description Coding for Video Using Motion Compensated Prediction * Multiple Description Coding for Video Using Motion Compensated Prediction * Amy R. Reibman Yao Wang Michael T. Orchard Rohit Puri and Polytechnic Univ. Princeton Univ. Univ. Illinois Hamid Jafarkhani Brooklyn,

More information

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

International Journal of Emerging Technology and Advanced Engineering Website:   (ISSN , Volume 2, Issue 4, April 2012) A Technical Analysis Towards Digital Video Compression Rutika Joshi 1, Rajesh Rai 2, Rajesh Nema 3 1 Student, Electronics and Communication Department, NIIST College, Bhopal, 2,3 Prof., Electronics and

More information

CC-SCTP: Chunk Checksum of SCTP for Enhancement of Throughput in Wireless Network Environments

CC-SCTP: Chunk Checksum of SCTP for Enhancement of Throughput in Wireless Network Environments CC-SCTP: Chunk Checksum of SCTP for Enhancement of Throughput in Wireless Network Environments Stream Control Transmission Protocol (SCTP) uses the 32-bit checksum in the common header, by which a corrupted

More information

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding INTERNATIONAL STANDARD This is a preview - click here to buy the full publication ISO/IEC 23003-3 First edition 2012-04-01 Information technology MPEG audio technologies Part 3: Unified speech and audio

More information

Error Concealment Used for P-Frame on Video Stream over the Internet

Error Concealment Used for P-Frame on Video Stream over the Internet Error Concealment Used for P-Frame on Video Stream over the Internet MA RAN, ZHANG ZHAO-YANG, AN PING Key Laboratory of Advanced Displays and System Application, Ministry of Education School of Communication

More information

IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 5, No 2, September 2012 ISSN (Online):

IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 5, No 2, September 2012 ISSN (Online): www.ijcsi.org 146 Enhancing acket-level Forward Error Correction for Streaming Video in Wireless Networks Chi-Huang Shih Department of Computer Science and Information Engineering, HungKuang University

More information

Modified SPIHT Image Coder For Wireless Communication

Modified SPIHT Image Coder For Wireless Communication Modified SPIHT Image Coder For Wireless Communication M. B. I. REAZ, M. AKTER, F. MOHD-YASIN Faculty of Engineering Multimedia University 63100 Cyberjaya, Selangor Malaysia Abstract: - The Set Partitioning

More information

An Unequal Packet Loss Protection Scheme for H.264/AVC Video Transmission

An Unequal Packet Loss Protection Scheme for H.264/AVC Video Transmission An Unequal Packet Loss Protection Scheme for H.4/AVC Video Transmission Xingjun Zhang, Xiaohong Peng School of Engineering and Applied Science, Aston University Aston Triangle, Birmingham, B4 7ET, UK {x.zhang,

More information

Request for Comments: 5109 December 2007 Obsoletes: 2733, 3009 Category: Standards Track. RTP Payload Format for Generic Forward Error Correction

Request for Comments: 5109 December 2007 Obsoletes: 2733, 3009 Category: Standards Track. RTP Payload Format for Generic Forward Error Correction Network Working Group A. Li, Ed. Request for Comments: 5109 December 2007 Obsoletes: 2733, 3009 Category: Standards Track RTP Payload Format for Generic Forward Error Correction Status of This Memo This

More information

Nokia Q. Xie Motorola April 2007

Nokia Q. Xie Motorola April 2007 Network Working Group Request for Comments: 4867 Obsoletes: 3267 Category: Standards Track J. Sjoberg M. Westerlund Ericsson A. Lakaniemi Nokia Q. Xie Motorola April 2007 RTP Payload Format and File Storage

More information

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201 Source Coding Basics and Speech Coding Yao Wang Polytechnic University, Brooklyn, NY1121 http://eeweb.poly.edu/~yao Outline Why do we need to compress speech signals Basic components in a source coding

More information

Internet Streaming Media Alliance Ultravox Provisional Specification Version 1.0 November 2007

Internet Streaming Media Alliance Ultravox Provisional Specification Version 1.0 November 2007 Internet Streaming Media Alliance Ultravox Provisional Specification Version 1.0 November 2007 ISMA Ultravox Part 1: Introduction Page 1 of 6 November 2007 ISMA SPECIFICATION LIMITATIONS AND CONDITIONS

More information

Lecture 5: Error Resilience & Scalability

Lecture 5: Error Resilience & Scalability Lecture 5: Error Resilience & Scalability Dr Reji Mathew A/Prof. Jian Zhang NICTA & CSE UNSW COMP9519 Multimedia Systems S 010 jzhang@cse.unsw.edu.au Outline Error Resilience Scalability Including slides

More information

Introduction to LAN/WAN. Application Layer 4

Introduction to LAN/WAN. Application Layer 4 Introduction to LAN/WAN Application Layer 4 Multimedia Multimedia: Audio + video Human ear: 20Hz 20kHz, Dogs hear higher freqs DAC converts audio waves to digital E.g PCM uses 8-bit samples 8000 times

More information

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec

/ / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec / / _ / _ / _ / / / / /_/ _/_/ _/_/ _/_/ _\ / All-American-Advanced-Audio-Codec () **Z ** **=Z ** **= ==== == **= ==== \"\" === ==== \"\"\" ==== \"\"\"\" Tim O Brien Colin Sullivan Jennifer Hsu Mayank

More information

MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.

MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan. MODIFIED IMDCT-DECODER BASED MP3 MULTICHANNEL AUDIO DECODING SYSTEM Shanmuga Raju.S 1, Karthik.R 2, Sai Pradeep.K.P 3, Varadharajan.E 4 Assistant Professor, Dept. of ECE, Dr.NGP Institute of Technology,

More information

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error. ON VIDEO SNR SCALABILITY Lisimachos P. Kondi, Faisal Ishtiaq and Aggelos K. Katsaggelos Northwestern University Dept. of Electrical and Computer Engineering 2145 Sheridan Road Evanston, IL 60208 E-Mail:

More information

The new Hybrid approach to protect MPEG-2 video header

The new Hybrid approach to protect MPEG-2 video header The new Hybrid approach to protect MPEG-2 video header *YUK YING CHUNG, *XIANG ZHANG, *XIAOMING CHEN, *MOHD AFIZI MOHD SHUKRAN, **CHANGSEOK BAE *School of Information Technologies, University of Sydney,

More information

Multimedia Data Transmission over Mobile Internet using Packet-Loss Punctured (PaLoP) Codes

Multimedia Data Transmission over Mobile Internet using Packet-Loss Punctured (PaLoP) Codes Multimedia Data Transmission over Mobile Internet using Packet-Loss Punctured PaLoP Codes Markus Kaindl and Joachim Hagenauer Institute for Communications Engineering Munich University of Technology 9

More information

Image Error Concealment Based on Watermarking

Image Error Concealment Based on Watermarking Image Error Concealment Based on Watermarking Shinfeng D. Lin, Shih-Chieh Shie and Jie-Wei Chen Department of Computer Science and Information Engineering,National Dong Hwa Universuty, Hualien, Taiwan,

More information

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala Tampere University of Technology Korkeakoulunkatu 1, 720 Tampere, Finland ABSTRACT In

More information

Transporting audio-video. over the Internet

Transporting audio-video. over the Internet Transporting audio-video over the Internet Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP

More information

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS Ye-Kui Wang 1, Miska M. Hannuksela 2 and Moncef Gabbouj 3 1 Tampere International Center for Signal Processing (TICSP), Tampere,

More information

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Week 14. Video Compression. Ref: Fundamentals of Multimedia Week 14 Video Compression Ref: Fundamentals of Multimedia Last lecture review Prediction from the previous frame is called forward prediction Prediction from the next frame is called forward prediction

More information

Perceptually-Based Joint-Program Audio Coding

Perceptually-Based Joint-Program Audio Coding Audio Engineering Society Convention Paper Presented at the 113th Convention 2002 October 5 8 Los Angeles, CA, USA This convention paper has been reproduced from the author s advance manuscript, without

More information

MPEG-4 aacplus - Audio coding for today s digital media world

MPEG-4 aacplus - Audio coding for today s digital media world MPEG-4 aacplus - Audio coding for today s digital media world Whitepaper by: Gerald Moser, Coding Technologies November 2005-1 - 1. Introduction Delivering high quality digital broadcast content to consumers

More information

2 Framework of The Proposed Voice Quality Assessment System

2 Framework of The Proposed Voice Quality Assessment System 3rd International Conference on Multimedia Technology(ICMT 2013) A Packet-layer Quality Assessment System for VoIP Liangliang Jiang 1 and Fuzheng Yang 2 Abstract. A packet-layer quality assessment system

More information

Optimal Estimation for Error Concealment in Scalable Video Coding

Optimal Estimation for Error Concealment in Scalable Video Coding Optimal Estimation for Error Concealment in Scalable Video Coding Rui Zhang, Shankar L. Regunathan and Kenneth Rose Department of Electrical and Computer Engineering University of California Santa Barbara,

More information

Interactive Progressive Encoding System For Transmission of Complex Images

Interactive Progressive Encoding System For Transmission of Complex Images Interactive Progressive Encoding System For Transmission of Complex Images Borko Furht 1, Yingli Wang 1, and Joe Celli 2 1 NSF Multimedia Laboratory Florida Atlantic University, Boca Raton, Florida 33431

More information

Active Concealment for Internet Speech Transmission

Active Concealment for Internet Speech Transmission Active Concealment for Internet Speech Transmission Long Le 1, Henning Sanneck 1, Georg Carle 1, and Tohru Hoshi 2 1 GMD Fokus, Kaiserin-Augusta-Allee 31, D-10589 Berlin, Germany {le,sanneck,carle}@fokus.gmd.de

More information

Video-Aware Wireless Networks (VAWN) Final Meeting January 23, 2014

Video-Aware Wireless Networks (VAWN) Final Meeting January 23, 2014 Video-Aware Wireless Networks (VAWN) Final Meeting January 23, 2014 1/26 ! Real-time Video Transmission! Challenges and Opportunities! Lessons Learned for Real-time Video! Mitigating Losses in Scalable

More information

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio: Audio Compression Audio Compression CD quality audio: Sampling rate = 44 KHz, Quantization = 16 bits/sample Bit-rate = ~700 Kb/s (1.41 Mb/s if 2 channel stereo) Telephone-quality speech Sampling rate =

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 N15071 February 2015, Geneva,

More information

The Steganography In Inactive Frames Of Voip

The Steganography In Inactive Frames Of Voip The Steganography In Inactive Frames Of Voip This paper describes a novel high-capacity steganography algorithm for embedding data in the inactive frames of low bit rate audio streams encoded by G.723.1

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework System Modeling and Implementation of MPEG-4 Encoder under Fine-Granular-Scalability Framework Literature Survey Embedded Software Systems Prof. B. L. Evans by Wei Li and Zhenxun Xiao March 25, 2002 Abstract

More information

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Project Title: Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Midterm Report CS 584 Multimedia Communications Submitted by: Syed Jawwad Bukhari 2004-03-0028 About

More information

Motion Estimation for Video Coding Standards

Motion Estimation for Video Coding Standards Motion Estimation for Video Coding Standards Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Introduction of Motion Estimation The goal of video compression

More information

CS 335 Graphics and Multimedia. Image Compression

CS 335 Graphics and Multimedia. Image Compression CS 335 Graphics and Multimedia Image Compression CCITT Image Storage and Compression Group 3: Huffman-type encoding for binary (bilevel) data: FAX Group 4: Entropy encoding without error checks of group

More information

Compression transparent low-level description of audio signals

Compression transparent low-level description of audio signals University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 25 Compression transparent low-level description of audio signals Jason

More information

Error-Resilient Transmission of 3D Models

Error-Resilient Transmission of 3D Models Error-Resilient Transmission of 3D Models Ghassan Al-Regib 1, Yucel Altunbasak 1, and Jarek Rossignac 2 1 Center for Signal and Image Processing Georgia Institute of Technology Atlanta, Georgia, 30332-0250

More information

Networking Applications

Networking Applications Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services

More information

Systematic Lossy Error Protection for Video Transmission over Wireless Ad Hoc Networks

Systematic Lossy Error Protection for Video Transmission over Wireless Ad Hoc Networks Systematic Lossy Error Protection for Transmission over Wireless Ad Hoc Networks Xiaoqing Zhu, Shantanu Rane and Bernd Girod Information Systems Laboratory, Stanford University, Stanford, CA 94305 ABSTRACT

More information

Video-Aware Link Adaption

Video-Aware Link Adaption WHITE PAPER Video-Aware Link Adaption Enhances real-time video over LTE October 2012 InterDigital, Inc. All rights reserved. Contents 1 Executive Summary...2 2 InterDigital s Solution...4 2.1 Adaptation

More information

Video Compression An Introduction

Video Compression An Introduction Video Compression An Introduction The increasing demand to incorporate video data into telecommunications services, the corporate environment, the entertainment industry, and even at home has made digital

More information

RECOMMENDATION ITU-R BT.1720 *

RECOMMENDATION ITU-R BT.1720 * Rec. ITU-R BT.1720 1 RECOMMENDATION ITU-R BT.1720 * Quality of service ranking and measurement methods for digital video broadcasting services delivered over broadband Internet protocol networks (Question

More information

Improving the quality of H.264 video transmission using the Intra-Frame FEC over IEEE e networks

Improving the quality of H.264 video transmission using the Intra-Frame FEC over IEEE e networks Improving the quality of H.264 video transmission using the Intra-Frame FEC over IEEE 802.11e networks Seung-Seok Kang 1,1, Yejin Sohn 1, and Eunji Moon 1 1Department of Computer Science, Seoul Women s

More information