Missing Frame Recovery Method for G Based on Neural Networks

Size: px
Start display at page:

Download "Missing Frame Recovery Method for G Based on Neural Networks"

Transcription

1 Missing Frame Recovery Method for G7231 Based on Neural Networks JARI TURUNEN & PEKKA LOULA Information Technology, Pori Tampere University of Technology Pohjoisranta 11, POBox 300, FIN Pori FINLAND Abstract: The quality of speech is an essential issue in the context of low bit rate video conferencing and voice over IP networking applications In this paper we present a lost frame recovery mechanism for the ITU- T G7231 recommendation The basic idea was to find a mechanism that can recover the missing frames from the past while preserving the speech quality as high as possible without delays The results show that the proposed multilayer perceptron neural network system can recover the missing speech frames from the previous frames accurately enough without having to wait for the next good frame as in the case of linear interpolation Key-Words: Frame estimation, multilayer perceptron, speech coding, error concealment, simulation 1 Introduction Most videoconferencing systems are based on International Telecommunication Union (ITU) H323 standard [2], which specifies that the audio and video stream packets are transferred over the Internet by using a real-time protocol (RTP), which is only intended to report packet losses and speed to both sending and receiving ends The H323 standard also utilizes the ordinary Internet protocol, Transport Control Protocol (TCP), for signaling and non-real time data transmission, for example document transfer Although the network technology and information transfer mechanisms are being developed all the time, speech transferring with several transform mechanisms suffers packet losses in low-bit rate communications According to [1] the global average packet loss in the Internet is 3% In some network servers the packet loss can be much higher than that in rush-hour times The rushhour traffic optimization algorithms in servers may themselves cause, for example, packet discards to the packet transfer TCP will take care of missed packets, but the packet loss indication, re-sending request and packet re-transmission will take too much time in real-time applications, and it can be seen as jammed video stream and heard as silence, if used instead of RTP, which is much more uncomfortable for the conference or call participant than RTP packet loss The network packet loss is harmful in real-time voice communications and video-conferencing systems even when using the RTP protocol The H323 standard based videoconferencing system uses ITU-T speech and audio coders G711, G722, G7231, G728, G729 and MPEG audio layer 1 We selected and examined the G7231 dual rate speech coder that is described in [3] It can operate at a maximum capacity with 63 kbit/s producing good quality sound, and if silence compression is included the bitrate varies between kbit/s This codec is especially suitable for future mobile multimedia systems that are very sensitive to the bandwidth capacity When considering the packet losses in G7231 there are some recovering mechanisms available, but in this paper a proposed neural network based frame estimator is described The G7231 decoding mechanism itself is equipped with a bursty recovery mechanism that estimates the missing LPC parameters and the residual vector by using information from the previous frame If the next two frames are lost the estimation continues and the output signal level is attenuated 25 db After three consecutive losses the output is muted completely An improvement regarding the G7231 recovery mechanism was made in [4] where the lost frame was interpolated from the preceding and following frames This improvement removes the high-energy spikes and metallic sound artifacts and can recover

2 up to 15 % of the lost frame rate Unfortunately this is made at the cost of increased algorithmic delay Linear interpolation is a common recovery mechanism in many speech coders, as for example in [5], and in their improvements However, the nonlinear processing of speech has proved to be a good mechanism, for example in [6] where the CELP-coder parameters are estimated more efficiently by using radial-basis neural networks than by using a linear predictor Another experiment where the nonlinear estimators worked well was made in [7] In this experiment the G728 residual information was estimated by using different techniques and the tests showed that the neural network based estimator produced the most intelligible sounding packets when compared to linear estimation methods In this paper we have concentrated on estimating the missing G7231 packets from the past data so that the recovery delay will not increase, as it will when using the linear interpolators The encoder and decoder have been left unchanged 2 Methods The background idea of using a neural network to estimate the lost frame information was derived from the multilayer perceptron s nonlinear properties and its possible ability to adapt speech properties better than other methods Linear interpolation is a good and simple way to estimate lost packets but there are two reasons why neural networks may be better First, if the neural network can learn and estimate the basic trends of the speech, then it can predict the lost packet In this way the lost packet delay and the resulted jittering problems are avoided or at least reduced Secondly, when compared with the efficiency of the linear estimator, the linear estimator will smooth out the missing packet too much by calculating the interpolation from previous and next arriving packets, so the output sound is mild It is impossible to predict speech parameters accurately because it would be practically predicting the future But if the neural network estimator can predict the speech parameters accurately enough then it has fulfilled the demands set on it We designed a packet recovery mechanism that can be implemented in the current G7231 coder/decoder mechanism in such a way that the base encoding/decoding system does not need any modifications A schematic diagram of the system is presented in Fig 1 x[n] G7231 coder Recovery mechanism Transmission chael Fig 1: The recovery structure G7231 decoder G7231 encoded speech stream frames consist of 189 bits of encoded speech in 63 kbit/s and 158 bits in 53 kbit/s If the silence compression is switched on the silent frames are transferred to the destination in 32 bits corresponding roughly to 10 kbit/s The encoder processes frames of 30 milliseconds or 240 samples of speech at once and calculates 10 Linear Predictive Code (LPC) parameters from the whole frame The LPC information is compressed with the Line Spectrum Pair (LSP) mechanism and quantized with three 8-bit codebooks The final values of the LPC are then sent by using only 24 bits [3] The frame is divided into 4 subframes, with 60 samples each Then the parameters common to both coding rates (the adaptive codebook lags and gain values) are extracted The pulse position and pulse sign values are encoder dependent values Finally the grid index values are added to the frame The structure of the G7231 codec is presented in Fig 2 Input speech LPC analyzer Frame division Adaptive codebook lags & gain values Pulse position & sign values Grid indices ^ x[n] Fig 2: The simplified diagram of G7231 codec Encoded frame The total number of bits is then either 189 for the 63 kbit/s or 158 for the 53 kbit/s Six previous frames of information 3 hidden layers with 9 x 6 x 3 rows Output layer (Estimated frame values) Fig 3: Neural network input/output structure The recovery mechanism presented in Fig 2 consists of a missing packet-monitoring/decision mechanism, a memory that will maintain essential information of six previous packets for current

3 packet estimation, and a multilayer perceptron neural network estimator for the indicated missing packet The base system was built from an ITU-T G7231 distributed source code The neural network was designed and converted to the C programming language by using University of Stuttgart neural network simulator software (SNNSv41) ruing on the Linux operating system The multilayer perceptron neural network is designed for predicting the estimates for the critical speech frame parameters (LSP parameters, amplitude, gain and pulse positions) from six previous frames The rest of the missing parameters, which are not so critical for the estimation process, were directly copied from the previous frame A diagram of the neural network is presented in Fig 3 As a reference, a linear interpolation method that will interpolate the missing information to the lost frame from the previous and following frames has been taken to the experiment 3 Experiments Approximately 10 minutes of spoken Fiish and English by several male, female and children speakers were collected to serve as training data so that everyone pronounced a sentence The spoken sentences were sampled at a 8000 Hz sampling frequency A minority of the sentences were spoken in an environment with background noise and distortion The training data was then encoded with G7231 codec and it consisted of frames of MLP Neural Network coded speech The coded frames were organized as training data so that six consecutive frames were organized for input data and the succeeding seventh frame for the output data The whole training set was introduced in a random order to the neural network estimator Our goal in the experiment was to find a system that would predict the missing packet from the past information, so that there is no need to wait for the next future value for interpolation This system must contain an adequate amount of information from the history for estimating the future transition The proposed experiment system will estimate the next missing frame from the past six frames The size of the neural network was obtained by using the network growing method described in [8] The initial network was selected to be a small network New neurons were added to the layers, and eventually new hidden layers were added to the network The number of nodes in the layers was selected experimentally The selection criterion for the final network size was to make the overall prediction error as small as possible The adding of new nodes was stopped in a situation where the bigger networks did not yield considerably better results The feed forward network size was chosen to be (1 input layer with 156 input nodes (6 x 26 values), 3 hidden layers with hidden nodes (9 x 26, 6 x 26 and 3 x 26 values respectively) and 1 output layer with 26 output nodes), with a nonlinear sigmoidal function The past six frames LSP parameters (6x10 nodes) Gains and amplitude (6x12 nodes) Pulse position (6x4 nodes) Input layer Hidden layers 1, 2 and 3 Output layer predicted frame (1x26 nodes) LSP parameters Gain and amplitudes Pulse positions Fig 4: Schematic diagram of the neural network topology

4 implemented to all layers After preliminary tests the network was divided into three parts to reduce the number of weights and thus the training time: the first part is dedicated to the LSP parameters (the input layer is 6 x 10 nodes), the second part (6 x 12 nodes) for gains and amplitudes and the third part for pulse position (6 x 4 nodes) Each unit is fully coected between the unit layers The structure of the multilayer perceptron network is presented in Fig 4 The selected network was trained by using backpropagation together with a momentum term The learning rate parameter η was kept as 02 In the simulation experiments the neural network was implemented directly in the G7231 decoder The speech data stream was manipulated with a random data loss module with predefined average loss value After the lost packet place is found in the stream the monitor module will request an estimate from the neural network based on the six previous values In order to ensure the quality of the prediction output in the test phase the network output was smoothed by averaging the output with the previous frame value After these operations the predicted and enhanced frame was decoded as a part of speech stream Three tests were conducted to study random packet losses: 5 %, 10 % and 15 % of the data In all cases the lost frame recovery was done by using neural networks and linear interpolation 5 % average loss 10 % average loss 15 % average loss 50 % loss (every second frame) 50 % average loss Lost Frame Arrived frame Fig 5: Speech stream loss experiments As a case study the 50 % frame loss situations were tested so that in the first study every second frame was lost Another experiment was done with randomly selected 1-2 consecutive frame losses in every second, third, or fourth frame The places were also randomly selected We then tried to recover these damages with neural networks and linear extrapolation/ interpolation This means that the lost and recovered frames will be a part of the next lost frames input information, so there will be a cumulative effect in the recovery process An example of the speech stream loss simulation experiments is presented in Fig 5 4 Results Ten people aged participated in the Mean Opinion Score (MOS) test described in [9] The listeners were first introduced to the reference signal and then they could listen to the test signals in the order they desired This test was done at Tampere University of Technology, Pori unit, by using the Intranet The test persons could listen to these signals in their own environments and judge all the signals by using the Pori unit s web server The objective quality method of the study was the segmental Signal-to-Noise ratio (SEG SNR ): mj 2 s ( n) 1 1 M n= mj N+ 1 SNR = 10log (1) SEG 10 mj M j= 0 2 ( ) ˆ( ) s n s n = + n mj N 1 where the s(n) is the original speech sample at time n, and (n) is the encoded/decoded speech sample at time n, M is the number of segments, m j is the end of the current segment and N is the segment length In Tables 1 and 2 the MOS test results and SEG SNR results are presented The segment length to be compared between the reference and test signals is 1000 samples, and the silence sections are taken out when the energy level is close to zero The segmental SNR values are averaged and presented in decibels In Tables 1 and 2 the Ref is an abbreviation for reference signal In Table 1 the 5 % means an average 5 % frame loss which is recovered with neural networks The 10 % lin is for a 10 % average frame loss recovered with linear interpolation Table 1: The MOS and SNR results for 5 %, 10 % and 15 % frame loss recovery Ref 5 % 5 % lin 10 % 10 % lin 15 % 15 % lin MOS SNR

5 In Table 2 the 50 % fr means the 50 % frame loss in the case study where every second packet is lost and recovered with neural networks The 50 % rand linear means that randomly selected 1-2 consecutive frames are lost and interpolated/extrapolated with linear methods Table 2: The MOS and SNR values for 50 % frame loss experiments Ref 50% fr 50% fr linear 50% rand 50 % rand linear MOS SNR Discussion The neural network results are comparable with linear predicted results and the neural network preserves the intelligibility as well as or even better than the linear prediction in 5%, 10% and 15% frame losses The test showed that the neural network could estimate the parameters accurately enough in real-time situations It should also be noted that there is only past information available for the neural network estimator while the linear interpolator can utilize the future information For every lost frame the total enhancement or recovery saving is 30 milliseconds with the neural network This is especially important in real-time applications and videoconferencing systems when the quality of speech is in concern Another thing that should be taken into account is that the original coder and decoder can be kept unchanged, which is extremely important in systems that are built to work respecting certain standards, such as H323 The losses near 50 % gave similar results in both recovery methods The sound quality was poor in the 50 % experiment because every second frame was lost, but the neural network reconstructed speech was still more understandable than the linearly predicted speech This is obvious because the linear prediction will average the missing packet to join together the previous and the next frame, while the neural network tries to predict what will come next The SNR emphasizes neural networks in the 50 % frame loss case more than linear interpolation and supports the understandability On the other hand the MOS scale gave a slightly better result in to the linear interpolation than to neural networks in every second frame loss This is obvious because the human ear likes more smoother and steady sounds more than coarse sounds containing a little bit more information The packet loss limit is % of the stream where the understandability starts to weaken radically Only parts of the sentences are understandable without enhancement and nonredundant information is totally lost These facts raised an idea to test the ability of neural networks to solve the prediction problem in a very difficult situation However, as mentioned before, a neural network caot predict the next frame precisely because that is basically artificial prediction of the future The total reconstruction delay is smaller with neural networks than with linear interpolation, due to the designed structure This is caused by the fact that the neural network will predict the future solely based on past information and there is no need to wait for the next good frame to come The algorithmic calculation time is longer in the neural network approach (21650 multiplications and 504 summations / predicted frame) than with the linear interpolation (26 divisions and 26 summations / interpolated frame) but the results are better and there is no waiting time for the next good frame The network topology is also our concern What makes the feedforward multilayer perceptron network better than other topologies in our case? When compared with for example recurrent or timedelayed networks the difference is that a basic recurrent network must be updated continuously with the stream information In our experiment the idea was to call the neural network when necessary and give it all the information needed for estimation Self Organizing Maps are designed for classification and they are suitable only if the output is limited to a finite number of possibilities Radial-Basis functions can provide the right type of output estimate but the full non-linearity of the feedforward multilayer perceptron network was preferred more in the experiment Speech has certain long-term trends that a neural network can track and estimate However, the optimal size of the network for frame recovery is very difficult to determine as is the correct number of training data samples The speech samples that are compressed to a finite G7231 parameter form still have a huge number of possible combinations when it comes to the recovery process and tracking This raised a further vision of a finite number of solutions where the network will learn only a certain number of output possibilities that have been generalized from the training data with vector quantization methods This approach will reduce the network size and thus the number of calculations

6 The other topologies of the neural networks must then be taken into account The adaptive codebook causes the worst problems in the recovery because it is impossible to estimate the missing adaptive codebook index The only solution was to copy the information from the previous frame adaptive codebook index References: [1] [2] ITU-T H323 Packet based multimedia communication systems 2/ pages [3] ITU-T G7231 Aex A: Dual rate speech coder for multimedia communications transmitting at 53 kbit/s and 63 kbit/s 11/96 27 pages [4] Grant Ho, Suat Yeldener & Marion Baraniecki Improved Lost Frame Recovery Techniques for ITU-T G7231 Speech Coding System, Proceedings of EUSIPCO98, Greece, pp [5] Aamir Husain & Vladimir Cuperman: Reconstruction of missing packets for celpbased speech coders Proceedings of IEEE ICAASP, Detroit 1995 volume 1 pp [6] Fernando Díaz-de-Maria & Aníbal R Figueiras-Vidal Radial Basis Functions for Nonlinear Prediction of speech in Analysis-by- Synthesis Coders IEEE ICASSP Detroit 1995 Topics in speech analysis, pp 788 [7] Jari Turunen & Pekka Loula: Assessment of Various Frame Recovery Mechanisms for G728 Proceedings of ISCOM99 Conference, Kaohsiung, Taiwan, 1999 [8] Simon Haykin Neural Networks Prentice Hall International, Inc 1999 [9] IEEE recommended practise for speech quality measurements IEEE Transactions on Audio and Electroacoustics, September 1969

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding Perceptual Pre-weighting and Post-inverse weighting for Speech Coding Niranjan Shetty and Jerry D. Gibson Department of Electrical and Computer Engineering University of California, Santa Barbara, CA,

More information

The Steganography In Inactive Frames Of Voip

The Steganography In Inactive Frames Of Voip The Steganography In Inactive Frames Of Voip This paper describes a novel high-capacity steganography algorithm for embedding data in the inactive frames of low bit rate audio streams encoded by G.723.1

More information

AN EFFICIENT TRANSCODING SCHEME FOR G.729 AND G SPEECH CODECS: INTEROPERABILITY OVER THE INTERNET. Received July 2010; revised October 2011

AN EFFICIENT TRANSCODING SCHEME FOR G.729 AND G SPEECH CODECS: INTEROPERABILITY OVER THE INTERNET. Received July 2010; revised October 2011 International Journal of Innovative Computing, Information and Control ICIC International c 2012 ISSN 1349-4198 Volume 8, Number 7(A), July 2012 pp. 4635 4660 AN EFFICIENT TRANSCODING SCHEME FOR G.729

More information

Synopsis of Basic VoIP Concepts

Synopsis of Basic VoIP Concepts APPENDIX B The Catalyst 4224 Access Gateway Switch (Catalyst 4224) provides Voice over IP (VoIP) gateway applications for a micro branch office. This chapter introduces some basic VoIP concepts. This chapter

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2700/INFSCI 1072 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

Voice Quality Assessment for Mobile to SIP Call over Live 3G Network

Voice Quality Assessment for Mobile to SIP Call over Live 3G Network Abstract 132 Voice Quality Assessment for Mobile to SIP Call over Live 3G Network G.Venkatakrishnan, I-H.Mkwawa and L.Sun Signal Processing and Multimedia Communications, University of Plymouth, Plymouth,

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

A MULTI-RATE SPEECH AND CHANNEL CODEC: A GSM AMR HALF-RATE CANDIDATE

A MULTI-RATE SPEECH AND CHANNEL CODEC: A GSM AMR HALF-RATE CANDIDATE A MULTI-RATE SPEECH AND CHANNEL CODEC: A GSM AMR HALF-RATE CANDIDATE S.Villette, M.Stefanovic, A.Kondoz Centre for Communication Systems Research University of Surrey, Guildford GU2 5XH, Surrey, United

More information

Speech-Coding Techniques. Chapter 3

Speech-Coding Techniques. Chapter 3 Speech-Coding Techniques Chapter 3 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones and zeros The lower the bandwidth, the lower the quality RTP payload types

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

Evaluation of VoIP Speech Quality Using Neural Network

Evaluation of VoIP Speech Quality Using Neural Network Journal of Communication and Computer 12 (2015) 237-243 doi: 10.17265/1548-7709/2015.05.003 D DAVID PUBLISHING Evaluation of VoIP Speech Quality Using Neural Network Angel Garabitov and Aleksandar Tsenov

More information

Real-time Audio Quality Evaluation for Adaptive Multimedia Protocols

Real-time Audio Quality Evaluation for Adaptive Multimedia Protocols Real-time Audio Quality Evaluation for Adaptive Multimedia Protocols Lopamudra Roychoudhuri and Ehab S. Al-Shaer School of Computer Science, Telecommunications and Information Systems, DePaul University,

More information

Digital Speech Interpolation Advantage of Statistical Time Division Multiplexer

Digital Speech Interpolation Advantage of Statistical Time Division Multiplexer PJETS Volume 1, No 1, 2011, 9-13 ISSN: 2222-9930 print Digital Speech Interpolation Advantage of Statistical Time Division Multiplexer Asadullah Shah *, Krishan Lal Khatri ** Abstract This paper discusses

More information

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201 Source Coding Basics and Speech Coding Yao Wang Polytechnic University, Brooklyn, NY1121 http://eeweb.poly.edu/~yao Outline Why do we need to compress speech signals Basic components in a source coding

More information

MULTIMODE TREE CODING OF SPEECH WITH PERCEPTUAL PRE-WEIGHTING AND POST-WEIGHTING

MULTIMODE TREE CODING OF SPEECH WITH PERCEPTUAL PRE-WEIGHTING AND POST-WEIGHTING MULTIMODE TREE CODING OF SPEECH WITH PERCEPTUAL PRE-WEIGHTING AND POST-WEIGHTING Pravin Ramadas, Ying-Yi Li, and Jerry D. Gibson Department of Electrical and Computer Engineering, University of California,

More information

Investigation of Algorithms for VoIP Signaling

Investigation of Algorithms for VoIP Signaling Journal of Electrical Engineering 4 (2016) 203-207 doi: 10.17265/2328-2223/2016.04.007 D DAVID PUBLISHING Todorka Georgieva 1, Ekaterina Dimitrova 2 and Slava Yordanova 3 1. Telecommunication Department,

More information

Alcatel OmniPCX Enterprise

Alcatel OmniPCX Enterprise Alcatel OmniPCX Enterprise QoS for VoIP Overview 1 OBJECTIVE: Describe the essential parameters for QoS The QoS parameters regarding the data network IP Packet Transfer Delay (IPTD): Time for the packet

More information

ABSTRACT. that it avoids the tolls charged by ordinary telephone service

ABSTRACT. that it avoids the tolls charged by ordinary telephone service ABSTRACT VoIP (voice over IP - that is, voice delivered using the Internet Protocol) is a term used in IP telephony for a set of facilities for managing the delivery of voice information using the Internet

More information

Advanced Video Coding: The new H.264 video compression standard

Advanced Video Coding: The new H.264 video compression standard Advanced Video Coding: The new H.264 video compression standard August 2003 1. Introduction Video compression ( video coding ), the process of compressing moving images to save storage space and transmission

More information

On Improving the Performance of an ACELP Speech Coder

On Improving the Performance of an ACELP Speech Coder On Improving the Performance of an ACELP Speech Coder ARI HEIKKINEN, SAMULI PIETILÄ, VESA T. RUOPPILA, AND SAKARI HIMANEN Nokia Research Center, Speech and Audio Systems Laboratory P.O. Box, FIN-337 Tampere,

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

ON-LINE SIMULATION MODULES FOR TEACHING SPEECH AND AUDIO COMPRESSION TECHNIQUES

ON-LINE SIMULATION MODULES FOR TEACHING SPEECH AND AUDIO COMPRESSION TECHNIQUES ON-LINE SIMULATION MODULES FOR TEACHING SPEECH AND AUDIO COMPRESSION TECHNIQUES Venkatraman Atti 1 and Andreas Spanias 1 Abstract In this paper, we present a collection of software educational tools for

More information

Impact of Voice Coding in Performance of VoIP

Impact of Voice Coding in Performance of VoIP Impact of Voice Coding in Performance of VoIP Batoul Alia Baker Koko 1, Dr. Mohammed Abaker 2 1, 2 Department of Communication Engineering, Al-Neelain University Abstract: Voice over Internet Protocol

More information

The MPEG-4 General Audio Coder

The MPEG-4 General Audio Coder The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution

More information

Transporting audio-video. over the Internet

Transporting audio-video. over the Internet Transporting audio-video over the Internet Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP

More information

MOHAMMAD ZAKI BIN NORANI THESIS SUBMITTED IN FULFILMENT OF THE DEGREE OF COMPUTER SCIENCE (COMPUTER SYSTEM AND NETWORKING)

MOHAMMAD ZAKI BIN NORANI THESIS SUBMITTED IN FULFILMENT OF THE DEGREE OF COMPUTER SCIENCE (COMPUTER SYSTEM AND NETWORKING) PERFORMANCE ANALYSIS OF 8KBPS VOICE CODEC (G.729, G.711 ALAW, G.711 ULAW) FOR VOIP OVER WIRELESS LOCAL AREA NETWORK WITH RESPECTIVE SIGNAL-TO- NOISE RATIO MOHAMMAD ZAKI BIN NORANI THESIS SUBMITTED IN FULFILMENT

More information

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala Tampere University of Technology Korkeakoulunkatu 1, 720 Tampere, Finland ABSTRACT In

More information

Multi-Pulse Based Code Excited Linear Predictive Speech Coder with Fine Granularity Scalability for Tonal Language

Multi-Pulse Based Code Excited Linear Predictive Speech Coder with Fine Granularity Scalability for Tonal Language Journal of Computer Science 6 (11): 1288-1292, 2010 ISSN 1549-3636 2010 Science Publications Multi-Pulse Based Code Excited Linear Predictive Speech Coder with Fine Granularity Scalability for Tonal Language

More information

Predicting the Quality Level of a VoIP Communication through Intelligent Learning Techniques

Predicting the Quality Level of a VoIP Communication through Intelligent Learning Techniques Predicting the Quality Level of a VoIP Communication through Intelligent Learning Techniques Demóstenes Zegarra Rodríguez, Renata Lopes Rosa, Graça Bressan University of São Paulo, São Paulo, Brazil {demostenes,

More information

Measurement QoS Parameters of VoIP Codecs as a Function of the Network Traffic Level

Measurement QoS Parameters of VoIP Codecs as a Function of the Network Traffic Level Measurement QoS Parameters of VoIP Codecs as a Function of the Network Traffic Level Jugoslav Jocić *, Zoran Veličković ** * Telekom Srbija AD, Prokuplje, Serbia ** College of Applied Technical Sciences,

More information

Overview. A Survey of Packet-Loss Recovery Techniques. Outline. Overview. Mbone Loss Characteristics. IP Multicast Characteristics

Overview. A Survey of Packet-Loss Recovery Techniques. Outline. Overview. Mbone Loss Characteristics. IP Multicast Characteristics A Survey of Packet-Loss Recovery Techniques Overview Colin Perkins, Orion Hodson and Vicky Hardman Department of Computer Science University College London (UCL) London, UK IEEE Network Magazine Sep/Oct,

More information

RTP: A Transport Protocol for Real-Time Applications

RTP: A Transport Protocol for Real-Time Applications RTP: A Transport Protocol for Real-Time Applications Provides end-to-end delivery services for data with real-time characteristics, such as interactive audio and video. Those services include payload type

More information

ITNP80: Multimedia! Sound-II!

ITNP80: Multimedia! Sound-II! Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent than for video data rate for CD-quality audio is much less than

More information

Data Compression. Audio compression

Data Compression. Audio compression 1 Data Compression Audio compression Outline Basics of Digital Audio 2 Introduction What is sound? Signal-to-Noise Ratio (SNR) Digitization Filtering Sampling and Nyquist Theorem Quantization Synthetic

More information

Implementation of G.729E Speech Coding Algorithm based on TMS320VC5416 YANG Xiaojin 1, a, PAN Jinjin 2,b

Implementation of G.729E Speech Coding Algorithm based on TMS320VC5416 YANG Xiaojin 1, a, PAN Jinjin 2,b International Conference on Materials Engineering and Information Technology Applications (MEITA 2015) Implementation of G.729E Speech Coding Algorithm based on TMS320VC5416 YANG Xiaojin 1, a, PAN Jinjin

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

How to achieve low latency audio/video streaming over IP network?

How to achieve low latency audio/video streaming over IP network? February 2018 How to achieve low latency audio/video streaming over IP network? Jean-Marie Cloquet, Video Division Director, Silex Inside Gregory Baudet, Marketing Manager, Silex Inside Standard audio

More information

RECOMMENDATION ITU-R BT.1720 *

RECOMMENDATION ITU-R BT.1720 * Rec. ITU-R BT.1720 1 RECOMMENDATION ITU-R BT.1720 * Quality of service ranking and measurement methods for digital video broadcasting services delivered over broadband Internet protocol networks (Question

More information

ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland

ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland 2015-12-16 1 OUTLINE Very short introduction to EVS Robustness EVS LSF robustness features

More information

Audio and video compression

Audio and video compression Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and

More information

An application of neural networks to adaptive playout delay in VoIP

An application of neural networks to adaptive playout delay in VoIP An application of neural networks to adaptive playout delay in VoIP Ying Zhang 1, Damien Fay 1, Liam Kilmartin 2 1 Department of Mathematics, National University of Ireland, Galway damien.fay@nuigalway.ie

More information

ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS

ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS ADAPTIVE PICTURE SLICING FOR DISTORTION-BASED CLASSIFICATION OF VIDEO PACKETS E. Masala, D. Quaglia, J.C. De Martin Λ Dipartimento di Automatica e Informatica/ Λ IRITI-CNR Politecnico di Torino, Italy

More information

A Synchronization Scheme for Hiding Information in Encoded Bitstream of Inactive Speech Signal

A Synchronization Scheme for Hiding Information in Encoded Bitstream of Inactive Speech Signal Journal of Information Hiding and Multimedia Signal Processing c 2016 ISSN 2073-4212 Ubiquitous International Volume 7, Number 5, September 2016 A Synchronization Scheme for Hiding Information in Encoded

More information

Assessing Call Quality of VoIP and Data Traffic over Wireless LAN

Assessing Call Quality of VoIP and Data Traffic over Wireless LAN Assessing Call Quality of VoIP and Data Traffic over Wireless LAN Wen-Tzu Chen and Chih-Yuan Lee Institute of Telecommunications Management, National Cheng Kung University, No. 1 University Road, Tainan

More information

2 Framework of The Proposed Voice Quality Assessment System

2 Framework of The Proposed Voice Quality Assessment System 3rd International Conference on Multimedia Technology(ICMT 2013) A Packet-layer Quality Assessment System for VoIP Liangliang Jiang 1 and Fuzheng Yang 2 Abstract. A packet-layer quality assessment system

More information

Location Based Advanced Phone Dialer. A mobile client solution to perform voice calls over internet protocol. Jorge Duda de Matos

Location Based Advanced Phone Dialer. A mobile client solution to perform voice calls over internet protocol. Jorge Duda de Matos Location Based Advanced Phone Dialer A mobile client solution to perform voice calls over internet protocol Jorge Duda de Matos Superior Institute of Technology (IST) Lisbon, Portugal Abstract Mobile communication

More information

Chapter 14 MPEG Audio Compression

Chapter 14 MPEG Audio Compression Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1

More information

15: OS Scheduling and Buffering

15: OS Scheduling and Buffering 15: OS Scheduling and ing Mark Handley Typical Audio Pipeline (sender) Sending Host Audio Device Application A->D Device Kernel App Compress Encode for net RTP ed pending DMA to host (~10ms according to

More information

ET4254 Communications and Networking 1

ET4254 Communications and Networking 1 Topic 2 Aims:- Communications System Model and Concepts Protocols and Architecture Analog and Digital Signal Concepts Frequency Spectrum and Bandwidth 1 A Communications Model 2 Communications Tasks Transmission

More information

Error Concealment Used for P-Frame on Video Stream over the Internet

Error Concealment Used for P-Frame on Video Stream over the Internet Error Concealment Used for P-Frame on Video Stream over the Internet MA RAN, ZHANG ZHAO-YANG, AN PING Key Laboratory of Advanced Displays and System Application, Ministry of Education School of Communication

More information

Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec

Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec Improved Context-Based Adaptive Binary Arithmetic Coding in MPEG-4 AVC/H.264 Video Codec Abstract. An improved Context-based Adaptive Binary Arithmetic Coding (CABAC) is presented for application in compression

More information

Lecture 7: Audio Compression & Coding

Lecture 7: Audio Compression & Coding EE E682: Speech & Audio Processing & Recognition Lecture 7: Audio Compression & Coding 1 2 3 Information, compression & quantization Speech coding Wide bandwidth audio coding Dan Ellis

More information

AUDIO. Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015

AUDIO. Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015 AUDIO Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015 Key objectives How do humans generate and process sound? How does digital sound work? How fast do I have to sample audio?

More information

GSM Network and Services

GSM Network and Services GSM Network and Services Voice coding 1 From voice to radio waves voice/source coding channel coding block coding convolutional coding interleaving encryption burst building modulation diff encoding symbol

More information

Network-Adaptive Video Coding and Transmission

Network-Adaptive Video Coding and Transmission Header for SPIE use Network-Adaptive Video Coding and Transmission Kay Sripanidkulchai and Tsuhan Chen Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA 15213

More information

MATLAB representation of neural network Outline Neural network with single-layer of neurons. Neural network with multiple-layer of neurons.

MATLAB representation of neural network Outline Neural network with single-layer of neurons. Neural network with multiple-layer of neurons. MATLAB representation of neural network Outline Neural network with single-layer of neurons. Neural network with multiple-layer of neurons. Introduction: Neural Network topologies (Typical Architectures)

More information

Packet Switching Networks Traffic Prediction Based on Radial Basis Function Neural Network

Packet Switching Networks Traffic Prediction Based on Radial Basis Function Neural Network JOURNAL OF APPLIED COMPUTER SCIENCE Vol. 18 No. 2 (2010), pp. 91-101 Packet Switching Networks Traffic Prediction Based on Radial Basis Function Neural Network Arkadiusz Zaleski, Tomasz Kacprzak Institute

More information

Mahdi Amiri. February Sharif University of Technology

Mahdi Amiri. February Sharif University of Technology Course Presentation Multimedia Systems Speech II Mahdi Amiri February 2014 Sharif University of Technology Speech Compression Road Map Based on Time Domain analysis Differential Pulse-Code Modulation (DPCM)

More information

To address these challenges, extensive research has been conducted and have introduced six key areas of streaming video, namely: video compression,

To address these challenges, extensive research has been conducted and have introduced six key areas of streaming video, namely: video compression, Design of an Application Layer Congestion Control for Reducing network load and Receiver based Buffering Technique for packet synchronization in Video Streaming over the Internet Protocol Mushfeq-Us-Saleheen

More information

On TCP friendliness of VOIP traffic

On TCP friendliness of VOIP traffic On TCP friendliness of VOIP traffic By Rashmi Parthasarathy WSU ID # 10975537 A report submitted in partial fulfillment of the requirements of CptS 555 Electrical Engineering and Computer Science Department

More information

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

Perceptual Quality Measurement and Control: Definition, Application and Performance

Perceptual Quality Measurement and Control: Definition, Application and Performance Perceptual Quality Measurement and Control: Definition, Application and Performance A. R. Prasad, R. Esmailzadeh, S. Winkler, T. Ihara, B. Rohani, B. Pinguet and M. Capel Genista Corporation Tokyo, Japan

More information

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law

More information

Presents 2006 IMTC Forum ITU-T T Workshop

Presents 2006 IMTC Forum ITU-T T Workshop Presents 2006 IMTC Forum ITU-T T Workshop G.729EV: An 8-32 kbit/s scalable wideband speech and audio coder bitstream interoperable with G.729 Presented by Christophe Beaugeant On behalf of ETRI, France

More information

White Paper Voice Quality Sound design is an art form at Snom and is at the core of our development utilising some of the world's most advance voice

White Paper Voice Quality Sound design is an art form at Snom and is at the core of our development utilising some of the world's most advance voice White Paper Voice Quality Sound design is an art form at and is at the core of our development utilising some of the world's most advance voice quality engineering tools White Paper - Audio Quality Table

More information

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7) EE799 -- Multimedia Signal Processing Multimedia Signal Compression VI (MPEG-4, 7) References: 1. http://www.mpeg.org 2. http://drogo.cselt.stet.it/mpeg/ 3. T. Berahimi and M.Kunt, Visual data compression

More information

Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 2015 Sharif University of Technology

Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 2015 Sharif University of Technology Course Presentation Multimedia Systems Speech II Hmid R. Rabiee Mahdi Amiri February 25 Sharif University of Technology Speech Compression Road Map Based on Time Domain analysis Differential Pulse-Code

More information

Technical White Paper for Huawei's Videoconferencing Network Diagnostics Tool (NLog)

Technical White Paper for Huawei's Videoconferencing Network Diagnostics Tool (NLog) Technical White Paper for Huawei's Videoconferencing Network Diagnostics Tool (NLog) Huawei Technologies Co., Ltd. All rights reserved. Contents 1 Videoconferencing Network Conditions... 3 2 Overall Description...

More information

Multimedia Communications

Multimedia Communications Multimedia Communications Directions and Innovations Introduction István Beszteri istvan.beszteri@hut.fi Multimedia Communications: Source Representations, Networks and Applications! Introduction! Networks

More information

Receiver-based adaptation mechanisms for real-time media delivery. Outline

Receiver-based adaptation mechanisms for real-time media delivery. Outline Receiver-based adaptation mechanisms for real-time media delivery Prof. Dr.-Ing. Eckehard Steinbach Institute of Communication Networks Media Technology Group Technische Universität München Steinbach@ei.tum.de

More information

The BroadVoice Speech Coding Algorithm. Juin-Hwey (Raymond) Chen, Ph.D. Senior Technical Director Broadcom Corporation March 22, 2010

The BroadVoice Speech Coding Algorithm. Juin-Hwey (Raymond) Chen, Ph.D. Senior Technical Director Broadcom Corporation March 22, 2010 The BroadVoice Speech Coding Algorithm Juin-Hwey (Raymond) Chen, Ph.D. Senior Technical Director Broadcom Corporation March 22, 2010 Outline 1. Introduction 2. Basic Codec Structures 3. Short-Term Prediction

More information

1 Introduction. 2 Speech Compression

1 Introduction. 2 Speech Compression Abstract In this paper, the effect of MPEG audio compression on HMM-based speech synthesis is studied. Speech signals are encoded with various compression rates and analyzed using the GlottHMM vocoder.

More information

Character Recognition Using Convolutional Neural Networks

Character Recognition Using Convolutional Neural Networks Character Recognition Using Convolutional Neural Networks David Bouchain Seminar Statistical Learning Theory University of Ulm, Germany Institute for Neural Information Processing Winter 2006/2007 Abstract

More information

Chapter 11: Understanding the H.323 Standard

Chapter 11: Understanding the H.323 Standard Página 1 de 7 Chapter 11: Understanding the H.323 Standard This chapter contains information about the H.323 standard and its architecture, and discusses how Microsoft Windows NetMeeting supports H.323

More information

SIMULATION FRAMEWORK MODELING

SIMULATION FRAMEWORK MODELING CHAPTER 5 SIMULATION FRAMEWORK MODELING 5.1 INTRODUCTION This chapter starts with the design and development of the universal mobile communication system network and implementation of the TCP congestion

More information

Maximising Bandwdith Efficency of Statistical Multiplexer Architecture using Frame Droping Methods

Maximising Bandwdith Efficency of Statistical Multiplexer Architecture using Frame Droping Methods Pak. j. eng. technol. sci. Volume 3, No 1, 2013, 11-22 ISSN: 2222-9930 print ISSN: 2224-2333 online Maximising Bandwdith Efficency of Statistical Multiplexer Architecture using Frame Droping Methods Asadullah

More information

Recommended Readings

Recommended Readings Lecture 11: Media Adaptation Scalable Coding, Dealing with Errors Some slides, images were from http://ip.hhi.de/imagecom_g1/savce/index.htm and John G. Apostolopoulos http://www.mit.edu/~6.344/spring2004

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

Key words: B- Spline filters, filter banks, sub band coding, Pre processing, Image Averaging IJSER

Key words: B- Spline filters, filter banks, sub band coding, Pre processing, Image Averaging IJSER International Journal of Scientific & Engineering Research, Volume 7, Issue 9, September-2016 470 Analyzing Low Bit Rate Image Compression Using Filters and Pre Filtering PNV ABHISHEK 1, U VINOD KUMAR

More information

A LOW-COMPLEXITY MULTIPLE DESCRIPTION VIDEO CODER BASED ON 3D-TRANSFORMS

A LOW-COMPLEXITY MULTIPLE DESCRIPTION VIDEO CODER BASED ON 3D-TRANSFORMS A LOW-COMPLEXITY MULTIPLE DESCRIPTION VIDEO CODER BASED ON 3D-TRANSFORMS Andrey Norkin, Atanas Gotchev, Karen Egiazarian, Jaakko Astola Institute of Signal Processing, Tampere University of Technology

More information

Troubleshooting Packet Loss. Steven van Houttum

Troubleshooting Packet Loss. Steven van Houttum Troubleshooting Packet Loss Steven van Houttum INTRODUCTION Steven van Houttum Consultant/Trainer MVP Office Server & Services (Skype for Business) TROUBLESHOOTING PACKET LOSS, IS IT REALLY THE NETWORK?

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

JOURNAL OF HUMANITIE COLLEGE No ISSN:

JOURNAL OF HUMANITIE COLLEGE No ISSN: JOURNAL OF HUMANITIE COLLEGE No. 7 2017 ISSN: 2226-0714 1 Journal of University College of Humanity Studies A Refereed Scientiefic Journal Pubilshed by The University College of Humanity Studies Republic

More information

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing. SAOC and USAC Spatial Audio Object Coding / Unified Speech and Audio Coding Lecture Audio Coding WS 2013/14 Dr.-Ing. Andreas Franck Fraunhofer Institute for Digital Media Technology IDMT, Germany SAOC

More information

An Effective Calibration of VOIP Internet Telephony Performance using VPN between PAC and PNS

An Effective Calibration of VOIP Internet Telephony Performance using VPN between PAC and PNS An Effective Calibration of VOIP Internet Telephony Performance using VPN between PAC and PNS Hyung Moo Kim, and Jae Soo Yoo, Member, IEEE Abstract In this paper, we have created VoIP terminals that use

More information

_APP B_549_10/31/06. Appendix B. Producing for Multimedia and the Web

_APP B_549_10/31/06. Appendix B. Producing for Multimedia and the Web 1-59863-307-4_APP B_549_10/31/06 Appendix B Producing for Multimedia and the Web In addition to enabling regular music production, SONAR includes a number of features to help you create music for multimedia

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

Speech and audio coding

Speech and audio coding Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 SUBJECTIVE AND OBJECTIVE QUALITY EVALUATION FOR AUDIO WATERMARKING BASED ON SINUSOIDAL AMPLITUDE MODULATION PACS: 43.10.Pr, 43.60.Ek

More information

PfR Voice Traffic Optimization Using Active Probes

PfR Voice Traffic Optimization Using Active Probes PfR Voice Traffic Optimization Using Active Probes This module documents a Performance Routing (PfR) solution that supports outbound optimization of voice traffic based on the voice metrics, jitter and

More information

Scalable Video Coding

Scalable Video Coding Introduction to Multimedia Computing Scalable Video Coding 1 Topics Video On Demand Requirements Video Transcoding Scalable Video Coding Spatial Scalability Temporal Scalability Signal to Noise Scalability

More information

VoP, Real-Time, Linux and RTLinux

VoP, Real-Time, Linux and RTLinux VoP, Real-Time, Linux and RTLinux By Vidyasagaran P All Rights Reserved COPYRIGHT: This document is a property of MultiTech Software Systems India Pvt. Ltd. No part of this document may be copied or reproduced

More information

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Perceptual Coding Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding Part II wrap up 6.082 Fall 2006 Perceptual Coding, Slide 1 Lossless vs.

More information

Single sided Real time PESQ Score Estimation

Single sided Real time PESQ Score Estimation Single sided Real time PESQ Score Estimation Sebastián Basterrech, Gerardo Rubino INRIA/Rennes Bretagne Atlantique, Rennes, France Martín Varela Converging Networks Laboratory VTT Technical Research Centre

More information

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR 2011-2012 / ODD SEMESTER QUESTION BANK SUB.CODE / NAME YEAR / SEM : IT1301 INFORMATION CODING TECHNIQUES : III / V UNIT -

More information

Introduction to Quality of Service

Introduction to Quality of Service Introduction to Quality of Service The use of IP as a foundation for converged networks has raised several issues for both enterprise IT departments and ISPs. IP and Ethernet are connectionless technologies

More information

Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology

Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology Course Presentation Multimedia Systems Speech II Mahdi Amiri February 2012 Sharif University of Technology Homework Original Sound Speech Quantization Companding parameter (µ) Compander Quantization bit

More information

Building Residential VoIP Gateways: A Tutorial Part Three: Voice Quality Assurance For VoIP Networks

Building Residential VoIP Gateways: A Tutorial Part Three: Voice Quality Assurance For VoIP Networks Building Residential VoIP Gateways: A Tutorial Part Three: Voice Quality Assurance For VoIP Networks by David Jarrett and Keith Buchanan, Senior Broadband Applications and VoIP Gateway Product Manager,

More information

Maintenance. Basic Reset. Basic Reset, page 1 Perform User Network Configuration Reset, page 3

Maintenance. Basic Reset. Basic Reset, page 1 Perform User Network Configuration Reset, page 3 Basic Reset, page 1 Perform Network Configuration Reset, page 3 Perform User Network Configuration Reset, page 3 Remove CTL File, page 4 Quality Report Tool, page 4 Voice Quality Monitoring, page 4 Cisco

More information

Networking Applications

Networking Applications Networking Dr. Ayman A. Abdel-Hamid College of Computing and Information Technology Arab Academy for Science & Technology and Maritime Transport Multimedia Multimedia 1 Outline Audio and Video Services

More information