ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland

Size: px
Start display at page:

Download "ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland"

Transcription

1 ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland

2 OUTLINE Very short introduction to EVS Robustness EVS LSF robustness features Listening test results More results Summary Questions? 2

3 INTRODUCTION TO EVS EVS stands for Enhanced Voice Services Latest generation voice and audio codec for 3GPP and VoIP networks Introduces SWB and FB at low bitrates of 9.6 and 16.4 kbit/s Also supports legacy narrowband and wideband bandwidths Supports internal resampling between all supported sampling frequencies: 8, 16, 32 and 48 khz. Bitrates from 5.9 to 128 kbit/s. State-of-the-art quality with both speech and generic audio Communications codec with delay less than 32 ms Very robust against frame loss DTX available for all bitrates and bandwidths 3

4 ROBUSTNESS Robustness is needed in real communication networks Whenever frame is lost in communication channel it has to be replaced in real time in the decoder with best possible approximation If nothing is done, there would be either silent gaps or on the other extreme loud bangs, when the current signal model is not stable. EVS has several novel methods in several different domains to enhance robustness This paper discusses spectral modelling robustness features related to LSF quantization Listening test results account for all of the EVS robustness increasing methods 4

5 LSF ROBUSTNESS FEATURES. 5 Mode NB WB at bitrates <9.6kbps Inactive MA MA MA Unvoiced MA MA MA WB at bitrates 9.6kbps Voiced SN/AR SN/AR SN/AR Generic SN/AR SN/AR MA Transitio n SN SN SN Audio SN/AR SN/AR MA

6 LSF ROBUSTNESS FEATURES.. The purely predictive quantizer uses a moving average (MA) predictor. The auto-regressive (AR) prediction has higher coding gain but also higher recovery time after a frame loss. In order to limit sensitivity to frame losses, the AR predictive quantizer is used in conjunction with the safety net. Transition mode always uses the non-predictive quantizer, due to signal being by definition highly changing. Unvoiced and inactive modes always use MA-predictive coding Voiced, audio and generic modes use switched non-predictive/arpredictive LSF coding at low bitrates. For higher bitrates the MApredictor is used for generic and audio mode. 6

7 LSF ROBUSTNESS FEATURES... In case of switched coding the predictor usage is selected in closed loop, based on several criteria: - If non-predictive is good enough (SD <~1.0) use it. - If prediction helps only very little use non-predictive. - If there is already a very long streak of predictive frames prefer non-predictive frame time-to-time. In practice this means that for stable signal segments predictive coding is used quite often (over 85%), but when the signal is more unstable the quantizer automatically inserts non-predictive LSF codebook entries. 7

8 LSF OBJECTIVE RESULTS Even with high frame erasure rate of 10%, there are less than 5% frames with Spectral Distortion larger than 4dB. 8

9 LISTENING TESTING AMR, AMR-WB and EVS were compared against each other Tested 0%, 3%, 6%, 10% and 15% frame erasure rates. Listening test consisted of two tests: clean speech (DTX enabled) and noisy speech (DTX disabled) ACR9 test methodology was used: 1 (very bad) to 9 (excellent) scale without reference i.e. MOS test. Tested bitrates: Around for all AMR, AMR-WB and EVS. Additional test points at around 24 kbit/s (comparison to AMR-WB). EVS also tested 8, 9.6, 16.4, 32 and 48 kbit/s at various bandwidths. 24 naïve listeners in both tests; Finnish language; Sennheiser HD-650 headphones, diotical listening. Noise types were: street, cafeteria, car, and classical music at -15dB. 9

10 CLEAN SPEECH RESULTS. 10

11 CLEAN SPEECH RESULTS.. 11

12 CLEAN SPEECH RESULTS EVS is significantly more robust than either AMR or AMR-WB at all bitrates Especially impressive is that EVS-WB and EVS-SWB at 13.2 kbit/s with 15 % frame erasure rate provides approximately the same quality as AMR 12.2 at 3 % FER and AMR-WB at 6 % FER. Also worth noting is that EVS-FB 48 kbit/s provides better than direct NB voice quality even in maximum tested FER rate of 15 %. 12

13 NOISY SPEECH RESULTS. 13

14 NOISY SPEECH RESULTS.. 14

15 NOISY SPEECH RESULTS Noisy speech results are very similar to the clean speech results. For some reason 10 % FER rate EVS seems to work somewhat better with noisy speech compared to clean speech. Background noise likely masks some audible effects that are audible in clean speech. Overall the quality drops very linearly with the increasing frame erasure rate. 15

16 COMBINED RESULTS AT LOW RATES 16

17 RESULTS AT LOW BITRATES.. As can be seen EVS-SWB 13.2k with 6 % FER rate provides better than any clean channel AMR / AMR-WB coding mode. Overall it could be estimated that EVS provides additional 5-6 percentage points of additional FER robustness margin compared to AMR-WB and about 10 percentage points more robustness compared to AMR 12.2 kbit/s. Thus EVS provides the same voice quality than earlier generation voice codec, at the same bitrate, although the channel contains significantly more channel errors. 17

18 COMBINED RESULTS HIGH RATES 18

19 RESULTS AT HIGH BITRATES AMR-WB kbit/s is at least 1.2 MOS point worse than EVS at 16.4 kbit/s over all FER rates. EVS-FB 48 kbit/s provides statistically equivalent quality to direct FB signal at 0% FER. Even with extremely high FER rate of 15 % EVS-FB 48 kbit/s is better than direct narrowband signal or AMR-WB kbit/s at 6 % FER rate. 19

20 DEMOSAMPLES AMR % AMR-WB % EVS 13.2 no FER AMR % AMR-WB % EVS % AMR % EVS 48 0% EVS 48 10% 20

21 SUMMARY EVS is extremely robust against frame erasures In clean channel performance it is transparent to original (FB 48kbit/s) 21

22 QUESTIONS? 22

23 23

24 BACKUP SLIDES Combined results in full screen by FER rate Combined results in full screen by bitrate 24

25 COMBINED CURVES 25

26 COMBINED CURVES 26

ROBUST SPEECH CODING WITH EVS. Nokia Technologies, Tampere, Finland

ROBUST SPEECH CODING WITH EVS. Nokia Technologies, Tampere, Finland ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Technologies, Tampere, Finland anssi.ramo@nokia.com, adriana.vasilache@nokia.com, henri.toukomaa@nokia.com ABSTRACT

More information

Date. Next Generation in Speech Quality ETSI STQ Workshop, Nov 2012 Dr. Imre Varga Qualcomm Inc.

Date. Next Generation in Speech Quality ETSI STQ Workshop, Nov 2012 Dr. Imre Varga Qualcomm Inc. Date Enhanced Voice Services Next Generation in Speech Quality ETSI STQ Workshop, Nov 2012 Dr. Imre Varga Qualcomm Inc. Next Gen 3GPP Speech Coding for Improved User Experience AMR AMR-WB 4.75 kbps 12.2

More information

EVS Channel Aware Mode Robustness to Frame Erasures

EVS Channel Aware Mode Robustness to Frame Erasures INTERSPEECH 2016 September 8 12, 2016, San Francisco, USA EVS Channel Aware Mode Robustness to Frame Erasures Anssi Rämö 1, Antti Kurittu 2, Henri Toukomaa 1 1 Nokia Technologies 2 Nokia Networks anssi.ramo@nokia.com,

More information

MULTIMODE TREE CODING OF SPEECH WITH PERCEPTUAL PRE-WEIGHTING AND POST-WEIGHTING

MULTIMODE TREE CODING OF SPEECH WITH PERCEPTUAL PRE-WEIGHTING AND POST-WEIGHTING MULTIMODE TREE CODING OF SPEECH WITH PERCEPTUAL PRE-WEIGHTING AND POST-WEIGHTING Pravin Ramadas, Ying-Yi Li, and Jerry D. Gibson Department of Electrical and Computer Engineering, University of California,

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2700/INFSCI 1072 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

Presents 2006 IMTC Forum ITU-T T Workshop

Presents 2006 IMTC Forum ITU-T T Workshop Presents 2006 IMTC Forum ITU-T T Workshop G.729EV: An 8-32 kbit/s scalable wideband speech and audio coder bitstream interoperable with G.729 Presented by Christophe Beaugeant On behalf of ETRI, France

More information

A MULTI-RATE SPEECH AND CHANNEL CODEC: A GSM AMR HALF-RATE CANDIDATE

A MULTI-RATE SPEECH AND CHANNEL CODEC: A GSM AMR HALF-RATE CANDIDATE A MULTI-RATE SPEECH AND CHANNEL CODEC: A GSM AMR HALF-RATE CANDIDATE S.Villette, M.Stefanovic, A.Kondoz Centre for Communication Systems Research University of Surrey, Guildford GU2 5XH, Surrey, United

More information

The BroadVoice Speech Coding Algorithm. Juin-Hwey (Raymond) Chen, Ph.D. Senior Technical Director Broadcom Corporation March 22, 2010

The BroadVoice Speech Coding Algorithm. Juin-Hwey (Raymond) Chen, Ph.D. Senior Technical Director Broadcom Corporation March 22, 2010 The BroadVoice Speech Coding Algorithm Juin-Hwey (Raymond) Chen, Ph.D. Senior Technical Director Broadcom Corporation March 22, 2010 Outline 1. Introduction 2. Basic Codec Structures 3. Short-Term Prediction

More information

Speech-Coding Techniques. Chapter 3

Speech-Coding Techniques. Chapter 3 Speech-Coding Techniques Chapter 3 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones and zeros The lower the bandwidth, the lower the quality RTP payload types

More information

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS TECHNICAL PAPER Enhanced Voice Services (EVS) Codec Until now, telephone services have generally failed to offer a high-quality audio experience due to limitations such as very low audio bandwidth and

More information

Speech and audio coding

Speech and audio coding Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples

More information

Real-time Audio Quality Evaluation for Adaptive Multimedia Protocols

Real-time Audio Quality Evaluation for Adaptive Multimedia Protocols Real-time Audio Quality Evaluation for Adaptive Multimedia Protocols Lopamudra Roychoudhuri and Ehab S. Al-Shaer School of Computer Science, Telecommunications and Information Systems, DePaul University,

More information

The MPEG-4 General Audio Coder

The MPEG-4 General Audio Coder The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution

More information

ETSI TS V ( )

ETSI TS V ( ) TS 126 441 V12.0.0 (2014-10) TECHNICAL SPECIFICATION Universal Mobile Telecommunications System (UMTS); LTE; EVS Codec General Overview (3GPP TS 26.441 version 12.0.0 Release 12) 1 TS 126 441 V12.0.0 (2014-10)

More information

ETSI TS V ( )

ETSI TS V ( ) TS 126 446 V12.0.0 (2014-10) TECHNICAL SPECIFICATION Universal Mobile Telecommunications System (UMTS); LTE; EVS Codec AMR-WB Backward Compatible Functions (3GPP TS 26.446 version 12.0.0 Release 12) 1

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

Data Compression. Audio compression

Data Compression. Audio compression 1 Data Compression Audio compression Outline Basics of Digital Audio 2 Introduction What is sound? Signal-to-Noise Ratio (SNR) Digitization Filtering Sampling and Nyquist Theorem Quantization Synthetic

More information

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing. SAOC and USAC Spatial Audio Object Coding / Unified Speech and Audio Coding Lecture Audio Coding WS 2013/14 Dr.-Ing. Andreas Franck Fraunhofer Institute for Digital Media Technology IDMT, Germany SAOC

More information

INTERNATIONAL TELECOMMUNICATION UNION

INTERNATIONAL TELECOMMUNICATION UNION INTERNATIONAL TELECOMMUNICATION UNION TELECOMMUNICATION STANDARDIZATION SECTOR STUDY PERIOD 2001-2004 English only Original: English Question(s): 9/16 Geneva, 20-30 May 2003 LIAISON STATEMENT Source: ITU-T

More information

the Audio Engineering Society. Convention Paper Presented at the 120th Convention 2006 May Paris, France

the Audio Engineering Society. Convention Paper Presented at the 120th Convention 2006 May Paris, France Audio Engineering Society Convention Paper Presented at the 120th Convention 2006 May 20 23 Paris, France This convention paper has been reproduced from the author s advance manuscript, without editing,

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding Perceptual Pre-weighting and Post-inverse weighting for Speech Coding Niranjan Shetty and Jerry D. Gibson Department of Electrical and Computer Engineering University of California, Santa Barbara, CA,

More information

Bandwidth Planning in your Cisco Webex Meetings Environment

Bandwidth Planning in your Cisco Webex Meetings Environment White Paper Bandwidth Planning in your Cisco Webex Meetings Environment White Paper 2018 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public Information. Page 1 of 16 Contents

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

Research Article Wideband Speech Recovery Using Psychoacoustic Criteria

Research Article Wideband Speech Recovery Using Psychoacoustic Criteria Hindawi Publishing Corporation URASIP Journal on Audio, Speech, and Music Processing Volume 7, Article ID 16816, 18 pages doi:1.1155/7/16816 Research Article Wideband Speech Recovery Using Psychoacoustic

More information

Opus, a free, high-quality speech and audio codec

Opus, a free, high-quality speech and audio codec Opus, a free, high-quality speech and audio codec Jean-Marc Valin, Koen Vos, Timothy B. Terriberry, Gregory Maxwell 29 January 2014 What is Opus? New highly-flexible speech and audio codec Works for most

More information

Dusseldorf, Germany Agenda item: th -20 th June, Status Report of SMG11 at SMG#32

Dusseldorf, Germany Agenda item: th -20 th June, Status Report of SMG11 at SMG#32 ETSI TC SMG#32 Tdoc SMG P-00-269 Dusseldorf, Germany Agenda item: 6.10 19 th -20 th June, 2000 Source: Chairman, SMG11 * Status Report of SMG11 at SMG#32 Executive Summary This document provides an overview

More information

Abstract. 1. Introduction

Abstract. 1. Introduction Wideband Speech Coding Standards and Applications Abstract Increasing the bandwidth of sound signals from the telephone bandwidth of 200-3400 Hz to the wider bandwidth of 50-7000 Hz results in increased

More information

dimensions are comparable to existing ACQUAlab front ends. Numerous important interfaces are already available in the basic unit, such as:

dimensions are comparable to existing ACQUAlab front ends. Numerous important interfaces are already available in the basic unit, such as: Data Sheet labcore (Code 7700) ACQUAlab modular multi-channel front end for speech and audio quality testing Overview labcore front view (with optional modules) DESCRIPTION labcore is the new front end

More information

Meeting #29 Agenda items: rd 25 th June, 1999, Miami. Adaptive Multi-Rate Wideband (AMR-WB) Feasibility study report. Version 1.0.

Meeting #29 Agenda items: rd 25 th June, 1999, Miami. Adaptive Multi-Rate Wideband (AMR-WB) Feasibility study report. Version 1.0. ETSI TC SMG Tdoc SMG P-99-429 Meeting #29 Agenda items: 6.10 23 rd 25 th June, 1999, Miami Source: SMG11 Adaptive Multi-Rate Wideband (AMR-WB) Feasibility study report Version 1.0.0 Page 2 Table of Contents

More information

System Identification Related Problems at SMN

System Identification Related Problems at SMN Ericsson research SeRvices, MulTimedia and Networks System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2015-04-28 Page 1 Outline Research

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Nobuhiko Kitawaki University of Tsukuba 1-1-1, Tennoudai, Tsukuba-shi, 305-8573 Japan. E-mail: kitawaki@cs.tsukuba.ac.jp

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Audio Processing and Coding The objective of this lab session is to get the students familiar with audio processing and coding, notably psychoacoustic analysis

More information

On Improving the Performance of an ACELP Speech Coder

On Improving the Performance of an ACELP Speech Coder On Improving the Performance of an ACELP Speech Coder ARI HEIKKINEN, SAMULI PIETILÄ, VESA T. RUOPPILA, AND SAKARI HIMANEN Nokia Research Center, Speech and Audio Systems Laboratory P.O. Box, FIN-337 Tampere,

More information

ETSI TR V ( )

ETSI TR V ( ) TR 126 976 V14.0.0 (2017-04) TECHNICAL REPORT Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; Performance characterization of the Adaptive

More information

MPEG-4 General Audio Coding

MPEG-4 General Audio Coding MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1 General Audio Coding Solid state players, Internet audio, terrestrial and

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

End-to-end speech and audio quality evaluation of networks using AQuA - competitive alternative for PESQ (P.862) Endre Domiczi Sevana Oy

End-to-end speech and audio quality evaluation of networks using AQuA - competitive alternative for PESQ (P.862) Endre Domiczi Sevana Oy End-to-end speech and audio quality evaluation of networks using AQuA - competitive alternative for PESQ (P.862) Endre Domiczi Sevana Oy Overview Significance of speech and audio quality Problems with

More information

(12) Patent Application Publication (10) Pub. No.: US 2012/ A1

(12) Patent Application Publication (10) Pub. No.: US 2012/ A1 (19) United States US 20120265523A1 (12) Patent Application Publication (10) Pub. No.: US 2012/0265523 A1 GREER et al. (43) Pub. Date: (54) (75) (73) (21) (22) (60) (51) FRAMIE ERASURE CONCEALMENT FOR

More information

Voice Quality Assessment for Mobile to SIP Call over Live 3G Network

Voice Quality Assessment for Mobile to SIP Call over Live 3G Network Abstract 132 Voice Quality Assessment for Mobile to SIP Call over Live 3G Network G.Venkatakrishnan, I-H.Mkwawa and L.Sun Signal Processing and Multimedia Communications, University of Plymouth, Plymouth,

More information

ETSI TS V (201

ETSI TS V (201 TS 126 442 V12.5.0 (201 16-01) TECHNICAL SPECIFICATION Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); ANSI C code (fixed-point) (3GPP TS 26.442 version

More information

System Identification Related Problems at

System Identification Related Problems at media Technologies @ Ericsson research (New organization Taking Form) System Identification Related Problems at MT@ER Erlendur Karlsson, PhD 1 Outline Ericsson Publications and Blogs System Identification

More information

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS Technical PapER Extended HE-AAC Bridging the gap between speech and audio coding One codec taking the place of two; one unified system bridging a troublesome gap. The fifth generation MPEG audio codec

More information

On the Importance of a VoIP Packet

On the Importance of a VoIP Packet On the Importance of a VoIP Packet Christian Hoene, Berthold Rathke, Adam Wolisz Technical University of Berlin hoene@ee.tu-berlin.de Abstract If highly compressed multimedia streams are transported over

More information

ETSI TS V (201

ETSI TS V (201 TS 126 443 V12.7.0 (201 16-10) TECHNICAL SPECIFICATION Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); ANSI C code (floating-point) (3GPP TS 26.443 version

More information

System Identification Related Problems at SMN

System Identification Related Problems at SMN Ericsson research SeRvices, MulTimedia and Network Features System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 1 Outline Research

More information

Voice Over LTE (VoLTE) Technology. July 23, 2018 Tim Burke

Voice Over LTE (VoLTE) Technology. July 23, 2018 Tim Burke Voice Over LTE (VoLTE) Technology July 23, 2018 Tim Burke Range of Frequencies Humans Can Hear 20,000 Hz 20 Hz Human Hearing 8,000 Hz 10,000 Hz 14,000 Hz 12,000 Hz Range of Frequencies Designed For Entertainment

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics

More information

AUDIO. Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015

AUDIO. Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015 AUDIO Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015 Key objectives How do humans generate and process sound? How does digital sound work? How fast do I have to sample audio?

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,

More information

VoLTE to VoLTE Call Flow Originating Network LTE LTE IP Network IMS Terminating Network LTE Calling Called SIP ----SIP INVITE message ( UE --> IMS )---->

More information

Open AMR Initiative. Technical Documentation. Version 1.0 Revision

Open AMR Initiative. Technical Documentation. Version 1.0 Revision VoiceAge Corporation 750 Chemin Lucerne, Suite 250 Ville Mont-Royal (Quebec) H3R 2H6 Canada (514) 737-4940 Fax (514) 908-2037 www.voiceage.com Open AMR Initiative Technical Documentation Version 1.0 Revision

More information

A New Technique for Transceiver Location Data Over LTE Voice Channels

A New Technique for Transceiver Location Data Over LTE Voice Channels International Journal of Research in Engineering and Science (IJRES) ISSN (Online): 2320-9364, ISSN (Print): 2320-9356 Volume 4 Issue 10 ǁ October. 2016 ǁ PP.15-19 A New Technique for Transceiver Location

More information

VS1063 ENCODER DEMONSTRATION

VS1063 ENCODER DEMONSTRATION PRELIMINARY DOCUMENT VS1063 ENCODER DEMONSTRATION VLSI Solution Audio Decoder Project Code: Project Name: All information in this document is provided as-is without warranty. Features are subject to change

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

* Answer/end call requires EHS cable for desk phone

* Answer/end call requires EHS cable for desk phone SD Pro 2 ML SD PRO 2 ML SD Pro 2 ML is a double-sided premium wireless DECT headset for qualityconscious business professionals demanding exceptional performance and supreme comfort. Certified for Skype

More information

Microsoft Lync compatibility. Sennheiser Communications solutions overview

Microsoft Lync compatibility. Sennheiser Communications solutions overview Microsoft Lync compatibility Sennheiser Communications solutions overview Sennheiser Communications in brief + = Sennheiser Communications A/S is a joint venture between the highly successful electro-acoustics

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

High comfort wearing styles with choice of headband and ear hook with leatherette

High comfort wearing styles with choice of headband and ear hook with leatherette SD Office SD OFFICE SD Office is a single-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance and supreme comfort. Designed for all-day use,

More information

GSM Network and Services

GSM Network and Services GSM Network and Services Voice coding 1 From voice to radio waves voice/source coding channel coding block coding convolutional coding interleaving encryption burst building modulation diff encoding symbol

More information

Chapter 14 MPEG Audio Compression

Chapter 14 MPEG Audio Compression Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1

More information

Quality of Service and Quality of T-Labs Berlin

Quality of Service and Quality of T-Labs Berlin Quality of Service and Quality of Experience @ T-Labs Berlin Sebastian Möller, Alexander Raake and Marcel Wältermann Quality and Usability Lab Deutsche Telekom Laboratories TU Berlin {sebastian.moeller,

More information

Rich Recording Technology Technical overall description

Rich Recording Technology Technical overall description Rich Recording Technology Technical overall description Ari Koski Nokia with Windows Phones Product Engineering/Technology Multimedia/Audio/Audio technology management 1 Nokia s Rich Recording technology

More information

* Answer/end call requires EHS cable for desk phone and Sennheiser software for certain softphones

* Answer/end call requires EHS cable for desk phone and Sennheiser software for certain softphones SD Pro 1 SD PRO 1 SD Pro 1 is a single-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance and supreme comfort. Designed for all-day use,

More information

Implementation of G.729E Speech Coding Algorithm based on TMS320VC5416 YANG Xiaojin 1, a, PAN Jinjin 2,b

Implementation of G.729E Speech Coding Algorithm based on TMS320VC5416 YANG Xiaojin 1, a, PAN Jinjin 2,b International Conference on Materials Engineering and Information Technology Applications (MEITA 2015) Implementation of G.729E Speech Coding Algorithm based on TMS320VC5416 YANG Xiaojin 1, a, PAN Jinjin

More information

User focus SD Office is designed to maximize productivity and flexibility in busy offices with its long distance wireless range up to 590 ft

User focus SD Office is designed to maximize productivity and flexibility in busy offices with its long distance wireless range up to 590 ft SD Office Note: Neckband available as accessory SD OFFICE SD Office is a single-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance and supreme

More information

VoLTE Performance Analysis and Evaluation in Real Networks

VoLTE Performance Analysis and Evaluation in Real Networks VoLTE Performance Analysis and Evaluation in Real Networks Bujar Krasniqi Faculty of Electrical and Computer Engineering University of Prishtina Prishtina, Kosovo bujar.krasniqi@uni-pr.edu Gentian Bytyqi

More information

Avaya compatibility. Sennheiser Communications solution overview

Avaya compatibility. Sennheiser Communications solution overview Avaya compatibility Sennheiser Communications solution overview Sennheiser Communications in brief + = Sennheiser Communications A/S is a joint venture between the highly successful electro-acoustics specialist

More information

Missing Frame Recovery Method for G Based on Neural Networks

Missing Frame Recovery Method for G Based on Neural Networks Missing Frame Recovery Method for G7231 Based on Neural Networks JARI TURUNEN & PEKKA LOULA Information Technology, Pori Tampere University of Technology Pohjoisranta 11, POBox 300, FIN-28101 Pori FINLAND

More information

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding

MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding MPEG-4 Version 2 Audio Workshop: HILN - Parametric Audio Coding Heiko Purnhagen Laboratorium für Informationstechnologie University of Hannover, Germany Outline Introduction What is "Parametric Audio Coding"?

More information

Pan-European ecall employing AMR-WB and LTE CSFB Ralf Weber

Pan-European ecall employing AMR-WB and LTE CSFB Ralf Weber 19. ITG/VDE Fachtagung Mobilkommunikation, Osnabrück, Germany, May 21-22, 2014 Pan-European ecall employing AMR-WB and LTE CSFB Ralf Weber Disclaimer Not to be used, copied, reproduced in whole or in part,

More information

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS

ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS ERROR-ROBUST INTER/INTRA MACROBLOCK MODE SELECTION USING ISOLATED REGIONS Ye-Kui Wang 1, Miska M. Hannuksela 2 and Moncef Gabbouj 3 1 Tampere International Center for Signal Processing (TICSP), Tampere,

More information

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP

14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyright by EURASIP TRADEOFF BETWEEN COMPLEXITY AND MEMORY SIZE IN THE 3GPP ENHANCED PLUS DECODER: SPEED-CONSCIOUS AND MEMORY- CONSCIOUS DECODERS ON A 16-BIT FIXED-POINT DSP Osamu Shimada, Toshiyuki Nomura, Akihiko Sugiyama

More information

Call me back on Skype

Call me back on Skype WHITEPAPER 2017 Call me back on Skype Special Edition for the International Telecoms Week, 14-17 May, Chicago For years international wholesale 600 500 400 300 200 100 0 International Traffic (billion

More information

Cisco Unified IP Phone 7942G and Cisco Unified IP Phone 7962G

Cisco Unified IP Phone 7942G and Cisco Unified IP Phone 7962G Phone 7942G and Phone 7962G General Questions Q. What are the Cisco Unified IP Phone 7962G and Phone 7942G? A. The Phone 7962G and Phone 7942G are part of a new suite of evolutionary Phone 7900 Series

More information

HD Voice and Wideband Codecs (HD-02) Panel Discussion (ITEXPO West 2009) September 02, 2009 Los Angeles, CA

HD Voice and Wideband Codecs (HD-02) Panel Discussion (ITEXPO West 2009) September 02, 2009 Los Angeles, CA A World Leader and Innovator In Wireless Technologies HD Voice and Wideband Codecs (HD-02) Panel Discussion (ITEXPO West 2009) September 02, 2009 Los Angeles, CA A. Ryan Heidari Director, Technology Marketing

More information

User focus DW Office ML is designed to maximize productivity and flexibility in busy offices. Note: Neckband available as accessory

User focus DW Office ML is designed to maximize productivity and flexibility in busy offices. Note: Neckband available as accessory DW Office ML Note: Neckband available as accessory DW OFFICE ML DW Office ML is a single-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance

More information

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law

More information

Aastra Telecom compatibility. Sennheiser Communications solution overview

Aastra Telecom compatibility. Sennheiser Communications solution overview Aastra Telecom compatibility Sennheiser Communications solution overview Sennheiser Communications in brief + = Sennheiser Communications A/S is a joint venture between the highly successful electro-acoustics

More information

VoIP Forgery Detection

VoIP Forgery Detection VoIP Forgery Detection Satish Tummala, Yanxin Liu and Qingzhong Liu Department of Computer Science Sam Houston State University Huntsville, TX, USA Emails: sct137@shsu.edu; yanxin@shsu.edu; liu@shsu.edu

More information

MPEG-4 aacplus - Audio coding for today s digital media world

MPEG-4 aacplus - Audio coding for today s digital media world MPEG-4 aacplus - Audio coding for today s digital media world Whitepaper by: Gerald Moser, Coding Technologies November 2005-1 - 1. Introduction Delivering high quality digital broadcast content to consumers

More information

Designed for all-day use, the DW Office connects directly to desk phone and softphone/pc to deliver excellent sound quality

Designed for all-day use, the DW Office connects directly to desk phone and softphone/pc to deliver excellent sound quality DW Office Note: Neckband available as accessory DW OFFICE DW Office is a single-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance and supreme

More information

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201 Source Coding Basics and Speech Coding Yao Wang Polytechnic University, Brooklyn, NY1121 http://eeweb.poly.edu/~yao Outline Why do we need to compress speech signals Basic components in a source coding

More information

Troubleshooting the 792xG Series Wireless IP Phone

Troubleshooting the 792xG Series Wireless IP Phone CHAPTER 3 Troubleshooting the 792xG Series Wireless IP Phone Understanding the 792xG Series Wireless IP Phone The Cisco Unified Wireless IP Phone 792xG Series are 802.11 dual-band wireless devices that

More information

(A simplified version of this document is available for applicants who had applied in previous years.)

(A simplified version of this document is available for applicants who had applied in previous years.) NYO Canada 2019 Auditions ( en français ) (A simplified version of this document is available for applicants who had applied in previous years.) Digital Audition Process Since 2013 NYO Canada has implemented

More information

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL

SPREAD SPECTRUM AUDIO WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL SPREAD SPECTRUM WATERMARKING SCHEME BASED ON PSYCHOACOUSTIC MODEL 1 Yüksel Tokur 2 Ergun Erçelebi e-mail: tokur@gantep.edu.tr e-mail: ercelebi@gantep.edu.tr 1 Gaziantep University, MYO, 27310, Gaziantep,

More information

Investigation of Algorithms for VoIP Signaling

Investigation of Algorithms for VoIP Signaling Journal of Electrical Engineering 4 (2016) 203-207 doi: 10.17265/2328-2223/2016.04.007 D DAVID PUBLISHING Todorka Georgieva 1, Ekaterina Dimitrova 2 and Slava Yordanova 3 1. Telecommunication Department,

More information

* Answer/end call requires EHS cable for desk phone

* Answer/end call requires EHS cable for desk phone DW Pro 2 ML dw PRO 2 ML DW Pro 2 ML is a double-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance and supreme comfort. Certified for Skype

More information

Brilliant. comfort. sound. exceptional. Mobile Business Series MB Pro 1

Brilliant. comfort. sound. exceptional. Mobile Business Series MB Pro 1 Mobile Business Series MB Pro 1 Brilliant sound exceptional comfort MB PRO 1 Sennheiser MB Pro 1 is a premium Bluetooth single-sided headset for business professionals who demand wireless communication

More information

User focus DW Pro 2 is designed to maximize productivity and flexibility in busy offices with its

User focus DW Pro 2 is designed to maximize productivity and flexibility in busy offices with its DW Pro 2 DW PRO 2 DW Pro 2 is a double-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance and supreme comfort. Designed for all-day use,

More information

3GPP TS V ( )

3GPP TS V ( ) TS 26.179 V13.1.0 (2016-06) Technical Specification 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Mission Critical Push To Talk (MCPTT); Codecs and media

More information

Convention Paper 7215

Convention Paper 7215 Audio Engineering Society Convention Paper 7215 Presented at the 123rd Convention 2007 October 5 8 New York, NY, USA The papers at this Convention have been selected on the basis of a submitted abstract

More information

Determination of Bit-Rate Adaptation Thresholds for the Opus Codec for VoIP Services

Determination of Bit-Rate Adaptation Thresholds for the Opus Codec for VoIP Services Determination of Bit-Rate Adaptation Thresholds for the Opus Codec for VoIP Services Yi Han, Damien Magoni, Patrick McDonagh and Liam Murphy School of Computer Science and Informatics, University College

More information

AN EFFICIENT TRANSCODING SCHEME FOR G.729 AND G SPEECH CODECS: INTEROPERABILITY OVER THE INTERNET. Received July 2010; revised October 2011

AN EFFICIENT TRANSCODING SCHEME FOR G.729 AND G SPEECH CODECS: INTEROPERABILITY OVER THE INTERNET. Received July 2010; revised October 2011 International Journal of Innovative Computing, Information and Control ICIC International c 2012 ISSN 1349-4198 Volume 8, Number 7(A), July 2012 pp. 4635 4660 AN EFFICIENT TRANSCODING SCHEME FOR G.729

More information

around the office. The iconic design of the DW Pro 1 puts it in a class of its own.

around the office. The iconic design of the DW Pro 1 puts it in a class of its own. DW Pro 1 DW PRO 1 DW Pro 1 is a single-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance and supreme comfort. Designed for all-day use,

More information

RTP implemented in Abacus

RTP implemented in Abacus Spirent Abacus RTP implemented in Abacus 编号版本修改时间说明 1 1. Codec that Abacus supports. G.711u law G.711A law G.726 G.726 ITU G.723.1 G.729 AB (when VAD is YES, it is G.729AB, when No, it is G.729A) G.729

More information

MOHAMMAD ZAKI BIN NORANI THESIS SUBMITTED IN FULFILMENT OF THE DEGREE OF COMPUTER SCIENCE (COMPUTER SYSTEM AND NETWORKING)

MOHAMMAD ZAKI BIN NORANI THESIS SUBMITTED IN FULFILMENT OF THE DEGREE OF COMPUTER SCIENCE (COMPUTER SYSTEM AND NETWORKING) PERFORMANCE ANALYSIS OF 8KBPS VOICE CODEC (G.729, G.711 ALAW, G.711 ULAW) FOR VOIP OVER WIRELESS LOCAL AREA NETWORK WITH RESPECTIVE SIGNAL-TO- NOISE RATIO MOHAMMAD ZAKI BIN NORANI THESIS SUBMITTED IN FULFILMENT

More information

* Answer/end call requires EHS cable for desk phone and Sennheiser software for certain softphones

* Answer/end call requires EHS cable for desk phone and Sennheiser software for certain softphones DW Office DW OFFICE DW Office is a single-sided premium wireless DECT headset for quality-conscious business professionals demanding exceptional performance and supreme comfort. Designed for all-day use,

More information

Audiovisual QoS for communication over IP networks

Audiovisual QoS for communication over IP networks Audiovisual QoS for communication over IP networks Trond Ulseth Telenor R&I E-mail: trond.ulseth@telenor.com Effect of transmission performance on Multimedia Quality of Service, The path towards the Next

More information