System Identification Related Problems at SMN

Size: px
Start display at page:

Download "System Identification Related Problems at SMN"

Transcription

1 Ericsson research SeRvices, MulTimedia and Networks System Identification Related Problems at SMN Erlendur Karlsson SysId Related ER/SMN Ericsson External Page 1

2 Outline Research Ericsson Research System Identification related applications at SMN Important issues when dealing with real-world problems SysId Related ER/SMN Ericsson External Page 2

3 Research Ericsson Research Ericsson Research Blogg 5G Cloud Context Aware Communication Data and Knowledge Internet of Things LTE Media Coding SDN Security Sevice Systems Smart Cities SysId Related ER/SMN Ericsson External Page 3

4 Contextual Communiction Excavator MWC 2015 Excavator Excavators from Volvo CE Powerful Linux PC Python application with custom signaling built on top of OpenWebRTC Control Rig Simulator from Oryx and Volvo CE Mac OS X computer OS X Cocoa application with custom signaling built on top of OpenWebRTC Signaling Server SysId Related ER/SMN Ericsson External Page 4

5 Remote Excavation Technologies Spatial scene capture, both video and audio Spatial scene rendering, both video and audio Low latency real time communication Low latency remote control SysId Related ER/SMN Ericsson External Page 5

6 Media Processing Architecture Camera Microphone Array Excavator Audio Video Data OpenWebRTC Audio / Video Data Channel Audio Video Data Network Mobile, fixed, Network Audio Video Mobile, fixed, Data OpenWebRTC Audio / Video Data Channel Audio Video Data Oculus Rift DK2 Headphones Control Rig SysId Related ER/SMN Ericsson External Page 6

7 System Identification Related Applications at MMT Audio and Speech Coding Audio Mining (ASR) Audio Media Processing Acoustic Echo Cancellation Noise Suppression Voice Activity Detection Spatial Audio Capture Spatial Audio Rendering Video Coding (2D and 3D) Objective Quality Estimation of Encoded Audio and Video Congestion Control in IP Networks SysId Related ER/SMN Ericsson External Page 7

8 Audio and Speech Coding Clean speech signals can be modeled very efficiently with Code-Excited Linear Prediction (CELP) encoders (Based on ARX model of the speech signal) Music signals are better encoded with transform encoding methods (Subband filter banks, MDCT) Signal classification and hybrid encoding used to obtain efficient encoding of audio signals of varying content EVS (Enhanced Voice System) just standardized in 3GPP standardization Special EVS session at ICASSP 2015 in Australia SysId Related ER/SMN Ericsson External Page 8

9 CELP Speech Model SysId Related ER/SMN Ericsson External Page 9

10 Bitstream Bitstream EVS Speech/Audio Codec prototype HL structure Mode TD TD- BWE Improved AMR-WB technology Parametric high band Technology Linear Pred. + ACELP FCB variable sf. Linear prediction, energy/gain FD G.719-like Transform (LD-MDCT), block switching input VAD Mode Dec. TD (+TD-BWE) FD CNG wb WB SWB FB bandwidths TD AMRWB-like TD-BWE FD G.719 like FD-coding parametric 4 ~ Audio BW [khz] SysId Related ER/SMN Ericsson External Page 10

11 Acoustic Echo Cancellation Long echo impulse reponses: msec At 48 khz sampling : 14,400 24,000 samples SysId Related ER/SMN Ericsson External Page 11

12 Spatial Audio Capture Microphone arrays Filter design in the spatial and frequency domains Beamforming techniques Adaptive tracking of the most active speakers in a room SysId Related ER/SMN Ericsson External Page 12

13 Spatial Audio Rendering Spatial hearing 3D binaural rendering through Head Related Filtering (HRF) Very useful in 3D gaming and evolved communication solutions Spatial audio rendering onto any loudspeaker configuration SysId Related ER/SMN Ericsson External Page 13

14 Spatial Hearing SysId Related ER/SMN Ericsson External Page 14

15 Acoustic Wave Reception The listeners median plane Sound wave Left Head Related Filter (HRF) Contralateral ear Listener Right Head Related Filter (HRF) Ipsilateral ear Length L ITD = L/c where c=speed of sound SysId Related ER/SMN Ericsson External Page 15

16 ASR System Main Components Training Data Acoustic Models Applying Lexical Models Constraints Language Models Speech Signal Representation Feature Vector Search Recognized Words Speech recognition is the problem of deciding on How to represent the signal How to model the constraints How to search for the most optimal answer SysId Related ER/SMN Ericsson External Page 16

17 ASR System Solution Components Acoustic- Phonetic Modeling Pattern Recognition Finite-State Transducers Language Models Adaptation Acoustic Models Lexical Models Language Models Speech Signal Representation Search Recognized Words Speech Signal Representation Search Algorithms Vector Quantization & Clustering Hidden Markov Modeling Graphical Models Segmental Models GMMs Neural Networks SysId Related ER/SMN Ericsson External Page 17

18 Important issues when dealing with real-world problems Understand the strengths and weaknesses of the different identification methods Preprocessing the data before the optimization can be crucial Choose the minimization criterion with care and adapt it to the problem at hand Different type of regularization components in the criterion can make the difference between success and failure Some times a criterion having components in both the time and frequency domains will work, when single domain criterions fail. SysId Related ER/SMN Ericsson External Page 18

19 Important issues when dealing with real-world problems Some applications require classification based modelling, where the current model used depends on signal classification of some signals Many systems have to deal with spurious events This will require the detection of such events and special model updates when they are detected Monitoring of system model Hypothesis testing and estimation SysId Related ER/SMN Ericsson External Page 19

20 Erlendur Karlsson,

System Identification Related Problems at SMN

System Identification Related Problems at SMN Ericsson research SeRvices, MulTimedia and Network Features System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 1 Outline Research

More information

System Identification Related Problems at

System Identification Related Problems at media Technologies @ Ericsson research (New organization Taking Form) System Identification Related Problems at MT@ER Erlendur Karlsson, PhD 1 Outline Ericsson Publications and Blogs System Identification

More information

Surrounded by High-Definition Sound

Surrounded by High-Definition Sound Surrounded by High-Definition Sound Dr. ChingShun Lin CSIE, NCU May 6th, 009 Introduction What is noise? Uncertain filters Introduction (Cont.) How loud is loud? (Audible: 0Hz - 0kHz) Introduction (Cont.)

More information

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing. SAOC and USAC Spatial Audio Object Coding / Unified Speech and Audio Coding Lecture Audio Coding WS 2013/14 Dr.-Ing. Andreas Franck Fraunhofer Institute for Digital Media Technology IDMT, Germany SAOC

More information

EE482: Digital Signal Processing Applications

EE482: Digital Signal Processing Applications Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu EE482: Digital Signal Processing Applications Spring 2014 TTh 14:30-15:45 CBC C222 Lecture 13 Audio Signal Processing 14/04/01 http://www.ee.unlv.edu/~b1morris/ee482/

More information

Keyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices

Keyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices Keyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices V1.19, 2018-12-25 Alango Technologies 1 Executive Summary

More information

REAL-TIME DIGITAL SIGNAL PROCESSING

REAL-TIME DIGITAL SIGNAL PROCESSING REAL-TIME DIGITAL SIGNAL PROCESSING FUNDAMENTALS, IMPLEMENTATIONS AND APPLICATIONS Third Edition Sen M. Kuo Northern Illinois University, USA Bob H. Lee Ittiam Systems, Inc., USA Wenshun Tian Sonus Networks,

More information

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

New Results in Low Bit Rate Speech Coding and Bandwidth Extension Audio Engineering Society Convention Paper Presented at the 121st Convention 2006 October 5 8 San Francisco, CA, USA This convention paper has been reproduced from the author's advance manuscript, without

More information

ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland

ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland 2015-12-16 1 OUTLINE Very short introduction to EVS Robustness EVS LSF robustness features

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual coding Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal. Perceptual encoders, however, have been designed for the compression of general

More information

How to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device

How to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device How to Change the Default Playback & Recording Audio Device Sound is a very important part of our computing experience. We listen to music, do voice chat, watch movies, play games, record sound, etc. In

More information

Audio Coding Standards

Audio Coding Standards Audio Standards Kari Pihkala 13.2.2002 Tik-111.590 Multimedia Outline Architectural Overview MPEG-1 MPEG-2 MPEG-4 Philips PASC (DCC cassette) Sony ATRAC (MiniDisc) Dolby AC-3 Conclusions 2 Architectural

More information

Distributed Signal Processing for Binaural Hearing Aids

Distributed Signal Processing for Binaural Hearing Aids Distributed Signal Processing for Binaural Hearing Aids Olivier Roy LCAV - I&C - EPFL Joint work with Martin Vetterli July 24, 2008 Outline 1 Motivations 2 Information-theoretic Analysis 3 Example: Distributed

More information

SSL for Circular Arrays of Mics

SSL for Circular Arrays of Mics SSL for Circular Arrays of Mics Yong Rui, Dinei Florêncio, Warren Lam, and Jinyan Su Microsoft Research ABSTRACT Circular arrays are of particular interest for a number of scenarios, particularly because

More information

Speech-Coding Techniques. Chapter 3

Speech-Coding Techniques. Chapter 3 Speech-Coding Techniques Chapter 3 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones and zeros The lower the bandwidth, the lower the quality RTP payload types

More information

USER MANUAL DUET PCS USB DESKTOP SPEAKERPHONE

USER MANUAL DUET PCS USB DESKTOP SPEAKERPHONE USER MANUAL DUET PCS USB DESKTOP SPEAKERPHONE DUET OVERVIEW Control Panel Buttons Connector Panel Loudspeaker Microphone THE DUET IS A HIGH-PERFORMANCE SPEAKERPHONE for desktop use that can cover small

More information

Speech and audio coding

Speech and audio coding Institut Mines-Telecom Speech and audio coding Marco Cagnazzo, cagnazzo@telecom-paristech.fr MN910 Advanced compression Outline Introduction Introduction Speech signal Music signal Masking Codeurs simples

More information

Nahimic Troubleshooting Instructions and Q&A The document applies to all MSI Notebook and Vortex product which supports Nahimic.

Nahimic Troubleshooting Instructions and Q&A The document applies to all MSI Notebook and Vortex product which supports Nahimic. Nahimic Troubleshooting Instructions and Q&A The document applies to all MSI Notebook and Vortex product which supports Nahimic. To know whether the product supports Nahimic or not, please visit MSI website

More information

Principles of Audio Coding

Principles of Audio Coding Principles of Audio Coding Topics today Introduction VOCODERS Psychoacoustics Equal-Loudness Curve Frequency Masking Temporal Masking (CSIT 410) 2 Introduction Speech compression algorithm focuses on exploiting

More information

Before starting the troubleshooting, make sure you have installed the latest version of audio driver and Nahimic on your notebook.

Before starting the troubleshooting, make sure you have installed the latest version of audio driver and Nahimic on your notebook. Nahimic Troubleshooting Instructions and Q&A Please refer to the Troubleshooting Instructions to resolve the problem, if you encounter any audio problem listed below. Audio playback: Low volume, weak,

More information

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS TECHNICAL PAPER Enhanced Voice Services (EVS) Codec Until now, telephone services have generally failed to offer a high-quality audio experience due to limitations such as very low audio bandwidth and

More information

AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS

AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS Edited by YITENG (ARDEN) HUANG Bell Laboratories, Lucent Technologies JACOB BENESTY Universite du Quebec, INRS-EMT Kluwer

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd Overview Audio Signal Processing Applications @ Dolby Audio Signal Processing Basics

More information

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd Introducing Audio Signal Processing & Audio Coding 2013 Dolby Laboratories,

More information

Making an on-device personal assistant a reality

Making an on-device personal assistant a reality June 2018 @qualcomm_tech Making an on-device personal assistant a reality Qualcomm Technologies, Inc. AI brings human-like understanding and behaviors to the machines Perception Hear, see, and observe

More information

ETSI TS V ( )

ETSI TS V ( ) TS 126 441 V12.0.0 (2014-10) TECHNICAL SPECIFICATION Universal Mobile Telecommunications System (UMTS); LTE; EVS Codec General Overview (3GPP TS 26.441 version 12.0.0 Release 12) 1 TS 126 441 V12.0.0 (2014-10)

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

Embedded Audio & Robotic Ear

Embedded Audio & Robotic Ear Embedded Audio & Robotic Ear Marc HERVIEU IoT Marketing Manager Marc.Hervieu@st.com Voice Communication: key driver of innovation since 1800 s 2 IoT Evolution of Voice Automation: the IoT Voice Assistant

More information

Digital Speech Coding

Digital Speech Coding Digital Speech Processing David Tipper Associate Professor Graduate Program of Telecommunications and Networking University of Pittsburgh Telcom 2700/INFSCI 1072 Slides 7 http://www.sis.pitt.edu/~dtipper/tipper.html

More information

5: Music Compression. Music Coding. Mark Handley

5: Music Compression. Music Coding. Mark Handley 5: Music Compression Mark Handley Music Coding LPC-based codecs model the sound source to achieve good compression. Works well for voice. Terrible for music. What if you can t model the source? Model the

More information

Audio-coding standards

Audio-coding standards Audio-coding standards The goal is to provide CD-quality audio over telecommunications networks. Almost all CD audio coders are based on the so-called psychoacoustic model of the human auditory system.

More information

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS Technical PapER Extended HE-AAC Bridging the gap between speech and audio coding One codec taking the place of two; one unified system bridging a troublesome gap. The fifth generation MPEG audio codec

More information

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends Contents List of Figures List of Tables Contributing Authors xiii xxi xxiii Introduction Karlheinz Brandenburg and Mark Kahrs xxix 1 Audio quality determination based on perceptual measurement techniques

More information

Chapter 14 MPEG Audio Compression

Chapter 14 MPEG Audio Compression Chapter 14 MPEG Audio Compression 14.1 Psychoacoustics 14.2 MPEG Audio 14.3 Other Commercial Audio Codecs 14.4 The Future: MPEG-7 and MPEG-21 14.5 Further Exploration 1 Li & Drew c Prentice Hall 2003 14.1

More information

xcore VocalFusion Speaker Evaluation Kit Quick Start Guide

xcore VocalFusion Speaker Evaluation Kit Quick Start Guide xcore VocalFusion Speaker Evaluation Kit Quick Start Guide IN THIS DOCUMENT Before you start Load XVF3100 firmware Setup Evaluation Voice Activity Detector Keyword detection Direction of Arrival indication

More information

User Manual. Please read this manual carefully before using the Phoenix Octopus

User Manual. Please read this manual carefully before using the Phoenix Octopus User Manual Please read this manual carefully before using the Phoenix Octopus For additional help and updates, refer to our website To contact Phoenix Audio for support, please send a detailed e-mail

More information

Synopsis of Basic VoIP Concepts

Synopsis of Basic VoIP Concepts APPENDIX B The Catalyst 4224 Access Gateway Switch (Catalyst 4224) provides Voice over IP (VoIP) gateway applications for a micro branch office. This chapter introduces some basic VoIP concepts. This chapter

More information

Parametric Coding of Spatial Audio

Parametric Coding of Spatial Audio Parametric Coding of Spatial Audio Ph.D. Thesis Christof Faller, September 24, 2004 Thesis advisor: Prof. Martin Vetterli Audiovisual Communications Laboratory, EPFL Lausanne Parametric Coding of Spatial

More information

ISO/IEC Information technology High efficiency coding and media delivery in heterogeneous environments. Part 3: 3D audio

ISO/IEC Information technology High efficiency coding and media delivery in heterogeneous environments. Part 3: 3D audio INTERNATIONAL STANDARD ISO/IEC 23008-3 First edition 2015-10-15 Corrected version 2016-03-01 Information technology High efficiency coding and media delivery in heterogeneous environments Part 3: 3D audio

More information

Data Compression. Audio compression

Data Compression. Audio compression 1 Data Compression Audio compression Outline Basics of Digital Audio 2 Introduction What is sound? Signal-to-Noise Ratio (SNR) Digitization Filtering Sampling and Nyquist Theorem Quantization Synthetic

More information

Speech User Interface for Information Retrieval

Speech User Interface for Information Retrieval Speech User Interface for Information Retrieval Urmila Shrawankar Dept. of Information Technology Govt. Polytechnic Institute, Nagpur Sadar, Nagpur 440001 (INDIA) urmilas@rediffmail.com Cell : +919422803996

More information

Inverse Filter Design for Crosstalk Cancellation in Portable Devices with Stereo Loudspeakers

Inverse Filter Design for Crosstalk Cancellation in Portable Devices with Stereo Loudspeakers Inverse Filter Design for Crosstalk Cancellation in Portable Devices with Stereo Loudspeakers Sung Dong Jo 1 and Seung o Choi 2,* 1 School of Information and Communications Gwangju Institute of Science

More information

Presents 2006 IMTC Forum ITU-T T Workshop

Presents 2006 IMTC Forum ITU-T T Workshop Presents 2006 IMTC Forum ITU-T T Workshop G.729EV: An 8-32 kbit/s scalable wideband speech and audio coder bitstream interoperable with G.729 Presented by Christophe Beaugeant On behalf of ETRI, France

More information

The MPEG-4 General Audio Coder

The MPEG-4 General Audio Coder The MPEG-4 General Audio Coder Bernhard Grill Fraunhofer Institute for Integrated Circuits (IIS) grl 6/98 page 1 Outline MPEG-2 Advanced Audio Coding (AAC) MPEG-4 Extensions: Perceptual Noise Substitution

More information

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding INTERNATIONAL STANDARD This is a preview - click here to buy the full publication ISO/IEC 23003-3 First edition 2012-04-01 Information technology MPEG audio technologies Part 3: Unified speech and audio

More information

2.4 Audio Compression

2.4 Audio Compression 2.4 Audio Compression 2.4.1 Pulse Code Modulation Audio signals are analog waves. The acoustic perception is determined by the frequency (pitch) and the amplitude (loudness). For storage, processing and

More information

DATA SHEET DEVIO DEVIO CR-1 CONFERENCE ROOM DEVICE

DATA SHEET DEVIO DEVIO CR-1 CONFERENCE ROOM DEVICE DATA SHEET DEVIO DEVIO CR-1 CONFERENCE ROOM DEVICE The Devio CR-1 is the hub that creates a desktop-like experience away from the desk, allowing you to connect to the AV technology in a meeting room simply

More information

Transporting audio-video. over the Internet

Transporting audio-video. over the Internet Transporting audio-video over the Internet Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP

More information

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony Nobuhiko Kitawaki University of Tsukuba 1-1-1, Tennoudai, Tsukuba-shi, 305-8573 Japan. E-mail: kitawaki@cs.tsukuba.ac.jp

More information

Audio Coding and MP3

Audio Coding and MP3 Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 11 Audio Engineering: Perceptual coding Coding and decoding Signal (analog) Encoder Code (Digital) Code (Digital) Decoder Signal (analog)

More information

Issues in Voice and Video Coding

Issues in Voice and Video Coding Issues in Voice and Video Coding Presented at ICME Panel I: Advances in Audio/Video Coding Technologies, July 12, 2011 Jerry D. Gibson Department of Electrical and Computer Engineering gibson@ece.ucsb.edu

More information

RTP implemented in Abacus

RTP implemented in Abacus Spirent Abacus RTP implemented in Abacus 编号版本修改时间说明 1 1. Codec that Abacus supports. G.711u law G.711A law G.726 G.726 ITU G.723.1 G.729 AB (when VAD is YES, it is G.729AB, when No, it is G.729A) G.729

More information

FEATURES. Ergonomic design. Premium stereo sound. LED backlight. Noise canceller and flexible Microphone

FEATURES. Ergonomic design. Premium stereo sound. LED backlight. Noise canceller and flexible Microphone FEATURES Ergonomic design Design for long gaming sessions, fully configurable by the player. Premium stereo sound "HIRAKEN" has premium stereo sound, further of an incredible virtual surround 7.1. Outstanding

More information

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved. Compressed Audio Demystified Why Music Producers Need to Care About Compressed Audio Files Download Sales Up CD Sales Down High-Definition hasn t caught on yet Consumers don t seem to care about high fidelity

More information

LIVE MUSIC PERFORMANCES OVER HIGH- SPEED IP NETWORKS

LIVE MUSIC PERFORMANCES OVER HIGH- SPEED IP NETWORKS LIVE MUSIC PERFORMANCES OVER HIGH- SPEED IP NETWORKS Stefan Karapetkov Polycom, Inc. e-mail: Stefan.Karapetkov@polycom.com ABSTRACT High-speed IP networks are creating opportunities for new kinds of real-time

More information

Best-in-class audio recording

Best-in-class audio recording Best-in-class audio recording Philips Voice Tracer range 2013 New Philips Voice Tracer range Best-in-class audio recording Only the perfect combination of audio quality & ease of use delivers the best

More information

Transforming. Noise. Introduction. Simulation. pairs. If they. can also - 1 -

Transforming. Noise. Introduction. Simulation. pairs. If they. can also - 1 - Transforming Sound into Knowledge Background Noise Simulation to ETSI ES Standardd 202 396-1 Introduction This sequencee calibrates a 4.1 speaker array in accordance to thee ETSI ES 202 396-1 standard

More information

AUDIO. Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015

AUDIO. Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015 AUDIO Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015 Key objectives How do humans generate and process sound? How does digital sound work? How fast do I have to sample audio?

More information

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 N15071 February 2015, Geneva,

More information

Mpeg 1 layer 3 (mp3) general overview

Mpeg 1 layer 3 (mp3) general overview Mpeg 1 layer 3 (mp3) general overview 1 Digital Audio! CD Audio:! 16 bit encoding! 2 Channels (Stereo)! 44.1 khz sampling rate 2 * 44.1 khz * 16 bits = 1.41 Mb/s + Overhead (synchronization, error correction,

More information

Realtek Audio Utility User Guide

Realtek Audio Utility User Guide Realtek Audio Utility User Guide DE118 Rev. 3 The Realtek audio CODEC provides 8-channel audio capability to deliver the ultimate audio experience on your computer. The sofftware provides Jack-sensing

More information

Yealink VC Microphone Profolio. Video Conferencing Phone VCP41. Key Features and Benefits

Yealink VC Microphone Profolio. Video Conferencing Phone VCP41. Key Features and Benefits Video Conferencing Phone VCP41 The Yealink Video Conferencing Phone VCP41 is a perfect choice for Video conferencing systems that delivers superior performance for small to medium-sized meeting rooms.

More information

Briefing. Briefing 100 People. Keep everyone s attention with the presenter front and center. C 2015 Cisco and/or its affiliates. All rights reserved.

Briefing. Briefing 100 People. Keep everyone s attention with the presenter front and center. C 2015 Cisco and/or its affiliates. All rights reserved. Briefing 100 People Keep everyone s attention with the presenter front and center. 2 1 Product ID Product CTS-SX80-IP60-K9 Cisco TelePresence Codec SX80 1 Included in CTS-SX80-IP60-K9 Cisco TelePresence

More information

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts ijdsp Interactive Illustrations of Speech/Audio Processing Concepts NSF Phase 3 Workshop, UCy Presentation of an Independent Study By Girish Kalyanasundaram, MS by Thesis in EE Advisor: Dr. Andreas Spanias,

More information

Parametric Coding of High-Quality Audio

Parametric Coding of High-Quality Audio Parametric Coding of High-Quality Audio Prof. Dr. Gerald Schuller Fraunhofer IDMT & Ilmenau Technical University Ilmenau, Germany 1 Waveform vs Parametric Waveform Filter-bank approach Mainly exploits

More information

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Audio Fundamentals, Compression Techniques & Standards Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011 Outlines Audio Fundamentals Sampling, digitization, quantization μ-law

More information

ETSI TS V (201

ETSI TS V (201 TS 126 442 V12.5.0 (201 16-01) TECHNICAL SPECIFICATION Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); ANSI C code (fixed-point) (3GPP TS 26.442 version

More information

Speech Technology Using in Wechat

Speech Technology Using in Wechat Speech Technology Using in Wechat FENG RAO Powered by WeChat Outline Introduce Algorithm of Speech Recognition Acoustic Model Language Model Decoder Speech Technology Open Platform Framework of Speech

More information

Audio Engineering Society. Convention Paper. Presented at the 126th Convention 2009 May 7 10 Munich, Germany

Audio Engineering Society. Convention Paper. Presented at the 126th Convention 2009 May 7 10 Munich, Germany Audio Engineering Society Convention Paper Presented at the 126th Convention 2009 May 7 10 Munich, Germany 7712 The papers at this Convention have been selected on the basis of a submitted abstract and

More information

Date. Next Generation in Speech Quality ETSI STQ Workshop, Nov 2012 Dr. Imre Varga Qualcomm Inc.

Date. Next Generation in Speech Quality ETSI STQ Workshop, Nov 2012 Dr. Imre Varga Qualcomm Inc. Date Enhanced Voice Services Next Generation in Speech Quality ETSI STQ Workshop, Nov 2012 Dr. Imre Varga Qualcomm Inc. Next Gen 3GPP Speech Coding for Improved User Experience AMR AMR-WB 4.75 kbps 12.2

More information

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio Dr. Jürgen Herre 11/07 Page 1 Jürgen Herre für (IIS) Erlangen, Germany Introduction: Sound Images? Humans

More information

HuddlePod Air Big Audio

HuddlePod Air Big Audio HuddlePod Air Big Audio WIRELESS AUDIO POD and EXTERNAL AUDIO SYSTEM ADAPTER INSTALLATION & OPERATION MANUAL Please check HUDDLECAMHD.com for the most up to date version of this document Product Overview.

More information

Modeling of Pinna Related Transfer Functions (PRTF) Using the Finite Element Method (FEM)

Modeling of Pinna Related Transfer Functions (PRTF) Using the Finite Element Method (FEM) Modeling of Pinna Related Transfer Functions (PRTF) Using the Finite Element Method (FEM) Manan Joshi *1, Navarun Gupta 1, and Lawrence V. Hmurcik 1 1 University of Bridgeport, Bridgeport, CT *Corresponding

More information

VoIP Forgery Detection

VoIP Forgery Detection VoIP Forgery Detection Satish Tummala, Yanxin Liu and Qingzhong Liu Department of Computer Science Sam Houston State University Huntsville, TX, USA Emails: sct137@shsu.edu; yanxin@shsu.edu; liu@shsu.edu

More information

White Paper Voice Quality Sound design is an art form at Snom and is at the core of our development utilising some of the world's most advance voice

White Paper Voice Quality Sound design is an art form at Snom and is at the core of our development utilising some of the world's most advance voice White Paper Voice Quality Sound design is an art form at and is at the core of our development utilising some of the world's most advance voice quality engineering tools White Paper - Audio Quality Table

More information

Corporate R&D: Excellence in Wireless Innovation. September

Corporate R&D: Excellence in Wireless Innovation. September Corporate R&D: Excellence in Wireless Innovation September 2011 www.qualcomm.com/research State of the Art Capabilities Fostering Innovation Human Resources Complete Development Labs 30% of engineers with

More information

Product Information NANO

Product Information NANO Product Information NANO Nano is a hearing aid, suitable for users with a wide range of hearing losses. Nano features the Audina proprietary Audio Efficiency technology that incorporates first-class features

More information

MPEG-4 General Audio Coding

MPEG-4 General Audio Coding MPEG-4 General Audio Coding Jürgen Herre Fraunhofer Institute for Integrated Circuits (IIS) Dr. Jürgen Herre, hrr@iis.fhg.de 1 General Audio Coding Solid state players, Internet audio, terrestrial and

More information

EASE Seminar Entry Level & Advanced Level

EASE Seminar Entry Level & Advanced Level EASE Seminar Entry Level & Advanced Level This is a general overview of our regular EASE Trainings. Please be aware that this document contains information on both levels we offer. Make sure which one

More information

PJP-EC200 Setup Procedure Yamaha Corporation

PJP-EC200 Setup Procedure Yamaha Corporation Yamaha Corporation Contents Components... 3... 4 Installations and Connections... 4 Setup... 4 Adjustment... 7... 9 Installations and Connections... 9 Setup... 9 Adjustment... 12 2 Components This document

More information

Polycom RealPresence Trio

Polycom RealPresence Trio FREQUENTLY ASKED QUESTIONS Polycom RealPresence Trio The Polycom RealPresence Trio 8800 is the first smart hub for group collaboration that transforms the iconic three-point conference phone into a voice,

More information

Avonic AV-MIC44. USB 2.0 Video Conferencing Table Speakerphone

Avonic AV-MIC44. USB 2.0 Video Conferencing Table Speakerphone Avonic AV-MIC44 USB 2.0 Video Conferencing Table Speakerphone User Manual Version 1.0 Update notes: Join Avonic linkedin.com/company/avonic twitter.com/avonic1 facebook.com/avonic www.avonic.eu 1 Contents

More information

ETSI TS V (201

ETSI TS V (201 TS 126 443 V12.7.0 (201 16-10) TECHNICAL SPECIFICATION Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); ANSI C code (floating-point) (3GPP TS 26.443 version

More information

Chapter 5.5 Audio Programming

Chapter 5.5 Audio Programming Chapter 5.5 Audio Programming Audio Programming Audio in games is more important than ever before 2 Programming Basic Audio Most gaming hardware has similar capabilities (on similar platforms) Mostly programming

More information

Appendix 4. Audio coding algorithms

Appendix 4. Audio coding algorithms Appendix 4. Audio coding algorithms 1 Introduction The main application of audio compression systems is to obtain compact digital representations of high-quality (CD-quality) wideband audio signals. Typically

More information

ETSI TS V ( )

ETSI TS V ( ) TS 126 446 V12.0.0 (2014-10) TECHNICAL SPECIFICATION Universal Mobile Telecommunications System (UMTS); LTE; EVS Codec AMR-WB Backward Compatible Functions (3GPP TS 26.446 version 12.0.0 Release 12) 1

More information

2014, IJARCSSE All Rights Reserved Page 461

2014, IJARCSSE All Rights Reserved Page 461 Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Real Time Speech

More information

External Loudspeaker Guidelines and Recommendations for Smart Speaker Applications

External Loudspeaker Guidelines and Recommendations for Smart Speaker Applications Contents Overview... 1 Introduction... 2 Minimum Recommended Requirements... 3 Examples of Evaluated Speakers... 5 Additional Considerations... 7 External Loudspeaker Guidelines and Recommendations for

More information

Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM)

Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM) Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM) Manan Joshi Navarun Gupta, Ph. D. Lawrence Hmurcik, Ph. D. University of Bridgeport, Bridgeport, CT Objective Measure

More information

GSM Network and Services

GSM Network and Services GSM Network and Services Voice coding 1 From voice to radio waves voice/source coding channel coding block coding convolutional coding interleaving encryption burst building modulation diff encoding symbol

More information

ETSI TS V (201

ETSI TS V (201 TS 126 179 V13.0.0 (201 16-05) TECHNICAL SPECIFICATION LTE; Mission Critical Push To Talk (MCPTT); Codecs and media handling (3GPP TS 26.179 version 13.0.0 Release 13) 1 TS 126 179 V13.0.0 (2016-05) Reference

More information

AFMG. EASE Seminar September 17 th to 21 st 2018, Berlin, Germany. Agenda. Software-Engineering Research Development

AFMG. EASE Seminar September 17 th to 21 st 2018, Berlin, Germany. Agenda. Software-Engineering Research Development EASE Seminar September 17 th to 21 st 2018, Berlin, Instructors: Emad Yacoub Hanna Language: English Hours: 09:00-17:00 (please be there at 08:45) EASE Seminars are split into two levels with Level 1 (entry

More information

ENSEA conference Small acoustics. Jeremie Huscenot January 8, 2000

ENSEA conference Small acoustics. Jeremie Huscenot January 8, 2000 ENSEA conference Small acoustics Jeremie Huscenot January 8, 2000 Introduction Listening to loudspeakers The difference which comes when comparing how headphones and loudspeaker produce sound is With loudspeaker,

More information

An Introduction to Pattern Recognition

An Introduction to Pattern Recognition An Introduction to Pattern Recognition Speaker : Wei lun Chao Advisor : Prof. Jian-jiun Ding DISP Lab Graduate Institute of Communication Engineering 1 Abstract Not a new research field Wide range included

More information

5G the next major wireless standard

5G the next major wireless standard 5G the next major wireless standard Klaus Doppler Director, Radio Communications Nokia Technologies, LABS DREAMS Seminar, Jan. 13, 2015 1 Nokia 2015 International activities on 5G Strong academic & government

More information

Introduction to HRTFs

Introduction to HRTFs Introduction to HRTFs http://www.umiacs.umd.edu/users/ramani ramani@umiacs.umd.edu How do we perceive sound location? Initial idea: Measure attributes of received sound at the two ears Compare sound received

More information

Application of Linux Audio in Hearing Aid Research

Application of Linux Audio in Hearing Aid Research Application of Linux Audio in Hearing Aid Research Giso Grimm 1 Tobias Herzke 2 Volker Hohmann 2 1 Universität Oldenburg, Oldenburg, Germany 2 HörTech ggmbh, Oldenburg, Germany Linux Audio Conference,

More information