System Identification Related Problems at SMN

Similar documents
System Identification Related Problems at

System Identification Related Problems at SMN

EE482: Digital Signal Processing Applications

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

Surrounded by High-Definition Sound

Embedded Audio & Robotic Ear

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Keyword Recognition Performance with Alango Voice Enhancement Package (VEP) DSP software solution for multi-microphone voice-controlled devices

REAL-TIME DIGITAL SIGNAL PROCESSING

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

Speech-Coding Techniques. Chapter 3

Audio-coding standards

Audio Coding Standards

Principles of Audio Coding

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

SSL for Circular Arrays of Mics

AUDIO SIGNAL PROCESSING FOR NEXT- GENERATION MULTIMEDIA COMMUNI CATION SYSTEMS

ROBUST SPEECH CODING WITH EVS Anssi Rämö, Adriana Vasilache and Henri Toukomaa Nokia Techonologies, Tampere, Finland

Speech and audio coding

Nahimic Troubleshooting Instructions and Q&A The document applies to all MSI Notebook and Vortex product which supports Nahimic.

Before starting the troubleshooting, make sure you have installed the latest version of audio driver and Nahimic on your notebook.

Transporting audio-video. over the Internet

Distributed Signal Processing for Binaural Hearing Aids

ETSI TS V ( )

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Digital Speech Coding

Optical Storage Technology. MPEG Data Compression

5: Music Compression. Music Coding. Mark Handley

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS

Audio-coding standards

A New Approach for Testing Voice Quality. sqlear Q&A

Chapter 5.5 Audio Programming

Chapter 14 MPEG Audio Compression

Synopsis of Basic VoIP Concepts

Audio Coding and MP3

Data Compression. Audio compression

Presents 2006 IMTC Forum ITU-T T Workshop

The MPEG-4 General Audio Coder

Inverse Filter Design for Crosstalk Cancellation in Portable Devices with Stereo Loudspeakers

Mpeg 1 layer 3 (mp3) general overview

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

2.4 Audio Compression

AUDIO. Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015

Making an on-device personal assistant a reality

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

ETSI TS V (201

ELL 788 Computational Perception & Cognition July November 2015

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

BlueCoin, the Robotic Ear

Product Information NANO

RTP implemented in Abacus

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Speech User Interface for Information Retrieval

White Paper Voice Quality Sound design is an art form at Snom and is at the core of our development utilising some of the world's most advance voice

Voice. Voice. Patterson EagleSoft Overview Voice 629

USER MANUAL DUET PCS USB DESKTOP SPEAKERPHONE

Transforming. Noise. Introduction. Simulation. pairs. If they. can also - 1 -

PRODUCT DATA. Voice Testing Software for Hands-free Equipment Type 7909-S1. Uses and Features

Yealink VC Microphone Profolio. Video Conferencing Phone VCP41. Key Features and Benefits

Parametric Coding of High-Quality Audio

How to Change the Default Playback & Recording Audio Device. How to Change the Default Playback Device

Date. Next Generation in Speech Quality ETSI STQ Workshop, Nov 2012 Dr. Imre Varga Qualcomm Inc.

Audio Engineering Society. Convention Paper. Presented at the 126th Convention 2009 May 7 10 Munich, Germany

Briefing. Briefing 100 People. Keep everyone s attention with the presenter front and center. C 2015 Cisco and/or its affiliates. All rights reserved.

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

AFMG. EASE Seminar September 17 th to 21 st 2018, Berlin, Germany. Agenda. Software-Engineering Research Development

Appendix 4. Audio coding algorithms

ETSI TS V (201

Speech-Music Discrimination from MPEG-1 Bitstream

User Manual. Please read this manual carefully before using the Phoenix Octopus

Series Aggregation Services Routers.

xcore VocalFusion Speaker Evaluation Kit Quick Start Guide

5G the next major wireless standard

VoIP Forgery Detection

Modeling of Pinna Related Transfer Functions (PRTF) Using the Finite Element Method (FEM)

Analogue Range.Audio 300, 310, 322, 326, 355, 440, 480. Simply Smarter Communications 1

Application of Linux Audio in Hearing Aid Research

Parametric Coding of Spatial Audio

Visualization and text mining of patent and non-patent data

Dr Andrew Abel University of Stirling, Scotland

DS502 GAMING Headset. Department Name Date

MPEG-4 General Audio Coding

Product Information Pico RITE

EASE Seminar Entry Level & Advanced Level

ETSI TS V ( )

Voice Command Based Computer Application Control Using MFCC

ABSTRACT. that it avoids the tolls charged by ordinary telephone service

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Proceedings of Meetings on Acoustics

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Avonic AV-MIC44. USB 2.0 Video Conferencing Table Speakerphone

May Wu, Ravi Iyer, Yatin Hoskote, Steven Zhang, Julio Zamora, German Fabila, Ilya Klotchkov, Mukesh Bhartiya. August, 2015

ETSI TS V ( )

An Introduction to Pattern Recognition

PJP-EC200 Setup Procedure Yamaha Corporation

Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy

Modeling of Pinna Related Transfer Functions (PRTF) using the Finite Element Method (FEM)

GSM Network and Services

Transcription:

Ericsson research SeRvices, MulTimedia and Network Features System Identification Related Problems at SMN Erlendur Karlsson SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 1

Outline Research Topics @ Ericsson Research System Identification related applications at SMN Important issues when dealing with real-world problems Trends and opportunities SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 2

Research Topics @ Ericsson Research Ericsson Research Blogg http://www.ericsson.com/research-blog 5G Cloud Context Aware Communication Data and Knowledge Garage Internet of Things LTE Media Coding SDN Security Service Systems Smart Cities SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 3

Cloud Robotics for 5G Enabled Manufacturing Traditional robots, programmed to carry out specific functions are replaced by new robots connected to the cloud These new robots only include low-level controls, sensors and actuators Their intelligence is moved to the cloud where they have access to almost unlimited computing power. SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 4

Telehaptic Control Drone inspection of wind turbines Friction and vibration feedback in haptic interface used to signal unsafe movement of drone around wind turbine An application example for 5G technology SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 5

System Identification Related Applications at MMT Audio and Speech Coding Audio Mining (ASR) Audio Media Processing Acoustic Echo Cancellation Noise Suppression Voice Activity Detection Spatial Audio Capture Spatial Audio Rendering Video Coding (2D and 3D) Objective Quality Estimation of Encoded Audio and Video Congestion Control in IP Networks SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 6

Audio and Speech Coding Clean speech signals can be modeled very efficiently with Code-Excited Linear Prediction (CELP) encoders (Based on ARX model of the speech signal) Music signals are better encoded with transform encoding methods (Subband filter banks, MDCT) Signal classification and hybrid encoding used to obtain efficient encoding of audio signals of varying content EVS (Enhanced Voice System) just standardized in 3GPP standardization Special EVS session at ICASSP 2015 in Australia SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 7

CELP Speech Model SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 8

Bitstream Bitstream EVS Speech/Audio Codec prototype HL structure Mode TD TD- BWE Improved AMR-WB technology Parametric high band Technology Linear Pred. + ACELP FCB variable sf. Linear prediction, energy/gain FD G.719-like Transform (LD-MDCT), block switching input VAD Mode Dec. TD (+TD-BWE) FD CNG wb WB SWB FB bandwidths TD AMRWB-like TD-BWE FD G.719 like FD-coding parametric 4 ~6 8 16 20 Audio BW [khz] SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 9

Acoustic Echo Cancellation Long echo impulse reponses: 300-500 msec At 48 khz sampling : 14,400 24,000 samples SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 10

Ericsson In-Game Communication Ericsson In-Game Entropia Universe / MindArk Communication SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 11

EIGC System Concept SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 12

Spatial Audio Capture Microphone arrays Filter design in the spatial and frequency domains Beamforming techniques Adaptive tracking of the most active speakers in a room SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 13

Spatial Audio Rendering Spatial hearing 3D binaural rendering through Head Related Filtering (HRF) Very useful in 3D gaming and evolved communication solutions Spatial audio rendering onto any loudspeaker configuration SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 14

Spatial Hearing SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 15

Acoustic Wave Reception The listeners median plane Sound wave Left Head Related Filter (HRF) Contralateral ear Listener Right Head Related Filter (HRF) Ipsilateral ear Length L ITD = L/c where c=speed of sound SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 16

ASR System Main Components Training Data Acoustic Models Applying Lexical Models Constraints Language Models Speech Signal Representation Feature Vector Search Recognized Words Speech recognition is the problem of deciding on How to represent the signal How to model the constraints How to search for the most optimal answer SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 17

ASR System Solution Components Acoustic- Phonetic Modeling Pattern Recognition Finite-State Transducers Language Models Adaptation Acoustic Models Lexical Models Language Models Speech Signal Representation Search Recognized Words Speech Signal Representation Search Algorithms Vector Quantization & Clustering Hidden Markov Modeling Graphical Models Segmental Models GMMs Neural Networks SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 18

Important issues when dealing with real-world problems Understand the strengths and weaknesses of the different identification methods Preprocessing the data before the optimization can be crucial Choose the minimization criterion with care and adapt it to the problem at hand Different type of regularization components in the criterion can make the difference between success and failure Some times a criterion having components in both the time and frequency domains will work, when single domain criterions fail. SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 19

Important issues when dealing with real-world problems 2 Some applications require classification based modelling, where the current model used depends on signal classification of some signals Many systems have to deal with spurious events This will require the detection of such events and special model updates when they are detected Monitoring of system model Hypothesis testing and estimation SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 20

Strong Trends Cloudification More and more services are being moved to the cloud Digitalization The amount of data becoming available in digital form over the internet is rapidly increasing Analytics Due to the advances in machine learning technology, analytics on large data sets are being explored at a rapidly increasing rate Internet of things (IOT) Every thing that will benefit from a network connection will have one. SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 21

Opportunities This is paving the way for great opportunities for technological development in all application areas Where do you see opportunities? SysId Related Problems @ ER/SMN Ericsson External 2016-05-09 Page 22

Erlendur Karlsson, email: erlendur.karlsson@ericsson.com