Voice Analysis for Mobile Networks

Similar documents
Building Residential VoIP Gateways: A Tutorial Part Three: Voice Quality Assurance For VoIP Networks

MultiDSLA. Measuring Network Performance. Malden Electronics Ltd

Does your Voice Quality Monitoring Measure Up?

Managing the Performance of Enterprise and Residential VoIP Services. Alan Clark Telchemy

End-to-end speech and audio quality evaluation of networks using AQuA - competitive alternative for PESQ (P.862) Endre Domiczi Sevana Oy

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

VoIP Performance Management. Anthony Caiozzo

Setting up a National Transmission Plan & QoS for International Connections. Senior Engineer, TIS ETSI STQ Vice Chair Joachim POMY

Introduction to Quality of Service

Application Notes. Introduction. Performance Management & Cable Telephony. Contents

Investigation of Algorithms for VoIP Signaling

ETSI TS V2.1.3 ( )

Evaluation of VoIP Speech Quality Using Neural Network

A New Approach for Testing Voice Quality. sqlear Q&A

PfR Voice Traffic Optimization Using Active Probes

ETSI ES V1.1.3 ( ) ETSI Standard

Performance Management: Key to IP Telephony Success

Mobile Multimedia Application over DOrA How to Ensure Positive End-user Experience?

Interconnected Systems Modern Telecommunication Systems and their Impact on QoS and Q oe

White Paper Voice Quality Sound design is an art form at Snom and is at the core of our development utilising some of the world's most advance voice

Artisan Technology Group is your source for quality new and certified-used/pre-owned equipment

Quality of Service and Quality of T-Labs Berlin

A Real-Time Network Simulation Application for Multimedia over IP

ETSI ES V1.2.1 ( )

3G Services Present New Challenges For Network Performance Evaluation

Synopsis of Basic VoIP Concepts

ITU-T G Reference guide to quality of experience assessment methodologies

INTERNATIONAL INTERCONNECTION FORUM FOR SERVICES OVER IP. (i3 FORUM) Interoperability Test Plan for International Voice services

Voice Quality Assessment for Mobile to SIP Call over Live 3G Network

RECOMMENDATION ITU-R BT.1720 *

2 Framework of The Proposed Voice Quality Assessment System

IxChariot Roaming Test Plan

Quantifying Skype User Satisfaction

A Practical Guide to Troubleshooting poor speech quality End-to-End & OEM

On improving the quality of VoIP connections

Troubleshooting Packet Loss. Steven van Houttum

Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications

ABSTRACT. that it avoids the tolls charged by ordinary telephone service

Final draft ETSI EG V1.1.1 ( )

Actively Managing Multimedia Telchemy Actively Managing Multimedia

Ai-Chun Pang, Office Number: 417. Homework x 3 30% One mid-term exam (5/14) 40% One term project (proposal: 5/7) 30%

Call me back on Skype

Performance analysis of voip over wimax

Impact of Voice Coding in Performance of VoIP

Designing Software for Mobile VoIP and Video. Fred Wydler VP VoIP Products SPIRIT DSP

Synthesizing Adaptive Protocols by Selective Enumeration (SYNAPSE)

Assessing Call Quality of VoIP and Data Traffic over Wireless LAN

4 rd class Department of Network College of IT- University of Babylon

Networking Applications

Real-time Audio Quality Evaluation for Adaptive Multimedia Protocols

VoLTE Performance Analysis and Evaluation in Real Networks

Spirent Avalanche. Video Quality Analyzer (VQA) & Voice Quality Analysis System. High Performance IPTV Emulation. Applications

Overcoming Barriers to High-Quality Voice over IP Deployments

PTC 208. Specification of the PTC requirements for Headsets for use in Telephony applications. Access Standards. Spark Limited. Wellington NEW ZEALAND

QOS MEASUREMENT AND EVALUATION IN PRIVATE NETWORK OF SPP PRIOR TO VOIP IMPLEMENTATION

Alcatel OmniPCX Enterprise

ESG Lab Review RingCentral Mobile Voice Quality Assurance

Study on Appropriate Voice Data Length of IP Packets for VoIP Network Adjustment

Per-segment based Full Passive Measurement of QoS for the FMC Environment

Presents 2006 IMTC Forum ITU-T T Workshop

Introduction on ETSI TC STQ Work

AT&T Collaborate TM. Network Assessment Tool

Enabling Quality Architectures in the Infrastructure. John Warner Strategic Product Marketing Texas Instruments

A passion for performance. The E500 - Delivering Network Performance and Capacity

Eight Essentials to Implementing Backhaul over Satellite for Mobile Operators. June 2009

AT&T Collaborate TM. Network Assessment Tool

TEMS PRODUCTS THE ONLY BENCHMARKING SOLUTION YOU LL EVER NEED

Perceptual Quality Measurement and Control: Definition, Application and Performance

Key Performance Metrics

Why Next-Generation Voice Services Need Next-Generation KPIs

Effective Network Quality Control Mechanism for QoS/QoE Assurance

Experimental Evaluation of Jitter Buffer Algorithms on Voice over IP Networks

Audio and Video Channel Impact on Perceived Audio-visual Quality in Different Interactive Contexts

Speech-Coding Techniques. Chapter 3

Audiovisual QoS for communication over IP networks

Voice, Video and Data Convergence:

ETSI TR V3.1.2 ( )

Voice over IP (VoIP)

Theoretical and Practical Aspects of Triple Play

ETSF10 Internet Protocols Transport Layer Protocols

Application Note PEAQ Audio Objective Testing in ClearView

IP-to-IP Gateway Test Suite

SIP Flex Test Suite. Highlights. IMS and VoIP Network Element and Service Testing

Report. Voipfuture Qrystal. VoLTE & VoWifi Demo-Report

Acoustic Verification System (AVS) Software User Manual. Spirent Communications 5280 Corporate Drive, Frederick, MD Document Version: 3.

Agilent Technologies IP Telephony Reporter J5422A

A Real-world Demonstration of NetSocket Cloud Experience Manager for Microsoft Lync

Lab Testing Summary Report

Measurement QoS Parameters of VoIP Codecs as a Function of the Network Traffic Level

Hybrid Coax-Wireless Multimedia Home Networks Using Technology. Noam Geri, Strategic Marketing Manager

QoS Targets for IP Networks & Services: Challenges and Opportunities

Single sided Real time PESQ Score Estimation

An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google Talk, and MSN Messenger

Application Notes for Tone Software Streamline Network Assessment Service with Avaya Communication Manager Issue 1.1

Modeling, Characterizing, and Enhancing User Experience in Cloud Mobile Rendering

PTC 208. Specification of the PTC requirements for Headsets for use in Telephony applications. Draft for public comment.

Quality Assurance for Broadcast Audio Contribution over IP

Enterprise Voice over IP Management. Alan Clark Telchemy. Telchemy. Actively Managing Multimedia. Actively Managing Multimedia

TR V1.2.5 ( )

INTERNATIONAL TELECOMMUNICATION UNION

Transcription:

White Paper VIAVI Solutions Voice Analysis for Mobile Networks Audio Quality Scoring Principals for Voice Quality of experience analysis for voice... 3 Correlating MOS ratings to network quality of service... 5 Detailed comparison of quality assessment method... 6 Subjective method...6 Comparison based methods... 7 Signal based methods... 7 Parameter based methods... 7 TeraVM quality assessment implementation for mobile... 9 Recommendations for meaningful VoLTE capacity testing... 10 Conclusion... 11

Quality of experience analysis for voice Defining user quality of experience for voice calls over network infrastructure has been around for many decades, from its simplest origin in which people would sit in a room listening to the calls and scoring the quality of the network subjectively to today s compute intensive assessment algorithms. The key commonality over the years with all of the assessment techniques is the delivery of a quality metric or score commonly known as a Mean Opinion Score (MOS). Generally, a MOS rating of between 4 and 5 represents a good quality of experience for that voice call. quality assessment subjective methods objective methods intrusive methods non-intrusive methods comparison-based methods single-based methods parameter-based methods Figure 1: Voice quality assessment techniques With the move from circuit to packet switched networking a correlation is needed between the voice quality scoring mechanism and the influence that the network packet transport has on voice quality performance. Quantitative measurements for network performance are collectively called Quality of Service (QoS) metrics, QoS enables user s express network performance objectively. As a result the subjective method is replaced with an objective method. Figure 1 above outlines the techniques used to assess voice quality, with the focus of this paper on voice quality analysis for mobility, the aim is to select a suitable measurement technique from the objective methods. Objective assessment can be categorized as Intrusive or Non-Intrusive. Intrusive methods are based on comparing a transmitted voice sample with the received voice sample (comparison based methods). The most widely recognized of these techniques is called Perceptual Evaluation of Speech Quality (PESQ) or its successor which has been adapted for mobility Perceptual Objective Listening Quality Assessment (POLQA). The comparison methodology means analysis is done offline or when the call has been completed. As the technique involves comparing file samples it is compute intensive. This makes intrusive assessment of voice somewhat limited in terms of the number of calls that can be assessed. The alternative is to use Non-Intrusive techniques to determine voice quality, the methods are based on inspecting the audio signal in isolation, either by analyzing the audio signal (signal based methods) or by analyzing the IP network packets transport layer parameters (parameter based methods). The parameter based assessment methods enables live assessment of voice quality and indeed is far more scalable in terms of the number of calls that can be assessed for voice quality. 3 Voice Analysis for Mobile Networks

A key take-away about all of the objective assessment methods is that they are based on computer algorithms that produce the MOS score. During their development and validation they are all subjected to a calibration process during which the algorithm-generated MOS scores are compared to subjectively produced MOS scores (real people listening and scoring the same audio samples) to align the results. Under ideal conditions, this results in a POLQA MOS score of between 4 and 5 correlating to a MOS sampling score of 4 and 5 for non-intrusive voice quality assessment. Ultimately, the decision to choose one technique over the other comes down to what it is that s under test and cost. For example, if a user is looking to determine how a new mobile phone behaves on a network they most likely use POLQA, or if its assessment of networking elements then a user would opt for the lower cost option and use non-intrusive analysis. Correlating MOS ratings to network quality of service The most relevant method of measuring voice quality in order to evaluate the experience that the subscriber is having is to use a listening-only like MOS score. Highlighted below are the different cases used to distinguish voice quality depending on the assessment method which is in use. MOS-LQS Subjective tests carried out according to ITU-T Recommendations P.830, P.835 and P.840 give results in terms of MOS-LQS. MOS-LQO MOS-LQO (electrical) MOS-LQO (acoustic) MOS-LQO (acoustic) The score is calculated by means of an objective model which aims at predicting the quality for a listening-only test situation. This kind of measurement is performed at electrical interfaces of the terminal only. In order to predict the listening quality as perceived by the user, assumptions for the terminals are made in terms of IRS or corrected IRS frequency response; this implicitly includes the assumption of a sealed condition between the handset receiver and the user s ear. ITU-T Recommendation P.862 falls into this category. This kind of measurement is performed at acoustical interfaces of the terminal only. In order to predict the listening quality as perceived by the user, this measurement includes the actual telephone set products provided by the manufacturer or vendor. In combination with the choice of the acoustical receiver in the lab test ( artificial ear ), there will be a more or less leaky condition between the handset s receiver and the artificial ear. Consequently, for more realistic test scenarios, there may be a degradation of the measured MOS value, while for more artificial test scenarios there may be a negligible difference. ITU-T Recommendation P.863 partially falls into this category. This kind of measurement is performed at transport interfaces. The model is based on a scalar transmission rating value, R, which is then converted to MOS-LQO value. ITU-T Recommendation G.107 falls into this category. Detailed comparison of quality assessment methods As highlighted in the previous chapter there is a number of assessment techniques used to derive voice quality which are dependent on the location of the listening point. The following figure outlines an architecture in which each of the unique measurement techniques is used to derive a reliable and repeatable voice quality score. This figure is also useful to outline the associated benefits and cost associated with each MOS score technique. 3 Voice Analysis for Mobile Networks

parameters speech in system speech out auditory test library equipment impairment factor subjective test objective measure appendix I to G.113 G.107 P.834 R P.833 MOS MOS-LQS MOS-LQO E-model rating Figure 2: Voice quality analysis algorithm architecture Subjective method The limitation of subjectively based methods is that they are time-consuming, and so evaluating various different scenarios especially at the scale required for evaluating wireless telephony is prohibitively expensive. These include the ITU-T Recommendations P.800, P.830, P.831, P.832, P.835 and P.840. Comparison based methods Comparison based methods give the best accuracy, as they analyze the difference between the audio signals transmitted from one caller and then received by the other caller. They require that the transmitted signal is known, so they are the most computing-intensive method. This makes them expensive for assessing the quality of thousands of simultaneous calls. Beside the effect of the impairments of the transmission network, comparison based methods also capture the effects of transcoding, echo cancelation and all other type of audio signal impairments. Audio signal impairments are not relevant when testing and evaluating the effect of the transport layer network. For Example ITU-T Recommendation P.863 (POLQA) Signal based methods Signal based methods give good accuracy, but are better suited to laboratory-testing scenarios as there is an inherent need to extract the actual audio stream in some form and then subject this to the analysis algorithm. With these methods, it is also possible to scale the covered number of scenarios significantly; but they require specialized solutions and are computing intensive. Another benefit is that these methods do not require comparison samples as do comparison based methods, so they scale better for the quality assessment of thousands of simultaneous calls. For Example ITU-T Recommendations P.862.1, P.862.2 (PESQ) and P.863 (POLQA) 4 Voice Analysis for Mobile Networks

Parameter based methods Parameter based methods give good accuracy as well as scaling efficiently for assessing the quality of thousands of simultaneous calls. They do not require the comparison sample and do not need the full extraction of the audio stream, as they are based on analysis of the transport-layer parameters. For example ITU-T Recommendation G.107 and G.107.1 (ITU E-Model) TeraVM s MOS scoring is based on the algorithms outlined here. Unlike POLQA, the TeraVM voice MOS score is lightweight, meaning it does not require any other dedicated hardware for analysis purposes. As already indicated under ideal conditions, when there are no impairments, both MOS scores produce toll quality values between 4-5! TeraVM quality assessment implementation for mobile In testing enb capacity, the voice-quality impairments are derived purely from the delay, jitter, out-of-sequence and lost-packet phenomena. The TeraVM audio- and video-quality assessment algorithm therefore expands the basic parameter based method by taking into consideration not only the transport-layer metrics, but also the metrics of the higher-layer audio transport protocol, and enhances these results by modeling the behavior of the UE (user equipment) by buffering the audio/video stream before starting the analysis. While the analysis is being conducted the UE model also takes into account the Jitter buffer that all UEs use to mitigate the effects of the wireless environment and the delays and jitters inherent in it. The Markov Model is designed to measure the distribution of lost and discarded packets, it is designed to detect two primary states within calls: the burst state, in which the rate of packet loss and discard is high enough to cause noticeable degradation in quality, and the gap state. UDP IP RTP jitter buffer Markov model passive analysis Figure 3: TeraVM MOS analysis algorithm architecture Recommendations for meaningful VoLTE capacity testing For evaluating the enb capacity performance over the RF or CPRI (common public radio interface) interface, the recommendation is to always establish a baseline MOS score against which the performance is compared. This score can then be used to evaluate the enb performance when: y The number of simultaneous telephony calls is increased y The calls are subjected to varying fading and mobility environments y Different air-interface technologies are used to alleviate the effect of the above. The MOS evaluation which is run with codecs operating at different audio bandwidths e.g. an ITU-T G.711 voice codec, condition will often yield a score above 4.0 in a narrow-band (300-3700 Hz) test; whereas it is more likely to yield a score in the range of 3.5-3.7 in a wideband (50-7000 Hz) test, due to the presence of the higher quality wideband samples. 5 Voice Analysis for Mobile Networks

These will quantify the performance of the enb when evaluated using the MOS score and the transport layer parameters, which give an insight into the reasons for performance deviations on the air interface. Conclusion Understanding the performance of mobile networks can easily be assessed using voice quality measurement techniques. The key is to select the correct voice quality measurement technique as a wrong decision can be costly. The first step is to reflect on what it is that needs to be assessed i.e. is it the network transport layer or higher up at a device level? The two most common technologies for voice quality assessment are non-intrusive MOS scoring (as delivered in TeraVM) and POLQA, both of which can be used to characterize a voice transmission system. The major differences between the two is price, scalability and complexity of the solutions. A non-intrusive MOS score technique uses network packet parameters, making it ideal for network transport layer assessment. Plus non-intrusive MOS is less costly, scales well and has least complexity to implement. On the opposite side is POLQA, which compares the received audio signal with the expected signal making it ideal for device level assessment. POLQA MOS scoring is highly accurate as it takes into account transmission network impairments, the effects of transcoding, echo cancelation, and any other type of audio signal alteration. As the POLQA algorithm is computationally intensive, it is not practical to use it for testing the speech quality with more than a handful of handsets. Further comparison of the two voice quality measurement techniques reveals that under ideal conditions i.e. no impairments presence, that both techniques will deliver a MOS score between 4 and 5. Therefore, when it comes characterizing the performance of the mobile backhaul network, non-intrusive MOS scoring as delivered in TeraVM offers greatest savings, with maximum number of voice call quality measurement points with the least complexity. To reach the VIAVI office nearest you, visit viavisolutions.com/contact 2018 VIAVI Solutions Inc. Product specifications and descriptions in this document are subject to change without notice. voiceanalysis-wp-wir-nse-ae 30187527 900 1018 viavisolutions.com/wirelessvalidation