Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications

Similar documents
Application Note PEAQ Audio Objective Testing in ClearView

Proceedings of Meetings on Acoustics

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

Voice Analysis for Mobile Networks

I D I A P R E S E A R C H R E P O R T. October submitted for publication

Perceived Audio Quality of Realistic FM and DAB+ Radio Broadcasting Systems

End-to-end speech and audio quality evaluation of networks using AQuA - competitive alternative for PESQ (P.862) Endre Domiczi Sevana Oy

Quality Aspects in Digital Broadcasting and Webcasting Systems: Bitrate versus Loudness

3G Services Present New Challenges For Network Performance Evaluation

Experimental Evaluation of Jitter Buffer Algorithms on Voice over IP Networks

DRA AUDIO CODING STANDARD

Perceptual Quality Measurement and Control: Definition, Application and Performance

BeaqleJS: HTML5 and JavaScript based Framework for the Subjective Evaluation of Audio Quality

Perceptually motivated Sub-band Decomposition for FDLP Audio Coding

No-reference perceptual quality metric for H.264/AVC encoded video. Maria Paula Queluz

Perspectives on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony

Performance Analysis of Discrete Wavelet Transform based Audio Watermarking on Indian Classical Songs

Voice Quality Assessment for Mobile to SIP Call over Live 3G Network

Squeeze Play: The State of Ady0 Cmprshn. Scott Selfon Senior Development Lead Xbox Advanced Technology Group Microsoft

Technical Recommendation S. 10/07: Source Encoding of High Definition Mobile TV Services

SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STREAMED OVER THE NETWORK

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:

Does your Voice Quality Monitoring Measure Up?

MAXIMIZING AUDIOVISUAL QUALITY AT LOW BITRATES

Parametric Coding of Spatial Audio

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

RECOMMENDATION ITU-R BS Procedure for the performance test of automated query-by-humming systems

RECOMMENDATION ITU-R BT.1720 *

Audio and Video Channel Impact on Perceived Audio-visual Quality in Different Interactive Contexts

Audio Processing on ARM Cortex -M4 for Automotive Applications

Impact of Voice Coding in Performance of VoIP

Principles of Audio Coding

Efficient Signal Adaptive Perceptual Audio Coding

Lecture 16 Perceptual Audio Coding

1 Audio quality determination based on perceptual measurement techniques 1 John G. Beerends

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION

Introduction on ETSI TC STQ Work

Evaluation of Different AAC Codec Realizations for Audio Mobile Device based on ARM Architecture

Performance analysis of voip over wimax

MPEG-4 Structured Audio Systems

CHARTING THE FUTURE OF INNOVATION # ERICSSON TECHNOLOGY. LEVERAGING STANDARDS FOR VIDEO QoE

Simple Watermark for Stereo Audio Signals with Modulated High-Frequency Band Delay

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

Subjective Evaluation of State=of=the-Art 2-Channel Audio Codecs

RC-1590 Stereo Preamplifier PRELIMINARY

Evaluation of VoIP Speech Quality Using Neural Network

The MPEG-4 General Audio Coder

Multimedia Quality in a Conversational Video-conferencing Environment

Chapter 14 MPEG Audio Compression

MULTIMODE TREE CODING OF SPEECH WITH PERCEPTUAL PRE-WEIGHTING AND POST-WEIGHTING

Audio-coding standards

5: Music Compression. Music Coding. Mark Handley

Presents 2006 IMTC Forum ITU-T T Workshop

Investigation of Algorithms for VoIP Signaling

Audio-coding standards

2 Framework of The Proposed Voice Quality Assessment System

QOS MEASUREMENT AND EVALUATION IN PRIVATE NETWORK OF SPP PRIOR TO VOIP IMPLEMENTATION

A Novel Audio Watermarking Algorithm Based On Reduced Singular Value Decomposition

Interconnected Systems Modern Telecommunication Systems and their Impact on QoS and Q oe

FINE-GRAIN SCALABLE AUDIO CODING BASED ON ENVELOPE RESTORATION AND THE SPIHT ALGORITHM

NCTU-VT: a freeware for wireless VoIP performance measurement

SAOC and USAC. Spatial Audio Object Coding / Unified Speech and Audio Coding. Lecture Audio Coding WS 2013/14. Dr.-Ing.

Audio Watermarking Based on PCM Technique

Impact of Background Traffic on Speech Quality in VoWLAN

Predicting the Quality Level of a VoIP Communication through Intelligent Learning Techniques

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

ETSI TR V ( )

Perceptual Pre-weighting and Post-inverse weighting for Speech Coding

Single sided Real time PESQ Score Estimation

A Detailed look of Audio Steganography Techniques using LSB and Genetic Algorithm Approach

MultiDSLA. Measuring Network Performance. Malden Electronics Ltd

PCM30U-ROK 2 048/256 kbps Broadcast Codec Brief Overview

QUALITY is the main factor driving research in video and

Theoretical and Practical Aspects of Triple Play

Streamer Max MKII. Bidirectional Encoder/Decoder for streaming audio over IP MAIN FEATURES

Mobile Voice and Data Experience in Estonia

The Steganography In Inactive Frames Of Voip

QUALITY is the main factor driving research in video

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Quality of Service and Quality of T-Labs Berlin

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

Robustness and independence of voice timbre features under live performance acoustic degradations

Modeling of an MPEG Audio Layer-3 Encoder in Ptolemy

Research Article Quality of Experience Assessment of Video Quality in Social Clouds

MPEG-4 aacplus - Audio coding for today s digital media world

Digital Image Processing

Lossless Compression of Speech and Audio Signals, and Its Application ( 音声 音響信号の可逆符号化とその応用に関する研究 )

Research Article Impairment-Factor-Based Audiovisual Quality Model for IPTV: Influence of Video Resolution, Degradation Type, and Content Type

Speech-Coding Techniques. Chapter 3

Speech and audio coding

Call me back on Skype

MAXIMIZING BANDWIDTH EFFICIENCY

A MULTI-RATE SPEECH AND CHANNEL CODEC: A GSM AMR HALF-RATE CANDIDATE

INTERNATIONAL TELECOMMUNICATION UNION

An Improved Performance of Watermarking In DWT Domain Using SVD

Perception of Utility in Autonomic VoIP Systems

Export Audio Mixdown

Determination of Bit-Rate Adaptation Thresholds for the Opus Codec for VoIP Services

Chapter 28. Multimedia

Transcription:

Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications Peter Počta {pocta@fel.uniza.sk} Department of Telecommunications and Multimedia Faculty of Electrical Engineering University of Žilina, Slovakia John Beerends TNO, The Hague The Netherlands

Outline Aim of this study Subjective test (test description, results) Objective test (models investigated, results) Conclusions 2

Aim of this study 3 Threefold: 1. We would like to know to what extent degradations introduced by the lossy audio codecs deployed in current digital audio broadcasting systems and web-casting applications have an impact on the audio quality perceived by the end user 2. We would like to compare the performance of the popular lossy codecs for typical boundary bit rates deployed in current digital audio broadcasting systems and web-casting applications on the same critical signals from the perceived quality perspective 3. We would like to see whether the PEAQ and POLQA Music models are able to provide valid predictions of perceived audio quality for the given application domain

Subjective test (test description) A subjective test carried out in accordance with ITU-R Rec. BS.1116-1 In all experiments one listener was seated in a small listening room (acoustically treated) with a background noise below 20 db SPL (A) All together, 23 listeners (15 male, 8 female, 16 46 years, mean 27.48 years) with an appropriate expertise, as defined in ITU-R Rec. BS.1116-1, participated in the test All involved subjects pre-screened for detecting small impairments in a training phase of the test as advised by the particular ITU-R recommendation 4

Subjective test (test description) 5 Grades from 21 listeners (14 male, 7 female, 16 46 years, mean 27.33 years) were used for the analysis As 2 subjects (one male and one female) were rejected during a post-screening phase due to an incorrect identification of the hidden reference in the test (a high quality test case) and the associated scores were removed from the data set The subjects were remunerated for their efforts Presentation level: 79 db SPL (A) A sampling rate: 48 khz - commonly used in the current digital audio distribution systems and applications A goal of this test was to evaluate the performance of the codecs under worst-case conditions from a signal perspective

Subjective test (test description) 6 Table 1: Audio test signals used in the subjective test Table 2: Codecs and lowest and highest bit rates used in the subjective test 6 critical audio test signals listed in Table 1 encoded by the codecs and bit rates listed in Table 2 to yield to 72 test items 72 test items divided to 2 test sessions balanced from a size and degradation perspective to avoid subject fatigue as advised in ITU-R Rec. BS.1116-1

Subjective test (results) 7 The codecs operating at the lowest bit rate have a serious impact on the perceived audio quality the codecs are ranked downwardly from a SDG obtained for the lowest investigated bit rate perspective As clearly expected, the performance of each codec improves with a higher bit rate The SDG values reported for the highest bit rate oscillate around -0.19 for all the investigated codecs all the investigated codecs provide very good, almost excellent, quality for the highest investigated bit rate and a quality difference between them is negligible in this case Figure 1: Effect of codecs and bit rates on average SDG values. The vertical bars show 95 % CI computed over 126 SDG values (21 subjective scores per audio signal for 6 audio signals).

Subjective test (results) Similarly as in [Berg et al.], HE-AACv2 codec operating at 128 kbps was unable to attain a perceptually transparent quality represented by the imperceptible section of the scale for all the critical items used in this study The best quality attained for the lowest bit rate is provided by: Ogg Vorbis at 64 kbps representing web-casting applications Note: the bit rate of 32 kbps was used in this study for Opus and AAC- LC codecs dominantly representing web-casting applications HE-AACv2 codec at 24 kbps characterizing broadcasting systems The ANOVA test revealed that subjects were more sensitive to the bit rates than to all the investigated codecs and signals, and highly statistically significant interactions between all the investigated factors were found 8

Objective test (models) 9 Models involved in the objective test: PEAQ intrusive signal-based audio quality predictor standardized in ITU, as ITU-R Rec. BS.1387-1, Basic version Advanced version more accurate but computationally more demanding POLQA Music intrusive signal-based audio quality predictor based on a speech quality prediction model POLQA standardized in ITU, as ITU-T Rec. P.863: the major differences between POLQA and POLQA Music are listed in [Pocta et al.] 2 versions investigated: V1: trained on the same audio material as PAQM V2: re-trained on all available data including the data of the current experiment

Objective test (results) 10 Figure 2: Correlation between the subjective results (SDG values) and PEAQ Basic Version predictions (ODG values) Figure 3: Correlation between the subjective results (SDG values) and PEAQ Advanced Version predictions (ODG values)

Objective test (results) 11 Figure 4: Correlation between the subjective results (SDG values) and POLQA Music predictions (POLQA Music OMOS values) Figure 5: Correlation between the subjective results (SDG values) and POLQA MusicV2 predictions (POLQA MusicV2 OMOS values)

Objective test (results) 12 Figure 6: Correlation between the subjective results (SDG values) and ODG values per codec predicted by POLQA MusicV2 All the differences between reported correlations for all the investigated models are statistically not significant It means that all the investigated models produce statistically equivalent results A ranking provided by POLQA MusicV2 best matches a ranking obtained from the subjective test POLQA MusicV2 is capable of predicting the codec performance very accurately when scores are averaged over a small set of musical fragments except for the low bit rate version of the HE-AACv2 codec

Conclusions 13 the impact of degradations introduced by 6 lossy audio codecs typically deployed in current DAB systems and web-casting applications on the audio quality perceived by the end user investigated from two perspectives: subjective (scores obtained from the test run according to ITU-R Rec.BS.1116-1) objective (predictions provided by PEAQ and POLQA Music models) Study conclusions: the degradations introduced by the lossy audio codecs typically deployed in current digital audio broadcasting systems and web-casting applications operating at the lowest investigated bit rate have a serious impact on the perceived audio quality best quality provided by Ogg Vorbis codec at 64 kbps (web-casting applications) very closely followed by HE-AACv2 codec at 24 kbps (broadcasting systems) all the investigated codecs provide a near transparent audio quality at the highest investigated bit rate a retrained version of POLQA Music allows reasonable accurate predictions of the perceived audio quality of the codecs typically deployed in these systems and applications, especially when scores are averaged over a small set of musical fragments

Thank you for your attention! Questions? 14

References [Soulodre et al.] G.A. Soulodre, T. Grusec, M. Lavoie and L. Thibault, Subjective Evaluation of State-of-the-Art Two-Channel Audio Codecs, Journal of Audio Engineering Society, vol. 46, No.3, pp. 164-177, 1998,ISSN 1549-4950. [Berg et al.] J. Berg, Ch. Bustad, L. Jonsson, L. Mossberg and D. Nyberg, Perceived Audio Quality of Realistic FM and DAB+ Radio Broadcasting Systems, Journal of Audio Engineering Society, vol. 61, No.10, pp. 755-777, 2013, ISSN 1549-4950. [Pocta et al.] P. Pocta, J.G. Beerends, Subjective and Objective Assessment of Perceived Audio Quality of Current Digital Audio Broadcasting Systems and Web-Casting Applications, IEEE Transactions on Broadcasting, vol. 61, No.3, September 2015, pp. 407-415, ISSN 0018-9316. 15