Introducing the MPEG-H Audio Alliance s New Interactive and Immersive TV Audio System

Similar documents
Introducing the MPEG-H Audio Alliance s New Interactive and Immersive TV Audio System

Starting production in interactive and immersive sound

PRESS RELEASE. Fraunhofer IIS, Qualcomm and Technicolor to Demonstrate the World s First Live Broadcast of MPEG-H Interactive and Immersive TV Audio

IMMERSIVE AUDIO WITH MPEG 3D AUDIO STATUS AND OUTLOOK

TECHNICAL PAPER. Personal audio mix for broadcast programs. Fraunhofer Institute for Integrated Circuits IIS

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

Object-Based Audio and Sound Reproduction

Enhanced Audio Features for High- Definition Broadcasts and Discs. Roland Vlaicu Dolby Laboratories, Inc.

ISO/IEC Information technology High efficiency coding and media delivery in heterogeneous environments. Part 3: 3D audio

ITU-R Workshop Topics on the Future of Audio in Broadcasting. Movies and the immersive experience. Hubert Henle 15.7.

DASH IN ATSC 3.0: BRIDGING THE GAP BETWEEN OTT AND BROADCAST

Advanced Functionality & Commercial Aspects of DRM. Vice Chairman DRM Technical Committee,

Dolby Atmos Immersive Audio From the Cinema to the Home

ATSC Standard: A/342 Part 3, MPEG-H System

ATSC Candidate Standard: A/342 Part 3, MPEG-H System

ATSC Standard: A/342 Part 2, AC-4 System

FAST FACTS. Fraunhofer Institute for Integrated Circuits IIS

HELP MANUAL FOR WEB STORE USERS. (Users who downloaded the app from Boom 3D Website)

The user interface in Boom 3D is sleek, the graphics are catchy and the app overall is extremely easy to operate.

The following bit rates are recommended for broadcast contribution employing the most commonly used audio coding schemes:

TR 042 EXAMPLE OF AN END-TO-END OBA BROADCAST ARCHITECTURE AND WORKFLOW

Tuning into a Radio Station

Dolby AC-4: Audio Delivery for Next-Generation Entertainment Services

Technical PapER. between speech and audio coding. Fraunhofer Institute for Integrated Circuits IIS

Front Surround System NEW PRODUCT BULLETIN

APPEAR SOFTWARE PORTFOLIO LIVE OTT TRANSCODING

WINDOWS HELP MANUAL by

PHABRIX. Dolby Test & Measurement Application Notes. broadcast excellence. Overview. Dolby Metadata Detection. Dolby Metadata Analysis

Boom 3D Quick Guide. Copyright , Global Delight Technologies Pvt. Ltd. All rights reserved.

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

The AAC audio Coding Family For

TX-SR383. Table of contents. Instruction Manual. Preparation. - Connecting speakers. Playback. Setup. Troubleshooting. Appendix AV RECEIVER

Audio over IP devices System description Ed 2.0

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Audio Systems for DTV

ENHANCED NEXT GENERATION AUDIO FOR LIVE SPORTS BROADCAST

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

DTAPI ENCODER CONTROL

Chapter 5.5 Audio Programming

WHITE PAPER. Fraunhofer Institute for Integrated Circuits IIS

EXCELLENCE IN AUDIO AND MEDIA ENGINEERING

Object-based audio production. Chris Baume EBU-PTS - 27th January 2016

NEW PRODUCT BULLETIN. Powerful surround ambience from a robust body. YAS-203. Fr o n t Sur r ound S ys tem

Value Electronics Authorized Sony Dealer

Connect via HDMI, LAN Connection, USB and more to enjoy music and movies from different sources. 2

BOSE GIFT GUIDE CHRISTMAS 2017

HD Audio Converter Incorporates HDMI technology

enjoy pure digital perfection USB host HDMI out 1080P Up-scaling Full HD (1080) up-scaling

55'' UHD 4K TV 55UJ630T

Before starting the troubleshooting, make sure you have installed the latest version of audio driver and Nahimic on your notebook.

KD-R975BTS. 1-DIN CD Receiver

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

5: Music Compression. Music Coding. Mark Handley

Surrounded by High-Definition Sound

WORSHIP FACILITY - MAIN AUDITORIUM APPLICATION GUIDE

ILE Implementation: CYBER TELEPORTATION TOKYO at SXSW

Parametric Coding of Spatial Audio

DTAPI ENCODER CONTROL

HDTV: concatenated compression EBU test results Credits to the EBU projects P/HDTP (M.Visca, RAI) and D/HDC (R.Schaefer, IRT) and their Members

8 th November 2016 Making Content Delivery Work by Adding Application Layer Technologies. Simon Jones Chief IPTV Architect BT

Authoring & Self-Promotion. DVD and Showreels

Tuning into a Radio Station

Nahimic Troubleshooting Instructions and Q&A The document applies to all MSI Notebook and Vortex product which supports Nahimic.

HDCP2.2. 4K Ultra HD Pas s - Through

TX-NR474 AV RECEIVER. Instruction Manual

Surround Codec Background, Technology, and

TX-SR393. Table of contents. Connections. Instruction Manual. - Connecting Speakers. Playback. Setup. Troubleshooting. Appendix

Surround Sound System. Introduction

SONiX TECHNOLOGY CO.,LTD.

ETSI TR V ( )

VenueExplorer, Object-Based Interactive Audio for Live Events. Matthew Paradis, Rebecca Gregory-Clarke & Frank Melchior

EBU Tech Doc Three parts of Tech Doc 3344

Chapter 14 MPEG Audio Compression

ISO/IEC INTERNATIONAL STANDARD. Information technology MPEG audio technologies Part 3: Unified speech and audio coding

WIFI VS BLUETOOTH Bluetooth. Wi-Fi.

MMSys June 18 1

NEW PRODUCT BULLETIN. Hear the difference... feel the realism. YSP Digital Sound Projector

MP3. Panayiotis Petropoulos

Experiencing MultiChannel Sound in

STR-DH820 - PRODUCT INFORMATION DOCUMENT

SPATIAL SOUND CARD. DOCUMENTATION (April 2018)

Principles of MPEG audio compression

AET 1380 Digital Audio Formats

TotalCode Studio. Professional desktop encoding for digital distribution and over the top services NEW FEATURES

Video Communication Ecosystems. Research Challenges for Immersive. over Future Internet. Converged Networks & Services (CONES) Research Group

Tuning into a Radio Station

Easy Setup Wizard in Nero Home

Picture Quality Analysis of the SDTV MPEG-2 Encoder Ericsson EN8100. Dagmar Driesnack, IRT V1.1

Immersive Live Experience (ILE) services and standardization activities

Sold by: Toll Free: (877)

MediaKind Encoding On-Demand

145W per channel, this powerhouse envelopes you in a new level of sound performance. (145W x 8ohms 1kHz 0.9% THD with 1 ch.

Important Encoder Settings for Your Live Stream

WHITE PAPER. Director Prof. Dr.-Ing. Albert Heuberger Am Wolfsmantel Erlangen

1.1 SD Commercial File Delivery standard (FAST)

CHAPTER 10: SOUND AND VIDEO EDITING

Networking Applications

STR-DN840 PRODUCT INFORMATION DOCUMENT

MEDIA RELEASE FOR IMMEDIATE RELEASE Singapore, 6 January 2010 Total: 8 pages (including Notes to the Editor)

Choosing the Radio Station

Transcription:

Introducing the MPEG-H Audio Alliance s New Interactive and Immersive TV Audio System Presentation at AES 137 th Convention, Los Angeles Robert Bleidt October, 2014 Based on the new MPEG-H Audio Open International Standard ISO/IEC23008-3 2014 Fraunhofer IIS

MPEG-H Audio improves the listening experience The newest audio codec from MPEG expands the features of today s codecs: Personalize the sound to what you want to hear Make surround sound more realistic Tailor reproduction to your listening environment and hardware And the technical stuff: One stream to every device Combine multiple streams for hybrid delivery Audio I-Frames for easy DASH segmentation Unsurpassed coding efficiency 2014 Fraunhofer IIS 2 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience Personalize the sound to what you want to hear 2014 Fraunhofer IIS

MPEG-H Audio improves the listening experience Personalize the sound to what you want to hear In addition to traditional channel-based audio, MPEG-H includes audio objects, sound elements that are transmitted separately to the decoder This allows the listener to adjust the elements to his liking Examples: Simple case: Adjust the dialogue or commentary level in relation to the other audio elements such as ambient sound, music, or sound effects Precursor 2011 Wimbledon Grunt Controller trials with BBC showed most listeners preferred either a higher or lower dialog level Helpful for listeners with hearing loss or non-primary language or dialect Additional language support becomes a simple matter of an additional 20-40 kb/s object 2014 Fraunhofer IIS 4 10/9/2014 www.iis.fraunhofer.de/tvaudio

First practical use of objects: Dialogue Enhancement Allows viewers to set their own Dialogue levels User benefit Dialogue/Announcer can be adjusted up or down Helps hearing impaired understand Dialogue Helps for listening in noisy environment Broadcaster benefit One signal broadcast to all viewers Default mix played by existing receivers Public test by BBC during the Tennis Grand Slam Championships 2011 in Wimbledon Now undergoing standardization in DVB 2014 Fraunhofer IIS 5 10/9/2014 This www.iis.fraunhofer.de/tvaudio document is restricted to use within ATSC

MPEG-H Audio improves the listening experience Personalize the sound to what you want to hear More complex examples: 2014 Field Test with leading U.S. sports network at winter sports event Pick your announcer and language Want to hear more sound effects? More audience sounds? Demo content from NASCAR race Pick your announcer and language Mix in driver s radio Hear car sounds/transducers Real-time decoding and rendering User selects audio mix from on-screen display sliders or preset buttons 2014 Fraunhofer IIS 6 10/9/2014 www.iis.fraunhofer.de/tvaudio

How we Used Objects in our Field Test Mix Nuendo Tracks 11.1 Bus with audience and ambience 7.1 Middle Layer 4.0 Height Layer Transmitted Bitstream Channel Bed 11.1 3D Here we chose to use a 5.0 object since we wanted console automation support for panning. Otherwise we could have used a mono or stereo dynamic object, saving 100 kb/s Viewer s Decoder Rendering 7.1 + 4H Speakers 5.0 Bus L,C,R,UL, UR Sound Effects Sound Effects Object 5.0 Mono Track Network Announcer Network Commentary Object 1.0 Mono Track Venue Announcer Mono Track Norwegian Announcer Venue Commentary Object 1.0 Norwegian Commentary Object 1.0 Total Bitrate 448 kb/s Viewer s Remote Control 2014 Fraunhofer IIS 7 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience Personalize the sound to what you want to hear Summary: Personalization Audio objects allow listeners to: Adjust elements in the sound mix to their liking Boost dialogue for hearing-impaired or second language listeners Listen to alternate audio content such as: Home vs away team commentary Driver to crew race audio Alternate languages, simultaneous translation, or audio description Many other future creative and revenue opportunities 2014 Fraunhofer IIS 8 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience Making surround sound more realistic 2014 Fraunhofer IIS

MPEG-H Audio improves the listening experience Making surround sound more realistic 3D audio systems add height information (more channels or audio objects) to the sound field to improve realism It becomes difficult for the listener to perceive he is hearing a recording Our studies show adding four height speakers provides a substantial improvement without the 22 speakers of classical 3D systems Overall Sound Quality 100 90 80 70 60 50 40 30 20 10 0 Stereo 5.1 Surround 5.1 + 4H 5.1 + 4H Active Dmx 22.2 3D 2014 Fraunhofer IIS 10 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience Making surround sound more realistic MPEG-H currently has defined speaker configurations from 1.0 (mono) to 22.2 Bitstream support for up to 128 channels and 128 objects Our recommendation for 3D sound: 7.1 Blu-ray surround configuration 4 Height Speakers above corner speakers as shown Other configurations are possible, for example: 5.1 + 4H Using front wide 7.1 configuration or 9.1 2014 Fraunhofer IIS 11 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience Making surround sound more realistic MPEG-H includes sound field capture technology for alternative means of capturing and reproducing 3D sound Based on Higher-Order Ambisonics techniques Transmits predominant sounds and ambience in separate tracks, independent of speaker format Allows one-microphone capture of sound field or ambience As with channel-based system, may be combined with audio objects for interactivity or personalization 2014 Fraunhofer IIS 12 10/9/2014 www.iis.fraunhofer.de/tvaudio

Recommended Vertical Viewing Angles Increase with 4K System 1920 x 1080 3840 x 2160 Viewing distance (relative to picture height) 3.2 1.6 Viewing angle - horizontal (degrees) 31 58 Viewing angle - vertical (degrees) 18 35 1.6 PH viewing is recommended for 4K (BT.2022) (though consumers may be able to initially afford/accept only 2 PH) 110 2160p Display Setup note: 2 PH ~= screen diagonal, 65 away from 65 TV 37º 19º 55 1080p Display 61º 33º Viewer 7.2 feet/2,2 m away 2014 Fraunhofer IIS 13 10/9/2014 www.iis.fraunhofer.de/tvaudio

The average consumer will never hear 3D, they listen to stereo today It is true today s consumers listen mainly on stereo in-tv speakers or with simple 2.0 or 2.1 soundbars 5.1 or 7.1 AVR + speaker systems aren t seen as the logical match for that new sleek, slim flat-panel TV So why do 3D audio if most of the audience won t hear it? The same reason we do surround today the high-value viewers will have 3D in their home theaters Right? 2014 Fraunhofer IIS 14 10/9/2014 www.iis.fraunhofer.de/tvaudio

Fraunhofer demonstrates future 3D listening for mainstream consumers with prototype 3D Sound Bar Concept demonstration of our vision of mass consumer delivery of 3D audio Décor-friendly no external wires or speakers No confusing AVR setup menus unbox, hang on wall, enjoy Could be integrated into TV or offered as a separate frame Dramatic improvement over today s stereo soundbars Home theater enthusiasts will still want 11.1 speakers (or more) for ultimate sound quality Shown at the Fraunhofer IIS booth at NAB 2014, IBC 2014 2014 Fraunhofer IIS 15 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience Making surround sound more realistic Summary: 3D Audio 3D provides a substantial increase in sound quality beyond surround sound Four height speakers provide a good tradeoff between cost and sound quality 3D sound bar technology could be built into TVs or soundbars to enable a dramatic improvement in sound quality and immersive sound for the mainstream consumer Higher Order Ambisonics offers an alternative method of capturing and transmitting 3D sound 2014 Fraunhofer IIS 16 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience Tailor reproduction to your listening environment and hardware Multi-platform loudness control Loudness measurement built into the encoder Or use external values Uses existing Dolby metadata in legacy mode Adjustment of dynamic range to match listening environment and device capabilities Examples: Home listening on good speakers Listening on earbuds at airport on mobile phone Listening at home on tablet 2014 Fraunhofer IIS 17 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience Tailor reproduction to your listening environment and hardware Speaker-foolproof rendering for legacy speakers May be able to correct for misplaced speakers in consumer s home HDMI 2.0 will allow communication of consumer s installed speaker locations May offer a limited impression of height on existing 5.1 or 2.0 consumer speakers Binaural rendering for headphone reproduction Listener perceives sound stage in front of him, instead of between his ears May be extended to tablet speakers with Fraunhofer Cingo technology 2014 Fraunhofer IIS 18 10/9/2014 www.iis.fraunhofer.de/tvaudio

MPEG-H Audio improves the listening experience The Technical Stuff A Brief introduction to some selected MPEG-H concepts and features 2014 Fraunhofer IIS

MPEG-H offers the best coding efficiency available today Includes a highly efficient new audio codec to enable true 3D Audio transmission in broadcasting and streaming: Bitrates in kb/s for: Good Recommended Transparent 22.2 Channels 256 512 1200 7.1 + 4 Height Channels + 4 Objects 200 384 800 5.1 Channels 96 160 256 2.0 Channels 32 56 160 Includes the unified Fraunhofer/VoiceAge improved speech codec and new spatial coding techniques Better speech quality at low bitrates Improved stereo and multichannel imaging Improved coding efficiency 2014 Fraunhofer IIS 20 10/9/2014 This www.iis.fraunhofer.de/tvaudio document is restricted to use within ATSC

Combining streams in the decoder In MPEG-H, objects may be delivered over disparate networks and supplied to the decoder separately Enables broadcast TV delivery of main content, with alternative or auxiliary content delivery over the Internet, for example Effectively, late binding in the MPEG-H decoder 2014 Fraunhofer IIS 22 10/9/2014 www.iis.fraunhofer.de/tvaudio

Implementing MPEG-H in TV Broadcasting Easy four-stage process as individual broadcasters desire 2014 Fraunhofer IIS

Developing a TV Audio System Based on MPEG-H The MPEG-H Audio Alliance is developing a practical TV audio system based on the MPEG-H standard Includes not just audio coding, but a complete end-to-end system for content production, transmission and consumer delivery We propose a four-stage process for implementation by broadcasters: From the start, consumer decoders will support all features of our system Broadcasters can choose what features to use and when to implement them Audio Console Panning Data MPEG-H Monitoring Unit Monitor Mode Remote Truck PCM Audio Integrated Loudness Video MPEG-H Control Data Encoder Speakers Contribution Encoder Control Data Settings (Loudness, Channels, Configuration) Sat or IP Link Contribution IRD Contribution link may use PCM or MPEG-H bitstream depending on bandwidth available Ingest/QC Network Operations Master Control Switcher SDI SDI Playout Server File Transfer SDI MPEG-H Monitoring Unit Monitor Mode Distribution Encoder Internet Encoder Speakers Integrated Loudness Sat or IP Link Distribution IRD SDI Adaptive Streaming Segments Local Affiliate (U.S. Broadcast Networks) Sat or IP Link Master Control Switcher SDI Playout Server SDI Emission Encoder Modulator/ Transmitter OTA Transport Stream Cable/Satellite Operator Transport Stream Processing Transport Stream Commercial Server Transport Stream Cbl/Sat Headend 2014 Fraunhofer IIS 24 10/9/2014 www.iis.fraunhofer.de/tvaudio

Stage 1 today s surround with current practices Replace AC-3 encoders with MPEG-H encoders (likely simultaneous with replacing MPEG-2 video encoders with HEVC or SHVC Benefits: Same audio quality with half the bitrate of AC-3 Similar operational practices using traditional loudness dialnorm and compressor profiles of AC-3 Automatic loudness profiles generated for mobile or tablet delivery 2014 Fraunhofer IIS 25 10/9/2014 www.iis.fraunhofer.de/tvaudio

Stage 2 Add objects for consumer choice of audio elements Primary language dialogue (for Dialogue Enhancement consumer adjustment of dialogue level) Second or third language dialogue Audio Description / Visually Impaired Home vs. Away team announcer Driver Pit Crew radio Athlete s playlist? Sound Effects Mono objects require at 20-40 kb/s each Objects can be programmed individually or in presets or groups Requires additional channels in TV plant for objects 2014 Fraunhofer IIS 26 10/9/2014 www.iis.fraunhofer.de/tvaudio

Stage 3 Adding Immersive Sound Add practical immersive audio perhaps 5.1 or 7.1 + 4 height channels Increasing the realism of the audio image to create the you re in the scene or at the event experience. May use Channels-based or HOA-based immersive sound bed + audio objects Requires four additional audio channels in TV plant for height channels System supports audio up to 128 channels, configurations to 22.2 defined in standard. 2014 Fraunhofer IIS 27 10/9/2014 www.iis.fraunhofer.de/tvaudio

Stage 4 Adding Dynamic Objects Dynamic Objects change position over time as specified by control data in bitstream Interesting not only for feature film or episodic drama, but for effects in sports broadcasting and other genres Requires transmission of control data through broadcast plant to final emission encoder We ll have a future paper on that 2014 Fraunhofer IIS 28 10/9/2014 www.iis.fraunhofer.de/tvaudio

Progress with MPEG-H continues MPEG Standard at DIS stage Final International Standard expected in January Being considered for upcoming ATSC 3.0 and DVB TV standards Pioneering experiments in sports broadcasting and other media to test production with MPEG-H s new features Visit the Fraunhofer s AES listening room on the show floor to learn more: Booth 1509 Or go to www.iis.fraunhofer.de/tvaudio or www.mpeghaa.com 2014 Fraunhofer IIS 29 10/9/2014 www.iis.fraunhofer.de/tvaudio

USA: codecs@dmt.fraunhofer.org EU: amm-info@iis.fraunhofer.de 2014 Fraunhofer IIS 31 10/9/2014 www.iis.fraunhofer.de/tvaudio

2014 Fraunhofer IIS 32 10/9/2014 www.iis.fraunhofer.de/tvaudio