A 120 fps High Frame Rate Real-time Video Encoder

Similar documents
3 September, 2015 Nippon Telegraph and Telephone Corporation NTT Advanced Technology Corporation

Functions of PC Communicator: PC-to- FOMA IP Videophone Technology

The i-visto Gateway XG Uncompressed HDTV Multiple Transmission Technology for 10-Gbit/s Networks

Multipoint Streaming Technology for 4K Super-high-definition Motion Pictures

HSTP-IPTV-GUIDE.1 IPTV

A Study on Transmission System for Realistic Media Effect Representation

White paper: Video Coding A Timeline

Development of Transport Systems for Dedicated Service Provision Using Packet Transport Technologies

International Standardization of IPTV at ITU-T IPTV-GSI

NOT FOR DISTRIBUTION OR REPRODUCTION

VIDEO COMPRESSION STANDARDS

Next-generation IPTV Service

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

Performance Estimation Techniques for Browser-based Applications

ITU-T SG16 Sapporo Meeting Report

ILE Implementation: CYBER TELEPORTATION TOKYO at SXSW

Standardization Trend for Super 3G (LTE)

HIGH PERFORMANCE VIDEO CODECS

Next-Generation Switching Systems

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

Testing HEVC model HM on objective and subjective way

HDTV: concatenated compression EBU test results Credits to the EBU projects P/HDTP (M.Visca, RAI) and D/HDC (R.Schaefer, IRT) and their Members

Audio and video compression

Electromagnetic Compatibility Standards and Their Application in NTT Group

Standardization Status towards the Introduction of 5G in 2020

Цепочка доставки UHD. Boris Yurin Account manager TV and Media

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

IPv6 Network Construction Support Solution: Application to the IPv6 Experimental System of IPv6 Promotion Council

Video Communication Ecosystems. Research Challenges for Immersive. over Future Internet. Converged Networks & Services (CONES) Research Group

Network Processing Technology for Terminals Enabling High-quality Services

Image and Video Coding I: Fundamentals

Digital video coding systems MPEG-1/2 Video

Picture Quality Analysis of the SDTV MPEG-2 Encoder Ericsson EN8100. Dagmar Driesnack, IRT V1.1

IP Video Network Gateway Solutions

Dolby Vision. Profiles and levels V1.2.9

EE 5359 Low Complexity H.264 encoder for mobile applications. Thejaswini Purushotham Student I.D.: Date: February 18,2010

High Efficiency Video Coding: The Next Gen Codec. Matthew Goldman Senior Vice President TV Compression Technology Ericsson

High-Throughput Parallel Architecture for H.265/HEVC Deblocking Filter *

Successful High-precision QoS Measurement in Widely Deployed 10-Gbit/s Networks Based on General-purpose Personal Computers

Development of System for Simultaneously Present Multiple Videos That Enables Search by Absolute Time

IP-9610 V02L007 Enhancement List

Tech Note - 05 Surveillance Systems that Work! Calculating Recorded Volume Disk Space

Framework for Supporting Metadata Services

ARIB STD-T53-C.S Circuit-Switched Video Conferencing Services

Standardization Trends of the Next Generation Network in ETSI TISPAN

About MPEG Compression. More About Long-GOP Video

SJTU 4K Video Subjective Quality Dataset for Content Adaptive Bit Rate Estimation without Encoding

R 118 TIERING OF CAMERAS FOR USE IN TELEVISION PRODUCTION

H.264 High Profile: Codec for Broadcast & Professional Video Application

Module 7 VIDEO CODING AND MOTION ESTIMATION

AVP v22

EFFICIENT DEISGN OF LOW AREA BASED H.264 COMPRESSOR AND DECOMPRESSOR WITH H.264 INTEGER TRANSFORM

Triveni Digital Inc. MPEG Technology Series. MPEG 101 (MPEG 2 with a dash of MPEG 4 thrown in) Copyright 2011 Triveni Digital, Inc.

Cisco D9054 HDTV Advanced Compression Encoder

Standardization Activities in International Electrotechnical Commission Technical Committee 86 (Fiber Optics)

MPEG: It s Need, Evolution and Processing Methods

Multi-Source Analyzer (MSA) Series

DASH IN ATSC 3.0: BRIDGING THE GAP BETWEEN OTT AND BROADCAST

JPEG 2000 vs. JPEG in MPEG Encoding

H.264/AVC und MPEG-4 SVC - die nächsten Generationen der Videokompression

Building an Area-optimized Multi-format Video Encoder IP. Tomi Jalonen VP Sales

Standardization Activities for the Optical Transport Network

High-Performance VLSI Architecture of H.264/AVC CAVLD by Parallel Run_before Estimation Algorithm *

Video Codec Design Developing Image and Video Compression Systems

New Service Merging Communications and Broadcasting NOTTV. software platform and techniques for. implementation.

Introduction to Video Encoding

ITU-T J.288. Encapsulation of type length value (TLV) packet for cable transmission systems

Parallel Processing Deblocking Filter Hardware for High Efficiency Video Coding

Georgios Tziritas Computer Science Department

Multicast Technology for Broadcast-type Data Delivery Services

Performance analysis of AAC audio codec and comparison of Dirac Video Codec with AVS-china. Under guidance of Dr.K.R.Rao Submitted By, ASHWINI S URS

Report on ITU-T FG IPTV International Standardization Activities

Cisco D9036 Modular Encoding Platform

RECOMMENDATION ITU-R BT.1720 *

An efficient access control method for composite multimedia content

Professor, CSE Department, Nirma University, Ahmedabad, India

Service Activation System: IP Service Activator Saburo Seto, Kenichi Yamane, and Souhei Majima

Upcoming Video Standards. Madhukar Budagavi, Ph.D. DSPS R&D Center, Dallas Texas Instruments Inc.

EE Low Complexity H.264 encoder for mobile applications

Implementation of A Optimized Systolic Array Architecture for FSBMA using FPGA for Real-time Applications

MediaTek High Efficiency Video Coding

Digital Television: MPEG-1, MPEG-2 And Principles Of The DVB System READ ONLINE

A Perspective on Video and Image Compression in Cable Networks

How to achieve low latency audio/video streaming over IP network?

PAPER Performance Comparison of Subjective Assessment Methods for Stereoscopic 3D Video Quality

Car Information Systems for ITS

Improving User Capacity and Disaster Recovery Time in IP Telephone Service Systems

2020 Town Web Design Converter for Providing Guidance Assistance to Individuals in Cities

EE 5359 H.264 to VC 1 Transcoding

Scalable Video Coding

AVP 3000 Voyager Configuration Packs

MPEG-l.MPEG-2, MPEG-4

Model: LT-122-PCIE For PCI Express

Global IP Network System Large-Scale, Guaranteed, Carrier-Grade

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING

ENCODERS: YESTERDAY AND TOMORROW

Survey of the State of P2P File Sharing Applications

Real-time and smooth scalable video streaming system with bitstream extractor intellectual property implementation

8 th November 2016 Making Content Delivery Work by Adding Application Layer Technologies. Simon Jones Chief IPTV Architect BT

Advanced Video Coding: The new H.264 video compression standard

Transcription:

: Creating Immersive UX Services for Beyond 2020 A High Frame Rate Real-time Video Encoder Yuya Omori, Takayuki Onishi, Hiroe Iwasaki, and Atsushi Shimizu Abstract This article describes a real-time HEVC (High Efficiency Video Coding) operating at 120 frames per second (fps) that is designed to achieve higher frame rate video services. The achieves 4K/ video encoding in real time through the synchronized operation of multiple 2K/120 fps s working in parallel. This also makes it possible to achieve temporal scalable coding and transmission with upward compatibility for existing based systems. This temporal scalability is expected to contribute to rapid expansion of the high frame rate video service field. The proposed systems will also open the door to next-generation high frame rate ultra-high definition television services. Keywords: high frame rate,, hardware 1. Introduction The latest video coding standard, H.265/HEVC (High Efficiency Video Coding), achieves double the coding efficiency of H.264/AVC (Advanced Video Coding), making it possible to provide higher definition video services economically. In recent years, 4K video broadcasting and distribution have become increasingly widespread, and realistic image representations are becoming increasingly popular. Such representations demand not only high spatial resolution but also many other factors (Fig. 1). A high frame rate (HFR) improves the moving picture quality and is essential for creating more realistic image representations [1]. The next-generation television system specified in Recommendation ITU-R *1 BT.2020 Parameter values for ultra-high definition television (UHDTV) systems for production and international programme exchange supports HFR formats up to. For the spread of HFR video services, temporal scalable coding with upward compatibility for current based video services is also necessary for s. Several HEVC real-time s have been developed to enable HEVC encoding of over-4k images. However, they are only capable of encoding images up to due to the increase in computational complexity and the need for temporal scalable functionality. This article presents our new 4K/ HEVC, which achieves HFR real-time encoding and 120/ temporal scalable functionality by exploiting our 4K/ HEVC codec largescale integrated circuit (LSI) called NARA, which stands for Next-generation Encoder Architecture for Real-time HEVC Applications [2]. When the flexible customizable software architecture of the NARA LSI is utilized, merely modifying the custom functions of the NARA software architecture makes it possible to achieve temporal scalability such as upward compatible reference picture structures and dual-stream bit-rate control functions. An equipped with one NARA LSI has encoding capability, and synchronized operation of multiple s working in parallel also achieves scalability to larger 4K/ input images. These scalable functionalities of the proposed will contribute to the development of *1 ITU-R: International Telecommunication Union - Radiocommunication Sector 1 NTT Technical Review

Highest quality standard UHDTV Spatial resolution 8K 4K satellite broadcasting in Japan Terrestrial digital broadcasting in Japan Color space Color gamut 4:4:4 4:2:2 4:2:0 HD Wide SDR 4K 2K Frame rate 30 fps 8 bits 10 bits 12 bits Bit depth HDR Dynamic range HD: high definition HDR: high dynamic range SDR: standard dynamic range UHDTV: ultra-high definition television Fig. 1. Factors required for high quality video. Uncompressed images HFR temporal scalable Enhancement Enhancement video receiver decoder 8.3 ms 16.6 ms base Base video receiver decoder Fig. 2. HFR temporal scalable video distribution. new broadcasting and distribution systems for the upcoming UHDTV services. 2. Encoder system architecture for HFR temporal scalability The Association of Radio Industries and Businesses (ARIB), an incorporated association promoting the practical application and dissemination of radio systems in Japan, regulates 120/ temporal scalable formats as an HFR scalable coding standard based on HEVC [3]. The ARIB standard specifies that HFR s must have temporal scalability, and that encoded picture data are distributed into a base stream and an enhancement stream, as illustrated in Fig. 2. Dual-stream bit-rate control must be performed for both the base and enhancement streams to ensure constant bit-rate encoding and distribution for both and decoders. In addition, base images and enhancement images should be periodically received and decoded one by one alternately to prevent deviation of the decoding time for decoders. This limitation of the decoding time leads to changes in the Vol. 15 No. 12 Dec. 2017 2

4K/ input images 8.3 ms 16.6 ms Base (4 2K/) Enhancement (4 2K/) 4K/ HFR 1/4 2/4 3/4 4/4 ( 4) ( 3) ( 2) ( 1) Fig. 3. System configuration. reference structure of inter-frame prediction. The temporal scalable encoding function is added to the existing NARA LSI by utilizing the flexible customizable software architecture of a NARA LSI with large motion search capability. The LSI s software architecture consists of three hierarchical s; the top function is the software for handling fundamental HEVC functions and user functions. This software hierarchy not only solves the complexity of tediously controlling the hardware using a lowlevel interface and the difficulty of handling HEVC common basic functions, but also provides a simple programming interface as custom functions of the top. Temporal scalability essentially requires complicated modifications of the LSI s encoding method. However, because of this software architecture, the dual-stream bit-rate control and the reference structure modification are achieved in some of the custom functions, and thus, they can be easily customized with higher level programming. 3. System configuration The system configuration of the 4K/ we have developed is shown in Fig. 3. It consists of four s. Each includes one NARA LSI and handles the encoding of one of the 2K images, which is squarely divided from the original 4K input images. The receives a input video as two sets of 2K/ video sequences, which correspond to the base and the enhancement, by using a multi-channel input functionality of the NARA LSI. The rearranges the two 2K/ sequences to form one image, and all s operate cooperatively by exchanging synchronization signals in order to share the common clocking and time stamp values to form one 4K/120 fps. The output stream is constructed in the MPEG-2 Transport Stream (TS) *2 format so that the base stream and the enhancement stream have different video packet identifiers. Here, the NARA LSI s multi-channel video output function is effectively utilized. Four transport streams can be transmitted in parallel, and they can also be transmitted with one multiplexed stream by using the s multi-channel multiplex and output functionality with cascaded transport stream input/output connections, as illustrated in Fig. 3. 4. Implementation A photograph of the overall 4K/ with four devices is shown in Fig. 4, and the specifications are listed in Table 1. This inputs a 4K/ video with eight 3G-SDIs (third-generation serial digital interfaces), and outputs an HEVC temporal scalable stream in the MPEG-2 TS format by using a DVB-ASI *2 MPEG-2 Transport Stream: A Moving Picture Experts Group standard digital container format for transmission and storage of audio, video, and Program and System Information Protocol (PSIP) data. 3 NTT Technical Review

Fig. 4. Photograph of. Table 1. System specifications. Video format 3840 2160 pixels / Video coding HEVC Main 10 profile Container format MPEG-2 Transport Stream Input 3G-SDI 8 Output DVB-ASI or 1000BASE-T (digital video broadcasting - asynchronous serial interface) or by using Internet protocol (IP) connections. We observed that the 4K/ s encoded by our were successfully decoded and played by other HFR systems. A photograph taken during a demonstration of 4K/ real-time transmission with our HFR is shown in Fig. 5. Uncompressed 4K/120 fps images were encoded at a constant bit rate of 80 Mbit/s, where the bit rate of the base was 60 Mbit/s and that of the enhancement was 20 Mbit/s and images were distributed over an IP connection. The other real-time 4K/ software decoder received the compressed, which was demultiplexed from MPEG-2 TS and decoded in real time. Then the 4K/ decoded images were displayed on the screen by the 4K/-enabled projector. No video services have been available to date due to the enormous video data size and the requirement for compatibility with legacy systems. This has made it difficult to achieve video Fig. 5. Real-time encoding demonstration. distribution. Our device solves this problem and provides advanced high quality HFR video by combining it with other existing 4K/-enabled devices such as a camera, decoder, and projector. 5. Conclusion We presented a new real-time HEVC for higher frame rate video encoding and transmission exploiting the existing HEVC LSI. This makes it possible to achieve temporal scalable HEVC coding and transmission with upward compatibility for. We plan to continue developing video coding devices to contribute to the provision of new services. References [1] M. Emoto, Y. Kusakabe, and M. Sugawara, High-frame-rate Motion Picture Quality and Its Independence of Viewing Distance, J. Disp. Technol., Vol. 10, No. 8, pp. 635 641, 2014. [2] T. Onishi, T. Sano, Y. Nishida, K. Yokohari, J. Su, K. Nakamura, K. Nitta, K. Kawashima, J. Okamoto, N. Ono, R. Kusaba, A. Sagata, H. Iwasaki, M. Ikeda, and A. Shimizu, Single-chip 4K 4:2:2 HEVC Video Encoder LSI with 8K Scalability, Proc. of 2015 Symposium on VLSI Circuits (VLSI Circuits), C54-C55, Kyoto, Japan, June 2015. [3] ARIB STD-B32: Video Coding, Audio Coding, and Multiplexing Specifications for Digital Broadcasting, Version 3.8, Sept. 2016. Vol. 15 No. 12 Dec. 2017 4

Feature Articles Yuya Omori Researcher, Visual Media Coding Group, Visual Media Project, NTT Media Intelligence Laboratories. He received a B.E. and M.E. in information and communication engineering from the University of Tokyo in 2012 and 2014. He joined NTT Media Intelligence Laboratories in 2014 and has been engaged in research and development (R&D) of parallel processing architecture and a high efficiency video codec algorithm. He is a member of the Institute of Electrical and Electronics Engineers (IEEE) and the Institute of Electronics, Information and Communication Engineers (IEICE). Takayuki Onishi Senior Research Engineer, Visual Media Coding Group, Visual Media Project, NTT Media Intelligence Laboratories. He received a B.E. and M.E. in information and communication engineering from the University of Tokyo in 1997 and 1999. He joined NTT Cyber Space Laboratories (now, NTT Media Intelligence Laboratories) in 1999 and has been engaged in R&D of high quality image coding and transmission. He is a member of IEEE and IEICE. 5 Hiroe Iwasaki Senior Research Engineer, Supervisor, Visual Media Coding Group, Visual Media Project, NTT Media Intelligence Laboratories. She received a B.S. and Ph.D. in information science from Tsukuba University, Ibaraki, in 1992 and 2006. She joined NTT Switching System Laboratories in 1992. She has been involved in R&D of an architecture and real-time operating system for multimedia embedded system LSIs. She moved to NTT Cyber Space Laboratories (now, NTT Media Intelligence Laboratories) in 1997. She is a member of IEEE, IEICE, and the Information Processing Society of Japan. Atsushi Shimizu Senior Research Engineer, Supervisor, Visual Media Coding Group, Visual Media Project, NTT Media Intelligence Laboratories. He received a B.E. and M.E. in electronic engineering from Nihon University, Tokyo, in 1990 and 1992. Since joining NTT in 1992, he has been working on video compression algorithm and software development. He is a member of IEICE, the Institute of Image Electronics Engineers of Japan, and the Institute of Image Information and Television Engineers. NTT Technical Review