Slow Motion Video. MediaTek Technology White Paper

Similar documents
Quality Audio Software Pipeline. Presenters Subhranil Choudhury, Rajendra C Turakani

Parallel Processing SIMD, Vector and GPU s cont.

Picture-in-Picture. Video-in-Video

Creative Media User Guide.

Live 7 Audio Fact Sheet

Media Commons Workshop: Final Cut Pro X

Performance Tuning VTune Performance Analyzer

Skill Area 214: Use a Multimedia Software. Software Application (SWA)

WIFI VS BLUETOOTH Bluetooth. Wi-Fi.

I/O Buffering and Streaming

Performance Analysis of the Intertwined Effects between Network Layers for g Transmissions

How to Make a Movie in imovie How to Make a Movie Trailer in imovie How to Color Correct a Movie in imovie

NVMS User Manual. Version 2.1.0

Chapter 7 Multimedia Operating Systems

Video recorders Series DH

How to edit audio tracks

H.264 Network DVR. Quick Start. GUI Display with USB Mouse Control 336Z

Presented by: Murad Kaplan

ECSE 425 Lecture 25: Mul1- threading

Creating an im ovie project Adjust Project Settings

For today, choose Format: NTSC Rate: Use: DV-NTSC Anamorphic (North American, widescreen)

Completing the Multimedia Architecture

RAID SEMINAR REPORT /09/2004 Asha.P.M NO: 612 S7 ECE

imovie for ipad CREATING A PROJECT

Fast distribution of Data in Wireless Sensor Network using Concurrency Operation

Integrating Concurrency Control and Energy Management in Device Drivers

Lecture 16. Today: Start looking into memory hierarchy Cache$! Yay!

Module 10 MULTIMEDIA SYNCHRONIZATION

Milestone Solution Partner IT Infrastructure Components Certification Report

Digital Video Surveillance Center Management Software

Product Evaluation Guide. for CMOS Megapixel IP Cameras. Version 1.0

COMP : Practical 11 Video

ECE519 Advanced Operating Systems

Plot SIZE. How will execution time grow with SIZE? Actual Data. int array[size]; int A = 0;

1617 APPDC Lesson 2 Worksheet Review sheet for Lesson 2 (This assignment is worth 100 POINTS!) KEY

Input/Output Management

OSMO MOBILE 2. User Manual. v

Features: (no need for QR Code)

Quick Start Guide (V1.03) UD.6L0201B1064A01

Z-Wave FHD IP Camera

Local Playback Software User Manual

Guarding Expert (Android) Mobile Client Software User Manual (V3.2)

9/8/2016. Characteristics of multimedia Various media types

360fly 4K PRO User s Guide

Disks and I/O Hakan Uraz - File Organization 1

How to change clip speed and direction

TDT Coarse-Grained Multithreading. Review on ILP. Multi-threaded execution. Contents. Fine-Grained Multithreading

DiViS Net (Integration)

Hardware-Based Speculation

W Box VMS BOX T E C H N O L O G I E S.

Adobe Premiere Pro CC 2018

Guarding Expert Mobile Client Software

VSPlayer Software User Manual

Design guidelines for embedded real time face detection application

It s All About Speed! Customizing DS and Learning How to Edit Faster Saturday, April 17 th, 10:15 11:30am

I/O Systems. Amir H. Payberah. Amirkabir University of Technology (Tehran Polytechnic)

Multimedia Systems 2011/2012

Creating a short stop motion animation with Adobe Photoshop CC. Open Adobe Photoshop CC. A note about embedding

PROFESSIONAL PROTECTION INCORPORATED

EECS150 - Digital Design Lecture 09 - Parallelism

Cache introduction. April 16, Howard Huang 1

Data Communication/MIDI. Juan P Bello

USER MANUAL. Mac Version

H.264 Network DVR. Quick Start

Get to Know Premiere Elements 2018 What s what and what it does

Impact of transmission errors on TCP performance. Outline. Random Errors

Welcome to Part 3: Memory Systems and I/O

Z-Wave Pan/Tilt IP Camera

Ultra-Low Latency Down to Microseconds SSDs Make It. Possible

NVR Client system. User Guide

imovie: Digital Storytelling

Digital Documentaries. Students as directors of their own learning

RECentral User s Manual Revised on: 12/25/2017

Multiprocessors and Thread-Level Parallelism. Department of Electrical & Electronics Engineering, Amrita School of Engineering

FLAT DATACENTER STORAGE. Paper-3 Presenter-Pratik Bhatt fx6568

Because of the good performance of vocoder and the potential


NVMS User Manual

Software-Controlled Multithreading Using Informing Memory Operations

twisted wave twisted wave [an introduction]

Multiprocessor scheduling

Table of Contents. Tops Digital Solutions DVS 2000 Series Users Manual Ver.2.0 2

Click New Project from the Premiere Pro Welcome screen, or select File > New Project.

Milestone Solution Partner IT Infrastructure Components Certification Report

SunEyes V Series IP Camera P2P CMS user manual

How You Use the Timeline

ADOBE 9A Adobe Premiere Pro CS4 ACE. Download Full Version :

Module 11: I/O Systems

Data Transmission Definition Data Transmission Analog Transmission Digital Transmission

Video Management Software User Manual

APP MANUAL APP MANUAL V2

MULTIMEDIA PROCESSING ON MANY-CORE TECHNOLOGIES USING DISTRIBUTED MULTIMEDIA MIDDLEWARE

Get to Know Premiere Elements 13 What s what and what it does

User Manual Smartwatch SW15

Security Management System Infinova T-series Cameras SD Card Video Synchronization

ASYNCHRONOUS SHADERS WHITE PAPER 0

Reviewer s Guide of HUAWEI EnVizion 360 Camera

All RTS Sentry digital surveillance servers provide username/password authentication for access and configuration.

DKZ-201S KOMFY SWITCH WITH CAMERA

CAN protocol enhancement

Transcription:

MediaTek Technology White Paper May 2015

Introducing Slow Motion Video MediaTek s Slow Motion Video technology allows the device user to easily capture, replay, and share a fast moving object any time. Slow Motion Video can record video at a high frame rate such as 120fps, 180fps or higher, playback in slower speed such as 1/2x,1/4x, and 1/16x. And, the user can easily share this slow motion video. High frame rate recording allow users to capture more details and wonderful moments for a fast moving object. Compared with normal camera recording, which only can capture 24fps 30fps, high frame-rate recording can capture as much as 120,180fps, which allows for more fluent recording. Mediatek provides a whole solution for high frame-rate recording, so that business partners can easily combine their high frame rate camera sensors with Mediatek ICs to provide many new wonderful features. Now VGA@120fps for MT6752, HD@180fps for MT6595 and MT6795 are supported, and higher, such as HD@240fps, will be available on later MediaTek octa-core ICs. Slow Motion Video playback allows user to slow down the playback speed so as to enjoy a fully playback experience. Playing the high frame rate recorded video will bring much more fluent watching experience. But as we all know, the human video processing capability is limited, so humans cannot capture details of a fast moving object if replayed in 1x speed. By using the slow motion playback, the user can not only distinguish each amazing moment but also can enjoy interesting viewing experience. Now Mediatek can support fluent slow motion playback with slowest speed of 1/16x. What s more, Mediatek Slow Motion Video not only supports slowing down the playback speed but can also adjust the effect interval. The user can just slow down the interested interval while playing other intervals at normal speed. The Slow Motion Video Share feature allows the user to create the slow motion effect on video and share the effected video to friends. The Sharing feature allows distributing the interesting video by social media like Youtube, Facebook, email, or radio transmission technology such as Bluetooth. With Mediatek Slow Motion Share function the user can share exactly what she got during playback to friends. The user can adjust the slow motion effect section and set the speed before sharing. Not all devices can support high frame-rate video playback fluently. To ensure that all sharing friends using different terminals can enjoy the same user-created experience, and to increase the sharing 2015 MediaTek Page 2 of 10

speed, the Mediatek Slow Motion Share function will downgrade the video with slow motion effect to normal frame rate of 30fps before sharing. Technology Problems Implementing all the above Slow Motion Video functions presents many technology challenges, as listed here below. High Frame Rate Recording High frame rate recording not only requires powerful hardware, but also needs little SW overhead to raise the HW utilization, which requires many optimizations and tune efforts For video encoding, the HW usage could be divided into 3 parts: Image Signal Processor (ISP), Multimedia Data Path (MDP), and Video Encoder (VENC). The HW block diagram is shown in the following Figure 1. Figure 1. Video Recording HW Block Diagram Requiring high SW overhead in each HW stage will obviously bring disaster for high frame rate recording. To achieve high frame rate recording, the first and the most important technology problem is the SW overhead penalty for each frame. Traditional video HW encoder architecture imposes a pre-processing time and post-processing time SW overhead penalty for each frame. Pre-Processing: before triggering HW encoding a frame, the encoder driver needs to provide useful information to HW (e.g. I frame period, CBR/VBR ). Post-Processing: after HW encoding, useful information such as key frame information, encoded bi-stream byte count is stored in the file writer. According to our experience (MT6795), we need at least 1.5ms for pre-processing and post-processing for each frame. 2015 MediaTek Page 3 of 10

In normal frame rate recording, the SW overhead is insignificant, as shown below. e.g. 4K2K@30fps, SW/HW = 1.5/31.8 = 4.7%. Figure 2. SW Overhead Penalty for Each Frame of 30fps Recording But in high frame rate recording, the SW overhead is obvious, as shown in Figure 3. e.g. HD@180fps, SW/HW = 1.5/4 = 37.5%. The SW overhead will degrade the HW utilization rate and become the main bottleneck to achieve higher than HD@180fps. Minimizing SW overhead is very important and challenging. Figure 3. SW Overhead Penalty for Each Frame of 180fps Recording 2015 MediaTek Page 4 of 10

Slow Motion Video Playback Slow Motion Video Playback requires slowing down the video track and the audio track. The problem is how to maintain the audio quality, without changing the timbre. To change the audio playback speed, the audio duration needs to be extended or shortened. The simplest way to modify audio duration is to resample it. Resample involves rebuilding a continuous waveform from its original samples at different rate. Thus, new samples will be added if an audio is to be extended, and vice versa, samples will be cut if an audio is to be shortened. When resampled audio is played at the original sample rate, it sounds faster or slower. However slowing down the resampled audio will lower the pitch, and speeding up it will raise the pitch. Resampling is a simple way to modify audio duration, but real audio information is lost. The MediaTek Solution & Algorithm A series of optimizations and algorithms are adopted by Mediatek for the Slow Motion Video Feature. High Frame Rate Recording Solution To fully utilize HW capability, Mediatek has applied series of optimizations on the video recording path, including VENC and MDP. Finally the SW overhead of the recording path is minimized to nearly 0. To fully utilize the VENC HW, Mediatek pipelines the SW and HW between MDP and VENC, shown as Figure 4. The VENC and MDP could be pipelined via Command Queue. This design raises the HW utilization and reduces VENC SW overhead. With this Technology the VENC HW register can be configured when the MDP HW is working. 2015 MediaTek Page 5 of 10

Figure 4. New Pipeline Design between MDP and VENC Figure 5 and Figure 6 show the different results between before and after applying SW/HW pipeline between MDP and VENC. Figure 5. VENC without Using Command Queue 2015 MediaTek Page 6 of 10

Figure 6. SW/HW Pipeline between MDP and VENC In Figure 5 VENC has a SW overhead penalty after MDP for each frame. Figure 6 shows, on the other hand, that VENC can start immediately after MDP processing with adopting pipeline between MDP and VENC. What is more, in MediaTek s video HW encoder architecture, the encoding task is split into three threads: HW thread, SW1 thread, SW2 thread. In order to hide SW overhead into HW processing time, SW1 and SW2 phases need to be parallel with HW processing, as shown in Figure 7 below: Prepare HW needed information of next frames early Collect HW output information even if HW is processing Figure 7. SW Overhead & HW Processing Parallel Solution 2015 MediaTek Page 7 of 10

As a result, the VENC SW overhead can be hidden, achieving the goal with of 0 SW overhead. To fully utilize the MDP HW, asynchronous MDP interface is introduced, as shown below in Figure 8. To increase hardware throughput, the MDP driver applies a new design for software and hardware pipeline optimization. The new design allows MDP SW driver configuration to perform asynchronously for every frame. In other words, the SW driver could configure the next frame while HW still operates the last frame. It reduces the user waiting time, and increases the HW utilization. Figure 8. Asynchronous MDP Interface Audio Time Stretch Algorithm To maintain the audio quality while slowing down the audio playback speed, Mediatek has adopted an Audio Time Stretch algorithm instead of simple resampling. Audio Time Stretch technology can retain pitch while prolonging or shortening audio duration. Among the Audio Time Stretch algorithms, Mediatek s exploits the well known time stretch algorithm Pitch Synchronous-OverLap-Add (PSOLA) for its high efficiency and simplicity, and improves PSOLA algorithm with our enhancement. 2015 MediaTek Page 8 of 10

The PSOLA algorithm captures audio waveform in time domain. It tries to find a wave form that has the same pitch and overlap, and adds it to prolong or shorten the duration. The whole process can be divided into Pitch Detection and Cross Fade process. The Pitch Detection process finds similar waveforms with same pitch and overlaps, and adds up new wave form to modify the duration. Figure 9 shows how audio sound is prolonged in PSOLA method. Here waveform 2 is similar to waveform 1, and these 2 sequence is considered as having the same pitch. Concatenating waveform 2 with waveform 1 and overlapping the intersection, waveform 1 is then extended with the same pitch. Figure 9. PSOLA Method Procedure to Prolong Audio The traditional pitch detection algorithm used in PSOLA detects the maximum point of signal s auto correlation. The pitch detection process captures sections of waveforms from orignal audio clips and calculates each waveform s auto correlation. The two sections of waveform with the maximum correlation are considered as having the same pitch. Overlapping them with different overlapping length changes the duration of the original audio, and maintains the same pitch. PSOLA s power consumption is extremely high in a real-time application such as slow motion video. Mediatek simplifies the pitch detection algorithm to achieve balance between CPU loading and audio quality. To reduce calculation, we only perform auto correlation every 8 samples, and after finding the maximum correlation point in this coarse process, we do another round of auto correlation calculation for every 8 samples around the first maximum correlation point. In this way we can reduce CPU loading and still have a good enough quality. Adding our time stretch technology along with Mediatek s slow motion video generates a slow motion video with audible sound, which is sync with video and has correct pitch. 2015 MediaTek Page 9 of 10

Conclusion With our powerful HW and optimized SW, Mediatek s Slow Motion Video technology provides a wonderful feature with excellent performance. Slow Motion Video makes the recording become more interesting and fun for the user. What s much more important, our business partners can create many more innovative features by using our solution. 2015 MediaTek Page 10 of 10