Module 10 MULTIMEDIA SYNCHRONIZATION

Similar documents
Continuity and Synchronization in MPEG. P. Venkat Rangan, Srihari SampathKumar, and Sreerang Rajan. Multimedia Laboratory. La Jolla, CA

ISO/IEC INTERNATIONAL STANDARD

TransMu x. Users Manual. Version 3. Copyright PixelTools Corporation

Transport protocols Introduction

Triveni Digital Inc. MPEG Technology Series. MPEG 101 (MPEG 2 with a dash of MPEG 4 thrown in) Copyright 2011 Triveni Digital, Inc.

INFORMATION TECHNOLOGY - GENERIC CODING OF MOVING PICTURES AND ASSOCIATED AUDIO: SYSTEMS Recommendation H.222.0

Latest Technology for Video-Streaming Gateway of M-stage V Live

CODING METHOD FOR EMBEDDING AUDIO IN VIDEO STREAM. Harri Sorokin, Jari Koivusaari, Moncef Gabbouj, and Jarmo Takala

MISB ST STANDARD. MPEG-2 Transport Stream for Class 1/Class 2 Motion Imagery, Audio and Metadata. 27 October Scope.

MISB ST STANDARD. MPEG-2 Transport Stream for Class 1/Class 2 Motion Imagery, Audio and Metadata. 25 February Scope.

Request for Comments: 4425 Category: Standards Track February 2006

Module 6 STILL IMAGE COMPRESSION STANDARDS

RECOMMENDATION ITU-R BS.776 * Format for user data channel of the digital audio interface **

Internet Streaming Media

MODELING AND SIMULATION OF MPEG-2 VIDEO TRANSPORT OVER ATM NETWOR.KS CONSIDERING THE JITTER EFFECT

Video Streaming Over Multi-hop Wireless Networks

MMC-2 CD MODEL. Document Number R0 File name MM2-05r0.doc. Content: Clause 5 of SFF CD Model

Internet Streaming Media

A NEW SCHEDULING ALGORITHM FOR MULTIMEDIA COMMUNICATION THESIS. Presented to the Graduate Council of the. University of North Texas in Partial

INTERNATIONAL TELECOMMUNICATION UNION

Internet Streaming Media. Reji Mathew NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2007

Telemetry Standard RCC Document , Chapter 4, September 2007 CHAPTER 4 PULSE CODE MODULATION STANDARDS TABLE OF CONTENTS LIST OF FIGURES

A Proposed Time-Stamped Delay Factor (TS-DF) algorithm for measuring Network Jitter on RTP Streams

MPEG TS. MAYAH Communications Application Note 34. MAYAH Communications GmbH Application Note 34 MPEG TS

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

ES623 Networked Embedded Systems

Distributed Video Systems Chapter 3 Storage Technologies

Configure Video and Audio Settings

VIDEO COMPRESSION STANDARDS

Shown below is a diagram of a very simple playback graph.

Internet Streaming Media. Reji Mathew NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2006

[1] ISO/IEC :2007 Information technology - Generic coding of moving pictures and associated audio information: Systems

Efficient support for interactive operations in multi-resolution video servers

Network-Adaptive Video Coding and Transmission

AMERICAN NATIONAL STANDARD

MP2TSME v8.0 Release Notes

CHAPTER 4 PULSE CODE MODULATION STANDARDS TABLE OF CONTENTS

Overview Computer Networking What is QoS? Queuing discipline and scheduling. Traffic Enforcement. Integrated services

DRAFT. Encapsulation of Dirac Video content and time code markers in ISO/IEC Transport Streams

CSCD 433/533 Advanced Networks Fall Lecture 14 RTSP and Transport Protocols/ RTP

9/8/2016. Characteristics of multimedia Various media types

INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO

DTC-350. VISUALmpeg PRO MPEG Analyser Software.

Chapter 7 Multimedia Operating Systems

Software Versions Used in this Tutorial: Final Cut Pro: Compressor: 3.0.3

Networking Applications

Module objectives. Integrated services. Support for real-time applications. Real-time flows and the current Internet protocols

The War Between Mice and Elephants

Introduction to Networked Multimedia An Introduction to RTP p. 3 A Brief History of Audio/Video Networking p. 4 Early Packet Voice and Video

TSD. Discussion about the AVTP timestamp for H.264 video transmission described in 1722_D12 chapter 9.5. Christian Sörensen

Bits and Bit Patterns

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia application format (MPEG-A) Part 4: Musical slide show application format

MPEG-2 Transport Stream

The VHDL Based Design of the MIDA MPEG1 Audio Decoder

ELEC 691X/498X Broadcast Signal Transmission Winter 2018

RECOMMENDATION ITU-R BT.1720 *

UBR Congestion controlled Video Transmission over ATM Eltayeb Omer Eltayeb, Saudi Telecom Company

Quality Audio Software Pipeline. Presenters Subhranil Choudhury, Rajendra C Turakani

WRED Explicit Congestion Notification

DVS-100P Configuration Guide

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

UNIT-II OVERVIEW OF PHYSICAL LAYER SWITCHING & MULTIPLEXING

Choice 1: audio, a simple audio client server system

Annex (informative) to A001 Rev. 6: Guidelines for the Implementation of DTS Coded Audio in DVB Compliant Transport Streams

Performance of UMTS Radio Link Control

PROBABILISTIC SCHEDULING MICHAEL ROITZSCH

IST MPEG-4 Video Compliant Framework

Introduction to LAN/WAN. Application Layer 4

Optical Storage Technology. MPEG Data Compression

Congestion in Data Networks. Congestion in Data Networks

CN1047 INTRODUCTION TO COMPUTER NETWORKING CHAPTER 6 OSI MODEL TRANSPORT LAYER

Comparison and Evaluation of Packing Schemes for MPEG-2 over ATM using AAL5

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Streaming (Multi)media

CS244a: An Introduction to Computer Networks

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

Multimedia Storage Servers

Lecture 4 Wide Area Networks - Congestion in Data Networks

What Is Congestion? Computer Networks. Ideal Network Utilization. Interaction of Queues

A look at the ROUTE forward ROUTE ~ Real-time Object-delivery over Unidirectional Transport

Maximizing the Number of Users in an Interactive Video-on-Demand System

Compression and File Formats

Youngkwon Lim. Chair, MPEG Systems Samsung

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

Correct Bluetooth EDR FEC Performance with SEC-DAEC Decoding

CS 218 F Nov 3 lecture: Streaming video/audio Adaptive encoding (eg, layered encoding) TCP friendliness. References:

MPEG-2. ISO/IEC (or ITU-T H.262)

ARM MPEG-4 AAC LC Decoder Technical Specification

Interlaken Look-Aside Protocol Definition

Multimedia Networking ECE 599

CS 344/444 Computer Network Fundamentals Final Exam Solutions Spring 2007

Quality of Service (QoS)

The RTP Encapsulation based on Frame Type Method for AVS Video

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

Characterizing Netflix Bandwidth Consumption

PROPOSED SMPTE STANDARD for Television Material Exchange Format (MXF) Operational pattern 1A (Single Item, Single Package)

PREFACE...XIII ACKNOWLEDGEMENTS...XV

Real-Time Protocol (RTP)

MULTIPLEXER / DEMULTIPLEXER IMPLEMENTATION USING A CCSDS FORMAT

Transcription:

Module 10 MULTIMEDIA SYNCHRONIZATION

Lesson 36 Packet architectures and audio-video interleaving

Instructional objectives At the end of this lesson, the students should be able to: 1. Show the packet architecture 2. Define the elements of the packet header 3. State and significance of presentation time stamp (PTS) and decoding time stamps (DTS) 4. Show how audio and video packets can be interleaved 5. State the design considerations of pack size 36.0 Introduction In the previous lesson, we had discussed pack architecture in details. Packs are composed of packets, which may be audio, video or any other media type. In this lesson, we are going to discuss in details about the packet architecture that is composed of packet header and the audio/ video data in the encoded form. We are also going to present how the audio and the video packets are interleaved. 36.1 Packet architecture Fig.36.1 presents the composition of packets. Every packet consists of its individual header, followed by data. Packets can be composed of only one media type and multiple media types cannot be mixed in a packet. Packets are of fixed size, typically 188 Bytes. In contrast, the information content of audio and video frames may vary from one frame to the other and hence frame boundaries need not coincide with the packet boundaries. Each packet may include multiple number of frames, some partial and a frame may have to be accommodated in multiple packets, as shown in fig.36.1. Table-36.1 shows the elements of packet and pack headers.

Packet header Bits Pack header Bits Start code 24 Start code 32 Steam ID 08 ------- Packet Length 16 SCR 33 STD Buffer scale 01 STD Buffer size 13 Mux rate 22 PTS 33 DTS 33 36.2 Presentation Time stamp (PTS) and Decoding Time Stamp (DTS) The packet header may contain a) A decoding time stamp (DTS), defined as the absolute time at which the decoding of the media unit is to be begun b) A presentation time stamp (PTS), defined as the absolute time at which the first media unit present in the packet is to be presented. In an idealized decoder, the decoding process is assumed to be instantaneous, and hence, the PTS and DTS may not differ at all.

In real decoder, the difference between PTS and DTS represents the decoding computational overhead. The time stamps are represented as ticks counted on the 90 khz system lock (modulo 2 33 ) and therefore, can time video playbacks of greater than 24 h in length. In fact, the longest duration of a playback can be 2 33 ------------------------------------ = 26 hours 30 mins 90 x 10 3 x 3600 A PTS need not be provided for each media presentation unit, but the separation between successive PTS s should not exceed 0.7 seconds. The packet header also contains STD buffer size and scale fields, the product of which defines the max input buffer size needed at the STD for the packet. The input buffering delay cannot exceed one second ( ) ( ) bytes i presentation unit p: t p t p 1 second d a where, t d (p) is the decoding time of presentation unit p. t a (i) is the arrival time of bytes i. An encoder must construct its multiplexed stream so that the input buffers at STD neither overflow nor underflow. ISO 11172 multiplexed stream specifications. The stream s buffer requirements at STD do not exceed 46 x 1024 bytes for video and 4096 bytes for audio. The stream conforms to the following spaces : Packet rate 5 300 packets/s if mux rate 10 mux rate 300 packets/s otherwise 5 10

36.3 Audio and Video interleaving The data part of a packet contains a variable number of contiguous bytes from an individual media stream. Packets need not start at media presentation unit boundaries. MPEG Stream Examples Video: 1.2 Mb/s Audio: 192 Kb/s (encoded CD quality stereophonic sound) Length of audio and video packets = 188 bytes (as per MPEG-2) As Video bit rate: Audio bit rate = 6.25: 1, one audio packet is interleaved with every 6 or 7 video packets to match the bit rate ratio. The interleaving may be achieved as: One audio packet for every 6 video packets- Repeated thrice One audio packet for every 7 video packets- Repeated once. 36.4 Choice of pack size Smaller pack size means frequent transmission of system headers. Too small a pack size would impose unnecessary additional packing overheads. The upper bound on the pack size is the 0.7 seconds of max separation between successive packs. If the pack size is 8 packets, we have 10 ms separation between successive packs. At this pack size, the pack separation is 10 ms. Video presentation unit : One frame Audio presentation unit : 1584 Bytes For example, the first pack has an SCR equal to 4008. The first video frame has a DTS equal to 22012. 22012 4008 The video decoder setup time = seconds = 200 ms. 90, 000 Thus, the first I picture is decoded 200 ms after arrival. The PTS of the first video frame is 24000, which is extra 20 ms after decoding, but before displaying the first I frame. This represents the time needed to initialize the video display buffer.

36.5 Conclusion In this lesson, we have presented the composition of a packet and showed how packets of different media may be interleaved. We have also studies the requirements for presentation time stamp and decoding time stamp. Decoding early and presenting later require buffering, which is also necessary for frames reordering in MPEG bit stream. Further, buffering ensures continuity in media playback. This aspect will be discussed in the next lesson.