Bluray (

Similar documents
Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Principles of Audio Coding

Georgios Tziritas Computer Science Department

Chapter 14 MPEG Audio Compression

CMPT 365 Multimedia Systems. Media Compression - Video Coding Standards

MPEG-4 departs from its predecessors in adopting a new object-based coding:

Data Compression. Audio compression

Audio and video compression

Introduction to LAN/WAN. Application Layer 4

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

2.4 Audio Compression

5: Music Compression. Music Coding. Mark Handley

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Video Compression MPEG-4. Market s requirements for Video compression standard

EE Multimedia Signal Processing. Scope & Features. Scope & Features. Multimedia Signal Compression VI (MPEG-4, 7)

Thanks for slides preparation of Dr. Shawmin Lei, Sharp Labs of America And, Mei-Yun Hsu February Material Sources

微电子学院 School of Microelectronics. Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Chapter 12 ZHU Yongxin, Winson

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Figure 1. Generic Encoder. Window. Spectral Analysis. Psychoacoustic Model. Quantize. Pack Data into Frames. Additional Coding.

Video Compression Standards (II) A/Prof. Jian Zhang

Video coding. Concepts and notations.

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

ELL 788 Computational Perception & Cognition July November 2015

Lesson 6. MPEG Standards. MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21

Video Coding Standards

Mpeg 1 layer 3 (mp3) general overview

AUDIOVISUAL COMMUNICATION

Optical Storage Technology. MPEG Data Compression

CISC 7610 Lecture 3 Multimedia data and data formats

Module 9 AUDIO CODING. Version 2 ECE IIT, Kharagpur

Multimedia Communications. Audio coding

AUDIOVISUAL COMMUNICATION

CSCD 443/533 Advanced Networks Fall 2017

Compression; Error detection & correction

ZEN / ZEN Vision Series Video Encoding Guidelines

Audio-coding standards

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

MPEG-4. Today we'll talk about...

MPEG-4: Overview. Multimedia Naresuan University

Lecture 3 Image and Video (MPEG) Coding

Week 14. Video Compression. Ref: Fundamentals of Multimedia

6MPEG-4 audio coding tools

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

Compression; Error detection & correction

Audio-coding standards

Networking Applications

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

MPEG-l.MPEG-2, MPEG-4

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

MULTIMEDIA SYSTEMS

Lecture 16 Perceptual Audio Coding

Compression and File Formats

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Introduction to Video Coding

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

Digital video coding systems MPEG-1/2 Video

EE 5359 H.264 to VC 1 Transcoding

Multimedia Systems Speech I Mahdi Amiri February 2011 Sharif University of Technology

ijdsp Interactive Illustrations of Speech/Audio Processing Concepts

ITNP80: Multimedia! Sound-II!

Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression

HIKVISION H.265+ Encoding Technology. Halve Your Bandwidth and Storage Enjoy the Ultra HD and Fluency

Advanced Video Coding: The new H.264 video compression standard

The MPEG-4 General Audio Coder

GSM Network and Services

Ch. 4: Video Compression Multimedia Systems

Appendix 4. Audio coding algorithms

The Gullibility of Human Senses

VIDEO COMPRESSION STANDARDS

IST MPEG-4 Video Compliant Framework

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

Efficient Representation of Sound Images: Recent Developments in Parametric Coding of Spatial Audio

EE482: Digital Signal Processing Applications

The Implement of MPEG-4 Video Encoding Based on NiosII Embedded Platform

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig

Ch. 5: Audio Compression Multimedia Systems

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

VIDEO COMPRESSION. Image Compression. Multimedia File Formats. Lossy Compression. Multimedia File Formats. October 8, 2009

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Image, video and audio coding concepts. Roadmap. Rationale. Stefan Alfredsson. (based on material by Johan Garcia)

Digital Image Stabilization and Its Integration with Video Encoder

CHAPTER 10: SOUND AND VIDEO EDITING

White paper: Video Coding A Timeline

IEE 5037 Multimedia Communications Lecture 12: MPEG-4

Parametric Coding of High-Quality Audio

New Results in Low Bit Rate Speech Coding and Bandwidth Extension

MPEG-4 General Audio Coding

White Paper: WiseStream II Technology. hanwhasecurity.com

Digital Media. Daniel Fuller ITEC 2110

The Scope of Picture and Video Coding Standardization

JPEG 2000 vs. JPEG in MPEG Encoding

MPEG-4 Structured Audio Systems

CT516 Advanced Digital Communications Lecture 7: Speech Encoder

Fundamentals of Video Compression. Video Compression

MPEG-4 - Twice as clever?

Transcription:

Bluray (http://www.blu-ray.com/faq) MPEG-2 - enhanced for HD, also used for playback of DVDs and HDTV recordings MPEG-4 AVC - part of the MPEG-4 standard also known as H.264 (High Profile and Main Profile) SMPTE VC-1 - standard based on Microsoft's Windows Media Video (WMV) technology Video bitrate - 40.0Mbps (vs ~10 Mbps for DVD) page 1

MPEG-4 MPEG-4 adopts a object-based coding: Offering higher compression ratio, also beneficial for digital video composition, manipulation, indexing, and retrieval The bit-rate for MPEG-4 video now covers a large range between 5 kbps to 10 Mbps More interactive than MPEG-1 and MPEG-2 page 2

Comparison between Block-based Coding and Object-based Coding page 3

Composition and manipulation of object page 4

Overview of MPEG-4 1. Video-object Sequence (VS) delivers the complete MPEG-4 visual scene, which may contain 2-D or 3-D natural or synthetic objects 2. Video Object (VO) a object in the scene, which can be of arbitrary shape corresponding to an object or background of the scene 3. Video Object Layer (VOL) facilitates a way to support (multi-layered) scalable coding. A VO can have multiple VOLs under scalable coding, or have a single VOL under non-scalable coding 4. Group of Video Object Planes (GOV) groups Video Object Planes together (optional level) 5. Video Object Plane (VOP) a snapshot of a VO at a particular moment page 5

Object oriented VOP I-VOP, B-VOP, P-VOP Objects can be arbitrary shape need to encode the shape and the texture (object) Need to treat MB inside object different than boundary blocks (padding, different DCT etc) page 6

Sprite Coding A sprite is a graphic image that can freely move around within a larger graphic image or a set of images To separate the foreground object from the background, we introduce the notion of a sprite panorama: a still image that describes the static background over a sequence of video frames The large sprite panoramic image can be encoded and sent to the decoder only once at the beginning of the video sequence When the decoder receives separately coded foreground objects and parameters describing the camera movements thus far, it can reconstruct the scene in an efficient manner page 7

page 8

Global Motion Compensation (GMC) Global overall change due to camera motions (pan, tilt, rotation and zoom) Without GMC this will cause a large number of significant motion vectors There are four major components within the GMC algorithm: Global motion estimation Warping and blending Motion trajectory coding Choice of LMC (Local Motion Compensation) or GMC. page 9

page 10

MPEG-7 The main objective of MPEG-7 is to serve the need of audio-visual content-based retrieval (or audiovisual object retrieval) in applications such as digital libraries page 11

MPEG-7 video segment page 12

A video summary page 13

Chapter 13: VOCODER Voice only coder use aspects of human hearing E.g. Formant Vocoder - voice is not equal represented in all frequencies because of vocal cord They can produce good quality sound in 1,000 bps concerned with modeling speech so that the salient features are captured in as few bits as possible use either a model of the speech waveform in time (LPC (Linear Predictive Coding) vocoding break down the signal into frequency components and model these (channel vocoders and formant vocoders) Simulations are improving but still recognizable (automated phone calls) page 14

Phase insensitivity A complete reconstituting of speech waveform is really unnecessary, perceptually: all that is needed is for the amount of energy at any time to be about right, and the signal will sound about right. Solid line: Superposition of two cosines, with a phase shift. Dashed line: No phase shift. wave is different, yet the sound is the same, perceptually page 15

Linear Predictive Coding (LPC) LPC vocoders extract salient features of speech directly from the waveform, rather than transforming the signal to the frequency domain LPC Features: uses a time-varying model of vocal tract sound generated from a given excitation transmits only a set of parameters modeling the shape and excitation of the vocal tract, not actual signals or differences small bit-rate page 16

Chapter 14: MPEG Audio Psychoacoustics Frequency: Remove audio that are masked anyway A lower tone can effectively mask (make us unable to hear) a higher tone The reverse is not true a higher tone does not mask a lower tone well The greater the power in the masking tone, the wider is its influence the broader the range of frequencies it can mask As a consequence, if two tones are widely separated in frequency then little masking occurs Temporal Phenomenon: any loud tone will cause the hearing receptors in the inner ear to become saturated and require time to recover page 17

14.2 MPEG Audio MPEG audio compression takes advantage of psychoacoustic models, constructing a large multidimensional lookup table to transmit masked frequency components using fewer bits Applies a filter bank to the input to break it into its frequency components In parallel, a psychoacoustic model is applied to the data for bit allocation block The number of bits allocated are used to quantize the info from the filter bank providing the compression page 18

MPEG Layers Each succeeding layer offering more complexity in the psychoacoustic model and better compression for a given level of audio quality Layer 1 quality can be quite good provided a comparatively high bit-rate is available Digital Audio Tape typically uses Layer 1 at around 192 kbps Layer 2 has more complexity; was proposed for use in Digital Audio Broadcasting Layer 3 (MP3) is most complex, and was originally aimed at audio transmission over ISDN lines Most of the complexity increase is at the encoder, not the decoder accounting for the popularity of MP3 players page 19

Summary Apply different set of heuristics (than video), yet achieve the goal of attaining good compression by removing components that the human ear is not good at distinguishing More complexity at the encoding phase leads to better compression Humans are lot more sensitive to dropped frames in audio than in video. Audio should also be well synchronized - otherwise distracting Humans also like audio better than video, TVs/ DVDs send higher fidelity audio page 20