What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

Similar documents
Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Interchange formats. Introduction Application areas Requirements Track and object model Real-time transfer Different interchange formats Comparison

CS 335 Graphics and Multimedia. Image Compression

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Image, video and audio coding concepts. Roadmap. Rationale. Stefan Alfredsson. (based on material by Johan Garcia)

Digital Image Representation Image Compression

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

5: Music Compression. Music Coding. Mark Handley

Optical Storage Technology. MPEG Data Compression

Index. 1. Motivation 2. Background 3. JPEG Compression The Discrete Cosine Transformation Quantization Coding 4. MPEG 5.

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

Fundamentals of Video Compression. Video Compression

Audio and video compression

Audio-coding standards

Lecture 3 Image and Video (MPEG) Coding

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

CMPT 365 Multimedia Systems. Media Compression - Image

CISC 7610 Lecture 3 Multimedia data and data formats

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy

Audio-coding standards

Mpeg 1 layer 3 (mp3) general overview

Lossy compression. CSCI 470: Web Science Keith Vertanen

Video Compression MPEG-4. Market s requirements for Video compression standard

End-to-End Data. Presentation Formatting. Difficulties. Outline Formatting Compression

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Lossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013

Compression II: Images (JPEG)

Georgios Tziritas Computer Science Department

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

Lecture 8 JPEG Compression (Part 3)

Principles of Audio Coding

CSCD 443/533 Advanced Networks Fall 2017

EXCELLENT ACADEMY OF ENGINEERING. Telephone: /

Digital Video Processing

Multimedia Communications. Transform Coding

Chapter 14 MPEG Audio Compression

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Lesson 5: Multimedia on the Web

Appendix 4. Audio coding algorithms

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

JPEG. Wikipedia: Felis_silvestris_silvestris.jpg, Michael Gäbler CC BY 3.0

Lecture 5. Digital Media Components Markup and Scripting Languages Multimedia Tools Facilities Provided by the School Suggested Reading

G64PMM - Lecture 3.2. Analogue vs Digital. Analogue Media. Graphics & Still Image Representation

Robert Matthew Buckley. Nova Southeastern University. Dr. Laszlo. MCIS625 On Line. Module 2 Graphics File Format Essay

Elementary Computing CSC 100. M. Cheng, Computer Science

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

Principles of MPEG audio compression

Networking Applications

Video Compression An Introduction

JPEG: An Image Compression System. Nimrod Peleg update: Nov. 2003

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology

2.4 Audio Compression

Lecture 8 JPEG Compression (Part 3)

Compression; Error detection & correction

Introduction to LAN/WAN. Application Layer 4

Image coding and compression

Audio Coding and MP3

CS 074 The Digital World. Digital Audio

Multimedia Communications. Audio coding

Compression; Error detection & correction

Lesson 5: Multimedia on the Web

Video Codec Design Developing Image and Video Compression Systems

Audio Coding Standards

Introduction to Video Coding

EEC-484/584 Computer Networks

JPEG Modes of Operation. Nimrod Peleg Dec. 2005

MULTIMEDIA AND CODING

Multimedia on the Web

Lecture Coding Theory. Source Coding. Image and Video Compression. Images: Wikipedia

Wireless Communication

VC 12/13 T16 Video Compression

ESE532: System-on-a-Chip Architecture. Today. Message. Project. Expect. Why MPEG Encode? MPEG Encoding Project Motion Estimation DCT Entropy Encoding

Part 1 of 4. MARCH

MULTIMEDIA COMMUNICATION

Part III: Survey of Internet technologies

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

EE482: Digital Signal Processing Applications

Fundamentals of Perceptual Audio Encoding. Craig Lewiston HST.723 Lab II 3/23/06

MPEG-1. Overview of MPEG-1 1 Standard. Introduction to perceptual and entropy codings

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

JPEG: An Image Compression System

7.5 Dictionary-based Coding

Introduction ti to JPEG

The Gullibility of Human Senses

2.2: Images and Graphics Digital image representation Image formats and color models JPEG, JPEG2000 Image synthesis and graphics systems

Digital Image Representation. Image Representation. Color Models

Image Coding and Compression

ELL 788 Computational Perception & Cognition July November 2015

Introduction to Computer Science (I1100) Data Storage

IMAGE COMPRESSION USING HYBRID QUANTIZATION METHOD IN JPEG

Parametric Coding of High-Quality Audio

AUDIO AND VIDEO COMMUNICATION MEEC EXERCISES. (with abbreviated solutions) Fernando Pereira

JPEG. Table of Contents. Page 1 of 4

Lecture 16 Perceptual Audio Coding

AUDIOVISUAL COMMUNICATION

ITNP80: Multimedia! Sound-II!

Transcription:

Multimedia What is multimedia? Media types +Text + Graphics + Audio +Image +Video Interchange formats What is multimedia? Multimedia = many media User interaction = interactivity Script = time 1 2 Most common media types Text Graphics Audio Image Video Continuous media Animations (virtual reality) Audio Video What is the difference between normal and continuous media? 3 4 Continuous media processing Interactivity Capture Play D/Aconversion Postprocessing Decompression Preprocessing A/Dconversions Compression Transfer User can control the progress of the presentation Level of interaction + user interface, application, server, content producer Amount of interaction + www page, www application, video, game, virtual reality 5 6 1

Time Media elements are placed in time dimension Different components are synchronized Presentation system takes cares of synchronization (orchestration) Applications Multimedia presentations + PowerPoint Multimedia communications + video conference Multimedia services + video on demand Integrated hybrid applications + record store 7 8 Text ASCII Text documents + Microsoft Word, Adobe Acrobat Structured documents + SGML, HTML, XML Hypertext + HyperCard Graphics Bitmap graphics + paintings + Microsoft Paint Vector graphics + drawings + OpenGL + Postscript 9 10 Physical properties of sound Amplitude +db = 20 log 10 (A/B) + hearing threshold 0 db, pain limit 100-120 db Period / frequency +Hz = 1/s + Hearing band 20 Hz - 20 khz Pulse code modulation Samples are taken at sample frequency Sample frequency has to be at least twice of maximum frequency (so called Nyquist frequency) Common sample frequencies are 8, 44.1, and 48 khz 11 12 2

Pulse code modulation (cont.) Signal amplitude at sample moment is converted into numerical value + Pulse Code Modulation (PCM) Sampling causes a quantization error Frequency transformation Natural sounds are composed of base frequency and its harmonic frequencies Thus, sound can be presented also in frequency dimension 13 14 Clarinet sound Frequency transformation 15 16 Frequency transformation (cont.) The Fourier transformation coefficients present the sound in frequency dimension Stationary signals can be presented accurately with Fourier transformation With dynamic signals discrete Fourier transformation has to be used Usually, fast Fast Fourier Transformation (FFT) algorithm is used Psycho acoustics The properties of human hearing should be considered in coding of sound Sound should be processed in frequency dimension For example, hearing threshold depends on frequency Ear is sensitive to the valleys and hills of spectrum + for example, detection of vowels 17 18 3

Threshold of hearing Psycho acoustics (cont.) Amplitude So called masking effect A loud sound at certain frequency raises the threshold of hearing at wider frequency area Frequency 19 20 Compression methods Begin For example, masking effect can be used in sound coding Signal is divided into frequency bands, which are coded separately (subband coding) For example, Mini Disc records (Sony), DCC Cassettes (Philips), and MPEG audio Subband analysis Scale factor calculation Coding of scale factors FFT analysis Calculation of masking and required bit allocation Determination of nontransmitted subbands Adjustment to fixed bit-rate Quantization of samples Coding of samples Coding of bit allocation Formatting and transmission End 21 Image and video coding Methods are either lossy or lossless Most common lossy method is Discrete Cosine Transformation (DCT) For example, Huffmann method is lossless Coding methods Image coding + JPEG (Joint Photographic Expert Group) Video coding + H.261, H.263 + MPEG (Motion Picture Expert Group) Coding methods utilize usually several coding techniques 23 24 4

JPEG Goals Compression ratio vs. image quality can be selected Works with all kinds of images Both software and hardware implementation Four modes: + sequential coding (original order) + progressive coding (multiphase coding) + lossless coding (perfect copy) + hierarchical coding (multiple resolutions) 25 JPEG Architectures Lossy modes use DCT for 8 x 8 pixel blocks Sequential mode outputs the DCT-coefficients block by block Progressive mode outputs the DCT-coefficient in groups Hierarchical mode encodes several resolutions at the same time 26 Sequential JPEG Progressive JPEG 27 28 Hierarchical JPEG Lossless JPEG 29 30 5

DCT and quantization DCT coding The DCT-coefficients can be represented as a matrix The quantization is done according to a quantization table The coefficients are put in Zig-Zag order This places the zero coefficients in the end of the run Finally Run-Length coding eliminates the zeros 31 32 Statistical coding Uses either Huffman or arithmetic coding Huffman coding requires a separate table Arithmetic coding does not require a table, but need more computation In addition, the compression ratio of arithmetic coding is 5-10 % better Lossless encoding Lossless encoding utilizes prediction Seven different alternatives + how many and which pixels are used Predictive encoding can reach compression ratio of 2:1 33 34 Efficiency 0,25-0,5 bpp: reasonable - good quality 0,5-0,75 bpp: good - very good quality 0,75-1,5 bpp: very good quality 1,5-2,00 bpp: same as original MPEG Resembles JPEG method In addition, properties of moving images have been used + following images are very similar + images have few moving objects + image series change seldom Implementation is more complicated Requires usually special hardware 35 36 6

Motion estimation Moving objects are searched from following images Motion estimation are calculated for the changing blocks Estimations are called motion vectors and they are transmitted as part of the coded information 37 Difference images MPEG image sequence Difference images are calculated between the original and predicted images Only changed (non-zero) areas are transmitted MPEG utilizes both forward an backward prediction: + I = Intra - original image + P = forward Prediction + B = Bidirection prediction 39 Interchange formats 40 Application areas The co-operation and transfer of multimedia applications requires common interchange formats Interchange format defines: A common interchange format can be used for + storage format (e.g., Macromedia Director) + transfer format (e.g., CD-ROM) + real-time transfer format (e.g., digital TV) + content transfer between applications (e.g., group work) + time, place, structure, and operation (procedures) Without an interchange format content produced with one application cannot be read and used with other application Conversion tools are not a good solution 41 42 7

Requirements Data model: + time, synchronization, different formats, pointers, links, interactivity Scripts: + programming language or graphical programming Capacity: + definitions should not take much space Retrieval time: + decompression has to be fast, progressive resolution, etc. Interdependency: + hardware and platform neutral Scalability: + new formats, attributes, etc. MHEG ISO working group Object based multimedia and hypermedia interchange format Supports interactivity and real-time transfer Defines final presentation format 43 44 Properties Defines a set of platform independent components for interactive applications Two ways to implement interactivity: + components are linked to events + scripting language is used Events are caused by timers or user s actions Properties (cont.) Defines both temporal and spatial location Macros allow the definition of complex objects based on models Contains also support for real-time transfer 45 46 MHEG class hierarchy HTML MH-object Behavior Action Link Script Component Content Interaction Selection Modification Composite Descriptor Macro Macro Definition Macro Use HTML is also an interchange format Limited features HTML = structural document, links + DOM model is coming ECMAScript (JavaScript) = interactivity Style sheets (Cascading Style Sheets) = placement Synchronization is still missing + Synchronized Multimedia Integration Language (SMIL) 47 48 8