Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression

Similar documents
Data Representation. Data is more than just numbers! Representations of Text ASCII. Interpreting bits to give them meaning.

Elementary Computing CSC 100. M. Cheng, Computer Science

Digital Media. Daniel Fuller ITEC 2110

Compression; Error detection & correction

Compression; Error detection & correction

Digital Media. Daniel Fuller ITEC 2110

Lossy compression. CSCI 470: Web Science Keith Vertanen

Fundamental of Digital Media Design. Introduction to Audio

Image and Video Coding I: Fundamentals

计算原理导论. Introduction to Computing Principles 智能与计算学部刘志磊

Image and Video Coding I: Fundamentals

Computing in the Modern World

CSCD 443/533 Advanced Networks Fall 2017

Introduction to LAN/WAN. Application Layer 4

CHAPTER 10: SOUND AND VIDEO EDITING

The Gullibility of Human Senses

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Networking Applications

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

UNDERSTANDING MUSIC & VIDEO FORMATS

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

Lecture 16 Perceptual Audio Coding

CHAPTER 6 Audio compression in practice

ITNP80: Multimedia! Sound-II!

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

Audio and video compression

AET 1380 Digital Audio Formats

CISC 7610 Lecture 3 Multimedia data and data formats

CS 528 Mobile and Ubiquitous Computing Lecture 4a: Playing Sound and Video Emmanuel Agu

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

A Digital Audio Primer

Data Representation. Reminders. Data is more than just numbers! Interpreting bits to give them meaning. Part 3: Media - Text and Pictures

Digital Media. Lecture 2: SemesterOverview. Georgia Gwinnett College School of Science and Technology Dr. Mark Iken

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

Lecture 19 Media Formats

COS 116 The Computational Universe Laboratory 4: Digital Sound and Music

CS 074 The Digital World. Digital Audio

Video Compression MPEG-4. Market s requirements for Video compression standard

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Lossy compression CSCI 470: Web Science Keith Vertanen Copyright 2013

Principles of Audio Coding

3.01C Multimedia Elements and Guidelines Explore multimedia systems, elements and presentations.

So, what is data compression, and why do we need it?

Bluray (

Digital Audio Basics

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Snr Staff Eng., Team Lead (Applied Research) Dolby Australia Pty Ltd

MPEG-l.MPEG-2, MPEG-4

Digital Audio. Amplitude Analogue signal

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

ESE532: System-on-a-Chip Architecture. Today. Message. Project. Expect. Why MPEG Encode? MPEG Encoding Project Motion Estimation DCT Entropy Encoding

Introducing Audio Signal Processing & Audio Coding. Dr Michael Mason Senior Manager, CE Technology Dolby Australia Pty Ltd

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

AAMS Auto Audio Mastering System V3 Manual

Standard File Formats

EE482: Digital Signal Processing Applications

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

MULTIMEDIA AND CODING

EEC-484/584 Computer Networks

Mpeg 1 layer 3 (mp3) general overview

Multimedia on the Web

Data Compression. Audio compression

About MPEG Compression. More About Long-GOP Video

Technical University of Munich - FTP Site Statistics. Top 20 Directories Sorted by Disk Space

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Digital video coding systems MPEG-1/2 Video

Robin Dittwald Institut für Linguistik Universität zu Köln June Subtitles with ELAN An Easy Guide

Image and video processing

Data Representation. Interpreting bits to give them meaning. Part 1: Numbers, Bases, and Binary

Audio for Everybody. OCPUG/PATACS 21 January Tom Gutnick. Copyright by Tom Gutnick. All rights reserved.

Audio-coding standards

Data Compression. Media Signal Processing, Presentation 2. Presented By: Jahanzeb Farooq Michael Osadebey

Tech Note - 05 Surveillance Systems that Work! Calculating Recorded Volume Disk Space

Bits and Bit Patterns

JPEG 2000 compression

TKT-2431 SoC design. Introduction to exercises

Lecture Information Multimedia Video Coding & Architectures

Optimizing Audio for Mobile Development. Ben Houge Berklee College of Music Music Technology Innovation Valencia, Spain

All RTS Sentry digital surveillance servers provide username/password authentication for access and configuration.

Lecture #3: Digital Music and Sound

MULTIMEDIA SYSTEMS

Multimedia Systems Speech I Mahdi Amiri February 2011 Sharif University of Technology

Skill Area 214: Use a Multimedia Software. Software Application (SWA)

3 Sound / Audio. CS 5513 Multimedia Systems Spring 2009 LECTURE. Imran Ihsan Principal Design Consultant

Audio-coding standards

AUDIOVISUAL COMMUNICATION

Gerald's Column by Gerald Fitton

Compression and File Formats

Megapixel Networking 101. Why Megapixel?

Amazing Audacity: Session 1

Multimedia applications

2.4 Audio Compression

TKT-2431 SoC design. Introduction to exercises. SoC design / September 10

AUDIOVISUAL COMMUNICATION

University of Pennsylvania Department of Electrical and Systems Engineering Digital Audio Basics

Lecture Information. Mod 01 Part 1: The Need for Compression. Why Digital Signal Coding? (1)

Transcription:

Data Representation Interpreting bits to give them meaning Part 4: Media -, Video, Compression Notes for CSC 100 - The Beauty and Joy of Computing The University of North Carolina at Greensboro Reminders Big thing for this week: Project Proposal Presentations: This Friday Homework 3 Hopefully mostly complete! Due this Wednesday Blown to Bits reading: Contribute to Chapter 4 online discussion by Wednesday What is sound? is just rapid fluctuations in air pressure, detected by the (somewhat delicate!) organs in our ears

waveforms We can plot changes in pressure over time: Main components: Intensity (how much pressure changes): We perceive this as "loudness" and in graph would be reflected in larger fluctuations Frequency: How rapid are the fluctuations? waveforms - Zooming in! "Pure" tone is a sine wave (real world sounds are generally not pure!) One cycle here is approximately 0.150 seconds to 0.157 seconds: Period is 0.007 seconds Frequency is 1/0.007 = 142.857... Hz (for "Hertz") For reference, "middle C" is around 261.626 Hz An octave doubles/halves frequency, so this note is a probably something like a "D below middle C" (which is 146.8 Hz) Question: How do we make this digital? waveforms - Zooming in even more! Answer: We sample the waveform many times per second. This is zoomed in enough where you can see actual samples: Quality of sound reproduction depends on sample rate (samples per second): In this example, 22 samples between 0.1890 and 0.1900 So 22/(0.190-0.189) = 22,000 samples per second CD sound: 44,100 samples/second Typical DVD sound: 48,000 samples/second Nyquist Theorem: Perfect reconstruction of signals with frequency <= F if you sample at (2/F) samples/second

waveforms - Zooming in even more! Answer: We sample the waveform many times per second. This is zoomed in Question: enough where What you is maximum can see actual frequency samples: that can be reconstructed from a CD? From a DVD? For comparison: Human hearing range is typically 20 Hz to around 20,000 Hz Quality of sound reproduction depends on sample rate (samples per second): In this example, 22 samples between 0.1890 and 0.1900 So 22/(0.190-0.189) = 22,000 samples per second CD sound: 44,100 samples/second Typical DVD sound: 48,000 samples/second Nyquist Theorem: Perfect reconstruction of signals with frequency <= F if you sample at (2/F) samples/second Video Basics Can be viewed as a series of still images 24 frames per second (fps) in movies 30 fps in US television Motion-JPEG (M-JPEG) is exactly this: JPEG image for each frame Benefit: Very simple format to work with and edit Drawback: Doesn't take advantage of temporal similarities between frames MPEG (DVD format) includes motion estimation: Video A few more details... Frames are no longer independent! MPEG has three frame types: I-frames (intra-coded - independent) P-frames (predicted) B-frames (bi-predictive) Must buffer B-frames until the next P-frame Can only "enter" a video stream at an I-frame (or you see very blocky artifacts). Video editors need to be very careful about this (splicing at non-i frames can be tricky!)

Video and A movie typically has multiple "streams" multiplexed together: Video stream Audio stream (maybe multiple for multi-language) Subtitles Rendering software must synchronize streams - otherwise sound and video may be off (probably everyone has seen this happen!) Compression Taking advantage of redundancies and other structure to give smaller file sizes. Two main types: Lossless: Allows perfect reconstruction of original data Zip, RAR, FLAC,... (JPEG has a lossless mode too!) Lossy: Reconstruction is an approximation of original Most media formats: JPEG, MPEG, MP3,... Can usually trade off quality for compression Note that digital sampling/capture is already a lossy process (Remember taking advantage of human color vision?) Compression Examples, and what you can expect Text: "Pride and Prejudice" Original (uncompressed) 685 kb Zip 250 kb GZip 250 kb RAR 217 kb 7Zip 204 kb BZip 176 kb Audio: "London Calling" (3:19 long) CD audio (uncompressed) 35.2 MB Zip (lossless, general) 33.9 MB FLAC (lossless, audio) 25.4 MB MP3 (lossy, 128 kbps) 3.2 MB Ogg (lossy, quality 3) 3.1 MB Notes: Zip is not designed for audio Both MP3 and Ogg sound good at this rate MP3 plays on almost all players MP3 encoding (using LAME) took 11.2 sec Ogg encoding took 6.1 sec

Compression Examples, and what you can expect - cont'd Picture: 3648 x 2736 (9.98 MPixel) Raw Zip (lossless) 29.9 MB 17.0 MB Video: "Wizard of Oz" (1:41:42) 480x720 @30fps BZip (lossless) 10.9 MB Raw 190 GB PNG (lossless) 9.8 MB HQ DVD 3.6 GB JPEG (lossy - Q=95) JPEG (lossy - Q=85) 2.1 MB 1.1 MB Notes: DVD compression is over 50:1 DiVX / MP4 can give 200:1 or more Summary There's a lot more we could talk about Logarithmic scale of human perception (intensities, frequencies, etc.) Image formats: bitmapped vs vector formats Compression techniques Other imagery formats (multispectral images)... Explore this if it interests you! Following your curiosity is a great way to learn...