Transporting audio-video. over the Internet

Similar documents
Networking Applications

Module objectives. Integrated services. Support for real-time applications. Real-time flows and the current Internet protocols

Introduction to Networked Multimedia An Introduction to RTP p. 3 A Brief History of Audio/Video Networking p. 4 Early Packet Voice and Video

Streaming (Multi)media

2.4 Audio Compression

Digital Asset Management 5. Streaming multimedia

Multimedia Networking

Introduction to LAN/WAN. Application Layer 4

Multimedia Communications

CS640: Introduction to Computer Networks. Application Classes. Application Classes (more) 11/20/2007

CS 457 Multimedia Applications. Fall 2014

Chapter 9. Multimedia Networking. Computer Networking: A Top Down Approach

Audio and video compression

Data Communication Prof.A. Pal Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur Lecture 36 Multimedia Networks

CS519: Computer Networks. Lecture 9: May 03, 2004 Media over Internet

Chapter 7 Multimedia Networking

Multimedia Networking

CSC 4900 Computer Networks: Multimedia Applications

Transporting Voice by Using IP

A common issue that affects the QoS of packetized audio is jitter. Voice data requires a constant packet interarrival rate at receivers to convert

Outline. QoS routing in ad-hoc networks. Real-time traffic support. Classification of QoS approaches. QoS design choices

Lecture 14: Multimedia Communications

Multimedia Networking

Principles of Audio Coding

Multimedia. Multimedia Networks and Applications

ITEC310 Computer Networks II

Speech-Coding Techniques. Chapter 3

Mohammad Hossein Manshaei 1393

Audio Coding and MP3

Chapter 28. Multimedia

Provides port number addressing, so that the correct destination application can receive the packet

Perceptual coding. A psychoacoustic model is used to identify those signals that are influenced by both these effects.

Synopsis of Basic VoIP Concepts

ETSF10 Internet Protocols Transport Layer Protocols

Multimedia Networking

MULTIMEDIA I CSC 249 APRIL 26, Multimedia Classes of Applications Services Evolution of protocols

CSCD 433/533 Advanced Networks Fall Lecture 14 RTSP and Transport Protocols/ RTP

Chapter 5 VoIP. Computer Networking: A Top Down Approach. 6 th edition Jim Kurose, Keith Ross Addison-Wesley March Multmedia Networking

Both LPC and CELP are used primarily for telephony applications and hence the compression of a speech signal.

Transporting Voice by Using IP

Introduction to Quality of Service

CS 218 F Nov 3 lecture: Streaming video/audio Adaptive encoding (eg, layered encoding) TCP friendliness. References:

Multimedia networking: outline

Latency and Loss Requirements! Receiver-side Buffering! Dealing with Loss! Loss Recovery!

Multimedia Networking

Lecture 9: Media over IP

Ch 4: Multimedia. Fig.4.1 Internet Audio/Video

Real-Time Protocol (RTP)

Multimedia Protocols. Foreleser: Carsten Griwodz Mai INF-3190: Multimedia Protocols

ABSTRACT. that it avoids the tolls charged by ordinary telephone service

Mpeg 1 layer 3 (mp3) general overview

Theoretical and Practical Aspects of Triple Play

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

ITNP80: Multimedia! Sound-II!

Multimedia networked applications: standards, protocols and research trends

Kommunikationssysteme [KS]

RECOMMENDATION ITU-R BT.1720 *

Multimedia Applications over Packet Networks

Quality of Service. Qos Mechanisms. EECS 122: Lecture 15

Today. March 7, 2006 EECS122 Lecture 15 (AKP) 4. D(t) Scheduling Discipline. March 7, 2006 EECS122 Lecture 15 (AKP) 5

RTP/RTCP protocols. Introduction: What are RTP and RTCP?

Multimedia Applications. Classification of Applications. Transport and Network Layer

RTP: A Transport Protocol for Real-Time Applications

Internet Streaming Media. Reji Mathew NICTA & CSE UNSW COMP9519 Multimedia Systems S2 2007

Transport protocols Introduction

Introduction to Voice over IP. Introduction to technologies for transmitting voice over IP

RSVP Support for RTP Header Compression, Phase 1

Multimedia Networking and Quality of Service

Multimedia

Course Syllabus. Website Multimedia Systems, Overview

Distributed Multimedia Systems. Introduction

Multimedia Applications. Internet Technologies and Applications

Tema 0: Transmisión de Datos Multimedia

RTP. Prof. C. Noronha RTP. Real-Time Transport Protocol RFC 1889

Mohammad Hossein Manshaei 1393

Improving QoS of VoIP over Wireless Networks (IQ-VW)

4 rd class Department of Network College of IT- University of Babylon

Chapter 9. Multimedia Networking. Computer Networking: A Top Down Approach

Overview. A Survey of Packet-Loss Recovery Techniques. Outline. Overview. Mbone Loss Characteristics. IP Multicast Characteristics

Multimedia in the Internet

AUDIO. Henning Schulzrinne Dept. of Computer Science Columbia University Spring 2015

Lecture 3: Modulation & Layering"

EEC-484/584 Computer Networks

in the Internet Andrea Bianco Telecommunication Network Group Application taxonomy

Mahdi Amiri. February Sharif University of Technology

Service/company landscape include 1-1

Computer Networks. Wenzhong Li. Nanjing University

Real-time Services BUPT/QMUL

VoIP Basics. 2005, NETSETRA Corporation Ltd. All rights reserved.

IEEE 802 Executive Committee Study Group on Mobile Broadband Wireless Access < Implication of End-user.

VoIP. ALLPPT.com _ Free PowerPoint Templates, Diagrams and Charts

Real-Time Control Protocol (RTCP)

Real-Time Transport Protocol (RTP)

Mul$media Streaming. Digital Audio and Video Data. Digital Audio Sampling the analog signal. Challenges for Media Streaming.

Multimedia! 23/03/18. Part 3: Lecture 3! Content and multimedia! Internet traffic!

Part 3: Lecture 3! Content and multimedia!

MEDIA TRANSPORT USING RTP

Popular protocols for serving media

Internet Streaming Media

White Paper Voice Quality Sound design is an art form at Snom and is at the core of our development utilising some of the world's most advance voice

Transcription:

Transporting audio-video over the Internet

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

Audio/video network requirements Key requirements Bit rates Transit delay variation Multicasting capabilities (for distribution) Other requirements Transit delay Error rate 3/ 3

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

call Basics window

Nyquist theorem To faithfully represent an analog signal if maximum frequency f sampling rate at least 2f Sampling at f : impossible to reconstruct Application to audio if sampling rate is 8 khz bandwidth is 3.4 khz Sampling at 2f : easier to reconstruct 3/ 6

Sound, Audio, Speech,... Sound: vibration of matter Audio: audible sound (by humans) human audible spectrum: 20 Hz - 20kHz Speech: a particular type of sound we hear better than we talk speech spectrum: 50-10 khz Music: a particular case of non-speech sound Music Infra/Ultra sound Speech Sound Other non-speech Audio sound Nonspeech Noise 3/ 7

Basics window return

Types of applications Traditional real-time applications e.g. PABXs constant bit rate (CBR) Traditional bulk data applications e.g. file transfer, email available bit rate (ABR) Modern real-time applications e.g compressed audio, video variable bit rate (VBR) 3/ 9

Quality of Service and bit rate CBR applications User satisfaction ABR applications bit rate VBR applications From S.Shenker, Fundamental Design Issues for the Future Internet, 1995 3/ 10

Principle (or platitude) The grass is always greener on the other side of the hill...

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

Audio bit rate requirements Quality Technique or Kbps Compr. standard Telephone quality Standard G.711 PCM 64 Standard G.721 ADPCM 32 Y Lower G.728 LD-CELP 16 Y Lower GSM 13 Y Standard- G.729 LD-CELP 8 Y Lower+ CELP 5-7 Y CD Quality Consumer CD-audio CD-DA 1441 (stereo) Consumer CD-audio MPEG with FFT 192-256 Y Sound studio quality MPEG with FFT 384 Y Consumer CD-audio MPEG2.5 Layer III 128 (stereo) Y (MP3) Reproduced from "Understanding Networked Multimedia" by, Prentice Hall 1995 3/ 13

Which bit rate is actually needed? Network overheads incl.: RTP header (12 bytes) Transport Protocol header (usually UDP, 8 bytes) IP header (20 bytes) Example: raw G.711 64 Kbps requires from 68 to 80 Kbps However, speech contains silence raw digitized sound stream transport data unit stream packet stream 3/ 14

Monologue typically 15% silence Silences in speech Bi-party telephone conversation 20% silence for overall conversation 60% silence for each party S silence ~ 15% If silence suppressed, required bit rate is in effect <40% of nominal raw bit rate S silence ~ 60% 3/ 15

Observations, Trends Audio does not eat bandwidth Voice packets will swim in an ocean of data packets

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

Video bit rate requirements Quality Technique or Mbps Compr. standard Video conf. quality H.261 0.1 Y VCR quality MPEG-1 1.2. Y Broadcast quality MPEG-2 2-4 (1) Y Studio-quality digital TV Uncompressed ITU-R 601 166 Compressed MPEG-2 3 to 6 (2) Y HDTV Uncompressed CD-DA 2000 Compressed MPEG-2 25 to 34 Y (1): future; current implementations: 4 to 7 (2): future; current implementations: 6 to 10 Reproduced from "Understanding Networked Multimedia" by, Prentice Hall 1995 3/ 18

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

Network Transit Delay Telephone conversation: Net Round-trip delay < 400 ms for natural conversation 400 ms Virtual reality Round-trip delay < 100 ms for impression of immersion 100 ms Net VR server 3/ 20

Transit delay variation (Jitter) Receiver to wait a delay offset before playout Playout process Called delay equalization Digital to analog converter Increases overall end-toend latency to be regularly fed 3/ 21

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

Inter-media synchronization Called orchestration Particular case: lip synchronization Net A skew of 80-100 ms is generally tolerated < 100 ms 3/ 23

Audio/video relative priorities The ear behaves as a differentiator The eye behaves as an integrator Toleration of transmission errors affecting sound much lower than for video If audio and video compete for network bandwidth, audio should have priority See UNM book, p. 347-348 3/ 24

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

call In-depth window

Audio-compression techniques Encoding techniques DPCM, Delta ADPCM Source compression techniques Based on psycho-acoustic model Transform encoding (all sounds) Source modeling/synthesis coding (for speech) 3/ 27

Psycho-acoustic and Masking Response of ear to frequency: ear most sensitive between 2-5 khz Masking: ear does not register energy in some frequencies band, when there is more energy in a nearby band 60 40 20 0 20 100 1000 10000 actual frequencies 60 40 20 0 perceived frequencies 20 100 1000 10000 f f 3/ 28

Physiology and masking Ear membrane vibrates as a function of frequency low freq. Ear Membrane high freq. High frequencies: at one end Low frequencies: at opposite end Vibration of a area forces close areas to vibrate at the same frequency, and not at their own High amplitude vibrating area Masked areas 3/ 29

Voice modeling techniques Human Vocal system model relies on a set of cylinders of differing diameters (e.g. 10 in LPC-10) excited by a signal at a certain frequency Operates over 20 ms, on standard PCM samples 3/ 30

In-depth window return

Principle (or platitude) This is what we perceive that count, not what the physical reality is! or The Reality is what we perceive

Information rate, bit rate, entropy Information rate is different from bit rate Information content or entropy of a sample: a function of how different it is from the predicted value Shannon s theory: any signal which is totally predictable: carries no information (e.g. a sine wave) noise is completely unpredictable: high entropy 3/ 33

Real, contentless signals, noise Real signals lie somewhere between the two extremes total predictability (no information) Real signal total unpredictability (noise) 3/ 34

The Effect of compression Compression removes redundancy... but 3/ 35

Principle (or platitude) Redundancy is essential for resistance to errors

The Effect of compression Compression removes redundancy Redundancy essential for resistance to errors Compressed data more sensitive to errors 3/ 37

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

call Basics window

TCP behaviour Slow start S TCP slow start R TCP TCP increases pkt rate Sender aware of packets dropped packets dropped feed back Sender decreases bit rate when packet dropped t TCP reduces pkt rate 3/ 40

UDP behaviour UDP sends blindly to a receiver S UDP full speed start R UDP less data to send No feedback from the receiver more data, packets dropped NO feed back Sender unaware whether packets are dropped/lost UDP maintain packet rate t 3/ 41

Basics window return

Protocols for real-time audio and video Audio/video applications cannot operate over TCP They use UDP which has no timestamp, feedback,... Data applications (FTP, HTTP) Real Time audio or video application RTP TCP UDP All applications use RTP (Real-Time Transport Protocol) time-stamp packet loss detection IP 3/ 43

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

call In-depth window

Real-Time Protocol RTP: an Internet IETF standard Supports timing reconstruction: timestamp (4 bytes) loss detection: sequence number (2 bytes) Lighter than TCP no retransmission, no flow control TCP header: 20 bytes; RTP header: 12 bytes 3/ 46

Real-Time Protocol services Two parts in RTP RTP per se: for carrying data RTCP: to identify participants, monitor the quality of the service Session control (RTCP) Receivers send periodically reports Reports indicate how well the reception is 3/ 47

In-depth window return

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

IP+UDP+RTP headers = 40 bytes On header overhead At 64 Kbps PCM 20 ms = 160 Bytes overall rate = 80 Kbps IP RTP UDP 40 audio data 160 At 8 Kbps (e.g. with G.729) (e.g. over modem lines) 20 ms = 20 Bytes IP RTP UDP 40 data 20 3/ 50

IP, UDP, RTP compression IP/UDP/RTP compression specified by Robust Header Compression (ROHC) IETF draft Can reduce to 1 byte (best case) Operates on a link-by-link basis 3/ 51

Basic principles Fixed fields removal parts of the headers remain unchanged between pkts Differential encoding some fields vary in a predictive, monotonic way Re-coding combinations of fields some fields may be combined and hash coded 3/ 52

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

Low-bit rate redundancy Compression aims at removing redundancies... but redundancies improve resistance to data errors Primary coding re-code each packet at lower resolution Pkt 3 Redundancy coding R3 Pkt 2 R2 Pkt 1 R1 insert re-coded packet into one subsequent pkt(s) Pkt 3 R2 Pkt 2 R1 Redundant pkt stream 3/ 54

Example of redundancy: RAT (UCL) Primary coding = 64 Kbps 160 bytes Pkt 2 Pkt 1 Redundancy coding = 4.8 Kbps 12 bytes R2 R1 Experiment shows reasonable repair with high loss rate (40%) 3/ 55

Key requirements Bit rate requirements Audio requirements Video requirements Delay requirements Jitter Inter-media synchronization On compression... TCP, UDP basics RTP Needs and Principles Header overhead End-systems improvements Redundancy coding Error concealment

Error concealment (audio example) Replace missing packet with silence OK if pkt<16ms, loss rate<1%; beyond, clipping effect (1) white noise (better than silence) (1) OK means tolerable; does not mean unnoticed 3/ 57

Phonemic Restoration brain uses phonemic restoration: the ability of the brain to subconsciously repair a missing segment of speech with the correct sound phonemic restoration occurs better when missing segment replaced by noise instead of silence 3/ 58

End of Part 3 Transporting audio-video over the Internet

call In-depth window

In-depth window return

Principle (or platitude) Systems with no reservation (e.g. connectionless networks) scale well, but are poor at QoS guarantees Too bad for IP, Ethernet Systems with reservations (e.g. connection-oriented networks) are good at QoS guarantees and poor at scaling Too bad for RSVP, ATM