Embedded lossless audio coding using linear prediction and cascade coding

Similar documents
MPEG-4 General Audio Coding

The MPEG-4 General Audio Coder

5: Music Compression. Music Coding. Mark Handley

CS 335 Graphics and Multimedia. Image Compression

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy

Multimedia Communications. Audio coding

Parametric Coding of High-Quality Audio

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

CMPT 365 Multimedia Systems. Media Compression - Image

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Scalable Perceptual and Lossless Audio Coding based on MPEG-4 AAC

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Audio Compression. Audio Compression. Absolute Threshold. CD quality audio:

Audio-coding standards

Audio-coding standards

JPEG 2000 vs. JPEG in MPEG Encoding

JPEG Joint Photographic Experts Group ISO/IEC JTC1/SC29/WG1 Still image compression standard Features

Video Codec Design Developing Image and Video Compression Systems

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

Compressed Audio Demystified by Hendrik Gideonse and Connor Smith. All Rights Reserved.

An Efficient Context-Based BPGC Scalable Image Coder Rong Zhang, Qibin Sun, and Wai-Choong Wong

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Audio Fundamentals, Compression Techniques & Standards. Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Video Coding in H.26L

JPEG 2000 A versatile image coding system for multimedia applications

Introduction to Video Coding

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Wireless Communication

Optical Storage Technology. MPEG Data Compression

Perceptual Coding. Lossless vs. lossy compression Perceptual models Selecting info to eliminate Quantization and entropy encoding

Chapter 14 MPEG Audio Compression

Fundamentals of Multimedia

Preface. I Introduction and Multimedia Data Representations 1

CSEP 521 Applied Algorithms Spring Lossy Image Compression

Introduction to Video Encoding

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

Multimedia Networking ECE 599

The adaptive serializable snapshot isolation protocol for managing database transactions

Convention Paper Presented at the 118th Convention 2005 May Barcelona, Spain

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

Fundamentals of Video Compression. Video Compression

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University

MPEG-4 Audio Lossless Coding

06/12/2017. Image compression. Image compression. Image compression. Image compression. Coding redundancy: image 1 has four gray levels

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

Modified SPIHT Image Coder For Wireless Communication

Contributions to image encryption and authentication

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

Lecture 5: Compression I. This Week s Schedule

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

Standard Codecs. Image compression to advanced video coding. Mohammed Ghanbari. 3rd Edition. The Institution of Engineering and Technology

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

A review of lossless audio compression standards and algorithms

A Public Domain Tool for Wavelet Image Coding for Remote Sensing and GIS Applications

Digital Image Representation Image Compression

Digital Image Processing

MULTIDIMENSIONAL SIGNAL, IMAGE, AND VIDEO PROCESSING AND CODING

Compression II: Images (JPEG)

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Audio Coding Standards

LIST OF TABLES. Table 5.1 Specification of mapping of idx to cij for zig-zag scan 46. Table 5.2 Macroblock types 46

Lecture 13 Video Coding H.264 / MPEG4 AVC

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P

EXPLORING ON STEGANOGRAPHY FOR LOW BIT RATE WAVELET BASED CODER IN IMAGE RETRIEVAL SYSTEM

Compression; Error detection & correction

JPEG Modes of Operation. Nimrod Peleg Dec. 2005

MPEG-4 Advanced Audio Coding

MPEG-4 BSAC Technology

Video coding. Concepts and notations.

Mesh Based Interpolative Coding (MBIC)

Application of wavelet filtering to image compression

Error Protection of Wavelet Coded Images Using Residual Source Redundancy

Lecture 7: Audio Compression & Coding

Wavelet Transform (WT) & JPEG-2000

DRA AUDIO CODING STANDARD

Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT)

Huffman Code Application. Lecture7: Huffman Code. A simple application of Huffman coding of image compression which would be :

STUDY AND IMPLEMENTATION OF VIDEO COMPRESSION STANDARDS (H.264/AVC, DIRAC)

Video Compression An Introduction

The Standardization process

Chapter 4: Audio Coding

Multimedia Communications. Transform Coding

CHAPTER 6 Audio compression in practice

JPEG 2000 compression

CHAPTER 2 LITERATURE REVIEW

THE TRANSFORM AND DATA COMPRESSION HANDBOOK

DCT Based, Lossy Still Image Compression

Data Compression. Audio compression

Part 1 of 4. MARCH

Video Compression MPEG-4. Market s requirements for Video compression standard

Introduction to Video Compression

Lecture 16 Perceptual Audio Coding

ISO/IEC INTERNATIONAL STANDARD

Principles of Audio Coding

Knowledge libraries and information space

JPEG Descrizione ed applicazioni. Arcangelo Bruna. Advanced System Technology

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

Engineering Mathematics II Lecture 16 Compression

Transcription:

University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2005 Embedded lossless audio coding using linear prediction and cascade coding Kevin Adistambha University of Wollongong Recommended Citation Adistambha, Kevin, Embedded lossless audio coding using linear prediction and cascade coding, M.Eng. thesis, School of Electrical, Computer and Telecommunications Engineering, University of Wollongong, 2005. http://ro.uow.edu.au/theses/498 Research Online is the open access institutional repository for the University of Wollongong. For further information contact the UOW Library: research-pubs@uow.edu.au

Embedded Lossless Audio Coding using Linear Prediction and Cascade coding A thesis submitted in fulfillment of the requirements for the award of the degree Master of Engineering Research From The University of Wollongong By Kevin Adistambha Bachelor of Science, Master of Internet Technology School of Electrical, Computer and Telecommunications Engineering 2005

Abstract This thesis studies the techniques and feasibility of embedding a perceptual audio coder within a lossless compression scheme. The goal is to provide for two step scalability in the resulting bitstream, where both a perceptual version of the audio signal and a lossless version of the same signal are provided in the one bitstream. The focus of this thesis is the selection of the perceptual coder to be used as the perceptual base layer and the techniques to be used to compress the lossless layer by using backward linear prediction followed by entropy coding. The perceptual base layer used is MPEG-4 AAC, chosen based on entropy measurements of the residual signal. Results of the work in this thesis show that the embedded lossless coding scheme could achieve an average compression ratio of only 6% larger compared to lossless only coding. Performing decorrelation on the AAC residual signal by means of backward linear predictive coding and measuring the entropy of the resulting LPC residual signal of various orders revealed that an 8% decrease in coding rate is achievable using 15 th order prediction. Furthermore, this thesis also investigates an entropy coding technique known as cascade coding which is originally designed to compress hydroacoustic image data and is modified to compress audio data. Cascade coding is an entropy coding technique that uses multiple cascaded stages where each stage codes a specific range of integers and is used to perform entropy coding of the backward linear prediction residual signal. The cascade coding technique explored in this thesis includes using a frame based approach and trained codebooks. 2

Statement of Originality This is to certify that the work described in this thesis is entirely my own, except where due reference is made in the text. No work in this thesis has been submitted for a degree to any other university or institution. Signed Kevin Adistambha 6 December 2005 3

Acknowledgements I would like to thank my supervisors Dr. Christian Ritz, Dr. Jason Lukasiak and Assoc. Prof. Ian Burnett. Their support and patience made this work possible. My never ending questions and problems don t seem to deter them to help me finish this thesis. I would also like to thank my family and friends for their morale support and providing relief when the stress or boredom (or both) is too much for me to handle. Especially my parents, whose patience seems impossible to exhaust. Last but not least, thanks to Ibanez guitars, Marshall amplification and Line 6 for providing me with much needed distractions. 4

Table of Contents 1 Introduction...14 1.1 Digital representation of audio data...15 1.2 Perceptual coding...15 1.3 Lossless coding...16 1.3.1 Scalable lossless coding...16 1.3.2 Approaches to scalable to lossless audio coding...17 1.4 Motivation...19 1.5 Thesis outline...20 1.6 Contributions of this Thesis...20 2 Entropy and decorrelation of audio signals...22 2.1 Introduction...22 2.2 Signal entropy and coding theory...22 2.2.1 Entropy theory...22 2.2.2 Entropy coding methods...23 2.2.3 Entropy coding techniques comparison...32 2.3 Scalable entropy coding techniques...33 2.3.1 Fine grain scalability via bit plane coding...34 5

2.3.2 Embedded coding...35 2.4 Decorrelation techniques in lossless audio coding...36 2.4.1 Linear predictive coding...36 2.4.2 SHORTEN linear predictor...38 2.4.3 Non-linear prediction...39 2.4.4 Lossless audio decorrelation techniques comparison...40 2.5 Chapter summary...40 3 Perceptual base layer...42 3.1 Overview of the embedded lossless coder...42 3.2 Overview of the perceptual coder...43 3.2.1 MPEG-1 Layer III...44 3.2.2 MPEG-4 AAC...44 3.2.3 Ogg Vorbis...45 3.3 Description of the test procedure...46 3.4 Residual entropy, compression and SNR results...50 3.4.1 Entropy and compression of the residual...50 3.4.2 Signal to noise ratio...51 3.4.3 Total Lossless bitrate comparison...51 3.5 Comparison of embedded lossless coding methods and lossless only coding.54 3.6 Chapter summary...55 6

4 Decorrelation and entropy coding...56 4.1 Decorrelation of the AAC residual...56 4.1.1 Overview of decorrelation methods in lossless audio coding...56 4.1.2 Decorrelation results...61 4.1.3 Statistical characteristics of the AAC residual...62 4.2 Entropy coding...63 4.2.1 Overview of entropy coding methods...63 4.2.2 Entropy coding via Rice coding...63 4.2.3 Entropy coding via cascade coding...64 4.3 Cascade Codebook Design and Testing...70 4.3.1 Cascade coding codebook training algorithm...70 4.3.2 Training database...71 4.3.3 Cascade coder codebook training results...72 4.3.4 Test database coding result...75 4.4 Cascade Codebook Performance and Discussion...76 4.5 Chapter summary...79 5 Conclusion...81 5.1 Limitations of techniques...82 5.2 Future work...82 7

List of Figures Figure 1.1. Example of perceptual coding vs. objective coding SNR function....18 Figure 1.2. Block diagram of an embedded lossless coder....19 Figure 2.1. Example of Huffman coding...24 Figure 2.2. The n th stage of a cascade coder...31 Figure 2.3. Example of a cascade coder...32 Figure 2.4. General operation of the bit plane coding scheme...33 Figure 3.1. Block diagram of the embedded lossless coder....42 Figure 3.2. General block diagram of a perceptual coder....43 Figure 3.3. Entropy and compression of the residual signal using gzip and Monkey's Audio. Line A represents residual compressed with gzip, line B represents residual entropy and line C represents the residual compressed with Monkey s Audio....48 Figure 3.4. Signal-to-noise ratio and entropy of the signal....49 Figure 3.5. Total bitrate (perceptual layer + lossless layer ) comparison...53 Figure 4.1. Block diagram of the embedded lossless coder....56 Figure 4.2. The windowing function used in the backward LPC process...59 Figure 4.3. LPC gain plot for backward LPC of order 1-40 using Hamming window and Hybrid windows....60 8

Figure 4.4. SFM measurements for the LPC residual signal of order 1-40 using Hamming window and Hybrid window...60 Figure 4.5. One frame of an AAC residual signal histogram before and after order 15 LPC processing, compared to Laplacian p.d.f....62 Figure 4.6. 95% confidence interval plot for cascade coding, Rice coding and entropy values, using no LPC and LPC order 15 on the residual signal...67 Figure 4.7. Histogram plot of the absolute values of samples in the codebook 8 frames...74 Figure 4.8. 95% confidence interval plot of test database coding. Based on the data shown in Table 4.7....76 9

List of Tables Table 2.1. Example of LZ77 encoding...26 Table 2.2. Example of symbol probabilities and interval values for arithmetic coding...27 Table 2.3. Example of arithmetic coding....28 Table 3.1. Genre breakdown of test signals...46 Table 3.2. Comparison of total lossless bitrate and residual signal-to-noise ratio with residual compressed using Monkey's Audio...51 Table 3.3. Percentage of filesize increase of two embedded lossless coding methods....54 Table 4.1. Bits/sample comparison of cascade coding with and without frame adaptive method....65 Table 4.2. The 15 entry codebook for cascade coding, based on [24]....66 Table 4.3. Results of cascade and Rice coding with the entropy of the AAC residual signal, with and without LPC. Figures are in bits/sample...69 Table 4.4. Proposed cascade coder eight-entries codebook....71 Table 4.5. Result of the new proposed codebook training and testing...72 Table 4.6. Result of the codebook entry 8 training using the test portion of the Training database....72 Table 4.7. Test database coding result, arranged by signal...75 10

Table 4.8. Test database coding result, arranged by codebook entry...75 11

List of Abbreviations AAC AAZ BPGC BSAC EBCOT EZW IID JPEG LPC LSB LTP MAC MDCT MGE MPEG MSB PCM Advanced Audio Coding Advanced Audio Zip Bit Plane Golomb Code Bit Sliced Arithmetic Coding Embedded Block Coding with Optimize Truncation Embedded Zerotree Wavelet Independent and Identically Distributed Joint Photographic Experts Group Linear Predictive Coding Least Significant Bit Long Term Prediction Monkey s Audio Coder Modified Discrete Cosine Transform Multigrid Embedding Motion Picture Experts Group Most Significant Bit Pulse Code Modulation 12

PDF PNS SFM SMR SNR SPIHT TWINVQ Probability Density Function Perceptual Noise Substitution Spectral Flatness Measure Signal to Mask Ratio Signal to Noise Ratio Set Partitioning in Hierarchical Tree Transform-domain Weighted Interleave Vector Quantization 13