Module 6 STILL IMAGE COMPRESSION STANDARDS

Similar documents
Journal of World s Electrical Engineering and Technology J. World. Elect. Eng. Tech. 1(1): 12-16, 2012

A Hybrid DWT-SVD Image-Coding System (HDWTSVD) for Color Images

Topic -3 Image Enhancement

A Memory Efficient Array Architecture for Real-Time Motion Estimation

Image Enhancement in the Spatial Domain. Spatial Domain

Towards Adaptive Information Merging Using Selected XML Fragments

A Novel Automatic White Balance Method For Digital Still Cameras

Color Correction Using 3D Multiview Geometry

A New and Efficient 2D Collision Detection Method Based on Contact Theory Xiaolong CHENG, Jun XIAO a, Ying WANG, Qinghai MIAO, Jian XUE

IP Multicast Simulation in OPNET

Detection and Recognition of Alert Traffic Signs

(a, b) x y r. For this problem, is a point in the - coordinate plane and is a positive number.

Optical Flow for Large Motion Using Gradient Technique

An Unsupervised Segmentation Framework For Texture Image Queries

View Synthesis using Depth Map for 3D Video

Multiview plus depth video coding with temporal prediction view synthesis

Hybrid Fractal Video Coding With Neighbourhood Vector Quantisation

A VECTOR PERTURBATION APPROACH TO THE GENERALIZED AIRCRAFT SPARE PARTS GROUPING PROBLEM

A Two-stage and Parameter-free Binarization Method for Degraded Document Images

Hierarchically Clustered P2P Streaming System

AN ANALYSIS OF COORDINATED AND NON-COORDINATED MEDIUM ACCESS CONTROL PROTOCOLS UNDER CHANNEL NOISE

An Extension to the Local Binary Patterns for Image Retrieval

Segmentation of Casting Defects in X-Ray Images Based on Fractal Dimension

A ROI Focusing Mechanism for Digital Cameras

Lecture # 04. Image Enhancement in Spatial Domain

Positioning of a robot based on binocular vision for hand / foot fusion Long Han

The International Conference in Knowledge Management (CIKM'94), Gaithersburg, MD, November 1994.

All lengths in meters. E = = 7800 kg/m 3

Controlled Information Maximization for SOM Knowledge Induced Learning

Communication vs Distributed Computation: an alternative trade-off curve

Cryptanalysis of Hwang-Chang s a Time-Stamp Protocol for Digital Watermarking

Accelerating Storage with RDMA Max Gurtovoy Mellanox Technologies

A Shape-preserving Affine Takagi-Sugeno Model Based on a Piecewise Constant Nonuniform Fuzzification Transform

A Novel Image-Based Rendering System With A Longitudinally Aligned Camera Array

Conversion Functions for Symmetric Key Ciphers

SCALABLE ENERGY EFFICIENT AD-HOC ON DEMAND DISTANCE VECTOR (SEE-AODV) ROUTING PROTOCOL IN WIRELESS MESH NETWORKS

A Recommender System for Online Personalization in the WUM Applications

5 4 THE BERNOULLI EQUATION

XFVHDL: A Tool for the Synthesis of Fuzzy Logic Controllers

High Performance Computing on GPU for Electromagnetic Logging

Input Layer f = 2 f = 0 f = f = 3 1,16 1,1 1,2 1,3 2, ,2 3,3 3,16. f = 1. f = Output Layer

Point-Biserial Correlation Analysis of Fuzzy Attributes

An Improved Resource Reservation Protocol

Modelling, simulation, and performance analysis of a CAN FD system with SAE benchmark based message set

Improvement of First-order Takagi-Sugeno Models Using Local Uniform B-splines 1

JPEG 2000 Wireless Image Transmission System using Encryption Domain Authentication

A New Finite Word-length Optimization Method Design for LDPC Decoder

COSC 6385 Computer Architecture. - Pipelining

Cardiac C-Arm CT. SNR Enhancement by Combining Multiple Retrospectively Motion Corrected FDK-Like Reconstructions

HISTOGRAMS are an important statistic reflecting the

WIRELESS sensor networks (WSNs), which are capable

On the Forwarding Area of Contention-Based Geographic Forwarding for Ad Hoc and Sensor Networks

Clustering Interval-valued Data Using an Overlapped Interval Divergence

Multi-azimuth Prestack Time Migration for General Anisotropic, Weakly Heterogeneous Media - Field Data Examples

Lifetime and Energy Hole Evolution Analysis in Data-Gathering Wireless Sensor Networks

Introduction to Medical Imaging. Cone-Beam CT. Introduction. Available cone-beam reconstruction methods: Our discussion:

An Efficient Handover Mechanism Using the General Switch Management Protocol on a Multi-Protocol Label Switching Network

Parallel processing model for XML parsing

COMPARISON OF CHIRP SCALING AND WAVENUMBER DOMAIN ALGORITHMS FOR AIRBORNE LOW FREQUENCY SAR DATA PROCESSING

A Full-mode FME VLSI Architecture Based on 8x8/4x4 Adaptive Hadamard Transform For QFHD H.264/AVC Encoder

A Haptic-Assisted Guidance System for Navigating Volumetric Data Sets

Color Interpolation for Single CCD Color Camera

Binary Dissection: Variants & Applications

Introduction To Pipelining. Chapter Pipelining1 1

High performance CUDA based CNN image processor

On using circuit-switched networks for file transfers

dc - Linux Command Dc may be invoked with the following command-line options: -V --version Print out the version of dc

TESSELLATIONS. This is a sample (draft) chapter from: MATHEMATICAL OUTPOURINGS. Newsletters and Musings from the St. Mark s Institute of Mathematics

Configuring RSVP-ATM QoS Interworking

Dynamic Processor Scheduling with Client Resources for Fast Multi-resolution WWW Image Browsing

Multidimensional Testing

Prioritized Traffic Recovery over GMPLS Networks

Computer Architecture. Pipelining and Instruction Level Parallelism An Introduction. Outline of This Lecture

THE THETA BLOCKCHAIN

a Not yet implemented in current version SPARK: Research Kit Pointer Analysis Parameters Soot Pointer analysis. Objectives

ANALYTIC PERFORMANCE MODELS FOR SINGLE CLASS AND MULTIPLE CLASS MULTITHREADED SOFTWARE SERVERS

t [ Background removed

UCB CS61C : Machine Structures

Quality-of-Content (QoC)-Driven Rate Allocation for Video Analysis in Mobile Surveillance Networks

Combinatorial Mobile IP: A New Efficient Mobility Management Using Minimized Paging and Local Registration in Mobile IP Environments

Frequency Domain Approach for Face Recognition Using Optical Vanderlugt Filters

i-pcgrid Workshop 2016 April 1 st 2016 San Francisco, CA

Embeddings into Crossed Cubes

= dv 3V (r + a 1) 3 r 3 f(r) = 1. = ( (r + r 2

Augmented Reality. Integrating Computer Graphics with Computer Vision Mihran Tuceryan. August 16, 1998 ICPR 98 1

And Ph.D. Candidate of Computer Science, University of Putra Malaysia 2 Faculty of Computer Science and Information Technology,

Dense pointclouds from combined nadir and oblique imagery by object-based semi-global multi-image matching

Prof. Feng Liu. Fall /17/2016

Two-Dimensional Coding for Advanced Recording

Lecture 8 Introduction to Pipelines Adapated from slides by David Patterson

A Minutiae-based Fingerprint Matching Algorithm Using Phase Correlation

A General Characterization of Representing and Determining Fuzzy Spatial Relations

Shortest Paths for a Two-Robot Rendez-Vous

10/29/2010. Rendering techniques. Global Illumination. Local Illumination methods. Today : Global Illumination Modules and Methods

Lecture 27: Voronoi Diagrams

Quality Aware Privacy Protection for Location-based Services

Fast quality-guided flood-fill phase unwrapping algorithm for three-dimensional fringe pattern profilometry

Annales UMCS Informatica AI 2 (2004) UMCS

Assessment of Track Sequence Optimization based on Recorded Field Operations

EE 6900: Interconnection Networks for HPC Systems Fall 2016

Transcription:

Module 6 STILL IMAE COMPRESSION STANDARDS

Lesson 17 JPE-2000 Achitectue and Featues

Instuctional Objectives At the end of this lesson, the students should be able to: 1. State the shotcomings of JPE standad. 2. State the scope and objectives of JPE-2000. 3. Name the main application aeas of JPE-2000. 4. List the equiements of its majo applications. 5. Explain the majo building blocks of JPE-2000. 6. Define tiling and its significance. 7. Distinguish between evesible and ievesible wavelet tansfoms. 8. Pefom peiodic extension of signals. 9. Distinguish between convolution based filteing and lifting based filteing. 10. Show the elations fo evesible and ievesible component tansfomations. 11. Define pecincts and packets fo JPE-2000 bit steam. 17.0 Intoduction In lesson-16, we had discussed in details about the JPE standad. This was in fact the fist eve intenational standad on continuous tone still image compession and coding. Despite its vey impessive pefomance and the massive populaity it gained in stoage of still images and tansmission though intenet, some shotcomings wee felt as the technology futhe matued. Fist, JPE encoded images show sevee blocking atifacts at vey low bit ate. This poblem basically oiginates fom the use of block-based DCTs, which we had discussed ealie. Moeove, JPE povides insufficient suppot fo spatial and SNR (quality) scalability. JPE lacks object-based and egion-based epesentation and has no suppot fo eo esiliency. Subsequent to the acceptance of JPE as a standad, the technology petaining to image compession and coding matued futhe, especially in the field of wavelet tansfoms and subband coding, which we had discussed in lessons-10 to 15. The shotcomings of JPE, matuity of wavelet tansfoms and subband coding and polifeation of new application aeas pompted the intenational community to wok on a moe advanced still image coding standad, known as JPE-2000, whose wok was fist initiated in 1996.

In this lesson, we ae going to pesent the majo application aeas of JPE-2000 and thei equiements, the achitectue of the coding engine and the encoding concepts. JPE-2000 encoding is based on EBCOT wavelet coding technique, which we have aleady discussed in lesson-15. Hence, the focus of this lesson will not be on the coding algoithm, but moe on its achitectual aspects. Because of seveal added featues not found in ealie coding standads, impoved capabilities and vast futue potentials, we shall devote two moe lessons, viz. lesson-18 and lesson-19, in addition to the pesent one to discuss vaious aspects of JPE-2000 coding. 17.1 Scope and objectives of JPE-2000 The JPE-2000 standads wee intended fo use in diffeent types of still images, such as bi-level, gay-scale, colo (o multi-component) with diffeent chaacteistics, such as natual images, scientific, medical, emote sensing, text, endeed gaphics etc and on diffeent imaging models, such as client/seve, eal-time tansmission, image libay achival, limited buffe and bandwidth esouces etc all within a unified famewok. The objective was to make it 30% moe efficient than the baseline JPE. The standadization committee developed the Pat-I of JPE-2000 standad to make it oyalty fee, so as to pomote its wide usage. 17.2 Applications of JPE-2000 and thei equiements The JPE-2000 standads have seveal applications and moe ae emeging with time. Some of the majo application aeas ae: Intenet Colo facsimile Pinting Scanning Digital photogaphy Remote Sensing Mobile Medical imagey Digital libaies and achives E-commece Each application aea has some equiements which the standad should fulfill. The equiements fom diffeent application aeas may be summaized as follows:

Impoved low bit-ate pefomance: It should give acceptable quality below 0.25 bpp. Netwoked image delivey and emote sensing applications have this equiements. Lossless and lossy compession: Lossless compession should be included in pogessive decoding. It should suppot embedded bit-steam that allow pogessive lossy to lossless build-up. Pogessive tansmission: The standad should allow pogessive tansmission that allows images to be econstucted with inceasing pixel accuacy and esolution. Region of Inteest Coding: It should pefeentially allocate moe bits to the egions of inteest (ROIs) as compaed to the non-roi ones. Random code steam access: The use defined ROIs in an image should be andomly accessible. Eo esilience: The standad should povide eo esiliency, especially fo wieless communication channels. Open achitectue: It should suppot open achitectue to optimize the system fo diffeent image types and applications. The decode should have only the coe tool set and a pase that undestands the coe steam. If necessay, the unknown tools could be equested by the decode. Content based desciption: Finding the desied image fom a lage achive of images is a challenging task. This has applications in medical images, foensic, digital libaies etc. These issues ae being addessed by MPE-7. Image Secuity: This is anothe majo equiement to potect the intellectual popety ights of the images. Digital images can be potected using watemaking, labeling, stamping, encyption etc. Continuous-tone and bi-level image compession: Peviously, we had diffeent standads fo bi-level images (JBI standads) and continuoustome images (JPE standad). A equiement was felt that a single standad should addess both these domains, including the colo images and encompass applications involving compound images with ovelaying texts. Sequential one-pass decoding: The decoding should be done in a single sequential pass and suppot inteleaved and non-inteleaved mode.

Side-channel suppot: The standad should suppot alpha planes and tanspaency planes. All these equiements, oiginating fom diffeent application domains wee addessed and incopoated in the JPE-2000 standad. 17.3 Achitectue of JPE-2000 Fig. 17.1: Block Diagam of JPE-2000 encode. Fig.17.1 shows the block-diagam of JPE-2000 encode and fig.17.2 shows its coesponding decode. Fig. 17.2: Block Diagam of JPE-2000 decode The block diagams ae not significantly diffeent fom those of JPE, except fo the fact that the tansfom used is Discete Wavelet Tansfom (DWT), athe than the DCT. Befoe applying the DWT, the souce image is divided into components (colos) and each component is divided into tiles (to be descibed in Section-17.4) which ae compessed independently, as though they ae independent images and all the samples in the tile ae DC level shifted. The DWT coefficients in diffeent subbands ae quantized and then composed into an embedded bit-steam following the EBCOT algoithm, pesented in lesson-15. The embedded bit-steam is composed of quality layes so as to offe both esolution and SNR scalability. The entopy coding is based on a contextadaptive aithmetic code, known as MQ code, which has eo coection capability. The encoding pocess may be summaized as follows:

The souce image is decomposed into components, such as RB o YUV. The image and its components ae divided into ectangula tiles, which fom the basic unit to be encoded. DWT at diffeent esolution levels is applied on each tile to compose the subbands. The DWT may be evesible o ievesible, to be descibed in Section-17.5. The subband coefficients ae quantized and collected into ectangula aays of code blocks (descibed in lesson-15, Section-15.3). The bit-planes of the code-blocks ae aithmetic coded in multiple passes with factional bit-plane concept of EBCOT algoithm (efe lesson-15). The encoding is done in such a way that the Regions of Inteest (ROI) get highe quality, as compaed to the othe aeas (to be discussed in lesson- 18). Makes ae intoduced in the bit-steam fo eo esilience (to be discussed in lesson-19). 17.4 Tiling and its significance Befoe applying the DWT, the image and its components ae divided into smalle non-ovelapping blocks, known as tiles, which can be coded independently, as if each tile is an independent image. All opeations, such as component mixing, DWT, quantization and entopy coding ae theefoe done independently fo each tile. Tiling has the advantage of educing memoy equiements fo DWT and its pocessing and is amenable to paallelization. Moeove, tiles may be independently accessed and used fo decoding specific pats of the image, athe than the complete one. The tile may be as lage as the entie image size (that is, single tile) o of smalle patitions, such as 256x256, 128x128 etc. In tems of PSNR, tiling degades the pefomance, as compaed to no tiling and smalle tile sizes lead to tiling atifacts. 17.5 Wavelet Filtes The JPE-2000 standad suppots lossy as well as lossless encoding. Two types of wavelet filtes ae included in Pat-I of the standad

(a) Ievesible, whee exact econstuction will not be possible at the decode and is used fo lossy encoding. This is implemented using a 9/7 Daubechies filte, whose analysis and synthesis filte coefficients ae shown in Table-17.1. (b) Revesible, whee exact econstuction at the decode is possible and is theefoe included fo lossless JPE-2000. This is implemented using a 5/3 filte, whose analysis and synthesis filte coefficients ae shown in Table-17.2. Analysis Filte Coefficients i Low-pass filte h l ( i) High-pass filte h u ( i) 0 0.6029490182363579 1.115087052456994 ± 1 0.2668641184428723-0.5912717631142470 ± 2-0.07822326652898785-0.05754352622849957 ± 3-0.01686411844287495 0.09127176311424948 ± 4 0.02674875741080976 i Synthesis Filte Coefficients i i Low-pass filte h l ( ) High-pass filte ( i) 0 1.115087052456994 0.6029490182363579 ± 1 0.5912717631142470-0.2668641184428723 ± 2-0.05754352622849957-0.07822326652898785 ± 3-0.09127176311424948 0.01686411844287495 ± 4 0.02674875741080976 Table-17.1 Daubechies 9/7 analysis and synthesis filte coefficients Analysis Filte Coefficients Low-pass filte i h l () High-pass filte i h u ( ) g l ( ) h u Synthesis Filte Coefficients Low-pass filte High-pass filte i i 0 6/8 1 1 6/8 ± 1 2/8-1/2 1/2-2/8 ± 2-1/8-1/8 g u () Table-17.2 5/3 Analysis and synthesis filte coefficients

17.5.1 Peiodic Extension of Signals: Two filteing appoaches, namely convolution and lifting based (to be discussed in Section-17.5.1) filtes ae used in JPE-2000 standad. Both these appoaches fail at the signal boundaies, since coesponding signal samples ae not available fo all the filte coefficients. One can conside the non-existent samples as zeos, but this leads to unsatisfactoy esults because of signal discontinuities. A bette appoach is to extend the signal at both the boundaies by including the mio images of the samples about the boundaies. Fig. 17.3: Peiodic symmetic extension of 1-d signal S(0),S(1) S(8). Fig. 17.3 illustates the signal extension fo a one-dimensional example of nine samples: s( 0), s( 1 ), s( 2),, s( 8), whee s( 0) and s ( 8) seve as bounday samples. s 0 and to the ight of s 8 as The samples ae extended to the left of ( ) s s () i = s( i), i = 1, 2, ( 8 + i) = s( 8 i) i = 1,2,3, The extent to which the samples should be extended depends on the filte length. 17.5.2 Convolution and Lifting Based filtes: The convolution based filteing pefoms a dot poduct between the two filte masks (fo low-pass and high-pass filtes) and the extended 1-D signal. In lifting based filteing, the samples ae split into odd and even goups. The odd samples ae updated with a weighted sum of even sample values and the even samples ae updated with a weighted sum of odd sample values. Fo the evesible filtes (lossless case), the esults ae ounded to intege values. The lifting based filteing fo the 5/3 analysis filte is achieved as follows: y y ( 2n + 1) = x ( 2n + 1) ( 2n) = x ( 2n) ext ext + ( 2n) + x ( 2n 2) xext ext + 2 ( 2n 1) + y( 2n + 1) y + 4 2 ()

whee is the extended input signal and y is the output signal. x ext 17.6 Revesible and Ievesible Component Tansfomations JPE-2000 suppots two diffeent component tansfomations fo multicomponent images: Ievesible Component Tansfomation (ICT) used fo lossy coding only. It may only be used with the 9/7 ievesible wavelet tansfom. Revesible Component Tansfomation (RCT) used fo lossless as well as fo lossy coding. It may only be used with the 5/3 evesible wavelet tansfom. The fowad (RB to YUV) and the invese ICT (YUV to RB) tansfomations ae given by = B R V U Y 0.08131 0.41869 0.5 0.5 0.33126 0.16875 0.114 0.587 0.299 = V U Y B R 0 1.772 1.0 0.71414 0.34413 1.0 1.402 0 1.0 The fowad and the invese RCT tansfomations ae given by + + = B R B R V U Y 4 2 + + + = V U V U Y B R 4

The pefomance compaisons between lossless compession (RCT + 5/3 evesible filte) and lossy compession (ICT + 9/7 ievesible filte) at compaable bit ate indicates significantly bette pefomance fo the latte, as compaed to the fome. It may be noted that the sub-sampling of the chominance components, as shown in Section-16.4 of lesson-16 is not ecommended fo JPE-2000, since a 2:1 sub-sampling in hoizontal and vetical diections may be obtained by simply discading the HL, LH and HH subbands of a component s wavelet decomposition and etaining all othe subbands. 17.7 Pecincts and packets fo JPE-2000 bit-steam The aithmetically coded quantized wavelet coefficients ae aanged in packets and the packet patition locations ae efeed to as pecincts. Afte quantization, each subband is divided into some non-ovelapping ectangles. Thee spatially consistent ectangles - one fom each subband at each esolution level compise a packet patition location o pecinct, as illustated in fig.17.4. Each pecinct is futhe divided into code-blocks (discussed in Section-15.3 of lesson-15), which fom the input to the aithmetic code. The data epesenting a specific tile, laye (efe to lesson-15), component, esolution and pecinct appeas in the code steam in a contiguous segment, called a packet. The data in a packet is odeed accoding to the subbands LL, HL, LH and HH and within each subband, the codeblocks ae aanged in aste scanning ode, confined within the bounds of the coesponding pecinct.