Fast Region-of-Interest Transcoding for JPEG 2000 Images

Similar documents
Scalable Video Streaming Based on JPEG2000 Transcoding with Adaptive Rate Control

Surveillance System with Mega-Pixel Scalable Transcoder

EFFICIENT METHODS FOR ENCODING REGIONS OF INTEREST IN THE UPCOMING JPEG2000 STILL IMAGE CODING STANDARD

A General Purpose Queue Architecture for an ATM Switch

SIGNAL COMPRESSION. 9. Lossy image compression: SPIHT and S+P

View Synthesis Prediction for Rate-Overhead Reduction in FTV

Multi-path Transport of FGS Video

Jyoti S. Pawadshetty*, Dr.J.W.Bakal** *(ME (IT)-II, PIIT New Panvel.) ** (Principal, SSJCOE Dombivali.)

JPEG Joint Photographic Experts Group ISO/IEC JTC1/SC29/WG1 Still image compression standard Features

Dynamic Window Decoding for LDPC Convolutional Codes in Low-Latency Optical Communications

Depth Estimation for View Synthesis in Multiview Video Coding

Fully Scalable Wavelet-Based Image Coding for Transmission Over Heterogeneous Networks

Secure Scalable Streaming and Secure Transcoding with JPEG-2000

signal-to-noise ratio (PSNR), 2

Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-domain

ERICSSON RESEARCH Media Lab. Signal Processing Laboratory Swiss Federal Institute of Technology, Lausanne

Next-Generation 3D Formats with Depth Map Support

Extensions of H.264/AVC for Multiview Video Compression

Point Cloud Attribute Compression using 3-D Intra Prediction and Shape-Adaptive Transforms

The Standardization process

Error resilience capabilities (cont d R=0.5 bit/pixel, ber=0.001

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /MFI.2006.

FPGA IMPLEMENTATION OF BIT PLANE ENTROPY ENCODER FOR 3 D DWT BASED VIDEO COMPRESSION

Toward Spatial Queries for Spatial Surveillance Tasks

EVALUATION OF EDCF MECHANISM FOR QoS IN IEEE WIRELESS NETWORKS

Fully Spatial and SNR Scalable, SPIHT-Based Image Coding for Transmission Over Heterogenous Networks

Efficient Dictionary Based Video Coding with Reduced Side Information

Digital Audio and Video in Industrial Systems

Module 6 STILL IMAGE COMPRESSION STANDARDS

Implication of variable code block size in JPEG 2000 and its VLSI implementation

Comparison of EBCOT Technique Using HAAR Wavelet and Hadamard Transform

Multi-Camera Calibration, Object Tracking and Query Generation

For layered video encoding, video sequence is encoded into a base layer bitstream and one (or more) enhancement layer bit-stream(s).

Bit-Plane Decomposition Steganography Using Wavelet Compressed Video

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

Modified SPIHT Image Coder For Wireless Communication

JPEG2000: The New Still Picture Compression Standard

Edge-Based Directional Fuzzy Filter for Compression Artifact Reduction in JPEG Images

JPEG 2000 A versatile image coding system for multimedia applications

Layered Self-Identifiable and Scalable Video Codec for Delivery to Heterogeneous Receivers

View Synthesis for Multiview Video Compression

Scalable Video Coding

Performance Comparison between DWT-based and DCT-based Encoders

Wavelet Transform (WT) & JPEG-2000

FAST AND EFFICIENT SPATIAL SCALABLE IMAGE COMPRESSION USING WAVELET LOWER TREES

Calculating the Distance Map for Binary Sampled Data

A Viewer for PostScript Documents

An embedded and efficient low-complexity hierarchical image coder

ANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION

Low-Memory Packetized SPIHT Image Compression

Unit-level Optimization for SVC Extractor

COMPRESSED DOMAIN SPATIAL SCALING OF MPEG VIDEO BIT- STREAMS WITH COMPARATIVE EVALUATION

Performance Optimization of an MPEG-2 to MPEG-4 Video Transcoder

Progressive resolution coding of hyperspectral imagery featuring region of interest access

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Under My Finger: Human Factors in Pushing and Rotating Documents Across the Table

Exhaustive Generation and Visual Browsing for Radiation Patterns of Linear Array Antennas

JPEG 2000 compression

Browsing News and TAlk Video on a Consumer Electronics Platform Using face Detection

Coding the Wavelet Spatial Orientation Tree with Low Computational Complexity

JPIP Proxy Server for remote browsing of JPEG2000 images

Embedded Rate Scalable Wavelet-Based Image Coding Algorithm with RPSWS

REGION-BASED SPIHT CODING AND MULTIRESOLUTION DECODING OF IMAGE SEQUENCES

Proxy Caching for Video on Demand Systems in Multicasting Networks

Fully scalable texture coding of arbitrarily shaped video objects

ECE 634: Digital Video Systems Scalable coding: 3/23/17

The Best-Performance Digital Video Recorder JPEG2000 DVR V.S M-PEG & MPEG4(H.264)

JPEG 2000 Compression Standard-An Overview

Overview of Multiview Video Coding and Anti-Aliasing for 3D Displays

EXPLORING ON STEGANOGRAPHY FOR LOW BIT RATE WAVELET BASED CODER IN IMAGE RETRIEVAL SYSTEM

FRACTAL COMPRESSION USAGE FOR I FRAMES IN MPEG4 I MPEG4

Delivery Context in MPEG-21

Reduced Memory Multi-Layer Multi-Component Rate Allocation for JPEG2000

On Interactive Browsing of Large Images

Compression / Decompression Strategies for Large Volume Medical Imagery

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

JPEG 2000 vs. JPEG in MPEG Encoding

Adaptation of Scalable Video Coding to Packet Loss and its Performance Analysis

Decoding. Encoding. Recoding to sequential. Progressive parsing. Pixels DCT Coefficients Scans. JPEG Coded image. Recoded JPEG image. Start.

QueryLines: Approximate Query for Visual Browsing

MERL { A MITSUBISHI ELECTRIC RESEARCH LABORATORY. Empirical Testing of Algorithms for. Variable-Sized Label Placement.

A HYBRID DPCM-DCT AND RLE CODING FOR SATELLITE IMAGE COMPRESSION

Motion Estimation. Original. enhancement layers. Motion Compensation. Baselayer. Scan-Specific Entropy Coding. Prediction Error.

THE rapid development of remote sensing technology

BLIND MEASUREMENT OF BLOCKING ARTIFACTS IN IMAGES Zhou Wang, Alan C. Bovik, and Brian L. Evans. (

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

Low-Complexity, Near-Lossless Coding of Depth Maps from Kinect-Like Depth Cameras

Homogeneous Transcoding of HEVC for bit rate reduction

Compression of RADARSAT Data with Block Adaptive Wavelets Abstract: 1. Introduction

Module 1B: JPEG2000 Part 1. Standardization issues, Requirements, Comparisons. JPEG: Summary (1) Motivation new still image st dard (2)

System Modeling and Implementation of MPEG-4. Encoder under Fine-Granular-Scalability Framework

Flexible, highly scalable, object-based wavelet image compression algorithm for network applications

Real-time and smooth scalable video streaming system with bitstream extractor intellectual property implementation

AN MSE APPROACH FOR TRAINING AND CODING STEERED MIXTURES OF EXPERTS

Complexity Reduction Tools for MPEG-2 to H.264 Video Transcoding

Mixed Raster Content for Compound Image Compression

Blind Measurement of Blocking Artifact in Images

SURVEILLANCE VIDEO FOR MOBILE DEVICES

Multidimensional Transcoding for Adaptive Video Streaming

On the JPEG2000 Implementation on Different Computer Platforms

Transcription:

MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Fast Region-of-Interest Transcoding for JPEG 2000 Images Kong, H-S; Vetro, A.; Hata, T.; Kuwahara, N. TR2005-043 May 2005 Abstract This paper presents a newly developed region-on-interest () transcoding method for JPEG 2000 images, which can be used for many progressive image/video streaming applications. The proposed method takes advantage of the packet property provided by JPEG 2000 standard to elevate the priority of the. Baased on the user specified information and progressive parameters, the developed transcoder extracts the selected and background packets with the desired quality and suppresses the unselected packets to the low quality. Since the transcoder only performs re-organizing the packets in the original coded bitstream, it does not involve any re-encoding operation. Compared to other coding schemes in the literature, the proposed method has the advantages of lower complexity and faster processing. Therefore, it is very suitable for real-time application. IEEE International Symposium on Circuits and Systems (ISCAS) This work may not be copied or reproduced in whole or in part for any commercial purpose. Permission to copy in whole or in part without payment of fee is granted for nonprofit educational and research purposes provided that all such whole or partial copies include the following: a notice that such copying is by permission of Mitsubishi Electric Research Laboratories, Inc.; an acknowledgment of the authors and individual contributions to the work; and all applicable portions of the copyright notice. Copying, reproduction, or republishing for any other purpose shall require a license with payment of fee to Mitsubishi Electric Research Laboratories, Inc. All rights reserved. Copyright c Mitsubishi Electric Research Laboratories, Inc., 2005 201 Broadway, Cambridge, Massachusetts 02139

MERLCoverPageSide2

FAST REGION-OF-INTEREST TRANSCODING FOR JPEG 2000 IMAGES Hao-Song Kong 1, Anthony Vetro 1, Toshihiko Hata 2, Naoki Kuwahara 2 1 Mitsubishi Electric Research Labs 201 Broadway, Cambridge, MA 02139, USA hkong@merl.com avetro@merl.com 2 Mitsubishi Electric Corporation Advanced Technology R&D Center Amagasaki, Hyogo 661-8661, Japan Hata.Toshihiko@wrc.melco.co.jp Kuwahara.Naoki@wrc.melco.co.jp ABSTRACT This paper presents a newly developed region-of-interest () transcoding method for JPEG 2000 images, which can be used for many progressive image/video streaming applications. The proposed method takes advantage of the packet property provided by JPEG 2000 standard to elevate the priority of the. Based on the user specified information and progressive parameters, the developed transcoder extracts the selected and background packets with the desired quality and suppresses the unselected packets to the low quality. Since the transcoder only performs re-organizing the packets in the original coded bitstream, it does not involve any reencoding operation. Compared to other coding schemes in the literature, the proposed method has the advantages of lower complexity and faster processing. Therefore, it is very suitable for real-time applications. 1. INTRODUCTION With the growing popularity of mobile wireless communications, there are increasing demands for image and video transmission in various applications. Video surveillance is one of these applications, in which people want to ubiquitously access the stored contents on the server to retrieve important scenes and to remotely browse them quickly. However, due to the network bandwidth limitations, full frame rate and high resolution video transmission are often not allowed. Therefore, a region-ofinterest () transcoding scheme is proposed in the paper to achieve adaptive and scalable video transmission. This scheme is used to extract the important objects (s) from JPEG 2000 coded images, to keep the s with high quality and high resolution and to re-assign low bit rate to the rest of the image or even to truncate the rest of the image based on the bandwidth capacity or user needs. JPEG 2000 image coding standard [1] provides an excellent hierarchical bitstream structure. It guarantees that the image is compressed once and then the bitstream can be decompressed in many ways to randomly access and extract a sub-set of the bitstream in order to reconstruct the image with the required spatial resolution and quality. In this paper, a detailed method of extraction and bitstream re-organizing is presented. The method is based on the desired features supported in JPEG 2000 standard, such as spatial random access at varying degrees of granularity and degree of quality for each. Compared to other coding methods in the literature, the proposed method has the advantages of lower complexity and faster processing, as it only performs re-organization of the packets in the original coded bitstream without any reencoding. 2. REVIEW OF EXISTING CODING TECHIQUES Recently, much attention has been paid to the coding [2]-[6] since the functionality of is suitable for many applications in which certain parts of an image are more meaningful than the other parts of the image. A general method for coding [7] is to scale up the wavelet transformed coefficients of s so that the bits associated with s are more significant than the bits associated with the background (). During the image transmission, the s are streamed with higher priorities than the background in a progressive manner. However, depending on the scaling value, some bits of the coefficients may stay on the same bit planes of the background. Therefore, the shape information needs to be transmitted to the decoder to distinguish the s from the background. JPEG 2000 standard provides a max-shift algorithm [8] to support coding. The max-shift method separates

the from the background by means of scaling them into non-overlapping bit planes. A scaling value is chosen to be sufficiently large to ensure the minimum coefficient associated with the is larger than the maximum coefficient of the background. When the decoder receives the scaling value, it identifies the coefficient by their magnitude. Therefore, the advantage of the method is that it allows arbitrary shaped coding without explicitly transmitting the shape information of the to the decoder. However, this method increases coding overhead due to code blocks on the and background boundary being coded twice. The major drawback of the scaling based method, including max-shift method, is its static encoding. All s must be defined during the time of encoding. This is a serious problem in the following situations: 1. the information is not known until the time of decoding 2. the information is provided dynamically by other external sources. Thus, recently in [9] authors proposed a dynamic coding method, which is able to handle the dynamic information in the interactive environments. The principle of the method is to take advantage of the flexible Packet/Layer mechanism in the JPEG 2000 to rearrange the packet priority in each layer. Since the method follows the layer definition in the original bitstream, it is fully compatible with JPEG 2000 standard. The key technique in the method is its dynamic layer insert functionality. In each layer, the belonging packets are selected from the bitstream according to the information and remain in the same layer; all other packets from non- areas are up-shifted by one layer. In this way, each layer contains only image information from the preferred packets. The resulting non-staffed positions are filled with packets. Figure 1 illustrates the procedure of the dynamic layer insertion. However, the dynamic layer insertion causes the packet header to be re-encoded, since the up-shifted packet is missing the position tag. This, in turn, causes ratedistortion recalculation to generate the information required for the tag-tree encoding for the packet header, which is an undesirable feature for real-time image transmission applications. 3. PROPOSED METHOD In order to avoid re-encoding of the packet header and make the coding flexible and dynamic, we propose a packet delete (suppress) method in this paper. Instead of pushing the non- packets to the upper layers, the non- packets are deleted from the highest layer down to the Packet 1 Packet 2 Packet 3 Packet 4 Layer 2 Layer 1 Packet 1 Packet 2 Packet 3 Packet 4 Layer 3 Layer 2 Layer 1 Figure 1. (left) The original bitstream contains two layers, each layer has four packets; (right) on layer 1, packets 2 and 3 are selected as, all packets at position 1 and 4 are shifted up and the layer 3 is created for a new layer insertion, the resulting non-staffed positions are filled with packets. lowest layer according to the progressive parameters. The method makes full use of the packet property to effectively elevate the priority of the. This method is simple, easy to implement and performs the entire progressive streaming functionalities based on the information. Figure 2 is used as an example of an image coded with five quality layers to describe our proposed method. It assumes that there exists a selected within the image as shown in Figure 2a, without specifying priorities. The quality layer map for this image is illustrated in one-dimension (first X then Y) as shown in Figure 2b. The corresponding JPEG 2000 bitstream of the image is shown in Figure 2c with Layer-Resolution- Component-Position (LRCP) progression. Due to space constraints, only quality layer structure is shown, the resolution level and component structures are not depicted in the figure. This situation can be thought of as the image containing only one level and one component. O Y (a) JPEG 2000 bitstream Quality Layer X Layer 0 Layer 1 (c) (b) packet X or Y Packet order Figure 2. An image bitstream representation with LRCP order.

In the proposed method, the transcoding operation is configured with 2 progressive parameters, m and n. m is used to specify the quality of the background and n is used to specify the quality of. The two parameters must be set with the following restrictions: If no is selected 0 m the maximum quality else 0 m < n the maximum quality m=0 means the packet will be deleted or will be suppressed to the packet. The packet has a length of zero and only needs one byte for its packet header. The first bit in the packet header should be set to 0 to indicate the packet. According to the information and progressive parameters, the transcoder extracts the packet from the lowest quality layer to the quality layer set by n. The packets on the higher layers are suppressed to the packets. For the non- packets, if m is set to 0, all non- packets are deleted from the highest quality layer to the lowest quality layer. If m is set not equal to 0, the non- packets are suppressed from the highest quality layer to the lower quality layer set by m. The non- packets on the layers higher than the layer set by m are all deleted. An example for m=1 and n=3 for image in Figure 2 is illustrated in Figure 3. The packets are extracted from every layer except for layer 4. The non- packets from layer 0 and 1 are chosen to be rendered but all the packets from the upper layers are suppressed to the packets. Other combinations of the progressive parameters will be shown in the figures in the next section. From the example, it is known that the original bitstream can be tailored or re-organized arbitrarily based on different combinations of progressive parameters to meet the requirements of channel capacity and image quality. The major advantage of the proposed method is that the transcoding operation does not require any reencoding, even for the re-organized packet headers. All the packets keep their original tag positions during the transcoding, only those unselected packets are suppressed to the packets. Therefore, the proposed method performs very fast transcoding. It is very suitable for realtime applications. 4. TRANSCODING EXAMPLES The proposed method has been applied to many image sequences in a surveillance system. A typical test image Lena is used to present the progressive adaptation in this paper. It is assuming that the first requirement for transmission is (m,n) = (0,2). Figure 4a shows the extracted with low quality and no background information. Since most packets are suppressed to the packets, the image size is reduced from the original JPEG 2000 coded bitstream 64K to 4K. Figure 4b presents the with (m,n) = (0,5). The quality is improved greatly while the background is still suppressed for the sake of saving bits. In the Figure 4c, the with partial background information is shown. Figure 4d and 4e show the high quality with a gradually enhanced background. Quality Layer JPEG 2000 bitstream packet X or Y EPT EPT EPT EPT Layer 0 Layer 1 Layer 3 Layer 4 Layer 2 EPT EPT EPT EPT EPT EPT EPT EPT EPT Packet order Figure 3. An example of and background packet extraction and suppression during transcoding. 5. CONCLUSIONS In this paper, the existing coding methods are reviewed. Due to the complexities of the existing methods, a fast and simple transcoding method is proposed. The fast processing is achieved by re-organizing the original bitstream, keeping all the tag positions for every packet and avoiding any re-encoding. The transcoded image examples with various progressive parameters are given to demonstrate the effective progressive transmission. REFERENCES [1] ISO/IEC 15444-1, Information technology JPEG 2000 image coding system Part 1: Core coding system, 1 st Ed., 2000. [2] R. Grosbois, D. Santa-Cruz, and T. Ebrahimi, New approach to JPEG 2000 compliant region of interest coding, SPIE 46 th Annual Meeting, San Diego, July, 2001.

[3] A. Nguyen, V. Chandran, S. Sridharan, and R. Prandolini, interpretability performance assessment of JPE G 2000 and Part 1 compliant region of interest coding, IEEE Trans. Consumer Electronics., Vol. 49, No. 4, pp. 808-817, Nov. 2003. (d) m=1, n=5 (a) m=0, n=2 (e) m=2, n=5 Figure 4. Progressive transmission. (b) m=0, n=5 (c) m=0, partial m=2, n=5 [4] L. Liu, and G. Fan, A new JPEG 2000 region-of-interest image coding method: partial significant bitplanes shift, IEEE Signal Processing Letters., Vol. 10, No. 2, pp. 35-38, Feb. 2003. [5] K. Park, and h. Park, Region-of-interest coding based on set partitioning in hierarchical trees, IEEE Trans. Circuits Syst. Video Technol., Vol. 12, No. 2, pp. 106-113, Feb 2002. [6] Z. Wang, and A.C. Bovik, Bitplane-by-bitplane shift (BbBShift) A suggestion for JPEG 2000 region of interest coding, IEEE Signal Processing Letters, Vol. 9, pp. 160-162, May, 2002. [7] E. Atsumi and N. Farvardin, Lossy/lossless region-ofinterest image coding based on set partitioning in hierarchical trees, IEEE ICIP., pp. 87-91, Chicago, IL, Oct. 1998. [8] A. Skodras, C. Christopoulos, and T. Ebrahimi, The JPEG 2000 still image compression standard, IEEE Signal Processing Magazine, pp. 36-58, Sep, 2001. [9] R. Rosenbaum and H. Schumann, Flexible, dynamic and compliant region of interest coding in JPEG 2000, IEEE ICIP, pp. 101-104, Rochester, New York, Sep, 2002.