Local Predecimation with Range Index Communication Parallelization Strategy for Fractal Image Compression on a Cluster of Workstations

Similar documents
FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M.

Iterated Functions Systems and Fractal Coding

A Review of Image Compression Techniques

5.7. Fractal compression Overview

CHAPTER 4 FRACTAL IMAGE COMPRESSION

Roshni S. Khedgaonkar M.Tech Student Department of Computer Science and Engineering, YCCE, Nagpur, India

Genetic Algorithm based Fractal Image Compression

Contrast Prediction for Fractal Image Compression

Fractal Compression. Related Topic Report. Henry Xiao. Queen s University. Kingston, Ontario, Canada. April 2004

An Elevated Area Classification Scheme Situated on Regional Fractal Dimension Himanshu Tayagi Trinity College, Tublin, Ireland

Fractal Image Compression. Kyle Patel EENG510 Image Processing Final project

Fractal Image Coding (IFS) Nimrod Peleg Update: Mar. 2008

Fast Fractal Image Encoder

Performance Evaluations for Parallel Image Filter on Multi - Core Computer using Java Threads

Encoding Time in seconds. Encoding Time in seconds. PSNR in DB. Encoding Time for Mandrill Image. Encoding Time for Lena Image 70. Variance Partition

A New Approach to Fractal Image Compression Using DBSCAN

AN IMPROVED DOMAIN CLASSIFICATION SCHEME BASED ON LOCAL FRACTAL DIMENSION

Fractal Image Compression

Fast Fuzzy Clustering of Infrared Images. 2. brfcm

Image-Space-Parallel Direct Volume Rendering on a Cluster of PCs

A combined fractal and wavelet image compression approach

A Parallel Genetic Algorithm for Maximum Flow Problem

Fingerprint Image Compression

The performance of xed block size fractal coding schemes for this model were investigated by calculating the distortion for each member of an ensemble

Keywords hierarchic clustering, distance-determination, adaptation of quality threshold algorithm, depth-search, the best first search.

Extensions of One-Dimensional Gray-level Nonlinear Image Processing Filters to Three-Dimensional Color Space

Fractal Image Compression on a Pseudo Spiral Architecture

Mesh Based Interpolative Coding (MBIC)

Linear Quadtree Construction in Real Time *

DISTRIBUTED HIGH-SPEED COMPUTING OF MULTIMEDIA DATA

Fractal Coding. CS 6723 Image Processing Fall 2013

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

Image Segmentation Techniques

Code Transformation of DF-Expression between Bintree and Quadtree

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Image Segmentation. 1Jyoti Hazrati, 2Kavita Rawat, 3Khush Batra. Dronacharya College Of Engineering, Farrukhnagar, Haryana, India

Direction-Length Code (DLC) To Represent Binary Objects

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Hierarchical Minimum Spanning Trees for Lossy Image Set Compression

Scope and Sequence for the New Jersey Core Curriculum Content Standards

6.2 DATA DISTRIBUTION AND EXPERIMENT DETAILS

FAST FRACTAL IMAGE COMPRESSION

A COMPARISON OF WAVELET-BASED AND RIDGELET- BASED TEXTURE CLASSIFICATION OF TISSUES IN COMPUTED TOMOGRAPHY

Fractals and the Chaos Game

Scalability and Classifications

Faster Implementation of Canonical Minimum Redundancy Prefix Codes *

Parallel Architecture & Programing Models for Face Recognition

Distributed Computing: PVM, MPI, and MOSIX. Multiple Processor Systems. Dr. Shaaban. Judd E.N. Jenne

K-Means Clustering Using Localized Histogram Analysis

Optimization of Bit Rate in Medical Image Compression

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

TRAFFIC SIMULATION USING MULTI-CORE COMPUTERS. CMPE-655 Adelia Wong, Computer Engineering Dept Balaji Salunkhe, Electrical Engineering Dept

Optimizing the use of the Hard Disk in MapReduce Frameworks for Multi-core Architectures*

Parallel Algorithms for the Third Extension of the Sieve of Eratosthenes. Todd A. Whittaker Ohio State University

A Novel Field-source Reverse Transform for Image Structure Representation and Analysis

Efficient Cluster Based Data Collection Using Mobile Data Collector for Wireless Sensor Network

Human Body Shape Deformation from. Front and Side Images

Abstract A SCALABLE, PARALLEL, AND RECONFIGURABLE DATAPATH ARCHITECTURE

Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques

Evaluation of Parallel Programs by Measurement of Its Granularity

Deduction and Logic Implementation of the Fractal Scan Algorithm

Edge Detection for Dental X-ray Image Segmentation using Neural Network approach

SIGGRAPH `92 Course Notes. Yuval Fisher. Visiting the Department of Mathematics. Technion Israel Institute of Technology. from

A Binary Redundant Scalar Point Multiplication in Secure Elliptic Curve Cryptosystems

Image Compression - An Overview Jagroop Singh 1

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Model-based segmentation and recognition from range data

Journal of Asian Scientific Research AN APPLICATION OF STEREO MATCHING ALGORITHM FOR WASTE BIN LEVEL ESTIMATION

Honours Project Proposal. Luke Ross Supervisor: Dr. Karen Bradshaw Department of Computer Science, Rhodes University

Fractals and L- Systems

Data Hiding in Video

Multimaterial Geometric Design Theories and their Applications

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

Codebook generation for Image Compression with Simple and Ordain GA

Hei nz-ottopeitgen. Hartmut Jürgens Dietmar Sau pe. Chaos and Fractals. New Frontiers of Science

Sobel Edge Detection Algorithm

Digital Signal Processing

Copyright Notice. Springer papers: Springer. Pre-prints are provided only for personal use. The final publication is available at link.springer.

Efficiency of k-means and K-Medoids Algorithms for Clustering Arbitrary Data Points

Fractal Dimension and the Cantor Set

International Journal of Wavelets, Multiresolution and Information Processing c World Scientific Publishing Company

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

A Parallel Evolutionary Algorithm for Discovery of Decision Rules

A Quantitative Approach for Textural Image Segmentation with Median Filter

Analyzing and Improving Load Balancing Algorithm of MooseFS

Applying Catastrophe Theory to Image Segmentation

SSD Garbage Collection Detection and Management with Machine Learning Algorithm 1

1 Introduction. 2 Iterative-Deepening A* 3 Test domains

Technical Report. Performance Analysis for Parallel R Programs: Towards Efficient Resource Utilization. Helena Kotthaus, Ingo Korb, Peter Marwedel

PERFECT FOLDING OF THE PLANE

A NEW PROOF OF THE SMOOTHNESS OF 4-POINT DESLAURIERS-DUBUC SCHEME

Hybrid image coding based on partial fractal mapping

Functional Fractal Image Compression

Optimized Progressive Coding of Stereo Images Using Discrete Wavelet Transform

SENSOR SELECTION SCHEME IN TEMPERATURE WIRELESS SENSOR NETWORK

FPGA Implementation of a Nonlinear Two Dimensional Fuzzy Filter

Hardware Architecture for Fractal Image Encoder with Quadtree Partitioning

A Comparative Study on Exact Triangle Counting Algorithms on the GPU

AN APPROACH FOR LOAD BALANCING FOR SIMULATION IN HETEROGENEOUS DISTRIBUTED SYSTEMS USING SIMULATION DATA MINING

RGB Digital Image Forgery Detection Using Singular Value Decomposition and One Dimensional Cellular Automata

Transcription:

The International Arab Journal of Information Technology, Vol. 6, No. 3, July 2009 293 Local Predecimation with Range Index Communication Parallelization Strategy for Fractal Image Compression on a Cluster of Workstations Syed Hussain 1, Kalim Qureshi 2, Mohammad Al-Mullah 2, and Haroon Rashid 1 1 Department of Computer Science, Comsats Institute of IT, Pakistan 2 Faculty of Math and Computer Science, Kuwait University, Kuwait Abstract: In this paper, we have implemented and evaluated the performance of local predecimation with range index communication parallelization strategy for fractal image compression on a beowulf cluster of workstations. The strategy effectively balances the load among workstations. We have evaluated the execution time of LPRI, varying the number of workstations and user-specified root mean square error. We have also reported the measured speedup and worker idle time of LPRI parallelization. Keywords: Load balancing, task partitioning, parallelization, fractal image compression. Received October 25, 2007; accepted March 5, 2008 1. Introduction A picture is worth hundred lines. Multimedia data, especially images contain much more information than traditional text files. With the emergence of novel applications such as telemedicine, teleconferencing and remote sensing, the need for efficient storage and fast communication of images is multifold. Image compression is an answer to this ever-increasing need. Images have much more room for compression because of two reasons namely; massive redundancy that they contain and susceptibility of human vision system to interpret images with minor data. Fractal Image Compression (FIC) [1, 2, 3, 7, and 10] is a lossy image encoding scheme, which uses the self-similarity existing in images to compress them. Fractal is a geometrical shape which contains self similarity; more formally, a set for which the Hausdorf dimension strictly exceeds the topological dimension [3]. Figure 1(a) shows a snowflake image, which contains self similarity. A fractal is a geometric shape that is divisible in parts, such that each part is a reduced copy of the original image. This property provides immense opportunity of reduction in data. Strictly speaking, a set of contractive transforms is repeatedly applied to any image to obtain the unique attractor formally known as Fixed Point Theorem [3]. Mathematically, this transform represented in equation 1 [6]. x a i wi = y ci bi x + d i y e f i i where a i, b i, c i, and d i represent scaling, skew and rotation while e i and f i represent the translation of the image. (a) Snow flake fractal. (b) The fractal transform. Figure 1. An example of fractal images. As shown in Figure 1(b), the transform does three things; it scales the image to half, generates three copies of it and translates them in a triangle. Applying this transform on any input results the Sierpinsky triangle [3]. A transform is contractive, if it brings the points of input image closer after any iteration [6]. FIC is an asymmetric compression technique, with encoding phase taking much more time then decoding phase. To reduce the encoding time, research has leapt forward in two directions; classification techniques [4] and use of high performance distributed computing. Given the independent ranges to be searched, the FIC is a suitable candidate as a data parallel application for parallelization [6, 9]. Historically there have been

294 The International Arab Journal of Information Technology, Vol. 6, No. 3, July 2009 efforts to parallelize FIC [5, 6, 9], but none of them focused on implementation of parallel FIC on low-cost We have implemented Local Predecimation with Range Index (LPRI) communication parallelization strategy for encoding phase of FIC on homogeneous cluster of workstations running Linux operating system with Message Passing Interface (MPI) [8]. We have evaluated the performance of LPRI parallelization strategy measuring execution time, speedup, and worker idle time cost. 2. Adaptive Quadtree Partitioning Algorithm 2.1. Serial Algorithm Grey-scale images are three dimensional matrices; two spatial dimensions and one dimension to represent the grey level intensity at each pixel specified by a pair of spatial coordinates. For these images, domain pixel intensities must also be transformed making the transform equation 1 becomes [6]: x ai wi = y = ci z 0 bi 0 x ei di 0 y + fi 0 s i z oi (2) where o i controls brightness and s i controls contrast of coded image. The input image is partitioned into non-overlapping range blocks by iteratively splitting an image into its four quadrants. These partitions are stored as ranges with depth 1 in the range pool. Now each depth n range from range pool is further divided into four depth n+1 ranges. A range is not further split when its depth reaches minimum-partition threshold. Now ranges are taken from range pool, to match it a suitable domain is searched within the entire image. The size of domain must be double the size of range to ensure that the transform is contractive. The domains may be overlapping and are selected on a k-distance lattice [3]. To map a domain to a range one of eight isometries [5] namely; four rotations, flip and four further rotations are tried. Suitable values of the transform parameters as shown in (equation 2) are also searched for each domain-range pair. Domain-Range mapping is judged by calculating Root Mean Square (RMS) error metric calculated as equation 3 [3] suggests: RMS = 1 2 N N N i= 1 j= 1 ( r( i, j ) d ( i, 2 j ) (3) where r(i,j) and d(i,j) are ranges and domains from the original N N image. If for a given range the domain with RMS values less than a predefined tolerance factor known as Collage Error (CE) [3], the domain is selected as the best match, the range is multi-computer cluster of workstations. removed from the range-pool and the calculated transform parameters are saved. If any domain is found for a range, it is said to be covered. If no domain can be found for a particular range, the range is further split into four quadrants; the depth of quadrants is set to be one more than the depth of original range. These quadrants are then added to the range-pool. The guard condition for stopping range split is maximumpartition threshold.we have parallelized the Adaptive Quadtree Algorithm [3, 5] for FIC, by applying LPRI strategy. 2.2. Local Predecimation with Range Index Communication LPRI Parallelization MPI task-farm paradigm [8] is used to implement our strategy. In task-farm approach, the master has a pool of tasks (called task-farm) and a pool of workers. In our strategy the master node does the initialization chores, splits the input image into ranges with depth equal to minimum partition. The master nodes then takes range from its range-pool one by one, assigns it to any worker from pool of workers. Worker encodes the range, returns the transform parameters to master and waits in worker pool for next range. Size of domains in fixed double the size of ranges in each spatial dimension. That is there are four domain pixels for one range pixel, hence the domain pixels need to be averaged before comparison to range pixels. This process is formally called as the predecimation [5]; this process is usually done before any domainrange comparisons so that each domain is predecimated once and only once. In our strategy, the predcimation of domains is done locally for each worker. Pseudocode of Figure 2, shows the detailed functionality of our strategy. 3. Experiment Results and Discussion We have implemented LPRI on Message Passing Interface [8] task-farm approach using ANSI C programming language.the platform of our experiment was homogenous cluster of workstation with 32 PII- 333 nodes having 96MB memory, running Linux 2.4, kernel connected with 100Mbps Fast Ethernet. We have conducted our experiment on 256 256 Lena image. Figure 3(a) shows the execution time of LPRI parallelization, changing user-specified RMS values. Lesser the RMS tolerance factor is, the algorithm looks for a closer domain-range match, which consequently increases the execution time. As RMS is increased, looser matches can also be used, and execution time decreases. The curve becomes steep as RMS is increased beyond 10.0 because domain-range matches are more or less same.speedup [8] a metric used to measure the performance gain of parallel

Local Predecimation with Range Index Communication Parallelization Strategy for Fractal 295 implementation as opposed to serial implementation calculated as: T faste speedup= (4) T pdp where T fastest is the execution time of serial algorithm on a single fastest machine, while T pdp is the execution time of LPRI parallelization strategy. Figure 3(b) reveals the speedup by increasing the workers. As the workers are increased beyond 12, the speedup curve is steep because number of return and reply communication messages are increased, consequently increasing the communication overhead. Overhead time O i, which includes communication data access to initiate a new task on worker i is calculated according to equation 5. O i = (Worker s new task starting time - Worker s old task completion time) (5) (5) Workers idle time cost = n W (6) i O j j Figure 3(c) shows worker idle time cost which calculated by using equation 6. 6.00 4.00 2.00 0.00 600 400 200 2 4 6 8 10 12 14 16 0 No of workers (b) LPRI speedup vs no workers. 2 4 6 8 10 12 14 16 (c) LPRI worker idle time cost vs no of workers. 4. Conclusion Figure 3. LPRI results. We have implemented and evaluated Local Predecimation with Range Index communication LPRI parallelization strategy for fractal image compression on a cluster of workstations. Our results reveal speedup gain up to a multiple of 5 on serial FIC. Worker idle time cost due to communication is on average 300 milliseconds. It can be reduced by applying an enhanced load balancing strategy in future. Acknowledgment I would like to thank COMSATS University Campus, Abbottabad, Pakistan for providing the excellent research environment. I am grateful to my MS supervisor Dr. Kalim Qureshi and co-supervisor Dr. Haroon for their guidelines and supports. I would like to thank Dr. Mohammand Al-Mullah for his suggestions. Figure 2. Communication LPRI. 300 200 100 0 2.0 4.0 6.0 8.0 10.0 12.0 14.0 16.0 (a) LPRI execution time vs user specified RMS. References [1] Chang-man X. and Zhao-yang Z., A Fast Fractal Image Compression Coding Method, Journal of Shanghai University (English Edition), vol. 5, no. 1, pp 57-59, 2002. [2] Dasgupta D., Hernandez G., and Niño F., An Evolutionary Algorithm for Fractal Coding of Binary Images, IEEE Computer Journal of Transactions on Evolutionary Computation, vol. 4, no. 2, pp. 172-181, 2004. [3] Fisher Y., Fractal Image Compression: Theory and Application, Springer-Verlag, NewYork, 2000.

296 The International Arab Journal of Information Technology, Vol. 6, No. 3, July 2009 [4] Fisher Y., Jacobs E., and Boss R., Fractal Image Compression Using Iterated Transforms, Technical Report 1408, NOSC, SanDiego, CA, 2000. [5] Hämmerle J. and Hl A., Fractal Image Compression on MIMD Architecture: Basic Algorithms, Computer Journal Parallel Algorithms and Applications, vol. 11, no. 34, pp. 187-204, 2004. [6] Jackson D. and Tinney G., Performance Analysis of Distributed Implementations of a Fractal Image Compression Algorithm, Computer Journal of Concurrency Practice and Experience, vol. 8, no. 5, pp 357-386, 2000. [7] Ong G., Chew C., and Cao Y., A Simple Partitioning Approach to Fractal Image Compression, in Proceedings of ACM Symposium on Applied Computing, New York, pp. 301-305, 2004. [8] Wilkinson B. and Michael Allen., Parallel Programming Techniques and Applications Using Networked Workstations and Parallel Computers, Prentice Hall, New Jersey, 2006. [9] Wu P., Distributed Fractal Image Compression on PVM for Million-Pixel Images, International Conference on Information Networking, Japan, pp. 393-398, 2004. [10] Zhao E. and Liu D., Fractal Image Compression Methods: A Review, in Proceedings of Information Technology and Applications, USA, pp. 756-759, 2006. [11] Hasan Y., Hassan M., and Ridley M., Incremental Transitivity Applied to Cluster Retrieval, in International Arab Journal of Information Technology, vol. 5, no. 3, pp. 311-319, 2008. Kalim Qureshi is a member of Math and Computer Science Department in Kuwait University. He is an approved supervisor for MS and PhD thesis by high education commission, Islamabad, Pakistan. His research interests include network parallel distributed computing, thread programming, concurrent algorithms designing, task scheduling, and performance measurement. He is a member of IEE Japan and IEEE Computer Society. Mohammad Al-Mullah is an associate professor in Math and Computer Science Department, Kuwait University. He did his MS and PhD from McGill University, Canada. His research interests include artificial Intelligence, programming languages, and parallel and concurrent systems. Haroon Rashid is director and faculty member of the Department of Computer Science at COMSATS Institute of Information Technology, Abbattabad Campus, Pakistan. His research interests include parallel computing, distributed systems, high speed networks, multimedia, and network performance optimization. Syed Hussain received his BS in computer science from COMSATS Institute of Information Technology CIIT, Abbottabad, Pakistan. He did his MS in Computer Science from CIIT. Currently, he is perusing his PhD at Carl Duisberg Cnetrun, Dortmund, Germany. His research interests include parallel and distributed computing.

Local Predecimation with Range Index Communication Parallelization Strategy for Fractal 297

298 The International Arab Journal of Information Technology, Vol. 6, No. 3, July 2009