Enhancing the Compression Ratio of the HCDC Text Compression Algorithm

Size: px
Start display at page:

Download "Enhancing the Compression Ratio of the HCDC Text Compression Algorithm"

Transcription

1 Enhancing the Compression Ratio of the HCDC Text Compression Algorithm Hussein Al-Bahadili and Ghassan F. Issa Faculty of Information Technology University of Petra Amman, Jordan Abstract The Hamming Codes based Data Compression (HCDC) algorithm is a lossless data compression algorithm based on the concept of the error-correction Hamming Codes. The HCDC algorithm has been used successfully in a number of text compression applications providing a competitive compression ratio to existing well-known text compression algorithms. This paper describes an enhanced version of the HCDC algorithm, which is referred to as E-HCDC algorithm. The E-HCDC utilizes a new encoding scheme called the m- encoding scheme, where m represents the maximum length for the pre-fix bits. The compression ratios achieved by the E- HCDC are calculated for a number of text files of various sizes from widely-used corpora, and it reveals an improvement of 15% to 20% over the original HCDC algorithm. Keywords- Text compression; Hamming Codes; HCDC algorithm; compression ratio. I. INTRODUCTION Data compression algorithms are developed to reduce the size of source data by substituting its original symbols with shorter symbols or by removing redundancy from the source data, so that it requires less disk space for storage, and less bandwidth and power consumption when transmitted over data communication channels [1]. Data compression consists of two main algorithms compression and decompression algorithms. The first comprises algorithmic transformations of a source data representation to produce compact representation, while the second comprises algorithmic transformations to restore an exact or an approximate representation of the source data [2]. As a result of that data compression algorithms are classified into two different styles based on the conformity of the decompressed data, these are lossless and lossy compression. Lossless algorithms restore an exact copy of the source data (i.e., the source and the decompressed files are identical), so they are used for text files, executable codes, database files compression. Lossless algorithms usually achieve a compression ratio ranging between 2 to 8. Examples of lossless algorithms include Huffman coding, Adaptive Huffman coding, Shannon-Fano Coding, Run Ahmed Imad Faculty of Information Technology Middle East University Amman, Jordan ahmed.emad@gmail.com Length Encoding, Arithmetic coding, and LZ77 and LZ78 families [1, 3]. Lossy algorithms cannot restore an exact copy of the source data and usually restore an approximate copy of the source data. This type of compression is used in applications where the source and the decompressed data are not necessary to be identical, such as multimedia streaming, video conferencing, VoIP applications, and MMS messaging. It achieves a data compression ratio up to 200, depending on the type of compressed data. A higher compression ratio can be achieved if more variations are allowed to be introduced in the decompressed data [1]. A novel lossless data compression algorithm based on the error correcting Hamming codes, namely, the Hamming Codes based Data Compression (HCDC) algorithm was developed by Al-Bahadili [4] and used successfully by a number of researchers for text and audio data compression applications [5-7]. In the HCDC algorithm, the source data is first converted to a binary sequence, which is divided into blocks of n-bit length. To utilize the Hamming codes, each block is considered as a Hamming codeword consisting of d data bits and p parity bits (where n=d+p, and p= ln(d)/ln(2) +1). According to Hamming codes, these codewords are categorized into 2 d valid codewords and 2 n -2 d non-valid codewords. For a valid codeword, only the d data bits preceded by 0 are written to the compressed file, while for a non-valid codeword all n bits preceded by 1 are written to the compressed file. These additional 0 and 1 bits are added to distinguish the valid and the non-valid blocks during the decompression process. For Standard English text compression, the HCDC algorithm was implemented as follows each character is treated as a 7-bit Hamming codeword having n=7, d=4, and p=3, the frequency of each character (f i, i=1 to N c, where N c is the number of character sets in the text file) is calculated and the characters are sorted from the most common character (MCC) to the least common character (LCC). Then, the 16 MCCs are considered as valid codewords and replaced with 5 bits instead of the original uncompressed 7- bit ( 0 followed by 4-bit representing the sequence number of the character 0 to 15), while the remaining characters are considered as a non-valid codewords replaced with b+1 bits 64

2 instead of the original uncompressed 7-bit ( 1 followed by b-bit representing the sequence number of the character 0 to N c-16, and b= ln(n c-16)/ln(2) ). It can be seen from the above discussion that the HCDC algorithm utilizes a simple encoding scheme for the pre-fix bit. This paper develops a new encoding scheme for the prefix bits, which is used to develop an enhanced version of the HCDC algorithm with higher compression ratio, namely, the E-HCDC algorithm. The new prefix encoding scheme is called the m-encoding scheme, where m represents the length of the longest pre-fix bits. The compression ratio of the E- HCDC algorithm is calculated and compared with the compression ratio of the original HCDC algorithm for a number of text files from Standard English corpora. This paper is divided into five sections. This section provides an introduction to the main theme of the paper, and also introduces the problems statement and the objectives of this research. The rest of this paper is organized as follows. Section II reviews some of the most recent and related work. The E-HCDC algorithm and its core component, the m- encoding scheme, are described in Section III. Section IV compares the compression ratios achieved by the HCDC and E-HCDC algorithms over a number of text files from Standard English Text Corpora. Finally, Section V presents the main conclusions of this research and also point-out future research direction. II. LITERATURE REVIEW A large number of data compression algorithms have been developed and used throughout the years. Some of which are of general use, i.e., can be used to compress files of different types (e.g., text files, image files, video files, etc.). Others are developed to compress efficiently particular types of files [1]. In this work, we concern with text compression, which has to be lossless so that exact form of the original data can be retrieved back. Al-Bahadili [5] developed a lossless bit-level data compression algorithm based on the error-correcting Hamming codes, and consequently it was referred to as the HCDC algorithm. The theoretical analysis of the algorithm indicates that the algorithm reserved a tremendous potential and as a result of that it has been used by many researchers in many applications. Al-Bahadili and Rababa a [5] investigated the performance of the HCDC algorithm for text compression application. They developed a scheme that utilizes the algorithm, which consists of six steps, some of which are applied repetitively to enhance the compression ratio of the algorithm, which is called the HCDC(k) scheme, where k refers to the number of repetition loops. The repetition loop continues until inflation is detected. The final compression ratio is a multiplication of the compression ratios of the individual loops. The results obtained for the HCDC(k) scheme demonstrated that the scheme has a higher compression ratio than most well-known text compression algorithms, and also exhibits a competitive performance with respect to many widely-used state-of-the-art compression tools. Al-Bahadili and Al-Saab used the HCDC algorithm in developing a novel Web search engine model, namely, the Compressed Index-Query (CIQ) model [6]. The model incorporates two compression layers both based on the HCDC algorithm and implemented at the back-end processor side of the Web search engine. The test results demonstrated that the new CIQ model achieves 100% agreement with current uncompressed Web search engine models, a compression ratio of more than 1.3, and a speed up factor of more than 1.3 providing a reduction in processing time of more than 24%. Amro et al. [7] used the HCDC algorithm for speech compression in Voice over Internet Protocol (VoIP) applications, in particular, they implemented the HCDC(k) algorithm with k=4 to achieve a significant compression ratio in the coefficient codebook without any scarification in the original Code-Excited Linear Prediction (CELP) signal quality since compression is lossless. Al-Bahadili & Hussain [8] developed a bit-level data compression algorithm in which the binary sequence is divided into blocks of n-bit length, which gives each block possible decimal values between 0 to 2 n -1. The decimal values of all blocks are calculated and if the number of different decimal values (d) is 256, then the binary sequence can be compressed using n-bit character wordlength. Otherwise, the process continues until finding a value of n that gives d 256. Thus, a compression ratio of approximately n/8 can be achieved. They referred to this algorithm as the Adaptive Character Wordlength (ACW) algorithm, since the compression ratio of the algorithm is a function of n, it is abbreviated as ACW(n) algorithm, which was used to develop a more efficient version called the ACW(n,s) algorithm, where s is the number of subsequences into which the original sequence is divided [9]. Khancome [10] proposed a full text compression algorithm, the source data is divided into suitable blocks; as well as, all characters in each block are assigned their positions. Afterwards, these positions are converted into the bit form. Empirically, the algorithm is efficient for the source data which takes several bytes per one character. The experimental results showed that the source data could be saved 11.50% in minimum; meanwhile, the maximum is 76.56%. Nicolae et al. [11] introduced a lossless nonreference-based compression algorithm named lossless FastQ compressor. They compared the algorithm with other state-of-the-art big data compression algorithms. The comparison reveals that the algorithm achieves better compression ratios. Nofal [12] proposed a bit-level files compression algorithm providing a maximum of 10% compression ratio. Jaradat et al. [13] proposed a file splitting technique for the reduction of the n th -order entropy of text files. Barr and Asanovic [14] studied the energy savings possible by lossless compressing data prior to transmission. Because 65

3 wireless transmission of a single bit can require over 1000 times more energy than a single 32-bit computation. The overall energy to send and receive data can be reduced by 11% compared with a well-chosen symmetric pair, or up to 57% over the default symmetric scheme. Irshid [15] proposed a very simple and efficient binary run-length compression algorithm, which is based on mapping the non-binary information source into an equivalent binary source using a new fixed-length code instead of the ASCII code. The codes are chosen such that the probability of one of the two binary symbols; say 0, at the output of the mapper is made as small as possible. Moreover, the "all 1 s" code is excluded from the code assignments table to ensure the presence of at least one 0 in each of the output codewords. Compression is achieved by encoding the number of 1 s between two consecutive 0 s using either a fixed-length or a variable-length code. When applying this simple encoding technique to English text files, they achieve a compression of 5.44 bpc and 4.6 bpc for the fixed-length code and the variable length (Huffman) code, respectively. III. THE ENHANCED HCDC ALGORITHM The main objective of this work is to increase the compression ratio of the HCDC algorithm [4]. To do so, a new pre-fix encoding scheme is developed, namely, the m- encoding scheme. In this scheme, different pre-fix bit combinations are used depending on the types of characters presented in the text file and frequencies of these characters. This is in replacement of the 0 / 1 pre-fix encoding scheme of the HCDC algorithm. The enhanced version of the HCDC algorithm is referred to as the Enhanced HCDC (E- HCDC) algorithm, which is expected to achieve higher compression than the original HCDC algorithm. A. The m-encoding Scheme In the m-encoding scheme, the characters frequencies are found first, and then the characters are sorted according to their frequencies from the MCC to the LCC. The characters are divided into groups of 16 characters each starting from the MCC. The last group may contain 16 or less characters depending on the number of characters sets in the source text. Since, the maximum number of characters for 7-bit character length is 128 characters; then the maximum number of groups is 8 numbered from G 1 to G 8. The characters for each group are numbered from 0 to 15 and assigned a binary code from 0000 to. For each group, we will assign different combination of pre-fix bits; e.g., the pre-fix for G 1 is 0 similar to the original HCDC. The pre-fix bit for the other groups are 10, 110, 1110, and so on until the last group which will have all 1 s. Thus, we will have 7 1 s for G 8. For each group we will add 4-bit representing the sequence number of the character within the group. The pre-fix bits and the numbers of bits for each group are summarized in Table I. It can be seen from Table I that we are reducing the number of bits written to the compressed data file to 6-bit and 7-bit for G 2 and G 3 in comparison to 8-bit for the HCDC algorithm. While it is kept unchanged with 8-bit for G 4, and start increasing by 1-bit for each of the remaining groups until it reaches 11 for G 8. The above discussion shows that more bits are required to represent the characters in the last four groups (G 5 to G 8), but, fortunately, the characters frequencies within these groups are small in comparison to that in the top 4 groups (G 1 to G 4). Furthermore, most of the time we have the characters span over not more than 6 groups, which means that the maximum number of bits required to represent the compressed characters is 9 bits, because the number of prefix bits for the last group is equal the number of bits for previous one but with pre-fix bits set to 1 s. This form of pre-fix encoding guarantees higher compression ratio than for the HCDC algorithm even for files with flat characters frequencies distribution. However, in this work, to improve the performance further, we introduce a modification to the above continuous encoding. TABLE I THE PRE-FIX BITS AND THE NUMBER OF BITS WRITTEN TO THE COMPRESSED FILE Group No. of Pre-fix No. of Compressed Bits Pre-fix bits (G) bits E-HCDC HCDC In the modified encoding, the number of pre-fix bits is limited to m-bit. Therefore, we call this pre-fix encoding as m-encoding. Before, we proceed with the description of this new encoding; let us define some parameters. The first one is the number of groups (G), which is calculated as G = N c / 2 d (1) Where G is the number of groups, N c is the number of characters sets, and d is the number of data bits in Hamming codeword. Second, the number of characters within the last group L g, which can be calculated as L g = N c 2 d (G-1) (2) Third, the number of bit (b) required to represent the characters within the last group, which can be calculated as (3) 66

4 When m is selected to be 4-bit, then the pre-fix combinations are 0 for G 1, 10 for G 2, 110 for G 3, 1110 for G 4, and for the remaining group(s). So that the number of bits for the last groups can also be calculated as L g = N c m 2 d (4) Table II shows the pre-fix bits combinations, the characters codes, and the compressed bits written to the compressed file for a file contains up to 96 characters. It is clear from the above discussion that the characters in G 5 and G 6 are merged together forming a group of 32 characters requiring 5-bit to represent each character from character 1 (00000) to character 31 (1). The pre-fix bits are 4-bit (), and then each character will be written as 9-bit character to the compressed file. However, if N c 80 and N c>64, then 1<L t 16, and in this case b t is 4, which means the number of compressed bits written to the compressed file is 8 TABLE II. THE PRE-FIX BITS AND THE NUMBER OF BITS WRITTEN TO THE COMPRESSED DATA FILE. Group (G) Character Pre-fix No. of (E) bits Pre-fix bits Character Code Compressed Bits No. of Compressed Bits E-HCDC HCDC One important feature of the m-encoding is that for m=1, it behaves exactly the same as the HCDC, where in this case the first 16 MCCs (G 1) are preceded by 0, while all other characters (groups) are preceded by 1. However, there is only one difference, which is in the HCDC, all 7-bit character is appended to the compressed file, while here, only b bits are appended, and b is calculated depending on the sequence number of the character excluding the first 16 MCC. In this case, we may need less than 7-bit to represent characters in the compressed file. This provides further saving and increasing the compression ratio. One other important feature of the m-encoding scheme is the optimum value of m can be calculated automatically depending on the characters frequencies or characters counts. Using the m-encoding scheme, the size of the compressed binary sequence can be calculated as Where Q b is length of the compressed binary sequence in bits, d is number of data bits in the Hamming codeword, T i is counts for character i (i=1 to N c), m is maximum length for the pre-fix bits, b t is number of bits representing the characters of the last group, and N c is number of characters sets in the text file. The code can calculate the values of Q b for all possible values of m, and select the value of m that gives minimum Q b to ensure highest possible compression ratio. B. The E-HCDC Algorithm (5) 67

5 This section presents the detailed description of the E- HCDC compression and decompression procedures for text compression. The E-HCDC compressor The E-HCDC compressor consists of the following steps 1) Read-in the input uncompressed text file. 2) Find the characters sets in the uncompressed file (N c), characters counts (T i), and characters frequencies (f i), where i=1 to N c. 3) Sort the characters set from the MCC to the LCC. 4) Determine the optimum value for m. 5) Initialize the compressed binary sequence to Null. 6) Start the compression process by reading in one character at a time until processing all characters; find the character sequence number, the group it s belonging to, and its sequence within the group. According to the value of m and the group its belonging to, append the pre-fix bits to the compressed binary sequence, find the character s equivalent binary representation based on the associated character wordlength (d or b depending on m and G), and then append the character binary representation to the compressed binary sequence. 7) In order to be able to convert the resultant compressed binary sequence (Q b) to 8-bit character length, and because Q b may not be a multiple of 8, then we append padding bits (say a 0 s) at the end of the compressed binary sequence, and the length of the padding bits (a) need to be stored at the compressed file header, so these bits can be discarded during the decompression phase. 8) Construct the compressed file header and append it to the compressed text sequence, which should contain all information necessary during the decompression process. 9) Convert the compressed binary sequence to characters by reading-in 8-bit at a time until converting all compressed binary sequence. For each 8-bit, find its equivalent ASCII code, and then append the equivalent character to the compressed text sequence. 10) Write the compressed text sequence to the compressed file. The E-HCDC decompressor The E-HCDC decompressor consists of the following steps 1) Read-in the input compressed file. 2) Extract the file header, and find-out N c, the uncompressed characters themselves (U i) (where i=1 to N c.), the maximum length of the pre-fix bits (m), and the number of padding bits (a). 3) Determine the number of groups (G) as G=m+1, the number of characters in the last group (L t), and the number of bits to represent the characters of the last group (b). 4) Initialize the compressed binary sequence (Q b) to Null. 5) Initialize the decompressed text sequence (D t) to Null. 6) Extract the compressed text sequence and read-in it one character at a time, convert each character to a binary representation according to its ASCII code, and append the resultant binary representation to the compressed binary sequence (Q b), until all characters are read-in. 7) Discard the appended bits (a). 8) Start the decompression process by reading in 1-bit, if this bit is 0, then the character belongs to G 1 (i.e., g=1), read-in the next 4-bit, find-out the equivalent decimal value (e) between 0 to 15, which represents the sequence number of the character (q) within G 1 minus 1 (q=e+1), determine the global sequence number (s) as s = q + 2 d (g - 1) = 1 + e + 2 d (g - 1) (6) Then, retrieve the character with sequence number s (U s) from the uncompressed characters lists and append it D t. If the read-in bit is 1 (count the number of 1 s (z)), then reads-in another bit until a 0 is readin or read-in a maximum of m 1 s. In this case, the group number (g) is equal to z+1 as long as z+1<m, otherwise it is equal to m. If g is not the last group (g<g), then read-in 4-bit, otherwise (g is the last group g=g) read-in b-bit, and perform the same steps above and append another character to D t. This process continues until reading all compressed binary sequence. 9) Write the decompressed text sequence (D t) to the uncompressed file. It can be clearly seen from the above discussion that the E-HCDC algorithm is 1) Bit-level. It processes the data at a binary level. 2) Lossless. An exact form of the source file is retrieved. 3) Adaptive. The character-binary coding depends on the characters frequencies. 4) Asymmetric. The compression time is higher than the decompression time. IV. RESULTS AND DISCUSSIONS The E-HCDC algorithm is implemented using VB.net programming language; however, it is important to indicate that the code is not optimized to determine the compression/decompression processing time, therefore, this paper only compares and shows the results for the compression ratio. It can be generally seen for the compression and the decompression procedures that the 68

6 algorithm can be classified as an asymmetric algorithm as the compression processing time looks longer than the time required for decompression. This section compares the compression ratio of the E- HCDC (C E-HCDC) against the compression ratio of the HCDC (C HCDC) for a number of text files from standard corpora, namely, Calgary Corpus, Canterbury Corpus, Artificial Corpus, and Large Corpus [16]. Furthermore, a new parameter is introduced, which is referred to as the compression enhancement ratio (E r) and it can be calculated as The results for C HCDC and C E-HCDC are listed in Table III, which clearly shows that the E-HCDC algorithm enhances the performance of the HCDC algorithm by 15% to 20% for various files from different corpora. This is achieved due to the implementation of the new m-encoding technique. Furthermore, Table III shows the values of G and m for each compressed file. (8) TABLE III. COMPARISON OF CHCDC AND CE-HCDC. Corpus File Name File Size (Byte) Nc CHCDC CE-HCDC G (m) m=g-1 Er (%) bib (5) 18 book (5) 19 book (5) 18 paper (5) 18 Calgary corpus paper (5) 19 paper (5) 19 paper (4) 19 paper (5) 18 paper (5) 18 alice29.txt (4) 19 Canterbury asyoulik.txt (4) 18 corpus lcet10.txt (5) 19 plrabn12.txt (5) 19 Artificial corpus alphabet.txt (1) 18 random.txt (3) 15 Large corpus bible.txt (3) 19 world192.txt (5) 18 V. CONCLUSIONS The main conclusions of this paper are (1) the E-HCDC algorithm enhances the compression ratio of HCDC algorithm by 15% to 20%. This is achieved due to the utilization of the newly developed m-encoding scheme for the pre-fix bits, (2) the analytical procedure for estimating the compression ratio provides a mean to automatically compute the optimum length for the pre-fix bits, and (3) the E-HCDC algorithm achieves a compression ratio that is high enough to be competent with the compression ratio achieved by many well-known algorithms of statistical and adaptive nature. The main recommendations for future work may include (1) utilize the m-encoding to develop a repetitive version of the E-HCDC algorithm (E-HCDC(k)), (2) use the E-HCDC algorithm as a post processing technique to increase the compression ratio of statistical lossless data compression algorithms, such as Shanon-Fano coding, Huffman coding, arithmetic coding, a combination of these algorithms, or any modified form of them, and (3) develop an optimized version of the code to compare its runtime with other compression algorithms and state-of-the-art software, and compare the compression/decompression processing runtimes. REFERENCES [1] K. Sayood (2012). Introduction to data compression. Fourth Edition, Morgan Kaufmann. [2] A. B. Sharma, L. Golubchik, R. Govindan, & M. J. Neely (2009). Dynamic data compression in multi-hop wireless networks. In Proceedings of the 11th International Joint Conference on Measurement and Modeling of Computer Systems, pp [3] L. G. Rueda & B. J. Oommen (2006). A fast and efficient nearlyoptimal adaptive Fano coding scheme. Journal of Information Sciences, Vol. 176, Issue 12, pp [4] H. Al-Bahadili (2008). A novel lossless data compression scheme based on the error correcting Hamming codes. Computers & Mathematics with Applications, Vol. 56, Issue 1, [5] H. Al-Bahadili & A. Rababa a (2010). A bit-level text compression scheme based on the HCDC algorithm. International Journal of Computers and Applications (IJCA), Vol. 32, Issue 3, pp [6] H. Al-Bahadili & S. Al-Saab (2011). Development of a novel compressed index-query Web search engine model. International Journal of Information Technology and Web Engineering (IJITWE), Vol. 6, No. 3, pp [7] I. Amro, R. Abu Zitar, & H. Al-Bahadili (2011). Speech compression exploiting linear prediction coefficients codebook and hamming 69

7 correction code algorithm. International Journal of Speech Technology, Vol. 14, No. 2, pp [8] H. Al-Bahadili & S. M. Hussain (2008). An adaptive character wordlength algorithm for data compression. Computers & Mathematics with Applications, Vol. 55, Issue 6, pp [9] H. Al-Bahadili & S. M. Hussain (2010). A bit-level text compression scheme based on ACW algorithm. The International Journal of Automation and Computing (IJAC), Vol. 7, No. 1, pp [10] C. Khancome (2011). Text compression algorithm using bits for character representation. International Journal of Advanced Computer Science, Vol. 1, No. 6, pp [11] M. Nicolae, S. Pathak, & S. Rajasekaran (2015). LFQC A lossless compression algorithm for FASTQ files. Journal of Bioinformatics, Vol. 31, No. 20, pp [12] S. Nofal (2007). Bit-level text compression. Proceedings of the 1st International Conference on Digital Communications and Computer Applications, pp [13] A. R. M. Jaradat, M. I. Irshid, & T. T. Nassar (2007). A file splitting technique for reducing the entropy of text files. International Journal of Computer, Information, Systems and Control Engineering, Vol. 1, No 7, pp [14] K. C. Barr & K. Asanovic (2006). Energy-aware lossless data compression. ACM Transaction on Computer System, Vol. 24, No. 3, pp [15] M. I. Irshid (2001). A simple binary run-length compression technique for non-binary sources based on source mapping. Active and Passive Electronic Components, Vol. 24, No. 4, pp [16] A. Ali (2013). Enhancing the compression ratio of the HCDC-based text compression scheme. M.Sc Thesis. Middle East University, Faculty of Information Technology, Amman-Jordan. 70

ADAPTIVE-QUALITY IMAGE COMPRESSION ALGORITHM

ADAPTIVE-QUALITY IMAGE COMPRESSION ALGORITHM ADAPTIVE-QUALITY IMAGE COMPRESSION ALGORITHM Abdel Rahman Alzoubaidi 1, Tamer Al-Sous 2 and Hussein Al-Bahadili 3 1 Department of Computer Engineering, Al Balqa Applied University, Salt, Jordan 2 Faculty

More information

A novel lossless data compression scheme based on the error correcting Hamming codes

A novel lossless data compression scheme based on the error correcting Hamming codes Computers and Mathematics with Applications 56 (2008) 143 150 www.elsevier.com/locate/camwa A novel lossless data compression scheme based on the error correcting Hamming codes Hussein Al-Bahadili Department

More information

arxiv: v2 [cs.it] 15 Jan 2011

arxiv: v2 [cs.it] 15 Jan 2011 Improving PPM Algorithm Using Dictionaries Yichuan Hu Department of Electrical and Systems Engineering University of Pennsylvania Email: yichuan@seas.upenn.edu Jianzhong (Charlie) Zhang, Farooq Khan and

More information

A Research Paper on Lossless Data Compression Techniques

A Research Paper on Lossless Data Compression Techniques IJIRST International Journal for Innovative Research in Science & Technology Volume 4 Issue 1 June 2017 ISSN (online): 2349-6010 A Research Paper on Lossless Data Compression Techniques Prof. Dipti Mathpal

More information

Lossless Compression Algorithms

Lossless Compression Algorithms Multimedia Data Compression Part I Chapter 7 Lossless Compression Algorithms 1 Chapter 7 Lossless Compression Algorithms 1. Introduction 2. Basics of Information Theory 3. Lossless Compression Algorithms

More information

So, what is data compression, and why do we need it?

So, what is data compression, and why do we need it? In the last decade we have been witnessing a revolution in the way we communicate 2 The major contributors in this revolution are: Internet; The explosive development of mobile communications; and The

More information

Keywords Data compression, Lossless data compression technique, Huffman Coding, Arithmetic coding etc.

Keywords Data compression, Lossless data compression technique, Huffman Coding, Arithmetic coding etc. Volume 6, Issue 2, February 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Comparative

More information

Data Compression Techniques

Data Compression Techniques Data Compression Techniques Part 1: Entropy Coding Lecture 1: Introduction and Huffman Coding Juha Kärkkäinen 31.10.2017 1 / 21 Introduction Data compression deals with encoding information in as few bits

More information

Dynamic with Dictionary Technique for Arabic Text Compression

Dynamic with Dictionary Technique for Arabic Text Compression International Journal of Computer Applications (975 8887) Volume 35 No.9, February 26 Dynamic with Dictionary Technique for Arabic Text Compression Fatima Thaher Ahmad Aburomman ABSTRACT In this research

More information

Data Compression. An overview of Compression. Multimedia Systems and Applications. Binary Image Compression. Binary Image Compression

Data Compression. An overview of Compression. Multimedia Systems and Applications. Binary Image Compression. Binary Image Compression An overview of Compression Multimedia Systems and Applications Data Compression Compression becomes necessary in multimedia because it requires large amounts of storage space and bandwidth Types of Compression

More information

S 1. Evaluation of Fast-LZ Compressors for Compacting High-Bandwidth but Redundant Streams from FPGA Data Sources

S 1. Evaluation of Fast-LZ Compressors for Compacting High-Bandwidth but Redundant Streams from FPGA Data Sources Evaluation of Fast-LZ Compressors for Compacting High-Bandwidth but Redundant Streams from FPGA Data Sources Author: Supervisor: Luhao Liu Dr. -Ing. Thomas B. Preußer Dr. -Ing. Steffen Köhler 09.10.2014

More information

IJSRD - International Journal for Scientific Research & Development Vol. 3, Issue 10, 2015 ISSN (online):

IJSRD - International Journal for Scientific Research & Development Vol. 3, Issue 10, 2015 ISSN (online): IJSRD - International Journal for Scientific Research & Development Vol., Issue, ISSN (online): - Modified Golomb Code for Integer Representation Nelson Raja Joseph Jaganathan P Domnic Sandanam Department

More information

A Comparative Study of Entropy Encoding Techniques for Lossless Text Data Compression

A Comparative Study of Entropy Encoding Techniques for Lossless Text Data Compression A Comparative Study of Entropy Encoding Techniques for Lossless Text Data Compression P. RATNA TEJASWI 1 P. DEEPTHI 2 V.PALLAVI 3 D. GOLDIE VAL DIVYA 4 Abstract: Data compression is the art of reducing

More information

A New Compression Method Strictly for English Textual Data

A New Compression Method Strictly for English Textual Data A New Compression Method Strictly for English Textual Data Sabina Priyadarshini Department of Computer Science and Engineering Birla Institute of Technology Abstract - Data compression is a requirement

More information

Research Article Does an Arithmetic Coding Followed by Run-length Coding Enhance the Compression Ratio?

Research Article Does an Arithmetic Coding Followed by Run-length Coding Enhance the Compression Ratio? Research Journal of Applied Sciences, Engineering and Technology 10(7): 736-741, 2015 DOI:10.19026/rjaset.10.2425 ISSN: 2040-7459; e-issn: 2040-7467 2015 Maxwell Scientific Publication Corp. Submitted:

More information

Multimedia Communications ECE 728 (Data Compression)

Multimedia Communications ECE 728 (Data Compression) Multimedia Communications ECE 728 (Data Compression) Multimedia Communications Course number: ECE 728 Instructor: Dr. S. Shirani E-mail: shirani@mcmaster.ca Office: ITB-A320 Web page of the course: http://www.ece.mcmaster.ca/~shirani/multi08/multi08.htm

More information

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay Lecture - 29 Source Coding (Part-4) We have already had 3 classes on source coding

More information

Data Compression Scheme of Dynamic Huffman Code for Different Languages

Data Compression Scheme of Dynamic Huffman Code for Different Languages 2011 International Conference on Information and Network Technology IPCSIT vol.4 (2011) (2011) IACSIT Press, Singapore Data Compression Scheme of Dynamic Huffman Code for Different Languages Shivani Pathak

More information

CS 335 Graphics and Multimedia. Image Compression

CS 335 Graphics and Multimedia. Image Compression CS 335 Graphics and Multimedia Image Compression CCITT Image Storage and Compression Group 3: Huffman-type encoding for binary (bilevel) data: FAX Group 4: Entropy encoding without error checks of group

More information

Chapter 5 VARIABLE-LENGTH CODING Information Theory Results (II)

Chapter 5 VARIABLE-LENGTH CODING Information Theory Results (II) Chapter 5 VARIABLE-LENGTH CODING ---- Information Theory Results (II) 1 Some Fundamental Results Coding an Information Source Consider an information source, represented by a source alphabet S. S = { s,

More information

David Rappaport School of Computing Queen s University CANADA. Copyright, 1996 Dale Carnegie & Associates, Inc.

David Rappaport School of Computing Queen s University CANADA. Copyright, 1996 Dale Carnegie & Associates, Inc. David Rappaport School of Computing Queen s University CANADA Copyright, 1996 Dale Carnegie & Associates, Inc. Data Compression There are two broad categories of data compression: Lossless Compression

More information

ADVANCED LOSSLESS TEXT COMPRESSION ALGORITHM BASED ON SPLAY TREE ADAPTIVE METHODS

ADVANCED LOSSLESS TEXT COMPRESSION ALGORITHM BASED ON SPLAY TREE ADAPTIVE METHODS ADVANCED LOSSLESS TEXT COMPRESSION ALGORITHM BASED ON SPLAY TREE ADAPTIVE METHODS RADU RĂDESCU, ANDREEA HONCIUC *1 Key words: Data compression, Splay Tree, Prefix, ratio. This paper presents an original

More information

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Lecture 10 (Chapter 7) ZHU Yongxin, Winson

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Lecture 10 (Chapter 7) ZHU Yongxin, Winson Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Lecture 10 (Chapter 7) ZHU Yongxin, Winson zhuyongxin@sjtu.edu.cn 2 Lossless Compression Algorithms 7.1 Introduction 7.2 Basics of Information

More information

A Comparative Study Of Text Compression Algorithms

A Comparative Study Of Text Compression Algorithms International Journal of Wisdom Based Computing, Vol. 1 (3), December 2011 68 A Comparative Study Of Text Compression Algorithms Senthil Shanmugasundaram Department of Computer Science, Vidyasagar College

More information

FPGA based Data Compression using Dictionary based LZW Algorithm

FPGA based Data Compression using Dictionary based LZW Algorithm FPGA based Data Compression using Dictionary based LZW Algorithm Samish Kamble PG Student, E & TC Department, D.Y. Patil College of Engineering, Kolhapur, India Prof. S B Patil Asso.Professor, E & TC Department,

More information

A COMPRESSION TECHNIQUES IN DIGITAL IMAGE PROCESSING - REVIEW

A COMPRESSION TECHNIQUES IN DIGITAL IMAGE PROCESSING - REVIEW A COMPRESSION TECHNIQUES IN DIGITAL IMAGE PROCESSING - ABSTRACT: REVIEW M.JEYAPRATHA 1, B.POORNA VENNILA 2 Department of Computer Application, Nadar Saraswathi College of Arts and Science, Theni, Tamil

More information

IMAGE COMPRESSION TECHNIQUES

IMAGE COMPRESSION TECHNIQUES IMAGE COMPRESSION TECHNIQUES A.VASANTHAKUMARI, M.Sc., M.Phil., ASSISTANT PROFESSOR OF COMPUTER SCIENCE, JOSEPH ARTS AND SCIENCE COLLEGE, TIRUNAVALUR, VILLUPURAM (DT), TAMIL NADU, INDIA ABSTRACT A picture

More information

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay Lecture - 26 Source Coding (Part 1) Hello everyone, we will start a new module today

More information

Multimedia Data and Its Encoding

Multimedia Data and Its Encoding Lecture 13 Multimedia Data and Its Encoding M. Adnan Quaium Assistant Professor Department of Electrical and Electronic Engineering Ahsanullah University of Science and Technology Room 4A07 Email adnan.eee@aust.edu

More information

Comparative Study of Dictionary based Compression Algorithms on Text Data

Comparative Study of Dictionary based Compression Algorithms on Text Data 88 Comparative Study of Dictionary based Compression Algorithms on Text Data Amit Jain Kamaljit I. Lakhtaria Sir Padampat Singhania University, Udaipur (Raj.) 323601 India Abstract: With increasing amount

More information

EE-575 INFORMATION THEORY - SEM 092

EE-575 INFORMATION THEORY - SEM 092 EE-575 INFORMATION THEORY - SEM 092 Project Report on Lempel Ziv compression technique. Department of Electrical Engineering Prepared By: Mohammed Akber Ali Student ID # g200806120. ------------------------------------------------------------------------------------------------------------------------------------------

More information

Multimedia Networking ECE 599

Multimedia Networking ECE 599 Multimedia Networking ECE 599 Prof. Thinh Nguyen School of Electrical Engineering and Computer Science Based on B. Lee s lecture notes. 1 Outline Compression basics Entropy and information theory basics

More information

An Asymmetric, Semi-adaptive Text Compression Algorithm

An Asymmetric, Semi-adaptive Text Compression Algorithm An Asymmetric, Semi-adaptive Text Compression Algorithm Harry Plantinga Department of Computer Science University of Pittsburgh Pittsburgh, PA 15260 planting@cs.pitt.edu Abstract A new heuristic for text

More information

Journal of Computer Engineering and Technology (IJCET), ISSN (Print), International Journal of Computer Engineering

Journal of Computer Engineering and Technology (IJCET), ISSN (Print), International Journal of Computer Engineering Journal of Computer Engineering and Technology (IJCET), ISSN 0976 6367(Print), International Journal of Computer Engineering and Technology (IJCET), ISSN 0976 6367(Print) ISSN 0976 6375(Online) Volume

More information

IMAGE PROCESSING (RRY025) LECTURE 13 IMAGE COMPRESSION - I

IMAGE PROCESSING (RRY025) LECTURE 13 IMAGE COMPRESSION - I IMAGE PROCESSING (RRY025) LECTURE 13 IMAGE COMPRESSION - I 1 Need For Compression 2D data sets are much larger than 1D. TV and movie data sets are effectively 3D (2-space, 1-time). Need Compression for

More information

ENSC Multimedia Communications Engineering Topic 4: Huffman Coding 2

ENSC Multimedia Communications Engineering Topic 4: Huffman Coding 2 ENSC 424 - Multimedia Communications Engineering Topic 4: Huffman Coding 2 Jie Liang Engineering Science Simon Fraser University JieL@sfu.ca J. Liang: SFU ENSC 424 1 Outline Canonical Huffman code Huffman

More information

A Novel Image Compression Technique using Simple Arithmetic Addition

A Novel Image Compression Technique using Simple Arithmetic Addition Proc. of Int. Conf. on Recent Trends in Information, Telecommunication and Computing, ITC A Novel Image Compression Technique using Simple Arithmetic Addition Nadeem Akhtar, Gufran Siddiqui and Salman

More information

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION 15 Data Compression Data compression implies sending or storing a smaller number of bits. Although many methods are used for this purpose, in general these methods can be divided into two broad categories:

More information

Image coding and compression

Image coding and compression Chapter 2 Image coding and compression 2. Lossless and lossy compression We have seen that image files can be very large. It is thus important for reasons both of storage and file transfer to make these

More information

A Web Search Engine Model Based on Index-Query Bit-Level Compression

A Web Search Engine Model Based on Index-Query Bit-Level Compression A Web Search Engine Model Based on Index-Query Bit-Level Compression Hussein Al-Bahadili, Saif Saab, Reyadh Naoum The Arab Academy for Banking & Financial Sciences P.O. Box 13190 Amman 11942, Jordan 00962-6-5552900

More information

A Comprehensive Review of Data Compression Techniques

A Comprehensive Review of Data Compression Techniques Volume-6, Issue-2, March-April 2016 International Journal of Engineering and Management Research Page Number: 684-688 A Comprehensive Review of Data Compression Techniques Palwinder Singh 1, Amarbir Singh

More information

Data Compression Algorithms

Data Compression Algorithms Data Compression Algorithms Introduction Marcus Hutter (*1967) ANU Canberra 5310.2017 NSWI072-1 Data compression The process of converting an input data stream (the source stream, the original raw data)

More information

Ch. 2: Compression Basics Multimedia Systems

Ch. 2: Compression Basics Multimedia Systems Ch. 2: Compression Basics Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Outline Why compression? Classification Entropy and Information

More information

Volume 2, Issue 9, September 2014 ISSN

Volume 2, Issue 9, September 2014 ISSN Fingerprint Verification of the Digital Images by Using the Discrete Cosine Transformation, Run length Encoding, Fourier transformation and Correlation. Palvee Sharma 1, Dr. Rajeev Mahajan 2 1M.Tech Student

More information

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm International Journal of Engineering Research and General Science Volume 3, Issue 4, July-August, 15 ISSN 91-2730 A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

More information

Digital Image Processing

Digital Image Processing Lecture 9+10 Image Compression Lecturer: Ha Dai Duong Faculty of Information Technology 1. Introduction Image compression To Solve the problem of reduncing the amount of data required to represent a digital

More information

Engineering Mathematics II Lecture 16 Compression

Engineering Mathematics II Lecture 16 Compression 010.141 Engineering Mathematics II Lecture 16 Compression Bob McKay School of Computer Science and Engineering College of Engineering Seoul National University 1 Lossless Compression Outline Huffman &

More information

Text Data Compression and Decompression Using Modified Deflate Algorithm

Text Data Compression and Decompression Using Modified Deflate Algorithm Text Data Compression and Decompression Using Modified Deflate Algorithm R. Karthik, V. Ramesh, M. Siva B.E. Department of Computer Science and Engineering, SBM COLLEGE OF ENGINEERING AND TECHNOLOGY, Dindigul-624005.

More information

Chapter 1. Digital Data Representation and Communication. Part 2

Chapter 1. Digital Data Representation and Communication. Part 2 Chapter 1. Digital Data Representation and Communication Part 2 Compression Digital media files are usually very large, and they need to be made smaller compressed Without compression Won t have storage

More information

Fundamentals of Video Compression. Video Compression

Fundamentals of Video Compression. Video Compression Fundamentals of Video Compression Introduction to Digital Video Basic Compression Techniques Still Image Compression Techniques - JPEG Video Compression Introduction to Digital Video Video is a stream

More information

EE67I Multimedia Communication Systems Lecture 4

EE67I Multimedia Communication Systems Lecture 4 EE67I Multimedia Communication Systems Lecture 4 Lossless Compression Basics of Information Theory Compression is either lossless, in which no information is lost, or lossy in which information is lost.

More information

University of Mustansiriyah, Baghdad, Iraq

University of Mustansiriyah, Baghdad, Iraq Volume 5, Issue 9, September 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Audio Compression

More information

DEFLATE COMPRESSION ALGORITHM

DEFLATE COMPRESSION ALGORITHM DEFLATE COMPRESSION ALGORITHM Savan Oswal 1, Anjali Singh 2, Kirthi Kumari 3 B.E Student, Department of Information Technology, KJ'S Trinity College Of Engineering and Research, Pune, India 1,2.3 Abstract

More information

Experimental Evaluation of List Update Algorithms for Data Compression

Experimental Evaluation of List Update Algorithms for Data Compression Experimental Evaluation of List Update Algorithms for Data Compression Reza Dorrigiv 1, Alejandro López-Ortiz 1, and J. Ian Munro 1 Cheriton School of Computer Science, University of Waterloo, Waterloo,

More information

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression Digital Compression Page 8.1 DigiPoints Volume 1 Module 8 Digital Compression Summary This module describes the techniques by which digital signals are compressed in order to make it possible to carry

More information

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M.

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M. 322 FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING Moheb R. Girgis and Mohammed M. Talaat Abstract: Fractal image compression (FIC) is a

More information

A Method for the Construction of Minimum-Redundancy Codes*

A Method for the Construction of Minimum-Redundancy Codes* In 1951, David HuftinaD o.vdopod aa ajaoritlun for etbciently encoding the output of a source that produces a sequeaee of symbols, cacllof~jch h1saprobability of oecu.rtenee. This aljorithm, essentially

More information

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR / ODD SEMESTER QUESTION BANK KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY ACADEMIC YEAR 2011-2012 / ODD SEMESTER QUESTION BANK SUB.CODE / NAME YEAR / SEM : IT1301 INFORMATION CODING TECHNIQUES : III / V UNIT -

More information

Information Technology Department, PCCOE-Pimpri Chinchwad, College of Engineering, Pune, Maharashtra, India 2

Information Technology Department, PCCOE-Pimpri Chinchwad, College of Engineering, Pune, Maharashtra, India 2 Volume 5, Issue 5, May 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Adaptive Huffman

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY Rashmi Gadbail,, 2013; Volume 1(8): 783-791 INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK EFFECTIVE XML DATABASE COMPRESSION

More information

CS/COE 1501

CS/COE 1501 CS/COE 1501 www.cs.pitt.edu/~lipschultz/cs1501/ Compression What is compression? Represent the same data using less storage space Can get more use out a disk of a given size Can get more use out of memory

More information

An Advanced Text Encryption & Compression System Based on ASCII Values & Arithmetic Encoding to Improve Data Security

An Advanced Text Encryption & Compression System Based on ASCII Values & Arithmetic Encoding to Improve Data Security Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 10, October 2014,

More information

Data Compression. Guest lecture, SGDS Fall 2011

Data Compression. Guest lecture, SGDS Fall 2011 Data Compression Guest lecture, SGDS Fall 2011 1 Basics Lossy/lossless Alphabet compaction Compression is impossible Compression is possible RLE Variable-length codes Undecidable Pigeon-holes Patterns

More information

Optimization of Bit Rate in Medical Image Compression

Optimization of Bit Rate in Medical Image Compression Optimization of Bit Rate in Medical Image Compression Dr.J.Subash Chandra Bose 1, Mrs.Yamini.J 2, P.Pushparaj 3, P.Naveenkumar 4, Arunkumar.M 5, J.Vinothkumar 6 Professor and Head, Department of CSE, Professional

More information

Comparison of Image Compression Behaviour for WhatsApp, Viber and LINE

Comparison of Image Compression Behaviour for WhatsApp, Viber and LINE Comparison of Image Compression Behaviour for WhatsApp, Viber and LINE Moath Maharmeh University of Petra Al-Bunayyat, Amman, 11196, Jordan moath@vegalayer.com Keywords: WhatsApp compression, Viber compression,

More information

Multimedia Systems. Part 20. Mahdi Vasighi

Multimedia Systems. Part 20. Mahdi Vasighi Multimedia Systems Part 2 Mahdi Vasighi www.iasbs.ac.ir/~vasighi Department of Computer Science and Information Technology, Institute for dvanced Studies in asic Sciences, Zanjan, Iran rithmetic Coding

More information

ENSC Multimedia Communications Engineering Huffman Coding (1)

ENSC Multimedia Communications Engineering Huffman Coding (1) ENSC 424 - Multimedia Communications Engineering Huffman Coding () Jie Liang Engineering Science Simon Fraser University JieL@sfu.ca J. Liang: SFU ENSC 424 Outline Entropy Coding Prefix code Kraft-McMillan

More information

LZ UTF8. LZ UTF8 is a practical text compression library and stream format designed with the following objectives and properties:

LZ UTF8. LZ UTF8 is a practical text compression library and stream format designed with the following objectives and properties: LZ UTF8 LZ UTF8 is a practical text compression library and stream format designed with the following objectives and properties: 1. Compress UTF 8 and 7 bit ASCII strings only. No support for arbitrary

More information

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

International Journal of Emerging Technology and Advanced Engineering Website:   (ISSN , Volume 2, Issue 4, April 2012) A Technical Analysis Towards Digital Video Compression Rutika Joshi 1, Rajesh Rai 2, Rajesh Nema 3 1 Student, Electronics and Communication Department, NIIST College, Bhopal, 2,3 Prof., Electronics and

More information

EFFICIENT DEISGN OF LOW AREA BASED H.264 COMPRESSOR AND DECOMPRESSOR WITH H.264 INTEGER TRANSFORM

EFFICIENT DEISGN OF LOW AREA BASED H.264 COMPRESSOR AND DECOMPRESSOR WITH H.264 INTEGER TRANSFORM EFFICIENT DEISGN OF LOW AREA BASED H.264 COMPRESSOR AND DECOMPRESSOR WITH H.264 INTEGER TRANSFORM 1 KALIKI SRI HARSHA REDDY, 2 R.SARAVANAN 1 M.Tech VLSI Design, SASTRA University, Thanjavur, Tamilnadu,

More information

CS/COE 1501

CS/COE 1501 CS/COE 1501 www.cs.pitt.edu/~nlf4/cs1501/ Compression What is compression? Represent the same data using less storage space Can get more use out a disk of a given size Can get more use out of memory E.g.,

More information

Error Resilient LZ 77 Data Compression

Error Resilient LZ 77 Data Compression Error Resilient LZ 77 Data Compression Stefano Lonardi Wojciech Szpankowski Mark Daniel Ward Presentation by Peter Macko Motivation Lempel-Ziv 77 lacks any form of error correction Introducing a single

More information

CHAPTER 6. 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform. 6.3 Wavelet Transform based compression technique 106

CHAPTER 6. 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform. 6.3 Wavelet Transform based compression technique 106 CHAPTER 6 6 Huffman Coding Based Image Compression Using Complex Wavelet Transform Page No 6.1 Introduction 103 6.2 Compression Techniques 104 103 6.2.1 Lossless compression 105 6.2.2 Lossy compression

More information

Analysis of Parallelization Effects on Textual Data Compression

Analysis of Parallelization Effects on Textual Data Compression Analysis of Parallelization Effects on Textual Data GORAN MARTINOVIC, CASLAV LIVADA, DRAGO ZAGAR Faculty of Electrical Engineering Josip Juraj Strossmayer University of Osijek Kneza Trpimira 2b, 31000

More information

Figure-2.1. Information system with encoder/decoders.

Figure-2.1. Information system with encoder/decoders. 2. Entropy Coding In the section on Information Theory, information system is modeled as the generationtransmission-user triplet, as depicted in fig-1.1, to emphasize the information aspect of the system.

More information

Encoding. A thesis submitted to the Graduate School of University of Cincinnati in

Encoding. A thesis submitted to the Graduate School of University of Cincinnati in Lossless Data Compression for Security Purposes Using Huffman Encoding A thesis submitted to the Graduate School of University of Cincinnati in a partial fulfillment of requirements for the degree of Master

More information

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay

Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay Information Theory and Coding Prof. S. N. Merchant Department of Electrical Engineering Indian Institute of Technology, Bombay Lecture - 11 Coding Strategies and Introduction to Huffman Coding The Fundamental

More information

Intro. To Multimedia Engineering Lossless Compression

Intro. To Multimedia Engineering Lossless Compression Intro. To Multimedia Engineering Lossless Compression Kyoungro Yoon yoonk@konkuk.ac.kr 1/43 Contents Introduction Basics of Information Theory Run-Length Coding Variable-Length Coding (VLC) Dictionary-based

More information

Compression; Error detection & correction

Compression; Error detection & correction Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Image Compression Caution: The PDF version of this presentation will appear to have errors due to heavy use of animations Material in this presentation is largely based on/derived

More information

Highly Secure Invertible Data Embedding Scheme Using Histogram Shifting Method

Highly Secure Invertible Data Embedding Scheme Using Histogram Shifting Method www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 3 Issue 8 August, 2014 Page No. 7932-7937 Highly Secure Invertible Data Embedding Scheme Using Histogram Shifting

More information

TEXT COMPRESSION ALGORITHMS - A COMPARATIVE STUDY

TEXT COMPRESSION ALGORITHMS - A COMPARATIVE STUDY S SENTHIL AND L ROBERT: TEXT COMPRESSION ALGORITHMS A COMPARATIVE STUDY DOI: 10.21917/ijct.2011.0062 TEXT COMPRESSION ALGORITHMS - A COMPARATIVE STUDY S. Senthil 1 and L. Robert 2 1 Department of Computer

More information

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression IMAGE COMPRESSION Image Compression Why? Reducing transportation times Reducing file size A two way event - compression and decompression 1 Compression categories Compression = Image coding Still-image

More information

Efficient VLSI Huffman encoder implementation and its application in high rate serial data encoding

Efficient VLSI Huffman encoder implementation and its application in high rate serial data encoding LETTER IEICE Electronics Express, Vol.14, No.21, 1 11 Efficient VLSI Huffman encoder implementation and its application in high rate serial data encoding Rongshan Wei a) and Xingang Zhang College of Physics

More information

Enhancing Text Compression Method Using Information Source Indexing

Enhancing Text Compression Method Using Information Source Indexing Enhancing Text Compression Method Using Information Source Indexing Abstract Ahmed Musa 1* and Ayman Al-Dmour 1. Department of Computer Engineering, Taif University, Taif-KSA. Sabbatical Leave: Al-Hussein

More information

Department of electronics and telecommunication, J.D.I.E.T.Yavatmal, India 2

Department of electronics and telecommunication, J.D.I.E.T.Yavatmal, India 2 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY LOSSLESS METHOD OF IMAGE COMPRESSION USING HUFFMAN CODING TECHNIQUES Trupti S Bobade *, Anushri S. sastikar 1 Department of electronics

More information

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201 Source Coding Basics and Speech Coding Yao Wang Polytechnic University, Brooklyn, NY1121 http://eeweb.poly.edu/~yao Outline Why do we need to compress speech signals Basic components in a source coding

More information

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University ECE 499/599 Data Compression & Information Theory Thinh Nguyen Oregon State University Adminstrivia Office Hours TTh: 2-3 PM Kelley Engineering Center 3115 Class homepage http://www.eecs.orst.edu/~thinhq/teaching/ece499/spring06/spring06.html

More information

CS106B Handout 34 Autumn 2012 November 12 th, 2012 Data Compression and Huffman Encoding

CS106B Handout 34 Autumn 2012 November 12 th, 2012 Data Compression and Huffman Encoding CS6B Handout 34 Autumn 22 November 2 th, 22 Data Compression and Huffman Encoding Handout written by Julie Zelenski. In the early 98s, personal computers had hard disks that were no larger than MB; today,

More information

REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ADVANTAGES OF IMAGE COMPRESSION

REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ADVANTAGES OF IMAGE COMPRESSION REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ABSTRACT ADVANTAGES OF IMAGE COMPRESSION Amanpreet Kaur 1, Dr. Jagroop Singh 2 1 Ph. D Scholar, Deptt. of Computer Applications, IK Gujral Punjab Technical University,

More information

HARDWARE IMPLEMENTATION OF LOSSLESS LZMA DATA COMPRESSION ALGORITHM

HARDWARE IMPLEMENTATION OF LOSSLESS LZMA DATA COMPRESSION ALGORITHM HARDWARE IMPLEMENTATION OF LOSSLESS LZMA DATA COMPRESSION ALGORITHM Parekar P. M. 1, Thakare S. S. 2 1,2 Department of Electronics and Telecommunication Engineering, Amravati University Government College

More information

ENHANCED DCT COMPRESSION TECHNIQUE USING VECTOR QUANTIZATION AND BAT ALGORITHM Er.Samiksha 1, Er. Anurag sharma 2

ENHANCED DCT COMPRESSION TECHNIQUE USING VECTOR QUANTIZATION AND BAT ALGORITHM Er.Samiksha 1, Er. Anurag sharma 2 ENHANCED DCT COMPRESSION TECHNIQUE USING VECTOR QUANTIZATION AND BAT ALGORITHM Er.Samiksha 1, Er. Anurag sharma 2 1 Research scholar (M-tech) ECE, CT Ninstitute of Technology and Recearch, Jalandhar, Punjab,

More information

LOSSLESS VERSUS LOSSY COMPRESSION: THE TRADE OFFS

LOSSLESS VERSUS LOSSY COMPRESSION: THE TRADE OFFS LOSSLESS VERSUS LOSSY COMPRESSION: THE TRADE OFFS 1 Ayush Raniwala Student, Computer Science Dept. SVKM s NMIMS Mukesh Patel School of Technology Management & Engineering,Mumbai, India ABSTRACT This paper

More information

Fundamentals of Multimedia. Lecture 5 Lossless Data Compression Variable Length Coding

Fundamentals of Multimedia. Lecture 5 Lossless Data Compression Variable Length Coding Fundamentals of Multimedia Lecture 5 Lossless Data Compression Variable Length Coding Mahmoud El-Gayyar elgayyar@ci.suez.edu.eg Mahmoud El-Gayyar / Fundamentals of Multimedia 1 Data Compression Compression

More information

Compression; Error detection & correction

Compression; Error detection & correction Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some

More information

Index. 1. Motivation 2. Background 3. JPEG Compression The Discrete Cosine Transformation Quantization Coding 4. MPEG 5.

Index. 1. Motivation 2. Background 3. JPEG Compression The Discrete Cosine Transformation Quantization Coding 4. MPEG 5. Index 1. Motivation 2. Background 3. JPEG Compression The Discrete Cosine Transformation Quantization Coding 4. MPEG 5. Literature Lossy Compression Motivation To meet a given target bit-rate for storage

More information

Optical Storage Technology. MPEG Data Compression

Optical Storage Technology. MPEG Data Compression Optical Storage Technology MPEG Data Compression MPEG-1 1 Audio Standard Moving Pictures Expert Group (MPEG) was formed in 1988 to devise compression techniques for audio and video. It first devised the

More information

Cost Minimization by QR Code Compression

Cost Minimization by QR Code Compression Cost Minimization by QR Code Compression Sharu Goel #1, Ajay Kumar Singh *2 #1 M. Tech Student & CSE Deptt., Meerut Institute of Engineering and Technology, Baghpat Bypass Road, NH- 58, Meerut, UPTU, (India)

More information

Data Representation. Types of data: Numbers Text Audio Images & Graphics Video

Data Representation. Types of data: Numbers Text Audio Images & Graphics Video Data Representation Data Representation Types of data: Numbers Text Audio Images & Graphics Video Analog vs Digital data How is data represented? What is a signal? Transmission of data Analog vs Digital

More information

Comparison of Text Data Compression Using Run Length Encoding, Arithmetic Encoding, Punctured Elias Code and Goldbach Code

Comparison of Text Data Compression Using Run Length Encoding, Arithmetic Encoding, Punctured Elias Code and Goldbach Code Comparison of Text Data Compression Using Run Length Encoding, Arithmetic Encoding, Punctured Elias Code and Goldbach Code Kenang Eko Prasetyo 1, Tito Waluyo Purboyo 2, Randy Erfa Saputra 3 Computer Engineering,

More information