GUJARAT TECHNOLOGICAL UNIVERSITY

Similar documents
Silver Oak College of Engineering and Technology Information Technology Department Mid Semester 2 Syllabus 6 th IT

GUJARAT TECHNOLOGICAL UNIVERSITY

GUJARAT TECHNOLOGICAL UNIVERSITY

GUJARAT TECHNOLOGICAL UNIVERSITY

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University

GUJARAT TECHNOLOGICAL UNIVERSITY

Multimedia Communications ECE 728 (Data Compression)

Lecture 6 Review of Lossless Coding (II)

So, what is data compression, and why do we need it?

GUJARAT TECHNOLOGICAL UNIVERSITY

Image coding and compression

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

GUJARAT TECHNOLOGICAL UNIVERSITY

Data Compression. An overview of Compression. Multimedia Systems and Applications. Binary Image Compression. Binary Image Compression

7.5 Dictionary-based Coding

Electrical Engineering

Lossless compression II

Information Retrieval. Lecture 3 - Index compression. Introduction. Overview. Characterization of an index. Wintersemester 2007

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

GUJARAT TECHNOLOGICAL UNIVERSITY, AHMEDABAD, GUJARAT. Computer Engineering

GUJARAT TECHNOLOGICAL UNIVERSITY, AHMEDABAD, GUJARAT. COURSE CURRICULUM COURSE TITLE: OBJECT ORIENTED PROGRAMMING (Code: )

Optimized Compression and Decompression Software

GUJARAT TECHNOLOGICAL UNIVERSITY, AHMEDABAD, GUJARAT. Course Curriculum. DATA COMMUNICATION AND NETWORKING (Code: ) Biomedical engineering

GUJARAT TECHNOLOGICAL UNIVERSITY, AHMEDABAD, GUJARAT

Multimedia Communications. Transform Coding

A Novel Image Compression Technique using Simple Arithmetic Addition

Multimedia Networking ECE 599

Welcome Back to Fundamentals of Multimedia (MR412) Fall, 2012 Lecture 10 (Chapter 7) ZHU Yongxin, Winson

Lecture Information. Mod 01 Part 1: The Need for Compression. Why Digital Signal Coding? (1)

Lecture Information Multimedia Video Coding & Architectures

Lecture 5 Lossless Coding (II)

GUJARAT TECHNOLOGICAL UNIVERSITY, AHMEDABAD, GUJARAT. COURSE CURRICULUM COURSE TITLE: OBJECT ORINTED PROGRAMMING (Code: )

GUJARAT TECHNOLOGICAL UNIVERSITY

Lecture 5 Lossless Coding (II) Review. Outline. May 20, Image and video encoding: A big picture

David Rappaport School of Computing Queen s University CANADA. Copyright, 1996 Dale Carnegie & Associates, Inc.

Compression II: Images (JPEG)

A Simple Lossless Compression Heuristic for Grey Scale Images

Topic 5 Image Compression

Image compression. Stefano Ferrari. Università degli Studi di Milano Methods for Image Processing. academic year

Index Compression. David Kauchak cs160 Fall 2009 adapted from:

Administrative. Distributed indexing. Index Compression! What I did last summer lunch talks today. Master. Tasks

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

Lossless Compression Algorithms

Information Technology Department, PCCOE-Pimpri Chinchwad, College of Engineering, Pune, Maharashtra, India 2

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

0ШШШ J&> ELSEVIER. Lossless. (ompression Handbook KHALID SAYOOD EDITOR ^ 1 R H F A»V. -ашштюшшг

Efficiency. Efficiency: Indexing. Indexing. Efficiency Techniques. Inverted Index. Inverted Index (COSC 488)

A Research Paper on Lossless Data Compression Techniques

Course Title: Fundamental of Information Technology (Code: )

CS 335 Graphics and Multimedia. Image Compression

Image Coding and Data Compression

GUJARAT TECHNOLOGICAL UNIVERSITY, AHMEDABAD, GUJARAT COURSE CURRICULUM COURSE TITLE: DCS AND SCADA (COURSE CODE: )

Digital Communication Prof. Bikash Kumar Dey Department of Electrical Engineering Indian Institute of Technology, Bombay

Data Representation and Networking

Digital Image Processing

Data Compression Fundamentals

IMAGE COMPRESSION TECHNIQUES

EE-575 INFORMATION THEORY - SEM 092

Simple variant of coding with a variable number of symbols and fixlength codewords.

IJSRD - International Journal for Scientific Research & Development Vol. 3, Issue 10, 2015 ISSN (online):

Compression. storage medium/ communications network. For the purpose of this lecture, we observe the following constraints:

A Public Domain Tool for Wavelet Image Coding for Remote Sensing and GIS Applications

Lecture 5: Compression I. This Week s Schedule

Data Compression Techniques for Big Data

Fundamentals of Video Compression. Video Compression

EE67I Multimedia Communication Systems Lecture 4

Maharashtra State Board of Technical Education (MSBTE) 'I' Scheme VI Semester Course Curriculum

Bi-Level Image Compression

COMPRESSION TECHNIQUES

Information Retrieval

Introduction to Information Retrieval

End-to-End Data. Presentation Formatting. Difficulties. Outline Formatting Compression

What is multimedia? Multimedia. Continuous media. Most common media types. Continuous media processing. Interactivity. What is multimedia?

yintroduction to compression ytext compression yimage compression ysource encoders and destination decoders

Introduction to Information Retrieval (Manning, Raghavan, Schutze)

CS6200 Information Retrieval. David Smith College of Computer and Information Science Northeastern University

Optimization of Bit Rate in Medical Image Compression

Journal of Computer Engineering and Technology (IJCET), ISSN (Print), International Journal of Computer Engineering

Outline of the course

GUJARAT TECHNOLOGICAL UNIVERSITY COMPUTER ENGINEERING (07) / INFORMATION TECHNOLOGY (16) / INFORMATION & COMMUNICATION TECHNOLOGY (32) DATA STRUCTURES

CS/COE 1501

Engineering Mathematics II Lecture 16 Compression

Spatio-temporal Range Searching Over Compressed Kinetic Sensor Data. Sorelle A. Friedler Google Joint work with David M. Mount

Multimedia. What is multimedia? Media types. Interchange formats. + Text +Graphics +Audio +Image +Video. Petri Vuorimaa 1

Entropy Coding. - to shorten the average code length by assigning shorter codes to more probable symbols => Morse-, Huffman-, Arithmetic Code

Introduction to Compression. Norm Zeck

Study of LZ77 and LZ78 Data Compression Techniques

An Optimum Novel Technique Based on Golomb-Rice Coding for Lossless Image Compression of Digital Images

Repetition 1st lecture

Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems

Mobile Computing and Application Development Course code :

Highly Secure Invertible Data Embedding Scheme Using Histogram Shifting Method

Data Compression. Guest lecture, SGDS Fall 2011

Data Storage. Slides derived from those available on the web site of the book: Computer Science: An Overview, 11 th Edition, by J.

Image Coding and Compression

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Coding of Still Pictures

Data Representation. Types of data: Numbers Text Audio Images & Graphics Video

CS 493: Algorithms for Massive Data Sets Dictionary-based compression February 14, 2002 Scribe: Tony Wirth LZ77

CMPT 365 Multimedia Systems. Media Compression - Image

Transcription:

GUJARAT TECHNOLOGICAL UNIVERSITY INFORMATION TECHNOLOGY DATA COMPRESSION AND DATA RETRIVAL SUBJECT CODE: 2161603 B.E. 6 th SEMESTER Type of course: Core Prerequisite: None Rationale: Data compression refers to the process of encoding information such that memory/transmission capacity requirements are minimized. Though there is an exponential growth in memory and transmission capacity, many high-bandwidth applications, such as digital storage and transmission of video, would not work without compression. Teaching and Examination Scheme: Teaching Scheme Credits Examination Marks Total L T P C Theory Marks Practical Marks Marks ESE PA (M) ESE (V) PA (E) PA ALA ESE OEP (I) 3 0 2 5 70 20 10 20 10 20 150 Content: Sr. No. Content Total Hrs % Weightage 1 Compression Techniques :Lossless Compression, Lossy Compression,Measures of Performance 2 Mathematical Preliminaries for Lossless Compression Models : Physical Models Probability Models Markov Models Composite Source Model Coding Uniquely Decodable Codes Prefix Codes Algorithmic Information Theory Minimum Description Length Principle 3 Huffman Coding The Huffman Coding Algorithm 41 Minimum Variance Huffman Codes Adaptive Huffman Coding Update Procedure Encoding Procedure Decoding Procedure Golomb Codes Rice Codes 2 5 4 10 6 15

Tunstall Codes Applications of Huffman Coding Lossless Image Compression Text Compression Audio Compression 4 Arithmetic Coding Introduction Coding a Sequence Generating a Tag Deciphering the Tag Generating a Binary Code Uniqueness and Efficiency of the Arithmetic Code Algorithm Implementation Integer Implementation Comparison of Huffman and Arithmetic Coding Adaptive Arithmetic Coding 5 Dictionary Techniques Static Dictionary Digram Coding Adaptive Dictionary The LZ77 Approach The LZ78 Approach Applications File Compression UNIX compress Image Compression The Graphics Interchange Format (GIF) Image Compression Portable Network Graphics (PNG) Compression over Modems V.42 bis 6 Predictive Coding: Prediction with Partial match (ppm): The basic algorithm, The ESCAPE SYMBOL, Length of context, The Exclusion Principle, The Burrows-Wheeler Transform: Move-to-front coding Lossless Image Compression CALIC, JPEG-LS, Multi-resolution Approaches Facsimile Encoding Dynamic Markoy Compression. 7 Mathematical Preliminaries for Lossy Coding Distortion criteria, Models, The Quantization Problem Uniform Quantizer Adaptive Quantization Forward Adaptive Quantization Backward Adaptive Quantization Nonuniform Quantization pdf-optimized Quantization Companded Quantization 8 Vector Quantization Advantages of Vector Quantization over Scalar Quantization The Linde-Buzo-Gray Algorithm 5 10 6 15 6 10 06 10 07 10

Initializing the LBG Algorithm The Empty Cell Problem Use of LBG for Image Compression Tree-Structured Vector Quantizers Design of Tree-Structured Vector Quantizers Pruned Tree-Structured Vector Quantizers Structured Vector Quantizers Pyramid Vector Quantization Polar and Spherical Vector Quantizers Lattice Vector Quantizers 9 Boolean retrieval An example information retrieval problem A first take at building an inverted index Processing Boolean queries The extended Boolean model versus ranked retrieval The term vocabulary and postings lists Document delineation and character sequence decoding Obtaining the character sequence in a document Choosing a document unit Determining the vocabulary of terms Tokenization Dropping common terms: stop words Normalization (equivalence classing of terms) Stemming and lemmatization Faster postings list intersection via skip pointers Positional postings and phrase queries Biword indexes Positional indexes 10 XML retrieval Basic XML concepts Challenges in XML retrieval A vector space model for XML retrieval Evaluation of XML retrieval Text-centric vs. data-centric XML retrieval 04 10 02 5 Suggested Specification table with Marks (Theory): Distribution of Theory Marks R Level U Level A Level N Level E Level C Level 15 35 15 5 00 00 Legends: R: Remembrance; U: Understanding; A: Application, N: Analyze and E: Evaluate C: Create and above Levels (Revised Bloom s Taxonomy) Note: This specification table shall be treated as a general guideline for students and teachers. The actual distribution of marks in the question paper may vary slightly from above table.

Reference Books: 1. Introduction to Data Compression, Khalid Sayood, Morgan Kaufmann 2. Publishers 3. The Data Compression book, Mark Nelson,Jean Loup Gaily 4. Data Compression : The Complete Reference, David Saloman, Springer 5. An Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, Hinrich Schütze, Cambridge University Press,Cambridge, England 6. Information storage and retrieval, Robert Korfhage, WILEY Course Outcome: After learning the course the students should be able to: 1. Understand and apply various coding techniques for compression. 2. Differentiate between Lossy and Lossless compression. 3. Understand basic concept of information retrieval List of Experiments: 1. Write a program that compresses and displays uncompressed windows BMP image file. 2. Write a program to generate binary code in case of arithmetic coding. 3. Implement Huffman Code(HC) to generate binary code when symbol and probabilities are given. 4. Implement Huffman code which can compress given file and decompress compressed file. 5. Implement adaptive Huffman program to compress decompressed file. 6. Write a program to Implement LZ77 algorithm. 7. Write a program to Implement LZ55 algorithm. 8. Write a program to Implement LZ78 algorithm 9. Write a program which performs JPEG compression, process step by step for given 8x8 block and decompression also. 10. Write a program to find tokens from the files and eliminate stop words. 11. Write a program to implement vector space model for XML retrieval. Design based Problems (DP)/Open Ended Problem: 1. Design an architecture and algorithm for data compression in cache and main memory. 2. Design an algorithm for compressing photo or video that is shared across social media. 3. Design an algorithm for compressing data at sensor which is reporting temperature data. Major Equipment: Computer,Laptop List of Open Source Software/learning website: 1) http://ocw.usu.edu/electrical_and_computer_engineering/information_theory/ ACTIVE LEARNING ASSIGNMENTS: Preparation of power-point slides, which include videos, animations, pictures, graphics for better understanding theory and practical work The faculty will allocate chapters/ parts of chapters to groups of students so that the entire syllabus to be covered. The power-point slides should be put up on the web-site of the College/ Institute, along with the names of the students of the group, the name of the faculty, Department and College on the first slide. The best three works should submit to GTU.