Optimal Data Compression Techniquesfor Smart Grid and Po

Size: px
Start display at page:

Download "Optimal Data Compression Techniquesfor Smart Grid and Po"

Transcription

1 Optimal Data Compression Techniques for Smart Grid and Power Quality Trend Data Faculty of Mechatronics Technical university in Liberec Research and development department KMB systems, s.r.o. ICHQP 2012, June 2012

2 Motivation: Smart grids, buildings, vehicles, energy producers and consumers > all generate an extreme amount of sensitive data comunication and storage is time/space demanding lossy algorithms are effective but these unpredictably deform the reality archived information is mostly jung - doesn t contain many useful features Project target: Format for efective storage and processing of PQ data (PQDIF, XLS or CSV is NOT storage friendly!) Is it possible to optimise the already best performing classical compression algorithm?

3 Motivation: Smart grids, buildings, vehicles, energy producers and consumers > all generate an extreme amount of sensitive data comunication and storage is time/space demanding lossy algorithms are effective but these unpredictably deform the reality archived information is mostly jung - doesn t contain many useful features Project target: Format for efective storage and processing of PQ data (PQDIF, XLS or CSV is NOT storage friendly!) Is it possible to optimise the already best performing classical compression algorithm?

4 Motivation: Smart grids, buildings, vehicles, energy producers and consumers > all generate an extreme amount of sensitive data comunication and storage is time/space demanding lossy algorithms are effective but these unpredictably deform the reality archived information is mostly jung - doesn t contain many useful features Project target: Format for efective storage and processing of PQ data (PQDIF, XLS or CSV is NOT storage friendly!) Is it possible to optimise the already best performing classical compression algorithm?

5 Tabulka : Compression ratios - assorted competitors Instrument(Mfg) File Original [kb] ZIP [kb] CR LZMA [kb] CR PQBox Sample PAS (Satec) Sample Sample Sample Sample MiQen (Iskra) MC760 Demo X4G_Demo PQInvest. (ELSPEC) Main 22KV ENA (ELCOM) QNA (Circutor) SPQ SPQ GridViz (Janitza) c c

6 Tabulka : Actual situation: RETIS, CETIS and ENVIS ratios File Original [kb] ZIP [kb] CR LZMA [kb] CR RETIS SMZ-Test SMZ-T CETIS Test Demo ENVIS 1.0 KMB SMY-Demo

7 SMPQ analyser: capacity of internal archives 256 MB Main archive (light blue) 15 MB PQ events (red) 11 MB PQ main archiv (blue) 1,5 MB LOG,1 MB HDO (RCS) (green) 0,7 MB daily profiles, 0,1 MB electricity meter (black) 1 MB reserve (for invalid flash blocks) (pink) PQ Event Trend and Ocilogram (saves to RAM)

8

9 Distribution of values of various measured quantities in the archived files.

10 Distribution of U, I, P a Q in a sample archive.

11 Distributions of values of selected harmonicks

12 vlastnosti komprimovaných dat Voltage harmonics absolute. Jan Kraus, Pavel Šte pán, Leoš Kukac ka relative to limits->

13 Selected loss-less compression techniques tested entropy encodings Huffman, Arithmetic, Range vocabulary methods (LZ77) DEFLATE 1, LZO combined methods BZIP2, LZMA 1 ZIP, GZIP

14 Type of compression and size of input. Voltage and current TRMS and harmonics included.

15 CR: LZ77 (ZIP) and LZMA versus input data size. Size of input versus CR for voltage (left) and active power (right) for ZIP and LZMA compression.

16 LZMA CR stability for charakteristical quantities.

17 LZMA compression of harmonics (U, orig. 700 kb)

18 LZMA by experiments turned out to be the best berforming algorithm for PQ trends compression. Builds on dictionary compression LZ77, extended with bit prediction in stream and range encoding of input data. Its good for application of prediction models to reduce symbol code variability in the compressed file: dictionary: higher propability to hit the term in dictionary entropy encoder: less symbols improve its efficiency

19 Modeling and optimization of a time serie.

20 Involved modeling techniques: relational models interval characteristics polynomial models spline models transform models (FFT, DWT) Evaluated model optimizations: effective encoding of values value ordering, differential encoding definition, size and distribution of modelling intervals local optimization dynamic interval length - RMSE, LogHist

21 Involved modeling techniques: relational models interval characteristics polynomial models spline models transform models (FFT, DWT) Evaluated model optimizations: effective encoding of values value ordering, differential encoding definition, size and distribution of modelling intervals local optimization dynamic interval length - RMSE, LogHist

22 Polynomial models - oscilating nature, not useful

23 Model of relations: Tabulka : CR for Pfh, Q fh modeled with P h, Q h. KMB1 P fh 1 P fh 2 P fh 3 Q fh 1 Q fh 3 Q fh 3 CR [] Ori Mdl Tabulka : CR improvement for LZMA while calculating U LL from U LN U LL [kb] U LN [kb] Sum Data CR U1 U2 U3 U12 U23 U31 [kb] [] KMB-1d Org Mdl FOS1 Org Mdl

24

25 Spline models: Various spline models. Green is a residuum for Akima spline model.

26 CR influences by number of nodes in model (Voltage, 40kB)

27 CR influences by number of nodes in model (power, 40kB)

28

29 Modely a Metriky - Běžně používané vztahy Residual Sum of Squares RSS = n = n (f (x i ) y i ) 2 ei 2 i=1 i=1 Root Mean Square Error RMSE = n MSE = (f (x i ) y i ) 2 1 n Mean Average Error n MAE = 1 n f (x i ) y i i=1 i=1

30 Akima Spline Model: removal of node and its influence in the surrounding intervals.

31 Proposed LogHist metric - motivation and definition For CR number of bits per symbol is crucial. LogHist splits symbols by code of its MSB. Sum of symbols in each group, multiplied by groups weight represents the weight of the evaluated interval and still is much less computationaly demanding than any LZ compression. Max LogHist = Hist i (int)(log N (i) + 1), i=0 where i is symbol code (ASCII, UTF), N defines (logarithmic) order of histogram bin and Hist i is a number of occurencies of symbol i in evaluated interval.

32 LogHist is not limited by number of input intervals.

33

34 Overview and Compression - key technique for savings, potentialy faster processing» not yet broadly accepted Different algorithms and data was benchmarked - specifics and limits have been identified Data from an existing implementation of SMPQ analyser have been used in our experiments - EN 50160(S) LZMA algorithm most effective existing solution for real applications. And it is still possible to improve it... Some described techniques are already implemented in ENVIS app: (SQL DB, CEA)

35 LZMA optimalizations Order and encoding is important for CR Before compression use any technique to limit the distribution of symbols Solution: prediction modeling + residuum Best results spline, agregation model and relation model s (interval spliting): LogHist metrics for affective buildup of models LZMA + agregation model with LogHist intervals + gradient optimization + diferential encoding

36 Emerging topics cluster examples Local model optimization towards the actual dictionary Timewise DB aggregation - ready in the next ENVIS software update Dynamic visualisation - use model instead of raw data Use modelling for data mining, statistical analysis and clustering

37 Thanks for your kind attention. I ll be glad to hear your opinions. Any comments regarding the advanced evaluation of PQ data, variable load/supply monitoring, general energy efficiency evaluation and archive processing is of my primar interrest. I will be also glad to later discuss possibilities for automatic data analysis and data mining in our "smart"world. Contact: Jan Kraus, TU in Liberec, jan.kraus@tul.cz

38 Harm. components of voltage, relative to EN limit. absolutne Jan Kraus, Pavel Šte pán, Leoš Kukac ka

39 3D scatterplot - cluster P, Q, a THD i. R and D

40 Scatter 4 quantities: P, Q, THD i, čas. R and D

OPTIMAL DATA COMPRESSION TECHNIQUES FOR SMART GRID AND POWER QUALITY TREND DATA

OPTIMAL DATA COMPRESSION TECHNIQUES FOR SMART GRID AND POWER QUALITY TREND DATA OPTIMAL DATA COMPRESSION TECHNIQUES FOR SMART GRID AND POWER QUALITY TREND DATA Jan Kraus Faculty of Mechatronics Technical University of Liberec Liberec, Czech republic Email: jan.kraus@tul.cz Pavel Štěpán

More information

Smart Continuous Grid Monitoring Power Quality and Power Consumption

Smart Continuous Grid Monitoring Power Quality and Power Consumption Smart Continuous Grid Monitoring Power Quality and Power Consumption KMB systems, s.r.o Measuring, monitoring and evaluation of power quality Tools and accesories for power factor control 2011 Product

More information

Stereo Image Compression

Stereo Image Compression Stereo Image Compression Deepa P. Sundar, Debabrata Sengupta, Divya Elayakumar {deepaps, dsgupta, divyae}@stanford.edu Electrical Engineering, Stanford University, CA. Abstract In this report we describe

More information

So, what is data compression, and why do we need it?

So, what is data compression, and why do we need it? In the last decade we have been witnessing a revolution in the way we communicate 2 The major contributors in this revolution are: Internet; The explosive development of mobile communications; and The

More information

General Oscillogram fw. module

General Oscillogram fw. module KMB systems, s. r. o. Dr. M. Horákové 559, 460 06 Liberec 7, Czech Republic Tel. +420 485 130 314, Fax +420 482 736 896 E-mail: kmb@kmb.cz, Web: www.kmb.cz General Oscillogram fw. module User Guide Preliminary

More information

Data Compression Fundamentals

Data Compression Fundamentals 1 Data Compression Fundamentals Touradj Ebrahimi Touradj.Ebrahimi@epfl.ch 2 Several classifications of compression methods are possible Based on data type :» Generic data compression» Audio compression»

More information

CS/COE 1501

CS/COE 1501 CS/COE 1501 www.cs.pitt.edu/~lipschultz/cs1501/ Compression What is compression? Represent the same data using less storage space Can get more use out a disk of a given size Can get more use out of memory

More information

CS/COE 1501

CS/COE 1501 CS/COE 1501 www.cs.pitt.edu/~nlf4/cs1501/ Compression What is compression? Represent the same data using less storage space Can get more use out a disk of a given size Can get more use out of memory E.g.,

More information

Simple variant of coding with a variable number of symbols and fixlength codewords.

Simple variant of coding with a variable number of symbols and fixlength codewords. Dictionary coding Simple variant of coding with a variable number of symbols and fixlength codewords. Create a dictionary containing 2 b different symbol sequences and code them with codewords of length

More information

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression IMAGE COMPRESSION Image Compression Why? Reducing transportation times Reducing file size A two way event - compression and decompression 1 Compression categories Compression = Image coding Still-image

More information

DCT Based, Lossy Still Image Compression

DCT Based, Lossy Still Image Compression DCT Based, Lossy Still Image Compression NOT a JPEG artifact! Lenna, Playboy Nov. 1972 Lena Soderberg, Boston, 1997 Nimrod Peleg Update: April. 2009 http://www.lenna.org/ Image Compression: List of Topics

More information

Analyzing Data Properties using Statistical Sampling Techniques

Analyzing Data Properties using Statistical Sampling Techniques Analyzing Data Properties using Statistical Sampling Techniques Illustrated on Scientific File Formats and Compression Features Julian M. Kunkel kunkel@dkrz.de June 2, 2016 Outline 1 Introduction 2 Exploring

More information

Reduction Deliverable Intelligent V2V and V2I communication

Reduction Deliverable Intelligent V2V and V2I communication Reduction 2011-2014 Deliverable 1.4.1 Intelligent V2V and V2I communication Test report with communication between the Bluetooth detectors and the incar system or smart phones 26-08- 2013 Public Document

More information

Multimedia Networking ECE 599

Multimedia Networking ECE 599 Multimedia Networking ECE 599 Prof. Thinh Nguyen School of Electrical Engineering and Computer Science Based on B. Lee s lecture notes. 1 Outline Compression basics Entropy and information theory basics

More information

Compression of Stereo Images using a Huffman-Zip Scheme

Compression of Stereo Images using a Huffman-Zip Scheme Compression of Stereo Images using a Huffman-Zip Scheme John Hamann, Vickey Yeh Department of Electrical Engineering, Stanford University Stanford, CA 94304 jhamann@stanford.edu, vickey@stanford.edu Abstract

More information

Image and Video Coding I: Fundamentals

Image and Video Coding I: Fundamentals Image and Video Coding I: Fundamentals Thomas Wiegand Technische Universität Berlin T. Wiegand (TU Berlin) Image and Video Coding Organization Vorlesung: Donnerstag 10:15-11:45 Raum EN-368 Material: http://www.ic.tu-berlin.de/menue/studium_und_lehre/

More information

Technical lossless / near lossless data compression

Technical lossless / near lossless data compression Technical lossless / near lossless data compression Nigel Atkinson (Met Office, UK) ECMWF/EUMETSAT NWP SAF Workshop 5-7 Nov 2013 Contents Survey of file compression tools Studies for AVIRIS imager Study

More information

Lossless Compression Algorithms

Lossless Compression Algorithms Multimedia Data Compression Part I Chapter 7 Lossless Compression Algorithms 1 Chapter 7 Lossless Compression Algorithms 1. Introduction 2. Basics of Information Theory 3. Lossless Compression Algorithms

More information

Data Analyst Nanodegree Syllabus

Data Analyst Nanodegree Syllabus Data Analyst Nanodegree Syllabus Discover Insights from Data with Python, R, SQL, and Tableau Before You Start Prerequisites : In order to succeed in this program, we recommend having experience working

More information

STUDY OF VARIOUS DATA COMPRESSION TOOLS

STUDY OF VARIOUS DATA COMPRESSION TOOLS STUDY OF VARIOUS DATA COMPRESSION TOOLS Divya Singh [1], Vimal Bibhu [2], Abhishek Anand [3], Kamalesh Maity [4],Bhaskar Joshi [5] Senior Lecturer, Department of Computer Science and Engineering, AMITY

More information

metrum PQ 110 Power Quality Unit, Class A Normative reference instrument for MV substation monitoring

metrum PQ 110 Power Quality Unit, Class A Normative reference instrument for MV substation monitoring metrum PQ 110 Power Quality Unit, Class A Normative reference instrument for MV substation monitoring Background The Metrum PQ instrument range are specially designed for normative measurements in the

More information

MIGRATORY COMPRESSION Coarse-grained Data Reordering to Improve Compressibility

MIGRATORY COMPRESSION Coarse-grained Data Reordering to Improve Compressibility MIGRATORY COMPRESSION Coarse-grained Data Reordering to Improve Compressibility Xing Lin *, Guanlin Lu, Fred Douglis, Philip Shilane, Grant Wallace * University of Utah EMC Corporation Data Protection

More information

XML Description Schema for Power Quality Data

XML Description Schema for Power Quality Data XML Description Schema for Power Quality J.P. Braun, V.J. Gosbell, I.S. Burnett School of Electrical, Computer and Telecommunications Engineering, University of Wollongong jbraun@ieee.org Abstract The

More information

Research Article Forecasting SPEI and SPI Drought Indices Using the Integrated Artificial Neural Networks

Research Article Forecasting SPEI and SPI Drought Indices Using the Integrated Artificial Neural Networks Computational Intelligence and Neuroscience Volume 2016, Article ID 3868519, 17 pages http://dx.doi.org/10.1155/2016/3868519 Research Article Forecasting SPEI and SPI Drought Indices Using the Integrated

More information

Hyper Text Transfer Protocol Compression

Hyper Text Transfer Protocol Compression Hyper Text Transfer Protocol Compression Dr.Khalaf Khatatneh, Professor Dr. Ahmed Al-Jaber, and Asma a M. Khtoom Abstract This paper investigates HTTP post request compression approach. The most common

More information

ENVIS Application User Guide Version 1.8 for Supported Measuring Instruments. K M B systems s.r.o

ENVIS Application User Guide Version 1.8 for Supported Measuring Instruments. K M B systems s.r.o ENVIS Application User Guide Version 1.8 for Supported Measuring Instruments K M B systems s.r.o September 5, 2017 1 ENVIS software is distributed alongside the advanced PQ monitors and energy analyzers

More information

Run-Length and Markov Compression Revisited in DNA Sequences

Run-Length and Markov Compression Revisited in DNA Sequences Run-Length and Markov Compression Revisited in DNA Sequences Saurabh Kadekodi M.S. Computer Science saurabhkadekodi@u.northwestern.edu Efficient and economical storage of DNA sequences has always been

More information

ROOT I/O compression algorithms. Oksana Shadura, Brian Bockelman University of Nebraska-Lincoln

ROOT I/O compression algorithms. Oksana Shadura, Brian Bockelman University of Nebraska-Lincoln ROOT I/O compression algorithms Oksana Shadura, Brian Bockelman University of Nebraska-Lincoln Introduction Compression Algorithms 2 Compression algorithms Los Reduces size by permanently eliminating certain

More information

Exporting the DS8000 Performance Summary

Exporting the DS8000 Performance Summary Exporting the DS8000 Performance Summary Paul Spagnolo pgspagn@us.ibm.com Storage ATS IBM Washington Systems Center - Storage Agenda What is the DS8000 Performance Summary? Offloading the Performance Summary

More information

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

2014 Summer School on MPEG/VCEG Video. Video Coding Concept 2014 Summer School on MPEG/VCEG Video 1 Video Coding Concept Outline 2 Introduction Capture and representation of digital video Fundamentals of video coding Summary Outline 3 Introduction Capture and representation

More information

Voltage Transducer UMT516 / MT516

Voltage Transducer UMT516 / MT516 Voltage Transducer UMT516 / MT516 True RMS AC voltage measurements Voltage auto range measurements up to 600V # Wide frequency measurement range 16 400 Hz High accuracy class 0.2 (IEC-688), 0.1 on communication

More information

SSIM based image quality assessment for vector quantization based lossy image compression using LZW coding

SSIM based image quality assessment for vector quantization based lossy image compression using LZW coding Available online at www.ganpatuniversity.ac.in University Journal of Research ISSN (Online) 0000 0000, ISSN (Print) 0000 0000 SSIM based image quality assessment for vector quantization based lossy image

More information

David Rappaport School of Computing Queen s University CANADA. Copyright, 1996 Dale Carnegie & Associates, Inc.

David Rappaport School of Computing Queen s University CANADA. Copyright, 1996 Dale Carnegie & Associates, Inc. David Rappaport School of Computing Queen s University CANADA Copyright, 1996 Dale Carnegie & Associates, Inc. Data Compression There are two broad categories of data compression: Lossless Compression

More information

THE NEXT TRANSDUCER GENERATION

THE NEXT TRANSDUCER GENERATION THE NEXT TRANSDUCER GENERATION ADJUSTABLE HIGH-ACCURACY HEAVY-CURRENT SENSORS SINEAX DM5S/DM5F Heavy-current sensor for grid monitoring. SINEAX DM5S and SINEAX DM5F are freeprogrammable universal measurement

More information

Rate Distortion Optimization in Video Compression

Rate Distortion Optimization in Video Compression Rate Distortion Optimization in Video Compression Xue Tu Dept. of Electrical and Computer Engineering State University of New York at Stony Brook 1. Introduction From Shannon s classic rate distortion

More information

Brotli Compression Algorithm outline of a specification

Brotli Compression Algorithm outline of a specification Brotli Compression Algorithm outline of a specification Overview Structure of backward reference commands Encoding of commands Encoding of distances Encoding of Huffman codes Block splitting Context modeling

More information

Audio Coding and MP3

Audio Coding and MP3 Audio Coding and MP3 contributions by: Torbjørn Ekman What is Sound? Sound waves: 20Hz - 20kHz Speed: 331.3 m/s (air) Wavelength: 165 cm - 1.65 cm 1 Analogue audio frequencies: 20Hz - 20kHz mono: x(t)

More information

Discover. BlackBox G4400. Outstanding Features. Designed for Your Needs. PQZIP Compression Technology. Elspec s Unique Technology

Discover. BlackBox G4400. Outstanding Features. Designed for Your Needs. PQZIP Compression Technology. Elspec s Unique Technology BlackBox G4400 Designed for Your Needs Discover Outstanding Features A comprehensive energy management and power monitoring program is the key to success for any energy provider/consumer, regardless their

More information

Data Compression. Media Signal Processing, Presentation 2. Presented By: Jahanzeb Farooq Michael Osadebey

Data Compression. Media Signal Processing, Presentation 2. Presented By: Jahanzeb Farooq Michael Osadebey Data Compression Media Signal Processing, Presentation 2 Presented By: Jahanzeb Farooq Michael Osadebey What is Data Compression? Definition -Reducing the amount of data required to represent a source

More information

CS 260: Seminar in Computer Science: Multimedia Networking

CS 260: Seminar in Computer Science: Multimedia Networking CS 260: Seminar in Computer Science: Multimedia Networking Jiasi Chen Lectures: MWF 4:10-5pm in CHASS http://www.cs.ucr.edu/~jiasi/teaching/cs260_spring17/ Multimedia is User perception Content creation

More information

UMG 20CM. 20 Channel Branch Circuit Monitoring Device with RCM

UMG 20CM. 20 Channel Branch Circuit Monitoring Device with RCM 20 Channel Branch Circuit Monitoring Device with RCM RCM Harmonics via analysis channel Alarm management GridVis Analysis software 20 current channels Interfaces / communication RS485 RTU Accuracy of measurement

More information

Tradeoffs in XML Database Compression

Tradeoffs in XML Database Compression Tradeoffs in XML Database Compression James Cheney University of Edinburgh Data Compression Conference March 30, 2006 Tradeoffs in XML Database Compression p.1/22 XML Compression XML: a format for tree-structured

More information

7.5 Dictionary-based Coding

7.5 Dictionary-based Coding 7.5 Dictionary-based Coding LZW uses fixed-length code words to represent variable-length strings of symbols/characters that commonly occur together, e.g., words in English text LZW encoder and decoder

More information

Basic Compression Library

Basic Compression Library Basic Compression Library Manual API version 1.2 July 22, 2006 c 2003-2006 Marcus Geelnard Summary This document describes the algorithms used in the Basic Compression Library, and how to use the library

More information

Image and Video Coding I: Fundamentals

Image and Video Coding I: Fundamentals Image and Video Coding I: Fundamentals Heiko Schwarz Freie Universität Berlin Fachbereich Mathematik und Informatik H. Schwarz (FU Berlin) Image and Video Coding Organization Vorlesung: Montag 14:15-15:45

More information

HYBRID TRANSFORMATION TECHNIQUE FOR IMAGE COMPRESSION

HYBRID TRANSFORMATION TECHNIQUE FOR IMAGE COMPRESSION 31 st July 01. Vol. 41 No. 005-01 JATIT & LLS. All rights reserved. ISSN: 199-8645 www.jatit.org E-ISSN: 1817-3195 HYBRID TRANSFORMATION TECHNIQUE FOR IMAGE COMPRESSION 1 SRIRAM.B, THIYAGARAJAN.S 1, Student,

More information

Comparative Study between DCT and Wavelet Transform Based Image Compression Algorithm

Comparative Study between DCT and Wavelet Transform Based Image Compression Algorithm IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 1, Ver. II (Jan Feb. 2015), PP 53-57 www.iosrjournals.org Comparative Study between DCT and Wavelet

More information

WIRE/WIRELESS SENSOR NETWORKS USING K-RLE ALGORITHM FOR A LOW POWER DATA COMPRESSION

WIRE/WIRELESS SENSOR NETWORKS USING K-RLE ALGORITHM FOR A LOW POWER DATA COMPRESSION WIRE/WIRELESS SENSOR NETWORKS USING K-RLE ALGORITHM FOR A LOW POWER DATA COMPRESSION V.KRISHNAN1, MR. R.TRINADH 2 1 M. Tech Student, 2 M. Tech., Assistant Professor, Dept. Of E.C.E, SIR C.R. Reddy college

More information

Recent developments in Zip-Ada. Part 1: Overview; new Deflate compression algorithm Part 2: New LZMA compression algorithm. Dr Gautier de Montmollin

Recent developments in Zip-Ada. Part 1: Overview; new Deflate compression algorithm Part 2: New LZMA compression algorithm. Dr Gautier de Montmollin Part 1: Overview; new Deflate compression algorithm Part 2: New LZMA compression algorithm Dr Gautier de Montmollin Swiss Ada Event 2017, Rapperswil, 2 Overview - what is Zip-Ada? Zip-Ada is a programming

More information

Opera Web Browser Archive - FTP Site Statistics. Top 20 Directories Sorted by Disk Space

Opera Web Browser Archive - FTP Site Statistics. Top 20 Directories Sorted by Disk Space Property Value FTP Server ftp.opera.com Description Opera Web Browser Archive Country United States Scan Date 04/Nov/2015 Total Dirs 1,557 Total Files 2,211 Total Data 43.83 GB Top 20 Directories Sorted

More information

LZ UTF8. LZ UTF8 is a practical text compression library and stream format designed with the following objectives and properties:

LZ UTF8. LZ UTF8 is a practical text compression library and stream format designed with the following objectives and properties: LZ UTF8 LZ UTF8 is a practical text compression library and stream format designed with the following objectives and properties: 1. Compress UTF 8 and 7 bit ASCII strings only. No support for arbitrary

More information

Introduction to Compression. Norm Zeck

Introduction to Compression. Norm Zeck Introduction to Compression 2 Vita BSEE University of Buffalo (Microcoded Computer Architecture) MSEE University of Rochester (Thesis: CMOS VLSI Design) Retired from Palo Alto Research Center (PARC), a

More information

Intro. To Multimedia Engineering Lossless Compression

Intro. To Multimedia Engineering Lossless Compression Intro. To Multimedia Engineering Lossless Compression Kyoungro Yoon yoonk@konkuk.ac.kr 1/43 Contents Introduction Basics of Information Theory Run-Length Coding Variable-Length Coding (VLC) Dictionary-based

More information

Ch. 2: Compression Basics Multimedia Systems

Ch. 2: Compression Basics Multimedia Systems Ch. 2: Compression Basics Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Outline Why compression? Classification Entropy and Information

More information

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University

ECE 499/599 Data Compression & Information Theory. Thinh Nguyen Oregon State University ECE 499/599 Data Compression & Information Theory Thinh Nguyen Oregon State University Adminstrivia Office Hours TTh: 2-3 PM Kelley Engineering Center 3115 Class homepage http://www.eecs.orst.edu/~thinhq/teaching/ece499/spring06/spring06.html

More information

Performance Benchmark and Capacity Planning. Version: 7.3

Performance Benchmark and Capacity Planning. Version: 7.3 Performance Benchmark and Capacity Planning Version: 7.3 Copyright 215 Intellicus Technologies This document and its content is copyrighted material of Intellicus Technologies. The content may not be copied

More information

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201

Source Coding Basics and Speech Coding. Yao Wang Polytechnic University, Brooklyn, NY11201 Source Coding Basics and Speech Coding Yao Wang Polytechnic University, Brooklyn, NY1121 http://eeweb.poly.edu/~yao Outline Why do we need to compress speech signals Basic components in a source coding

More information

Processing a Trillion Cells per Mouse Click

Processing a Trillion Cells per Mouse Click Processing a Trillion Cells per Mouse Click Common Sense 13/01 21.3.2013 Alex Hall, Google Zurich Olaf Bachmann, Robert Buessow, Silviu Ganceanu, Marc Nunkesser Outline of the Talk AdSpam team at Google

More information

EE67I Multimedia Communication Systems Lecture 4

EE67I Multimedia Communication Systems Lecture 4 EE67I Multimedia Communication Systems Lecture 4 Lossless Compression Basics of Information Theory Compression is either lossless, in which no information is lost, or lossy in which information is lost.

More information

A Methodology to Detect Most Effective Compression Technique Based on Time Complexity Cloud Migration for High Image Data Load

A Methodology to Detect Most Effective Compression Technique Based on Time Complexity Cloud Migration for High Image Data Load AUSTRALIAN JOURNAL OF BASIC AND APPLIED SCIENCES ISSN:1991-8178 EISSN: 2309-8414 Journal home page: www.ajbasweb.com A Methodology to Detect Most Effective Compression Technique Based on Time Complexity

More information

Lossless Compression of Breast Tomosynthesis Objects Maximize DICOM Transmission Speed and Review Performance and Minimize Storage Space

Lossless Compression of Breast Tomosynthesis Objects Maximize DICOM Transmission Speed and Review Performance and Minimize Storage Space Lossless Compression of Breast Tomosynthesis Objects Maximize DICOM Transmission Speed and Review Performance and Minimize Storage Space David A. Clunie PixelMed Publishing What is Breast Tomosynthesis?

More information

File Fragment Encoding Classification: An Empirical Approach

File Fragment Encoding Classification: An Empirical Approach DIGITAL FORENSIC RESEARCH CONFERENCE File Fragment Encoding Classification: An Empirical Approach By Vassil Roussev and Candice Quates Presented At The Digital Forensic Research Conference DFRWS 2013 USA

More information

PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation. Simple Linear Regression Software: Stata v 10.1

PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation. Simple Linear Regression Software: Stata v 10.1 PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation Simple Linear Regression Software: Stata v 10.1 Emergency Calls to the New York Auto Club Source: Chatterjee, S; Handcock MS and

More information

An Oracle White Paper September Oracle Utilities Meter Data Management Demonstrates Extreme Performance on Oracle Exadata/Exalogic

An Oracle White Paper September Oracle Utilities Meter Data Management Demonstrates Extreme Performance on Oracle Exadata/Exalogic An Oracle White Paper September 2011 Oracle Utilities Meter Data Management 2.0.1 Demonstrates Extreme Performance on Oracle Exadata/Exalogic Introduction New utilities technologies are bringing with them

More information

CS570: Introduction to Data Mining

CS570: Introduction to Data Mining CS570: Introduction to Data Mining Fall 2013 Reading: Chapter 3 Han, Chapter 2 Tan Anca Doloc-Mihu, Ph.D. Some slides courtesy of Li Xiong, Ph.D. and 2011 Han, Kamber & Pei. Data Mining. Morgan Kaufmann.

More information

Daala: One year later

Daala: One year later Daala: One year later Timothy B. Terriberry Original Plan Finish Daala by the end of 2015 This obviously ain t gonna happen 2 Original Plan Finish Daala by the end of 2015 This obviously ain t gonna happen

More information

( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value.

( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value. Calibration OVERVIEW... 2 INTRODUCTION... 2 CALIBRATION... 3 ANOTHER REASON FOR CALIBRATION... 4 CHECKING THE CALIBRATION OF A REGRESSION... 5 CALIBRATION IN SIMPLE REGRESSION (DISPLAY.JMP)... 5 TESTING

More information

CS 335 Graphics and Multimedia. Image Compression

CS 335 Graphics and Multimedia. Image Compression CS 335 Graphics and Multimedia Image Compression CCITT Image Storage and Compression Group 3: Huffman-type encoding for binary (bilevel) data: FAX Group 4: Entropy encoding without error checks of group

More information

Data Warehousing & Data Mining

Data Warehousing & Data Mining Data Warehousing & Data Mining Wolf-Tilo Balke Kinda El Maarry Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Summary Last week: Logical Model: Cubes,

More information

HARDWARE IMPLEMENTATION OF LOSSLESS LZMA DATA COMPRESSION ALGORITHM

HARDWARE IMPLEMENTATION OF LOSSLESS LZMA DATA COMPRESSION ALGORITHM HARDWARE IMPLEMENTATION OF LOSSLESS LZMA DATA COMPRESSION ALGORITHM Parekar P. M. 1, Thakare S. S. 2 1,2 Department of Electronics and Telecommunication Engineering, Amravati University Government College

More information

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology Course Presentation Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology Image Compression Basics Large amount of data in digital images File size

More information

Box-Cox Transformation for Simple Linear Regression

Box-Cox Transformation for Simple Linear Regression Chapter 192 Box-Cox Transformation for Simple Linear Regression Introduction This procedure finds the appropriate Box-Cox power transformation (1964) for a dataset containing a pair of variables that are

More information

Final Review. Image Processing CSE 166 Lecture 18

Final Review. Image Processing CSE 166 Lecture 18 Final Review Image Processing CSE 166 Lecture 18 Topics covered Basis vectors Matrix based transforms Wavelet transform Image compression Image watermarking Morphological image processing Segmentation

More information

Reversible Wavelets for Embedded Image Compression. Sri Rama Prasanna Pavani Electrical and Computer Engineering, CU Boulder

Reversible Wavelets for Embedded Image Compression. Sri Rama Prasanna Pavani Electrical and Computer Engineering, CU Boulder Reversible Wavelets for Embedded Image Compression Sri Rama Prasanna Pavani Electrical and Computer Engineering, CU Boulder pavani@colorado.edu APPM 7400 - Wavelets and Imaging Prof. Gregory Beylkin -

More information

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Project Title: Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Midterm Report CS 584 Multimedia Communications Submitted by: Syed Jawwad Bukhari 2004-03-0028 About

More information

Chapter 1. Digital Data Representation and Communication. Part 2

Chapter 1. Digital Data Representation and Communication. Part 2 Chapter 1. Digital Data Representation and Communication Part 2 Compression Digital media files are usually very large, and they need to be made smaller compressed Without compression Won t have storage

More information

Chapter 7 Lossless Compression Algorithms

Chapter 7 Lossless Compression Algorithms Chapter 7 Lossless Compression Algorithms 7.1 Introduction 7.2 Basics of Information Theory 7.3 Run-Length Coding 7.4 Variable-Length Coding (VLC) 7.5 Dictionary-based Coding 7.6 Arithmetic Coding 7.7

More information

Seagate Enterprise SATA SSD with DuraWrite Technology Competitive Evaluation

Seagate Enterprise SATA SSD with DuraWrite Technology Competitive Evaluation August 2018 Seagate Enterprise SATA SSD with DuraWrite Technology Competitive Seagate Enterprise SATA SSDs with DuraWrite Technology have the best performance for compressible Database, Cloud, VDI Software

More information

An Asymmetric, Semi-adaptive Text Compression Algorithm

An Asymmetric, Semi-adaptive Text Compression Algorithm An Asymmetric, Semi-adaptive Text Compression Algorithm Harry Plantinga Department of Computer Science University of Pittsburgh Pittsburgh, PA 15260 planting@cs.pitt.edu Abstract A new heuristic for text

More information

Zeppelin quick start guide

Zeppelin quick start guide Zeppelin quick start guide 1. Preface Our Big Data Machine Learning Service (BDMLS) is built on the Hadoop platform. It primarily provides big data analysis algorithm via Apache Spark. We pack Spark algorithm

More information

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK Professor Laurence S. Dooley School of Computing and Communications Milton Keynes, UK How many bits required? 2.4Mbytes 84Kbytes 9.8Kbytes 50Kbytes Data Information Data and information are NOT the same!

More information

Understanding Your Virtual Workload. Irfan Ahmad CTO CloudPhysics

Understanding Your Virtual Workload. Irfan Ahmad CTO CloudPhysics What s PRESENTATION Your Shape? TITLE GOES 5 HERE Steps to Understanding Your Virtual Workload Irfan Ahmad CTO CloudPhysics SNIA Legal Notice The material contained in this tutorial is copyrighted by the

More information

Source coding and compression

Source coding and compression Computer Mathematics Week 5 Source coding and compression College of Information Science and Engineering Ritsumeikan University last week binary representations of signed numbers sign-magnitude, biased

More information

Making the Most of Hadoop with Optimized Data Compression (and Boost Performance) Mark Cusack. Chief Architect RainStor

Making the Most of Hadoop with Optimized Data Compression (and Boost Performance) Mark Cusack. Chief Architect RainStor Making the Most of Hadoop with Optimized Data Compression (and Boost Performance) Mark Cusack Chief Architect RainStor Agenda Importance of Hadoop + data compression Data compression techniques Compression,

More information

Lossless compression II

Lossless compression II Lossless II D 44 R 52 B 81 C 84 D 86 R 82 A 85 A 87 A 83 R 88 A 8A B 89 A 8B Symbol Probability Range a 0.2 [0.0, 0.2) e 0.3 [0.2, 0.5) i 0.1 [0.5, 0.6) o 0.2 [0.6, 0.8) u 0.1 [0.8, 0.9)! 0.1 [0.9, 1.0)

More information

MCS-375: Algorithms: Analysis and Design Handout #G2 San Skulrattanakulchai Gustavus Adolphus College Oct 21, Huffman Codes

MCS-375: Algorithms: Analysis and Design Handout #G2 San Skulrattanakulchai Gustavus Adolphus College Oct 21, Huffman Codes MCS-375: Algorithms: Analysis and Design Handout #G2 San Skulrattanakulchai Gustavus Adolphus College Oct 21, 2016 Huffman Codes CLRS: Ch 16.3 Ziv-Lempel is the most popular compression algorithm today.

More information

Application of Clustering Techniques to Energy Data to Enhance Analysts Productivity

Application of Clustering Techniques to Energy Data to Enhance Analysts Productivity Application of Clustering Techniques to Energy Data to Enhance Analysts Productivity Wendy Foslien, Honeywell Labs Valerie Guralnik, Honeywell Labs Steve Harp, Honeywell Labs William Koran, Honeywell Atrium

More information

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2017

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2017 CPSC 340: Machine Learning and Data Mining Principal Component Analysis Fall 2017 Assignment 3: 2 late days to hand in tonight. Admin Assignment 4: Due Friday of next week. Last Time: MAP Estimation MAP

More information

Core learning in mathematics: links to the 1999 Framework for teaching mathematics

Core learning in mathematics: links to the 1999 Framework for teaching mathematics Core learning in mathematics: links to the 1999 Framework for teaching mathematics A blue box on the right shows that there is no equivalent in the new objectives. A blue box on the left shows that there

More information

Data Mining Lecture 8: Decision Trees

Data Mining Lecture 8: Decision Trees Data Mining Lecture 8: Decision Trees Jo Houghton ECS Southampton March 8, 2019 1 / 30 Decision Trees - Introduction A decision tree is like a flow chart. E. g. I need to buy a new car Can I afford it?

More information

Excel for Gen Chem General Chemistry Laboratory September 15, 2014

Excel for Gen Chem General Chemistry Laboratory September 15, 2014 Excel for Gen Chem General Chemistry Laboratory September 15, 2014 Excel is a ubiquitous data analysis software. Mastery of Excel can help you succeed in a first job and in your further studies with expertise

More information

Video Compression An Introduction

Video Compression An Introduction Video Compression An Introduction The increasing demand to incorporate video data into telecommunications services, the corporate environment, the entertainment industry, and even at home has made digital

More information

University of Osnabruck - FTP Site Statistics. Top 20 Directories Sorted by Disk Space

University of Osnabruck - FTP Site Statistics. Top 20 Directories Sorted by Disk Space University of Osnabruck - FTP Site Statistics Property Value FTP Server ftp.usf.uni-osnabrueck.de Description University of Osnabruck Country Germany Scan Date 17/May/2014 Total Dirs 29 Total Files 92

More information

CHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM

CHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM CHAPTER 4 STOCK PRICE PREDICTION USING MODIFIED K-NEAREST NEIGHBOR (MKNN) ALGORITHM 4.1 Introduction Nowadays money investment in stock market gains major attention because of its dynamic nature. So the

More information

Algebra 1, 4th 4.5 weeks

Algebra 1, 4th 4.5 weeks The following practice standards will be used throughout 4.5 weeks:. Make sense of problems and persevere in solving them.. Reason abstractly and quantitatively. 3. Construct viable arguments and critique

More information

Compression in Open Source Databases. Peter Zaitsev April 20, 2016

Compression in Open Source Databases. Peter Zaitsev April 20, 2016 Compression in Open Source Databases Peter Zaitsev April 20, 2016 About the Talk 2 A bit of the History Approaches to Data Compression What some of the popular systems implement 2 Lets Define The Term

More information

Embedded lossless audio coding using linear prediction and cascade coding

Embedded lossless audio coding using linear prediction and cascade coding University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2005 Embedded lossless audio coding using linear prediction and

More information

AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE

AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE AN OVERVIEW AND EXPLORATION OF JMP A DATA DISCOVERY SYSTEM IN DAIRY SCIENCE A.P. Ruhil and Tara Chand National Dairy Research Institute, Karnal-132001 JMP commonly pronounced as Jump is a statistical software

More information

Image Compression using Haar Wavelet Transform and Huffman Coding

Image Compression using Haar Wavelet Transform and Huffman Coding Image Compression using Haar Wavelet Transform and Huffman Coding Sindhu M S, Dr. Bharathi.S.H Abstract In modern sciences there are several method of image compression techniques are exist. Huge amount

More information

Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression

Data Representation. Reminders. Sound What is sound? Interpreting bits to give them meaning. Part 4: Media - Sound, Video, Compression Data Representation Interpreting bits to give them meaning Part 4: Media -, Video, Compression Notes for CSC 100 - The Beauty and Joy of Computing The University of North Carolina at Greensboro Reminders

More information