Internet Still Image and Video Formats

Size: px
Start display at page:

Download "Internet Still Image and Video Formats"

Transcription

1 Authors: Guido Heising Kai Uwe Barthel University of Applied Sciences Berlin, FB6 FHTW Berlin, FB4 Luxemburger Str. 10 Treskowallee Berlin Berlin Germany Germany gheising@epo.org barthel@fhtw-berlin.de Internet Still Image and Video Formats Nowadays it is hard to find a website without images. Even buttons and layout elements of a website consist of small bitmaps. Using images in the internet has become possible as there are sophisticated compression algorithms to keep the sizes of the images as small as possible. Video clips are still rare to be found on typical websites, but the usage of video is increasing. However opposed to still images up to now there is no commonly accepted video coding standard that has been incorporated into web-browsing applications. In addition the bandwidth that is required for the transmission of high quality videos larger than the size of a matchbox is still too high for most users having access to the internet with moderate bandwidth only (e.g. a modem). This chapter introduces some basic facts about images and videos. It explains the essential ideas of image and video compression techniques. Standards and quasi-standards are described and their features will be discussed. 1 Digital Images When images like photos or graphics are digitized (for example using a scanner) the color and the brightness information of the original image is converted to discrete digital picture elements the pixels. There are two parameters the color depth and the image resolution that control how the analog image information is converted to the digital image data (fig. 1). Fig 1: The resolution and the color depth control the quality and the size of the digitized image Due to the fact that our eyes use three kinds of cells for color perception, all colors can be described using three components or channels. Thus the information that has to be stored for each pixel consists of three values to describe the color components R, G, and B (red, green and blue). There is a standard srgb color space which is used by many scanners and other devices. If a

2 different color space is to be used, this information can be described in an ICC-profile from the international color consortium. This color profile should be joined to the image data. For grayscale images (like black and white photos) only one channel for the brightness information is needed. The number of bits that are assigned to a pixel is called the color depth and is measured in bits per pixel or bits per channel. Most computer graphic adapters use a display mode called True Color, that uses 8 bits per channel which corresponds to 24 bits per pixel. In this mode 16.7 million different colors can be described or displayed which is more than sufficient for typical digital images to be shown on computer monitors. In the prepress domain four channel images: Cyan, Magenta, Yellow and black (CMYK) are used for printing purposes. Other image types like medical images might need a higher color depth such as 12 or 16 bits per channel. A simple fax image uses only 1 bit per pixel to distinguish between black and white. If an image has only a limited number of different colors (up to 256 colors), the RGB-values colors are stored in a look up table (LUT). In this palletized image every pixels color can be described by an index referring the color from the LUT entry. By selecting a resolution the scanning device is told how many samples (pixels) are to be generated. The unit for resolution is dpi which stands for dots per inch. Typical values for scanning resolutions are in the range between 50 and 1200 dpi. If the resolution is chosen to be a high value, then the quality will be very good as even finest details of the image can be captured. However also the amount of digital data generated will be quite high. The pixel number of the image is given by the product of length and width both multiplied by the dpi number. To obtain the total number of bits for this image, the pixel number has to be multiplied by the color depth. Websites are viewed by many different users on different computer screens which have dissimilar resolutions. The actual resolution of a monitor depends on the number of pixels that can be displayed and the dimensions of the screen (measured in cm or inch). Typical computer systems have a monitor resolution of approximately 100 dpi. This should be taken into account if images are to be displayed on a website. There is no use displaying an image having more pixels than the area on the screen in which it is displayed. This may seem obvious, however many web pages can be found that use too large images. Browsers can scale images to any size, however they do not use sophisticated scaling or interpolation techniques, so that best visual results are obtained if every pixel will be used only once non-scaled. 1.1 Need for compression A typical photo (10 cm x 15 cm) scanned at a resolution of 100 dpi in true color (24 bits per pixel) will result in an uncompressed file size of 680 kb (393 rows x 590 columns = pixels x 3 Bytes). Using the fastest available modem with a bandwidth of bits/second, this image would need 100 seconds to be transmitted. If we were talking about a video clip of this size, even with a reduced frame rate of 10 fps (frames per second), a compression factor of 1000 would be needed for the 56k modem. Typical videos have frame rates of 25 (PAL) or (NTSC), the dimension of digitized video in pixels is 720x576 pixels for PAL DV and 720 x 480 for NTSC DV. These examples show how important it is to use highly efficient image compression techniques. When talking about compression three different modes should be distinguished: Lossless

3 compression does a perfect preservation of the image, every pixel is unchanged in every bit. For typical images (photos) even the best algorithms are not able to achieve a compression factor higher than two to three. Visual lossless compression is a term that was introduced by the industry trying to tell their customers that their compression technique is almost perfect. However there is no exact definition for visual lossless, usually it means that encoding errors are not noticeable to a typical viewing person. If encoding errors may be perceived depends very much on the viewing conditions and how much the viewer is trained. Typical compression factors range between three and ten. Then there is lossy compression, mostly used in the internet due to the need for very small file sizes. A lossy compressed image may look different compared to the original, however the quality is still acceptable. Needless to say that quality requirements are different in different circumstances. Typical compression factors are in the range from 10 to 100. Once a compression standard has been established, new improved ideas or algorithms can not be incorporated. However, as only the decoding part is standardized, there is always room for optimizations on the coder side, as long as the decoder can still decode the compressed data. After a standard has been settled, it usually takes some years until optimal coding settings are determined. This is particular true for video compression, as there are too many different parameter settings possible. This also means that there are JPEGs and MPEGs of similar size but different quality. It actually depends on the capability/quality of the coder used, how well an image or video is compressed. 1.2 Image format features If images are to be used in the internet several aspects have to be considered: There are hundreds of image file formats, compressed and uncompressed, however standard browsers do only support three of them. These supported image types are Compuserve GIF [1], Portable Network Graphics (PNG) [2], and the JPEG-standard from the Joint Picture Expert Group [3]. Often image file formats provide different encoding modes such as support for different color spaces, the choice between lossy and lossless compression and different transparency modes. However not all modes of these image file formats are supported by all web browsers. For example the original JPEG standard knew more than twenty different encoding modes, but only two of them are supported by standard browsers and common image viewing applications. For example there is a lossless JPEG compressing mode, which can not be used in the internet. Image file formats other than the three mentioned above can only be used if appropriate plugins are installed or a special applet image viewer is used. However none of the three image formats GIF, PNG, or JPEG is suitable for all purposes. They differ in the type of color spaces supported and other modes. The following table 1 compares the main features of these image file formats. One important distinction is the support for different color modes. Bitonal images or palletized images with a reduced number of colors are only supported by GIF and PNG. These images could however be converted to RGB images and then be stored as JPEG, but this would make these images larger than necessary. JPEG should be used for natural photographic images. The professional device independent Lab color space is not supported by any of the three formats.

4 Four channel CMYK images can be stored as JPEG, but these images should not be used in the internet, as browsers cannot display them. Table 1: Comparison of features of internet image file formats image format GIF JPEG PNG 8 / 24 color mode bitonal palletized RGB Lab CMYK features progressive transmission lossless compression lossy compression high compression efficiency *) meta data animation transparency / alpha **) ICC support random access in huge images = supported by image format & web browser = supported by image format, but not supported by web browser = supported by image format, but problems with some web browsers *) today JPEG is outperformed by newer & better algorithms like JPEG2000, which has no browser support yet **) GIF images allow one out of the possible 256 colors to be fully transparent. PNG has a much higher level of transparency support. Other distinctions between features are the available compression mode and the support for additional (meta) data or special transmission abilities. JPEG always performs a lossy compression, the desired quality can be controlled by a quality factor. GIF and PNG use lossless compression only. However, some tools can modify the image data to ease subsequent lossless compression. Compression efficiency has been permanently improved over the last years. JPEG does a very good compression job, but naturally there are far better schemes by now. All three formats have support for progressive (sometimes called interlaced) transmission of the image data, such that a reduced resolution version of the image may be seen before the entire data has been transmitted. Usually the file size increases by a few percents when using this progressive option. However, some browsers ignore this feature and do only display the image if all data has been transmitted.

5 Meta data like the author, copyright information, etc. becomes more and more important. Only JPEG supports the IPTC standard for metadata, PNG allows some annotations and comments. Usually browsers cannot display this extra information. Transparency or an alpha channel describes if the pixels of an image are fully or partially transparent. GIF images allow one out of the possible 256 colors to be fully transparent. PNG has a much higher level of transparency support, but has very reduced browser support. Transparency support is needed if the background of a website is changed very often. In order to avoid transparency information the transparent image can be overlaid on the background image/color. The resulting merged image then can be stored with no transparency even using JPEG. JPEG and PNG have limited ICC-profile support, which is ignored by most browsers. For huge images like maps or aerial views a random access to small fractions of an image would be very desirable. However none of the described formats has support for this. Usually many different small images are held on the server which then are dynamically selected. The new compression standard JPEG2000 has support for most of the abovementioned features and it does also outperform these formats in coding efficiency. Up to now there is no browser support for this new powerful standard. The choice of a particular image file format should be based on the following aspects: For photographic images JPEG is the right choice, if a lossy representation can be tolerated. If not, PNG (24 bit mode) should be used for lossless transmission. Non-natural images with only few colors should be compressed to GIF or PNG (8 bit mode). It should be noted that the PNG format does not have any patent restrictions, as opposed to GIF. Animated images have to be saved as GIF. There is an animated version of PNG called MNG, however, without support from any browser. Images with transparency need to be stored as GIF or PNG, while PNG has a much higher level of transparency support, but unfortunately is not supported by all browsers. (At a PNG transparency test web page can be found). As mentioned before there are a lot of coding tools that can differ significantly in encoding quality. PNG allows different filter settings that do affect the compressed file size. Optimal compression depends on a suitable choice of the filter type for PNG and the usage of a high quality encoding tool in the case of an JPEG image. 1.3 Image Compression Principle The core idea of image compression is to remove any redundancy or correlation that resides within the image. Redundancy is all the data, which is not important to the information. Consequently it can be removed by the coder, whereas at the decoder this data is reconstructed and added back to the image.

6 The basic idea of image compression is shown in Figure 2. In a first step all irrelevant information is discarded. This might be a too high color depth which is not needed or a too large image which is scaled down first before it is compressed and transmitted. The next step is to remove the redundancy of the image data. The efficiency of a coder depends on how good this redundancy is discovered and removed. Actually a model describes the idea, how the redundancy can be exploited. Models have become more and more elaborated over time. A very simple model could use the fact that most pixel values do not change abruptly. The value of the previously transmitted pixel can be used as a guess (a prediction) for the next pixel value. Now only the prediction error has to be transmitted or stored. If it is possible to store this prediction error more effectively than the original pixel data (with fewer bits), then there is compression. A more sophisticated model might use a better adaptive predictor taking into account more surrounding pixels. The model for JPEG compression uses the fact that it is more effective to describe an image block through a superposition of suitable basic blocks instead of transmitting it pixel by pixel. Fig 2: Block diagram of an (image) compression scheme If this model (how to remove the redundancy) is known both to coder and decoder, then there is no need to transmit the model data. However if the image data is analyzed first to adapt the model to the particular image, then some extra data (the side information) has to be transmitted to the decoder as well in order to guarantee that the same model is used at the coder and decoder side. After the redundancy reduction in a next step the remaining data can be further simplified by using a quantization process, which basically performs some sort of rounding. If a quantization is performed, this is the point where the compression gets lossy, since the decoder is not able to reverse the rounding. This is the main parameter for controlling the compression factor. As an example, let us assume that the pixel values of an 8-bit gray channel are to be coded. By employing a quantizer consisting of 2 6 =64 equal distributed values over the input range of 2 8 =256 gray values the input values can be rounded to the closest value of {2,6,10,,254}. If

7 only 2 4 =16 output values are allowed the input values can be rounded to {8,24,40,,248}. In the first case 6 bit/pixel are necessary which results in a compression factor of 8/6=1,33, whereas only 4 bit/pixel are used in the latter case leading to a compression factor of 8/4=2. Needless to say, that the losses, the quantization errors, introduced in the latter case are larger and the reconstruction quality will be worse. Finally the quantized data has to be transmitted or stored in an efficient way: Entropy encoding tries to assign short codes (bit patterns) to frequent symbols, whereas rare symbols do get longer codes (more bits). Thus, by employing a variable length code (VLC) the average code word length will be reduced compared to using a fixed length code (FLC) as assumed in the above example. At the decoder all the steps are performed in the reverse order to reconstruct the decoded image, which leads to an approximation of the original image in the case of lossy coding or an identical version in the case of lossless coding. The principle of predictive coding also known as DPCM (Differential Pulse Code Modulation) has already been coarsely described in the previous section. A very sophisticated coder using DPCM is the relative new JPEG-LS (JPEG lossless) standard also known as LOCO, however there is no browser support for it. Some other image coding approaches, especially those used by the internet image formats shall be explained in the next part of this chapter: Run length coding is one of the simplest image coding approaches particular efficient for images with a limited number of colors. Instead of repeating identical pixel values over and over, the value is reported only once, followed by the quantity of pixels of the same color (fig. 3). As an example red, red, red, blue, blue, blue, blue, could be transmitted as (red, 3); (blue,4);. This approach is in part in the Microsoft BMP file format (1,139); (2,6); (1,33); (2, 9); (1,32); (2,8); (1,32);...; (0, 400) Fig 3: Run length coding; indices are grouped and described as value followed by the number of occurrence LZW (Lempel-Ziv-Welch) is the patented coding algorithm which is used for the GIF format. It is similar to the techniques used for ZIP or COMPRESS. LZW is a sophisticated lossless compression method which analyzes the data and looks for repeating patterns within the (image) data. A library is build up during the encoding process. If a new pattern occurs in the data, this pattern will be added to the library at the coder and decoder as well. If the same pattern occurs again, then the library index can be used to describe this pattern. If LZW sees "012012", it is a clever-enough algorithm to spot the trend of alternating symbols and replace each instance with a shorter code for the index, thereby compressing the information. GIF compression does a line scan of the image data, thus only exploiting the horizontal redundancy. PNG compression can use prediction techniques (called compression filters) to

8 exploit vertical redundancy as well. In addition it uses a patent free version for the actual compression. JPEG coding splits the image into blocks of 8x8 pixels. Every block is then transformed using the discrete cosine transform (DCT), which results in energy compaction within the block. This approach corresponds to the idea that instead of describing the block on a pixel by pixel basis (fig.4, left), it is more efficient to perform a weighted overlay of only a few well chosen basis block patterns, which will generate an identical or similar block. The fixed set of basis blocks consists of 64 different cosine-shaped patterns of varying frequency, which is known at the coder and decoder side. Thus only the weights (DCT coefficients) have to be coded. Any quality can be achieved from a highly compressed coarse approximation up to a visual lossless representation by selecting the precision (quantizer) of the weights of these blocks. In the example shown in Figure 4 on the right a high precision is chosen which leads to a near lossless representation of the small block in the top right position. Fig 4: JPEG; instead of a description by pixels (left) image blocks are approximate by overlaid basis blocks The lower the contrast of a basis block, shown in the example, the closer the weight to zero and the lower the importance of that respective basis block. Therefore one can further reduce the bit rate by coding only the n largest weights of an image block out of 64. The quality of the reconstructed block often is still quite high, as can be derived from fig. 5 for n=16. There is one exception from the above stated rule as there is one basis block which does not show a contrast but in most cases does have a large weight. This is the left most block in Figure 5. It represents the mean or DC value of the image block [5]. Fig 5: JPEG; The successive overlay of basis blocks (shown in the upper line) can produce better and better approximations of any image block (shown below)

9 The DCT approach is very efficient as far as coding of natural images is concerned as in most cases the spatial variation of the image signal is rather smooth. Thus only a small number of the basis blocks will be required for the approximation, most probably those blocks in the top left corner of fig. 4 which represent lower frequencies. One mayor disadvantage of the block-based DCT scheme should be mentioned as it is often recognized in decoded images, i.e. the so called blocking artifacts. These annoying artificial block structures are related to the choice of a high compression ratio which led to a poor approximation of the image blocks. This is due to a strong rounding or the omission of DCT coefficients. In JPEG2000 overlapping basis block are employed to reduce the blocking artifacts. The DCT blocks of size 8x8 are replaced by overlapping blocks of different sizes based on a wavelet transform leading to visually improved decoded images without any blocking artifacts. 2 Video formats Accessing video via the internet will become more and more popular as the bandwidths of the internet connections will increase and the means for acquisition and processing of video data are already highly available on the consumer market. Camcorders, web cams and graphics cards with a TV tuner or video-in adapter are the devices most commonly used by consumers for bringing video into the computer. The camcorder can be connected to the computer via FireWire (IEEE 1394) for transferring the video and associated audio data. The very popular DV format is often used in digital camcorders. Its compression scheme is based on JPEG but with some modifications to better adapt to video signals. At this stage one problem may arise which is associated to interlaced video formats, like PAL or NTSC [6]. Please note that the terms interlaced and progressive have a different meaning in the context of video formats than in context of bit stream organization and transmission as introduced in the image formats section. The interlaced video formats are designed for displaying video on a TV in fields instead of full frames. There are two alternating fields, odd and even fields. An odd (even) field consists of all odd (even) numbered rows of a full frame, but odd and even fields are captured and displayed at different time instances. The advantage is that instead of having a full frame rate of 25 Hz for PAL (29.97 Hz for NTSC) a 50 Hz field rate (59.94 Hz for NTSC) is achieved without increasing the bit rate or bandwidth. Hereby flickering is reduced. Since computer monitors are designed to display progressive video formats, i.e. full frames, displaying interlaced video can introduce severe artifacts. This is especially the case if two neighboring fields are combined by simply copying the respective lines to a full frame in the presence of motion as the position of a moving object will not coincide in the combined odd and even fields. One simple way to overcome this problem is to perform an interlaced to progressive format conversion also known as deinterlacing which reduces the temporal and spatial video resolution. For example, the PAL DV interlaced format with 720x576 full frame pixels consisting of 720x288 pixels each for the odd and the even field can be converted to a full frame with 360x288 pixels at 25 Hz. This can be easily achieved by omitting one field type and omitting the missing

10 lines in the other fields as well while reducing the number of pixels of every line by a factor of two. Obviously the quality of the converted video is lower, but it is good enough for most of the internet applications. However, the amount of video data is still quite large: A video sequence in Common Intermediate Format (CIF) has 352 x 288 = pixel/frame and pixel/frame x 25 frame/s = pixel/s and pixel/s x 1.5 byte/pixel = byte/s. Please note that the number of pixels per line is reduced by 8 to cut off dark areas of a PAL signal at left and right image borders. In addition the CIF video format, which is standardized by the International Telecommunication Union (ITU), requires only 1.5 byte/pixel on average. This is due to the fact that the CIF format is not based on the RGB color format but on the YUV color format. It consists of a luminance component Y and two chrominance components U and V, each of which can be computed by a linear combination of the RGB components: Y = 0.299R G B, U = B Y, and V = R Y This is the common video color format for transmission and coding of PAL systems the weighting factors for computing the NTSC or SECAM chrominance components are slightly different [6]. There are two main reasons for utilizing the YUV format, one historical and one related to the human visual perception. The first popular video application was TV broadcast of monochrome video. Later when color TV was introduced the analog color TV signal was broadcasted in a composite video format, in which the luminance and the two chrominance components were multiplexed in order to make the signal compatible with the monochrome TV systems. In addition it was realized that the human eye is less sensitive to spatial variations of the chrominance than of the luminance. Thus the signal bandwidth can be reduced by separating chrominance from luminance and transmitting only a bandlimited chrominance resulting in a hardly visible reduction of the video quality. The CIF format employs the so called 4:2:0 format where the number of chrominance values in each of the two components is reduced by a factor of two in both the vertical and the horizontal direction. This leads on average to one U and one V value for four pixels and to the above mentioned total of 1.5 byte/pixel when using a color depth of 8 bits per channel. Since the amount of raw data of the CIF format is still too high for internet applications the video resolution can be further reduced and compression should be employed. The Quarter CIF format (QCIF) was standardized mainly for videophone-like applications and is well suited for most internet video applications. It has only half of the resolution of a CIF frame in horizontal and vertical directions (176x144) and a raw data rate of byte/s at 25 Hz. For internet applications the frame rate is often reduced to 8.33 or even 5 frames per second (fps). If an advanced video compression scheme like MPEG-4 is additionally applied the bit rate can be decreased to less than 8 kbyte/s. 2.1 Video Compression Principle In principle JPEG, like every other image compression scheme, can be used for coding of motion videos as well, simply by coding every video frame separately. This approach is simple but not as

11 efficient as it could be. In addition some flickering artifacts can be introduced when displaying a moving sequence of images although individual pictures do not show an annoying artifact. This is due to the coding artifacts related to the quantization process that change from frame to frame. These artifacts are especially visible at lower bit rates. To achieve better compression the temporal correlations between consecutive frames have to be exploited. The most commonly used approach is to perform a temporal prediction of the pixels of the current frame to be coded x (fig. 6 b) out of a previous frame y (fig. 6 a), subtract the corresponding pixels of the predicted frame from the current frame and encode the difference frame e (fig. 6 c) in a JPEG-like manner by transform coding. The difference frame shown in the example of fig. 6 c is often referred to as the prediction error frame in which the mid-gray pixels reflect areas with low prediction error and the dark and light pixels show areas with large errors. The advantage of coding the prediction error frame instead of the original is that there are normally large areas having only small errors. This will result in only a few, if any, non-zero coefficients after the transform and quantization stage that have to be encoded. a) previous frame y b) current frame x c) prediction error frame d) prediction error frame e = x - y after motion compensation Fig 6: Temporal prediction of a video frame with and without motion compensation A comparison of frames x and y shows that the image content has moved to the bottom right due to a camera pan. This leads to strong errors in the prediction error frame in textured regions and at object borders which in turn increases the bit rate required in these areas. A motion compensated prediction is often applied to further reduce the prediction error. This can be achieved by subdividing the current frame into regular, non overlapping blocks and finding a corresponding block in the previous frame which describes each block of the current frame as good as possible. These blocks are then copied for the prediction of the current frame (fig. 7). Figure 6 d shows that motion compensated prediction leads to less prediction errors. Only those areas of frame x that have new content can not be predicted properly, e.g. at the bottom of the frame. For every block a suitable motion vector has to be determined during a motion estimation process. This vector describes the relative translational motion of the block and has to be coded as additional side information. The quality of an encoder is highly dependent on the way how the motion estimation is performed. As this process is generally very time consuming most encoders allow motion vectors of lower quality which result in lower coding efficiency but faster encoding. Fortunately, the decoder does not have to perform the motion search which reduces the computational complexity a lot.

12 Fig 7: Motion compensated temporal prediction of a video frame To enable the encoder to control the video quality at the decoder side both encoder and decoder have to perform the prediction equally. Thus the last decoded frame instead of the original has to be used for prediction, since the decoder does not have the original frame. This implies that the encoder also consists of a decoder, which feeds back the last decoded frame including the coding artifacts to predict the current frame. This coding loop has a strong impact on the characteristics of the video coding scheme. Coding efficiency, i.e. the visual quality at a given bit rate, is much increased. Therefore, almost 100% of the efficient video codecs on the market are based on this scheme. Coding artifacts of the previous frame are introduced to the current frame which is not necessarily a disadvantage. Compared to the separate encoding of the frames the artifacts are temporarily more correlated thus reducing the annoying flickering artifacts. If transmission errors occur which affect the quality of a frame these errors will be propagated to the following frames. Since the decoding of a frame depends on the previous frames there is no instantaneous access to the frames in a sequence. The coding loop introduces a trade-off between coding efficiency and error robustness. The design of a video codec will stress one or the other characteristic depending on the kind of video application. I B P B P I B P B P Fig 8: Possible frame types in a video codec The frames that are coded by prediction from a previous frame are called P-frames. Obviously, the first frame of a sequence has to be coded without prediction. Therefore it is called intra or I- frame, because only information available within this frame is used for coding. A P-frame can be predicted from an I-frame or another P-frame as can be seen in figure 8, where the arrows indicate the prediction direction. There is one other frame type, namely a B-frame, which is not only predicted from one but from two frames. In most codecs a bidirectional prediction is made from one previous and one following frame which can further reduce the bit rate. B-frames introduce an additional delay as the following P-frame has to be acquisitioned, coded and

13 decoded prior to coding the current B-frame coding. Thus B-frames are not employed in delay sensitive applications such as video communication. Further I-frames will be inserted, e.g. one per second, to allow for starting the decoding process at more or less arbitrary time instances of the sequence. All frames depending on one I-frame are often grouped in a group of pictures GOP structure. 2.2 Internet video applications For most of the internet video applications the two main problems related to transmission via the internet are that the available average network bit rate is to low and that some of the video packets do not arrive timely enough to be displayed. The first problem has an impact on all internet video applications and can be reduced by video codecs with improved coding efficiency. The second problem addresses only those internet applications where simultaneous transmission and display of video is required, i.e. for live video reception and video streaming. This is a more serious problem, because loosing one packet containing for example a P-frame means that all following frames of a GOP cannot be decoded. Here very sophisticated techniques have to be combined to achieve a properly working system. These techniques might fall within the following categories: video buffer management, coder control, feed back, scalable coding, quality of service, streaming and real time protocols, error concealment. For more detailed information on this topic the reader is referred to [6]. A very common technique to improve the error robustness of video stream for transmission over bandlimited channels is to use a rate-control. The rate-control controls the encoder coding options in order to achieve at least on average a constant target bit rate (CBR). Without ratecontrol a constant quality at a variable bit rate (VBR) is achieved. Large fluctuations of the amount of bits per frame can occur due to the content of the images, the variations thereof over time and the chosen frame type. An I-frame may need a bit amount ten times higher than that of a P-frame. The rate-control can influence the bit rate by changing the accuracy of the quantization, which effects picture quality as well, or by frame-skipping. By skipping input frames at the encoder side the bit rate and temporal resolution will be reduced. The missing frames are substituted at the decoder side by repeating the respective previous frame. Some encoders provide a 2-pass coding as an optional feature to better adapt to a target bit rate. In a first pass the sequence is coded and the required bit amounts per frame for the chosen encoder parameters are evaluated. In the second pass the actual coding is done with corrected encoder setting. In the following we will concentrate on the more simple application of video downloading and successive decoding. For downloading of pre-encoded video encoding time and delay are generally not important factors. The main objective is achieving a high coding efficiency in order to allow small video file sizes and a reasonable video quality. Thus the GOP structure similar to fig. 8 including B-frames can be used. An exhaustive but time-consuming motion search can be chosen and the variable bit rate mode as well since it often leads to better visual results.

14 2.3 Video coding standards, quasi standards and file formats First of all a distinction has to be made between the coding scheme and the file format containing the bit stream. There are file formats that allow compression with a variety of different coding schemes. In addition a single file may contain more than one video and audio track. Currently there are more or less four popular internet video formats on the market, i.e. MPEG (*.mpg), Windows Media Video (*.wmv, *.wm), Real Video (*.rm) and QuickTime (*.mov). They first three have their own video coding scheme and file format whereas Apple s QuickTime format is basically a file format which can be used with several codecs. This is also true for the Windows AVI format (*.avi). A tag in the header of the file indicates the used compression scheme. It therefore might happen that a player cannot decode an AVI file found in the web since the corresponding decoder is not installed in the player. Thus the availability of players for the respective format is an important aspect for choosing an appropriate video format. Players for the four formats mentioned above are widely installed and can be downloaded for free at [7-9]. There are several players for the MPEG format, but it can also be played with the WindowsMediaPlayer. In the following some characteristics of the most popular video formats are given. Motion JPEG, an extension of the JPEG standard, is a simple but not very efficient scheme for video compression since every frame is stored as an I-frame. The very popular DV format is based on motion JPEG but with some slide modifications to better adapt to video signals. It is rarely used for internet applications. The different MPEG formats have been developed by the ISO Moving Picture Experts Group mainly for video and audio compression, storage and transmission. There is a file format specified for each version. All MPEG video coding standards are based on motion compensated prediction and the GOP structure. MPEG-1 is well suited for compression of progressive video in CIF format at rates of Mbit/s including audio. For audio compression the very popular MPEG-1 layer3 scheme can be used which is also known as MP3. It has been used for storing video on CD-ROM at a quality comparable to an analog consumer VCR. MPEG-2 has been derived from MPEG-1 but with improved performance and additional features for coding interlaced video at higher bit rates (4 15 Mbit/s) and qualities. It is widely used, e.g. for DVD and Digital Video Broadcast (DVB). Both formats are more appropriate for high bit rates but can operate at lower bit rates as well, thus they are used as an internet video format for downloading. MPEG-4 is much better suited as an internet format as it has been especially designed for low bit rates, though it covers a wide range of bit rates (8 kbit/s-1200mbit/s), video resolutions and qualities [10]. Furthermore it allows to encode arbitrarily shaped video objects, 2D- and 3Dscenes composed of natural and synthetic still image, video and audio content. BIFS (binary format for scenes) has been specified for the scene description. It was derived from VRML but allows compression of the scene description, user interaction and streaming. An introduction to the MPEG-4 multimedia standard can be found in [6] and [11]. Since the MPEG-4 standard is

15 rather new, up to now only rectangular videos compressed by MPEG-4 can be found in the internet. A new video coding standard with much better coding efficiency is standardized in It is a joint development of the ITU-T and MPEG, the H.264 MPEG-4 AVC advanced video coding standard. Although being part of MPEG-4 it is not backward compatible with MPEG-4. In comparison to MPEG-4 and MPEG-2, the bit rate at the same visual quality can be further reduced by about 40 % and 63 % respectively [12]. Unfortunately, the complexity especially at the encoder is largely increased. Due to the good performance, this coding scheme will probably become very popular. For the Windows Media Video format the Windows Media Encoder is available [7]. Since this coding scheme is a proprietary standard of Microsoft and the used coding techniques are currently hidden, it can only be decoded by the Windows Media Player. But as Microsoft participated in the development of MPEG-4 AVC some of its efficient new features might have gone in the latest Media Encoder. At least its coding efficiency is now better than that of the (older) MPEG-4. The latest Windows Media file format can also include video compressed with other codecs such as MPEG-4. The situation for the Real Video format is quite similar [8]. The video coding scheme is hidden, encoding can be performed using RealSystem Producer and decoding can only be done by RealPlayer. This is the dominant format for streaming applications. It provides slightly better qualities at low bit rates compared to the competitors. In addition it supports more platforms than Windows Media and QuickTime, not only Windows and Mac OS but also Linux and Unix. The latest RealMedia file format can also include video compressed with other codecs such as MPEG-4. Apple s QuickTime format is capable of carrying other media data than pure audio and video as well [9]. Animations, graphics, texts and even 360-degree virtual reality scenes can be stored in QuickTime format. That is the main reason why MPEG has based his MPEG-4 file format on QuickTime. Apple s QuickTime Pro package provides a variety of different video codecs. Formerly, Sorenson Video was often chosen as the preferred video codec, but this may change since the latest version of QuickTime supports MPEG-4 video. Please note that the abovementioned qualities of the video codecs only reflect the situation at the beginning of the year Every new release of a codec can lead to large improvements. On the other hand one should be aware of the fact that standardized coding schemes, like MPEG-4, are public and the format will not change. Proprietary standards will change and future players may not support older versions.

16 References [1] [2] [3] [4] D.S. Taubman, M.W. Marcellin: JPEG2000: Image compression Fundamentals, Standards and Practice. Kluwer International Series in Engineering and Computer Science, [5] R. J. Clarke: Transform coding of images. Microelectronics and Signal Processing. Academic Press, London, [6] Y. Wang; J. Ostermann, and Y.-Q. Zhang: Video Processing and Communications. Signal Processing Series. Prentice Hall, Upper Saddle River, New Jersey, [7] [8] [9] [10] ISO/IEC JTC1, Coding of audio-visual objects Part 2: Visual. ISO/IEC (MPEG-4 Visual version 1). April 1999, Amendment 1 (version 2), February [11] ISO/IEC/JTC1/SC29/WG11, "MPEG-4 Overview (Version 21)", Dokument N4668, Jeju Island, Korea, March (accessible via the official mpeg web page: [12] H. Schwarz, and Th. Wiegand: The Emerging JVT/H.26L Video Coding Standard, Proc. of IBC 2002, Amsterdam, NL, September 2002.

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS Television services in Europe currently broadcast video at a frame rate of 25 Hz. Each frame consists of two interlaced fields, giving a field rate of 50

More information

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013

ECE 417 Guest Lecture Video Compression in MPEG-1/2/4. Min-Hsuan Tsai Apr 02, 2013 ECE 417 Guest Lecture Video Compression in MPEG-1/2/4 Min-Hsuan Tsai Apr 2, 213 What is MPEG and its standards MPEG stands for Moving Picture Expert Group Develop standards for video/audio compression

More information

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression

IMAGE COMPRESSION. Image Compression. Why? Reducing transportation times Reducing file size. A two way event - compression and decompression IMAGE COMPRESSION Image Compression Why? Reducing transportation times Reducing file size A two way event - compression and decompression 1 Compression categories Compression = Image coding Still-image

More information

Wireless Communication

Wireless Communication Wireless Communication Systems @CS.NCTU Lecture 6: Image Instructor: Kate Ching-Ju Lin ( 林靖茹 ) Chap. 9 of Fundamentals of Multimedia Some reference from http://media.ee.ntu.edu.tw/courses/dvt/15f/ 1 Outline

More information

Video Compression An Introduction

Video Compression An Introduction Video Compression An Introduction The increasing demand to incorporate video data into telecommunications services, the corporate environment, the entertainment industry, and even at home has made digital

More information

Tech Note - 05 Surveillance Systems that Work! Calculating Recorded Volume Disk Space

Tech Note - 05 Surveillance Systems that Work! Calculating Recorded Volume Disk Space Tech Note - 05 Surveillance Systems that Work! Surveillance Systems Calculating required storage drive (disk space) capacity is sometimes be a rather tricky business. This Tech Note is written to inform

More information

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami to MPEG Prof. Pratikgiri Goswami Electronics & Communication Department, Shree Swami Atmanand Saraswati Institute of Technology, Surat. Outline of Topics 1 2 Coding 3 Video Object Representation Outline

More information

Digital Video Processing

Digital Video Processing Video signal is basically any sequence of time varying images. In a digital video, the picture information is digitized both spatially and temporally and the resultant pixel intensities are quantized.

More information

Image and video processing

Image and video processing Image and video processing Digital video Dr. Pengwei Hao Agenda Digital video Video compression Video formats and codecs MPEG Other codecs Web video - 2 - Digital Video Until the arrival of the Pentium

More information

Image, video and audio coding concepts. Roadmap. Rationale. Stefan Alfredsson. (based on material by Johan Garcia)

Image, video and audio coding concepts. Roadmap. Rationale. Stefan Alfredsson. (based on material by Johan Garcia) Image, video and audio coding concepts Stefan Alfredsson (based on material by Johan Garcia) Roadmap XML Data structuring Loss-less compression (huffman, LZ77,...) Lossy compression Rationale Compression

More information

Video coding. Concepts and notations.

Video coding. Concepts and notations. TSBK06 video coding p.1/47 Video coding Concepts and notations. A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds. Each image is either

More information

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology Course Presentation Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology Image Compression Basics Large amount of data in digital images File size

More information

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression

DigiPoints Volume 1. Student Workbook. Module 8 Digital Compression Digital Compression Page 8.1 DigiPoints Volume 1 Module 8 Digital Compression Summary This module describes the techniques by which digital signals are compressed in order to make it possible to carry

More information

About MPEG Compression. More About Long-GOP Video

About MPEG Compression. More About Long-GOP Video About MPEG Compression HD video requires significantly more data than SD video. A single HD video frame can require up to six times more data than an SD frame. To record such large images with such a low

More information

VC 12/13 T16 Video Compression

VC 12/13 T16 Video Compression VC 12/13 T16 Video Compression Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos Miguel Tavares Coimbra Outline The need for compression Types of redundancy

More information

Fundamentals of Video Compression. Video Compression

Fundamentals of Video Compression. Video Compression Fundamentals of Video Compression Introduction to Digital Video Basic Compression Techniques Still Image Compression Techniques - JPEG Video Compression Introduction to Digital Video Video is a stream

More information

Image and Video Coding I: Fundamentals

Image and Video Coding I: Fundamentals Image and Video Coding I: Fundamentals Thomas Wiegand Technische Universität Berlin T. Wiegand (TU Berlin) Image and Video Coding Organization Vorlesung: Donnerstag 10:15-11:45 Raum EN-368 Material: http://www.ic.tu-berlin.de/menue/studium_und_lehre/

More information

Part 1 of 4. MARCH

Part 1 of 4. MARCH Presented by Brought to You by Part 1 of 4 MARCH 2004 www.securitysales.com A1 Part1of 4 Essentials of DIGITAL VIDEO COMPRESSION By Bob Wimmer Video Security Consultants cctvbob@aol.com AT A GLANCE Compression

More information

Image coding and compression

Image coding and compression Image coding and compression Robin Strand Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Today Information and Data Redundancy Image Quality Compression Coding

More information

MULTIMEDIA AND CODING

MULTIMEDIA AND CODING 07 MULTIMEDIA AND CODING WHAT MEDIA TYPES WE KNOW? TEXTS IMAGES SOUNDS MUSIC VIDEO INTERACTIVE CONTENT Games Virtual reality EXAMPLES OF MULTIMEDIA MOVIE audio + video COMPUTER GAME audio + video + interactive

More information

Introduction to Video Compression

Introduction to Video Compression Insight, Analysis, and Advice on Signal Processing Technology Introduction to Video Compression Jeff Bier Berkeley Design Technology, Inc. info@bdti.com http://www.bdti.com Outline Motivation and scope

More information

CSCD 443/533 Advanced Networks Fall 2017

CSCD 443/533 Advanced Networks Fall 2017 CSCD 443/533 Advanced Networks Fall 2017 Lecture 18 Compression of Video and Audio 1 Topics Compression technology Motivation Human attributes make it possible Audio Compression Video Compression Performance

More information

Module 7 VIDEO CODING AND MOTION ESTIMATION

Module 7 VIDEO CODING AND MOTION ESTIMATION Module 7 VIDEO CODING AND MOTION ESTIMATION Lesson 20 Basic Building Blocks & Temporal Redundancy Instructional Objectives At the end of this lesson, the students should be able to: 1. Name at least five

More information

Image and Video Coding I: Fundamentals

Image and Video Coding I: Fundamentals Image and Video Coding I: Fundamentals Heiko Schwarz Freie Universität Berlin Fachbereich Mathematik und Informatik H. Schwarz (FU Berlin) Image and Video Coding Organization Vorlesung: Montag 14:15-15:45

More information

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri

Interframe coding A video scene captured as a sequence of frames can be efficiently coded by estimating and compensating for motion between frames pri MPEG MPEG video is broken up into a hierarchy of layer From the top level, the first layer is known as the video sequence layer, and is any self contained bitstream, for example a coded movie. The second

More information

Lecture Information Multimedia Video Coding & Architectures

Lecture Information Multimedia Video Coding & Architectures Multimedia Video Coding & Architectures (5LSE0), Module 01 Introduction to coding aspects 1 Lecture Information Lecturer Prof.dr.ir. Peter H.N. de With Faculty Electrical Engineering, University Technology

More information

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK Professor Laurence S. Dooley School of Computing and Communications Milton Keynes, UK How many bits required? 2.4Mbytes 84Kbytes 9.8Kbytes 50Kbytes Data Information Data and information are NOT the same!

More information

Megapixel Video for. Part 2 of 4. Brought to You by. Presented by Video Security Consultants

Megapixel Video for. Part 2 of 4. Brought to You by. Presented by Video Security Consultants rought to You by 2009 Video Security Consultants Presented by Part 2 of 4 A1 Part 2 of 4 How to Avert a Compression Depression Illustration by Jerry King While bandwidth is widening, larger video systems

More information

CMPT 365 Multimedia Systems. Media Compression - Image

CMPT 365 Multimedia Systems. Media Compression - Image CMPT 365 Multimedia Systems Media Compression - Image Spring 2017 Edited from slides by Dr. Jiangchuan Liu CMPT365 Multimedia Systems 1 Facts about JPEG JPEG - Joint Photographic Experts Group International

More information

The Scope of Picture and Video Coding Standardization

The Scope of Picture and Video Coding Standardization H.120 H.261 Video Coding Standards MPEG-1 and MPEG-2/H.262 H.263 MPEG-4 H.264 / MPEG-4 AVC Thomas Wiegand: Digital Image Communication Video Coding Standards 1 The Scope of Picture and Video Coding Standardization

More information

Lecture Information. Mod 01 Part 1: The Need for Compression. Why Digital Signal Coding? (1)

Lecture Information. Mod 01 Part 1: The Need for Compression. Why Digital Signal Coding? (1) Multimedia Video Coding & Architectures (5LSE0), Module 01 Introduction to coding aspects 1 Lecture Information Lecturer Prof.dr.ir. Peter H.N. de With Faculty Electrical Engineering, University Technology

More information

Chapter 1. Digital Data Representation and Communication. Part 2

Chapter 1. Digital Data Representation and Communication. Part 2 Chapter 1. Digital Data Representation and Communication Part 2 Compression Digital media files are usually very large, and they need to be made smaller compressed Without compression Won t have storage

More information

Lecture 6: Compression II. This Week s Schedule

Lecture 6: Compression II. This Week s Schedule Lecture 6: Compression II Reading: book chapter 8, Section 1, 2, 3, 4 Monday This Week s Schedule The concept behind compression Rate distortion theory Image compression via DCT Today Speech compression

More information

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

International Journal of Emerging Technology and Advanced Engineering Website:   (ISSN , Volume 2, Issue 4, April 2012) A Technical Analysis Towards Digital Video Compression Rutika Joshi 1, Rajesh Rai 2, Rajesh Nema 3 1 Student, Electronics and Communication Department, NIIST College, Bhopal, 2,3 Prof., Electronics and

More information

Lecture 5: Compression I. This Week s Schedule

Lecture 5: Compression I. This Week s Schedule Lecture 5: Compression I Reading: book chapter 6, section 3 &5 chapter 7, section 1, 2, 3, 4, 8 Today: This Week s Schedule The concept behind compression Rate distortion theory Image compression via DCT

More information

CS 335 Graphics and Multimedia. Image Compression

CS 335 Graphics and Multimedia. Image Compression CS 335 Graphics and Multimedia Image Compression CCITT Image Storage and Compression Group 3: Huffman-type encoding for binary (bilevel) data: FAX Group 4: Entropy encoding without error checks of group

More information

Course Syllabus. Website Multimedia Systems, Overview

Course Syllabus. Website   Multimedia Systems, Overview Course Syllabus Website http://ce.sharif.edu/courses/93-94/2/ce342-1/ Page 1 Course Syllabus Textbook Z-N. Li, M.S. Drew, Fundamentals of Multimedia, Pearson Prentice Hall Upper Saddle River, NJ, 2004.*

More information

Mahdi Amiri. February Sharif University of Technology

Mahdi Amiri. February Sharif University of Technology Course Presentation Multimedia Systems Overview of the Course Mahdi Amiri February 2014 Sharif University of Technology Course Syllabus Website http://ce.sharif.edu/courses/92-93/2/ce342-1/ Page 1 Course

More information

MPEG: It s Need, Evolution and Processing Methods

MPEG: It s Need, Evolution and Processing Methods MPEG: It s Need, Evolution and Processing Methods Ankit Agarwal, Prateeksha Suwalka, Manohar Prajapati ECE DEPARTMENT, Baldev Ram mirdha institute of technology (EC) ITS- 3,EPIP SItapura, Jaipur-302022(India)

More information

Multimedia Standards

Multimedia Standards Multimedia Standards SS 2017 Lecture 5 Prof. Dr.-Ing. Karlheinz Brandenburg Karlheinz.Brandenburg@tu-ilmenau.de Contact: Dipl.-Inf. Thomas Köllmer thomas.koellmer@tu-ilmenau.de 1 Organisational issues

More information

Video Compression MPEG-4. Market s requirements for Video compression standard

Video Compression MPEG-4. Market s requirements for Video compression standard Video Compression MPEG-4 Catania 10/04/2008 Arcangelo Bruna Market s requirements for Video compression standard Application s dependent Set Top Boxes (High bit rate) Digital Still Cameras (High / mid

More information

Compression; Error detection & correction

Compression; Error detection & correction Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some

More information

Introduction to Video Coding

Introduction to Video Coding Introduction to Video Coding o Motivation & Fundamentals o Principles of Video Coding o Coding Standards Special Thanks to Hans L. Cycon from FHTW Berlin for providing first-hand knowledge and much of

More information

Advanced Video Coding: The new H.264 video compression standard

Advanced Video Coding: The new H.264 video compression standard Advanced Video Coding: The new H.264 video compression standard August 2003 1. Introduction Video compression ( video coding ), the process of compressing moving images to save storage space and transmission

More information

Advanced High Graphics

Advanced High Graphics VISUAL MEDIA FILE TYPES JPG/JPEG: (Joint photographic expert group) The JPEG is one of the most common raster file formats. It s a format often used by digital cameras as it was designed primarily for

More information

Georgios Tziritas Computer Science Department

Georgios Tziritas Computer Science Department New Video Coding standards MPEG-4, HEVC Georgios Tziritas Computer Science Department http://www.csd.uoc.gr/~tziritas 1 MPEG-4 : introduction Motion Picture Expert Group Publication 1998 (Intern. Standardization

More information

Digital video coding systems MPEG-1/2 Video

Digital video coding systems MPEG-1/2 Video Digital video coding systems MPEG-1/2 Video Introduction What is MPEG? Moving Picture Experts Group Standard body for delivery of video and audio. Part of ISO/IEC/JTC1/SC29/WG11 150 companies & research

More information

Multimedia Technology CHAPTER 4. Video and Animation

Multimedia Technology CHAPTER 4. Video and Animation CHAPTER 4 Video and Animation - Both video and animation give us a sense of motion. They exploit some properties of human eye s ability of viewing pictures. - Motion video is the element of multimedia

More information

Compression and File Formats

Compression and File Formats Compression and File Formats 1 Compressing Moving Images Methods: Motion JPEG, Cinepak, Indeo, MPEG Known as CODECs compression / decompression algorithms hardware and software implementations symmetrical

More information

MEDIA RELATED FILE TYPES

MEDIA RELATED FILE TYPES MEDIA RELATED FILE TYPES Data Everything on your computer is a form of data or information and is ultimately reduced to a binary language of ones and zeros. If all data stayed as ones and zeros the information

More information

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing

VIDEO AND IMAGE PROCESSING USING DSP AND PFGA. Chapter 3: Video Processing ĐẠI HỌC QUỐC GIA TP.HỒ CHÍ MINH TRƯỜNG ĐẠI HỌC BÁCH KHOA KHOA ĐIỆN-ĐIỆN TỬ BỘ MÔN KỸ THUẬT ĐIỆN TỬ VIDEO AND IMAGE PROCESSING USING DSP AND PFGA Chapter 3: Video Processing 3.1 Video Formats 3.2 Video

More information

Image and Video Compression Fundamentals

Image and Video Compression Fundamentals Video Codec Design Iain E. G. Richardson Copyright q 2002 John Wiley & Sons, Ltd ISBNs: 0-471-48553-5 (Hardback); 0-470-84783-2 (Electronic) Image and Video Compression Fundamentals 3.1 INTRODUCTION Representing

More information

Multimedia Signals and Systems Motion Picture Compression - MPEG

Multimedia Signals and Systems Motion Picture Compression - MPEG Multimedia Signals and Systems Motion Picture Compression - MPEG Kunio Takaya Electrical and Computer Engineering University of Saskatchewan March 9, 2008 MPEG video coding A simple introduction Dr. S.R.

More information

A QUAD-TREE DECOMPOSITION APPROACH TO CARTOON IMAGE COMPRESSION. Yi-Chen Tsai, Ming-Sui Lee, Meiyin Shen and C.-C. Jay Kuo

A QUAD-TREE DECOMPOSITION APPROACH TO CARTOON IMAGE COMPRESSION. Yi-Chen Tsai, Ming-Sui Lee, Meiyin Shen and C.-C. Jay Kuo A QUAD-TREE DECOMPOSITION APPROACH TO CARTOON IMAGE COMPRESSION Yi-Chen Tsai, Ming-Sui Lee, Meiyin Shen and C.-C. Jay Kuo Integrated Media Systems Center and Department of Electrical Engineering University

More information

2014 Summer School on MPEG/VCEG Video. Video Coding Concept

2014 Summer School on MPEG/VCEG Video. Video Coding Concept 2014 Summer School on MPEG/VCEG Video 1 Video Coding Concept Outline 2 Introduction Capture and representation of digital video Fundamentals of video coding Summary Outline 3 Introduction Capture and representation

More information

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION

15 Data Compression 2014/9/21. Objectives After studying this chapter, the student should be able to: 15-1 LOSSLESS COMPRESSION 15 Data Compression Data compression implies sending or storing a smaller number of bits. Although many methods are used for this purpose, in general these methods can be divided into two broad categories:

More information

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Project Title: Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Midterm Report CS 584 Multimedia Communications Submitted by: Syed Jawwad Bukhari 2004-03-0028 About

More information

CMPT 365 Multimedia Systems. Media Compression - Video Coding Standards

CMPT 365 Multimedia Systems. Media Compression - Video Coding Standards CMPT 365 Multimedia Systems Media Compression - Video Coding Standards Spring 2017 Edited from slides by Dr. Jiangchuan Liu CMPT365 Multimedia Systems 1 Video Coding Standards H.264/AVC CMPT365 Multimedia

More information

Audio and video compression

Audio and video compression Audio and video compression 4.1 introduction Unlike text and images, both audio and most video signals are continuously varying analog signals. Compression algorithms associated with digitized audio and

More information

INF5063: Programming heterogeneous multi-core processors. September 17, 2010

INF5063: Programming heterogeneous multi-core processors. September 17, 2010 INF5063: Programming heterogeneous multi-core processors September 17, 2010 High data volumes: Need for compression PAL video sequence 25 images per second 3 bytes per pixel RGB (red-green-blue values)

More information

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm International Journal of Engineering Research and General Science Volume 3, Issue 4, July-August, 15 ISSN 91-2730 A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

More information

7: Image Compression

7: Image Compression 7: Image Compression Mark Handley Image Compression GIF (Graphics Interchange Format) PNG (Portable Network Graphics) MNG (Multiple-image Network Graphics) JPEG (Join Picture Expert Group) 1 GIF (Graphics

More information

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy

Features. Sequential encoding. Progressive encoding. Hierarchical encoding. Lossless encoding using a different strategy JPEG JPEG Joint Photographic Expert Group Voted as international standard in 1992 Works with color and grayscale images, e.g., satellite, medical,... Motivation: The compression ratio of lossless methods

More information

VIDEO COMPRESSION STANDARDS

VIDEO COMPRESSION STANDARDS VIDEO COMPRESSION STANDARDS Family of standards: the evolution of the coding model state of the art (and implementation technology support): H.261: videoconference x64 (1988) MPEG-1: CD storage (up to

More information

Image Compression for Mobile Devices using Prediction and Direct Coding Approach

Image Compression for Mobile Devices using Prediction and Direct Coding Approach Image Compression for Mobile Devices using Prediction and Direct Coding Approach Joshua Rajah Devadason M.E. scholar, CIT Coimbatore, India Mr. T. Ramraj Assistant Professor, CIT Coimbatore, India Abstract

More information

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck

Compression Part 2 Lossy Image Compression (JPEG) Norm Zeck Compression Part 2 Lossy Image Compression (JPEG) General Compression Design Elements 2 Application Application Model Encoder Model Decoder Compression Decompression Models observe that the sensors (image

More information

Image Compression - An Overview Jagroop Singh 1

Image Compression - An Overview Jagroop Singh 1 www.ijecs.in International Journal Of Engineering And Computer Science ISSN: 2319-7242 Volume 5 Issues 8 Aug 2016, Page No. 17535-17539 Image Compression - An Overview Jagroop Singh 1 1 Faculty DAV Institute

More information

Digital Image Processing

Digital Image Processing Lecture 9+10 Image Compression Lecturer: Ha Dai Duong Faculty of Information Technology 1. Introduction Image compression To Solve the problem of reduncing the amount of data required to represent a digital

More information

Digital Image Representation Image Compression

Digital Image Representation Image Compression Digital Image Representation Image Compression 1 Image Representation Standards Need for compression Compression types Lossless compression Lossy compression Image Compression Basics Redundancy/redundancy

More information

Multimedia Technology

Multimedia Technology Multimedia Application An (usually) interactive piece of software which communicates to the user using several media e.g Text, graphics (illustrations, photos), audio (music, sounds), animation and video.

More information

VIDEO COMPRESSION. Image Compression. Multimedia File Formats. Lossy Compression. Multimedia File Formats. October 8, 2009

VIDEO COMPRESSION. Image Compression. Multimedia File Formats. Lossy Compression. Multimedia File Formats. October 8, 2009 File Formats Lossy Compression Image Compression File Formats VIDEO COMPRESSION 121 (Basics) video := time sequence of single images frequent point of view: video compression = image compression with a

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Fundamentals of Image Compression DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR) IN SIGNAL PROCESSING IMPERIAL COLLEGE LONDON Compression New techniques have led to the development

More information

IO [io] MAYAH. IO [io] Audio Video Codec Systems

IO [io] MAYAH. IO [io] Audio Video Codec Systems IO [io] MAYAH IO [io] Audio Video Codec Systems MPEG 4 Audio Video Embedded 24/7 Real-Time Solution MPEG 4 Audio Video Production and Streaming Solution ISMA compliant 24/7 Audio Video Realtime Solution

More information

Multimedia on the Web

Multimedia on the Web Multimedia on the Web Graphics in web pages Downloading software & media Digital photography JPEG & GIF Streaming media Macromedia Flash Graphics in web pages Graphics are very popular in web pages Graphics

More information

Week 14. Video Compression. Ref: Fundamentals of Multimedia

Week 14. Video Compression. Ref: Fundamentals of Multimedia Week 14 Video Compression Ref: Fundamentals of Multimedia Last lecture review Prediction from the previous frame is called forward prediction Prediction from the next frame is called forward prediction

More information

255, 255, 0 0, 255, 255 XHTML:

255, 255, 0 0, 255, 255 XHTML: Colour Concepts How Colours are Displayed FIG-5.1 Have you looked closely at your television screen recently? It's in full colour, showing every colour and shade that your eye is capable of seeing. And

More information

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Chapter 11.3 MPEG-2 MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Simple, Main, SNR scalable, Spatially scalable, High, 4:2:2,

More information

Cross Layer Protocol Design

Cross Layer Protocol Design Cross Layer Protocol Design Radio Communication III The layered world of protocols Video Compression for Mobile Communication » Image formats» Pixel representation Overview» Still image compression Introduction»

More information

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao Video Coding Standards Yao Wang Polytechnic University, Brooklyn, NY11201 http: //eeweb.poly.edu/~yao Outline Overview of Standards and Their Applications ITU-T Standards for Audio-Visual Communications

More information

JPEG 2000 vs. JPEG in MPEG Encoding

JPEG 2000 vs. JPEG in MPEG Encoding JPEG 2000 vs. JPEG in MPEG Encoding V.G. Ruiz, M.F. López, I. García and E.M.T. Hendrix Dept. Computer Architecture and Electronics University of Almería. 04120 Almería. Spain. E-mail: vruiz@ual.es, mflopez@ace.ual.es,

More information

Advanced Encoding Features of the Sencore TXS Transcoder

Advanced Encoding Features of the Sencore TXS Transcoder Advanced Encoding Features of the Sencore TXS Transcoder White Paper November 2011 Page 1 (11) www.sencore.com 1.605.978.4600 Revision 1.0 Document Revision History Date Version Description Author 11/7/2011

More information

Streaming Technologies Glossary

Streaming Technologies Glossary Streaming Technologies Glossary A ACELP.net ADSL AIFF Artifact Asf Asx Avi B Bitrate C C Capture card Codec CIF (QCIF) Client Industry standard audio compression technology developed by VoiceAge Corporation.

More information

VIDEO SIGNALS. Lossless coding

VIDEO SIGNALS. Lossless coding VIDEO SIGNALS Lossless coding LOSSLESS CODING The goal of lossless image compression is to represent an image signal with the smallest possible number of bits without loss of any information, thereby speeding

More information

Compression; Error detection & correction

Compression; Error detection & correction Compression; Error detection & correction compression: squeeze out redundancy to use less memory or use less network bandwidth encode the same information in fewer bits some bits carry no information some

More information

MPEG-l.MPEG-2, MPEG-4

MPEG-l.MPEG-2, MPEG-4 The MPEG Handbook MPEG-l.MPEG-2, MPEG-4 Second edition John Watkinson PT ^PVTPR AMSTERDAM BOSTON HEIDELBERG LONDON. NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO Focal Press is an

More information

06/12/2017. Image compression. Image compression. Image compression. Image compression. Coding redundancy: image 1 has four gray levels

06/12/2017. Image compression. Image compression. Image compression. Image compression. Coding redundancy: image 1 has four gray levels Theoretical size of a file representing a 5k x 4k colour photograph: 5000 x 4000 x 3 = 60 MB 1 min of UHD tv movie: 3840 x 2160 x 3 x 24 x 60 = 36 GB 1. Exploit coding redundancy 2. Exploit spatial and

More information

Compression II: Images (JPEG)

Compression II: Images (JPEG) Compression II: Images (JPEG) What is JPEG? JPEG: Joint Photographic Expert Group an international standard in 1992. Works with colour and greyscale images Up 24 bit colour images (Unlike GIF) Target Photographic

More information

color bit depth dithered

color bit depth dithered EPS The EPS (Encapsulated PostScript) format is widely accepted by the graphic arts industry for saving images that will be placed into programs such as Adobe Illustrator and QuarkXPress. It is used on

More information

In the name of Allah. the compassionate, the merciful

In the name of Allah. the compassionate, the merciful In the name of Allah the compassionate, the merciful Digital Video Systems S. Kasaei Room: CE 315 Department of Computer Engineering Sharif University of Technology E-Mail: skasaei@sharif.edu Webpage:

More information

Lecture 8 JPEG Compression (Part 3)

Lecture 8 JPEG Compression (Part 3) CS 414 Multimedia Systems Design Lecture 8 JPEG Compression (Part 3) Klara Nahrstedt Spring 2012 Administrative MP1 is posted Today Covered Topics Hybrid Coding: JPEG Coding Reading: Section 7.5 out of

More information

Lecture Coding Theory. Source Coding. Image and Video Compression. Images: Wikipedia

Lecture Coding Theory. Source Coding. Image and Video Compression. Images: Wikipedia Lecture Coding Theory Source Coding Image and Video Compression Images: Wikipedia Entropy Coding: Unary Coding Golomb Coding Static Huffman Coding Adaptive Huffman Coding Arithmetic Coding Run Length Encoding

More information

REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ADVANTAGES OF IMAGE COMPRESSION

REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ADVANTAGES OF IMAGE COMPRESSION REVIEW ON IMAGE COMPRESSION TECHNIQUES AND ABSTRACT ADVANTAGES OF IMAGE COMPRESSION Amanpreet Kaur 1, Dr. Jagroop Singh 2 1 Ph. D Scholar, Deptt. of Computer Applications, IK Gujral Punjab Technical University,

More information

Repetition 1st lecture

Repetition 1st lecture Repetition 1st lecture Human Senses in Relation to Technical Parameters Multimedia - what is it? Human senses (overview) Historical remarks Color models RGB Y, Cr, Cb Data rates Text, Graphic Picture,

More information

IMAGE COMPRESSION USING FOURIER TRANSFORMS

IMAGE COMPRESSION USING FOURIER TRANSFORMS IMAGE COMPRESSION USING FOURIER TRANSFORMS Kevin Cherry May 2, 2008 Math 4325 Compression is a technique for storing files in less space than would normally be required. This in general, has two major

More information

Image compression. Stefano Ferrari. Università degli Studi di Milano Methods for Image Processing. academic year

Image compression. Stefano Ferrari. Università degli Studi di Milano Methods for Image Processing. academic year Image compression Stefano Ferrari Università degli Studi di Milano stefano.ferrari@unimi.it Methods for Image Processing academic year 2017 2018 Data and information The representation of images in a raw

More information

Elementary Computing CSC 100. M. Cheng, Computer Science

Elementary Computing CSC 100. M. Cheng, Computer Science Elementary Computing CSC 100 1 Graphics & Media Scalable Outline & Bit- mapped Fonts Binary Number Representation & Text Pixels, Colors and Resolution Sound & Digital Audio Film & Digital Video Data Compression

More information

In the first part of our project report, published

In the first part of our project report, published Editor: Harrick Vin University of Texas at Austin Multimedia Broadcasting over the Internet: Part II Video Compression Borko Furht Florida Atlantic University Raymond Westwater Future Ware Jeffrey Ice

More information

Topic 5 Image Compression

Topic 5 Image Compression Topic 5 Image Compression Introduction Data Compression: The process of reducing the amount of data required to represent a given quantity of information. Purpose of Image Compression: the reduction of

More information

Lesson 6. MPEG Standards. MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21

Lesson 6. MPEG Standards. MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21 Lesson 6 MPEG Standards MPEG - Moving Picture Experts Group Standards - MPEG-1 - MPEG-2 - MPEG-4 - MPEG-7 - MPEG-21 What is MPEG MPEG: Moving Picture Experts Group - established in 1988 ISO/IEC JTC 1 /SC

More information

MPEG-2. ISO/IEC (or ITU-T H.262)

MPEG-2. ISO/IEC (or ITU-T H.262) MPEG-2 1 MPEG-2 ISO/IEC 13818-2 (or ITU-T H.262) High quality encoding of interlaced video at 4-15 Mbps for digital video broadcast TV and digital storage media Applications Broadcast TV, Satellite TV,

More information