Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/ PDF Free Download

To: Dr. K. R. Rao From: Kaustubh V. Dhonsale (UTA id: - 1000699333) Date: 04/24/2012 Subject: EE-5359: Class project interim report Proposed project topic: Overview, implementation and comparison of Audio Video Standard (AVS) of China and H.264/MPEG -4 part 10 or Advanced Video Coding Standard List of acronyms in alphabetical order: AVC: AVS: HDTV: IP: ISO: ITU-T: MPEG: MSE: NAL: PSNR: SSIM: TV: QP: MB: Advanced video coding Audio video standard High definition television Internet protocol International organization for standardization International telecommunication union s telecommunication standardization sector Moving picture experts group Mean square error network adaptation layer Peak signal to noise ratio Structural similarity Television Quantization parameter Macroblock Objective: The main objective of this project is to implement two compression techniques AVS of China and H.264 based on various factors like complexity, video quality, bit rate compression ratio etc. This project will emphasize on comparison between AVS china and H.264 based on various factors like PSNR, MSE, SSIM, compression ratio at different bit rates. Introduction: In real world applications compression is the most important part of the data transmission. Compression depends on the fact that the data is redundant in nature. It implies that up to some extent it was generated using some set of rules, and if those rules are known, the data can be accurately predicted. An encoder creates a compressed file that is ready for transmission or storage, retaining the information in the encoded data bits. Usually, video and audio files are large in nature so, compression becomes Page 1

unavoidable in such conditions. The general block diagram explaining the scope of coding standardization is shown in fig. 1. Fig. 1 Scope of video coding standardization [1] To play such a compressed file, an inverse algorithm is applied to produce a video that shows virtually the same content as the original source video. The time it takes to compress, send, decompress and display a file is called latency. The more advanced the compression algorithm, the higher the latency, given the same processing power. A pair of algorithms that works together is called a video codec (encoder/decoder). Video codecs that implement different standards are normally not compatible with each other; that is, video content that is compressed using one standard cannot be decompressed with a different standard. For instance, an MPEG-4 Part 2 decoder will not work with an H.264 encoder [2]. Different video compression standards use different methods to avoid data redundancy and therefore result in different bit rates, qualities and latency. The purpose of this project is to understand the performance of H.264 [1] and AVS of China [9] in terms of parameters peak signal-to-noise ratio (PSNR), mean square error (MSE), structural similarity (SSIM) [15] and compression ratio using various test sequences at different bit rates etc. Fig. 2 shows the history of the audio and video coding standards. Fig.2 History of audio / video coding standards [3]. Page 2

AVS of China: Audio Video coding Standard (AVS) [8] is established by the Working Group of China in the same name which was approved by the Chinese Science and Technology Department of Ministry of Information Industry in June 2002 [9]. AVS comprises of two major video parts- AVS Part 2 [10] for high-definition digital video broadcasting and high-density storage media and AVS Part 7 [11] for low complexity, low resolution mobility applications. Compared with other standards, AVS has been designed to provide near optimum performance and a considerable reduction in complexity. AVS will therefore provide lowcost implementations [12]. Figures 3 and 4 show the block diagrams of the AVS China encoder and decoder respectively. Fig. 3 Block diagram of AVS encoder [13] Page 3

Fig. 4 Block diagram of AVS decoder [13]. Various profiles of AVS China: Profiles Jizhun profile Jiben Profile Shenzhan profile Jiaqiang profile Key application Television broadcasting, HDTV Mobility applications Video Surveillance Multimedia entertainment Table 1 AVS video profiles and applications [14]. AVS video Jizhun profile: Jizhun profile is defined as the first profile in the national standards of AVS of China-Part 2 [14], approved as national standard in 2006. This profile mainly focuses on digital video applications like commercial broadcasting and storage media, including high-definition applications. Basically it is preferred for high coding efficiency on video sequences of higher resolutions, at the expense of moderate computational complexity. AVS of China Jiben profile: Jiben profile is defined in AVS-Part 7 [14] taking into account mainly mobility video applications with smaller picture resolution. Thus, computational complexity becomes a critical issue. Also the ability of error resilience is needed due to the wireless transporting environment. AVS-Part 7 reached to final committee draft at the end of 2004 [14]. Page 4

AVS of China Shenzhan profile: The profile AVS-Shenzhan focuses exclusively on standardizing the video surveillance applications. Especially, there are special features of sequences from video surveillance such as the random noise appearing in pictures, relatively lower encoding complexity affordable, and friendliness to events detection and searching required [14]. Hence corresponding techniques considering a proper process on these special features will be encouraged. AVS of china Jiaqiang profile: Jiaqiang profile is mainly focused to fulfill the needs of multimedia entertainment; one of the major concerns of this profile is movie compression for high-density storage [14]. Relatively higher computational complexity can be tolerated at the encoder side to provide higher video quality, with compatibility to AVS-Part 2 as well. Table 2 shows the summary of AVS of China profiles. Table 2 Summary of profiles in AVs of China [14] Parts of AVS Contents Stage Part 1 System for broadcasting Final committee draft Part 2 SD/HD Video National Standard Part 3 Audio Final draft Part4 Conformance test Final committee draft Part5 Reference software Final committee draft Part 6 Digital right management Final committee draft Part 7 Mobility video Final draft Part 8 System over IP Final draft Part 9 File format Final draft Table 3 AVS China parts [14] Page 5

Layered structure of AVS china: AVS of China uses coded structure for the data compression and decompression. It can be easily explained with the help of coded bit stream. Fig.5 shows the layered data structure. Fig. 5 Layered data structure. At the top layer, multiple frames of the video are put in sequence into a buffer. This sequence is then combined to form frames, in the layer of Picture/ Frame and this combined sequence is called as picture. Then in the next layer of slice, these pictures are divided into rectangular regions and these regions are called as slices. Slices are divided into Macroblocks (MBs) in the subsequent layer. MB consists of set blocks which has luminance and chrominance. Sequence The sequence layer consists of a set of mandatory and optional downloaded system parameters [17]. The mandatory parameters are required to initialize decoder systems. The optional parameters are used for other system settings at the discretion of the network provider. Sometimes user data can be contained in the sequence header. The sequence layer provides an entry point into the coded video. Sequence headers should be placed in the bitstream to support user access appropriately for the given distribution medium. Repeat sequence headers may be inserted to support random access. Sequences are terminated with a sequence end code. Page 6

Picture The picture layer provides the coded representation of a video frame. It comprises a header with mandatory and optional parameters and optionally with user data. Three types of pictures are defined by AVS [17]: 1. Intra pictures (I-pictures) 2. Predicted pictures (P-pictures) 3. Interpolated pictures (B-pictures) Slice The slice structure provides the lowest-layer mechanism for re-synchronizing the bitstream in case of transmission error. Slices comprise of a series of MBs. Slices must not overlap, must be contiguous, must begin and terminate at the left and right edges of the Picture. It is possible for a single slice to cover the entire Picture. The slice structure is optional. Slices are independently coded and no slice can refer to another slice during the decoding process. Fig. 6 Normal slice structure and flexible slice set in AVS-video storage [17] (left: normal slice structure where slice can only contain continual lines of macroblocks; right: flexible slice set allowing more flexible grouping of MBs in slice and slice set, where B0, B1and B2 are slices of the same slice group). Macroblock Picture is divided into several macroblocks (MB). The upper-left sample of each MB should not exceed picture boundary. Macroblock partitioning is shown in Fig. 7. The partitioning is used for motion compensation. The number in each rectangle specifies the order of appearance of motion vectors and reference indices in a bitstream. Page 7

Fig. 7 Macroblock partitioning A MB includes the luminance and chrominance component pixels that collectively represent a 16x16 region of the picture. In 4:2:0 mode, the chrominance pixels are sub-sampled by a factor of two in each dimension; therefore each chrominance component contains only one 8x8 block. In 4:2:2 mode, the chrominance pixels are sub-sampled by a factor of two in the horizontal dimension; therefore each chrominance component contains two 8x8 blocks. The MB header contains information about the coding mode and the motion vectors. It may optionally contain the quantization parameter (QP). Block The block is the smallest coded unit and contains the transform coefficient data for the prediction errors. In the case of intra-coded blocks, intra prediction is performed from neighboring blocks. Coding tools in AVs of China: The major coding tools of AVS china part-2 are listed below: 1. 8 X 8 spatial prediction: Spatial prediction is used in intra coding in AVS Part 2 to exploit spatial correlation of picture. The intra prediction is based on 8x8 block. It uses decoded information in the current frame as the reference of prediction, exploiting statistical spatial dependencies between pixels within a picture. If MBPAFF is applied, intra-frame prediction can only take the MBs within the same stage as reference. There are five luminance intra prediction modes, and four chrominance intra prediction modes. Each of the four 8x8 luminance blocks can be predicted using one of the five intra-prediction modes. Ahead of prediction of DC mode (Mode2), diagonal down left (Mode3) mode and diagonal downright mode(mode 4), a three-tap low-pass filter (1,2,1) is applied on the samples that will be used as references of prediction [17]. It needs to be pointed out that in DC mode each pixel of current block is predicted by an Page 8

average of the vertically and horizontally corresponding reference pixels. Hence, the prediction values of different pixels in a block might be different. This results in a fine prediction for a large block. Prediction of the most probable mode is according to the intra-prediction modes of neighboring blocks. This will help to reduce average bits needed to describe the intra-prediction mode in video bitstream. The reconstructed pixels of neighboring blocks before deblocking filtered is used as reference pixels for the current block is shown in fig. 8 Fig. 8 Neighbor pixels in luminance intra prediction [14] Four luminance intra prediction directions are illustrated in Fig. 8. Five luminance intra prediction modes are illustrated in Fig. 9. Fig. 9 Five luminance intra prediction modes [14] Page 9

2. Inter prediction P-picture and B-picture are specified in AVS Part 2. There are four macroblock partition types for inter prediction, 16x16, 16x8, 8x16 and 8x8 [17]. Fig. 10 shows different types of macroblock and sub-macroblock partitions. Fig. 10 Macroblock and sub-macroblock partitions. 2.1 P-Prediction In P-picture, there are 5 inter prediction modes, PSkip (16x16), P_16x16, P_16x8, P_8x16, and P_8x8. For the latter 4 modes in P-frame, each partition of macroblock is predicted from one of the two candidate reference frames, which are latest decoded I- or P-frame. For the latter 4 modes in P-field, each partition of macroblock is predicted from one of the four latest decoded reference fields. 2.2 Bi-prediction There are two kinds of bi-predictions in AVS Part 2, symmetric-prediction and direct-prediction. In symmetric-prediction, only one forward motion vector is transmitted for each partition. The backward motion vector is derived from the forward one by a symmetric rule (as shown in Fig. 11). In direct-prediction, forward and backward motion vectors are all derived from the motion vector of the collocated block in the backward reference picture. Page 10

Fig. 11 Symmetric mode of AVS Part 2 [14] 2.3 Interpolation A 1/4-pixel interpolation method named as Two Steps Four Taps interpolation (TSFT) is adopted in AVS Part 2. 1/2-pixel samples are interpolated in step 1 and 1/4-pixel samples in step 2. 1/2-pixel interpolation filter is a 4-tap filter Hi (-1/8, 5/8, 5/8, -1/8). For ordinary 1/4-pixel samples, a, c, d, f, I, k, n and q in Fig.6, a 4-tap filter H2 (1/16, 7/16, 7/16, 1/16) is applied, and four special 1/4- pixel samples, e, g, p and r, are filtered by 2-tap bi-linear filter H3(1/2, 1/2)" [18]. The positions of the pixels are illustrated in Fig. 12. Fig. 12 Position ofinteger pixels, 1/2 pixels and 1/4 pixels [14]. The grey pixels are the integer pixel samples, blue ones are 1/2-pixel samples and white ones are 1/4-pixel samples Page 11

2.4 Reference frames Maximum of two reference pictures is allowed in inter-frame prediction of AVS-video. In particular, two reference frames are maximum when the current frame is coded with frame coding, while the reference index is no more than three (beginning from zero) when applying field coding. References are the nearest coded frames/fields of the current frame/field, except that the second field in B picture cannot take the first field of the same picture as its reference, as shown in Fig. 13. Especially, reference can only be taken from the previous stages of the current macroblock pair in P-prediction when MBPAFF is applied. For video surveillance, normally the scene is not changed or changed slightly, background reference frames can replace one of the above two reference pictures and provide better coding efficiency. Fig. 13 Maximum of two reference pictures in AVS-video: [14] (a) reference of current (the second) field in field-coded I picture, (b) references of current frame in frame-coded P picture, (c) references of current (the first) field in field-coded P picture, (d) references of current (the second) field in field-coded P picture, (e) references of current frame in frame-coded B picture (f) references of current (the first) field in field-coded B picture and (g) references of current (the second) field in field-coded B picture. Page 12

3. Transform and Quantization In a typical block based compression scheme like AVS, the residual block is transformed using a 8x8 integer transform. These integer transforms are a variation of discrete cosine transform (DCT) [22]. The transform outputs a set of coefficients, each of which is a weighting value for a standard basis pattern. Transform Quantize Coefficients 8x8 Image Block Fig.14 Quantization of the transform coefficients of the image block [17] The output of the transform, a block of transform coefficients, is quantized, i.e. each coefficient is divided by an integer value. Quantization reduces the precision of the transform coefficients according to a quantization parameter (QP). Typically, the result is a block in which most or all of the coefficients are zero, with a few non-zero coefficients. Setting QP to a high value means that more coefficients are set to zero; resulting in high compression at the expense of poor decoded image quality. Setting QP to a low value means that more non-zero coefficients remain after quantization, resulting in better decoded image quality but lower compression. 4. In-loop deblocking filter AVS uses the de-blocking filter in motion compensation loop. In-loop deblocking filter is applied in AVS Part 2, to reduce the blocking artifacts and enhance both subjective and objective performance. The deblocking process directly acts on the reconstructed reference first across vertical edges and then across horizontal edges. Obviously, different image regions and different bit rates need different smoothes. Therefore, the de-blocking filter is automatically adjusted in AVS depending on activities of blocks and QPs. Page 13

5. 2D VLC (Variable Length Coding) In AVS Part 2, an efficient context-based 2D-VLC entropy coder is designed for coding 8x8 block-size transform coefficients. 2D-VLC means that a pair of Run-Level is regarded as one event and jointly coded. Context-based is a technique, which uses the coefficient information to switch among different VLC tables. High performance can be achieved with the cost of relatively low complexity. MPEG-4 Part 10/AVC: In order to produce the standard method of compression, a group of experts known as moving picture experts group (MPEG) was formed by international organization for standardization (ISO) [4] and international telecommunication union s telecommunication standardization sector [5] (ITU-T) to establish an international standard for coded representation of moving picture and associated audio on digital storage media.. Over the period of time various standards such as MPEG -1, MPEG-2, MPEG-4, MPEG-4 part 2,MPEG -4 AVC etc. [4] have been developed by this group for video as well as audio compression. Page 14

Fig. 15 Block diagram of the H.264 encoder and decoder [6]. This joint group involved in development of H.264 was focused on creating a simple and clean solution, limiting options and features to a minimum for video data. An important aspect of the standard, as with other video standards, is to provide the capabilities in profiles (sets of algorithmic features) and levels (performance classes) that optimally support popular productions and common formats. H.264 has seven profiles, each targeting a specific class of applications. Each profile defines what feature set the encoder may use and limits the decoder implementation complexity. Fig.15 shows the encoder and decoder for the H.264/AVC. Page 15

The AVC/H.264 standard defines four important Profiles: Baseline Profile Primarily used for lower-cost applications with limited computing resources, this profile is used widely in videoconferencing and mobile applications. Extended Profile Intended to use for streaming video profile, this profile has relatively high compression capability and some extra robustness to data losses and server stream switching can be achieved. Main Profile Originally intended as the mainstream consumer profile for broadcast and storage applications, the importance of this profile faded when the High profile was developed for those applications.. High Profile The primary profile for broadcast and disc storage applications, particularly for high-definition television applications (this is the profile adopted into HD DVD and Blu-ray Disc, for example). Fig. 16 Profiles of H.264 /AVC [8] Some advantages of H.264: Up to 50% in bit rate savings: Compared to H.263v2 (H.263+) or MPEG-4 [7] Simple Profile, H.264 permits a reduction in bit rate by up to 50% for a similar degree of encoder optimization at most bit rates. High quality video: H.264 offers consistent video quality at high and low bit rates. Error resilience: H.264 provides the tools that are necessary to deal with packet losses in packet networks and bit errors in error-prone wireless networks. Network friendliness: Through the network adaptation layer (NAL), H.264 bit streams can be easily transported over different networks. Wide areas of application streaming mobile TV, HDTV over IP, extended PVR and storage options for the home user Page 16

Video evaluation parameters: Peak signal to noise ratio (PSNR): PSNR is commonly used to measure the quality of reconstruction. The higher the PSNR higher the reconstruction quality. Mean Square Error: MSE is the difference between values estimated by estimator and true value being measured. MSE value of zero implies that the estimator has predicted the values correctly. Structural similarity (SSIM): SSIM measures the similarity between the two images. SSIM is designed to improve on traditional methods like PSNR and MSE [15]. Experimental results for AVS of China: The software which has been used to perform AVS of China is BM1.0v3 [19] obtained from the ftp server [19]. Microsoft Visual C++ 2008 express edition [20] has been used to run the code and build the project for the AVS China codec. After building the project, code will generate two application files namely lencod.exe and ldecod.exe. By running these two files using appropriate and necessary parameters final result which is a decoded file is obtained. The original file and decoded file are than evaluated using MSU video quality measurement tool. The values of PSNR, MSE and SSIM are obtained from it [21]. Page 17

AVS China results for sequence foreman_cif.yuv: File used foreman_cif.yuv Resolution: 358X 288 Frame rate: 25 fps Original file size: 44550 Kilobytes Number of frames used: 100 Table 4 shows PSNR, MSE, SSIM and compression ratios for different bitrates. Fig. 17 shows decoded images at different bitrates for AVS China. QP Compressed File size(kb) Bit rate (Kbits/s) Bits/pixel Y-PSNR (db) Y-MSE Y-SSIM Compression ratio 0 20734 17100 6.63 60.54 0.0573 0.9997 2.14 : 1 15 6213 5123.57 1.9877 45.8679 1.6837 0.9942 7.1 : 1 31 1045 860.94 0.3340 37.6255 11.2340 0.9737 42.63:1 45 206 169.21 0.0656 31.2507 48.7547 0.9186 216.3:1 63 34 27.67 0.0107 23.8649 267.0482 0.7114 1317.3:1 Table 4 MSE, PSNR and SSIM for foreman_cif.yuv file for AVS China Page 18

Bitrate=17100 Kbits/s Bitrate= 860.94 Kbits/s PSNR= 60.54 db PSNR = 37.6255 db MSE= 0.0573 MSE= 11.2340 SSIM= 0.9997 SSIM= 0.9737 Bitrate = 27.67 Kbits/s PSNR = 23.8649 db MSE = 267.0482 SSIM = 0.7114 Fig. 17 Foreman_cif.yuv sequence compressed AVS China files at different bitrates. Page 19

MSE PSNR in db 65 PSNR Vs. Bitrate 60 55 50 45 40 35 30 0 2000 4000 6000 8000 10000 12000 14000 16000 18000 Bitrate in Kbps Fig. 18 PSNR Vs. Bitrate for foreman_cif.yuv sequence for AVs China 50 45 40 35 30 25 20 15 10 5 MSE Vs. Bitrate 0 0 2000 4000 6000 8000 10000 12000 14000 16000 18000 Bitrate in Kbps Fig. 19 MSE Vs. Bitrate for foreman_cif.yuv sequence for AVS china Page 20

SSIM 1 SSIM Vs. Bitrate 0.95 0.9 0.85 0.8 0.75 0 2000 4000 6000 8000 10000 12000 14000 16000 18000 Bitrate in Kbps Fig. 20 SSIM Vs. Bitrate for foreman_cif.yuv sequence for AVS China Page 21

AVS China results for sequence foreman_qcif.yuv file: File used foreman_qcif.yuv Resolution: 176 X 144 Frame rate: 25 fps Original file size: 11138 Kilobytes Number of frames used: 300 Table 5 shows PSNR, MSE, SSIM and compression ratios for different bitrates. Fig. 21 shows decoded images at different bitrates. QP Compressed File size(kb) Bit rate (Kbits/s) Bits/pixel Y-PSNR (db) Y-MSE Y-SSIM Compression ratio 0 5762 4751.41 7.49 53.658 0.2823 0.9985 1.93 : 1 15 1843 1519.72 2.3985 44.894 2.214 0.9905 6.04 : 1 31 436 350.44 0.5531 36.038 16.319 0.9493 25.56 : 1 45 118 96.68 0.1526 29.161 79.499 0.855 94.39 : 1 63 14 11.12 0.0176 20.357 603.59 0.5186 795.57 : 1 Table 5 MSE, PSNR and SSIM for foreman_qcif.yuv file for AVS China Page 22

Bitrate=4751.41 Kbits/s Bitrate= 350.44 Kbits/s PSNR= 53.65 db PSNR = 36.038 db MSE= 0.2823 MSE= 16.139 SSIM= 0.9985 SSIM= 0.9493 Bitrate = 11.12 Kbits/s PSNR = 20.357 db MSE = 603.59 SSIM = 0.5186 Fig. 21 foreman_qcif.yuv sequence compressed AVS China files at different bitrates. Page 23

MSE PSNR in db 55 PSNR Vs. Bitrate 50 45 40 35 30 25 20 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 Bitrate in Kbps Fig. 22 PSNR Vs. Bitrate for foreman_qcif.yuv sequence for AVS China 700 MSE Vs. Bitrate 600 500 400 300 200 100 0 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 Bitrate in Kbps Fig. 23 MSE Vs. Bitrate for foreman_qcif.yuv sequence for AVS China Page 24

SSIM 1 SSIM Vs. Bitrate 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5 0 500 1000 1500 2000 2500 3000 3500 4000 4500 5000 Bitrate in Kbps Fig. 24 SSIM Vs. Bitrate for foreman_qcif.yuv sequence for AVS China Page 25

AVS China results for akiyo_qcif.yuv file: File used: akiyo_qcif.yuv Resolution: 176 X 144 Frame rate: 25 fps Original file size: 11138 Kilobytes Number of frames used: 300 Table 6 shows PSNR, MSE, SSIM and compression ratios for different bitrates. Fig. 25 shows decoded images at different bitrates. QP Compressed File size(kb) Bit rate (Kbits/s) Bits/pixel Y-PSNR (db) Y-MSE Y-SSIM Compression ratio 0 2059 1697.68 2.6794 58.8256 0.0852 0.9993 5.41 : 1 15 391 322.26 0.5086 47.3606 1.1940 0.9902 28.49 : 1 31 99 80.79 0.1275 41.5042 4.5989 0.9747 112.51 : 1 45 37 29.64 0.0468 34.3129 24.087 0.9231 301.03 : 1 63 10 7.84 0.0124 26.8455 134.43 0.8022 1113.8 : 1 Table 6 MSE, PSNR and SSIM for akiyo_qcif.yuv file for AVS China Page 26

Bitrate=1697.68 Kbits/s Bitrate= 80.79 Kbits/s PSNR= 58.82 db PSNR = 41.5042 db MSE= 0.0852 MSE= 4.5989 SSIM= 0.9993 SSIM= 0.9747 Bitrate = 7.84 Kbits/s PSNR = 26.8455 db MSE = 134.43 SSIM = 0.8022 Fig. 25 akiyo_qcif.yuv sequence compressed AVS China files at different bitrates. Page 27

MSE PSNR in db 60 PSNR Vs. Bitrate 55 50 45 40 35 30 25 0 200 400 600 800 1000 1200 1400 1600 1800 Bitrate in Kbps Fig. 26 PSNR Vs. Bitrate for foreman_qcif.yuv sequence for AVS China 140 MSE Vs. Bitrate 120 100 80 60 40 20 0 0 200 400 600 800 1000 1200 1400 1600 1800 Bitrate in Kbps Fig. 27 MSE Vs. Bitrate for foreman_qcif.yuv sequence for AVS China Page 28

SSIM 1 SSIM Vs. Bitrate 0.98 0.96 0.94 0.92 0.9 0.88 0.86 0.84 0.82 0.8 0 200 400 600 800 1000 1200 1400 1600 1800 Bitrate in Kbps Fig. 28 SSIM Vs. Bitrate for foreman_qcif.yuv sequence for AVS China Proceeding work: This project will focus on key aspects of MPEG-4/H.264 and AVS China Standards. Furthermore it will look forward in simulation with different test sequences at different bitrates. For the implementation purpose of AVS of China BM1.0v3 [19] is used and for H.264 implementation JM18.3 will be used for the testing of the parameters like MSE, PSNR, SSIM and compression ratio [16]. This project report will also emphasize on comparison between the two formats based on various performance parameters such as PSNR, MSE, SSIM and compression ratio for different video sequences at different bit rates. Page 29

References: 1. T. Wiegand, G. J. Sullivan, G. Bjøntegaard, and A. Luthra, Overview of the H.264/AVC Video Coding Standard, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 13, pp. 560-576, JULY 2003. 2. H.264 video compression standard : New possibilities within video surveillance, Axis Commuincation,2008, Feb. 22nd 2012, <http://www.axis.com/files/whitepaper/wp_h264_31669_en_0803_lo.pdf> 3. ITU H.263 Video Compression : http://www.h263l.com/ 4. ITU-T: http://en.wikipedia.org/wiki/itu-t 5. S-K Kwon, A. Tamhankar and K.R. Rao Overview of H.264 / MPEG-4 Part 10, J. Visual Communication and Image Representation, vol. 17, pp. 186-216, April 2006. 6. X. Zhou, E. Q. Li, and Y.-K. Chen, Implementation of H.264 decoder on general purpose processors with media instructions, SPIE Conference on Image and Video Communications and Processing, vol. 5022, pp.224-235, May 2003. 7. J. Watkinson, The MPEG Handbook, Second Edition, Elsevier/Focal Press, pp. 1, 2004. 8. A. Puri, X. Chen and A. Luthra, Video coding using the H.264/MPEG- 4 AVC compression standard, Signal Processing: Image Communication, vol. 19, pp. 793-849, Oct. 2004. 9. Audio Video Coding Standard Group of China (AVS), Advanced Audio Video Coding Standard Part 2: Video, Doc. AVS-N1063, Dec, 2003. 10. L. Fan, S. Ma and F. Wu, Overview of AVS Video Standards, IEEE International Conference on Multimedia and Expo (ICME), pp. 423-426, 2004. 11. Audio Video Coding Standard Workgroup of China (AVS), Video Coding Standard FCD1.0, NOV.2003. 12. AVS Video Expert Group, Information Technology Advanced Audio Video Coding Standard Part 7: Mobility Video, Audio Video Coding Standard Group of China (AVS), Doc. AVS-N1151, Dec.2004. 13. W. Gao et al, AVS The Chinese next-generation video coding standard, NAB, Las Vegas, 2004. 14. L. Yu, S. Chen and J. Wang, Overview of AVS-video coding standards, Signal Processing: Image Communication, Vol. 24, Issue 4, Special Issue on AVS and its Application, pp. 247-262, April 2009. Page 30

15. J.B. Maretens and L. Meesters, Image dissimilarity, Signal Processing, vol. 70, pp. 155-176, Nov. 1998. 16. 18.3 JM H.264 software: http://iphome.hhi.de/suehring/tml/ 17. Project on "Low Complexity AVS-China Part-2 video using data mining techniques" by Jennie Abraham: http://www-ee.uta.edu/dip/courses/ee5359/thesis%20project%20table%20docs/ jennieproposal.doc 18. L. Yu et al., Overview of AVS-Video: Tools, performance and complexity, SPIE VCIP, vol. 5960, pp. 596021-1~ 596021-12, Beijing, China, July 2005. 19. AVS of China software: ftp://124.207.250.92 (password required) 20. Microsoft Visual C++ 2008 express edition: http://msdn.microsoft.com/enus/express/future/bb421473 21. MSU video quality measurement tool: http://compression.ru/video/quality_measure/vqmt_download_en.html#start 22. C. Zhang et al, The Technique of Prescaled Integer Transform: Concept, Design and Applications, IEEE transaction on circuits and systems for video technology, vol. 18, pp. 84-97, January 2008 23. I.E. Richardson, The H.264 Advanced Video compression standard, second edition, John Wiley & sons,2010 Page 31

Objective: Introduction: To: Dr. K. R. Rao. From: Kaustubh V. Dhonsale (UTA id: ) Date: 04/24/2012