SPATIO-TEMPORAL SIGNATURES FOR VIDEO COPY DETECTION

Size: px
Start display at page:

Download "SPATIO-TEMPORAL SIGNATURES FOR VIDEO COPY DETECTION"

Transcription

1 SPATIO-TEMPORAL SIGNATURES FOR VIDEO COPY DETECTION Isabelle Simand, 2 Denis Pellerin, 3 Stephane Bres and 3 Jean-Michel Jolion Isabelle.Simand@liris.cnrs.fr 3 LIRIS, bat. J. Verne, INSA, 6962 Villeurbanne Cedex, FRANCE 2 LIS, 46 av. F. Viallet, 3803 Grenoble Cedex, FRANCE ABSTRACT The number of copied videos is growing rapidly on television broadcast networks as well as on the world wide web. The existing copy detection methods resort either to image techniques or to video ones. We propose a spatio-temporal for the automatic detection of video extracts, based on the evolution of gray level centroids along time. We have obtained good results for a base of more than 50 Gb of data and for numerous tests of robustness. Our algorithm is robust to changes in contrast and brightness, zooms, modification of frame rate, a logo superimposition on the image, etc.. INTRODUCTION The number of copies of a given video grows rapidly on television broadcast networks as well as on the world wide web. That causes problems of law and efficiency, among others for search engines. Video copy detection is a recent research domain also named video monitoring or video fingerprinting. It is generally based on the creation of a compact representation () of each video in a database. Copy detection is performed by comparison between the of a given video and those stored in a database. Thus, this approach is different from basic image or video indexing as the search is based on a strong matching criteria instead of a similarity one. The research in video copy detection or video fingerprinting is recent. The approaches described in the literature can be grouped according to the type of extracted features: spatial features in images or spatio-temporal features in video. Cheung et al. [, 2] have proposed a by selecting a few frames similar to seed images. The introduced by Joly et al. [3] consists in extracting local features around interest points. These two previous techniques are not robust to changes in the order of the images in the video. Indyk et al. [4] use the shot change duration as for a video, but video sequences, especially short one, contain few shot changes. Hampapur et al. [5] propose two sequence-matching methods based on motion direction histograms or average gray level over image blocks, and compare them on three videos. Oostveen et al. [6] consider the sign of the differences of mean luminance over blocks. The is compact and facilitates the research step, but is sensitive to global geometric operations. Finally, Hoad and Zobel [7] use the magnitude of two motion vectors calculated for the darkest (respectively lightest) 5% of the pixels. They evaluate the robustness with several degraded versions of the clips as queries and notice that it is sensitive to changes in luminance and resolution. These previous spatio-temporal methods have been tested on a few hours of video. In this paper we propose a spatio-temporal for the automatic detection of video extracts, based on the evolution of gray level centroids along time. This is compact enough to keep storage and computing time low. Moreover, it is discriminant, and robust to several transformations. Particular applications are the supervision of television broadcast and video control storage to avoid duplication, or the verification of the terms of a contract. In section 2, we present the method for the extraction and the video search. Section 3 includes the results obtained for a base of more than 50 Gb of data and tests of robustness, and section 4 summaries the work. 2.. Signature extraction 2. METHOD We have chosen to characterize each video by calculating the gray level centroid of each frame of the video. The coordinates (c x, c y ) of the centroid (Eq. ) are calculated as the weighted sum of the pixel positions, the weights being the gray values of the pixels: c x = size (L i x i) size L i c y = size (L i y i) size L i () where size is the number of pixels considered in the image, and L i the gray level or the luminance of the pixel. Then, these coordinates are normalized to a single frame size. This computation is not costly for each frame. The resulting point is obviously not significant for one frame, but becomes very characteristic for several successive frames,

2 Figure : Example of motion of a centroid on a few frames. due to its movement during a few seconds. As a consequence, it leads to an efficient spatio-temporal (Fig. ). This corresponds to a fingerprinting approach rather than a watermarking one, because no information is added. This first method has the advantage to be applicable even for videos which are already on the net. On the other hand, this is a low level one and does not allow semantic comparison, which is beyond the scope of this project. This approach is based on the gray levels, obtained from the RGB color components. Indeed, combining these three components in one centroid is faster and simpler than using them separately. As a result, the is identical for a color video and its gray value version. As a video can be related to quite a complex scene, only one centroid is not sufficient to characterize a video extract. We thus propose to divide the frames in several subimages. Based on our experiments, dividing a frame into four quarters is the best compromise between the complexity of the and its discriminative power. For each quarter, a centroid is calculated (Fig. 2). This permits us to keep minimal spatial information on the gray level repartition. Thus, without increasing the number of computations (which depends on the frame size and not on the number of centroids calculated), we passed from to 4 centroids and increased the precision. A test with 6 centroids has shown the limits of this approch: for /6 of the image s size, the motion of a centroid is not much of importance. The computation of the centroids location is biased toward brighter areas in order to amplify the centroids movements. This is done by a look-up table, so without any additional cost (Eq. 2). Then, the motion is less uniform and easier to be discriminated. f(l) = L (2) where L is the original gray level between 0 and 255. The of an image is composed of the eight values of the four centroids coordinates (x, y, x 2, y 2, x 3, y 3, x 4, y 4 ). The of a video consists of the s of its images. We have sub-sampled the image horizontally and vertically to pixel out of 8 to speed up the computation. This selection is founded on the redundancy due to the gray level correlation of neighbor pixels. Effectively, we can notice (Fig. 3) that the variations of a given coordinate are very similar before and after the sub-sampling of pixel out of 64, which has no harmful effect on the characterization of the image. Moreover, we can see that a subsampling of pixel out of 256 leads to noised variations. The is then less characteristic of the video than the previous versions. This sub-sampling strategy is very significant in terms of computation time for the creation of the s associated with a video base. This is even more important during the search phase, which begins with the online generation of the of the video extract to be compared. Variations of x Number of frame Figure 2: Example of a frame extracted from the movie Avengers, on which the four centroids are superimposed (one per quarter) Figure 3: Example of variations of the coordinate x during 0 s. Top: without sub-sampling, center: subsampling of pixel out of 64, bottom: sub-sampling of pixel out of 256

3 The uniqueness of the cannot of course be proved but we shall assume this uniqueness as a consequence of the complexity of the : 8n pixel locations, where n is the number of frames Searching a video using a video extract It is possible to perform fast comparisons in order to check a video toward a database starting from a short query based on the proposed. Among all the possible measures (the L and L 2 distances, the covariance, the correlation,...), the L -distance is the most robust as well as the least expensive in terms of computational requirements. Our process involves the computation of the distances between the centroids positions of the, (x, y) j, and those of the videos of the database, (x, y ) j. For frame i, the distance is (Eq. 3): D i = 4 ( x j x j + y j y j ) (3) j= The distances D i are computed between the images of the query and the images of each video of the base, from the beginning to the end. The queries are composed of 200 frames, which corresponds to 7 or 8 seconds of video. It is comparable to the length of the jingles used in audio experiments [8, 9]. In particular, it is shorter than most of commercials. Among these 200 frames, only one out of two is used for the comparison (Fig. 4). Indeed, 00 D i distances are required to comfort the uniqueness, and sufficient to ensure a good recognition, with a minimum computation cost. Obviously, taking 00 frames in a block of 200 images is more discriminant than choosing them consecutively, since there is generally more motion over 200 frames. To search the video, we use a sliding window which is moved by steps of four frames to reduce the scan time, assuming that during four consecutive frames the motion of the centroids is negligible. More formally, we have: At time t : At time t+ : D = 0 for i = nb nb + 200, i = i + 2 do D = D + D i D = 0 for i = nb + 4 nb , i = i + 2 do D = D + D i The decision about a query depends on an empirically determined adaptive threshold on the global distance values, which can take three values (high=60, medium=45, low=30). The actual value is fixed according to the temporal variance of the centroids coordinates. We call variance of a video extract the sum of the eight coordinate variances over the 00 frames which compose the. A video extract has experimentally been defined as static, moderately dynamic and dynamic when its variance is, respectively, bellow 400, between 400 and 800, and above 800 (Fig. 5 and 6). Recognition threshold Video Variance of the query Request shift (4 frames) t t time Figure 4: Representation of the comparisons between the video and the query, along time. Figure 5: Variation of the recognition threshold depending on the variance of the video extract over 00 frames. The smaller the variance (static scenes), the lower is the recognition threshold, since dynamic scene centroids have characteristic motion, which is less true for static ones. Hence, moderately dynamic and static scenes need a tighter threshold. We can remark that the L distance, (contrary to the correlation method) takes into account the relative position of the centroids in the image, and not only their motion. That is why the method is working even for static scenes.

4 Distance D Number of frame Figure 6: Distance between the video analyzed and the extract used as a request. This video query, a part of advertisement, is correctly located two times in the video, where the peaks are passing over the detection threshold. 3. RESULTS We have built a database of 53 Gb of data in order to obtain significant results. This corresponds to 0 hours of videos (i.e. more than 380 videos). There are English and French TV news, cartoons, advertisements, soap operas, many documentaries from the TREC video 2002 competition [0] and from the AIM corpus developed within the French inter-laboratory research group ISIS. These gray level or color videos are in Mpeg or DivX format. The samples are short video clips or long movies. The experimental protocol consists of a search, in the database, of a video using an extract of video. In order to evaluate our results, we have established a ground truth of our database. It allows us to estimate the effectiveness of the method in terms of recall and precision (R and P). Recall measures the relevant results returned, while precision measures the proportion of results returned that actually are relevant (Eq. 4, 5). In contrast to classical Content Based Image Retrieval, we are not looking for the n best similar videos, i.e. there is no result set window. As the search process ends up with a binary decision for any video, a recall of means that the actual number of copies is extracted and a precision of means that no more wrong videos are detected in the database. For these robustness tests, we have created different versions of 0 videos by changing, each time, one of their characteristics. It is important to indicate that all the searches done during those tests have been realized using the whole database, even if, for each request, we are interested only in a particular aspect of the videos. 3.. Random search in the database We are interested in the search of videos which do exist in the database, therefore we have randomly selected 50 video extracts of the database and used them as requests. The results are shown in Table. Recall in 00% of the cases Precision in 92% of the cases 0.67 in 2% of the cases 0.5 in 4% of the cases 0.07 in 2% of the cases Table : Results obtained for 50 video extracts randomly chosen as request among the 386 videos of the database. Assume that these 50 requests were not in the database. According to these precision ratios, we can conclude that in only 8% of the cases, the system returns a non empty answer Robustness tests within the database Robustness to modification of frame size and zooms Consequently to the normalization of the frame size, the method is not sensitive to global size modification. The zoomed videos on which we have done our experiments have been realized by cutting and resizing the frame to its initial size (Fig 7). The offers a recall of in 00% of the cases for a wide range of variations, and a slight decrease for extreme zooms: indeed, we obtain a recall of 0.9 for a 20% zoom in and 0.8 for a 23% zoom out. The precision is for 9 cases out of 0, and 0.47 for the last one. R = # relevant results returned # relevant results in the entire database (4) P = # relevant results returned # total results returned We give the results of a series of random requests in the database, and several illustrations of the robustness of the in different situations. (5) (a) (b) Figure 7: Example of zoom changes (a) original video, (b) video zoomed in by 20%.

5 Robustness to variations in contrast and brightness As far as contrast is concerned, our experiments have been realized for two types of modifications: an increase of 25% of the contrast, and then a decrease of 25% (Fig. 8, b and c). For these kinds of contrast variations, we obtained a recall of in all the experiments, and a precision of in 90% of the cases, which shows the reliability of the. Regarding brightness, the results were also very satisfying (Fig. 9), for decreases as well for increases of 5%, 0% or 20% (Fig. 8, d and e). The results in terms of recall and precision are recorded in Table 2. Recall Precision in 70% of the cases in 90% of the cases 0.83 in 20% of the cases 0.5 in 0% of the cases 0.82 in 0% of the cases (b) (c) (d) (e) (a) Figure 8: Example of contrast and brightness changes (a) original video, contrast with respect to the original (b) 25%, (c) +25%, brightness with respect to the original (d) 20%, (e) +20%. Nb of videos 0 Table 2: Results obtained after a change in brightness Robustness to change in compression format Our database is exclusively made of videos in Mpeg and DivX formats. Fifteen among these 386 videos are in both formats. They allow us to test the format effect as shown in Table 3. Video requests Compres. Video format num. -2 Mpeg DivX Nb of videos Recall Precision Table 3: Results obtained when the request and the video found are in a different compression format. The recall and precision ratios are maximal in most cases, and reach the value 0.5 in the worst case, which is satisfying. The video number 5 gives different results depending on the format of the request (Mpeg or DivX) Robustness after a noise addition The presence of noise in images is a current phenomenon as soon as a video is manipulated. We have therefore cho % of brightness Figure 9: Number of videos recognized, depending on brightness in the video. In each of this 0 cases, the request is the original sen to add gaussian white noise of variance v to the image, in order to test the robustness of the towards this aspect. Results for this kind of noisy query are shown in Table 4. Observing this table and Figure 0, we notice that the performances of the s are declining when v reaches 0%. Figure illustrates an image disturbed by this kind of noise Robustness to change in frame rate Concerning similar frame rates, such as 24 fps and 25 fps, the method is very robust. For more important variations we have proceeded as follows. The database contains videos at both 25 and 30 fps. In order to test the robustness towards a possible change in frame rate, we have transformed three videos from 25 to 30 fps and three others from 30 to 25 fps, be-

6 Variance Recall Precision (9/0) (8/0) v=2% 0.77 (/0) 0.67 (/0) 0.4 (/0) (6/0) (6/0) v=6% 0.69 (/0) 0.22 (/0) 0.5 (/0) 0.4 (/0) 0 (2/0) 0 (2/0) (3/0) v=0% (5/0) 0.3 (/0) 0 (5/0) 0.09 (/0) 0 (5/0) Table 4: Results obtained for videos distorted by a gaussian white noise of variance v. They are decreasing as the noise is growing. Nb of videos v parameter Original video at 25 fps Extraction of the Original at 25 fps Modification of the frame rate Re-sampling of the Modified video at 30 fps Recognition of the modified video in the base Modified at 30 fps Figure 2: Transformations performed to improve the detection of videos whose frame rates have been modified. transformations. We repeat this process with the s of videos which had a frame rate of 30 fps, and which have been transformed to 25 fps. In Figure 3, the curves represent the same in each of these cases. We can observe that the coordinate variations are nearly the same for the 30 fps video and after re-sampling the original 25 fps video. After this re-sampling, the recognition ratio is better than before. Indeed, we have recorded, in 00% of the cases (6/6), a recall ratio of, and the same precision ratio in 5 cases out of 6. Variations of x Figure 0: Number of original videos recognized using a noisy query image, depending on the value of the variance parameter applied Number of frame (a) (b) Figure : Example of noise addition (a) original video without noise, (b) noised video: variance v=0%. fore extracting their s. The results obtained by comparing the s at 25 fps and 30 fps were not satisfying. We then decided to proceed to their re-sampling. As a result, the s of the original videos at 25 fps have been re-sampled to create modified s at a simulated frame rate of 30 fps. Figure 2 shows those Figure 3: Example of variations of x. Top: of the original video (at 25 fps), center: of the modified video (at 30 fps), bottom: modified, obtained by re-sampling the original one Robustness after adding logos Here, we deal with the robustness of the towards videos on which a logo has been superimposed. We have considered realistic cases for this experiment: the superimposition of the logo has been done on one of the frame s borders. Its position, in a corner for instance, can have an

7 influence on the result, and above all its content: a white logo will attract the centroid more than a black one. Even after this modification, the original video and the one with the logo should still be considered as identical by the comparison method. We have considered the detection of videos with logos using queries without, and vice-versa (Tab. 5). We obtained a precision of, in 90% of the cases, after adding a small logo, and in 60% of the cases for a medium one. As we can see, the performances tend to decrease when logos reach large sizes. Our method remains relatively sensitive to the addition of an element to the frame. A solution could be to consider only the central part of the frame during the computation of centroids. Size of logo Logo size compared with the frame size Nb of videos detected if the request has... a logo Small 0.95% 0/0 9/0 Medium 3.3% 5/0 5/0 no logo Table 5: Number of original videos (without logo) detected by copies with logos, and vice-versa. 4. CONCLUSION In this paper, we have proposed an efficient and robust video copy detection tool using a spatio-temporal. It has been developed in order to detect videos which have the same content as a video query. These videos are spotted by considering the distance between the four centroids of the request and those of the videos analyzed, on 200 consecutive images. Using a database of more than 0 hours of video, we have obtained a recall of for each of the 50 video extracts randomly drawn in the database, and the same precision in 92% of the cases. A good robustness to different transforms has also been observed, but a problem still remains concerning large logo superimposition. The size of the s is 72 Mb for the whole database, which is 730 times less than the videos themselves. The search only takes about 20 seconds on a 2 GHz Celeron processor. A future implementation, however, will accelerate this phase thanks to an intelligent scanning of the database. Combining audio fingerprinting to the current is also studied to improve results, as well as an improvement of the robustness against complex distortions of the signal, such as embedded videos. 5. REFERENCES [] S-C.S. Cheung and A. Zakhor, Estimation of web video multiplicity, in Proceedings of the SPIE - Internet Imaging, San Jose, California. January 22-28, 2000, pp [2] S-C.S. Cheung and A. Zakhor, Efficient video similarity measurement with video, IEEE Transactions on Circuits and Systems for Video Technology, vol. 3, no., pp , [3] A. Joly, C. Frelicot, and O. Buisson, Robust content-based video copy identification in a large reference database, in Conference on Image and Video Retrieval, CIVR 2003, Urbana-Champaign, IL, USA, July 24-25, 2003, pp [4] P. Indyk, G. Iyengar, and N. Shivakumar, Finding pirated video sequences on the internet, Tech. Rep., Computer Science Department, Stanford University, 999. [5] A. Hampapur, KH Hyun, and R. Bolle, Comparison of sequence matching techniques for video copy detection, in SPIE Conference on Storage and Retrieval for Media Databases, USA, January 20-25, [6] J. Oostveen, T. Kalder, and J. Haitsma, Feature extraction and a database strategy for video fingerprinting, in VISUAL 2002, Taipei, Taiwan, March -3, [7] T.C. Hoad and J. Zobel, Fast video matching with alignment, in Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, Berkeley, California, USA, November 7, [8] E. Allamanche, J. Herre, O. Hellmuth, B. Froba, T. Kastner, and M. Cremer, Content-based identification of audio material using mpeg-7 low level description, in Proceedings of the Second Annual International Symposium on Music Information Retrieval: ISMIR 200, Indiana, USA, October 5-7, 200. [9] P. Cano, E. Batlle, T. Kalker, and J. Haitsma, A review of algorithms for audio fingerprinting, in International Workshop on Multimedia Signal Processing, US Virgin Islands, December, [0] Trec video 2002,

A Robust Video Hash Scheme Based on. 2D-DCT Temporal Maximum Occurrence

A Robust Video Hash Scheme Based on. 2D-DCT Temporal Maximum Occurrence A Robust Video Hash Scheme Based on 1 2D-DCT Temporal Maximum Occurrence Qian Chen, Jun Tian, and Dapeng Wu Abstract In this paper, we propose a video hash scheme that utilizes image hash and spatio-temporal

More information

IN THE LAST decade, the amount of video contents digitally

IN THE LAST decade, the amount of video contents digitally IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 18, NO. 7, JULY 2008 983 Robust Video Fingerprinting for Content-Based Video Identification Sunil Lee, Member, IEEE, and Chang D. Yoo,

More information

Copyright Detection System for Videos Using TIRI-DCT Algorithm

Copyright Detection System for Videos Using TIRI-DCT Algorithm Research Journal of Applied Sciences, Engineering and Technology 4(24): 5391-5396, 2012 ISSN: 2040-7467 Maxwell Scientific Organization, 2012 Submitted: March 18, 2012 Accepted: June 15, 2012 Published:

More information

Comparison of Sequence Matching Techniques for Video Copy Detection

Comparison of Sequence Matching Techniques for Video Copy Detection Comparison of Sequence Matching Techniques for Video Copy Detection Arun Hampapur a, Ki-Ho Hyun b and Ruud Bolle a a IBM T.J Watson Research Center, 3 Saw Mill River Road, Hawthorne, NY 1532, USA b School

More information

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING

DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING DUPLICATE DETECTION AND AUDIO THUMBNAILS WITH AUDIO FINGERPRINTING Christopher Burges, Daniel Plastina, John Platt, Erin Renshaw, and Henrique Malvar March 24 Technical Report MSR-TR-24-19 Audio fingerprinting

More information

Time Stamp Detection and Recognition in Video Frames

Time Stamp Detection and Recognition in Video Frames Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th

More information

AN IMPROVISED LOSSLESS DATA-HIDING MECHANISM FOR IMAGE AUTHENTICATION BASED HISTOGRAM MODIFICATION

AN IMPROVISED LOSSLESS DATA-HIDING MECHANISM FOR IMAGE AUTHENTICATION BASED HISTOGRAM MODIFICATION AN IMPROVISED LOSSLESS DATA-HIDING MECHANISM FOR IMAGE AUTHENTICATION BASED HISTOGRAM MODIFICATION Shaik Shaheena 1, B. L. Sirisha 2 VR Siddhartha Engineering College, Vijayawada, Krishna, Andhra Pradesh(520007),

More information

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain Automatic Video Caption Detection and Extraction in the DCT Compressed Domain Chin-Fu Tsao 1, Yu-Hao Chen 1, Jin-Hau Kuo 1, Chia-wei Lin 1, and Ja-Ling Wu 1,2 1 Communication and Multimedia Laboratory,

More information

A Robust Wipe Detection Algorithm

A Robust Wipe Detection Algorithm A Robust Wipe Detection Algorithm C. W. Ngo, T. C. Pong & R. T. Chin Department of Computer Science The Hong Kong University of Science & Technology Clear Water Bay, Kowloon, Hong Kong Email: fcwngo, tcpong,

More information

Elimination of Duplicate Videos in Video Sharing Sites

Elimination of Duplicate Videos in Video Sharing Sites Elimination of Duplicate Videos in Video Sharing Sites Narendra Kumar S, Murugan S, Krishnaveni R Abstract - In some social video networking sites such as YouTube, there exists large numbers of duplicate

More information

Real-time Monitoring System for TV Commercials Using Video Features

Real-time Monitoring System for TV Commercials Using Video Features Real-time Monitoring System for TV Commercials Using Video Features Sung Hwan Lee, Won Young Yoo, and Young-Suk Yoon Electronics and Telecommunications Research Institute (ETRI), 11 Gajeong-dong, Yuseong-gu,

More information

AIIA shot boundary detection at TRECVID 2006

AIIA shot boundary detection at TRECVID 2006 AIIA shot boundary detection at TRECVID 6 Z. Černeková, N. Nikolaidis and I. Pitas Artificial Intelligence and Information Analysis Laboratory Department of Informatics Aristotle University of Thessaloniki

More information

A Short Introduction to Audio Fingerprinting with a Focus on Shazam

A Short Introduction to Audio Fingerprinting with a Focus on Shazam A Short Introduction to Audio Fingerprinting with a Focus on Shazam MUS-17 Simon Froitzheim July 5, 2017 Introduction Audio fingerprinting is the process of encoding a (potentially) unlabeled piece of

More information

Content Based Video Copy Detection: Issues and Practices

Content Based Video Copy Detection: Issues and Practices Content Based Video Copy Detection: Issues and Practices Sanjoy Kumar Saha CSE Department, Jadavpur University Kolkata, India With the rapid development in the field of multimedia technology, it has become

More information

Video Key-Frame Extraction using Entropy value as Global and Local Feature

Video Key-Frame Extraction using Entropy value as Global and Local Feature Video Key-Frame Extraction using Entropy value as Global and Local Feature Siddu. P Algur #1, Vivek. R *2 # Department of Information Science Engineering, B.V. Bhoomraddi College of Engineering and Technology

More information

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Partha Sarathi Giri Department of Electronics and Communication, M.E.M.S, Balasore, Odisha Abstract Text data

More information

Fingerprint Image Enhancement Algorithm and Performance Evaluation

Fingerprint Image Enhancement Algorithm and Performance Evaluation Fingerprint Image Enhancement Algorithm and Performance Evaluation Naja M I, Rajesh R M Tech Student, College of Engineering, Perumon, Perinad, Kerala, India Project Manager, NEST GROUP, Techno Park, TVM,

More information

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS

DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS DIGITAL TELEVISION 1. DIGITAL VIDEO FUNDAMENTALS Television services in Europe currently broadcast video at a frame rate of 25 Hz. Each frame consists of two interlaced fields, giving a field rate of 50

More information

Block Mean Value Based Image Perceptual Hashing for Content Identification

Block Mean Value Based Image Perceptual Hashing for Content Identification Block Mean Value Based Image Perceptual Hashing for Content Identification Abstract. Image perceptual hashing has been proposed to identify or authenticate image contents in a robust way against distortions

More information

Detector. Flash. Detector

Detector. Flash. Detector CLIPS at TRECvid: Shot Boundary Detection and Feature Detection Georges M. Quénot, Daniel Moraru, and Laurent Besacier CLIPS-IMAG, BP53, 38041 Grenoble Cedex 9, France Georges.Quenot@imag.fr Abstract This

More information

Perceptual Quality Improvement of Stereoscopic Images

Perceptual Quality Improvement of Stereoscopic Images Perceptual Quality Improvement of Stereoscopic Images Jong In Gil and Manbae Kim Dept. of Computer and Communications Engineering Kangwon National University Chunchon, Republic of Korea, 200-701 E-mail:

More information

Content Based Video Copy Detection Based on Motion Vectors Estimated Using a Lower Frame Rate

Content Based Video Copy Detection Based on Motion Vectors Estimated Using a Lower Frame Rate Signal, Image and Video Processing manuscript No. (will be inserted by the editor) Content Based Video Copy Detection Based on Motion Vectors Estimated Using a Lower Frame Rate Kasım Taşdemir A. Enis Çetin

More information

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University. 3D Computer Vision Structured Light II Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Introduction

More information

Invisible Watermarking Using Eludician Distance and DWT Technique

Invisible Watermarking Using Eludician Distance and DWT Technique Invisible Watermarking Using Eludician Distance and DWT Technique AMARJYOTI BARSAGADE # AND AWADHESH K.G. KANDU* 2 # Department of Electronics and Communication Engineering, Gargi Institute of Science

More information

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Structured Light II Johannes Köhler Johannes.koehler@dfki.de Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Introduction Previous lecture: Structured Light I Active Scanning Camera/emitter

More information

Multi-pass approach to adaptive thresholding based image segmentation

Multi-pass approach to adaptive thresholding based image segmentation 1 Multi-pass approach to adaptive thresholding based image segmentation Abstract - Thresholding is still one of the most common approaches to monochrome image segmentation. It often provides sufficient

More information

Locating 1-D Bar Codes in DCT-Domain

Locating 1-D Bar Codes in DCT-Domain Edith Cowan University Research Online ECU Publications Pre. 2011 2006 Locating 1-D Bar Codes in DCT-Domain Alexander Tropf Edith Cowan University Douglas Chai Edith Cowan University 10.1109/ICASSP.2006.1660449

More information

CHAPTER-4 WATERMARKING OF GRAY IMAGES

CHAPTER-4 WATERMARKING OF GRAY IMAGES CHAPTER-4 WATERMARKING OF GRAY IMAGES 4.1 INTRODUCTION Like most DCT based watermarking schemes, Middle-Band Coefficient Exchange scheme has proven its robustness against those attacks, which anyhow, do

More information

CHAPTER 5 MOTION DETECTION AND ANALYSIS

CHAPTER 5 MOTION DETECTION AND ANALYSIS CHAPTER 5 MOTION DETECTION AND ANALYSIS 5.1. Introduction: Motion processing is gaining an intense attention from the researchers with the progress in motion studies and processing competence. A series

More information

TEVI: Text Extraction for Video Indexing

TEVI: Text Extraction for Video Indexing TEVI: Text Extraction for Video Indexing Hichem KARRAY, Mohamed SALAH, Adel M. ALIMI REGIM: Research Group on Intelligent Machines, EIS, University of Sfax, Tunisia hichem.karray@ieee.org mohamed_salah@laposte.net

More information

Similar Fragment Retrieval of Animations by a Bag-of-features Approach

Similar Fragment Retrieval of Animations by a Bag-of-features Approach Similar Fragment Retrieval of Animations by a Bag-of-features Approach Weihan Sun, Koichi Kise Dept. Computer Science and Intelligent Systems Osaka Prefecture University Osaka, Japan sunweihan@m.cs.osakafu-u.ac.jp,

More information

Region-based Segmentation

Region-based Segmentation Region-based Segmentation Image Segmentation Group similar components (such as, pixels in an image, image frames in a video) to obtain a compact representation. Applications: Finding tumors, veins, etc.

More information

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VI (Nov Dec. 2014), PP 29-33 Analysis of Image and Video Using Color, Texture and Shape Features

More information

A Robust Video Copy Detection System using TIRI-DCT and DWT Fingerprints

A Robust Video Copy Detection System using TIRI-DCT and DWT Fingerprints A Robust Video Copy Detection System using and Fingerprints Devi. S, Assistant Professor, Department of CSE, PET Engineering College, Vallioor, Tirunelvel N. Vishwanath, Phd, Professor, Department of CSE,

More information

Scalable Coding of Image Collections with Embedded Descriptors

Scalable Coding of Image Collections with Embedded Descriptors Scalable Coding of Image Collections with Embedded Descriptors N. Adami, A. Boschetti, R. Leonardi, P. Migliorati Department of Electronic for Automation, University of Brescia Via Branze, 38, Brescia,

More information

Repeating Segment Detection in Songs using Audio Fingerprint Matching

Repeating Segment Detection in Songs using Audio Fingerprint Matching Repeating Segment Detection in Songs using Audio Fingerprint Matching Regunathan Radhakrishnan and Wenyu Jiang Dolby Laboratories Inc, San Francisco, USA E-mail: regu.r@dolby.com Institute for Infocomm

More information

A Fast Shot Matching Strategy for Detecting Duplicate Sequences in a Television Stream

A Fast Shot Matching Strategy for Detecting Duplicate Sequences in a Television Stream A Fast Shot Matching Strategy for Detecting Duplicate Sequences in a Television Stream Xavier Naturel IRISA-INRIA Rennes Campus de Beaulieu Rennes, France Patrick Gros IRISA-CNRS Campus de Beaulieu Rennes,

More information

AN EFFECTIVE APPROACH FOR VIDEO COPY DETECTION USING SIFT FEATURES

AN EFFECTIVE APPROACH FOR VIDEO COPY DETECTION USING SIFT FEATURES AN EFFECTIVE APPROACH FOR VIDEO COPY DETECTION USING SIFT FEATURES Miss. S. V. Eksambekar 1 Prof. P.D.Pange 2 1, 2 Department of Electronics & Telecommunication, Ashokrao Mane Group of Intuitions, Wathar

More information

ACCURACY AND STABILITY IMPROVEMENT OF TOMOGRAPHY VIDEO SIGNATURES

ACCURACY AND STABILITY IMPROVEMENT OF TOMOGRAPHY VIDEO SIGNATURES ACCURACY AND STABILITY IMPROVEMENT OF TOMOGRAPHY VIDEO SIGNATURES POSSOS, Sebastian. KALVA, Hari Department of Computer Science and Engineering, Florida Atlantic University, Boca Raton, FL 33431 Email:

More information

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM 1 PHYO THET KHIN, 2 LAI LAI WIN KYI 1,2 Department of Information Technology, Mandalay Technological University The Republic of the Union of Myanmar

More information

Video Inter-frame Forgery Identification Based on Optical Flow Consistency

Video Inter-frame Forgery Identification Based on Optical Flow Consistency Sensors & Transducers 24 by IFSA Publishing, S. L. http://www.sensorsportal.com Video Inter-frame Forgery Identification Based on Optical Flow Consistency Qi Wang, Zhaohong Li, Zhenzhen Zhang, Qinglong

More information

Texture Segmentation by Windowed Projection

Texture Segmentation by Windowed Projection Texture Segmentation by Windowed Projection 1, 2 Fan-Chen Tseng, 2 Ching-Chi Hsu, 2 Chiou-Shann Fuh 1 Department of Electronic Engineering National I-Lan Institute of Technology e-mail : fctseng@ccmail.ilantech.edu.tw

More information

Fast and Robust Short Video Clip Search for Copy Detection

Fast and Robust Short Video Clip Search for Copy Detection Fast and Robust Short Video Clip Search for Copy Detection Junsong Yuan 1,2, Ling-Yu Duan 1, Qi Tian 1, Surendra Ranganath 2, and Changsheng Xu 1 1 Institute for Infocomm Research, 21 Heng Mui Keng Terrace,

More information

A Video Watermarking Algorithm Based on the Human Visual System Properties

A Video Watermarking Algorithm Based on the Human Visual System Properties A Video Watermarking Algorithm Based on the Human Visual System Properties Ji-Young Moon 1 and Yo-Sung Ho 2 1 Samsung Electronics Co., LTD 416, Maetan3-dong, Paldal-gu, Suwon-si, Gyenggi-do, Korea jiyoung.moon@samsung.com

More information

Searching Video Collections:Part I

Searching Video Collections:Part I Searching Video Collections:Part I Introduction to Multimedia Information Retrieval Multimedia Representation Visual Features (Still Images and Image Sequences) Color Texture Shape Edges Objects, Motion

More information

Adaptive Fingerprint Image Enhancement Techniques and Performance Evaluations

Adaptive Fingerprint Image Enhancement Techniques and Performance Evaluations Adaptive Fingerprint Image Enhancement Techniques and Performance Evaluations Kanpariya Nilam [1], Rahul Joshi [2] [1] PG Student, PIET, WAGHODIYA [2] Assistant Professor, PIET WAGHODIYA ABSTRACT: Image

More information

Robust color segmentation algorithms in illumination variation conditions

Robust color segmentation algorithms in illumination variation conditions 286 CHINESE OPTICS LETTERS / Vol. 8, No. / March 10, 2010 Robust color segmentation algorithms in illumination variation conditions Jinhui Lan ( ) and Kai Shen ( Department of Measurement and Control Technologies,

More information

Scene Change Detection Based on Twice Difference of Luminance Histograms

Scene Change Detection Based on Twice Difference of Luminance Histograms Scene Change Detection Based on Twice Difference of Luminance Histograms Xinying Wang 1, K.N.Plataniotis 2, A. N. Venetsanopoulos 1 1 Department of Electrical & Computer Engineering University of Toronto

More information

28 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 12, NO. 1, JANUARY 2010

28 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 12, NO. 1, JANUARY 2010 28 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 12, NO. 1, JANUARY 2010 Camera Motion-Based Analysis of User Generated Video Golnaz Abdollahian, Student Member, IEEE, Cuneyt M. Taskiran, Member, IEEE, Zygmunt

More information

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK Xiangyun HU, Zuxun ZHANG, Jianqing ZHANG Wuhan Technique University of Surveying and Mapping,

More information

Fast Denoising for Moving Object Detection by An Extended Structural Fitness Algorithm

Fast Denoising for Moving Object Detection by An Extended Structural Fitness Algorithm Fast Denoising for Moving Object Detection by An Extended Structural Fitness Algorithm ALBERTO FARO, DANIELA GIORDANO, CONCETTO SPAMPINATO Dipartimento di Ingegneria Informatica e Telecomunicazioni Facoltà

More information

VC 12/13 T16 Video Compression

VC 12/13 T16 Video Compression VC 12/13 T16 Video Compression Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos Miguel Tavares Coimbra Outline The need for compression Types of redundancy

More information

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm Group 1: Mina A. Makar Stanford University mamakar@stanford.edu Abstract In this report, we investigate the application of the Scale-Invariant

More information

Fast Fuzzy Clustering of Infrared Images. 2. brfcm

Fast Fuzzy Clustering of Infrared Images. 2. brfcm Fast Fuzzy Clustering of Infrared Images Steven Eschrich, Jingwei Ke, Lawrence O. Hall and Dmitry B. Goldgof Department of Computer Science and Engineering, ENB 118 University of South Florida 4202 E.

More information

Video shot segmentation using late fusion technique

Video shot segmentation using late fusion technique Video shot segmentation using late fusion technique by C. Krishna Mohan, N. Dhananjaya, B.Yegnanarayana in Proc. Seventh International Conference on Machine Learning and Applications, 2008, San Diego,

More information

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.11, November 2013 1 Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial

More information

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Project Title: Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding. Midterm Report CS 584 Multimedia Communications Submitted by: Syed Jawwad Bukhari 2004-03-0028 About

More information

Effects Of Shadow On Canny Edge Detection through a camera

Effects Of Shadow On Canny Edge Detection through a camera 1523 Effects Of Shadow On Canny Edge Detection through a camera Srajit Mehrotra Shadow causes errors in computer vision as it is difficult to detect objects that are under the influence of shadows. Shadow

More information

Scanner Parameter Estimation Using Bilevel Scans of Star Charts

Scanner Parameter Estimation Using Bilevel Scans of Star Charts ICDAR, Seattle WA September Scanner Parameter Estimation Using Bilevel Scans of Star Charts Elisa H. Barney Smith Electrical and Computer Engineering Department Boise State University, Boise, Idaho 8375

More information

Unsupervised Camera Motion Estimation and Moving Object Detection in Videos

Unsupervised Camera Motion Estimation and Moving Object Detection in Videos Proceedings of the Irish Machine Vision and Image Processing conference, pp. 102-109, 2006 Unsupervised Camera Motion Estimation and Moving Object Detection in Videos Rozenn Dahyot School of Computer Science

More information

Redundancy and Correlation: Temporal

Redundancy and Correlation: Temporal Redundancy and Correlation: Temporal Mother and Daughter CIF 352 x 288 Frame 60 Frame 61 Time Copyright 2007 by Lina J. Karam 1 Motion Estimation and Compensation Video is a sequence of frames (images)

More information

Multimedia Database Systems. Retrieval by Content

Multimedia Database Systems. Retrieval by Content Multimedia Database Systems Retrieval by Content MIR Motivation Large volumes of data world-wide are not only based on text: Satellite images (oil spill), deep space images (NASA) Medical images (X-rays,

More information

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval Lesson 11 Media Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval Information Retrieval Retrieval = Query + Search Informational Retrieval: Get required information from database/web

More information

Mesh Based Interpolative Coding (MBIC)

Mesh Based Interpolative Coding (MBIC) Mesh Based Interpolative Coding (MBIC) Eckhart Baum, Joachim Speidel Institut für Nachrichtenübertragung, University of Stuttgart An alternative method to H.6 encoding of moving images at bit rates below

More information

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Gaussian Weighted Histogram Intersection for License Plate Classification, Pattern

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Gaussian Weighted Histogram Intersection for License Plate Classification, Pattern [6] IEEE. Reprinted, with permission, from [Wening Jia, Gaussian Weighted Histogram Intersection for License Plate Classification, Pattern Recognition, 6. ICPR 6. 8th International Conference on (Volume:3

More information

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV

Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV Comparative Study of Partial Closed-loop Versus Open-loop Motion Estimation for Coding of HDTV Jeffrey S. McVeigh 1 and Siu-Wai Wu 2 1 Carnegie Mellon University Department of Electrical and Computer Engineering

More information

Object Detection in Video Streams

Object Detection in Video Streams Object Detection in Video Streams Sandhya S Deore* *Assistant Professor Dept. of Computer Engg., SRES COE Kopargaon *sandhya.deore@gmail.com ABSTRACT Object Detection is the most challenging area in video

More information

Tracking of video objects using a backward projection technique

Tracking of video objects using a backward projection technique Tracking of video objects using a backward projection technique Stéphane Pateux IRISA/INRIA, Temics Project Campus Universitaire de Beaulieu 35042 Rennes Cedex, FRANCE ABSTRACT In this paper, we present

More information

Character Recognition

Character Recognition Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches

More information

Content-Based Real Time Video Copy Detection Using Hadoop

Content-Based Real Time Video Copy Detection Using Hadoop IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Volume 6, PP 70-74 www.iosrjen.org Content-Based Real Time Video Copy Detection Using Hadoop Pramodini Kamble 1, Priyanka

More information

MAXIMIZING BANDWIDTH EFFICIENCY

MAXIMIZING BANDWIDTH EFFICIENCY MAXIMIZING BANDWIDTH EFFICIENCY Benefits of Mezzanine Encoding Rev PA1 Ericsson AB 2016 1 (19) 1 Motivation 1.1 Consumption of Available Bandwidth Pressure on available fiber bandwidth continues to outpace

More information

Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block Transform

Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block Transform Circuits and Systems, 2010, 1, 12-17 doi:10.4236/cs.2010.11003 Published Online July 2010 (http://www.scirp.org/journal/cs) Fast Implementation of VC-1 with Modified Motion Estimation and Adaptive Block

More information

Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH

Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH 2011 International Conference on Document Analysis and Recognition Real-Time Document Image Retrieval for a 10 Million Pages Database with a Memory Efficient and Stability Improved LLAH Kazutaka Takeda,

More information

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image [6] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image Matching Methods, Video and Signal Based Surveillance, 6. AVSS

More information

8.5 Application Examples

8.5 Application Examples 8.5 Application Examples 8.5.1 Genre Recognition Goal Assign a genre to a given video, e.g., movie, newscast, commercial, music clip, etc.) Technology Combine many parameters of the physical level to compute

More information

Robust Lossless Data Hiding. Outline

Robust Lossless Data Hiding. Outline Robust Lossless Data Hiding Yun Q. Shi, Zhicheng Ni, Nirwan Ansari Electrical and Computer Engineering New Jersey Institute of Technology October 2010 1 Outline What is lossless data hiding Existing robust

More information

DATA and signal modeling for images and video sequences. Region-Based Representations of Image and Video: Segmentation Tools for Multimedia Services

DATA and signal modeling for images and video sequences. Region-Based Representations of Image and Video: Segmentation Tools for Multimedia Services IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 8, DECEMBER 1999 1147 Region-Based Representations of Image and Video: Segmentation Tools for Multimedia Services P. Salembier,

More information

Modified SPIHT Image Coder For Wireless Communication

Modified SPIHT Image Coder For Wireless Communication Modified SPIHT Image Coder For Wireless Communication M. B. I. REAZ, M. AKTER, F. MOHD-YASIN Faculty of Engineering Multimedia University 63100 Cyberjaya, Selangor Malaysia Abstract: - The Set Partitioning

More information

Image Matching Using Run-Length Feature

Image Matching Using Run-Length Feature Image Matching Using Run-Length Feature Yung-Kuan Chan and Chin-Chen Chang Department of Computer Science and Information Engineering National Chung Cheng University, Chiayi, Taiwan, 621, R.O.C. E-mail:{chan,

More information

Evaluation of GIST descriptors for web scale image search

Evaluation of GIST descriptors for web scale image search Evaluation of GIST descriptors for web scale image search Matthijs Douze Hervé Jégou, Harsimrat Sandhawalia, Laurent Amsaleg and Cordelia Schmid INRIA Grenoble, France July 9, 2009 Evaluation of GIST for

More information

5. Feature Extraction from Images

5. Feature Extraction from Images 5. Feature Extraction from Images Aim of this Chapter: Learn the Basic Feature Extraction Methods for Images Main features: Color Texture Edges Wie funktioniert ein Mustererkennungssystem Test Data x i

More information

Binju Bentex *1, Shandry K. K 2. PG Student, Department of Computer Science, College Of Engineering, Kidangoor, Kottayam, Kerala, India

Binju Bentex *1, Shandry K. K 2. PG Student, Department of Computer Science, College Of Engineering, Kidangoor, Kottayam, Kerala, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Survey on Summarization of Multiple User-Generated

More information

Compression of Stereo Images using a Huffman-Zip Scheme

Compression of Stereo Images using a Huffman-Zip Scheme Compression of Stereo Images using a Huffman-Zip Scheme John Hamann, Vickey Yeh Department of Electrical Engineering, Stanford University Stanford, CA 94304 jhamann@stanford.edu, vickey@stanford.edu Abstract

More information

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain Author manuscript, published in "International Symposium on Broadband Multimedia Systems and Broadcasting, Bilbao : Spain (2009)" One-pass bitrate control for MPEG-4 Scalable Video Coding using ρ-domain

More information

Non-Linearly Quantized Moment Shadow Maps

Non-Linearly Quantized Moment Shadow Maps Non-Linearly Quantized Moment Shadow Maps Christoph Peters 2017-07-30 High-Performance Graphics 2017 These slides include presenter s notes for your convenience. 1 In this presentation we discuss non-linearly

More information

DESIGNING A REAL TIME SYSTEM FOR CAR NUMBER DETECTION USING DISCRETE HOPFIELD NETWORK

DESIGNING A REAL TIME SYSTEM FOR CAR NUMBER DETECTION USING DISCRETE HOPFIELD NETWORK DESIGNING A REAL TIME SYSTEM FOR CAR NUMBER DETECTION USING DISCRETE HOPFIELD NETWORK A.BANERJEE 1, K.BASU 2 and A.KONAR 3 COMPUTER VISION AND ROBOTICS LAB ELECTRONICS AND TELECOMMUNICATION ENGG JADAVPUR

More information

Fast Min-hashing Indexing and Robust Spatiotemporal Matching for Detecting Video Copies

Fast Min-hashing Indexing and Robust Spatiotemporal Matching for Detecting Video Copies To Appear in ACM Transactions on Multimedia Computing, Communications, and Applications, 2010 Fast Min-hashing Indexing and Robust Spatiotemporal Matching for Detecting Video Copies CHIH-YI CHIU Institute

More information

FPGA IMPLEMENTATION FOR REAL TIME SOBEL EDGE DETECTOR BLOCK USING 3-LINE BUFFERS

FPGA IMPLEMENTATION FOR REAL TIME SOBEL EDGE DETECTOR BLOCK USING 3-LINE BUFFERS FPGA IMPLEMENTATION FOR REAL TIME SOBEL EDGE DETECTOR BLOCK USING 3-LINE BUFFERS 1 RONNIE O. SERFA JUAN, 2 CHAN SU PARK, 3 HI SEOK KIM, 4 HYEONG WOO CHA 1,2,3,4 CheongJu University E-maul: 1 engr_serfs@yahoo.com,

More information

QR Code Watermarking Algorithm based on Wavelet Transform

QR Code Watermarking Algorithm based on Wavelet Transform 2013 13th International Symposium on Communications and Information Technologies (ISCIT) QR Code Watermarking Algorithm based on Wavelet Transform Jantana Panyavaraporn 1, Paramate Horkaew 2, Wannaree

More information

Integrating Low-Level and Semantic Visual Cues for Improved Image-to-Video Experiences

Integrating Low-Level and Semantic Visual Cues for Improved Image-to-Video Experiences Integrating Low-Level and Semantic Visual Cues for Improved Image-to-Video Experiences Pedro Pinho, Joel Baltazar, Fernando Pereira Instituto Superior Técnico - Instituto de Telecomunicações IST, Av. Rovisco

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Fundamentals of Image Compression DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR) IN SIGNAL PROCESSING IMPERIAL COLLEGE LONDON Compression New techniques have led to the development

More information

EXTRACTING TEXT FROM VIDEO

EXTRACTING TEXT FROM VIDEO EXTRACTING TEXT FROM VIDEO Jayshree Ghorpade 1, Raviraj Palvankar 2, Ajinkya Patankar 3 and Snehal Rathi 4 1 Department of Computer Engineering, MIT COE, Pune, India jayshree.aj@gmail.com 2 Department

More information

Text Area Detection from Video Frames

Text Area Detection from Video Frames Text Area Detection from Video Frames 1 Text Area Detection from Video Frames Xiangrong Chen, Hongjiang Zhang Microsoft Research China chxr@yahoo.com, hjzhang@microsoft.com Abstract. Text area detection

More information

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications:

Chapter 11.3 MPEG-2. MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Chapter 11.3 MPEG-2 MPEG-2: For higher quality video at a bit-rate of more than 4 Mbps Defined seven profiles aimed at different applications: Simple, Main, SNR scalable, Spatially scalable, High, 4:2:2,

More information

A DWT, DCT AND SVD BASED WATERMARKING TECHNIQUE TO PROTECT THE IMAGE PIRACY

A DWT, DCT AND SVD BASED WATERMARKING TECHNIQUE TO PROTECT THE IMAGE PIRACY A DWT, DCT AND SVD BASED WATERMARKING TECHNIQUE TO PROTECT THE IMAGE PIRACY Md. Maklachur Rahman 1 1 Department of Computer Science and Engineering, Chittagong University of Engineering and Technology,

More information

IDIAP IDIAP. Martigny ffl Valais ffl Suisse

IDIAP IDIAP. Martigny ffl Valais ffl Suisse R E S E A R C H R E P O R T IDIAP IDIAP Martigny - Valais - Suisse ASYMMETRIC FILTER FOR TEXT RECOGNITION IN VIDEO Datong Chen, Kim Shearer IDIAP Case Postale 592 Martigny Switzerland IDIAP RR 00-37 Nov.

More information

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Literature Survey Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Omer Shakil Abstract This literature survey compares various methods

More information

IST MPEG-4 Video Compliant Framework

IST MPEG-4 Video Compliant Framework IST MPEG-4 Video Compliant Framework João Valentim, Paulo Nunes, Fernando Pereira Instituto de Telecomunicações, Instituto Superior Técnico, Av. Rovisco Pais, 1049-001 Lisboa, Portugal Abstract This paper

More information

New Edge-Enhanced Error Diffusion Algorithm Based on the Error Sum Criterion

New Edge-Enhanced Error Diffusion Algorithm Based on the Error Sum Criterion New Edge-Enhanced Error Diffusion Algorithm Based on the Error Sum Criterion Jae Ho Kim* Tae Il Chung Hyung Soon Kim* Kyung Sik Son* Pusan National University Image and Communication Laboratory San 3,

More information

DIGITAL IMAGE WATERMARKING BASED ON A RELATION BETWEEN SPATIAL AND FREQUENCY DOMAINS

DIGITAL IMAGE WATERMARKING BASED ON A RELATION BETWEEN SPATIAL AND FREQUENCY DOMAINS DIGITAL IMAGE WATERMARKING BASED ON A RELATION BETWEEN SPATIAL AND FREQUENCY DOMAINS Murat Furat Mustafa Oral e-mail: mfurat@cu.edu.tr e-mail: moral@mku.edu.tr Cukurova University, Faculty of Engineering,

More information