Video Syntax Analysis

Size: px
Start display at page:

Download "Video Syntax Analysis"

Transcription

1 1 Video Syntax Analysis Wei-Ta Chu 2008/10/9

2 Outline 2 Scene boundary detection Key frame selection

3 3 Announcement of HW #1 Shot Change Detection Goal: automatic shot change detection Requirements 1. Write a program to automatically perform shot change detection 2. Write a report that describes 2.1. How to run your program 2.2. What kinds of features or algorithms you used or compared Detection performance in precision and recall, or even ROC/PR curves

4 4 Announcement of HW #1 Shot Change Detection Evaluation data Homework submission Pack your programs and report into one zip file, and mail it to Deadline: 12:00, Oct. 26, 2008 Grade will be given based on detection performance and descriptions in your report.

5 Implementation 5 Do this homework with any tool you are familiar with. Related resources libmpeg2: FFMPEG: Opencv library: Matlab toolbox:

6 6 Scene Transition Graph Yeung, et al. Segmentation of video by clustering and graph analysis Computer Vision and Image Understanding, vol. 71, no. 1, pp , 1998.

7 Observations 7 Shots in a scene are often repetitive. We are able to classify shots by grouping shots of similar visual contents. Often, a scene is made up of temporally adjacent shots indicating their interrelationships.

8 Similarity of Video Shots 8 D(.,.) measures the dissimilarity between two image frames.

9 Similarity of Video Shots 9 Dissimilarity based on color histogram intersection Dissimilarity based on luminance projection Yeungand Liu, Eficient matching and clustering of video shots Proc. of IEEE International Conference on Image Processing, vol. 1, pp , 1995.

10 10 Representative Image Set for a Video Shot Selection of representative set is achieved by nonlinear temporal sampling

11 11 Representative Image Set for a Video Shot Only 2 to 5% of frames are needed in comparison to achieve good matching results. In addition to temporal subsampling, spatial subsampling can also be used to improve matching efficiency.

12 Clustering of Video Shots 12 Shots in the same cluster are similar Any other shot outside of the cluster must have a dissimilarity greater than the dissimilarity between any shot in the cluster. C i : the ith cluster

13 Clustering of Video Shots 13 Dissimilarity between two clusters: Using the shot pair, in which two shots are in two different clusters, that has the largest dissimilarity value. Dissimilarity between two clusters should be updated at each iteration.

14 Clustering of Video Shots 14

15 Time-Constrained Clustering 15 Any two shots that are far apart in time, even if they share similar visual contents, they potentially represent different contents or occur in different scenes. Temporal distance between two shots The distance in number of frames The distance in number of frames from the end of the earlier shot to the beginning of the latter one.

16 Scene Transition Graph 16 A scene transition graph is a directed graph with the property G=(V,E,F) V: each node represents a cluster of shots E: a directed edge is drawn from node U to W if there is a shot represented by node U that immediately precedes any shots represented by node W. F: a mapping that partitions the set of shots into clusters STG is able to represent compactly the structures of shots and the temporal flow of the story for many video programs.

17 Example of STG 17

18 Cut Edges 18 An edge is a cut edge, if when is removed, results in two disconnected graphs. Each partitioned STG G i represents the interactions of shots in a story unit.

19 19 STG After Time Confining and Cut Edges Finding

20 Framework 20 Shot segmentation Time-constrained clustering Building of scene transition graph Scene segmentation

21 Influences of Parameters 21 Without the knowledge of how long each individual scene lasts, T cannot be approximated well. If T is too large, shots from different scenes are clustered together. If T is too small, shots in the same scene may be separated into different scenes. It s les detrimental to have several story units represent a scene than to have one story unit represent several scenes.

22 Influences of Time Constraints 22 T = 20s. d t (B 1,B 3 ) > T STG B 1 B 2 A 1 A 2 A 3 Clustering results are {B 1,B 2 },{A 1,A 2,A 3 },{B 3,B 4 },{C 1 },{D 1 } Story unit results are {B 1,A 1,B 2,A 2,B 3,A 3,B 4 },{C 1 },{D 1 } B 3 B 4 C 1 D 1

23 Influences of Time Constraints 23 T = 20s. STG B 1 A 1 A 2 A 3 B 4 Clustering results are {B 1 },{B 2,B 3 },{A 1,A 2,A 3 },{B 4 },{C 1 },{D 1 } Story unit results are {B 1 },{A 1,B 2,A 2,B 3,A 3 },{B 4 },{C 1 },{D 1 } B 2 B 3 C1 D 1

24 Refined Analysis 24 Make the time-window more elastic Compute the duration of each story unit and adjust Given a story unit, examination of the next story unit by relaxing the temporal windows and reclustering the shots in these two units. If there exists at least one new cluster that contains shots from the two units, two story units are merged into one.

25 Refined Analysis 25

26 Example 26

27 Results 27 STG constructed from the sitcom Friends. There are frames, each at a spatial resolution of 320x240. There are 313 shots.

28 Results 28 Time-constrained clustering of video shots is able to identify individual story units. The resulting STG permits rapid nonlinear browsing of long video programs.

29 Variations of Clustering Parameters 29 Smaller delta values result in more clusters and thus more story units. Users often prefer over-segmentation rather than under-segmentation.

30 Refining the Segmentation Results 30 The first two story units in Scene 1 are merged into one. The number of story units in Scene is reduced from 4 to 2.

31 Conclusion 31 Analysis based on time-constrained clustering and scene transition graph analysis has contributed to the extraction of story units. The building of story structure provides nonlinear access to video contents. Identification, integration, and application of domain-dependent and semantic features tend to improve segmentation accuracy.

32 32 Scene Detection in Movies and TV shows Rasheed, et al. Scene detection in Holywood movies and tvshows Proc. of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp , 2003.

33 Introduction 33 Every year around 4500 motion pictures are released around the world spanning over approximately 9000 hours of video. Inexpensive and popular digital technology is available through cable and internet, such as video on demand. Accessing video content Detect shots and set of key frames Combine similar shots and form scenes or story units

34 Problems in Previous Works 34 A false color match between shots of two different scenes may wrongly combine scenes. Action scenes may be broken into many scenes for not satisfying the color matching criterion.

35 System Flowchart 35 BSC: backward shot coherence PSB: potential scene boundaries

36 Shot Detection 36 Based on histogram intersection 16 bin HSV normalized color histogram 8 bins for hue, 4 bins each for saturation and value

37 Key frame Selection 37 (1) Initially, the middle frame of the shot is selected and added to the null set K i. (2) Each frame within a shot is compared to every frame in K i. (3) If the frame differs from all previously chosen key frames by a fixed threshold, it is added in K i.

38 Shot-based Features 38 Shot length Shot motion content Estimate the parameters of a global affine motion model Calculate the difference between the actual and the reprojected motion of blocks

39 Shot Motion Content 39

40 Scene Boundary Detection Algorithm 40 Pass 1: Detecting potential scene boundaries based on color properties Pass 2: Removal of weak scene boundaries by analyzing the shot length and motion content

41 Potential Scene Boundaries 41 Backward shot coherence

42 Potential Scene Boundaries 42 Backward shot coherence Compute the shot coherence of the shot I in a window of previous shots Taking the maximum shot coherence in a window of length N Post-processing: if a pair of key frames of two adjacent potential scenes are similar, merger them into one scene.

43 Potential Scene Boundaries 43

44 Selection of Window Size 44 The computation of BSC is controlled by the selection of window size N. A memory parameter which mimics a human s ability to recall a shot seen in the past. If N is too large, it may span over several scenes. If N is too small, over-segmentation of video may be obtained. N=10 in this paper

45 Scene Dynamics Analysis 45 Scenes with weak structure are often broken in several scenes. E.g. action scenes non-repetitiveness of shots Scene dynamics Action scenes: larger SMC and smaller L The PSB between two consecutive scenes will be removed if SD of both scenes exceed a fixed threshold.

46 Scene Dynamics Analysis 46

47 Scene Representation 47 A shot is a good representative when The shot is shown several times (higher SC) The shot spans over longer period of time (larger shot length) The shot has minimal motion content (smaller SMC) Multiple faces are preferred.

48 Shot Goodness 48 A correlation matrix of dimension N X N is constructed where element (i,j) is the coherence of shot i with shot j. Three shots with the highest W are selected as candidate shots and face detection is performed.

49 Detection of Faces 49 A method based on skin detection is adopted. The middle frame of candidate shots are tested. Each isolated segment of skin is considered as face and the frame with highest votes is taken as the scene key frame. In the case of a tie or no face, the key frame of the shot with the highest goodness value is selected.

50 Scene Key Frame 50

51 More Examples 51

52 Experimental Results 52 Five movies, one sitcom, and one TV show Miss (false negative) False alarm (false positive)

53 Experimental Results 53 Slightly over segmentation is preferable over undersegmentation. While browsing a video, it s beter to have two segments of one scene rather than on segment consisting of two scenes.

54 Experimental Results 54

55 References 55 Rasheed, et al. Scene detection in Holywood movies and tvshows Proc. of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp , Yeung, et al. Segmentation of video by clustering and graph analysis Computer Vision and Image Understanding, vol. 71, no. 1, pp , Vendrig, et al. Systematic evaluation of logical story unit segmentation IEEE Transactions on Multimedia, vol. 4, no. 4, pp , 2002.

56 56 Brief Introduction of Montage

57 Montage 57 Montage refers to the editing of the film, the cutting and piecing together of exposed film in a manner that best conveys the intent of the work

58 Methods of Montage 58 Metric The editing follows a specific number of frames, cutting to the next shot no matter what is happening within the image. This montage is used to elicit the most basal and emotional of reactions in the audience. Example

59 Methods of Montage 59 Rhythmic Cutting based on time -- along with a change in the speed of the metric cuts -- to induce more complex meanings than what is possible with metric montage. Once sound was introduced, rhythmic montage also included aural elements (music, dialogue, sounds). Example

60 Methods of Montage 60 Tonal A tonal montage uses the emotional meaning of the shots -- not just manipulating the temporal length of the cuts or its rhythmical characteristics -- to elicit a reaction from the audience even more complex than from the metric or rhythmic montage. For example, a sleeping baby would emote calmness and relaxation. Example: This is the clip following the death of the revolutionary sailor Vakulinchuk, a martyr for sailors and workers.

61 Methods of Montage 61 Overtonal/Associational The overtonal montage is the accumulation of metric, rhythmic, and tonal montage to synthesize its affect on the audience for an even more abstract and complicated effect. Example: In this clip, the men are workers walking towards a confrontation at their factory, and later in the movie, the protagonist uses ice as a means of escape.

62 Methods of Montage 62 Intellectual Uses shots which, combined, elicit an intellectual meaning Example: from Eisenstein's October and Strike. In Strike, a shot of striking workers being attacked cut with a shot of a bull being slaughtered creates a film metaphor suggesting that the workers are being treated like cattle. This meaning does not exist in the individual shots; it only arises when they are juxtaposed.

63 Next Week 63 Introduction of content-based image retrieval

A Robust Wipe Detection Algorithm

A Robust Wipe Detection Algorithm A Robust Wipe Detection Algorithm C. W. Ngo, T. C. Pong & R. T. Chin Department of Computer Science The Hong Kong University of Science & Technology Clear Water Bay, Kowloon, Hong Kong Email: fcwngo, tcpong,

More information

Scene Detection Media Mining I

Scene Detection Media Mining I Scene Detection Media Mining I Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} Overview Hierarchical structure of video sequence

More information

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain Automatic Video Caption Detection and Extraction in the DCT Compressed Domain Chin-Fu Tsao 1, Yu-Hao Chen 1, Jin-Hau Kuo 1, Chia-wei Lin 1, and Ja-Ling Wu 1,2 1 Communication and Multimedia Laboratory,

More information

Video De-interlacing with Scene Change Detection Based on 3D Wavelet Transform

Video De-interlacing with Scene Change Detection Based on 3D Wavelet Transform Video De-interlacing with Scene Change Detection Based on 3D Wavelet Transform M. Nancy Regina 1, S. Caroline 2 PG Scholar, ECE, St. Xavier s Catholic College of Engineering, Nagercoil, India 1 Assistant

More information

Video Key-Frame Extraction using Entropy value as Global and Local Feature

Video Key-Frame Extraction using Entropy value as Global and Local Feature Video Key-Frame Extraction using Entropy value as Global and Local Feature Siddu. P Algur #1, Vivek. R *2 # Department of Information Science Engineering, B.V. Bhoomraddi College of Engineering and Technology

More information

Clustering Methods for Video Browsing and Annotation

Clustering Methods for Video Browsing and Annotation Clustering Methods for Video Browsing and Annotation Di Zhong, HongJiang Zhang 2 and Shih-Fu Chang* Institute of System Science, National University of Singapore Kent Ridge, Singapore 05 *Center for Telecommunication

More information

AUTOMATIC VIDEO INDEXING

AUTOMATIC VIDEO INDEXING AUTOMATIC VIDEO INDEXING Itxaso Bustos Maite Frutos TABLE OF CONTENTS Introduction Methods Key-frame extraction Automatic visual indexing Shot boundary detection Video OCR Index in motion Image processing

More information

Exemplar candidate work. Introduction

Exemplar candidate work. Introduction Exemplar candidate work Introduction OCR has produced these simulated candidate style answers to support teachers in interpreting the assessment criteria for the new Creative imedia specifications and

More information

Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features. Wei-Ta Chu

Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features. Wei-Ta Chu 1 Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features Wei-Ta Chu H.-H. Yeh, C.-Y. Yang, M.-S. Lee, and C.-S. Chen, Video Aesthetic Quality Assessment by Temporal

More information

Recall precision graph

Recall precision graph VIDEO SHOT BOUNDARY DETECTION USING SINGULAR VALUE DECOMPOSITION Λ Z.»CERNEKOVÁ, C. KOTROPOULOS AND I. PITAS Aristotle University of Thessaloniki Box 451, Thessaloniki 541 24, GREECE E-mail: (zuzana, costas,

More information

PixSO: A System for Video Shot Detection

PixSO: A System for Video Shot Detection PixSO: A System for Video Shot Detection Chengcui Zhang 1, Shu-Ching Chen 1, Mei-Ling Shyu 2 1 School of Computer Science, Florida International University, Miami, FL 33199, USA 2 Department of Electrical

More information

Computer Vision 5 Segmentation by Clustering

Computer Vision 5 Segmentation by Clustering Computer Vision 5 Segmentation by Clustering MAP-I Doctoral Programme Miguel Tavares Coimbra Outline Introduction Applications Simple clustering K-means clustering Graph-theoretic clustering Acknowledgements:

More information

MULTIVIEW REPRESENTATION OF 3D OBJECTS OF A SCENE USING VIDEO SEQUENCES

MULTIVIEW REPRESENTATION OF 3D OBJECTS OF A SCENE USING VIDEO SEQUENCES MULTIVIEW REPRESENTATION OF 3D OBJECTS OF A SCENE USING VIDEO SEQUENCES Mehran Yazdi and André Zaccarin CVSL, Dept. of Electrical and Computer Engineering, Laval University Ste-Foy, Québec GK 7P4, Canada

More information

Video shot segmentation using late fusion technique

Video shot segmentation using late fusion technique Video shot segmentation using late fusion technique by C. Krishna Mohan, N. Dhananjaya, B.Yegnanarayana in Proc. Seventh International Conference on Machine Learning and Applications, 2008, San Diego,

More information

16720 Computer Vision: Homework 3 Template Tracking and Layered Motion.

16720 Computer Vision: Homework 3 Template Tracking and Layered Motion. 16720 Computer Vision: Homework 3 Template Tracking and Layered Motion. Instructor: Martial Hebert TAs: Varun Ramakrishna and Tomas Simon Due Date: October 24 th, 2011. 1 Instructions You should submit

More information

Key Frame Extraction and Indexing for Multimedia Databases

Key Frame Extraction and Indexing for Multimedia Databases Key Frame Extraction and Indexing for Multimedia Databases Mohamed AhmedˆÃ Ahmed Karmouchˆ Suhayya Abu-Hakimaˆˆ ÃÃÃÃÃÃÈÃSchool of Information Technology & ˆˆÃ AmikaNow! Corporation Engineering (SITE),

More information

A Semi-Automatic 2D-to-3D Video Conversion with Adaptive Key-Frame Selection

A Semi-Automatic 2D-to-3D Video Conversion with Adaptive Key-Frame Selection A Semi-Automatic 2D-to-3D Video Conversion with Adaptive Key-Frame Selection Kuanyu Ju and Hongkai Xiong Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China ABSTRACT To

More information

CS 664 Segmentation. Daniel Huttenlocher

CS 664 Segmentation. Daniel Huttenlocher CS 664 Segmentation Daniel Huttenlocher Grouping Perceptual Organization Structural relationships between tokens Parallelism, symmetry, alignment Similarity of token properties Often strong psychophysical

More information

FACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU

FACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU FACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU 1. Introduction Face detection of human beings has garnered a lot of interest and research in recent years. There are quite a few relatively

More information

CS 534: Computer Vision Segmentation and Perceptual Grouping

CS 534: Computer Vision Segmentation and Perceptual Grouping CS 534: Computer Vision Segmentation and Perceptual Grouping Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Outlines Mid-level vision What is segmentation Perceptual Grouping Segmentation

More information

Image Segmentation continued Graph Based Methods. Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book

Image Segmentation continued Graph Based Methods. Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book Image Segmentation continued Graph Based Methods Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book Previously Binary segmentation Segmentation by thresholding

More information

Story Unit Segmentation with Friendly Acoustic Perception *

Story Unit Segmentation with Friendly Acoustic Perception * Story Unit Segmentation with Friendly Acoustic Perception * Longchuan Yan 1,3, Jun Du 2, Qingming Huang 3, and Shuqiang Jiang 1 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing,

More information

WITH digital technology becoming inexpensive and popular,

WITH digital technology becoming inexpensive and popular, IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 7, NO. 6, DECEMBER 2005 1097 Detection and Representation of Scenes in Videos Zeeshan Rasheed and Mubarak Shah, Fellow, IEEE Abstract This paper presents a method

More information

AIIA shot boundary detection at TRECVID 2006

AIIA shot boundary detection at TRECVID 2006 AIIA shot boundary detection at TRECVID 6 Z. Černeková, N. Nikolaidis and I. Pitas Artificial Intelligence and Information Analysis Laboratory Department of Informatics Aristotle University of Thessaloniki

More information

Video scenes clustering based on representative shots

Video scenes clustering based on representative shots ISSN 746-7233, England, UK World Journal of Modelling and Simulation Vol., No. 2, 2005, pp. -6 Video scenes clustering based on representative shots Jun Ye, Jian-liang Li 2 +, C. M. Mak 3 Dept. Applied

More information

Rushes Video Segmentation Using Semantic Features

Rushes Video Segmentation Using Semantic Features Rushes Video Segmentation Using Semantic Features Athina Pappa, Vasileios Chasanis, and Antonis Ioannidis Department of Computer Science and Engineering, University of Ioannina, GR 45110, Ioannina, Greece

More information

Image Segmentation continued Graph Based Methods

Image Segmentation continued Graph Based Methods Image Segmentation continued Graph Based Methods Previously Images as graphs Fully-connected graph node (vertex) for every pixel link between every pair of pixels, p,q affinity weight w pq for each link

More information

Video Analysis for Browsing and Printing

Video Analysis for Browsing and Printing Video Analysis for Browsing and Printing Qian Lin, Tong Zhang, Mei Chen, Yining Deng, Brian Atkins HP Laboratories HPL-2008-215 Keyword(s): video mining, video printing, user intent, video panorama, video

More information

Using Cross-Media Correlation for Scene Detection in Travel Videos

Using Cross-Media Correlation for Scene Detection in Travel Videos Using Cross-Media Correlation for Scene Detection in Travel Videos Wei-Ta Chu Department of CSIE National Chung Cheng University Taiwan wtchu@cs.ccu.edu.tw Che-Cheng Lin Department of CSIE National Chung

More information

Applications. Foreground / background segmentation Finding skin-colored regions. Finding the moving objects. Intelligent scissors

Applications. Foreground / background segmentation Finding skin-colored regions. Finding the moving objects. Intelligent scissors Segmentation I Goal Separate image into coherent regions Berkeley segmentation database: http://www.eecs.berkeley.edu/research/projects/cs/vision/grouping/segbench/ Slide by L. Lazebnik Applications Intelligent

More information

Production of Video Images by Computer Controlled Cameras and Its Application to TV Conference System

Production of Video Images by Computer Controlled Cameras and Its Application to TV Conference System Proc. of IEEE Conference on Computer Vision and Pattern Recognition, vol.2, II-131 II-137, Dec. 2001. Production of Video Images by Computer Controlled Cameras and Its Application to TV Conference System

More information

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami to MPEG Prof. Pratikgiri Goswami Electronics & Communication Department, Shree Swami Atmanand Saraswati Institute of Technology, Surat. Outline of Topics 1 2 Coding 3 Video Object Representation Outline

More information

Comparison of Sequence Matching Techniques for Video Copy Detection

Comparison of Sequence Matching Techniques for Video Copy Detection Comparison of Sequence Matching Techniques for Video Copy Detection Arun Hampapur a, Ki-Ho Hyun b and Ruud Bolle a a IBM T.J Watson Research Center, 3 Saw Mill River Road, Hawthorne, NY 1532, USA b School

More information

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VI (Nov Dec. 2014), PP 29-33 Analysis of Image and Video Using Color, Texture and Shape Features

More information

NeTra-V: Towards an Object-based Video Representation

NeTra-V: Towards an Object-based Video Representation Proc. of SPIE, Storage and Retrieval for Image and Video Databases VI, vol. 3312, pp 202-213, 1998 NeTra-V: Towards an Object-based Video Representation Yining Deng, Debargha Mukherjee and B. S. Manjunath

More information

AN ACCELERATED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION

AN ACCELERATED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION AN ACCELERATED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION 1 SEYED MOJTABA TAFAGHOD SADAT ZADEH, 1 ALIREZA MEHRSINA, 2 MINA BASIRAT, 1 Faculty of Computer Science and Information Systems, Universiti

More information

Semantic Extraction and Semantics-based Annotation and Retrieval for Video Databases

Semantic Extraction and Semantics-based Annotation and Retrieval for Video Databases Semantic Extraction and Semantics-based Annotation and Retrieval for Video Databases Yan Liu (liuyan@cs.columbia.edu) and Fei Li (fl200@cs.columbia.edu) Department of Computer Science, Columbia University

More information

Color Local Texture Features Based Face Recognition

Color Local Texture Features Based Face Recognition Color Local Texture Features Based Face Recognition Priyanka V. Bankar Department of Electronics and Communication Engineering SKN Sinhgad College of Engineering, Korti, Pandharpur, Maharashtra, India

More information

Movies Segmentation into Scenes and Chapters Using Locally Weighted Bag of Visual Words

Movies Segmentation into Scenes and Chapters Using Locally Weighted Bag of Visual Words Movies Segmentation into Scenes and Chapters Using Locally Weighted Bag of Visual Words Vasileios Chasanis Department of Computer Science University of Ioannina GR 45110 vchasani@cs.uoi.gr Argyris Kalogeratos

More information

Scene Segmentation in Artistic Archive Documentaries

Scene Segmentation in Artistic Archive Documentaries Scene Segmentation in Artistic Archive Documentaries Dalibor Mitrović, Stefan Hartlieb, Matthias Zeppelzauer, Maia Zaharieva Interactive Media Systems Group Vienna University of Technology Favoritenstrasse

More information

Region Feature Based Similarity Searching of Semantic Video Objects

Region Feature Based Similarity Searching of Semantic Video Objects Region Feature Based Similarity Searching of Semantic Video Objects Di Zhong and Shih-Fu hang Image and dvanced TV Lab, Department of Electrical Engineering olumbia University, New York, NY 10027, US {dzhong,

More information

Problem Definition. Clustering nonlinearly separable data:

Problem Definition. Clustering nonlinearly separable data: Outlines Weighted Graph Cuts without Eigenvectors: A Multilevel Approach (PAMI 2007) User-Guided Large Attributed Graph Clustering with Multiple Sparse Annotations (PAKDD 2016) Problem Definition Clustering

More information

Introduction to Medical Imaging (5XSA0) Module 5

Introduction to Medical Imaging (5XSA0) Module 5 Introduction to Medical Imaging (5XSA0) Module 5 Segmentation Jungong Han, Dirk Farin, Sveta Zinger ( s.zinger@tue.nl ) 1 Outline Introduction Color Segmentation region-growing region-merging watershed

More information

CHAPTER 3 SHOT DETECTION AND KEY FRAME EXTRACTION

CHAPTER 3 SHOT DETECTION AND KEY FRAME EXTRACTION 33 CHAPTER 3 SHOT DETECTION AND KEY FRAME EXTRACTION 3.1 INTRODUCTION The twenty-first century is an age of information explosion. We are witnessing a huge growth in digital data. The trend of increasing

More information

Navidgator. Similarity Based Browsing for Image & Video Databases. Damian Borth, Christian Schulze, Adrian Ulges, Thomas M. Breuel

Navidgator. Similarity Based Browsing for Image & Video Databases. Damian Borth, Christian Schulze, Adrian Ulges, Thomas M. Breuel Navidgator Similarity Based Browsing for Image & Video Databases Damian Borth, Christian Schulze, Adrian Ulges, Thomas M. Breuel Image Understanding and Pattern Recognition DFKI & TU Kaiserslautern 25.Sep.2008

More information

Movie Segmentation into Scenes and Chapters Using Locally Weighted Bag of Visual Words

Movie Segmentation into Scenes and Chapters Using Locally Weighted Bag of Visual Words Movie Segmentation into Scenes and Chapters Using Locally Weighted Bag of Visual Words Vasileios Chasanis Department of Computer Science University of Ioannina GR 45110 vchasani@cs.uoi.gr Argyris Kalogeratos

More information

Key Frame Extraction using Faber-Schauder Wavelet

Key Frame Extraction using Faber-Schauder Wavelet Key Frame Extraction using Faber-Schauder Wavelet ASSMA AZEROUAL Computer Systems and Vision Laboratory assma.azeroual@edu.uiz.ac.ma KARIM AFDEL Computer Systems and Vision Laboratory kafdel@ymail.com

More information

Texture Segmentation by Windowed Projection

Texture Segmentation by Windowed Projection Texture Segmentation by Windowed Projection 1, 2 Fan-Chen Tseng, 2 Ching-Chi Hsu, 2 Chiou-Shann Fuh 1 Department of Electronic Engineering National I-Lan Institute of Technology e-mail : fctseng@ccmail.ilantech.edu.tw

More information

Object Detection System

Object Detection System A Trainable View-Based Object Detection System Thesis Proposal Henry A. Rowley Thesis Committee: Takeo Kanade, Chair Shumeet Baluja Dean Pomerleau Manuela Veloso Tomaso Poggio, MIT Motivation Object detection

More information

Learning video saliency from human gaze using candidate selection

Learning video saliency from human gaze using candidate selection Learning video saliency from human gaze using candidate selection Rudoy, Goldman, Shechtman, Zelnik-Manor CVPR 2013 Paper presentation by Ashish Bora Outline What is saliency? Image vs video Candidates

More information

The Detection of Faces in Color Images: EE368 Project Report

The Detection of Faces in Color Images: EE368 Project Report The Detection of Faces in Color Images: EE368 Project Report Angela Chau, Ezinne Oji, Jeff Walters Dept. of Electrical Engineering Stanford University Stanford, CA 9435 angichau,ezinne,jwalt@stanford.edu

More information

Progress Report of Final Year Project

Progress Report of Final Year Project Progress Report of Final Year Project Project Title: Design and implement a face-tracking engine for video William O Grady 08339937 Electronic and Computer Engineering, College of Engineering and Informatics,

More information

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Mobile Human Detection Systems based on Sliding Windows Approach-A Review Mobile Human Detection Systems based on Sliding Windows Approach-A Review Seminar: Mobile Human detection systems Njieutcheu Tassi cedrique Rovile Department of Computer Engineering University of Heidelberg

More information

Bus Detection and recognition for visually impaired people

Bus Detection and recognition for visually impaired people Bus Detection and recognition for visually impaired people Hangrong Pan, Chucai Yi, and Yingli Tian The City College of New York The Graduate Center The City University of New York MAP4VIP Outline Motivation

More information

CS145: INTRODUCTION TO DATA MINING

CS145: INTRODUCTION TO DATA MINING CS145: INTRODUCTION TO DATA MINING 08: Classification Evaluation and Practical Issues Instructor: Yizhou Sun yzsun@cs.ucla.edu October 24, 2017 Learnt Prediction and Classification Methods Vector Data

More information

Chapter 9 Object Tracking an Overview

Chapter 9 Object Tracking an Overview Chapter 9 Object Tracking an Overview The output of the background subtraction algorithm, described in the previous chapter, is a classification (segmentation) of pixels into foreground pixels (those belonging

More information

A Rapid Scheme for Slow-Motion Replay Segment Detection

A Rapid Scheme for Slow-Motion Replay Segment Detection A Rapid Scheme for Slow-Motion Replay Segment Detection Wei-Hong Chuang, Dun-Yu Hsiao, Soo-Chang Pei, and Homer Chen Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan 10617,

More information

Computable scenes and structures in Films

Computable scenes and structures in Films Computable scenes and structures in Films Hari Sundaram and Shih-Fu Chang Email: {sundaram, sfchang}@ee.columbia.edu Dept. Of Electrical Engineering, Columbia University, New York, New York 10027. Abstract

More information

CAP 6412 Advanced Computer Vision

CAP 6412 Advanced Computer Vision CAP 6412 Advanced Computer Vision http://www.cs.ucf.edu/~bgong/cap6412.html Boqing Gong April 21st, 2016 Today Administrivia Free parameters in an approach, model, or algorithm? Egocentric videos by Aisha

More information

Real-time Monitoring System for TV Commercials Using Video Features

Real-time Monitoring System for TV Commercials Using Video Features Real-time Monitoring System for TV Commercials Using Video Features Sung Hwan Lee, Won Young Yoo, and Young-Suk Yoon Electronics and Telecommunications Research Institute (ETRI), 11 Gajeong-dong, Yuseong-gu,

More information

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation IJCSNS International Journal of Computer Science and Network Security, VOL.13 No.11, November 2013 1 Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial

More information

STUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES

STUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES 25-29 JATIT. All rights reserved. STUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES DR.S.V.KASMIR RAJA, 2 A.SHAIK ABDUL KHADIR, 3 DR.S.S.RIAZ AHAMED. Dean (Research),

More information

Highlights Extraction from Unscripted Video

Highlights Extraction from Unscripted Video Highlights Extraction from Unscripted Video T 61.6030, Multimedia Retrieval Seminar presentation 04.04.2008 Harrison Mfula Helsinki University of Technology Department of Computer Science, Espoo, Finland

More information

Cs : Computer Vision Final Project Report

Cs : Computer Vision Final Project Report Cs 600.461: Computer Vision Final Project Report Giancarlo Troni gtroni@jhu.edu Raphael Sznitman sznitman@jhu.edu Abstract Given a Youtube video of a busy street intersection, our task is to detect, track,

More information

Elimination of Duplicate Videos in Video Sharing Sites

Elimination of Duplicate Videos in Video Sharing Sites Elimination of Duplicate Videos in Video Sharing Sites Narendra Kumar S, Murugan S, Krishnaveni R Abstract - In some social video networking sites such as YouTube, there exists large numbers of duplicate

More information

Automatic Colorization of Grayscale Images

Automatic Colorization of Grayscale Images Automatic Colorization of Grayscale Images Austin Sousa Rasoul Kabirzadeh Patrick Blaes Department of Electrical Engineering, Stanford University 1 Introduction ere exists a wealth of photographic images,

More information

Iterative Image Based Video Summarization by Node Segmentation

Iterative Image Based Video Summarization by Node Segmentation Iterative Image Based Video Summarization by Node Segmentation Nalini Vasudevan Arjun Jain Himanshu Agrawal Abstract In this paper, we propose a simple video summarization system based on removal of similar

More information

Scene Change Detection Based on Twice Difference of Luminance Histograms

Scene Change Detection Based on Twice Difference of Luminance Histograms Scene Change Detection Based on Twice Difference of Luminance Histograms Xinying Wang 1, K.N.Plataniotis 2, A. N. Venetsanopoulos 1 1 Department of Electrical & Computer Engineering University of Toronto

More information

Visual Information Systems. A Course Project On Trailer Classification. Shreeharsh Kelkar

Visual Information Systems. A Course Project On Trailer Classification. Shreeharsh Kelkar Visual Information Systems A Course Project On Trailer Classification Shreeharsh Kelkar sak2010@columbia.edu 1 1.1 Goal of This Project The goal of this project is not just trailer classification, but

More information

Multimedia Databases. 9 Video Retrieval. 9.1 Hidden Markov Model. 9.1 Hidden Markov Model. 9.1 Evaluation. 9.1 HMM Example 12/18/2009

Multimedia Databases. 9 Video Retrieval. 9.1 Hidden Markov Model. 9.1 Hidden Markov Model. 9.1 Evaluation. 9.1 HMM Example 12/18/2009 9 Video Retrieval Multimedia Databases 9 Video Retrieval 9.1 Hidden Markov Models (continued from last lecture) 9.2 Introduction into Video Retrieval Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme

More information

Clustering Algorithms for general similarity measures

Clustering Algorithms for general similarity measures Types of general clustering methods Clustering Algorithms for general similarity measures general similarity measure: specified by object X object similarity matrix 1 constructive algorithms agglomerative

More information

Object Extraction Using Image Segmentation and Adaptive Constraint Propagation

Object Extraction Using Image Segmentation and Adaptive Constraint Propagation Object Extraction Using Image Segmentation and Adaptive Constraint Propagation 1 Rajeshwary Patel, 2 Swarndeep Saket 1 Student, 2 Assistant Professor 1 2 Department of Computer Engineering, 1 2 L. J. Institutes

More information

EE 701 ROBOT VISION. Segmentation

EE 701 ROBOT VISION. Segmentation EE 701 ROBOT VISION Regions and Image Segmentation Histogram-based Segmentation Automatic Thresholding K-means Clustering Spatial Coherence Merging and Splitting Graph Theoretic Segmentation Region Growing

More information

Image Segmentation. Selim Aksoy. Bilkent University

Image Segmentation. Selim Aksoy. Bilkent University Image Segmentation Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Examples of grouping in vision [http://poseidon.csd.auth.gr/lab_research/latest/imgs/s peakdepvidindex_img2.jpg]

More information

Image Segmentation. Selim Aksoy. Bilkent University

Image Segmentation. Selim Aksoy. Bilkent University Image Segmentation Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Examples of grouping in vision [http://poseidon.csd.auth.gr/lab_research/latest/imgs/s peakdepvidindex_img2.jpg]

More information

Classification of Face Images for Gender, Age, Facial Expression, and Identity 1

Classification of Face Images for Gender, Age, Facial Expression, and Identity 1 Proc. Int. Conf. on Artificial Neural Networks (ICANN 05), Warsaw, LNCS 3696, vol. I, pp. 569-574, Springer Verlag 2005 Classification of Face Images for Gender, Age, Facial Expression, and Identity 1

More information

Searching Video Collections:Part I

Searching Video Collections:Part I Searching Video Collections:Part I Introduction to Multimedia Information Retrieval Multimedia Representation Visual Features (Still Images and Image Sequences) Color Texture Shape Edges Objects, Motion

More information

Hierarchical Segmentation of Videos into Shots and Scenes using Visual Content

Hierarchical Segmentation of Videos into Shots and Scenes using Visual Content Hierarchical Segmentation of Videos into Shots and Scenes using Visual Content prepared by Andrew Thompson supervised by Robert Laganière and Pierre Payeur Thesis submitted to the Faculty of Graduate and

More information

NOVEL APPROACH TO CONTENT-BASED VIDEO INDEXING AND RETRIEVAL BY USING A MEASURE OF STRUCTURAL SIMILARITY OF FRAMES. David Asatryan, Manuk Zakaryan

NOVEL APPROACH TO CONTENT-BASED VIDEO INDEXING AND RETRIEVAL BY USING A MEASURE OF STRUCTURAL SIMILARITY OF FRAMES. David Asatryan, Manuk Zakaryan International Journal "Information Content and Processing", Volume 2, Number 1, 2015 71 NOVEL APPROACH TO CONTENT-BASED VIDEO INDEXING AND RETRIEVAL BY USING A MEASURE OF STRUCTURAL SIMILARITY OF FRAMES

More information

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM 1 PHYO THET KHIN, 2 LAI LAI WIN KYI 1,2 Department of Information Technology, Mandalay Technological University The Republic of the Union of Myanmar

More information

2D image segmentation based on spatial coherence

2D image segmentation based on spatial coherence 2D image segmentation based on spatial coherence Václav Hlaváč Czech Technical University in Prague Center for Machine Perception (bridging groups of the) Czech Institute of Informatics, Robotics and Cybernetics

More information

Color Image Segmentation

Color Image Segmentation Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.

More information

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Partha Sarathi Giri Department of Electronics and Communication, M.E.M.S, Balasore, Odisha Abstract Text data

More information

Dynamic skin detection in color images for sign language recognition

Dynamic skin detection in color images for sign language recognition Dynamic skin detection in color images for sign language recognition Michal Kawulok Institute of Computer Science, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland michal.kawulok@polsl.pl

More information

Video Scene Segmentation Using Markov Chain Monte Carlo

Video Scene Segmentation Using Markov Chain Monte Carlo IEEE TRANSACTIONS ON MULTIMEDIA, VOL. X, NO. Y, 005 Video Scene Segmentation Using Markov Chain Monte Carlo Yun Zhai, Student Member, IEEE, and Mubarak Shah, Fellow, IEEE Abstract Videos are composed of

More information

Integration of Global and Local Information in Videos for Key Frame Extraction

Integration of Global and Local Information in Videos for Key Frame Extraction Integration of Global and Local Information in Videos for Key Frame Extraction Dianting Liu 1, Mei-Ling Shyu 1, Chao Chen 1, Shu-Ching Chen 2 1 Department of Electrical and Computer Engineering University

More information

ECG782: Multidimensional Digital Signal Processing

ECG782: Multidimensional Digital Signal Processing Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu ECG782: Multidimensional Digital Signal Processing Spring 2014 TTh 14:30-15:45 CBC C313 Lecture 10 Segmentation 14/02/27 http://www.ee.unlv.edu/~b1morris/ecg782/

More information

A Geometrical Key-frame Selection Method exploiting Dominant Motion Estimation in Video

A Geometrical Key-frame Selection Method exploiting Dominant Motion Estimation in Video A Geometrical Key-frame Selection Method exploiting Dominant Motion Estimation in Video Brigitte Fauvet, Patrick Bouthemy, Patrick Gros 2 and Fabien Spindler IRISA/INRIA 2 IRISA/CNRS Campus Universitaire

More information

Content-based Image and Video Retrieval. Image Segmentation

Content-based Image and Video Retrieval. Image Segmentation Content-based Image and Video Retrieval Vorlesung, SS 2011 Image Segmentation 2.5.2011 / 9.5.2011 Image Segmentation One of the key problem in computer vision Identification of homogenous region in the

More information

Face Detection Using Color Based Segmentation and Morphological Processing A Case Study

Face Detection Using Color Based Segmentation and Morphological Processing A Case Study Face Detection Using Color Based Segmentation and Morphological Processing A Case Study Dr. Arti Khaparde*, Sowmya Reddy.Y Swetha Ravipudi *Professor of ECE, Bharath Institute of Science and Technology

More information

Types of general clustering methods. Clustering Algorithms for general similarity measures. Similarity between clusters

Types of general clustering methods. Clustering Algorithms for general similarity measures. Similarity between clusters Types of general clustering methods Clustering Algorithms for general similarity measures agglomerative versus divisive algorithms agglomerative = bottom-up build up clusters from single objects divisive

More information

Fast Face Detection Assisted with Skin Color Detection

Fast Face Detection Assisted with Skin Color Detection IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 4, Ver. II (Jul.-Aug. 2016), PP 70-76 www.iosrjournals.org Fast Face Detection Assisted with Skin Color

More information

VIDEO OBJECT SEGMENTATION BY EXTENDED RECURSIVE-SHORTEST-SPANNING-TREE METHOD. Ertem Tuncel and Levent Onural

VIDEO OBJECT SEGMENTATION BY EXTENDED RECURSIVE-SHORTEST-SPANNING-TREE METHOD. Ertem Tuncel and Levent Onural VIDEO OBJECT SEGMENTATION BY EXTENDED RECURSIVE-SHORTEST-SPANNING-TREE METHOD Ertem Tuncel and Levent Onural Electrical and Electronics Engineering Department, Bilkent University, TR-06533, Ankara, Turkey

More information

CS249: ADVANCED DATA MINING

CS249: ADVANCED DATA MINING CS249: ADVANCED DATA MINING Classification Evaluation and Practical Issues Instructor: Yizhou Sun yzsun@cs.ucla.edu April 24, 2017 Homework 2 out Announcements Due May 3 rd (11:59pm) Course project proposal

More information

Motion Detection. Final project by. Neta Sokolovsky

Motion Detection. Final project by. Neta Sokolovsky Motion Detection Final project by Neta Sokolovsky Introduction The goal of this project is to recognize a motion of objects found in the two given images. This functionality is useful in the video processing

More information

Object of interest discovery in video sequences

Object of interest discovery in video sequences Object of interest discovery in video sequences A Design Project Report Presented to Engineering Division of the Graduate School Of Cornell University In Partial Fulfillment of the Requirements for the

More information

Action recognition in videos

Action recognition in videos Action recognition in videos Cordelia Schmid INRIA Grenoble Joint work with V. Ferrari, A. Gaidon, Z. Harchaoui, A. Klaeser, A. Prest, H. Wang Action recognition - goal Short actions, i.e. drinking, sit

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

A Framework for Sub-Window Shot Detection

A Framework for Sub-Window Shot Detection A Framework for Sub-Window Shot Detection Chuohao Yeo, Yong-Wei Zhu 2, Qibin Sun 3, Shih-Fu Chang 4,2,3 Institute for Infocomm Research, Agency for Science, Technology and Research, Singapore 4 Department

More information

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig Multimedia Databases Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Previous Lecture Audio Retrieval - Query by Humming

More information