GIST. GPU Implementation. Prakhar Jain ( ) Ejaz Ahmed ( ) 3 rd May, 2009

Size: px
Start display at page:

Download "GIST. GPU Implementation. Prakhar Jain ( ) Ejaz Ahmed ( ) 3 rd May, 2009"

Transcription

1 GIST GPU Implementation Prakhar Jain ( ) Ejaz Ahmed ( ) 3 rd May, 2009 International Institute Of Information Technology, Hyderabad

2 Table of Contents S. No. Topic Page No. 1 Abstract 3 2 Introduction 4 3 Basic Algorithm 4 4 Parallelization 4 5 Getting Descriptor 5 6 Graphs 8 7 Speed Up 9 8 Precision / Accuracy 9 9 Related work Refrences 9 Page 2 of 10

3 ABSTRACT GIST is a computational model of the recognition of real world scenes that bypasses the segmentation and the processing of individual objects or regions. The procedure is based on a very low dimensional representation of the scene called the Spatial Envelope. Torralba proposed a set of perceptual dimensions (naturalness, openness, roughness, expansion, ruggedness) that represent the dominant spatial structure of a scene. Then, he has shown that these dimensions may be reliably estimated using spectral and coarsely localized information. The model generates a multidimensional space in which scenes sharing membership in semantic categories (e.g., streets, highways, coasts) are projected closed together. The performance of the spatial envelope model shows that specific information about object shape or identity is not a requirement for scene categorization and that modeling a holistic representation of the scene informs about its probable semantic category. The implementation the authors have given is in Matlab which runs on the CPU.The objective of this project is to parallelize it using GPGPU ( CUDA ). Page 3 of 10

4 Introduction Many of the Computer Vision algorthims are computationally expensive.algorithms such as Adaboost training, SIFT Feature training, SFM etc require weeks to execute. But, these algorithms are inherently parallelizable. In this project we have parallelized the GIST feature exploiting the computation power of the GPU with the help of Nvidia s CUDA. The rest of the report descrives the Basic algorithm, flowchart and our approach to parallelize it. We conclude it with some results, speed ups and graphs to support the performance claims. BASIC ALGORITHM Create Gabor filter bank Create a Parameter Matrix Create transfer functions Preprocess Image Low contrast normalization Local luminance variance normalization Getting the descriptor For each filter Convolve image with filter Divide the image in blocks Mean of each block is the corresponding feature PARALLELIZATION Pixel level parallelization. Use one thread to calculate the value of one element of a Gabor filter. Number of threads will be equal to number of filters * size_of_image * size_of_image Preprocessing the image Page 4 of 10

5 o Fourier transform * gaussian ( element by element ) o Pixel level parallelization due to inter pixel independence. Getting the Descriptor:-> For calculating the actual descriptors three things are required:- Preprocessing of image to obtain normalized image suitable for calculating features. Calculation of Gabor filters for every orientation and scale. Number of bocks in which the image should be devided. Steps For Getting the Descriptor:- 1. Getting the Fourier Transform of an image:- For calculating the features the processing is done in frequency domain.therefore the image needs to be converted from spacial to frequency domain. Image R G B components R G B Components Spatial Domain Spatial Domain Frequency Domain Fourier Transform of each component of image is calculated using CUFFT Library's function, which is 2 times as fast as CPU implementation of FFT. 2. Applying the Filters on Images:- Number of scales Ns Number of Orientation Per Scale No Total Number of Filters Nf=Ns * No Number of Channels 3 Page 5 of 10

6 Each filter is applied to each channel of image to give Nf*3 images.this is done on GPU with the help of following Kernels Element Multiplication:- Number of Threads Created = ImageSize * imagesize*nf R G B image components in Frquency Domain Nf Number of Filters (i) Element Multiplication Result Of Element Multiplication in Frequency domain. Taking Inverse FFT of Nf * 3 Images :- Inverse Fourier Transform of Nf * 3 images is calculated using CUFFT Library's function, which is 2 times as fast as CPU implementation of IFFT. Taking Absolute of Every image :- For this absolute kernel is used. Total Number of threads created = ImageSize * ImageSize * Nf * 3. Page 6 of 10

7 3. Getting The Features:- For every Nf * 3 images (obtained from prev step) Devide the image in blocks. Mean Value of each block is the one feature. mean value of each block is one feature For calculating mean CUDPP's SEGMENTED SCAN is Used. Page 7 of 10

8 Graphs Page 8 of 10

9 SPEED UP Precision / Accuracy The values are found to be accurate upto 4 places of decimal when compared to results of Torralba s Matlab code. Related Work and Refrences Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope Aude Oliva, Antonio Torralba International Journal of Computer Vision, Vol. 42(3): , CPU Implementation: Segmented Scan: pub lic_interface.html Page 9 of 10

10 Page 10 of 10

Visual localization using global visual features and vanishing points

Visual localization using global visual features and vanishing points Visual localization using global visual features and vanishing points Olivier Saurer, Friedrich Fraundorfer, and Marc Pollefeys Computer Vision and Geometry Group, ETH Zürich, Switzerland {saurero,fraundorfer,marc.pollefeys}@inf.ethz.ch

More information

Statistics of Natural Image Categories

Statistics of Natural Image Categories Statistics of Natural Image Categories Authors: Antonio Torralba and Aude Oliva Presented by: Sebastian Scherer Experiment Please estimate the average depth from the camera viewpoint to all locations(pixels)

More information

Tag Recommendation for Photos

Tag Recommendation for Photos Tag Recommendation for Photos Gowtham Kumar Ramani, Rahul Batra, Tripti Assudani December 10, 2009 Abstract. We present a real-time recommendation system for photo annotation that can be used in Flickr.

More information

ROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING. Wei Liu, Serkan Kiranyaz and Moncef Gabbouj

ROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING. Wei Liu, Serkan Kiranyaz and Moncef Gabbouj Proceedings of the 5th International Symposium on Communications, Control and Signal Processing, ISCCSP 2012, Rome, Italy, 2-4 May 2012 ROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING

More information

Large-Scale Scene Classification Using Gist Feature

Large-Scale Scene Classification Using Gist Feature Large-Scale Scene Classification Using Gist Feature Reza Fuad Rachmadi, I Ketut Eddy Purnama Telematics Laboratory Department of Multimedia and Networking Engineering Institut Teknologi Sepuluh Nopember

More information

Scene Recognition using Bag-of-Words

Scene Recognition using Bag-of-Words Scene Recognition using Bag-of-Words Sarthak Ahuja B.Tech Computer Science Indraprastha Institute of Information Technology Okhla, Delhi 110020 Email: sarthak12088@iiitd.ac.in Anchita Goel B.Tech Computer

More information

Computer Vision for VLFeat and more...

Computer Vision for VLFeat and more... Computer Vision for VLFeat and more... Holistic Methods Francisco Escolano, PhD Associate Professor University of Alicante, Spain Contents PCA/Karhunen-Loeve (slides appart) GIST and Spatial Evelope Image

More information

LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS

LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS 8th International DAAAM Baltic Conference "INDUSTRIAL ENGINEERING - 19-21 April 2012, Tallinn, Estonia LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS Shvarts, D. & Tamre, M. Abstract: The

More information

Recognize Complex Events from Static Images by Fusing Deep Channels Supplementary Materials

Recognize Complex Events from Static Images by Fusing Deep Channels Supplementary Materials Recognize Complex Events from Static Images by Fusing Deep Channels Supplementary Materials Yuanjun Xiong 1 Kai Zhu 1 Dahua Lin 1 Xiaoou Tang 1,2 1 Department of Information Engineering, The Chinese University

More information

Templates, Image Pyramids, and Filter Banks

Templates, Image Pyramids, and Filter Banks Templates, Image Pyramids, and Filter Banks Computer Vision James Hays, Brown Slides: Hoiem and others Reminder Project due Friday Fourier Bases Teases away fast vs. slow changes in the image. This change

More information

Evaluation of GIST descriptors for web scale image search

Evaluation of GIST descriptors for web scale image search Evaluation of GIST descriptors for web scale image search Matthijs Douze Hervé Jégou, Harsimrat Sandhawalia, Laurent Amsaleg and Cordelia Schmid INRIA Grenoble, France July 9, 2009 Evaluation of GIST for

More information

Content Based Image Retrieval

Content Based Image Retrieval Content Based Image Retrieval R. Venkatesh Babu Outline What is CBIR Approaches Features for content based image retrieval Global Local Hybrid Similarity measure Trtaditional Image Retrieval Traditional

More information

Contextual priming for artificial visual perception

Contextual priming for artificial visual perception Contextual priming for artificial visual perception Hervé Guillaume 1, Nathalie Denquive 1, Philippe Tarroux 1,2 1 LIMSI-CNRS BP 133 F-91403 Orsay cedex France 2 ENS 45 rue d Ulm F-75230 Paris cedex 05

More information

Unsupervised Deep Learning for Scene Recognition

Unsupervised Deep Learning for Scene Recognition Unsupervised Deep Learning for Scene Recognition Akram Helou and Chau Nguyen May 19, 2011 1 Introduction Object and scene recognition are usually studied separately. However, research [2]shows that context

More information

High performance 2D Discrete Fourier Transform on Heterogeneous Platforms. Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli

High performance 2D Discrete Fourier Transform on Heterogeneous Platforms. Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli High performance 2D Discrete Fourier Transform on Heterogeneous Platforms Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli Motivation Fourier Transform widely used in Physics, Astronomy, Engineering

More information

Implementing a Speech Recognition System on a GPU using CUDA. Presented by Omid Talakoub Astrid Yi

Implementing a Speech Recognition System on a GPU using CUDA. Presented by Omid Talakoub Astrid Yi Implementing a Speech Recognition System on a GPU using CUDA Presented by Omid Talakoub Astrid Yi Outline Background Motivation Speech recognition algorithm Implementation steps GPU implementation strategies

More information

Scene-Centered Description from Spatial Envelope Properties

Scene-Centered Description from Spatial Envelope Properties Scene-Centered Description from Spatial Envelope Properties Aude Oliva 1 and Antonio Torralba 2 1 Department of Psychology and Cognitive Science Program Michigan State University, East Lansing, MI 48824,

More information

Every Picture Tells a Story: Generating Sentences from Images

Every Picture Tells a Story: Generating Sentences from Images Every Picture Tells a Story: Generating Sentences from Images Ali Farhadi, Mohsen Hejrati, Mohammad Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, David Forsyth University of Illinois

More information

Lecture 2: 2D Fourier transforms and applications

Lecture 2: 2D Fourier transforms and applications Lecture 2: 2D Fourier transforms and applications B14 Image Analysis Michaelmas 2017 Dr. M. Fallon Fourier transforms and spatial frequencies in 2D Definition and meaning The Convolution Theorem Applications

More information

Digital Image Processing. Image Enhancement in the Frequency Domain

Digital Image Processing. Image Enhancement in the Frequency Domain Digital Image Processing Image Enhancement in the Frequency Domain Topics Frequency Domain Enhancements Fourier Transform Convolution High Pass Filtering in Frequency Domain Low Pass Filtering in Frequency

More information

International Journal of Computer Science and Network (IJCSN) Volume 1, Issue 4, August ISSN

International Journal of Computer Science and Network (IJCSN) Volume 1, Issue 4, August ISSN Accelerating MATLAB Applications on Parallel Hardware 1 Kavita Chauhan, 2 Javed Ashraf 1 NGFCET, M.D.University Palwal,Haryana,India Page 80 2 AFSET, M.D.University Dhauj,Haryana,India Abstract MATLAB

More information

Visual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space.

Visual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space. Visual words Map high-dimensional descriptors to tokens/words by quantizing the feature space. Quantize via clustering; cluster centers are the visual words Word #2 Descriptor feature space Assign word

More information

Visual Object Recognition

Visual Object Recognition Visual Object Recognition Lecture 3: Descriptors Per-Erik Forssén, docent Computer Vision Laboratory Department of Electrical Engineering Linköping University 2015 2014 Per-Erik Forssén Lecture 3: Descriptors

More information

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009 Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer

More information

Periocular Biometrics: When Iris Recognition Fails

Periocular Biometrics: When Iris Recognition Fails Periocular Biometrics: When Iris Recognition Fails Samarth Bharadwaj, Himanshu S. Bhatt, Mayank Vatsa and Richa Singh Abstract The performance of iris recognition is affected if iris is captured at a distance.

More information

Human detection solution for a retail store environment

Human detection solution for a retail store environment FACULDADE DE ENGENHARIA DA UNIVERSIDADE DO PORTO Human detection solution for a retail store environment Vítor Araújo PREPARATION OF THE MSC DISSERTATION Mestrado Integrado em Engenharia Eletrotécnica

More information

Using the Forest to See the Trees: Context-based Object Recognition

Using the Forest to See the Trees: Context-based Object Recognition Using the Forest to See the Trees: Context-based Object Recognition Bill Freeman Joint work with Antonio Torralba and Kevin Murphy Computer Science and Artificial Intelligence Laboratory MIT A computer

More information

Spatial Hierarchy of Textons Distributions for Scene Classification

Spatial Hierarchy of Textons Distributions for Scene Classification Spatial Hierarchy of Textons Distributions for Scene Classification S. Battiato 1, G. M. Farinella 1, G. Gallo 1, and D. Ravì 1 Image Processing Laboratory, University of Catania, IT {battiato, gfarinella,

More information

Image processing in frequency Domain

Image processing in frequency Domain Image processing in frequency Domain Introduction to Frequency Domain Deal with images in: -Spatial domain -Frequency domain Frequency Domain In the frequency or Fourier domain, the value and location

More information

A Scene Recognition Algorithm Based on Covariance Descriptor

A Scene Recognition Algorithm Based on Covariance Descriptor A Scene Recognition Algorithm Based on Covariance Descriptor Yinghui Ge Faculty of Information Science and Technology Ningbo University Ningbo, China gyhzd@tom.com Jianjun Yu Department of Computer Science

More information

SUBSET SELECTION FOR LANDMARK MODERN AND HISTORIC IMAGES

SUBSET SELECTION FOR LANDMARK MODERN AND HISTORIC IMAGES SUBSET SELECTION FOR LANDMARK MODERN AND HISTORIC IMAGES Heider K. Ali 1 and Anthony Whitehead 2 1 Carleton University, Systems & Computer Engineering Department, Ottawa, ON, K1S 5B8, Canada heider@sce.carleton.ca

More information

Beyond bags of Features

Beyond bags of Features Beyond bags of Features Spatial Pyramid Matching for Recognizing Natural Scene Categories Camille Schreck, Romain Vavassori Ensimag December 14, 2012 Schreck, Vavassori (Ensimag) Beyond bags of Features

More information

ACCELERATION OF IMAGE RESTORATION ALGORITHMS FOR DYNAMIC MEASUREMENTS IN COORDINATE METROLOGY BY USING OPENCV GPU FRAMEWORK

ACCELERATION OF IMAGE RESTORATION ALGORITHMS FOR DYNAMIC MEASUREMENTS IN COORDINATE METROLOGY BY USING OPENCV GPU FRAMEWORK URN (Paper): urn:nbn:de:gbv:ilm1-2014iwk-140:6 58 th ILMENAU SCIENTIFIC COLLOQUIUM Technische Universität Ilmenau, 08 12 September 2014 URN: urn:nbn:de:gbv:ilm1-2014iwk:3 ACCELERATION OF IMAGE RESTORATION

More information

Schedule for Rest of Semester

Schedule for Rest of Semester Schedule for Rest of Semester Date Lecture Topic 11/20 24 Texture 11/27 25 Review of Statistics & Linear Algebra, Eigenvectors 11/29 26 Eigenvector expansions, Pattern Recognition 12/4 27 Cameras & calibration

More information

Texture Segmentation

Texture Segmentation Texture Segmentation Introduction to Signal and Image Processing Prof. Dr. Philippe Cattin MIAC, University of Basel 1 of 48 22.02.2016 09:20 Contents Contents Abstract 2 1 Introduction What is Texture?

More information

Image Enhancement Techniques for Fingerprint Identification

Image Enhancement Techniques for Fingerprint Identification March 2013 1 Image Enhancement Techniques for Fingerprint Identification Pankaj Deshmukh, Siraj Pathan, Riyaz Pathan Abstract The aim of this paper is to propose a new method in fingerprint enhancement

More information

SUPPLEMENTARY MATERIAL FOR. Do computer vision models differ systematically from human object perception? RT Pramod 1,2 & SP Arun 1

SUPPLEMENTARY MATERIAL FOR. Do computer vision models differ systematically from human object perception? RT Pramod 1,2 & SP Arun 1 SUPPLEMENTARY MATERIAL FOR Do computer vision models differ systematically from human object perception? RT Pramod 1,2 & SP Arun 1 1 Centre for Neuroscience & 2 Department of Electrical Communication Engineering

More information

An Introduction to Content Based Image Retrieval

An Introduction to Content Based Image Retrieval CHAPTER -1 An Introduction to Content Based Image Retrieval 1.1 Introduction With the advancement in internet and multimedia technologies, a huge amount of multimedia data in the form of audio, video and

More information

Real-Time Detection of Landscape Scenes

Real-Time Detection of Landscape Scenes Real-Time Detection of Landscape Scenes Sami Huttunen 1,EsaRahtu 1, Iivari Kunttu 2, Juuso Gren 2, and Janne Heikkilä 1 1 Machine Vision Group, University of Oulu, Finland firstname.lastname@ee.oulu.fi

More information

Advanced CUDA Optimization 1. Introduction

Advanced CUDA Optimization 1. Introduction Advanced CUDA Optimization 1. Introduction Thomas Bradley Agenda CUDA Review Review of CUDA Architecture Programming & Memory Models Programming Environment Execution Performance Optimization Guidelines

More information

Texture. COS 429 Princeton University

Texture. COS 429 Princeton University Texture COS 429 Princeton University Texture What is a texture? Antonio Torralba Texture What is a texture? Antonio Torralba Texture What is a texture? Antonio Torralba Texture Texture is stochastic and

More information

Weed Seeds Recognition via Support Vector Machine and Random Forest

Weed Seeds Recognition via Support Vector Machine and Random Forest Weed Seeds Recognition via Support Vector Machine and Random Forest Yilin Long and Cheng Cai * * Department of Computer Science, College of Information Engineering, Northwest A&F University, Yangling,

More information

GPU Based Face Recognition System for Authentication

GPU Based Face Recognition System for Authentication GPU Based Face Recognition System for Authentication Bhumika Agrawal, Chelsi Gupta, Meghna Mandloi, Divya Dwivedi, Jayesh Surana Information Technology, SVITS Gram Baroli, Sanwer road, Indore, MP, India

More information

Fingerprint Verification applying Invariant Moments

Fingerprint Verification applying Invariant Moments Fingerprint Verification applying Invariant Moments J. Leon, G Sanchez, G. Aguilar. L. Toscano. H. Perez, J. M. Ramirez National Polytechnic Institute SEPI ESIME CULHUACAN Mexico City, Mexico National

More information

Aerial Image Classification Using Structural Texture Similarity

Aerial Image Classification Using Structural Texture Similarity Aerial Image Classification Using Structural Texture Similarity Vladimir Risojević and Zdenka Babić Faculty of Electrical Engineering, University of Banja Luka Patre 5, 78000 Banja Luka, Bosnia and Herzegovina

More information

Computational Methods for Radiance. Render the full variety offered by the direct observation of objects. (Computationally).

Computational Methods for Radiance. Render the full variety offered by the direct observation of objects. (Computationally). Computational Methods for Radiance Render the full variety offered by the direct observation of objects. (Computationally). Methods for Plenoptic 1.0 Computing with Radiance Goal: Render the full variety

More information

5. Feature Extraction from Images

5. Feature Extraction from Images 5. Feature Extraction from Images Aim of this Chapter: Learn the Basic Feature Extraction Methods for Images Main features: Color Texture Edges Wie funktioniert ein Mustererkennungssystem Test Data x i

More information

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011 Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition

More information

Fourier Transform and Texture Filtering

Fourier Transform and Texture Filtering Fourier Transform and Texture Filtering Lucas J. van Vliet www.ph.tn.tudelft.nl/~lucas Image Analysis Paradigm scene Image formation sensor pre-processing Image enhancement Image restoration Texture filtering

More information

Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope International Journal of Computer Vision 42(3), 145 175, 2001 c 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. Modeling the Shape of the Scene: A Holistic Representation of the Spatial

More information

Advanced Video Content Analysis and Video Compression (5LSH0), Module 4

Advanced Video Content Analysis and Video Compression (5LSH0), Module 4 Advanced Video Content Analysis and Video Compression (5LSH0), Module 4 Visual feature extraction Part I: Color and texture analysis Sveta Zinger Video Coding and Architectures Research group, TU/e ( s.zinger@tue.nl

More information

USING THE GPU FOR FAST SYMMETRY-BASED DENSE STEREO MATCHING IN HIGH RESOLUTION IMAGES

USING THE GPU FOR FAST SYMMETRY-BASED DENSE STEREO MATCHING IN HIGH RESOLUTION IMAGES USING THE GPU FOR FAST SYMMETRY-BASED DENSE STEREO MATCHING IN HIGH RESOLUTION IMAGES Vasco Mota Gabriel Falcao Michel Antunes Joao Barreto Urbano Nunes Institute of Systems and Robotics, Dept. of Electr.

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

CS 223B Computer Vision Problem Set 3

CS 223B Computer Vision Problem Set 3 CS 223B Computer Vision Problem Set 3 Due: Feb. 22 nd, 2011 1 Probabilistic Recursion for Tracking In this problem you will derive a method for tracking a point of interest through a sequence of images.

More information

Short Survey on Static Hand Gesture Recognition

Short Survey on Static Hand Gesture Recognition Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of

More information

NumbaPro CUDA Python. Square matrix multiplication

NumbaPro CUDA Python. Square matrix multiplication NumbaPro Enables parallel programming in Python Support various entry points: Low-level (CUDA-C like) programming language High-level array oriented interface CUDA library bindings Also support multicore

More information

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction Preprocessing The goal of pre-processing is to try to reduce unwanted variation in image due to lighting,

More information

Artifacts and Textured Region Detection

Artifacts and Textured Region Detection Artifacts and Textured Region Detection 1 Vishal Bangard ECE 738 - Spring 2003 I. INTRODUCTION A lot of transformations, when applied to images, lead to the development of various artifacts in them. In

More information

Practical Image and Video Processing Using MATLAB

Practical Image and Video Processing Using MATLAB Practical Image and Video Processing Using MATLAB Chapter 18 Feature extraction and representation What will we learn? What is feature extraction and why is it a critical step in most computer vision and

More information

CS 229 Classification of Channel Bifurcation Points in Remote Sensing Imagery of River Deltas. Erik Nesvold

CS 229 Classification of Channel Bifurcation Points in Remote Sensing Imagery of River Deltas. Erik Nesvold CS 229 Classification of Channel Bifurcation Points in Remote Sensing Imagery of River Deltas Erik Nesvold nesvold@stanford.edu I. Introduction River deltas are very important landforms on Earth that are

More information

A SYNOPTIC ACCOUNT FOR TEXTURE SEGMENTATION: FROM EDGE- TO REGION-BASED MECHANISMS

A SYNOPTIC ACCOUNT FOR TEXTURE SEGMENTATION: FROM EDGE- TO REGION-BASED MECHANISMS A SYNOPTIC ACCOUNT FOR TEXTURE SEGMENTATION: FROM EDGE- TO REGION-BASED MECHANISMS Enrico Giora and Clara Casco Department of General Psychology, University of Padua, Italy Abstract Edge-based energy models

More information

ELL 788 Computational Perception & Cognition July November 2015

ELL 788 Computational Perception & Cognition July November 2015 ELL 788 Computational Perception & Cognition July November 2015 Module 6 Role of context in object detection Objects and cognition Ambiguous objects Unfavorable viewing condition Context helps in object

More information

Comparing Local Feature Descriptors in plsa-based Image Models

Comparing Local Feature Descriptors in plsa-based Image Models Comparing Local Feature Descriptors in plsa-based Image Models Eva Hörster 1,ThomasGreif 1, Rainer Lienhart 1, and Malcolm Slaney 2 1 Multimedia Computing Lab, University of Augsburg, Germany {hoerster,lienhart}@informatik.uni-augsburg.de

More information

Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs

Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs Xiaowei Li, Changchang Wu, Christopher Zach, Svetlana Lazebnik, Jan-Michael Frahm 1 Motivation Target problem: organizing

More information

Set Size, Clutter & Complexity

Set Size, Clutter & Complexity Set Size, Clutter & Complexity A review of You are simple Quantities Aude Oliva "I think the next century will be the century of complexity." Stephen Hawking To be complex or not to be complex Unfamiliar

More information

A Content Based Image Retrieval System Based on Color Features

A Content Based Image Retrieval System Based on Color Features A Content Based Image Retrieval System Based on Features Irena Valova, University of Rousse Angel Kanchev, Department of Computer Systems and Technologies, Rousse, Bulgaria, Irena@ecs.ru.acad.bg Boris

More information

ImageCLEF 2011

ImageCLEF 2011 SZTAKI @ ImageCLEF 2011 Bálint Daróczy joint work with András Benczúr, Róbert Pethes Data Mining and Web Search Group Computer and Automation Research Institute Hungarian Academy of Sciences Training/test

More information

Face Detection CUDA Accelerating

Face Detection CUDA Accelerating Face Detection CUDA Accelerating Jaromír Krpec Department of Computer Science VŠB Technical University Ostrava Ostrava, Czech Republic krpec.jaromir@seznam.cz Martin Němec Department of Computer Science

More information

Sar Image Segmentation Using Hierarchical Unequal Merging

Sar Image Segmentation Using Hierarchical Unequal Merging Sar Image Segmentation Using Hierarchical Unequal Merging V. Shalini M.E. Communication Engineering Sri Sakthi Institute Of Engineering And Technology Coimbatore, India Dr. S. Bhavani Sri Sakthi Institute

More information

B. Tech. Project Second Stage Report on

B. Tech. Project Second Stage Report on B. Tech. Project Second Stage Report on GPU Based Active Contours Submitted by Sumit Shekhar (05007028) Under the guidance of Prof Subhasis Chaudhuri Table of Contents 1. Introduction... 1 1.1 Graphic

More information

Dimensionality Reduction using Relative Attributes

Dimensionality Reduction using Relative Attributes Dimensionality Reduction using Relative Attributes Mohammadreza Babaee 1, Stefanos Tsoukalas 1, Maryam Babaee Gerhard Rigoll 1, and Mihai Datcu 1 Institute for Human-Machine Communication, Technische Universität

More information

Discriminative classifiers for image recognition

Discriminative classifiers for image recognition Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study

More information

Indoor Outdoor Image Classification

Indoor Outdoor Image Classification Indoor Outdoor Image Classification Bachelor Thesis Fei Guo Department of Computer Science, ETH Zurich guof@student.ethz.ch Advisor: Lukas Bossard Supervisor: Prof. Dr. Luc van Gool August 18, 2011 Abstract

More information

Latest development in image feature representation and extraction

Latest development in image feature representation and extraction International Journal of Advanced Research and Development ISSN: 2455-4030, Impact Factor: RJIF 5.24 www.advancedjournal.com Volume 2; Issue 1; January 2017; Page No. 05-09 Latest development in image

More information

CS 231A Computer Vision (Fall 2012) Problem Set 3

CS 231A Computer Vision (Fall 2012) Problem Set 3 CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest

More information

Texture. Outline. Image representations: spatial and frequency Fourier transform Frequency filtering Oriented pyramids Texture representation

Texture. Outline. Image representations: spatial and frequency Fourier transform Frequency filtering Oriented pyramids Texture representation Texture Outline Image representations: spatial and frequency Fourier transform Frequency filtering Oriented pyramids Texture representation 1 Image Representation The standard basis for images is the set

More information

Patch-based Object Recognition. Basic Idea

Patch-based Object Recognition. Basic Idea Patch-based Object Recognition 1! Basic Idea Determine interest points in image Determine local image properties around interest points Use local image properties for object classification Example: Interest

More information

CHAPTER 5 GLOBAL AND LOCAL FEATURES FOR FACE RECOGNITION

CHAPTER 5 GLOBAL AND LOCAL FEATURES FOR FACE RECOGNITION 122 CHAPTER 5 GLOBAL AND LOCAL FEATURES FOR FACE RECOGNITION 5.1 INTRODUCTION Face recognition, means checking for the presence of a face from a database that contains many faces and could be performed

More information

Classification of objects from Video Data (Group 30)

Classification of objects from Video Data (Group 30) Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time

More information

High Performance Video Artifact Detection Enhanced with CUDA. Atul Ravindran Digimetrics

High Performance Video Artifact Detection Enhanced with CUDA. Atul Ravindran Digimetrics High Performance Video Artifact Detection Enhanced with CUDA Atul Ravindran Digimetrics Goals & Challenges Provide automated QC for digital video files with accuracy and minimum false positives Provide

More information

CHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37

CHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37 Extended Contents List Preface... xi About the authors... xvii CHAPTER 1 Introduction 1 1.1 Overview... 1 1.2 Human and Computer Vision... 2 1.3 The Human Vision System... 4 1.3.1 The Eye... 5 1.3.2 The

More information

Texture. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors

Texture. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors Texture The most fundamental question is: How can we measure texture, i.e., how can we quantitatively distinguish between different textures? Of course it is not enough to look at the intensity of individual

More information

TEXTURE. Plan for today. Segmentation problems. What is segmentation? INF 4300 Digital Image Analysis. Why texture, and what is it?

TEXTURE. Plan for today. Segmentation problems. What is segmentation? INF 4300 Digital Image Analysis. Why texture, and what is it? INF 43 Digital Image Analysis TEXTURE Plan for today Why texture, and what is it? Statistical descriptors First order Second order Gray level co-occurrence matrices Fritz Albregtsen 8.9.21 Higher order

More information

TEXTURE ANALYSIS USING GABOR FILTERS

TEXTURE ANALYSIS USING GABOR FILTERS TEXTURE ANALYSIS USING GABOR FILTERS Texture Types Definition of Texture Texture types Synthetic Natural Stochastic < Prev Next > Texture Definition Texture: the regular repetition of an element or pattern

More information

Frequency analysis, pyramids, texture analysis, applications (face detection, category recognition)

Frequency analysis, pyramids, texture analysis, applications (face detection, category recognition) Frequency analysis, pyramids, texture analysis, applications (face detection, category recognition) Outline Measuring frequencies in images: Definitions, properties Sampling issues Relation with Gaussian

More information

Dither Removal. Bart M. ter Haar Romeny. Image Dithering. This article has not been updated for Mathematica 8.

Dither Removal. Bart M. ter Haar Romeny. Image Dithering. This article has not been updated for Mathematica 8. This article has not been updated for Mathematica 8. The Mathematica Journal Dither Removal Bart M. ter Haar Romeny Mathematica is ideal for explaining the design of seemingly complex mathematical methods.

More information

Coarse-to-fine image registration

Coarse-to-fine image registration Today we will look at a few important topics in scale space in computer vision, in particular, coarseto-fine approaches, and the SIFT feature descriptor. I will present only the main ideas here to give

More information

Final Project Report: Filterbank-Based Fingerprint Matching

Final Project Report: Filterbank-Based Fingerprint Matching Sabanci University TE 407 Digital Image Processing Final Project Report: Filterbank-Based Fingerprint Matching June 28, 2004 Didem Gözüpek & Onur Sarkan 5265 5241 1 1. Introduction The need for security

More information

Learning global properties of scene images based on their correlational structures

Learning global properties of scene images based on their correlational structures Learning global properties of scene images based on their correlational structures Wooyoung Lee Machine Learning Department Carnegie Mellon University Pittsburgh, PA 15213 wooyoung@cs.cmu.edu Michael S.

More information

Local Features Tutorial: Nov. 8, 04

Local Features Tutorial: Nov. 8, 04 Local Features Tutorial: Nov. 8, 04 Local Features Tutorial References: Matlab SIFT tutorial (from course webpage) Lowe, David G. Distinctive Image Features from Scale Invariant Features, International

More information

Multimedia Information Retrieval

Multimedia Information Retrieval Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis Why content-based? Actually, what is content-based

More information

Content Based Image Retrieval Using Curvelet Transform

Content Based Image Retrieval Using Curvelet Transform Content Based Image Retrieval Using Curvelet Transform Ishrat Jahan Sumana, Md. Monirul Islam, Dengsheng Zhang and Guojun Lu Gippsland School of Information Technology, Monash University Churchill, Victoria

More information

Automatic Classification of Outdoor Images by Region Matching

Automatic Classification of Outdoor Images by Region Matching Automatic Classification of Outdoor Images by Region Matching Oliver van Kaick and Greg Mori School of Computing Science Simon Fraser University, Burnaby, BC, V5A S6 Canada E-mail: {ovankaic,mori}@cs.sfu.ca

More information

high performance medical reconstruction using stream programming paradigms

high performance medical reconstruction using stream programming paradigms high performance medical reconstruction using stream programming paradigms This Paper describes the implementation and results of CT reconstruction using Filtered Back Projection on various stream programming

More information

Scene segmentation and pedestrian classification from 3-D range and intensity images

Scene segmentation and pedestrian classification from 3-D range and intensity images University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2012 Scene segmentation and pedestrian classification

More information

CS534: Introduction to Computer Vision Edges and Contours. Ahmed Elgammal Dept. of Computer Science Rutgers University

CS534: Introduction to Computer Vision Edges and Contours. Ahmed Elgammal Dept. of Computer Science Rutgers University CS534: Introduction to Computer Vision Edges and Contours Ahmed Elgammal Dept. of Computer Science Rutgers University Outlines What makes an edge? Gradient-based edge detection Edge Operators Laplacian

More information

Modeling Image Context using Object Centered Grid

Modeling Image Context using Object Centered Grid Modeling Image Context using Object Centered Grid Sobhan Naderi Parizi, Ivan Laptev, Alireza Tavakoli Targhi Computer Vision and Active Perception Laboratory Royal Institute of Technology (KTH) SE-100

More information

Continuous Visual Vocabulary Models for plsa-based Scene Recognition

Continuous Visual Vocabulary Models for plsa-based Scene Recognition Continuous Visual Vocabulary Models for plsa-based Scene Recognition Eva Hörster Multimedia Computing Lab University of Augsburg Augsburg, Germany hoerster@informatik.uniaugsburg.de Rainer Lienhart Multimedia

More information

Why is computer vision difficult?

Why is computer vision difficult? Why is computer vision difficult? Viewpoint variation Illumination Scale Why is computer vision difficult? Intra-class variation Motion (Source: S. Lazebnik) Background clutter Occlusion Challenges: local

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK UNSUPERVISED SEGMENTATION OF TEXTURE IMAGES USING A COMBINATION OF GABOR AND WAVELET

More information