GIST. GPU Implementation. Prakhar Jain ( ) Ejaz Ahmed ( ) 3 rd May, 2009
|
|
- Gavin Shelton
- 5 years ago
- Views:
Transcription
1 GIST GPU Implementation Prakhar Jain ( ) Ejaz Ahmed ( ) 3 rd May, 2009 International Institute Of Information Technology, Hyderabad
2 Table of Contents S. No. Topic Page No. 1 Abstract 3 2 Introduction 4 3 Basic Algorithm 4 4 Parallelization 4 5 Getting Descriptor 5 6 Graphs 8 7 Speed Up 9 8 Precision / Accuracy 9 9 Related work Refrences 9 Page 2 of 10
3 ABSTRACT GIST is a computational model of the recognition of real world scenes that bypasses the segmentation and the processing of individual objects or regions. The procedure is based on a very low dimensional representation of the scene called the Spatial Envelope. Torralba proposed a set of perceptual dimensions (naturalness, openness, roughness, expansion, ruggedness) that represent the dominant spatial structure of a scene. Then, he has shown that these dimensions may be reliably estimated using spectral and coarsely localized information. The model generates a multidimensional space in which scenes sharing membership in semantic categories (e.g., streets, highways, coasts) are projected closed together. The performance of the spatial envelope model shows that specific information about object shape or identity is not a requirement for scene categorization and that modeling a holistic representation of the scene informs about its probable semantic category. The implementation the authors have given is in Matlab which runs on the CPU.The objective of this project is to parallelize it using GPGPU ( CUDA ). Page 3 of 10
4 Introduction Many of the Computer Vision algorthims are computationally expensive.algorithms such as Adaboost training, SIFT Feature training, SFM etc require weeks to execute. But, these algorithms are inherently parallelizable. In this project we have parallelized the GIST feature exploiting the computation power of the GPU with the help of Nvidia s CUDA. The rest of the report descrives the Basic algorithm, flowchart and our approach to parallelize it. We conclude it with some results, speed ups and graphs to support the performance claims. BASIC ALGORITHM Create Gabor filter bank Create a Parameter Matrix Create transfer functions Preprocess Image Low contrast normalization Local luminance variance normalization Getting the descriptor For each filter Convolve image with filter Divide the image in blocks Mean of each block is the corresponding feature PARALLELIZATION Pixel level parallelization. Use one thread to calculate the value of one element of a Gabor filter. Number of threads will be equal to number of filters * size_of_image * size_of_image Preprocessing the image Page 4 of 10
5 o Fourier transform * gaussian ( element by element ) o Pixel level parallelization due to inter pixel independence. Getting the Descriptor:-> For calculating the actual descriptors three things are required:- Preprocessing of image to obtain normalized image suitable for calculating features. Calculation of Gabor filters for every orientation and scale. Number of bocks in which the image should be devided. Steps For Getting the Descriptor:- 1. Getting the Fourier Transform of an image:- For calculating the features the processing is done in frequency domain.therefore the image needs to be converted from spacial to frequency domain. Image R G B components R G B Components Spatial Domain Spatial Domain Frequency Domain Fourier Transform of each component of image is calculated using CUFFT Library's function, which is 2 times as fast as CPU implementation of FFT. 2. Applying the Filters on Images:- Number of scales Ns Number of Orientation Per Scale No Total Number of Filters Nf=Ns * No Number of Channels 3 Page 5 of 10
6 Each filter is applied to each channel of image to give Nf*3 images.this is done on GPU with the help of following Kernels Element Multiplication:- Number of Threads Created = ImageSize * imagesize*nf R G B image components in Frquency Domain Nf Number of Filters (i) Element Multiplication Result Of Element Multiplication in Frequency domain. Taking Inverse FFT of Nf * 3 Images :- Inverse Fourier Transform of Nf * 3 images is calculated using CUFFT Library's function, which is 2 times as fast as CPU implementation of IFFT. Taking Absolute of Every image :- For this absolute kernel is used. Total Number of threads created = ImageSize * ImageSize * Nf * 3. Page 6 of 10
7 3. Getting The Features:- For every Nf * 3 images (obtained from prev step) Devide the image in blocks. Mean Value of each block is the one feature. mean value of each block is one feature For calculating mean CUDPP's SEGMENTED SCAN is Used. Page 7 of 10
8 Graphs Page 8 of 10
9 SPEED UP Precision / Accuracy The values are found to be accurate upto 4 places of decimal when compared to results of Torralba s Matlab code. Related Work and Refrences Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope Aude Oliva, Antonio Torralba International Journal of Computer Vision, Vol. 42(3): , CPU Implementation: Segmented Scan: pub lic_interface.html Page 9 of 10
10 Page 10 of 10
Visual localization using global visual features and vanishing points
Visual localization using global visual features and vanishing points Olivier Saurer, Friedrich Fraundorfer, and Marc Pollefeys Computer Vision and Geometry Group, ETH Zürich, Switzerland {saurero,fraundorfer,marc.pollefeys}@inf.ethz.ch
More informationStatistics of Natural Image Categories
Statistics of Natural Image Categories Authors: Antonio Torralba and Aude Oliva Presented by: Sebastian Scherer Experiment Please estimate the average depth from the camera viewpoint to all locations(pixels)
More informationTag Recommendation for Photos
Tag Recommendation for Photos Gowtham Kumar Ramani, Rahul Batra, Tripti Assudani December 10, 2009 Abstract. We present a real-time recommendation system for photo annotation that can be used in Flickr.
More informationROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING. Wei Liu, Serkan Kiranyaz and Moncef Gabbouj
Proceedings of the 5th International Symposium on Communications, Control and Signal Processing, ISCCSP 2012, Rome, Italy, 2-4 May 2012 ROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING
More informationLarge-Scale Scene Classification Using Gist Feature
Large-Scale Scene Classification Using Gist Feature Reza Fuad Rachmadi, I Ketut Eddy Purnama Telematics Laboratory Department of Multimedia and Networking Engineering Institut Teknologi Sepuluh Nopember
More informationScene Recognition using Bag-of-Words
Scene Recognition using Bag-of-Words Sarthak Ahuja B.Tech Computer Science Indraprastha Institute of Information Technology Okhla, Delhi 110020 Email: sarthak12088@iiitd.ac.in Anchita Goel B.Tech Computer
More informationComputer Vision for VLFeat and more...
Computer Vision for VLFeat and more... Holistic Methods Francisco Escolano, PhD Associate Professor University of Alicante, Spain Contents PCA/Karhunen-Loeve (slides appart) GIST and Spatial Evelope Image
More informationLOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS
8th International DAAAM Baltic Conference "INDUSTRIAL ENGINEERING - 19-21 April 2012, Tallinn, Estonia LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS Shvarts, D. & Tamre, M. Abstract: The
More informationRecognize Complex Events from Static Images by Fusing Deep Channels Supplementary Materials
Recognize Complex Events from Static Images by Fusing Deep Channels Supplementary Materials Yuanjun Xiong 1 Kai Zhu 1 Dahua Lin 1 Xiaoou Tang 1,2 1 Department of Information Engineering, The Chinese University
More informationTemplates, Image Pyramids, and Filter Banks
Templates, Image Pyramids, and Filter Banks Computer Vision James Hays, Brown Slides: Hoiem and others Reminder Project due Friday Fourier Bases Teases away fast vs. slow changes in the image. This change
More informationEvaluation of GIST descriptors for web scale image search
Evaluation of GIST descriptors for web scale image search Matthijs Douze Hervé Jégou, Harsimrat Sandhawalia, Laurent Amsaleg and Cordelia Schmid INRIA Grenoble, France July 9, 2009 Evaluation of GIST for
More informationContent Based Image Retrieval
Content Based Image Retrieval R. Venkatesh Babu Outline What is CBIR Approaches Features for content based image retrieval Global Local Hybrid Similarity measure Trtaditional Image Retrieval Traditional
More informationContextual priming for artificial visual perception
Contextual priming for artificial visual perception Hervé Guillaume 1, Nathalie Denquive 1, Philippe Tarroux 1,2 1 LIMSI-CNRS BP 133 F-91403 Orsay cedex France 2 ENS 45 rue d Ulm F-75230 Paris cedex 05
More informationUnsupervised Deep Learning for Scene Recognition
Unsupervised Deep Learning for Scene Recognition Akram Helou and Chau Nguyen May 19, 2011 1 Introduction Object and scene recognition are usually studied separately. However, research [2]shows that context
More informationHigh performance 2D Discrete Fourier Transform on Heterogeneous Platforms. Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli
High performance 2D Discrete Fourier Transform on Heterogeneous Platforms Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli Motivation Fourier Transform widely used in Physics, Astronomy, Engineering
More informationImplementing a Speech Recognition System on a GPU using CUDA. Presented by Omid Talakoub Astrid Yi
Implementing a Speech Recognition System on a GPU using CUDA Presented by Omid Talakoub Astrid Yi Outline Background Motivation Speech recognition algorithm Implementation steps GPU implementation strategies
More informationScene-Centered Description from Spatial Envelope Properties
Scene-Centered Description from Spatial Envelope Properties Aude Oliva 1 and Antonio Torralba 2 1 Department of Psychology and Cognitive Science Program Michigan State University, East Lansing, MI 48824,
More informationEvery Picture Tells a Story: Generating Sentences from Images
Every Picture Tells a Story: Generating Sentences from Images Ali Farhadi, Mohsen Hejrati, Mohammad Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, David Forsyth University of Illinois
More informationLecture 2: 2D Fourier transforms and applications
Lecture 2: 2D Fourier transforms and applications B14 Image Analysis Michaelmas 2017 Dr. M. Fallon Fourier transforms and spatial frequencies in 2D Definition and meaning The Convolution Theorem Applications
More informationDigital Image Processing. Image Enhancement in the Frequency Domain
Digital Image Processing Image Enhancement in the Frequency Domain Topics Frequency Domain Enhancements Fourier Transform Convolution High Pass Filtering in Frequency Domain Low Pass Filtering in Frequency
More informationInternational Journal of Computer Science and Network (IJCSN) Volume 1, Issue 4, August ISSN
Accelerating MATLAB Applications on Parallel Hardware 1 Kavita Chauhan, 2 Javed Ashraf 1 NGFCET, M.D.University Palwal,Haryana,India Page 80 2 AFSET, M.D.University Dhauj,Haryana,India Abstract MATLAB
More informationVisual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space.
Visual words Map high-dimensional descriptors to tokens/words by quantizing the feature space. Quantize via clustering; cluster centers are the visual words Word #2 Descriptor feature space Assign word
More informationVisual Object Recognition
Visual Object Recognition Lecture 3: Descriptors Per-Erik Forssén, docent Computer Vision Laboratory Department of Electrical Engineering Linköping University 2015 2014 Per-Erik Forssén Lecture 3: Descriptors
More informationLearning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009
Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer
More informationPeriocular Biometrics: When Iris Recognition Fails
Periocular Biometrics: When Iris Recognition Fails Samarth Bharadwaj, Himanshu S. Bhatt, Mayank Vatsa and Richa Singh Abstract The performance of iris recognition is affected if iris is captured at a distance.
More informationHuman detection solution for a retail store environment
FACULDADE DE ENGENHARIA DA UNIVERSIDADE DO PORTO Human detection solution for a retail store environment Vítor Araújo PREPARATION OF THE MSC DISSERTATION Mestrado Integrado em Engenharia Eletrotécnica
More informationUsing the Forest to See the Trees: Context-based Object Recognition
Using the Forest to See the Trees: Context-based Object Recognition Bill Freeman Joint work with Antonio Torralba and Kevin Murphy Computer Science and Artificial Intelligence Laboratory MIT A computer
More informationSpatial Hierarchy of Textons Distributions for Scene Classification
Spatial Hierarchy of Textons Distributions for Scene Classification S. Battiato 1, G. M. Farinella 1, G. Gallo 1, and D. Ravì 1 Image Processing Laboratory, University of Catania, IT {battiato, gfarinella,
More informationImage processing in frequency Domain
Image processing in frequency Domain Introduction to Frequency Domain Deal with images in: -Spatial domain -Frequency domain Frequency Domain In the frequency or Fourier domain, the value and location
More informationA Scene Recognition Algorithm Based on Covariance Descriptor
A Scene Recognition Algorithm Based on Covariance Descriptor Yinghui Ge Faculty of Information Science and Technology Ningbo University Ningbo, China gyhzd@tom.com Jianjun Yu Department of Computer Science
More informationSUBSET SELECTION FOR LANDMARK MODERN AND HISTORIC IMAGES
SUBSET SELECTION FOR LANDMARK MODERN AND HISTORIC IMAGES Heider K. Ali 1 and Anthony Whitehead 2 1 Carleton University, Systems & Computer Engineering Department, Ottawa, ON, K1S 5B8, Canada heider@sce.carleton.ca
More informationBeyond bags of Features
Beyond bags of Features Spatial Pyramid Matching for Recognizing Natural Scene Categories Camille Schreck, Romain Vavassori Ensimag December 14, 2012 Schreck, Vavassori (Ensimag) Beyond bags of Features
More informationACCELERATION OF IMAGE RESTORATION ALGORITHMS FOR DYNAMIC MEASUREMENTS IN COORDINATE METROLOGY BY USING OPENCV GPU FRAMEWORK
URN (Paper): urn:nbn:de:gbv:ilm1-2014iwk-140:6 58 th ILMENAU SCIENTIFIC COLLOQUIUM Technische Universität Ilmenau, 08 12 September 2014 URN: urn:nbn:de:gbv:ilm1-2014iwk:3 ACCELERATION OF IMAGE RESTORATION
More informationSchedule for Rest of Semester
Schedule for Rest of Semester Date Lecture Topic 11/20 24 Texture 11/27 25 Review of Statistics & Linear Algebra, Eigenvectors 11/29 26 Eigenvector expansions, Pattern Recognition 12/4 27 Cameras & calibration
More informationTexture Segmentation
Texture Segmentation Introduction to Signal and Image Processing Prof. Dr. Philippe Cattin MIAC, University of Basel 1 of 48 22.02.2016 09:20 Contents Contents Abstract 2 1 Introduction What is Texture?
More informationImage Enhancement Techniques for Fingerprint Identification
March 2013 1 Image Enhancement Techniques for Fingerprint Identification Pankaj Deshmukh, Siraj Pathan, Riyaz Pathan Abstract The aim of this paper is to propose a new method in fingerprint enhancement
More informationSUPPLEMENTARY MATERIAL FOR. Do computer vision models differ systematically from human object perception? RT Pramod 1,2 & SP Arun 1
SUPPLEMENTARY MATERIAL FOR Do computer vision models differ systematically from human object perception? RT Pramod 1,2 & SP Arun 1 1 Centre for Neuroscience & 2 Department of Electrical Communication Engineering
More informationAn Introduction to Content Based Image Retrieval
CHAPTER -1 An Introduction to Content Based Image Retrieval 1.1 Introduction With the advancement in internet and multimedia technologies, a huge amount of multimedia data in the form of audio, video and
More informationReal-Time Detection of Landscape Scenes
Real-Time Detection of Landscape Scenes Sami Huttunen 1,EsaRahtu 1, Iivari Kunttu 2, Juuso Gren 2, and Janne Heikkilä 1 1 Machine Vision Group, University of Oulu, Finland firstname.lastname@ee.oulu.fi
More informationAdvanced CUDA Optimization 1. Introduction
Advanced CUDA Optimization 1. Introduction Thomas Bradley Agenda CUDA Review Review of CUDA Architecture Programming & Memory Models Programming Environment Execution Performance Optimization Guidelines
More informationTexture. COS 429 Princeton University
Texture COS 429 Princeton University Texture What is a texture? Antonio Torralba Texture What is a texture? Antonio Torralba Texture What is a texture? Antonio Torralba Texture Texture is stochastic and
More informationWeed Seeds Recognition via Support Vector Machine and Random Forest
Weed Seeds Recognition via Support Vector Machine and Random Forest Yilin Long and Cheng Cai * * Department of Computer Science, College of Information Engineering, Northwest A&F University, Yangling,
More informationGPU Based Face Recognition System for Authentication
GPU Based Face Recognition System for Authentication Bhumika Agrawal, Chelsi Gupta, Meghna Mandloi, Divya Dwivedi, Jayesh Surana Information Technology, SVITS Gram Baroli, Sanwer road, Indore, MP, India
More informationFingerprint Verification applying Invariant Moments
Fingerprint Verification applying Invariant Moments J. Leon, G Sanchez, G. Aguilar. L. Toscano. H. Perez, J. M. Ramirez National Polytechnic Institute SEPI ESIME CULHUACAN Mexico City, Mexico National
More informationAerial Image Classification Using Structural Texture Similarity
Aerial Image Classification Using Structural Texture Similarity Vladimir Risojević and Zdenka Babić Faculty of Electrical Engineering, University of Banja Luka Patre 5, 78000 Banja Luka, Bosnia and Herzegovina
More informationComputational Methods for Radiance. Render the full variety offered by the direct observation of objects. (Computationally).
Computational Methods for Radiance Render the full variety offered by the direct observation of objects. (Computationally). Methods for Plenoptic 1.0 Computing with Radiance Goal: Render the full variety
More information5. Feature Extraction from Images
5. Feature Extraction from Images Aim of this Chapter: Learn the Basic Feature Extraction Methods for Images Main features: Color Texture Edges Wie funktioniert ein Mustererkennungssystem Test Data x i
More informationPreviously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011
Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition
More informationFourier Transform and Texture Filtering
Fourier Transform and Texture Filtering Lucas J. van Vliet www.ph.tn.tudelft.nl/~lucas Image Analysis Paradigm scene Image formation sensor pre-processing Image enhancement Image restoration Texture filtering
More informationModeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision 42(3), 145 175, 2001 c 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. Modeling the Shape of the Scene: A Holistic Representation of the Spatial
More informationAdvanced Video Content Analysis and Video Compression (5LSH0), Module 4
Advanced Video Content Analysis and Video Compression (5LSH0), Module 4 Visual feature extraction Part I: Color and texture analysis Sveta Zinger Video Coding and Architectures Research group, TU/e ( s.zinger@tue.nl
More informationUSING THE GPU FOR FAST SYMMETRY-BASED DENSE STEREO MATCHING IN HIGH RESOLUTION IMAGES
USING THE GPU FOR FAST SYMMETRY-BASED DENSE STEREO MATCHING IN HIGH RESOLUTION IMAGES Vasco Mota Gabriel Falcao Michel Antunes Joao Barreto Urbano Nunes Institute of Systems and Robotics, Dept. of Electr.
More informationPart-based and local feature models for generic object recognition
Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza
More informationCS 223B Computer Vision Problem Set 3
CS 223B Computer Vision Problem Set 3 Due: Feb. 22 nd, 2011 1 Probabilistic Recursion for Tracking In this problem you will derive a method for tracking a point of interest through a sequence of images.
More informationShort Survey on Static Hand Gesture Recognition
Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of
More informationNumbaPro CUDA Python. Square matrix multiplication
NumbaPro Enables parallel programming in Python Support various entry points: Low-level (CUDA-C like) programming language High-level array oriented interface CUDA library bindings Also support multicore
More informationComputer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction
Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction Preprocessing The goal of pre-processing is to try to reduce unwanted variation in image due to lighting,
More informationArtifacts and Textured Region Detection
Artifacts and Textured Region Detection 1 Vishal Bangard ECE 738 - Spring 2003 I. INTRODUCTION A lot of transformations, when applied to images, lead to the development of various artifacts in them. In
More informationPractical Image and Video Processing Using MATLAB
Practical Image and Video Processing Using MATLAB Chapter 18 Feature extraction and representation What will we learn? What is feature extraction and why is it a critical step in most computer vision and
More informationCS 229 Classification of Channel Bifurcation Points in Remote Sensing Imagery of River Deltas. Erik Nesvold
CS 229 Classification of Channel Bifurcation Points in Remote Sensing Imagery of River Deltas Erik Nesvold nesvold@stanford.edu I. Introduction River deltas are very important landforms on Earth that are
More informationA SYNOPTIC ACCOUNT FOR TEXTURE SEGMENTATION: FROM EDGE- TO REGION-BASED MECHANISMS
A SYNOPTIC ACCOUNT FOR TEXTURE SEGMENTATION: FROM EDGE- TO REGION-BASED MECHANISMS Enrico Giora and Clara Casco Department of General Psychology, University of Padua, Italy Abstract Edge-based energy models
More informationELL 788 Computational Perception & Cognition July November 2015
ELL 788 Computational Perception & Cognition July November 2015 Module 6 Role of context in object detection Objects and cognition Ambiguous objects Unfavorable viewing condition Context helps in object
More informationComparing Local Feature Descriptors in plsa-based Image Models
Comparing Local Feature Descriptors in plsa-based Image Models Eva Hörster 1,ThomasGreif 1, Rainer Lienhart 1, and Malcolm Slaney 2 1 Multimedia Computing Lab, University of Augsburg, Germany {hoerster,lienhart}@informatik.uni-augsburg.de
More informationModeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs
Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs Xiaowei Li, Changchang Wu, Christopher Zach, Svetlana Lazebnik, Jan-Michael Frahm 1 Motivation Target problem: organizing
More informationSet Size, Clutter & Complexity
Set Size, Clutter & Complexity A review of You are simple Quantities Aude Oliva "I think the next century will be the century of complexity." Stephen Hawking To be complex or not to be complex Unfamiliar
More informationA Content Based Image Retrieval System Based on Color Features
A Content Based Image Retrieval System Based on Features Irena Valova, University of Rousse Angel Kanchev, Department of Computer Systems and Technologies, Rousse, Bulgaria, Irena@ecs.ru.acad.bg Boris
More informationImageCLEF 2011
SZTAKI @ ImageCLEF 2011 Bálint Daróczy joint work with András Benczúr, Róbert Pethes Data Mining and Web Search Group Computer and Automation Research Institute Hungarian Academy of Sciences Training/test
More informationFace Detection CUDA Accelerating
Face Detection CUDA Accelerating Jaromír Krpec Department of Computer Science VŠB Technical University Ostrava Ostrava, Czech Republic krpec.jaromir@seznam.cz Martin Němec Department of Computer Science
More informationSar Image Segmentation Using Hierarchical Unequal Merging
Sar Image Segmentation Using Hierarchical Unequal Merging V. Shalini M.E. Communication Engineering Sri Sakthi Institute Of Engineering And Technology Coimbatore, India Dr. S. Bhavani Sri Sakthi Institute
More informationB. Tech. Project Second Stage Report on
B. Tech. Project Second Stage Report on GPU Based Active Contours Submitted by Sumit Shekhar (05007028) Under the guidance of Prof Subhasis Chaudhuri Table of Contents 1. Introduction... 1 1.1 Graphic
More informationDimensionality Reduction using Relative Attributes
Dimensionality Reduction using Relative Attributes Mohammadreza Babaee 1, Stefanos Tsoukalas 1, Maryam Babaee Gerhard Rigoll 1, and Mihai Datcu 1 Institute for Human-Machine Communication, Technische Universität
More informationDiscriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationIndoor Outdoor Image Classification
Indoor Outdoor Image Classification Bachelor Thesis Fei Guo Department of Computer Science, ETH Zurich guof@student.ethz.ch Advisor: Lukas Bossard Supervisor: Prof. Dr. Luc van Gool August 18, 2011 Abstract
More informationLatest development in image feature representation and extraction
International Journal of Advanced Research and Development ISSN: 2455-4030, Impact Factor: RJIF 5.24 www.advancedjournal.com Volume 2; Issue 1; January 2017; Page No. 05-09 Latest development in image
More informationCS 231A Computer Vision (Fall 2012) Problem Set 3
CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest
More informationTexture. Outline. Image representations: spatial and frequency Fourier transform Frequency filtering Oriented pyramids Texture representation
Texture Outline Image representations: spatial and frequency Fourier transform Frequency filtering Oriented pyramids Texture representation 1 Image Representation The standard basis for images is the set
More informationPatch-based Object Recognition. Basic Idea
Patch-based Object Recognition 1! Basic Idea Determine interest points in image Determine local image properties around interest points Use local image properties for object classification Example: Interest
More informationCHAPTER 5 GLOBAL AND LOCAL FEATURES FOR FACE RECOGNITION
122 CHAPTER 5 GLOBAL AND LOCAL FEATURES FOR FACE RECOGNITION 5.1 INTRODUCTION Face recognition, means checking for the presence of a face from a database that contains many faces and could be performed
More informationClassification of objects from Video Data (Group 30)
Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time
More informationHigh Performance Video Artifact Detection Enhanced with CUDA. Atul Ravindran Digimetrics
High Performance Video Artifact Detection Enhanced with CUDA Atul Ravindran Digimetrics Goals & Challenges Provide automated QC for digital video files with accuracy and minimum false positives Provide
More informationCHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37
Extended Contents List Preface... xi About the authors... xvii CHAPTER 1 Introduction 1 1.1 Overview... 1 1.2 Human and Computer Vision... 2 1.3 The Human Vision System... 4 1.3.1 The Eye... 5 1.3.2 The
More informationTexture. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors
Texture The most fundamental question is: How can we measure texture, i.e., how can we quantitatively distinguish between different textures? Of course it is not enough to look at the intensity of individual
More informationTEXTURE. Plan for today. Segmentation problems. What is segmentation? INF 4300 Digital Image Analysis. Why texture, and what is it?
INF 43 Digital Image Analysis TEXTURE Plan for today Why texture, and what is it? Statistical descriptors First order Second order Gray level co-occurrence matrices Fritz Albregtsen 8.9.21 Higher order
More informationTEXTURE ANALYSIS USING GABOR FILTERS
TEXTURE ANALYSIS USING GABOR FILTERS Texture Types Definition of Texture Texture types Synthetic Natural Stochastic < Prev Next > Texture Definition Texture: the regular repetition of an element or pattern
More informationFrequency analysis, pyramids, texture analysis, applications (face detection, category recognition)
Frequency analysis, pyramids, texture analysis, applications (face detection, category recognition) Outline Measuring frequencies in images: Definitions, properties Sampling issues Relation with Gaussian
More informationDither Removal. Bart M. ter Haar Romeny. Image Dithering. This article has not been updated for Mathematica 8.
This article has not been updated for Mathematica 8. The Mathematica Journal Dither Removal Bart M. ter Haar Romeny Mathematica is ideal for explaining the design of seemingly complex mathematical methods.
More informationCoarse-to-fine image registration
Today we will look at a few important topics in scale space in computer vision, in particular, coarseto-fine approaches, and the SIFT feature descriptor. I will present only the main ideas here to give
More informationFinal Project Report: Filterbank-Based Fingerprint Matching
Sabanci University TE 407 Digital Image Processing Final Project Report: Filterbank-Based Fingerprint Matching June 28, 2004 Didem Gözüpek & Onur Sarkan 5265 5241 1 1. Introduction The need for security
More informationLearning global properties of scene images based on their correlational structures
Learning global properties of scene images based on their correlational structures Wooyoung Lee Machine Learning Department Carnegie Mellon University Pittsburgh, PA 15213 wooyoung@cs.cmu.edu Michael S.
More informationLocal Features Tutorial: Nov. 8, 04
Local Features Tutorial: Nov. 8, 04 Local Features Tutorial References: Matlab SIFT tutorial (from course webpage) Lowe, David G. Distinctive Image Features from Scale Invariant Features, International
More informationMultimedia Information Retrieval
Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis Why content-based? Actually, what is content-based
More informationContent Based Image Retrieval Using Curvelet Transform
Content Based Image Retrieval Using Curvelet Transform Ishrat Jahan Sumana, Md. Monirul Islam, Dengsheng Zhang and Guojun Lu Gippsland School of Information Technology, Monash University Churchill, Victoria
More informationAutomatic Classification of Outdoor Images by Region Matching
Automatic Classification of Outdoor Images by Region Matching Oliver van Kaick and Greg Mori School of Computing Science Simon Fraser University, Burnaby, BC, V5A S6 Canada E-mail: {ovankaic,mori}@cs.sfu.ca
More informationhigh performance medical reconstruction using stream programming paradigms
high performance medical reconstruction using stream programming paradigms This Paper describes the implementation and results of CT reconstruction using Filtered Back Projection on various stream programming
More informationScene segmentation and pedestrian classification from 3-D range and intensity images
University of Wollongong Research Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 2012 Scene segmentation and pedestrian classification
More informationCS534: Introduction to Computer Vision Edges and Contours. Ahmed Elgammal Dept. of Computer Science Rutgers University
CS534: Introduction to Computer Vision Edges and Contours Ahmed Elgammal Dept. of Computer Science Rutgers University Outlines What makes an edge? Gradient-based edge detection Edge Operators Laplacian
More informationModeling Image Context using Object Centered Grid
Modeling Image Context using Object Centered Grid Sobhan Naderi Parizi, Ivan Laptev, Alireza Tavakoli Targhi Computer Vision and Active Perception Laboratory Royal Institute of Technology (KTH) SE-100
More informationContinuous Visual Vocabulary Models for plsa-based Scene Recognition
Continuous Visual Vocabulary Models for plsa-based Scene Recognition Eva Hörster Multimedia Computing Lab University of Augsburg Augsburg, Germany hoerster@informatik.uniaugsburg.de Rainer Lienhart Multimedia
More informationWhy is computer vision difficult?
Why is computer vision difficult? Viewpoint variation Illumination Scale Why is computer vision difficult? Intra-class variation Motion (Source: S. Lazebnik) Background clutter Occlusion Challenges: local
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK UNSUPERVISED SEGMENTATION OF TEXTURE IMAGES USING A COMBINATION OF GABOR AND WAVELET
More information