VIALACTEA: the Milky Way as a Star Formation Engine

Similar documents
Machine Learning Based Data Mining for Milky Way Filamentary Structures Reconstruction

Classification of Protein Crystallization Imagery

Stefano Cavuoti INAF Capodimonte Astronomical Observatory Napoli

CoE4TN4 Image Processing

Image Processing

Digital Image Processing

Machine Learning in Biology

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang

Lecture 7: Most Common Edge Detectors

(1) Department of Physics University Federico II, Via Cinthia 24, I Napoli, Italy (2) INAF Astronomical Observatory of Capodimonte, Via

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

COMPUTER AND ROBOT VISION

Region-based Segmentation

ECG782: Multidimensional Digital Signal Processing

Morphological track 1

Data mining and Knowledge Discovery Resources for Astronomy in the Web 2.0 Age

Efficient Image Compression of Medical Images Using the Wavelet Transform and Fuzzy c-means Clustering on Regions of Interest.

CHAPTER 6 DETECTION OF MASS USING NOVEL SEGMENTATION, GLCM AND NEURAL NETWORKS

CS 231A Computer Vision (Winter 2014) Problem Set 3

Edge detection. Stefano Ferrari. Università degli Studi di Milano Elaborazione delle immagini (Image processing I)

DATA MINING TEST 2 INSTRUCTIONS: this test consists of 4 questions you may attempt all questions. maximum marks = 100 bonus marks available = 10

SUPPLEMENTARY FILE S1: 3D AIRWAY TUBE RECONSTRUCTION AND CELL-BASED MECHANICAL MODEL. RELATED TO FIGURE 1, FIGURE 7, AND STAR METHODS.

EECS490: Digital Image Processing. Lecture #19

CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT

Assignment 3: Edge Detection

The Curse of Dimensionality

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

1. INTRODUCTION. AMS Subject Classification. 68U10 Image Processing

Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig

Analysis of Irregularly Shaped Texture Regions 1

2. LITERATURE REVIEW

Skeletonization Algorithm for Numeral Patterns

Improving the detection of excessive activation of ciliaris muscle by clustering thermal images

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Motion Estimation and Optical Flow Tracking

Edge and local feature detection - 2. Importance of edge detection in computer vision

Babu Madhav Institute of Information Technology Years Integrated M.Sc.(IT)(Semester - 7)

Bus Detection and recognition for visually impaired people

EN1610 Image Understanding Lab # 3: Edges

Edge detection. Gradient-based edge operators

Chapter 11 Representation & Description

Robust PDF Table Locator

Feature extraction. Bi-Histogram Binarization Entropy. What is texture Texture primitives. Filter banks 2D Fourier Transform Wavlet maxima points

Basic relations between pixels (Chapter 2)

FOREGROUND DETECTION ON DEPTH MAPS USING SKELETAL REPRESENTATION OF OBJECT SILHOUETTES

EECS490: Digital Image Processing. Lecture #23

CAP 6412 Advanced Computer Vision

Image Processing. Image Features

Binary Shape Characterization using Morphological Boundary Class Distribution Functions

TEXTURE. Plan for today. Segmentation problems. What is segmentation? INF 4300 Digital Image Analysis. Why texture, and what is it?

Edge and corner detection

3D Object Recognition using Multiclass SVM-KNN

Small-scale objects extraction in digital images

Edge detection. Convert a 2D image into a set of curves. Extracts salient features of the scene More compact than pixels

Enhancing Forestry Object Detection using Multiple Features

CLASSIFICATION FOR ROADSIDE OBJECTS BASED ON SIMULATED LASER SCANNING

Topic 6 Representation and Description

Image Segmentation. Srikumar Ramalingam School of Computing University of Utah. Slides borrowed from Ross Whitaker

Anno accademico 2006/2007. Davide Migliore

Local Image preprocessing (cont d)

UNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences

Recognition of a Predefined Landmark Using Optical Flow Sensor/Camera

Sobel Edge Detection Algorithm

TEXTURE ANALYSIS USING GABOR FILTERS

EE795: Computer Vision and Intelligent Systems

DIGITAL IMAGE PROCESSING

Beyond Bags of features Spatial information & Shape models

10/14/2017. Dejan Sarka. Anomaly Detection. Sponsors

CS 4495 Computer Vision. Linear Filtering 2: Templates, Edges. Aaron Bobick. School of Interactive Computing. Templates/Edges

A new analysis technique for the Sprouts Game

Line, edge, blob and corner detection


A Review on Plant Disease Detection using Image Processing

ADAPTIVE TEXTURE IMAGE RETRIEVAL IN TRANSFORM DOMAIN

Semi-Automatic Transcription Tool for Ancient Manuscripts

Multimedia Computing: Algorithms, Systems, and Applications: Edge Detection

Modern Medical Image Analysis 8DC00 Exam

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

Morphological Image Processing GUI using MATLAB

Lorentzian Distance Classifier for Multiple Features

EE 584 MACHINE VISION

RESEARCH ON OPTIMIZATION OF IMAGE USING SKELETONIZATION TECHNIQUE WITH ADVANCED ALGORITHM

Image Enhancement: To improve the quality of images

EDGE BASED REGION GROWING

Geometric Modeling in Graphics

[Programming Assignment] (1)

A FUZZY LOGIC BASED METHOD FOR EDGE DETECTION

3D MILKY WAY SIMULATION

Automatic Image Alignment (feature-based)

TILING PROBLEMS: FROM DOMINOES, CHECKERBOARDS, AND MAZES TO DISCRETE GEOMETRY

Digital Image Processing Fundamentals

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

SLIDING WINDOW FOR RELATIONS MAPPING

2: Image Display and Digital Images. EE547 Computer Vision: Lecture Slides. 2: Digital Images. 1. Introduction: EE547 Computer Vision

INF 4300 Classification III Anne Solberg The agenda today:

CS221: Final Project Report - Object Recognition and Tracking

Face detection and recognition. Detection Recognition Sally

Visualization for Classification Problems, with Examples Using Support Vector Machines

FEATURE EXTRACTION TECHNIQUES FOR IMAGE RETRIEVAL USING HAAR AND GLCM

Object Detection Design challenges

Transcription:

Giuseppe Riccio 1, Stefano Cavuoti 1, Massimo Brescia 1, Eugenio Schisano 2, Amata Mercurio 1, Davide Elia 2, Milena Benedettini 2, Stefano Pezzuto 2, Sergio Molinari 2, Anna Maria Di Giorgio 2 (1) INAF Astronomical Observatory of Capodimonte, Via Moiariello 16, I-80131 Napoli, Italy (2) INAF IAPS, Via del Fosso del Cavaliere 100, I-00133 Roma, Italy 1

The role of WP5 in the project The goal is to exploit the combination of all the new-generation Infrared Radio surveys of the Galactic Plane from space missions and ground-based facilities, using a novel data and science analysis paradigm based on 3D visual analytics and data mining framework, to build and deliver a quantitative 3D model of our VIALACTEA Galaxy as a star formation engine that will be used as a template for external galaxies and study star formation across the cosmic time 2

WP1-task1 filamentary structure detection Task 1: Filamentary structure detection Production of column density maps of entire galactic plane Automated filament extraction workflow for Hi- GAL survey The filament extraction code was run on the column density maps covering the region between Galactic longitude 290 -- 320, with different threshold levels equal to 2.5, 3. and 3.5 times the local standard deviation of the minimum eigenvalue (Schisano et al., 2014) OACN Data Mining goal: To refine the edges; To extend filament regions. The right calculation of physical quantities related to filaments strongly depends on their dimensions, so the correct definition of edges is crucial. 3

Overview of the filament areas of interest Original image Intermediate mask TRAINING EXECUTION TRAINING + EXECUTION Traditional thresholding gradients Filament detection Filament optimization FilExSeC To combine different filament detection methods to improve the global performances OR Beam curve detection Structured Forest Filament Flux Calculation Spine Extraction Based on Z-S method SVM detection Fuzzy detection 4

FilExSeC algorithm FilExSeC (Filaments Extraction, Selection and Classification), a data mining tool to refine and optimize the detection of the edges of filamentary structures. The method consists in two main phases: Feature Extraction: a set of features depending by its neighbors is associated to each pixel of the input image Classification: image pixels are classified as filamentary or background, by using a supervised Machine Learning method, trained by these features A further phase, Feature Selection, finds the most relevant features. By reducing the initial data parameter space, it is possible indeed to improve the execution efficiency of the model, without affecting its performances. 5

Haralick feature space Haralick Features (1979) Based on co-occurrence matrix (GLCM) Element C i,j represents, for a fixed distance and direction, the probability to have two pixels in the image at that distance, with grey level Z i and Z j, respectively Haralick features extracted from C i,j /(number of pairs) Contrast m = (i j) 2 C i,j i j Energy i j 2 C i,j Entropy C i,j log C i,j i j Correlation i j (i μ i )(j μ j )C i,j σ i σ j Robert Haralick 6

Haar-like and statistical feature space Alfred Haar used templates Haar-like Features (2001) Each Haar-like variable involves 2 or 3 interconnected black and white rectangles (masks or templates) Values of each feature are obtained by sliding masks on the image and calculating: Statistical Features For each pixel, the following features are calculated in a centered window: gradients (horizontal, vertical, main diagonal, secondary diagonal) Mean, standard deviation, skewness, kurtosis, entropy, range Moreover, the pixel value is considered as a Statistical Feature too 7

Science & Technology Leo Breiman Random Forest Classifier (2001) The classifier prefigures 4 use cases: Train: the classifier is trained to discriminate between filamentary and background pixels; Test: the classifier is evaluated by a blind test dataset; Run: the trained and validated classifier is used on new real images. 8

Overview of the WP5 activities Mark A. Hall Feature Selection (Backward Elimination 1999) to reduce the parameter space, by weighting the contribution carried by each feature to the learning capability of the classifier. So it is possible to improve the execution efficiency of the model, without affecting its performances. At the end of this phase, a subset of features having higher weight (defined as importance) in recognizing filament pixels is considered. This subset is then used to definitely train and test the classifier with new training and testing subsets. Tests revealed that Haralick features are useless 9

FilExSeC pixel feature analysis Features extracted from each pixel and its neighbors Feature selection: Haralick type excluded (no information lost and improved computing time) 10

FilExSeC Filament Connections FilExSeC is able to connect, by means of NFPs, filaments that in the traditional method are tagged as disjointed objects. By joining filaments as a unique structure total mass and mass per length change, inducing a different physics of the filamentary structure. Detected Filaments 668 Confirmed Filaments 298 New Filaments 196 Extended Filaments 169 Joined Filaments 5 EXAMPLE: TEST tiles 1+2 of Hi-GAL l217-l224 A further analysis is required to verify the correctness of the reconstruction of interconnections between different filaments, to evaluate the contribution of FilExSeC to the knowledge of the physics of the filaments. Confirmed Filament Pixels New Filament Pixels Confirmed Background Undetected Filament Pixels 11

FilExSeC Example of Joined Filaments IAPS FilExSeC 1 2 3 4 5 6 Connection of 6 filaments identified by IAPS IAPS total number of pixels: 1852 Pixel added by FilExSeC : 858 New Total number of pixels: 2710 % NFP: +46.33% 12

Filament physical analysis IAPS masks 1 pixel extension IAPS masks 1 pixel extension Tests on Hi-GAL l217- l224 confirm that the FilExSeC method is able to justify a 1-pixel extension of the IAPS results, but it is always an underestimation of the IAPS masks extended with 3 pixels IAPS masks 3 pixels extension IAPS masks 3 pixels extension 13

Spine detection FilExSeC Spine alternative extraction 14

Spine detection It is a fast parallel thinning algorithm that consists of two sub-iterations: one aimed at deleting the south-east boundary points and the north-west corner points while the other one is aimed at deleting the north-west boundary points and the south-east corner points. End points and pixel connectivity are preserved. Each pattern is thinned down to a "skeleton" of unitary thickness. 15

Spine detection The medial axis of an object is the set of all points having more than one closest point on the object's boundary. In 2D, the medial axis of a subset S which is bounded by planar curve C is the locus of the centers of circles that are tangent to curve C in two or more points, where all such circles are contained in S. The medial axis together with the associated radius function of the maximally inscribed discs is called the medial axis transform (MAT). Both methods have been preliminarily validated on old simulations and tested on Lupus masks. In the following slides we report results. 16

Spine detection test Hi-GAL l217-l224 IAPS SPINE IAPS mask filament (white) pixels 78,341 Z-S Spine (blue) pixels: 30,285 MAT spine (blue) pixels: 33,949 Z-S SPINE IAPS Spine (blue) pixels: 21,146 IAPS out-of-mask (red) pixels: 98 MAT SPINE In the following slides we highlight some interesting area. 17

Spine detection IAPS SPINE Z-S SPINE White Pixels: Filament Mask Pixels Blue Pixels: Filament Spine Pixels MAT SPINE Red Pixels: Filaments Spine Pixels out the mask region 18

Spine detection IAPS SPINE Z-S SPINE MAT SPINE 19

Spine detection IAPS SPINE Z-S SPINE MAT SPINE 20

The workflow for compact source analysis The main limitation of FilExSec is that it works with masks obtained by traditional methods. This causes a bias on our performances. It is necessary to find a method directly working on original images without any priors At this time, we are under investigation on new edge detectors: Boosted Edge Learning (Dollar et al. 2006) gpb (global Probability of boundary) (Arbelaez et al. 2011) Beam-curve Pyramid based edge detector (Alpert et al. 2010) Curvelets and Wavelets (Starck et al. 2002 and Mallat 1998) Fuzzy Logic Edge Detectors (Becerikli et al. 2005) Canny and Sobel filters enhancement (Canny 1986 and Sobel 2014) 21

Conclusions The whole project has successfully passed the mid-term official EU commission review The initial inertia due to interaction problems between technology and science communities is going to be successfully overcome The data and computing infrastructures and visual analytics solutions started to host and integrate the planned scientific workflows, matching the expected capabilities The data mining paradigms are demonstrating their expected benefit to help the scientific problem solving automation as well as to manage the foreseen amount and complexity of data In other words The European project at mid-term stage (April 2015) is respecting the initial goals, among which the data mining and machine learning expectation to release useful resources and solutions for the wide scientific community, which will remain available also after the project closure (October 2016). 22

The Science Gateways 23