Classification of Protein Crystallization Imagery

Similar documents
COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

DT-Binarize: A Hybrid Binarization Method using Decision Tree for Protein Crystallization Images

COMPUTER AND ROBOT VISION

CHAPTER 6 DETECTION OF MASS USING NOVEL SEGMENTATION, GLCM AND NEURAL NETWORKS

Adaptive Learning of an Accurate Skin-Color Model

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Global Journal of Engineering Science and Research Management

Detection of Bone Fracture using Image Processing Methods

Beyond Bags of features Spatial information & Shape models

Estimating Human Pose in Images. Navraj Singh December 11, 2009

Hybrid Approach for MRI Human Head Scans Classification using HTT based SFTA Texture Feature Extraction Technique

Ulrik Söderström 16 Feb Image Processing. Segmentation

Content Based Image Retrieval

Evaluating Classifiers

Business Club. Decision Trees

CS4670: Computer Vision

List of Exercises: Data Mining 1 December 12th, 2015

Region-based Segmentation

CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT

Digital Image Processing COSC 6380/4393

CS145: INTRODUCTION TO DATA MINING

VIALACTEA: the Milky Way as a Star Formation Engine

Learning to Recognize Faces in Realistic Conditions

Using Machine Learning for Classification of Cancer Cells

A Cascade of Feed-Forward Classifiers for Fast Pedestrian Detection

Bus Detection and recognition for visually impaired people

EE795: Computer Vision and Intelligent Systems

Evaluating Classifiers

Face Image Quality Assessment for Face Selection in Surveillance Video using Convolutional Neural Networks

Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features. Wei-Ta Chu

Image Processing

Schedule for Rest of Semester

Blood Microscopic Image Analysis for Acute Leukemia Detection

Network Traffic Measurements and Analysis

Evaluation Measures. Sebastian Pölsterl. April 28, Computer Aided Medical Procedures Technische Universität München

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging

UNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences

[Programming Assignment] (1)

CONTENT BASED IMAGE RETRIEVAL (CBIR) USING MULTIPLE FEATURES FOR TEXTILE IMAGES BY USING SVM CLASSIFIER

Automated Classification of Protein Crystallization Images Using Support Vector Machines and BlobWorld Textures

Motion Estimation and Optical Flow Tracking

WP1: Video Data Analysis

Incremental Learning of Object Detectors Using a Visual Shape Alphabet

3 October, 2013 MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos

DETECTION AND CLASSIFICATION OF ABNORMALITIES IN BRAIN MR IMAGES USING SUPPORT VECTOR MACHINE WITH DIFFERENT KERNELS

Supervised vs unsupervised clustering

CS249: ADVANCED DATA MINING

ECE 176 Digital Image Processing Handout #14 Pamela Cosman 4/29/05 TEXTURE ANALYSIS

CHAPTER 4 SEMANTIC REGION-BASED IMAGE RETRIEVAL (SRBIR)

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

An ICA based Approach for Complex Color Scene Text Binarization

CHAPTER 4 TEXTURE FEATURE EXTRACTION

Themes in the Texas CCRS - Mathematics

Object-Based Classification & ecognition. Zutao Ouyang 11/17/2015

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Edge detection. Gradient-based edge operators

Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

CS381V Experiment Presentation. Chun-Chen Kuo

Face Detection for Skintone Images Using Wavelet and Texture Features

Supervised Learning Classification Algorithms Comparison

INTELLIGENT transportation systems have a significant

Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation

SECTION 5 IMAGE PROCESSING 2

Motion Detection. Final project by. Neta Sokolovsky

Anno accademico 2006/2007. Davide Migliore

Lecture 25: Review I

CS 221: Object Recognition and Tracking

Digital Image Processing

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++

Evaluation of different biological data and computational classification methods for use in protein interaction prediction.

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

Multiple-Choice Questionnaire Group C

Simultaneous surface texture classification and illumination tilt angle prediction

IT Digital Image ProcessingVII Semester - Question Bank


8/3/2017. Contour Assessment for Quality Assurance and Data Mining. Objective. Outline. Tom Purdie, PhD, MCCPM

Object Detection by 3D Aspectlets and Occlusion Reasoning

Ensemble methods in machine learning. Example. Neural networks. Neural networks

EECS490: Digital Image Processing. Lecture #19

Global Probability of Boundary

Temperature Calculation of Pellet Rotary Kiln Based on Texture

Discriminative classifiers for image recognition

Feature Extraction and Image Processing, 2 nd Edition. Contents. Preface

Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey

Image Processing, Analysis and Machine Vision

CAP 6412 Advanced Computer Vision

Skin and Face Detection

Robust Model-Based Detection of Gable Roofs in Very-High-Resolution Aerial Images

Feature Detectors and Descriptors: Corners, Lines, etc.

Types of Edges. Why Edge Detection? Types of Edges. Edge Detection. Gradient. Edge Detection

Robotics Programming Laboratory

Feature Extraction from Wavelet Coefficients for Pattern Recognition Tasks. Rajat Aggarwal Chandu Sharvani Koteru Gopinath

Outline

Texture. Texture is a description of the spatial arrangement of color or intensities in an image or a selected region of an image.

Automatic Colorization of Grayscale Images

5. Feature Extraction from Images

EE368 Project: Visual Code Marker Detection

Contents I IMAGE FORMATION 1

Deep Face Recognition. Nathan Sun

Transcription:

Classification of Protein Crystallization Imagery Xiaoqing Zhu, Shaohua Sun, Samuel Cheng Stanford University Marshall Bern Palo Alto Research Center September 2004, EMBC 04

Outline Background X-ray crystallography High-throughput automatic crystallization Related work Proposed approach Level-set boundary detection Texture-based feature extraction SVM and decision-tree classifier Experimental results Conclusion Crystallization Imagery, EMBC 2004 2

Background X-ray crystallography For determining 3-D structures of proteins molecules Protein crystallization process highly sensitive to physico-chemical conditions High-throughput approach Robotic setup of protein crystallization trials (1000s trials per day) Periodically recorded crystallization results via digital photography High volume of data calls for automatic classification Long-term goal Automatic data inference for crystallization Prediction of crystallization results based on past experience Recommendation of subsequent experimental conditions Crystallization Imagery, EMBC 2004 3

Example Protein Crystallization Results Crystal-negative Crystal-positive Crystallization Imagery, EMBC 2004 4

Example Protein Crystallization Results Crystal-negative Crystal-positive Crystallization Imagery, EMBC 2004 5

Related Work Edge detection via Hough transform [Zuk and Ward, 1991] Spectral features from FFT [Jusica et al., 2001] Sobel edge detector with reported accuracy ~ 75% [Wilson et al.2002] Canny edge detector; lease-square circle fitting; texture and geometric features; self-organizing neural network; with reported accuracy ~ 75% [Spraggon et al. 2002] Conic curve fitting for boundary detection; spectral analysis for classification; with reported error rate ~ 15% [Cumbaa et al. 2002] Line tracking for boundary detection; manually-tuned decision tree for classification; reported error rate ~ 12% and 14% [Bern et al. 2003] Crystallization Imagery, EMBC 2004 6

Proposed Approach Drop boundary detection: The dynamic programming line-tracking algorithm The level-set method Feature exaction: Local geometric features based on gradient information and Hough transform Global texture features from gray-level co-occurrence matrix (GLCM) Combination of both Classification algorithm Automatic decision tree with winnowing (C5.0) Support vector machine (SVM-Light) Crystallization Imagery, EMBC 2004 7

System Diagram Input Images Segmented Images Feature Vectors Classification Results Image 1: [f 1, f 2, f N ] Image 2: [f 1, f 2, f N ] Image 1 = positive Image 2 = negative Drop Boundary Detection Feature Extraction Classification Automatic Feature Selection Crystallization Imagery, EMBC 2004 8

Illustration of the Level-set method The level-set method describes the 2-D boundary by constructing a 3-D surface, called the front, the zero-level set of which coincides with the 2-D boundary. The algorithm follows the changes of the 2-D boundary by tracking the simpler motion of the front in 3-D space. Courtesy of J.A. Sethian at Dept. of Mathematics, UC-Berkeley http://math.berkeley.edu/~sethian/explanations/level_set_explain.html Crystallization Imagery, EMBC 2004 9

Texture Feature Extraction Gray level co-occurrence matrix (GLCM) Joint histogram of gray levels of a pair of pixels with a given spatial relationship Captures the statistics of the gray level spatial variation Score functions from the GLCM Contrast, correlation, entropy, etc Mean and variance over several GLCMs for orientation-invariant features [R. Haralick, 1973] Only pixels within the drop boundary are considered Crystallization Imagery, EMBC 2004 10

Support Vector Machine (SVM) Classifier Non-linear mapping of the input feature vectors into a highdimensional space Construction of an optimal separating plane for dividing the transformed vectors Implementation: SVM-Light at http://svmlight.joachims.org/ Crystallization Imagery, EMBC 2004 11

Decision-Tree Classifier Automatic calculation of the decision thresholds from training samples Boosting: final classification decision based on voting from several classifiers Winnowing: selection of more important features based on statistics of the training feature vectors Implementation: the C5.0 data mining tool at http://www.rulequest.com/see5-info.html Crystallization Imagery, EMBC 2004 12

Experimental Setup Dataset: 520 manually annotated images from Joint Center for Structural Genomics (JCSG) Boundary detection: Comparison of the new level-set method with the previous line-tracking algorithm Classification: Binary classification: crystal-positive/crystal-negative 10-fold cross-validation for classification Investigation of texture and geometric features Comparison of SVM-Light vs. the C5.0 classifier Feature selection via C5.0 winnowing Crystallization Imagery, EMBC 2004 13

Example Drop detection Results From the line-tracking algorithm From the level-set method The Level-set method reduces the error rate of drop detection by 5-10% Crystallization Imagery, EMBC 2004 14

Classification Results: Error Rates 40.00% 35.00% 30.00% 25.00% 20.00% 15.00% F.N. 14.6% F.P. 9.6% 10.00% 5.00% 0.00% SVM-Light C5.0 C 5.0 with boosting Texture, False-Negative Geometric, False-Negative Combined, False-Negative Texture, False-Positive Geometric, False-Positive Combined, False-Positive Crystallization Imagery, EMBC 2004 15

Selected Features Geometric features: F2 - Overall directional gradient F4 - Normalized Hough Transform F5 - Curve ratio F7 - Gray standard deviation Texture features: F2 - Contrast F3 - Correlation F5 - Sum Average F9 - Entropy Crystallization Imagery, EMBC 2004 16

Conclusions Investigated several modern mathematical tools: Level set method for drop boundary detection Texture features based on GLCM The SVM and boosting in the C5.0 classifier Automatic feature selection Best classification results: C5.0 with boosting and both features False positive rate: 9.6%; False negative rate:14.6% Crystallization Imagery, EMBC 2004 17

Thank You