Automated Lesion Detection Methods for 2D and 3D Chest X-Ray Images

Similar documents
Improvement in automated detection of pulmonary nodules on helical x-ray CT images

Computer-Aided Diagnosis in Abdominal and Cardiac Radiology Using Neural Networks

Extraction and recognition of the thoracic organs based on 3D CT images and its application

Automated segmentations of skin, soft-tissue, and skeleton from torso CT images

Development of an Automated Method for Detecting Mammographic Masses With a Partial Loss of Region

Detection & Classification of Lung Nodules Using multi resolution MTANN in Chest Radiography Images

Cardio-Thoracic Ratio Measurement Using Non-linear Least Square Approximation and Local Minimum

Eigenspace Template Matching for Detection of Lacunar Infarcts on MR Images

A Comparison of wavelet and curvelet for lung cancer diagnosis with a new Cluster K-Nearest Neighbor classifier

Detection and Measurement of Hilar Region in Chest Radiograph

The MAGIC-5 CAD for nodule detection in low dose and thin slice lung CT. Piergiorgio Cerello - INFN

Temporal subtraction system on torso FDG-PET scans based on statistical image analysis

Detection and Identification of Lung Tissue Pattern in Interstitial Lung Diseases using Convolutional Neural Network

Combinatorial Effect of Various Features Extraction on Computer Aided Detection of Pulmonary Nodules in X-ray CT Images

Computer-aided detection of clustered microcalcifications in digital mammograms.

algorithms ISSN

Mass Classification Method in Mammogram Using Fuzzy K-Nearest Neighbour Equality

A New Algorithm for Shape Detection

CHAPTER 6 DETECTION OF MASS USING NOVEL SEGMENTATION, GLCM AND NEURAL NETWORKS

A Multiclassifier Approach for Lung Nodule Classification

FEATURE DESCRIPTORS FOR NODULE TYPE CLASSIFICATION

CHAPTER 6 HYBRID AI BASED IMAGE CLASSIFICATION TECHNIQUES

X-ray Categorization and Spatial Localization of Chest Pathologies

A novel lung nodules detection scheme based on vessel segmentation on CT images

Hybrid Approach for MRI Human Head Scans Classification using HTT based SFTA Texture Feature Extraction Technique

Computer Aided Diagnosis Based on Medical Image Processing and Artificial Intelligence Methods

Medical Image Feature, Extraction, Selection And Classification

Submitted to Transaction on Medical Image. Condential untill published. Jun Zhang 1, Kenji Suzuki 1

Lung nodule detection by using. Deep Learning

Medical Image Processing: Image Reconstruction and 3D Renderings

Copyright 2009 Society of Photo Optical Instrumentation Engineers. This paper was published in Proceedings of SPIE, vol. 7260, Medical Imaging 2009:

Computer-Aided Diagnosis for Lung Diseases based on Artificial Intelligence: A Review to Comparison of Two- Ways: BP Training and PSO Optimization

A New Method for Determining Transverse Crack Defects in Welding Radiography Images based on Fuzzy-Genetic Algorithm

Malaysian License Plate Recognition Artificial Neural Networks and Evolu Computation. The original publication is availabl

Semi-Automatic Detection of Cervical Vertebrae in X-ray Images Using Generalized Hough Transform

Prostate Detection Using Principal Component Analysis

The Institute of Telecommunications and Computer Sciences, UTP University of Science and Technology, Bydgoszcz , Poland

FINDING THE TRUE EDGE IN CTA

IDENTIFYING GEOMETRICAL OBJECTS USING IMAGE ANALYSIS

doi: /

Computer-Aided Detection system for Hemorrhage contained region

ISSN: (Online) Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

Automatic Lung Surface Registration Using Selective Distance Measure in Temporal CT Scans

Practical Image and Video Processing Using MATLAB

MEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 19: Machine Learning in Medical Imaging (A Brief Introduction)

Lung Nodule Detection using a Neural Classifier

Spiral CT. Protocol Optimization & Quality Assurance. Ge Wang, Ph.D. Department of Radiology University of Iowa Iowa City, Iowa 52242, USA

Detecting Bone Lesions in Multiple Myeloma Patients using Transfer Learning

Genetic Algorithms for Vision and Pattern Recognition

Unsupervised classification of cirrhotic livers using MRI data

Radiology. Marta Anguiano Millán. Departamento de Física Atómica, Molecular y Nuclear Facultad de Ciencias. Universidad de Granada

Assessment of 3D performance metrics. X-ray based Volumetric imaging systems: Fourier-based imaging metrics. The MTF in CT

A CAD SYSTEM FOR THE DETECTION OF MALIGNANT TUMORS IN DIGITIZED MAMMOGRAM FILMS

CHAPTER-1 INTRODUCTION

Edge Detection for Dental X-ray Image Segmentation using Neural Network approach

Towards Automatic Recognition of Fonts using Genetic Approach

Comparison of Vessel Segmentations using STAPLE

By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

Available Online through

Time Stamp Detection and Recognition in Video Frames

Hierarchical Shape Statistical Model for Segmentation of Lung Fields in Chest Radiographs

Recognition and Measurement of Small Defects in ICT Testing

ANATOMIA Tutorial. Fig. 1 Obtaining CT scan data

Tumor Detection and classification of Medical MRI UsingAdvance ROIPropANN Algorithm

LUNG NODULES SEGMENTATION IN CHEST CT BY LEVEL SETS APPROACH

CIRCULAR MOIRÉ PATTERNS IN 3D COMPUTER VISION APPLICATIONS

Automated segmentation methods for liver analysis in oncology applications

Ch 22 Inspection Technologies

A Novel Smoke Detection Method Using Support Vector Machine

doi: /

Comparison of Vessel Segmentations Using STAPLE

Serhat Ozekes 1 Onur Osman 1 Osman N. Ucan 2

Evaluation of Hardware Oriented MRCoHOG using Logic Simulation

Three-dimensional graphics system of medical images on the Web for patients

Lecture 11: Classification

Limitations of Projection Radiography. Stereoscopic Breast Imaging. Limitations of Projection Radiography. 3-D Breast Imaging Methods

2 OVERVIEW OF RELATED WORK

3D temporal subtraction on chest MDCT images using nonlinear image registration technique

Global Journal of Engineering Science and Research Management

Evolutionary Algorithms in the Classification of Mammograms

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Detection of Protrusions in Curved Folded Surfaces Applied to Automated Polyp Detection in CT Colonography

A COMPARISON OF WAVELET-BASED AND RIDGELET- BASED TEXTURE CLASSIFICATION OF TISSUES IN COMPUTED TOMOGRAPHY

Japan Foundry Society, Inc. Application of Recent X-ray CT Technology to Investment Casting field. Kouichi Inagaki ICC / IHI Corporation

Iterative Estimation of 3D Transformations for Object Alignment

CHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION

International Research Journal of Engineering and Technology (IRJET) e-issn:

TUMOR DETECTION IN MRI IMAGES

Genetic Algorithms. PHY 604: Computational Methods in Physics and Astrophysics II

School of Mathematics, Statistics and Computer Science. Computer Science. Object Detection using Neural Networks and Genetic Programming

Human Heart Coronary Arteries Segmentation

GENERAL AUTOMATED FLAW DETECTION SCHEME FOR NDE X-RAY IMAGES

Pattern recognition (4)

Classification of Mammographic Images Using Artificial Neural Networks

Parameter Optimization for Mammogram Image Classification with Support Vector Machine

arxiv: v1 [cs.cv] 26 Jun 2011

DEVELOPMENT OF AN AUTOMATED SOLDER INSPECTION SYSTEM WITH NEURAL NETWORK USING OBLIQUE COMPUTED TOMOGRAPHY

A. Overview of the CAD System In this paper, the classification is performed in four steps. Initially the mammogram image is smoothened using median

Wavelet-based Texture Classification of Tissues in Computed Tomography

CHAPTER 1 INTRODUCTION

Transcription:

Automated Lesion Detection Methods for 2D and 3D Chest X-Ray Images Takeshi Hara, Hiroshi Fujita,Yongbum Lee, Hitoshi Yoshimura* and Shoji Kido** Department of Information Science, Gifu University Yanagido 1-1, Gifu, Gifu 501-1193, JAPAN. TEL: +81-58-293-2757, FAX: +81-58-230-1895 {hara, fujita, lee}@fjt.info.gifu-u.ac.jp *Center Research Laboratory, Konica Co. **Osaka Medical Center for Cancer and Cardiovascular Diseases Abstract The purpose of this work is to present some technical approaches of our computer-aided detection (CAD) system for chest radiograms and helical CT scans, and also evaluate that by using three databases. The CAD includes some methods to detect lesions and to eliminate false-positive findings. The detective methods consist of template matching and artificial neural network approaches. Genetic Algorithm (GA) was employed in template matching to select a matched image from various reference patterns. Artificial Neural Networks (ANN) were also applied to eliminate the false-positive candidates. The sensitivity and the number of falsepositives were 73% and 11FPs per image on chest radiogram CAD and 77% with 2.6 FPs per image on helical CT scan CAD. These preliminary results imply that the GA and ANN based detective methods may be effective to indicate lesions on chest radiograms and helical CT scans. 1. Introduction Early detection of suspicious lesions is effective to reduce the number of death caused by lung cancer. Although a conventional chest radiogram has been used for the screening of lung cancer, mass screening by helical CT scan has started for the high risk groups of smokers in some facilities of Japan. It shows a good performance for the diagnosis. We are currently working on developing two CAD schemes for chest radiograms and helical CT scans. The purpose of this study is to show recent results for chest radiogram CAD system by using three different databases and to provide a preliminary detection method for CAD system on chest helical CT scans. 2. Materials and methods 2.1 CAD system for chest radiograms Conventional chest radiogram has been employed as a mass screening method of lung cancer for decades, however it was said that small lesions at the first stage of cancer could appear with a very subtle shadows that radiologists might be miss. We have been developing a CAD scheme by computer programs that could prepare quantitative values and the position of lesions to radiologists. If radiologists will take into account the information from CAD system, their diagnostic performance would be higher when the information was not prepared. Figure 1 shows the overview of our Detection by GA Templete Matching Original Digital Image Detection by Artificial Neural Network False-Positive Elimination by Feature Analysis False-Positive Elimination by Artificial Neural Network Annotated Image Display Fig.1 Overview of chest radiogram CAD The detection scheme consists of two different part: detection and false-positive elimination processing. Many lesion candidates were indicated by two detection process by GA template matching and ANN technique not to miss truepositive findings, however some part of those candidates include normal shadows as false-positive findings.

automated detection system. Each process of this scheme was explained as follows. 2.1.1 Image database We employed three image databases to estimate our CAD scheme. Table 1 shows the number of images in each database. Images in Group-A are collected in a prefectural mass survey in Japan and include relatively easy cases with large nodules. Group-B is a standard database published by Japanese Society of Radiological Technology (JSRT) [1]. The quality is excellent to evaluate the CAD performance because the ease of detection for each case was already classified into five categories and verified by ROC studies. Group-C is mass survey cases and is collected in a central hospital in Osaka, Japan. All the films were digitized at 0.1mm or 0.175mm with 10-bit or 12-bit density resolution. We converted all of the digitized images to 1024 x 1024 pixels with 10-bit density resolution. 2.1.2 Lung field extraction The lung field was extracted to limit the region to be processed. Thresholding techniques for gray scale images were applied to extract the rough region at first. The threshold value was determined in terms of the area of histogram of original images[2]. Larger regions that correspond to the lung field were almost extracted in this step. Secondly, smaller regions that could not be considered as lung fields were eliminated in terms of the region size. Finally, the extracted lung regions were expanded because the extracted area could not always include borders of lung field. Figure 2 shows an example of an original chest image (a) and the processed image (b). The lung area with margins was extracted correctly. 2.1.3 Nodule detection by GA and TM Genetic algorithms (GA) is a optimal value search procedure in complex problems[3, 4]. We apply the GA technique to the detection process as a pattern recognition methods based on template matching with various reference patterns[5]. Detection method at this step is based on template matching techniques with many reference images. We employed a two-dimensional Gaussian distribution for some reference ones because the shadow of lung nodules in radiogram generally shows the distributions. The size or angle was unknown in practice, Table 1 Employed three databases # normal # abnormal total Group-A 10 33 43 Group-B 93 84 177 Group-C 59 28 87 Total 162 195 357 (a) (b) Fig.2 Lung field extraction Example of original chest image (a) and extracted image (b). The precious lung area was enlarged not to miss the nodules along the lung wall. however, we prepared various patters of Gaussian distribution. Here, a GA technique was employed to select the suitable image from various reference patterns and to show the fittest position in the original chest image. Virtual creatures, which is called individuals, have chromosomes represented by a binary digit system and the chromosomes could determine the pattern and the position by calculating the ease of living of individuals in the environment. The degree of living was given by a fitness defined as a difference between a reference image (Gaussian image) and an observed image (extracted patients' image). The genetic operations including crossover and mutation were applied to all individuals to create new ones in the next generation. Figure 3 shows the overview of detection step based on GA template matching method. We prepared 512 reference images with various sizes and angles of Gaussian distributions as shown in the left half. The chromosomes determine the positions x and y of partial image as shown in the right half. The similarity was defined as a five-dimensional function with those parameters. The chromosomes have 42 bits that represent the recognized positions of x (10 bits: 0 to 1023) and y

Horizontal Size = H Angle = A Simulated Typical Nodules based on Gaussian Distribution Vertical Size = V (10 bits: 0 to 1023), horizontal and vertical size categories of H (8 bits: #1 ~ #8) and V (8 bits: #1 ~ #8) and the category of angle of reference images A (8 bits: #1 ~ #8). The population size, generation, and probability of mutation were 120, 200 and 0.25 to 0.01 as the generation gains. The mutation was set up by inverting all bits at the probability. Then, ranking selection and one-point crossover technique were employed in the GA procedures. 2.1.4 Nodule detection by ANN Chest Image Similarity (S) = f x,y,h,v,a) ( Fig. 3 Detection by using GA template matching The position of nodule candidates on a chest image is defined by two parameters of x and y (Right). Three parameters of H, V and A determines the shapes of simulated typical nodules (Left). A similarity is calculated in terms of those five parameters, so that the recognition of nodule existence is an optimal value search problem in those five dimensional space. ANN based detection method includes feed-forward y Y Similarity = f(x, y, H, V, A) x X artificial neural network with three layers. The regulated pixel values in terms of the minimum and maximum in a partial image were entered to ANN. The matrix size of inputted image was 32 x 32 and the gray scale values were converted to 0.0 to 1.0. To train the ANN by backpropagation method, we selected a hundred diagnosed shadows of fifty of nodular and fifty of obvious normal. The iteration and the errors for training were 2000 and 0.0001, respectively. By scanning the mask of 32 x 32 with eight steps, the output value of ANN was recorded, and the positions of candidates that satisfy certain conditions of output values were remained as the detection region for nodular shadows. 2.1.5 Elimination of false-positive candidates by feature analysis Candidates that were over-interpreted are called falsepositive candidates. We previously analyzed the image and found that the false positive candidates could be classified into three categories such as shadows of lung margins, shadows of bones and shadows of vessels and others. We developed the algorithm to eliminate those candidates selectively. The first classification step was based on the histogram analysis within the candidates. The candidate that includes many regions outside expanded lung field was eliminated in terms of the area of low pixel values in its histogram. The second classification step was based on the histogram analysis of (b) (a) Fig.4 Detection example Computer and radiologist s detection were superimposed with three white squares and a white arrow on lung image (a). Upper left square (Upper right lung) was detected correctly, but the others were miss detection (false-positives) by computer. (b) is the magnified image of radiologist detection (ROI indicated by white arrow). The nodule size is medium and it is said that the shadow is not so subtle to diagnose.

the vector images within the candidates. The kurtosis that is the feature value of the sharpness of histogram was extracted. The final classification step was based on the analysis of feature values of the entropy and the ratio of partial images inside the candidates. 2.1.6 Elimination of false-positive candidates by ANN This process was very similar to the ANN-baseddetection step. The remaining candidates were entered to an another ANN. To train the ANN to eliminate falsepositive candidates, we selected a hundred diagnosed shadows of fifty of nodular and fifty of normal. The conditions of iteration and the errors were the same as that of the detection step. The number of nodes of input layer, hidden layer and output layer were 32 x 32, 32 x 32 and 1, respectively. Normalize pixel values within partial images were calculated and entered to ANN. 2.1.7 Indication of detected candidates Detected candidates were indicated by squares on the original images with the size of 32x32. Figure 4 shows two sample images with nodules indicating by white arrows. There were two miss detection, called falsepositives, but a small nodule shadows was detected correctly. 2.2 CAD system on chest helical CT scans A similar procedure could be applied to the detection part of the nodules on chest helical CT scan images, and the diagnosis by CT images play an important role in not only confirming the existence of nodules but also discriminating. We have been developing an automated detection scheme to detect nodular shadows on CT scans. Figure 5 shows the overview of our automated detection system. Each process of this scheme is explained as follows. 2.2.1 Image database Table 2 shows the specification of chest helical CT scans used in this study. The datasets consist of 20 cases (5 normal and 15 abnormal cases) including 101 nodules. The abnormal cases have multiple nodules, so that a few nodules with various size could be seen in the same slice images. 2.2.2 Detection methods by GA and TM The detection technique is based on GA and template Detection by 3D & 2D Templete Matching Original CT Image Detection by TM along the lung walls False-Positive Elimination by Texture Analysis Annotated Image Display Fig.5 Overview of chest CT CAD The detection scheme consists of two different part: detection and false-positive elimination processing. Many lesion candidates were indicated by two detection process by GA template matching and template matching along the lung walls not to miss true-positive findings, ant the false candidates were also detected as chest CAD scheme, so that the falsepositive elimination process was attached after the detection process. matching method (TM) in a three-dimensional space. We supposed that it was possible to detect a nodule to set the higher adaptation scales surrounding the nodules, so that the individuals could gather on the place for the environmental configuration. We used a similarity, which was defined by the difference of CT values between three-dimensional images, for adaptation scales of individuals. The similarity was calculated by template matching between two images. Template matching is one of the most fundamental Table 2 Specification of data set Tube voltage 120 [kv] Tube current 50 [ma] Slice thickness 10 [mm] Scan time 1 [sec/round] Table feed rate 20 [mm/sec] # of cases 20 # of images 557 # of nodules 101 Fig.6 Employed four reference images The sphere size is 10, 20,30 and 40 pixels (left to right). The information of chromosomes determines automatically to employ which reference pattern in four based on the fitness values. 3D templates based on Gaussian to detect relatively large nodules

methods to detect a target object within an image field. If the similarity between an unknown object (observed image) and the template (reference image) is sufficiently high, the unknown object can be considered as the detection object. We used three-dimensional images of the chest helical CT scans and simulated nodule images for observed and reference images, respectively. Each of the CT scans (observed image) has 512x512 pixels and consists of about 30 slice images. Extraction Comparison Rotation by the estimated angle Reference Fig.7 Overview of template matching along lung walls A reference image with semi-sphere shape was used rotating by the estimated angle of chest walls. The angle was determined by curve fitting techniques on CT slice images. We made four artificial simulated nodule images which include a similar image to be detected. Here, lung nodule shadow generally shows a spherical shape and the CT values tend to have a Gaussian distribution, we used some simple sphere models with three-dimensional Gaussian distribution as simulated nodules of reference images. Four reference images ranged 3 slices and their diameters were 10, 20, 30 and 40 pixels as shown in Fig.6. The observed image was cut off at a position which was determined by place information in each chromosome of GA [6]. At the same time, each chromosome selected a reference image from four ones. The size of cut image was the same as the reference image. The similarity between a cut and a reference images was calculated to evaluate adaptation scale of the individual on GA. We used cross-correlation as the similarity. The similarity was only evaluated within the common area that contained only a part of nodules between a cut and a reference images. Therefore it was possible to evaluate the similarity without depending on the background patterns. 2.2.3 Detection method along the lung wall Conventional template matching technique was also used to detect nodules existing along the lung walls. The GA template matching method was not able to detect these nodules because they usually have semicircular (a) (b) Fig.8 Detection example Computer and radiologist s detection were superimposed with four white squares and black and white arrow on lung image (a), respectively. The white arrow indicates a small nodule on the wall. The radius was about 4mm. The magnified image of the nodule is (b). The black arrow indicates a large nodule. Computer also detected the two nodules correctly, but two false-positive candidates was detected. The black line on (b) is an approximated line from the wall shape.

Table 3 System performance TP # of FP /img Group-A 78 % (35/45) 6.0 Group-B 68 % (57/84) 16.5 Group-C 82 % (23/28) 11.7 Average 73% (115/157) 11.4 shapes as shown in Fig.7. A reference pattern with semicircular shape was employed for this template matching. The reference pattern was rotated an angle to the lung wall automatically. 2.2.4 Elimination of false-positive candidates by feature analysis Detected candidates were classified into true or falsepositives by texture analysis. Four feature values such as contrast in gray level differences and entropy, inverse difference moment and angular second moment in cooccurrence matrix were employed to eliminate the falsepositive candidates. 2.2.5 Indication of detected candidates Detected candidates were indicated by white square on the original images. Figure 8 shows a detection example with two nodules, one is a large nodule indicated by black arrow and the other is a very small nodules on the lung wall. The radius of it was approximately 4mm because the actual pixel size was about 0.6mm. The black line on Fig8 (b) was approximated lines from the lung wall shape. 3. Results 3.1 Chest radiogram CAD system We employed three image databases to estimate the detection performance. Each performance is shown in Table 3. The average of sensitivity and the number of false-positives (FPs) were 73% and about 11 per image, respectively, which require the reduction of FPs for practical use of CAD. 3.2 CAD system on chest helical CT scan The methods were applied to 20 cases (5 normal and 15 abnormal) including 101 nodules with many smaller sizes less than 10mm. It was possible to detect 78 nodules by combining the GA method and the template matching along the lung walls. However, the number of falsepositives was 2.6 per slice. 4. Conclusion Image recognition method by using Genetic Algorithm is useful for detecting and locating lung nodules on digital chest radiogram and helical CT scans. Three datasets (357 images) were tested to estimate the system performance for chest radiogram CAD system including JSRT standard image database issued in Japan. The detection performance was 73 % true-positive rate with 11.4 false-positives per image. As for the CAD system on chest helical CT scans, our detection methods identified 77% of true-positive nodules with 2.6 false-positives per image, as a preliminary study with a small dataset. Acknowledgment The authors are grateful to J. Shiraishi for his proposal for employing the JSRT database, and to A. Kojima for his helpful discussions and technical assistance. References: 1. Standard digital image database for chest radiographs with and without a lung nodule (Jun-ichi Shiraishi, Kunio Doi, Shigehiko Katsuragawa, et al.), Proc. of the 12th International Symposium and Exhibition on Computer Assisted Radiology and Surgery (CAR' 98), p.882, (1998) 2. Machine vision: theory, algorithms, practicalities (E.R. Davies) Academic press, (1997) 3. Genetic algorithms + data structures = Evolution programs (Zbigniew Michalewicz) Springer-Verlag, (1992) 4. Advances in genetic programming (edited by Kenneth E. Kinnerar, Jr.) the MIT Press, (1994) 5. Development of automated detection system for lung nodules in chest radiograms (Takeshi Hara, Hiroshi Fujita and Jing Xu) Proc. of IASTED International Conference on Intelligent Information Systems (IEEE Computer Society), ed. by H.Adeli, p.71-p.74 (1997) 6. Nodule detection on chest helical CT scans by using genetic algorithm (Yongbum Lee, Takeshi Hara, Hiroshi Fujita, et al.) Proc. of IASTED International Conference on Intelligent Information Systems (IEEE Computer Society), ed. by H.Adeli, p.67-p.70 (1997)