A supervised classification-based method for coronary calcium detection in non-contrast CT

12
ORIGINAL PAPER A supervised classification-based method for coronary calcium detection in non-contrast CT Uday Kurkure Deepak R. Chittajallu Gerd Brunner Yen H. Le Ioannis A. Kakadiaris Received: 6 February 2009 / Accepted: 23 February 2010 Ó Springer Science+Business Media, B.V. 2010 Abstract Accurate quantification of coronary artery calcium provides an opportunity to assess the extent of atherosclerosis disease. Coronary calcification burden has been reported to be associated with cardiovascular risk. Currently, an observer has to identify the coronary calcifications among a set of candidate regions, obtained by thresholding and connected component labeling, by clicking on them. To relieve the observer of such a labor-intensive task, an automated tool is needed that can detect and quantify the coronary calcifications. However, the diverse and heterogeneous nature of the candidate regions poses a significant challenge. In this paper, we investigate a supervised classification-based approach to distinguish the coro- nary calcifications from all the candidate regions and propose a two-stage, hierarchical classifier for auto- mated coronary calcium detection. At each stage, we learn an ensemble of classifiers where each classifier is a cost-sensitive learner trained on a distinct asymmet- rically sampled data subset. We compute the relative location of the calcifications with respect to a heart- centered coordinate system, and also use the neigh- boring regions of the calcifications to better character- ize their properties for discrimination. Our method detected coronary calcifications with an accuracy, sensitivity and specificity of 98.27, 92.07 and 98.62%, respectively, for a testing dataset of non-contrast computed tomography scans from 105 subjects. keywords Computed tomography Coronary calcium Supervised classification Introduction Cardiovascular disease (CVD) is one of the major causes of deaths in the western world. It is respon- sible for almost one million deaths per year in the United States [1]. Thus, appropriate preventive measures need to be taken to decrease the cardiovas- cular events. However, preventive measures can be U. Kurkure (&) D. R. Chittajallu G. Brunner Y. H. Le Computational Biomedicine Lab, Department of Computer Science, University of Houston, Houston, TX 77204, USA e-mail: [email protected] D. R. Chittajallu e-mail: [email protected] G. Brunner e-mail: [email protected] Y. H. Le e-mail: [email protected] I. A. Kakadiaris Computational Biomedicine Lab, Departments of Computer Science, Electrical and Computer Engineering and Biomedical Engineering, University of Houston, Houston, TX 77054, USA e-mail: [email protected] 123 Int J Cardiovasc Imaging DOI 10.1007/s10554-010-9607-2

Transcript of A supervised classification-based method for coronary calcium detection in non-contrast CT

ORIGINAL PAPER

A supervised classification-based method for coronarycalcium detection in non-contrast CT

Uday Kurkure • Deepak R. Chittajallu •

Gerd Brunner • Yen H. Le • Ioannis A. Kakadiaris

Received: 6 February 2009 / Accepted: 23 February 2010

� Springer Science+Business Media, B.V. 2010

Abstract Accurate quantification of coronary artery

calcium provides an opportunity to assess the extent of

atherosclerosis disease. Coronary calcification burden

has been reported to be associated with cardiovascular

risk. Currently, an observer has to identify the coronary

calcifications among a set of candidate regions,

obtained by thresholding and connected component

labeling, by clicking on them. To relieve the observer

of such a labor-intensive task, an automated tool is

needed that can detect and quantify the coronary

calcifications. However, the diverse and heterogeneous

nature of the candidate regions poses a significant

challenge. In this paper, we investigate a supervised

classification-based approach to distinguish the coro-

nary calcifications from all the candidate regions and

propose a two-stage, hierarchical classifier for auto-

mated coronary calcium detection. At each stage, we

learn an ensemble of classifiers where each classifier is

a cost-sensitive learner trained on a distinct asymmet-

rically sampled data subset. We compute the relative

location of the calcifications with respect to a heart-

centered coordinate system, and also use the neigh-

boring regions of the calcifications to better character-

ize their properties for discrimination. Our method

detected coronary calcifications with an accuracy,

sensitivity and specificity of 98.27, 92.07 and 98.62%,

respectively, for a testing dataset of non-contrast

computed tomography scans from 105 subjects.

keywords Computed tomography �Coronary calcium � Supervised classification

Introduction

Cardiovascular disease (CVD) is one of the major

causes of deaths in the western world. It is respon-

sible for almost one million deaths per year in the

United States [1]. Thus, appropriate preventive

measures need to be taken to decrease the cardiovas-

cular events. However, preventive measures can be

U. Kurkure (&) � D. R. Chittajallu � G. Brunner �Y. H. Le

Computational Biomedicine Lab,

Department of Computer Science, University of Houston,

Houston, TX 77204, USA

e-mail: [email protected]

D. R. Chittajallu

e-mail: [email protected]

G. Brunner

e-mail: [email protected]

Y. H. Le

e-mail: [email protected]

I. A. Kakadiaris

Computational Biomedicine Lab,

Departments of Computer Science,

Electrical and Computer Engineering

and Biomedical Engineering,

University of Houston, Houston,

TX 77054, USA

e-mail: [email protected]

123

Int J Cardiovasc Imaging

DOI 10.1007/s10554-010-9607-2

applied only when an accurate risk assessment can be

made. Thus, there is an urgent need to develop tools

to improve the assessment of the cardiovascular risk.

Recent studies [2–5] have shown that coronary

artery calcification (CAC) burden as measured by

non-contrast computed tomography (CT) is a signif-

icant and independent predictor of atherosclerosis

disease and is associated with future coronary events.

Therefore, accurate identification and quantification

of calcifications in the coronary arteries may allow

improved diagnosis and monitoring of progression

of atherosclerosis. Moreover, the regional coronary

calcium burden is related to the regional dysfunction

of the left ventricle [6]. Thus, the quantification of the

coronary calcifications provides a measure to assess

the risk of coronary artery disease (CAD).

With the advancements in imaging technology, the

non-invasive assessment of the coronary arteries and

their calcification is feasible. Figure 1 depicts typical

candidate regions for the coronary and the aortic

calcifications as well as the image noise present in a

CT scan. The calcifications are highly dense regions

compared to other soft tissues; hence, they appear as

bright structures in the CT scans. The coronary

calcifications are located in the three main coronary

arteries and their sub-branches—the left main/left

anterior descending artery (LM/LAD), the left cir-

cumflex (LCX), and the right coronary artery

(RCA)—that cover different portions of the heart

surface area.

The current clinical standard to detect the coronary

calcifications is to apply a connected component

labeling method using a threshold of 130 Hounsfield

units (HU) and a minimum size constraint of three/

four pixels (at least 1 mm2) [7]. This thresholding

results in candidate regions not only from coronary

calcifications but also from non-coronary calcifica-

tions, noise artifacts, and metal implants. The coro-

nary calcifications are then identified manually.

However, manual annotation is a labor intensive

and time-consuming task, especially for longitudinal

studies and large-scale screening. Hence, automated

computational methods for detecting the coronary

calcifications are needed to ease the manual burden as

well as to provide means to investigate possible

improvements in cardiovascular risk assessment.

To the best of our knowledge, Ukai et al. [8, 9]

first proposed a method for coronary calcification

detection using some diagnostic rules, and later

improved their method by utilizing a neural networks

based classification method to discriminate between

the coronary calcifications and the artifacts. The

method was evaluated on helical CT scans (acquired

in mass screening for lung cancer) of subjects with

very few coronary calcifications and, thus, is limited

in scope. The calcium candidates were obtained using

an intensity threshold of 80 HU and size limit of 6

pixels, unlike the current clinical standards [7]. The

neural network was trained using six features—size,

presence of fat in surrounding region, maximum HU,

minimum HU, difference between maximum HU

inside the candidate and mean HU outside the

candidate, and region number obtained by dividing

heart into eight regions. However, the true samples

used for training in their study included aortic

calcifications and artifacts, that satisfied some inten-

sity contraints, resulting in a contaminated model.

Recently, Isgum et al. [10] proposed a two-stage

classification method using the k-nearest neighbor

classifier for coronary calcification detection. The CT

scans used in this study were originally obtained at

high resolution and were later resampled to a lower

resolution. The candidate regions with high proba-

bility of belonging to the negative class in the first

stage of classification were discarded and only the

Fig. 1 CT image of a heart depicting typical candidate

regions: coronary calcifications in LAD (a), RCA (b), aorta

(c), and image noise (d,e)

Int J Cardiovasc Imaging

123

remaining ones were considered for the second stage

of classification. For classification, the features used

were computed based on the size, shape, location, and

appearance of the candidate regions. The appearance-

based features were computed at the peak intensity

point of each candidate region from image deriva-

tives. The use of region-based features was limited to

the mean and the maximum intensity values. Addi-

tionally, aorta segmentation was performed to

improve the performance of the classifier in separating

aortic calcifications from coronary calcifications.

However, the segmentation results were not satisfac-

tory since the problem of segmenation is inherently

difficult in the absence of any contrast. Furthermore,

the classifiers were trained on the unbalanced data

owing to the large number of negative candidates;

hence, they were biased towards the majority class.

In this paper, we investigate various factors that

can make it feasible to build an automated coronary

calcium detection system. Though the coronary

calcifications appear as high-density structures in the

non-contrast CT scans, it is inherently difficult to

identify them automatically. The difficulty arises

because of the presence of other similar high-density

structures, including the non-coronary calcifications,

and the absence of any contrast agent to identify the

blood vessels. It is apparent that the choice of features

plays an important role in solving this problem. We

compute several features based on appearance, shape

and size of the calcifications. In addition, we inves-

tigate various clues that a human observer uses to

manually annotate the coronary calcifications in the

CT scans (e.g., expected location of the coronary

arteries and surrounding regions of calcifications).

Thus, in order to better characterize their properties

for discrimination, we additionally compute relative

location of the calcifications with respect to a heart-

centered coordinate system and region-based features

for the candidates and their neighboring regions. To

the best of our knowledge, it is the first time that the

relevance of the region-based features and the neigh-

borhood region of the calcifications is investigated.

Finally, we have developed a novel two-stage

hierarchical classification-based method to detect

coronary calcifications in the non-contrast cardiac

CT scans1. In the current problem of classification,

the positive class is composed of the coronary

calcifications, while the negative class consists of

the aortic calcifications, image noise, and metal

implants, if any. In the first stage, we explicitly learn

to distinguish the arterial (the coronary and the aortic)

calcifications from other highly dense regions present

within the heart region. In the second stage, we learn

to separate the coronary calcifications from the aortic

calcifications. We investigate the possibility of such

separation without requiring the segmentation of the

aorta in this work. At each stage of the hierarchy, we

construct an ensemble of classifiers, trained on

different data subsets that are generated using an

asymmetric sampling method, to overcome the

problem of a highly unbalanced and large data set.

The decisions of each individual classifier in the

ensemble are combined together to obtain the final

decision for that stage. Furthermore, each classifier in

the ensemble is designed to accommodate asymmet-

ric penalty costs for different types of errors.

Materials and methods

Data

The heart scans were obtained by electron-beam CT

(EBCT) imaging with a slice thickness of 3 mm and

an x-y pixel spacing of 0.508–0.586 mm. Scans from

205 subjects with approximately 20–35 image slices

per scan were used in our experiments. There were 20

coronary calcifications and 335 negative candidate

regions in the heart region per scan on average. We

created two mutually exclusive data subsets, D1 with

100 subjects and D2 with 105 subjects, for the

training and testing phases, respectively.

For training and testing purposes, the arterial

calcifications were manually annotated. The manual

annotation was performed using a software developed

at the Computational Biomedicine Lab, specifically

for the purpose of calcium scoring. The vertical range

of the heart region was defined from the transverse

slice at the top in which the pulmonary artery splits to

the transverse slice at the bottom till which the

coronary arteries are present. To annotate the coro-

nary calcifications, the observer clicks points along

the trajectory of a coronary artery in the transverse

slices. Then, the software interpolates these points to

build an arterial trajectory, and detects the nearby1 Preliminary work on this topic has appeared in [11].

Int J Cardiovasc Imaging

123

coronary calcium. It computes the extent of the

detected calcifications using the minimum HU con-

straint of 130 HU and the minimum size constraint of

four pixels. The observer is allowed to include

coronary calcium not detected by the software and

exclude any non-coronary calcium detected by the

software. To annotate the aortic calcifications, the

observer clicks at any point within a calcified region

in the aorta and the software computes its extent

using the minimum HU and size constraints.

Feature extraction

In classification-based methods, the representation

of the candidate samples (i.e., the features) play the

most important role. To uniquely represent the

coronary calcifications, it is imperative to investigate

their various characteristic properties that can be used

to distinguish them apart from other candidate

regions. The various types of features that were

designed and evaluated are discussed next.

Spatial location

To identify a coronary calcification among the

candidate regions, a human observer mostly relies

on prior knowledge of the expected location of the

coronary arteries relative to the heart. To incorporate

such information about the location, two sets of

location features are computed:

F1: absolute location (x, y, z) in the image

coordinate system, and

F2: relative location (x, y, z) in a heart-centered

coordinate system.

The heart-centered coordinate system is defined by

a unit bounding box around the heart region. The

heart region is extracted using a hierarchical

approach in which the peripheral structures are

detected first and then the heart region is determined

using dynamic programming and prior knowledge of

anatomical location of the heart.

In particular, the body region is segmented first

using thresholding and morphological operators.

Next, the lung is detected within the body region by

applying a threshold (-250 HU) and extracting the

largest connected-component region from the com-

plement of the thresholded result. The boundary of

the extracted lung region may have indentations

because of the pulmonary vessels that transport blood

between the lung and the heart. Since a part of the

lung boundary defines the lateral boundary of the

heart, it is important to smooth the jagged boundary

resulting from the indentations of pulmonary vessels.

To fill the indentations on the lung boundary, a binary

morphological closing operation is performed on

the extracted lung region [12]. Then, the bones are

detected using a threshold and a region growing

algorithm because they constraint the heart boundary

on anterior and posterior sides. The bone fragments in

the first k slices are extracted by applying a threshold

(130 HU) outside the convex hull of the lung region.

The extracted bone fragments are then provided as

input to the 3D region growing algorithm to obtain

the complete bone structure.

Since the heart region lies in-between the lung

halves that define the heart’s lateral boundary, one

could simply compute the convex hull of the lung and

then subtract the lung from it to obtain the in-between

region. However, such a scheme may miss some

portions of the heart on the anterior side and include

unnecessary structures on the posterior side. To avoid

such situation, the anterior and posterior boundaries are

obtained by determining optimal shortest paths

between the left and right lung using dynamic pro-

gramming in each axial slice. Since the spine, sternum

and other bone structures constrain the anterior and

posterior boundary of the heart, these bone structures

guide the dynamic programming method to detect the

boundaries. The region between the detected anterior

and posterior boundaries, and the lungs is smoothed

using morphological operators and extracted as the

heart region. Finally, a minimum size cube that

contains the complete 3D heart region is computed to

define the unit bounding box for the heart-centered

coordinate system.

Pixel-based texture features

Texture features have been proven to improve the

accuracy of the classification-based methods in

different medical image analysis applications. We

compute texture features using the Laws texture

energy measures [13]. These texture energy measures

are computed using convolution kernels generated

from one-dimensional convolution kernels of five

pixel length characterizing level, edge, spot, wave

and ripple patterns. Since the texture features are

Int J Cardiovasc Imaging

123

pixel-based, we selected the maximum HU-density

pixel as the representative of the candidate region

for pixel-based texture features (F3).

Region-based texture features

Additionally, an observer uses the candidate’s

regional properties instead of individual pixel prop-

erties to distinguish between the coronary calcifica-

tion from the rest. Based on this observation, we

propose to extract region-based features to better

represent a candidate region in addition to the

individual pixel features.

Since the coronary arteries mostly span through the

pericardial fat after originating from the ascending

aorta, except in the distal section when they enter the

ventricular wall, the neighboring regions of the coro-

nary calcifications provide significant clues to distin-

guish the coronary calcifications from the image noise,

which mostly has blood in its neighboring region.

Hence, the observer also uses appearance clues from

the candidate region and its surrounding neighboring

region to distinguish the coronary calcification from

the rest of the candidate regions. Based on these

observations, we extract three set of the region-based

features:

F4: the set of features computed for the candidate

calcium region,

F5: the set of features computed for its neighbor-

hood region, and

F6: the set of features computed for the combined

region of the candidate and its neighborhood.

The neighborhood region is defined as a ribbon-like

region of a certain width around the candidate region.

Specifically, we compute the mean, standard devia-

tion, skewness, kurtosis, and entropy of the pixel-

based Laws texture energy measures [13] for the

region of interest. Additionally, we compute the

difference of means normalized by the sum of standard

deviations of the Laws measures inside and outside the

candidate region as a feature for the combined region.

We also compute the grey-level co-occurrence matri-

ces (GLCM) proposed by Haralick et al. [14]. The

GLCM provides the probability of occurrence of a

particular pair of the HU-density at a particular spatial

arrangement. Based on the cost statistics, we compute

the contrast, correlation, energy, and homogeneity

features for four different spatial arrangements.

Additional features

We include object-related features (F7) based on the

size, shape, and appearance of the region of inter-

est—area, shape moments, eccentricity, compactness,

anisotropy, inertia, and HU density-based first-order

statistics (e.g., mean, maximum, minimum, standard

deviation, skewness, kurtosis, and entropy), length of

major axis, length of minor axis, equivalent radius,

mean radial length and so on. To list all features

and explain their details is beyond the scope of this

paper.

Feature selection

The classification accuracy of a classifier depends

largely on the selection of useful features. One of the

criterions, often termed as maximum relevance, to

find useful features is to select features that are

individually highly correlated with the target class

distribution. However, the features thus selected may

be highly intercorrelated too. When two features are

intercorrelated, one of them can be removed without

affecting the classification performance to reduce the

computational load. Also, individually powerful

features may not be so powerful together. Thus, we

need to select features which have maximum rele-

vance and minimal redundancy.

The relevance of a feature with respect to a

particular outcome or the redundancy between any

two features can be characterized in terms of

correlation or mutual information. We used a mutual

information (MI)-based minimum-redundancy-max-

imum-relevance (mRMR) feature selection heuristic

proposed by Peng et al. [15]. The feature selection is

performed in two steps. First, a subset G of N features

is selected from the large feature set F such that an

mth feature is added incrementally to the currently

selected set Gm-1 with m - 1 features, if it

maximizes the following condition:

maxfi2fF�Gm�1g

MIðfi; LÞ �1

m� 1

X

fj2Gm�1

MIðfi; fjÞ

2

4

3

5; ð1Þ

where the first term corresponds to the relevance,

the second term corresponds to the redundancy, L is

the target class, and fi is the ith feature. In the second

step, we form P sequential feature subsets in multi-

ples of k features such that G1*k , G2*k , ... ,Gp*k

Int J Cardiovasc Imaging

123

and select the feature subset that corresponds to the

minimum cross-validation error obtained from a

classifier.

Coronary calcium detection method

For classification-based coronary calcium detection,

the negative class (aortic calcifications, image noise,

and metal implants, if any) is broader and richer than

the positive class (coronary calcifications), introduc-

ing complex class compositions. Such complex class

composition may increase the complexity of the

learner and introduce a significant number of false

positives. Moreover, the majority of the candidate

regions belong to the image noise and hence, the

negative class. Thus, there is large number of

candidates and the classes are highly unbalanced in

terms of the number of candidates. Training on such

unbalanced classes will bias the classifier towards the

majority class, and will degrade its performance.

To account for the complex class composition, we

divide the problem of detecting coronary calcifica-

tions in to two sub-problems: (1) detection of arterial

(aorta and coronary arteries) calcifications among all

the candidates, and (2) detection of the coronary

calcifications among the arterial calcifications. We

construct a two-stage hierarchical classifier to solve

these two sub-problems (Fig. 2). At the first stage of

the hierarchy, a classifier is constructed using the

features selected specifically to discriminate the

arterial (coronary and aortic) calcifications from the

rest of the candidate regions. At the second stage of

the hierarchy, another classifier is constructed using

the features selected specifically to separate the

coronary calcifications from the aortic calcifications.

The final decision of the two-stage classifier is a

combination of decisions of the individual classifiers

of each stage and is given by:

DðxÞ ¼þ1; if D2ðxÞ ¼ þ1

�1; if D2ðxÞ ¼ �1

�1; if D1ðxÞ ¼ �1

;

8<

: ð2Þ

where D1 and D2 are the hard-output decisions of the

classifiers from the two stages of the hierarchy,

respectively.

The problem of unbalanced classes persists even in

the two-stage hierarchical classification-based

approach. At the first stage, the number of arterial

calcifications is significantlty lower than the the

number of candidates belonging to the image noise.

At the second stage, the number of aortic calcifica-

tions is very small as compared to the number of

coronary calcifications. We address the problem of

the unevenness of the positive and negative classes by

employing an asymmetric random sampling strategy.

In asymmetric random sampling, the candidates are

randomly selected from the majority class until the

number of the selected candidates is equal to the

number of the candidates from the minority class.

However, a single learning agent or classifier

trained on an under-sampled data subset assumes that

the distribution of candidates in the sampled subset is

the same as in the original dataset, which may not be

true in practical situations. Thus, some potentially

relevant candidates may get eliminated during the

sampling process, hampering the performance of the

classifier. An ensemble of multiple classifiers that are

trained on different independently sampled subsets

would improve the accuracy of prediction over a

single classifier. Moreover, distortion of data distri-

bution that can be introduced via data sampling will

be minimized. Thus, for each stage of the hierarchy,

we train individual classifiers whose decisions are

combined to obtain the final decision at that stage

(Fig. 3). We investigated two combination rules—

simple majority voting rule (MVR) and weighted

majority rule (WMR)—to combine the decisions of

the individual classifiers. In the WMR, the weights

are directly related to the competency or accuracy of

the classifiers, and are computed as the logarithm of

odds of competency, wi ¼ logð Ai

1�AiÞ; where Ai is the

prior probability for correct classification of ith

classifier, computed from the validation set.

Fig. 2 A schematic of the classification framework for

coronary calcium detection

Int J Cardiovasc Imaging

123

To further reduce the error of classification,

asymmetric penalty cost may be applied to the

different types of errors. Support vector machines

(SVMs) [16] are capable of incorporating different

costs for the positive and the negative classes by

modifying their objective function as follows:

1

2wTwþ Cp

X

ijyi¼þ1

ni þ Cn

X

ijyi¼�1

ni; ð3Þ

where w is a normal vector perpendicular to the

separating hyperplane; yi is a label for each ith data

sample with ?1 for positive outcome and -1 for

negative outcome; ni is a slack variable allowing soft

margins (i.e., training errors for data that may not be

completely separable); and Cp and Cn are the cost

parameters for the positive and the negative classes,

respectively [17]. However, the specific costs for this

problem are not easy to determine. We employ a grid

search algorithm in which a classifier determines the

optimal costs based on the cross-validation accuracy

obtained using three-fold cross-validation. In n-fold

cross-validation, the training data is first partitioned

into n subsets and then, classifier uses one of the n

subsets for validation and the rest n - 1 subsets for

training. The cross-validation accuracy is computed by

averaging the accuracy of classifier from the n-folds.

In grid search method, a discrete grid is constructed

in the parameter space and the parameter values at

each grid node are evaluated using the n-fold cross-

validation method. The parameter values at the node

having the highest cross-validation accuracy are

finally selected to train the classifier [18].

Training

The candidate regions are extracted from the training

CT scans using the minimum size and HU-density

constraints. From the training set, we generate four

asymmetrically-sampled training subsets for each

stage of the hierarchy with equal number of candi-

dates in the positive and negative classes. Next, we

compute the features for all the candidate regions.

Then, we apply the feature selection method to select

the most relevant and non-redundant feature subset

for each stage of the hierarchy. The feature subset

with highest cross-validation accuracy is selected.

Finally, we construct a two-stage classifier using

multiple classifiers trained on the selected subsets of

features for each stage. We use LIBSVM [19] to train

the individual SVMs using a nonlinear Gaussian

radial basis kernel function. The width parameter c,

the penalty cost parameter C, and the ratio of

asymmetric misclassification costs Cp/Cn are opti-

mized using a grid search technique and the three-

fold cross-validation method.

Deployment

In the deployment or testing phase, the candidate

regions are extracted using the size and HU-density

constraints from the CT scan. The features from the

selected feature subsets are computed for the candi-

date regions and provided as input to the two-stage

hierarchical classifier. The classifier then predicts for

each candidate region whether it is a coronary

calcium region or not.

Family of classifiers

To demonstrate the importance of using an ensemble

of multiple classifiers over a single classifier, and a

two-stage hierarchical classifier over a single stage

classifier, we constructed the following classifiers:

L1: single stage, single classifier

L2: single stage, ensemble classifier with MVR rule

L3: single stage, ensemble classifier with WMR

rule

L4: two-stage classifier using a single classifier at

each stage

L5: two-stage classifier using an ensemble classifier

with MVR rule at each stage

Fig. 3 Depiction of the construction of an ensemble classifier

using multiple classifiers

Int J Cardiovasc Imaging

123

L6: two-stage classifier using an ensemble classifier

with WMR rule at each stage

L7: two-stage classifier using an ensemble classifier

with MVR rule and asymmetric penalty costs at

each stage

L8: two-stage classifier using an ensemble classifier

with WMR rule and asymmetric penalty costs at

each stage.

Results

The performance of the classifiers was evaluated in

terms of sensitivity, specificity and accuracy based on

metrics derived from confusion matrix. These statis-

tical performance measures can be computed in terms

of true-positives (TP), true-negatives (TN), false-

positives (FP) and false-negatives (FN) as following:

Sensitivity ¼ TP

TPþ FN;

Specificity ¼ TN

TN+FP;

Accuracy ¼ TPþ TN

TP+TN+FP+FN:

First, we present the results of the feature analysis

experiments that were conducted using single stage

classifiers to emphasize the choice of the features

used. Then, we evaluate the performance of the

proposed two-stage hierarchical classifiers and ana-

lyze the errors in detail.

Feature analysis

To systematically assess the effectiveness of different

types of feature groups to detect the coronary

calcium, we constructed various feature sets by

combining different types of feature groups incre-

mentally. The various feature sets that were con-

structed are listed in the Table 1. Each of the feature

set was further reduced to a subset of the most

relevant and non-redundant features, independently,

through the feature selection process. The effective-

ness of each feature set was assessed using the single

stage classification approach. Thus, three single stage

classifiers (i.e., L1, L2, and L3) were applied to detect

the coronary calcifications.

Table 2 presents the performance metrics for

these. Using the heart-centered coordinate system-

based features (F2) instead of the absolute image

coordinate system-based features (F1) improved the

sensitivity by approximately 7.0 and 1.5% when used

independently and in combination with the other

Table 1 The feature sets formed by combining various feature

groups

Feature set Feature groups

S1 F1

S2 F2

S3 F2 ? F7

S4 F2 ? F7 ? F3

S5 F2 ? F7 ? F3 ? F4 ? F5 ? F6

S6 F2 ? F7 ? F3 ? F4

S7 F2 ? F7 ? F3 ? F6

S8 F1 ? F7 ? F3 ? F6

Table 2 Performance evaluation of single stage classifiers

Classifier Feature

set

Sensitivity

(%)

Specificity

(%)

Accuracy

(%)

L1 S1 70.62 92.97 91.79

S2 77.64 95.61 94.65

S3 85.41 97.36 96.73

S4 89.50 97.43 97.01

S5 91.32 97.38 97.06

S6 88.39 96.18 95.77

S7 93.29 98.34 98.07

S8 91.67 97.97 97.64

L2 S1 69.46 93.30 92.04

S2 76.98 96.07 95.06

S3 85.01 97.78 97.10

S4 88.49 97.92 97.42

S5 91.62 98.41 98.05

S6 88.24 98.07 97.55

S7 92.63 98.48 98.17

S8 91.52 98.12 97.77

L3 S1 69.71 93.12 91.88

S2 77.34 95.88 94.90

S3 85.82 97.54 96.92

S4 88.89 97.72 97.26

S5 92.33 98.14 97.83

S6 88.69 97.83 97.34

S7 92.98 98.38 98.09

S8 91.82 97.99 97.66

Bold values indicate the best performing feature test

Int J Cardiovasc Imaging

123

features, respectively. Adding object features (F7) to

the spatial location features improved the sensitivity

and specificity by almost 10.0 and 1.5%, respectively,

in feature set S3. The pixel-based texture features

computed at the peak HU pixel (F3) further improved

the sensitivity by 3.5% in feature set S4. Addition of

the region-based features computed for the candidate

region (F4) in feature set S6 didn’t have significant

effect on the prediction of the classifiers. However,

addition of the region-based features computed for

the combined candidate and neighborhood region

(F6) in feature set S7 improved the sensitivity by

almost 5%. Inclusion of all the region-based features

in feature set S5 reduced the sensitivity over the

feature set S7 by almost 2.0, 1.0, and 0.7% for L1, L2,

and L3, respectively. Thus, the best performance was

obtained for the feature set S7 comprising of the

features F2, F7, F3, and F6 for all the three classifiers.

Comparison of the classifiers

The two-stage hierarchical classifiers (i.e., L4, L5, L6,

L7 and L8) were constructed for the two feature sets

(S5 and S7) that provided highest sensitivity and

accuracy for the single stage classifiers. The perfor-

mance metrics of the single stage classifiers and the

two-stage hierarchical classifiers are presented in the

Tables 2 and 3, respectively. The ensemble-based

classifiers exhibited the highest accuracy for both

single stage and two-stage classifiers. The MVR rule

proved to be the best decision fusion rule for the

ensemble-based classifiers in terms of accuracy.

Among the ensemble-based classifiers using the

MVR rule, the single stage classifiers provided higher

sensitivity; however, the two-stage hierarchical clas-

sifiers provided higher accuracy.

The feature set S7 proved to be better than the

feature set S5 for the ensemble-based classifiers in

terms of the sensitivity with specificity remaining

almost same. Using the asymmetric penalty costs

improved the sensitivity by 0.10–0.50% for the two-

stage ensemble-based classifiers. Overall, the highest

accuracy of 98.27% (sensitivity: 92.07%, specificity:

98.62%) was achieved for the two-stage hierarchical

L7 classifier for the feature set S7.

Misclassification error analysis

Table 4 presents the misclassification error analysis

for the L1, L2, and L7 classifiers. The misclassification

error percentage, E1, was computed as the ratio of the

number of misclassifications of a particular candidate

to the overall number of misclassfications. The

category-wise misclassification error percentage,

Table 3 Performance evaluation of two-stage hierarchical

classifiers

Classifier Feature

set

Sensitivity

(%)

Specificity

(%)

Accuracy

(%)

L4 S5 91.12 98.25 97.87

S7 91.67 97.66 97.34

L5 S5 90.51 98.59 98.16

S7 91.62 98.63 98.26

L6 S5 91.12 98.30 97.92

S7 92.28 98.30 97.98

L7 S5 90.66 98.63 98.21

S7 92.07 98.62 98.27

L8 S5 91.22 98.36 97.98

S7 92.48 98.35 98.04

Bold values indicate the best performing feature test

Table 4 Misclassification error analysis of L1, L2, and L7 classifiers

Classifier Error Noise AA DA LM LAD LCX RCA

L1 E1 73.02 8.58 0.00 3.60 1.52 3.46 9.82

E2 1.52 21.60 0.00 27.95 1.54 7.26 8.52

L2 E1 69.64 9.05 0.00 3.94 1.75 5.26 10.36

E2 1.37 21.60 0.00 29.03 1.68 10.46 8.52

L7 E1 70.90 4.49 0.31 6.04 1.70 3.56 13.00

E2 1.31 10.10 0.50 41.93 1.54 6.68 10.08

E1 depicts the % of misclassifications in a particular category of all the misclassifications. E2 depicts the % of the candidate regions

misclassified in a particular category of all the candidate regions in that category

Int J Cardiovasc Imaging

123

E2, was computed as the ratio of the number of

misclassified candidate regions in a particular category

to the total number of candidate regions in that

category. In the negative class, the noise artifacts had

the highest misclassification error percentage (E1),

while the ascending aorta calcifications had the highest

category-wise misclassification error percentage (E2).

Likewise in the positive class, the right coronary artery

calcifications had the highest misclassification error

percentage (E1), while the left main artery calcifica-

tions had the highest category-wise misclassification

error percentage (E2). The two-stage classifier L7

misclassified 50% less aortic calcifications, 4% less

image noise, and 7.5% more coronary calcifications

than the single stage classifier L2.

Discussion

Our results demonstrate the feasibility of an auto-

mated coronary calcium detection system using a

classification-based method. However, the choice of

features and the classification approach have played

an important role toward calcification detection. Our

analysis provides the starting point to automate the

coronary calcium detection process by careful choice

of the features and the classification approach.

Our results suggest that a heart-centered coordi-

nate system provides a compact representation for the

spatial location of the coronary arteries as opposed to

the absolute image coordinate system. However, in

this paper we have used a simple bounding box-based

coordinate system to represent the location of the

calcifications. The currently used coordinate system

is not compact, nor rotation-invariant, nor patient-

invariant. A more compact frame of reference using a

local heart-centered coordinate system, as recently

proposed by our group [20, 21], which is invariant to

translation, scale, and rotation, could further improve

the accuracy of the calcification detection system.

However, development of a patient-invariant coordi-

nate system may require actual segmentation of the

coronary arteries which is a difficult problem owing

to the low resolution, anisotropy, and lack of contrast.

Our experiments show that the texture features are

able to characterize the coronary calcifications well

and the neighborhood region of the coronary calcium

plays an important role in distinguishing the coronary

calcium from the other candidate regions. Using the

combined region—the neighborhood region and the

candidate region—to compute the region-based

texture features significantly improved the sensitivity

of the system over using these regions individually.

Our analysis indicates that the proposed hierarchi-

cal classification approach proves to be effective in

reducing the false positives from the aorta calcifica-

tions and the image noise as compared to the single

stage classifiers. Our method achieved sensitivity of

92.07% at the expense of 4.65 false positives per

scan. Also, the ensemble of multiple classifiers

further reduces the false positives occuring due to

the image noise. Nevertheless, there is high percent-

age of the misclassification errors for the calcifica-

tions in the aorta, left main artery and the right

coronary artery. This is because of the significant

overlap in the relative positions of the ascending

aorta near the origin, the left main artery and the

proximal region of the right coronary artery in

different subjects due to anatomical variations. Fur-

ther investigation is required into the design of

informative features to discriminate between the

ascending aorta calcifications and the coronary cal-

cifications near the ostia.

None of the previous studies have attempted to

explicitly deal with the issues of unbalanced class

priors, and complex class compositions that are the

characteristics of the given problem. Such issues have

been extensively studied in other applications that

have similar requirements. Wu et al. [22] proposed a

hierarchical classification scheme to overcome the

above mentioned asymmetries in face detection

problem. Yan et al. [23] proposed an ensemble of

SVMs for predicting rare classes in scene classifica-

tion. Various techniques for ensemble construction

can be categorized mainly in four categories [24]: (1)

manipulating the feature space, (2) manipulating the

training sets, (3) manipulating the output labels, and

(iv) introducing randomness in the learning process.

We have utilized the advantages of such hierarchical

and ensemble-based techniques in our classification

approach to overcome the issues of asymmetry in the

given problem.

The current cardiovascular risk scoring methods

do not take into account the wealth of information

that is available in the non-contrast CT data. Studies

indicate that pericardial fat may be a significant

cardiovascular risk factor [25]. Other studies have

shown that calcification of the thoracic aorta, aortic

Int J Cardiovasc Imaging

123

arch and aortic valve are associated with increased

risk of cardiovascular disease [26, 27]. Moreover, a

recent study [28] indicated that the spatial distribu-

tions of coronary calcifications plays an important

role in prediction of cardiovascular heart disease and

can be used to improve the current coronary calcium

scoring technique. However, there is a dearth of

automatic techniques to mine the imaging data for

required information. Furthermore, validation in large

epidemiological studies is needed to determine which

type of information will offer additive predictive

value. Our long term goal is to contribute to the

development of quantitative methods to assess cumu-

lative risk of vulnerable patients by developing new

techniques to mine additional information from the

imaging data [29–31]. Toward this goal, we are

developing novel robust computational methods for

segmentation and classification of various anatomical

structures and tissues (e.g., thoracic cavity [32], heart

[33], aorta [34, 35], and fat [36]) from non-contrast

CT data.

Study limitations

Though our study have demonstrated that a classifi-

cation-based method can detect coronary calcifica-

tions with proper selection of the type of features and

classification approach, the findings of this study are

limited with respect to the imaging modality of

EBCT. With the increased use of multi-detector CT

(MDCT) in the routine practice for calcium scoring, it

would be interesting to assess the results of our method

on MDCT scans. Another major limitation of our study

is that the cohort constitutes a higher risk population

(i.e., the patients were physician-referred for CHD risk

stratification). Currently, our method requires the user

to input the start and end slice numbers to define the

extent of the heart in transverse slices. However, we

are developing methods to automate the detection of

the heart extent in the z-axis in order to deploy our

method on a larger cohort.

Conclusion

In this paper, we have presented a classification-

based method to detect the coronary calcifications

from the non-contrast CT scans. We have demon-

strated that a two-stage hierarchical classifier using

multiple classifiers is more robust, in reducing the

false positives keeping a high detection rate than

the single stage classifiers. At the first level of the

hierarchy, the method discriminates between the

arterial (the coronary and the aortic) calcifications

and other highly dense regions present within the

heart region. At the second level, the method

distinguishes the coronary calcifications from the

aortic calcifications without requiring explicit seg-

mentation of the aorta. Our results indicate that the

neighborhood region of the coronary calcium con-

tains significant clues to distinguish them apart from

the other high density structures. Thus, a classifica-

tion-based method has potential to be used in the

calcium scoring softwares to ease the burden of the

observers by reducing the manual interaction required

and eventually, eliminate them completely.

Acknowledgments This work was supported in part by NSF

Grants IIS-0431144 and CNS-0521527.

References

1. National Institutes of Health (2007) Disease statistics.

National Heart Lung and Blood Institute, Technical report

2. Greenland P, LaBree L, Azen S, Doherty T, Detrano R

(2004) Coronary artery calcium score combined with

Framingham score for risk prediction in asymptomatic

individuals. J Am Med Assoc 291(2):210–215

3. Shaw L, Raggi P, Schisterman E, Berman D, Callister T

(2003) Prognostic value of cardiac risk factors and coro-

nary artery calcium screening for all-cause mortality.

Radiology 228(3):826–833

4. Taylor A, Bindeman J, Feuerstein I, Cao F, Brazaitis M,

O’Malley P (2005) Coronary calcium independently pre-

dicts incident premature coronary heart disease over

measured cardiovascular risk factors: mean three-year

outcomes in the prospective army coronary calcium

(PACC) project. J Am Coll Cardiol 46(5):807–814

5. Wong ND, Budoff MJ, Pio J, Detrano RC (2002) Coronary

calcium and cardiovascular event risk: evaluation by age-

and sex-specific quartiles. Am Heart J 143(3):456–459

6. Edvardsen T, Detrano R, Rosen B, Carr J, Liu K, Lai S,

Shea S, Pan L, Bluemke D, Lima J (2006) Coronary artery

atherosclerosis is related to reduced regional left ventric-

ular function in individuals without history of clinical

cardiovascular disease: the multi-ethnic study of athero-

sclerosis. Arterioscler Thromb Vasc Biol 26(1):206–211

7. Rumberger J, Kaufman L (2003) A rosetta stone for cor-

onary calcium risk stratification: agatston, volume, and

mass scores in 11,490 individuals. Am J Roentgenol

181(3):743–748

8. Ukai Y, Niki N, Satoh H, Watanabe S, Ohmatsu H, Eguchi

K, Moriyama N (1996) An algorithm for coronary

Int J Cardiovasc Imaging

123

calcification diagnosis based on helical CT images. In:

Proceedings of International conference on pattern recog-

nition, vol. 3. Washington, DC, pp 543–547

9. Ukai Y, Niki N, Satoh H, Watanabe S, Ohmatsu H, Eguchi

K, Moriyama N (1998) A coronary calcification diagnosis

system based on helical CT images. IEEE Trans Nucl Sci

45(6):3083–3088

10. Isgum I, Rutten A, Prokop M, van Ginneken B (2007)

Detection of coronary calcifications from computed

tomography scans for automated risk assessment of coro-

nary artery disease. Med Phys 34(4):1450–1461

11. Kurkure U, Chittajallu D, Brunner G, Yalamanchili R,

Kakadiaris I (2008) Detection of coronary calcifications

using supervised hierarchical classification. In: Proceed-

ings of medical image computing and computer-assisted

intervention workshop on computer vision for intravascular

and intracardiac imaging. New York, NY

12. Hu S, Hoffman E, Reinhardt J (2001) Automatic lung

segmentation for accurate quantitation of volumetric X-ray

CT images. IEEE Trans Med Imaging 20(6):490–498

13. Laws K (1980) Texture image segmentation. Ph.D. dis-

sertation, University of Southern California

14. Haralick RM, Shanmugam K, Dinstein I (1973) Textural

features for image classification. IEEE Trans Syst Man

Cybern 3:610–621

15. Peng H, Long F, Ding C (2005) Feature selection based on

mutual information: criteria of max-dependency, max-rel-

evance, and min-redundancy. IEEE Trans Pattern Anal

Mach Intell 27(8):1226–1238

16. Vapnik V (1998) Statistical learning theory. Wiley-Inter-

science, ISBN 0471030031

17. Osuna E, Freund R, Girosi F (1997) Support vector

machines: training and applications. Massachusetts Insti-

tute of Technology, Technical Report AIM-1602, [Online].

Available: http://www.dspace.mit.edu/handle/1721.1/7290

18. Hsu C, Chang C, Lin C (2003) A practical guide to support

vector classification. National Taiwan University, Techni-

cal report

19. Chang C-C, Lin C-J (2001) LIBSVM: a library for support

vector machines. Software available at http://www.csie.

ntu.edu.tw/

20. Brunner G, Chittajallu D, Kurkure U, Kakadiaris I (2008) A

heart-centered coordinate system for the detection of coronary

artery zones in non-contrast computed tomography data. In:

Proceedings of medical image computing and computer-

assisted intervention workshop on computer vision for intra-

vascular and intracardiac imaging. New York, NY

21. Brunner G, Chittajallu D, Kurkure U, Kakadiaris I (2010)

Toward the automatic detection of coronary artery calci-

fication in non-contrast computed tomography data. Int J

Cardiovasc Imaging (in press)

22. Wu J, Brubaker S, Mullin M, Rehg J (2008) Fast asym-

metric learning for cascade face detection. IEEE Trans

Pattern Anal Mach Intell 30(3):369–382

23. Yan S, Li M, Zhang H, Cheng Q (2003) Ranking prior

likelihood distributions for bayesian shape localization

framework. In: Proceedings of IEEE international confer-

ence on computer vision, pp 51–58

24. Dietterich T (2000) Ensemble methods in machine learn-

ing. Lect Notes Comput Sci 1857:1–15 [Online]. Avail-

able: http://citeseer.ist.psu.edu/dietterich00ensemble.html

25. Rosito G, Massaro J, Hoffmann U, Ruberg F, Mahabadi A,

Vasan R, O’Donnell C, Fox C (2008) Pericardial fat, vis-

ceral abdominal fat, cardiovascular disease risk factors,

and vascular calcification in a community-based sample:

the Framingham heart study. Circulation 117(5):605–613

[Online]. Available: http://www.circ.ahajournals.org/cgi/

content/abstract/117/5/605

26. Rodondi N, Taylor BC, Bauer DC, Lui LY, Vogt MT, Fink

HA, Browner WS, Cummings SR, Ensrud KE (2007)

Association between aortic calcification and total and

cardiovascular mortality in older women. J Intern Med

261(3):238–244

27. Witteman JC, Kok FJ, van Saase JL, Valkenburg HA

(1986) Aortic calcification as a predictor of cardiovascular

mortality. Lancet 2(8516):1120–1122

28. Brown E, Kronmal R, Bluemke D, Guerci A, Carr J,

Goldin J, Detrano R (2008) Coronary calcium coverage

score: determination, correlates, and predictive accuracy in

the multi-ethnic study of atherosclerosis. Radiology

247:669–675

29. Kakadiaris I, Kurkure U, Bandekar A, O’Malley S,

Naghavi M (2010) Cardiovascular informatics. In: Para-

gios N, Ayache N, Duncan J (eds) Handbook of biomedical

imaging. Springer, Berlin (in press)

30. Kakadiaris I, Kurkure U, Mendizabal-Ruiz E, Naghavi M

(2009) Towards cardiovascular risk stratification using

imaging data. In: Proceedings of 31st international con-

ference of the IEEE engineering in medicine and biology

society. Minneapolis, MN

31. Kakadiaris I, Mendizabal-Ruiz E, Kurkure U, Naghavi M

(2009) Challenges and opportunities for extracting car-

diovascular risk biomarker from imaging data. In: Pro-

ceedings of 14th Iberoamerican congress on pattern

recognition. Guadalajara, Mexico, 15–18 November, pp

227–235

32. Chittajallu D, Balanca P, Kakadiaris I (2009) Automatic

delineation of the inner thoracic region in non-contrast CT

data. In: Proceedings of 31st international conference of

the IEEE engineering in medicine and biology society.

Minneapolis, MN

33. Chittajallu D, Brunner G, Kurkure U, Yalamanchili R,

Kakadiaris I (2009) Fuzzy-cuts: a knowledge-driven graph-

based method for medical image segmentation. In: Pro-

ceedings of IEEE computer society conference on computer

vision and pattern recognition. Miami Beach, FL

34. Kurkure U, Avila-Montes O, Kakadiaris I (2008) Auto-

mated segmentation of thoracic aorta in non-contrast CT

images. In: Proceedings of IEEE international symposium

on biomedical imaging: from nano to macro. Paris,

France

35. Avila-Montes O, Kurkure U, Kakadiaris I (2010) Aorta

segmentation in non-contrast cardiac CT images using an

entropy-based cost function. In: Proceedings of society of

photographic instrumentation engineers medical imaging

conference. SPIE, San Diego, CA, Feb 13–18 (in press)

36. Yalamanchili R, Kurkure U, Dey D, Berman D, Kakadiaris

I (2010) Knowledge-based quantification of pericardial fat

in non-contrast CT data. In: Proceedings of society of

photographic instrumentation engineers medical imaging

conference. SPIE, San Diego, CA, Feb 13–18 (in press)

Int J Cardiovasc Imaging

123