In this study, the proposed method was divided into two main stages. The first stage was related to the detection of suspicious mass regions on mammograms, while the second stage was related to the classification of these regions as benign, probably benign, malignant, and probably malignant. The steps of the proposed method are presented in
Figure 1. Each step will be described in the sections below.
3.2. Suspected Region Localization (SRL)
In this stage, the pre-processed mammogram, obtained from Section 3.1, was used as the input. This study focused on finding suspicious regions (SRs). Generally, analysis of mammograms is a difficult task due to the irregularity of mammogram texture. Since mammography is a non-stationary signal, discrete wavelet transform (DWT) (
16) is an efficient tool for analyzing different signal components at different resolution scales. A signal in wavelet decomposition can be represented as follows:
where and are the basic functions, and cij is a coefficient (a numerical value). The information and nature of a given signal are determined by the coefficients since the basic functions are fixed. Also, the Fourier transform of x is defined as follows:
where zn = . The quincunx downsampling version of x [n] can be written as follows:
where D is the generating matrix. Since the quincunx wavelet transform (QWT) requires more computations, the lifting scheme (LS) (
17) is an efficient and rapid implementation method of wavelet transform with low memory and computational complexity. The basic idea of LS is to split a signal (image) into even and odd samples (Equation 4); then, the odd samples are predicted based on the even ones. The resulting prediction error is used to update the even samples:
where the split is defined as follows:
Figure 4 presents the method of finding SRs in two phases. In the first phase, the DWC based on QLS was defined as follows:
The proposed steps for finding suspicious regions (SRs) in mammograms
Phase 1: Consider C = {c1, c2, …, cm|m = 1, 2, …, n} denote containing all independent coefficients of the QLS transform in the pre-processed mammogram image (Mp); the ck is a non-zero coefficient in iteration k. Density+ and Density- are defined as follows:
where is defined as follows:
Since the wavelet transform produces a large number of artifacts, the Gaussian filter has been used to eliminate noise; it also has better edges than , where is defined as:
Finally, max (1, α) is determined as follows:
Density - is also defined as:
Finally, DWC is defined as:
Phase 2: In this phase, the boundary of ROI was traced automatically using a Canny edge detector, based on the density defined in the previous phase (Equation 13):
Canny Edge Detection: The Canny edge detector was used for SR extraction.
Hole-Filing: If S is a symmetric structuring element, the hole-filling algorithm can be defined as follows:
where the set of xi and B contains all filled holes and their boundaries. The algorithm terminates when xi = xi-1.
Morphological Operation: The morphological disks were used for filtering so as to discard any noisy objects.
Figure 3 presents the steps of mass segmentation and identification from the mammogram.
3.3. Mass Classification
The classification stage is performed using a shape feature classifier. This step aims to distinguish between benign and malignant masses. Although several morphological features have been used for the classification of benign and malignant masses in the literature (
12-
14,
18), it is preferable to use an optimal number of features that allow the CAD system to achieve the desired performance. The borders of benign tumors are relatively smooth, while malignant tumors have much more irregular borders. According to this hypothesis, the tumor compactness can be calculated based on the following equation (
19):
where CI represents the compactness index, P denotes the total number of pixels at the edge or on the margin of tumor, and A is the number of pixels or tumor area. If CI is equal to zero, the mass is circular; otherwise, it has other shapes. The compactness index is generally used to measure the tumor shape characteristics. A high CI value indicates a large perimeter enclosing a small area. As the tumor shape becomes more complex and rougher, the value of the CI index increases. Moreover, CI is independent of linear transformations, rotations, starting point, and size of a given contour and measures the degree of roughness of a region. Therefore, typical malignant masses are expected to have higher CI values as compared to typical benign tumors.
To distinguish between different mass shapes, we used certain thresholds for mass classification. The thresholds were determined by comprehensive comparisons between different mass shapes, which were annotated by the radiologists in the CBIS-DDSM dataset. First, we calculated the CI parameter for each mass in the CBIS-DDSM dataset. If CI is < 60%, the mass is benign, whereas if it is ≥ 80%, it is considered malignant. If CI
[60%, 70%) means CI ≥ 60% and < 70%, therefore, “If CI
[60%, 70%) the mass is probably benign and otherwise, the mass is probably malignant (
Figure 5). The SRs detected in the previous stage were analyzed based on the thresholds and classified into four categories.
3.4. Dataset
In this study, we employed a new updated and standardized version of the DDSM database (
20), that is, the curated breast imaging subset of DDSM (CBIS-DDSM) (
21), which contains images with a standard DICOM format. The CBIS-DDSM database was reviewed and annotated by trained mammographers. We used all mass cases in this database, which included 1593 mammograms (829 benign and 764 malignant cases) of 892 cases and a total of 1698 masses with ground truth (GT) masks (i.e., annotations).
Besides, the CBIS-DDSM database contains pixelwise annotations for ROIs, such as mass shape, Breast Imaging-Reporting and Data system (BI-RADS) rating (rating: 0, 2 - 6), and lesion pathology (two annotations of benign or malignant). The ROI segmentation, bounding boxes, and pathological GT for diagnosis were also included in this dataset. Each view was used as a separate image, and all images were resized to 1024 × 1024 pixels, using bicubic interpolation. In our experiments, the ROIs were extracted using binary segmentation masks, provided in the CBIS-DDSM dataset.
3.5. Evaluation Protocol
Several indicators were used to evaluate the performance of the proposed method in this study. The ROI denotes the SR found by the algorithm, and the region GT represents the GT mask. For simplicity, the same notation was used for a region and its number of pixels. The sensitivity, average FPPI, precision, and Jaccard index (IoU) were also measured to evaluate the performance of the proposed segmentation method. Moreover, sensitivity was examined to determine the proportion of true positives (TP) correctly identified. Sensitivity was also determined as the TPR or probability of detection (
Figure 6) and defined as follows:
Examples of sensitivity measurement for the detection of suspicious regions (SRs). A, Ground truth region (GT); B, No coverage of GT (i.e., GT ⋂ SR = ∅), sensitivity = 0; C, One of the SRs completely overlapping with the GT region (complete coverage of GT; GT ⋂ SR≠∅), sensitivity = 1; and D, One of the SRs covering a portion of the GT region (partial coverage of GT; GT ⋂ SR ≠ ∅), sensitivity = 1.
In a CAD system, high sensitivity is needed to find the SRs, even at high FP costs. The FP rate can be reduced through classification. The FPPI was defined as follows:
The precision and Jaccard index (IoU) were also defined as follows:
Generally, the Jaccard index is used to measure the SR boundary overlap with the GT boundary. To evaluate the classification method performance, we used the receiver operator characteristic (ROC) curve and measured the area under the curve (AUC) and overall accuracy. The ROC curve indicated the diagnostic ability of the classifier system and was defined based on sensitivity and specificity as follows (
14):
The overall classification accuracy was calculated as follows:
The negative predictive value (NPV) and positive predictive value (PPV) were also defined as follows: