Excellent agreement was observed between the readings of two radiologists using mammography (κ = 0.86,
Table 1). Following the review of mammograms, one breast was classified in category a by radiologist A, and in category c by radiologist B. Also, four breasts were classified in category c by radiologist A, and in category b by radiologist B. Similarly, excellent agreement was observed between the two radiologists based on CT scans (κ = 0.91,
Table 2). Following the review of CT images, four breasts were classified in category c by radiologist A, and in category b by radiologist B. Also, one breast was classified in category d by radiologist A, and in category c by radiologist B.
Agreement between the two imaging modalities was moderate for both radiologists (radiologist A: κ = 0.50; radiologist B: κ = 0.43) (
Figures 2-
4). Radiologist A classified 1.94% (2/103) of breasts in category a, 17.48% (18/103) in category b, 80.58% (83/103) in category c, and none in category d after reviewing the mammograms. The distribution of corresponding categories, based on CT scans, was 5.82% (6/103), 29.13% (30/103), 62.14% (64/103), and 2.91% (3/103), respectively. Radiologist B classified 1.94% (2/103) of breasts in category a, 20.39% (21/103) in category b, 77.67% (80/103) in category c, and none in category d after reviewing the mammograms. Also, based on CT images, the distribution of corresponding categories was 5.83% (6/103), 33.01% (34/103), 59.22% (61/103), and 1.94% (2/103), respectively.
Classification of non-dense breasts (categories a & b, defined as type I) was not significantly different between the two radiologists, based on the mammograms (19.42% vs. 22.33%, P > 0.05) and CT scans (34.95% vs. 38.84%, P > 0.05). However, both radiologists classified significantly more breasts as non-dense according to CT scans, compared to mammography (radiologist A: 34.95% vs. 19.42%, P < 0.01; radiologist B, 38.84% vs. 22.34%, P < 0.01) (
Table 3).
Regarding breast abnormalities, micro-calcification was detected in three breasts only based on the mammograms. Of nine breasts with masses on CT images, two were missed in mammography (
Figure 5).
Moreover, we retrospectively collected the chest CT scans of 1916 female examinees (age: 40-86 years) and divided them into four groups with ten-year gaps: group 1 (40 - 49 years; n = 991); group 2 (50 - 59 years; n = 649); group 3 (60 - 69 years; n = 181); and group 4 (> 70 years; n = 95). The two radiologists read the CT scans, evaluated the breast composition, and resolved disagreements through discussion.
In group 1, 0.9% (9/991) of breasts was classified in category a, 18.06% (179/991) in category b, 77.90% (772/991) in category c, and 3.13% (31/991) in category d. In group 2, 2.77% (18/649) of breasts were classified in category a, 29.74% (193/649) in category b, 67.03% (435/649) in category c, and 0.46% (3/649) in category d. Also, in group 3, 15.47% (28/181) of women were classified in category a, 46.41% (84/181) in category b, 37.57% (68/181) in category c, and 0.55% (1/181) in category d. Finally, in group 4, 23.16% (22/95) were classified in category a, 56.84% (54/95) in category b, 20.00% (19/95) in category c, and 0% in category d (
Table 4). In the older age group, the probability of non-dense breasts increased, while the probability of dense breasts decreased (
Figures 6 and
7). The difference between the groups was statistically significant (P < 0.001). The difference between each age group was statistically significant (group 1 vs. 2, P < 0.001, group 1 vs. 3 , P < 0.001, group 1 vs. 4, χ
2 = P < 0.001, group 2 vs. 3 , P < 0.001, group 2 vs. 4, P < 0.001, group 3 vs. 4 , P = 0.018). The difference between each category group was statistically significant (category a vs. b, P < 0.001, category a vs. c, P < 0.001, category a vs. d, P < 0.001, category b vs. c, P < 0.001, category b vs. d, P < 0.001, category c vs. d, P = 0.007).