In our study, the majority of patients (91%) were women, and out of 70 malignant nodules, 7 cases (10%) were men, and 63 cases (90%) were women. This gender predominance in suffering from nodular thyroid diseases and malignant thyroid nodules is likely due to differences in estrogen hormone levels between the sexes and its effect on hormone receptors in thyroid cells (
6,
7). In our study, the mean age of patients with malignant nodules was significantly lower compared to those with benign nodules, consistent with findings from other studies (
9).
Fine needle aspiration has long been considered the gold standard for evaluating thyroid nodules. However, ultrasound evaluation has recently gained increasing attention in diagnosing malignant nodules and determining the treatment plan in patients with nodular thyroid diseases. In older guidelines, FNA was routinely performed on nodules larger than 10 mm to distinguish benign from malignant nodules. This approach has led to unnecessary FNAs of benign nodules, which are not only unpleasant for the patient but also costly, and sometimes result in indeterminate cytopathologic reports. There are also challenges in differentiating among various follicular pathologies (e.g., follicular neoplasms including follicular adenoma, follicular carcinoma, or follicular variant of papillary thyroid cancer), Hurthle cell neoplasm, and new Bethesda classification pathologies such as atypia of undetermined significance (AUS) and follicular lesion of undetermined significance (FLUS) in FNA, which may lead to lobectomy, isthmectomy, and decision-making based on frozen sections during surgery.
With the introduction of ultrasonography and new classification guidelines, especially TIRADS, it has become possible to predict the potential malignancy of nodules based on composition, echogenicity, margins, microcalcifications, and echogenic foci. Consequently, unnecessary FNAs have been reduced, and the predictive accuracy of the nature of nodules has significantly improved before definitive surgery (
2,
10). With the revision of these guidelines, the sensitivity and diagnostic accuracy of TIRADS have gradually increased (
11). The latest guideline in this field, AI-TIRADS, has higher sensitivity, predictive value, and diagnostic accuracy compared to ACR-TIRADS, as demonstrated in multiple studies (
6,
7,
12). Sensitivity and diagnostic accuracy are even relatively higher in cases with TIRADS 2 and 5 sonographic evaluations.
Similarly, in our study, the sensitivity and positive predictive value of AI-TIRADS were found to be 0.94 and 0.76 for all TR scores, while these values were 0.97 and 0.97 in patients with TR = 2 and TR points ≥ 7 (classic TIRADS 2 and 5 sonographic reports). Meanwhile, the negative predictive value of AI-TIRADS was 0.91 across all TR groups, and 0.97 in patients with TR = 2 and TR points ≥ 7. The overall diagnostic accuracy of this index was 0.81, and it was 0.97 for the TIRADS 2 and 5 groups. All of these indices were comparatively stronger than other well-known classification guidelines, such as ACR-TIRADS or British Thyroid Association (BTA), and notably more pronounced in the extreme ends of classification, where benign lesions and high cancer probabilities were found (TR = 2 and TR ≥ 7) (
11).
Many clinicians recommend surgical resection when the thyroid nodule is larger than 4 cm, as larger nodules have an increased probability of being follicular thyroid cancer or the follicular variant of papillary thyroid cancer, and the false-negative rate of FNAB increases (
13). However, the size of the nodule is not an absolute criterion for thyroidectomy; surgery is determined based on pressure or aesthetic symptoms, or the suspicion of malignancy based on clinical presentation or malignant manifestations in ultrasound.
In this study, 86 patients were referred with pressure or aesthetic symptoms (24 patients with pressure symptoms and 62 with external symptoms). In the subgroup with pressure symptoms, 8 patients (33%) had malignant pathology, while 47 patients who were either incidentally found or followed up due to thyroid disease had malignant nodules. Among patients with nodules larger than 25 mm, 18 (29.5%) had malignancy, with papillary thyroid cancer (PTC) being the most common type in all size subgroups. Nodules that present with compression or external symptoms tend to be larger, while those discovered incidentally or during follow-up of thyroid disease are smaller and asymptomatic.
In this study, malignant pathology was inversely related to nodule size, meaning that patients with pressure symptoms often had benign pathology, whereas nodules found incidentally or during follow-up were more frequently malignant. In the majority of articles, smaller nodules are more likely to be malignant (
14,
15). However, there are conflicting results in the literature, and relying solely on size for decision-making is not a reliable approach (
13). It is now more accepted to make decisions based on ultrasound features, even for large nodules with benign FNA results. A higher probability of underlying malignancy should be considered if suspicious sonographic features are present. The new AI-TIRADS guideline, as demonstrated in our study, performs well in these problematic cases.
In Bethesda III cases, molecular and genetic tests are used to determine whether to follow up, repeat FNA, or consider surgery. These tests are not widely available, are expensive, and require expert interpretation. Their diagnostic accuracy is still under investigation, with varying results reported in the literature (
16). Therefore, in cases where FNA results fall into this category, ultrasound findings can guide the treatment plan, and surgery may be strongly recommended if the nodule has TR points ≥ 4.
In Bethesda IV cases, differentiating benign from malignant pathology in FNA results of follicular neoplasms can be challenging. Thyroidectomy remains a common recommendation for these nodules. Nodule size and sonographic features can also help guide treatment decisions, with short-term follow-up and repeat FNA recommended for nodules smaller than 25 mm with no suspicious sonographic features. Surgery is reserved for larger nodules (> 25 mm) or those with suspicious imaging findings (
17).
The future looks promising. With the use of artificial intelligence and the validation of ultrasound features for thyroid nodules, particularly through AI-TIRADS compared to ACR-TIRADS, it will be possible to differentiate malignant from benign nodules more accurately, especially in difficult thyroid nodule cases, as reviewed briefly above.