Today, breast cancer (BC) is considered the leading mortality culprit in the female population. In the Western communities, about 10% of women are susceptible to the disease, which makes up 12.5% worldwide (
1,
2). In the United States, one in eight women is at greater risk of BC during their lifetime (
3,
4). Additionally, it is estimated that about 200 million people will suffer from the disease annually in India by 2030, which is, per se, an epidemic for the country (
5). Today, evidence implies that the BC is considered a global challenge due to its heterogeneous, multifactorial, violent nature, and destructive effects on health (
6,
7). According to reports, it has been well established that malignant BC is often invasive and develops in the early stages in the mammary glands and ducts (
8). It is followed by diffusion to the surrounding tissues, adjacent lymph nodes and metastasizes to the bones, liver, brain, or lungs in the advanced stages (
9,
10). Most regretfully, many malignancies are diagnosed late in the advanced stages, with the tumor metastasizing to tissues around the breast, axillary lymph nodes, and even other organs (
11,
12). Reportedly, Numerous clinical and nonclinical factors may affect the incidence of BC (
13). Hence, the most effective way to reduce BC mortality is timely detection and treatment, which, in turn, necessitates faster diagnosis in the early stages.
Moreover, it is very demanding to differentiate between benign and malignant cancers in the initial diagnosis (
14,
15). Therefore, to come up with an accurate and correct method for early detection is of great significance. A biopsy is the best way to diagnose benign or malignant cancers. However, it is an invasive and expensive procedure (
16). Also, physicians and cancer specialists usually analyze clinical and laboratory data manually and then opt for a relevant decision, making the method slow, expensive, time-consuming, and subjective (
1). Given the different stages and severity of the disease and some ambiguities and unpredictable conditions about its consequences, it is imperative to adopt innovative technologies for screening (
17). Also, so much research has focused on statistical methods and artificial intelligence (AI) in predicting cancer (
16).
Recently, researchers have shown great interest in developing new and non-invasive digital technologies such as AI that can effectively prompt accurate and timely detection of malignancies (
18). It is claimed that these technologies may minimize diagnostic errors and discrepancies among observers at any level of prediction, prognosis, and treatment. Therefore, diagnostic and prognostic models can help identify at-risk patients and adopt the most effective support and treatment programs (
3,
19-
21). Machine learning (ML), a branch of AI, can extract high-quality knowledge and patterns from a substantial raw dataset. Also, it can ease evidence-based risk analysis, screening, predictive, and care planning research and support reliable clinical decisions. Thus, it might improve patient care outcomes and quality and reduce uncertainty and ambiguity (
3,
4,
22).
Data mining (DM) methods are used for BC in various areas, including early detection, differentiation of benign or malignant nature, prediction of patient survival after treatment, and the possibility of its recurrence (
1,
2,
5,
23). It can also help physicians achieve significant results without dependency on invasive and complicated procedures (
1). In this regard, many ML-based algorithms are applied for predicting and classifying BC outcomes.