This study systematically reviewed the published evidence regarding the optimal policies based on MDPs and POMDPs in cancer screening, and six publications were identified. The review results suggest that the optimal cancer screening policies are control-limit types; they outperform guidelines in terms of benefits, and the cancer risk threshold is a non-decreasing function of age. The results showed that individual screening based on cancer risk and age would have more benefits.
Based on our knowledge, this is the first study that has reviewed the optimization of the economic and clinical benefits for different cancer screening programs using MDPs and POMDPs models. The first objective was to describe the optimal policy in cancer screening programs. According to the current review, all studies showed that the optimal screening policy should be a control-limit type. They suggested that at every age, there is a risk threshold such that if the cancer probability is greater than the threshold, screening should be stopped, and the patient must be referred for biopsy or treatment (
14,
27,
32-
35). This policy can exclude some individuals from undergoing unnecessary screenings and consequentially reduces costs.
Age and cancer risk are two crucial factors in cancer screening; nevertheless, the guidelines do not provide a quantitative relationship between these factors and screening decisions. For example, the American Cancer Society (ACS) recommends annual BC screening for average-risk women aged 45 to 54 years; nevertheless, screening decisions need to be taken based on personal characteristics, such as gender, risk score, age, and life expectancy (
16,
35). One of the important objectives of this study was to explore how, when a patient gets older, the cancer risk threshold, as a function of age, changes mathematically. When a patient grows older, life expectancy decreases, and cancer detection becomes less beneficial (
12,
14,
33). Therefore, biopsy referral decisions will be beneficial if the cancer probability is higher than the risk threshold (
14). As a result, the referral threshold for biopsy and treatment should be a non-decreasing function of patient age. All the reviewed studies showed that the risk threshold is non-decreasing concerning age; therefore, when age increases, the referral decision will be optimal if the probability of having cancer increases (
14,
27,
32-
35); nevertheless, cancer screening guidelines have not mentioned this issue, and they have not used an age-dependent referral threshold for each patient in their screening decisions (
12,
15-
18).
Cancer screening in older adults is more complicated. Older patients usually have multi-morbidity (the co-existence of multiple diseases), and the treatment of cancer might have bad effects on other diseases. Furthermore, the probability of dying from cancer might be smaller than the risk of death from other competing diseases (
29). Therefore, after some age, it is not logical to treat a patient, and screening should be stopped. The clinical reasoning behind the way to calculate stopping age is that if the expected future reward from biopsy for a 100% cancer patient is less than the expected future reward for the patient who is never referred for biopsy, then screening should be terminated. Three studies used a stopping age in cancer screening (
14,
33,
35). After this age, even if the cancer probability is 100%, it is not logical to refer a patient for biopsy or treatment.
The quality-adjusted life-year is the most common criterion used in the medical decision-making literature to measure the benefits of interventions for patients (
37). With regard to effectiveness outcome, the findings of this study showed that QALYs were applied in all studies. Six studies only used QALYs as an outcome to measure the health and economic effects (
14,
27,
32-
35).
A screening program is successful if it can attain maximum net benefits (
11). Five studies compared the benefits of optimal policy to guidelines in terms of QALYs. These studies showed that the optimal policy outperforms the guidelines (
14,
27,
32-
35).
Finally, the benefits of screening decrease as patients grow older due to the increased mortality risk and reduction in life expectancy. Therefore, screening intervals might be changed for older patients (
35); however, guidelines suggest a constant screening interval or do not mention screening intervals. For example, ACS recommended annual mammography screening for women aged 50 to 54 years and biennially screening for women aged 55 years and older (
15,
16). Two studies showed that it is necessary to use a variable screening interval based on risk and age to achieve an optimal policy (
32,
35).
The most important limitation was that all studies were performed in the USA, and we had to compare the optimal policies to the American guidelines. All studies, but one, have implemented MDP and POMDP formulation only using QALYs and have not considered screening and treatment costs as factors to optimize policies. Limited resources in the health sector imply that healthcare costs need to be involved in decision-making to allocate resources (
38). If these models can attain QALYs and costs of the optimal policy, cost-effectiveness analysis for optimal policies and national guidelines might be carried out. Thirdly, these studies have only been performed on breast, prostate, and colorectal cancer screening.
5.1. Conclusions
This study shows that cancer screening based on guidelines for all patients has some limitations. MDPs and POMDPs suggest that age and cancer risk are important variables in the decision for cancer screening. These factors can impact screening intervals, optimal policy, and referral decisions for biopsy and treatment. Markov decision processes and POMDPs quantified these effects. These models highlight personalized screening and show that this type of screening can outperform cancer screening guidelines regarding economic and clinical benefits. This study showed that the optimal screening policy is control-limit. There is a cancer risk threshold that is non-decreasing for age, and the screening interval is not necessarily constant for a group of individuals. These models highlight the importance of individual-based screening.