The present study investigated the clinical significance of carpal bone analysis in improving the accuracy of BAAs. Additionally, a hybrid model was established by incorporating a carpal bone analysis using an open dataset. Both models demonstrated human expert-level accuracy, with an upper 95% CI of < 0.5 for the MAD between the BAA models and the reference standard. The MAD values for the M1 and M2 models were 0.366 and 0.388 years, respectively, which are similar to or lower than those reported in previous research (
21-
23). The current study showed that the model integrating a carpal bone analysis was more accurate than the model without a carpal bone analysis. Particularly, the model using a carpal bone analysis showed significantly improved accuracy in pre-pubertal patients.
The classical method of BAA in children is based on bone growth in the phalanges, carpal bones, and wrist joints. Carpal bone maturity varies widely, and the analysis of this variable does not provide accurate or significant information in patients > 7 years of age (
24). In girls aged >6 years or boys > 8 years, a phalangeal feature analysis yields more reliable information than a carpal bone analysis; accordingly, most recent deep learning methods do not utilize the carpal bones for analysis (
9-
14). However, carpal bones are a vital and distinctive feature of skeletal maturity, which provide accurate and significant information for determining the bone age of young children (
3). Combined with the existing phalangeal and wrist joint analyses, a carpal bone analysis significantly improves the accuracy of BAAs in young children.
Several studies have used carpal bone extraction to calculate the bone age (
25-
27). Most of these studies focused on classical image segmentation rather than bone assessments, without applying a deep learning algorithm. Meanwhile, the accuracy of BAAs was unacceptable. In this regard, Hao et al. (
28) exploited carpal bones using a regression CNN to evaluate the bone age. Although they achieved an accuracy of 90.15% within six months from the ground truth for males, their study was only performed on individuals with a chronological age of 0 - 6 years, and a limited BAA was performed for children of all ages.
Additionally, Iglovikov et al. (
29) trained several deep network architectures using different parts of images (whole hand, carpal bones, and metal carpals/proximal phalanges) to evaluate the contribution of various bones to the model performance. Moreover, the linear ensemble of these regional models outperformed all the aforementioned models, which is consistent with our results. However, our study reported a higher accuracy and a lower MAD compared to the study conducted by Iglovikov et al. (
29) with the lowest MAD at 6.1 months.
Recently, the survival rates of infants in all preterm and birth weight categories have improved owing to advances in obstetric and perinatal care (
15). As the survival rates of preterm and low-birth-weight infants increase, concerns regarding the developmental outcomes and growth of survivors also increase. Most infants exhibit an accelerated compensatory growth pattern (known as “catch-up growth”), which is usually completed by two years of age (
30). Nevertheless, in the absence of compensatory growth, infants are unlikely to grow to the extent of their peers and reach their target height in adulthood. Consequently, the need for recombinant human growth hormone therapy has increased in these young children (
31), and the need for BAA for these young children has also increased. Our model incorporating a carpal bone analysis yielded better predictions compared to the model without a similar analysis in prepubertal men, suggesting its improved accuracy in young children.
Our hybrid model overcame the limitations of GP and TW3 methods by focusing on regions that are highly related to changes in bone maturity and by applying finer-grained maturity stages compared to TW3, yielding a reliable and accurate bone age estimate (
18). Similar to our approach, the BoneXpert (Visiana Aps, Holte, Denmark; http://www.boneexpert.com), an automated commercially available BAA system (recently released, version 3.0), is used to evaluate the bone age using both GP and alternative TW2 methods (
32). However, unlike our model, the BoneXpert is based on a feature extraction technique, which reconstructs the boundaries of 15 bones (i.e., metacarpals, phalanges, distal radius, and ulna) (
33). Overall, the present results showed that the accuracy of bone age predictions improved remarkably for males versus females, which is consistent with the results of a previous carpal bone analysis (
25,
29), probably due to the fact that girls mature sooner than boys. This finding is evident in the dataset applied in the experiment, where the carpal bones of girls ossified at a much earlier age than those of boys.
The present study had several limitations. First, although previous studies reported ethnic differences in the growth patterns of certain age groups (
34), our study was limited to patients of a single ethnicity. Therefore, a prospective multicenter study with a large sample size is required. Second, our model required radiographs of diagnostic quality. The model itself did not evaluate the radiographic quality, such as the rotation or incomplete filming of the left hand. Third, our model could not detect traumatic or congenital deformities. Finally, the reference standard for the external validation set was based on the GP Atlas, which is inherently limited to young children. There are also differences in the nutritional and ethnic characteristics of children today compared to those during the 1930’s and 1940’s, which were used to generate standards.
In conclusion, the present results showed the improved accuracy of the hybrid GP and modified TW model for BAA by integrating a carpal bone analysis. Particularly, the model utilizing a carpal bone analysis showed a significantly improved predictive ability in pre-pubertal patients; therefore, it can be sufficiently used in young children.