I J Radiol

Image Credit:I J Radiol

Comparative Image Quality of Deep Learning Reconstruction Against Conventional Reconstruction for T2-weighted Prostate Imaging: A Systematic Review and Meta-Analysis

Author(s):
Narges AziziNarges AziziNarges Azizi ORCID1, Hoda BorooghaniHoda Borooghani2, Manouchehr Nasrollahzadeh SaraviManouchehr Nasrollahzadeh SaraviManouchehr Nasrollahzadeh Saravi ORCID3, Sina DelazarSina DelazarSina Delazar ORCID1,*
1Advanced Diagnostic and Interventional Radiology Research Center (ADIR), Tehran University of Medical Sciences, Tehran, Iran
2Iran University of Medical Science, Tehran, Iran
3Shahid Beheshti University of Medical Science, Tehran, Iran

IJ Radiology:Vol. 23, issue 1; e168277
Published online:May 03, 2026
Article type:Systematic Review
Received:Nov 22, 2025
Accepted:Apr 06, 2026
How to Cite:Azizi N, Borooghani H, Nasrollahzadeh Saravi M, Delazar S. Comparative Image Quality of Deep Learning Reconstruction Against Conventional Reconstruction for T2-weighted Prostate Imaging: A Systematic Review and Meta-Analysis. I J Radiol. 2026;23(1):e168277. doi: https://doi.org/10.5812/iranjradiol-168277

Abstract

Context:

Deep learning reconstruction is increasingly proposed to accelerate prostate T2-weighted magnetic resonance imaging (MRI) acquisition without loss of diagnostic quality. However, the magnitude of image quality benefits and the influence of deep learning network design remain unclear.

Objectives:

We aimed to compare the image quality of deep learning versus conventional prostate T2 MRI reconstruction and assess how acceleration factors and network architecture influence performance.

Methods:

A PRISMA search of PubMed/Medline, Embase, Web of Science, and Scopus (22 February 2026) identified studies comparing deep learning with conventional T2 MRI reconstruction. Paired standardized mean change (SMCC) values were pooled with a three-level random-effects model from extracted paired image-quality scores; meta-regressions examined scan-time acceleration. Small-study bias was tested with a precision-effect method.

Results:

Five studies (602 participants; 20 comparisons) included in the study. Overall deep learning reconstruction benefit was +0.14 standard deviation (SD) [95% confidence interval (CI) -2.00 to 2.28; I2 = 99 %]. At equal scan times deep learning reconstruction improved quality by +2.09 SD; for each fold increase in acceleration, image quality declined by 0.48 SD (95% CI -0.91 to -0.05; P = 0.030). C-SENSE AI at acceleration factors of 1.7, 3.4, and 4.8 demonstrated superior performance, whereas CycleGAN showed inferior results. Small-study effects were evident.

Conclusion:

Deep learning reconstruction can increase prostate T2-weighted MRI speed without significant changes in overall image quality; however, overly aggressive implementations may lead to image degradation. Careful validation and protocol-specific tuning are therefore essential prior to clinical adoption.

1. Context

Prostate cancer is one of the most frequent cancers among men and has high mortality and morbidity rates globally (1-3). Although many cases show slow progression, some forms are more aggressive, necessitating precise risk assessment and treatment (1, 4-5). Magnetic resonance imaging (MRI) now facilitates prostate cancer diagnosis, localization, staging, and follow-up because of its suitable soft-tissue contrast; multiparametric MRI (mpMRI) remains the standard protocol, whereas biparametric MRI (bpMRI) provides a faster, contrast-free alternative with fewer side effects (4-8).

T2-weighted imaging (T2WI) is part of both mpMRI and bpMRI and provides important anatomical details of prostate zones (9). In daily practice, image quality fluctuates because sequence parameters, patient motion, and noise can blur fine details. Although conventional reconstruction methods, such as fast spin-echo reconstruction with parallel imaging or compressed sensing, partially recover lost signal, residual noise and artifacts may still obscure small tumors (6, 10-11).

Recent advances in deep learning have changed MRI reconstruction by improving the signal-to-noise ratio (SNR), spatial resolution, and acquisition time (3, 6, 9, 12). Nevertheless, current deep learning implementations still entail trade-offs: performance may vary with deep learning architecture, parameter changes, or scanners, and the balance between accelerated acquisition and preserved image quality remains incompletely defined (12-14).

2. Objectives

We conducted a systematic review and meta-analysis comparing deep learning versus conventional T2-weighted reconstructions in prostate MRI, with the aim of determining whether deep learning techniques provide clinically meaningful gains in image quality and pinpointing the architectures most suitable for routine practice.

3. Methods

The study followed the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines (15-16) and was registered in the PROSPERO database (CRD42024613297).

3.1. Search Strategy

A systematic literature search was conducted across PubMed/MEDLINE, Embase, Web of Science, and Scopus for studies published up to February 22, 2026. We included original English-language articles using the following MeSH terms: 'Prostate' OR 'Prostatic Neoplasms'; 'Magnetic Resonance Imaging' OR 'Diagnostic Imaging'; and 'Deep Learning' OR 'Artificial Intelligence' OR 'Image Processing, Computer-Assisted' OR 'Image Enhancement' (see Supplementary Information, Tables S1, S2, and S3). In addition, the reference lists and citations of relevant articles were manually screened.

3.2. Study Selection

Two reviewers independently screened all retrieved studies, with any disagreements resolved by a third reviewer. We included original, English-language studies, in which men underwent prostate imaging and assessed deep learning reconstruction of T2-weighted MRI compared to conventional reconstruction methods. Studies were eligible only if they reported image quality assessment as an outcome by mean and standard deviation (SD) for both of the deep learning model and the conventional method. Reviews, conference proceedings, non-English studies, and studies lacking the required data were excluded.

3.3. Data Extraction and Study-Level Internal Validity

Data extraction was performed independently by two blinded reviewers using a standardized spreadsheet template, with discrepancies resolved by a third reviewer. Extracted data included study characteristics, MRI acquisition parameters (both conventional and deep learning reconstructions), sequence types, reconstruction models, image quality metrics (mean and SD for image noise, sharpness, lesion detectability, artifacts, overall image quality, and diagnostic confidence), and other technical parameters such as voxel size, acquisition time, field strength, and bandwidth. The complete data structure is available in (Online spreadsheet). Study-level internal validity was assessed using the QUADAS tool by two independent reviewers and visualized using the QUADAS module in Stata 17 (StataCorp, College Station, TX, USA) (17).

3.4. Statistical Analysis

Paired Standardized Mean Change: Image quality was treated as a pseudo-continuous variable. The within-group standardized mean change (SMCC) served as the effect size, accounting for the paired nature of observations and the varying number of levels across Likert scales. The correlation between paired observations, required for standard error estimation, was empirically determined as r = 0.85 using in-house data. A three-level random-effects meta-analysis was performed to account for multiple, non-independent observations within the same cohort. All analyses were implemented using the metafor package (version 4.8-0) in R (version 4.4.1). The heterogeneity was quantified using the I2 statistic.

3.5. Leave-One-Out Analysis

Leave-One-Out Analysis: A leave-one-study-out sensitivity analysis was performed to assess the robustness of the pooled effect estimates. The three-level random-effects meta-analysis was re-estimated repeatedly after sequential exclusion of each study, while preserving the original hierarchical structure with random effects specified at the study, model-within-study, and reader-within-study levels. Changes in the pooled SMCC, 95% confidence intervals, and statistical significance were examined across all iterations to identify potential undue influence of individual studies.

3.6. Meta-Regression by Acceleration

Meta-Regression by Acceleration: To evaluate whether the degree of scan acceleration influenced image quality, we performed a meta-regression. The acceleration factor was included as a continuous moderator within the three-level random-effects analysis. The intercept represents the pooled SMCC when acquisition speed is equivalent to the reference protocol (acceleration = 1), and the slope quantifies the change in SMCC associated with each one-unit increase in acceleration. The omnibus QM test was used to assess whether acceleration explained a significant portion of between-effect heterogeneity. Statistical significance was defined as P ≤ 0.05, and meta-regression results are illustrated using bubble plots.

3.7. Risk of Bias Assessment

Risk of Bias Assessment: Small-study effects and publication bias were examined with a multilevel funnel plot and a Precision-Effect Test (PET). The PET intercept gives the bias-adjusted SMD at infinite precision, and a positive slope with P ≤ 0.05 suggests selective reporting due to the larger effects in less precise studies. Risk of bias is illustrated with a funnel plot overlaid by the Precision-Effect Test regression line.

4. Results

The meta-analysis includes twenty paired deep learning versus conventional comparisons from five studies, comprising 602 participants, with a mean age of 66.6 ± 1.74 years (Figure 1). Patients with suspected prostate cancer, histologically proven prostate cancer, and screening or follow-up imaging were included for prostate imaging. The regional distribution of studies consists of three from the United States, one from Germany, and one from the United Kingdom. Patient characteristics are summarized in Table 1. Study-level internal validity shows most concerns centered on reporting indeterminate results, whereas reference and test-execution details were sound in nearly all papers. The distribution of ratings is illustrated in Figures S1 and S2.

Table 1.Demographic and Clinical Characteristics of Patients in the Included Studies
StudyYearCountryParticipants, No.Age (mean ± SD)PSA (ng/mL)Included Patients
Liu et al. (22)2021United States346N/AN/ASuspected prostate cancer
Johnson et al. (23)2022United States11368.0 ± 7.0N/ASuspected prostate cancer
Harder et al. (21)2022Germany2364.4 ± 6.214.4 ± 18.4Histologically proven prostate cancer
Tong et al. (18)2023United States8066 ± N/AN/ASuspected prostate cancer
Lee et al. (25)2023United Kingdom4066 ± N/A4.70 (median; IQR 2.9)Suspected prostate cancer

Abbreviation: N/A, Not Available; PSA, prostate specific antigen; SD, standard deviation; IQR, interquartile range.

PRISMA 2020 flow diagram.
Figure 1.

PRISMA 2020 flow diagram.

4.1. MRI Sequences and Deep Learning Methods

The included studies evaluated deep learning-based reconstruction methods primarily in T2-weighted prostate MRI and axial planes (Table 2). Conventional turbo/fast spin-echo (TSE/FSE) T2-weighted sequences served as the reference standard across all studies. Deep learning approaches encompassed a range of architectures and vendor-specific implementations, including variational networks (Siemens), GAN-based super-resolution models (CycleGAN), compressed sensing combined with AI (C-SENSE AI) at 1.7, 3.4, and 4.8 acceleration factors, and AIR Recon DL (GE Healthcare) applied with low, medium, and high denoising levels and acquisition of standard, fast, and high-resolution. All examinations were performed at 3-Tesla field strength on systems from Siemens, Philips, or GE Healthcare.

Table 2.Technical Settings and Results Reported in the Included Studies
Study, YearDeep Learning ReconstructionReferenceVendorSNR (model / ref)Scan Time (s), Model / RefOverall Image Quality (DL Model / Ref)Likert Scale
Liu et al. (22)CycleGAN super-res. T2-SRStandard T2W-TSESiemensN/A75 / 750R1: 3.11 ± 0.41 / 3.81 ± 0.424
Johnson et al. (23)Variational-Network DLGRAPPA (parallel)SiemensN/A42 / 232.2R1: 3.90 ± 0.64 / 4.00 ± 0.56; R2: 3.80 ± 0.89 / 4.35 ± 0.74; R3: 4.55 ± 0.60 / 4.60 ± 0.50; R4: 3.60 ± 1.00 / 3.65 ± 0.995
Harder et al. (21)C-SENSE AI 1.7C-SENSE 1.7Philips7.68 ± 1.50 / 4.30 ± 0.47285 / 285Mean of 4 readers: 5.06 ± 0.79 / 4.60 ± 1.176
Harder et al. (21)C-SENSE AI 3.4C-SENSE 1.7Philips6.61 ± 1.78 / 4.30 ± 0.47156 / 285Mean of 4 readers: 5.34 ± 0.69 / 3.05 ± 0.766
Harder et al. (21)C-SENSE AI 4.8C-SENSE 1.7Philips5.79 ± 0.72 / 4.30 ± 0.47119 / 285Mean of 4 readers: 4.28 ± 0.51 / 3.05 ± 0.766
Tong et al. (18)Variational-Network DLConventional Cartesian T2-TSESiemensN/A68 / 226R1: 3.89 ± 0.39 / 3.72 ± 0.53; R2: 3.31 ± 0.74 / 3.33 ± 0.82; R3: 3.51 ± 0.62 / 3.67 ± 0.634
Lee et al. (25)AIR Recon DL - Standard, Low denoisingT2WI SoC (DLR off)GE HealthcareMedian 12.38 (IQR 3.11) / Median 10.07 (IQR 2.12)362 / 3623.42 ± 0.98 / 3.23 ± 0.925
Lee et al. (25)AIR Recon DL - Standard, Medium denoisingT2WI SoC (DLR off)GE HealthcareMedian 14.24 (IQR 4.17) / Median 10.07 (IQR 2.12)362 / 3623.35 ± 1.14 / 3.23 ± 0.925
Lee et al. (25)AIR Recon DL - Standard, High denoisingT2WI SoC (DLR off)GE HealthcareMedian 17.28 (IQR 6.49) / Median 10.07 (IQR 2.12)362 / 3623.12 ± 1.07 / 3.23 ± 0.925
Lee et al. (25)AIR Recon DL - Fast, Low denoisingT2WI SoC (DLR off)GE HealthcareMedian 11.65 (IQR 3.47) / Median 10.07 (IQR 2.12)244 / 3623.58 ± 1.01 / 3.23 ± 0.925
Lee et al. (25)AIR Recon DL - Fast, Medium denoisingT2WI SoC (DLR off)GE HealthcareMedian 14.13 (IQR 4.74) / Median 10.07 (IQR 2.12)244 / 3623.78 ± 0.89 / 3.23 ± 0.925
Lee et al. (25)AIR Recon DL - Fast, High denoisingT2WI SoC (DLR off)GE HealthcareMedian 18.70 (IQR 6.74) / Median 10.07 (IQR 2.12)244 / 3623.38 ± 0.95 / 3.23 ± 0.925
Lee et al. (25)AIR Recon DL - High-resolution, Low denoisingT2WI SoC (DLR off)GE HealthcareMedian 10.57 (IQR 2.87) / Median 10.07 (IQR 2.12)244 / 3623.30 ± 0.88 / 3.23 ± 0.925
Lee et al. (25)AIR Recon DL - High-resolution, Medium denoisingT2WI SoC (DLR off)GE HealthcareMedian 10.57 (IQR 3.61) / Median 10.07 (IQR 2.12)244 / 3623.40 ± 0.98 / 3.23 ± 0.925
Lee et al. (25)AIR Recon DL - High-resolution, High denoisingT2WI SoC (DLR off)GE HealthcareMedian 13.17 (IQR 4.43) / Median 10.07 (IQR 2.12)244 / 3623.12 ± 0.76 / 3.23 ± 0.925

Abbreviations: DL, deep learning; SNR, signal-to-noise ratio; R, reader; N/A, not available; CI, confidence interval; IQR, interquartile range; SoC, standard of care; GRAPPA, generalized autocalibrating partially parallel acquisitions; DLR, deep learning reconstruction; CycleGAN, GAN-based super-resolution models; C-SENSE AI, compressed sensing combined with AI; SR, super resolution; T2WI, T2 weighted imaging; TSE: turbo spin echo; Ref, reference; Recon, reconstruction.

4.2. Overall Image-Quality Paired Standardized Mean Change

Overall Image-Quality Paired Standardized Mean Change: The pooled SMCC for overall image quality was 0.14, which did not reach statistical significance [SE = 1.11; z = 0.15; P = 0.88; 95% confidence interval (CI) -2.00 to 2.28] (Figure 2). Heterogeneity was substantial (Q19 = 854.3, P < 0.001). Variance component analysis demonstrated that the largest proportion of true variability arose from between-study differences (σ² = 1.91), followed by variability between models within studies (σ² = 0.31) and readers within studies (σ² = 0.25). When expressed relative to total variance, heterogeneity was almost entirely attributable to real between-unit differences, yielding a total I2 of 99.9%. Sensitivity analyses assuming alternative within-pair correlations (r = 0.70 and 0.50) produced comparable pooled estimates, indicating robustness of the findings. Sensitivity analyses assuming lower within-pair correlations yielded comparable results. Using r = 0.70, the pooled SMCC was 0.12 (SE 0.80; z = 0.15, P = 0.88; 95% CI -1.44 to 1.68), and using r = 0.50, the pooled SMCC was 0.10 (SE 0.63; z = 0.16, P = 0.87; 95% CI -1.12 to 1.33). In all sensitivity analyses, the overall effect remained statistically non-significant, with overlapping confidence intervals and substantial residual heterogeneity.

Forest plot of the paired standardized mean change (SMCC) in overall image quality comparing deep-learning versus standard T2-weighted reconstructions across different models and readers. Squares show individual study effects (size proportional to study weight); horizontal bars represent 95% confidence intervals (CIs). The diamond indicates the overall pooled SMCC, and the vertical line at 0 indicates no difference in image quality-values to the right favor deep learning reconstruction. SoC: Standard of Care; HR: High Resolution; DL, deep learning.
Figure 2.

Forest plot of the paired standardized mean change (SMCC) in overall image quality comparing deep-learning versus standard T2-weighted reconstructions across different models and readers. Squares show individual study effects (size proportional to study weight); horizontal bars represent 95% confidence intervals (CIs). The diamond indicates the overall pooled SMCC, and the vertical line at 0 indicates no difference in image quality-values to the right favor deep learning reconstruction. SoC: Standard of Care; HR: High Resolution; DL, deep learning.

4.3. Leave-One-Out Analysis

Leave-One-Out Analysis: Leave-one-study-out sensitivity analysis at the study level showed that the overall results were robust to the exclusion of any single study. After sequentially removing each study, the pooled SMCC estimates ranged from -0.42 to 0.53, and in all iterations the 95% confidence intervals crossed zero (all p-values > 0.30). No individual study exerted a disproportionate influence on the pooled effect estimate (Supplementary figure S3).

4.4. Acceleration Meta-Regression

Acceleration Meta-Regression: Including acquisition-time acceleration in our three-level meta-regression revealed a statistically significant trade-off between faster imaging and image quality (Figure 3). At an acceleration of 1.0, the pooled SMCC was estimated at 2.21 SD (95% CI -0.17 to 4.58; P = 0.069), indicating a non-significant trend toward improved image quality under near-isochronous conditions. For each fold increase in acceleration, image quality declined by 0.48 SD (95% CI -0.91 to -0.05; P = 0.030). Substantial residual heterogeneity persisted (QE18 = 399.5, P < 0.0001). Given the limited number of included effects, these findings should be interpreted as exploratory.

Bubble plot illustrating the meta-regression of scan-time acceleration versus paired standardized mean change (SMCC) in overall image quality.
Figure 3.

Bubble plot illustrating the meta-regression of scan-time acceleration versus paired standardized mean change (SMCC) in overall image quality.

4.5. Risk-of-Bias Analysis

Risk-of-Bias Analysis Visual inspection of the funnel plot, together with PET line (Figure 4), suggests the presence of small-study effects for Image-Quality. The PET slope (β1 = 8.86, P < 0.0001) shows a positive association between study-level sampling error and reported effect size, indicating that smaller studies tend to report larger estimates of deep learning image quality.

Funnel plot with precision-effect test (PET) line assessing small-study effects.
Figure 4.

Funnel plot with precision-effect test (PET) line assessing small-study effects.

5. Discussion

Deep learning reconstruction evaluations have largely focused on image quality and methodological heterogeneity and variability in quality metrics restricted quantitative comparisons. However, our study indicates that, in prostate imaging, deep learning reconstruction neither significantly improves nor degrades the overall image quality of T2-weighted prostate MRI compared with conventional reconstruction, while enabling reductions in scan time. Meta-regression analysis demonstrated that faster deep learning models compared to conventional models are associated with significant decline in image quality.

These results are clinically important given the increasing need to shorten MRI examination times (18). Nevertheless, a critical question in deep learning models implementation is whether faster models come at the cost of image quality and finally diagnostic accuracy. Our meta-regression between acceleration and image quality differences showed that faster deep-learning models relative to conventional reconstruction were significantly associated with reduced image quality, corresponding to a decline of 0.48 SD for each one-fold increase in speed. Importantly, the observed heterogeneity necessitates cautious interpretation of these results. A previous study showed that radiologists can comfortably use deep learning acceleration in the range of 2-3 for routine imaging without decreasing diagnostic confidence (19). Ursprung et al. study suggests deep learning reconstruction can shorten prostate MRI acquisitions while often preserving image quality, but the evidence base is highly heterogeneous across sequences, vendors, study designs (20). Radiologists thus face a dilemma in choosing the acceleration, balancing patient burden and comfort against the need for anatomic details. Clinical teams need to create models with the acceptable acceleration level based on their aim. As an example, using higher acceleration where some image blurring is acceptable (screening or routine follow-up) and using lower acceleration when detail must be visualized.

A significant concern in deep learning reconstruction is the occurrence of hallucinations, where realistic-looking but incorrect image details are added or removed. This issue arises when there are multiple plausible solutions to the inverse image reconstruction problem. A model, influenced by the patterns learned from training data, may favor solutions that introduce fabricated features (20). Deep learning models can also cause over-smoothing, particularly in the transition zone, which may mimic the charcoal sign, leading to false positives or lost subtle findings near the capsule, resulting in false negatives (20). The risk of hallucinations and artifacts increases with stronger acceleration and decreased sampling rates. Artifacts can also vary based on algorithm settings, vendors, scanners, and patient anatomy (20).

Among the models assessed, C-SENSE AI by Harder et al, delivered the best image quality over their respective conventional models (21). In contrast, GAN with CycleGAN for super-resolution reconstruction degraded image quality compared to its conventional models (22). These differences can be traced to balances in under-sampling, noise suppression, detail preservation, and acceleration, pointing the need for careful selection and validation of deep learning models in clinical setting. C-SENSE AI paired modest k-space under-sampling with a multiscale CNN embedded directly in the raw-data (21). They kept the acceleration within a range that preserved SNR and concentrated on denoising and edge sharpening rather than hallucinating missing anatomy (21). By contrast, architectures that pushed acceleration or super-resolution too far fared poorly. A CycleGAN super-resolution model up-sampled thick 2-D T2-TSE slices to 0.75 mm isotropic but introduced striping, hallucinated textures, and uneven high-frequency noise, leading readers to rate it below the conventional model (22). In evaluated underperformed models, resolution or speed was gained at the expense of raw SNR and new artifacts the network could not fully correct. Vendor-specific training limits how well they generalize, and radiologists still encounter over-smoothed or noisy images even when metrics look good, so each center should validate any deep learning tool on its own scanners, especially in thin-slice or under-sampled protocols where weaknesses emerge first.

Our findings provide several implications for radiologists and clinical practice. Firstly, a radiologist can expect that a well-trained deep learning reconstruction algorithm will preserve diagnostic information present in a conventional T2-weighted image. Secondly, the ability to shorten MRI exams without loss of information directly benefits patient care. Prostate MRI, which traditionally could take 30-45 minutes for a full multiparametric study, may be shorten into a sub-10-minute with deep learning acceleration (23-24). Shorter scan times translate to less time that patients must remain in the scanner, alleviating discomfort (especially important for elderly patients or those with pain) and reducing anxiety for claustrophobic individuals. They also diminish the likelihood of motion artifacts, since there is a smaller window during which the patient might move. By increasing throughput, faster exams allow more patients to be scanned per day, which can help address long MRI waiting lists and improve access to prostate MRI as a screening or diagnostic tool. However, our findings also provide a note of caution. Clinicians need to be aware that deep learning reconstructions may sometimes introduce changes to image appearance (smoothing, minor artifacts) that differ from conventional images. Training and experience are needed to recognize normal vs. abnormal after deep learning processing. There can be a learning curve which radiologists must get comfortable with the DL-enhanced images and ensure they detect any failure case.

This study is not without limitations. First, the number of studies available for inclusion was relatively small (five studies with 20 comparisons), showing the early stage of this technology’s evaluation. The high heterogeneity we observed indicates that results varied widely between settings. We addressed some of this by meta-regressions on acceleration, but other sources of heterogeneity likely remain, such as how conventional reconstruction was defined. Second, we observed asymmetry in funnel plots, with smaller studies tending to report larger deep learning benefits. Third, our meta-analysis focused on image quality rather than direct patient outcomes. Future research should evaluate clinical end-points such as diagnostic accuracy, sensitivity, and specificity. Fourth, the rapid evolution of deep learning models means our meta-analysis is a moving target. It will be important to periodically update analyses to include newer studies and possibly head-to-head comparisons of different deep learning approaches. Multi-center trials with a bulk of patients, randomizing or crossing over between deep learning and conventional reconstructions, could provide stronger evidence of diagnostic performance and capture fewer failure modes. Likert-type scales with ≥4 ordered categories were treated as semi-continuous measures, while acknowledging that the assumption of equal spacing between adjacent categories may not strictly hold. Assessment of small-study effects using the Precision-Effect Test (PET) was limited by the small number of included studies. As PET has low statistical power when fewer than ten studies are available, these results should be interpreted with caution and viewed as exploratory rather than confirmatory.

To put these findings into clinical context, we assessed the certainty of evidence using a GRADE framework. The overall certainty for image-quality outcomes was rated as low. This rating reflects the substantial inter-study heterogeneity, the use of surrogate image-quality metrics instead of direct clinical outcomes (indirectness), and the limited precision resulting from the small number of included studies. Therefore, while deep learning reconstruction shows promise for scan-time acceleration, our results should be interpreted as exploratory rather than definitive.

Footnotes

References


Crossmark
Crossmark
Checking
Share on
Metrics

Purchasing Reprints

  • Copyright Clearance Center (CCC) handles bulk orders for article reprints for Brieflands. To place an order for reprints, please click here (   https://www.copyright.com/landing/reprintsinquiryform/ ). Clicking this link will bring you to a CCC request form where you can provide the details of your order. Once complete, please click the ‘Submit Request’ button and CCC’s Reprints Services team will generate a quote for your review.
Search Relations

Author(s):

Related Articles