人工智能应用于盆腔妇科肿瘤的超声诊断：一项系统评价和荟萃分析。

Artificial intelligence applied to ultrasound diagnosis of pelvic gynecological tumors: a systematic review and meta-analysis.

作者信息

Geysels Axel, Garofalo Giulia, Timmerman Stefan, Barreñada Lasai, De Moor Bart, Timmerman Dirk, Froyman Wouter, Van Calster Ben

出版信息

Gynecol Obstet Invest. 2025 May 8:1-29. doi: 10.1159/000545850.

OBJECTIVE

To perform a systematic review on artificial intelligence (AI) studies focused on identifying and differentiating pelvic gynecological tumors on ultrasound scans.

METHODS

Studies developing or validating AI models for diagnosing gynecological pelvic tumors on ultrasound scans were eligible for inclusion. We systematically searched PubMed, Embase, Web of Science, and Cochrane Central from their database inception until April 30th, 2024. To assess the quality of the included studies, we adapted the QUADAS-2 risk of bias tool to address the unique challenges of AI in medical imaging. Using multi-level random effects models, we performed a meta-analysis to generate summary estimates of the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity. To provide a reference point of current diagnostic support tools for ultrasound examiners, we descriptively compared the pooled performance to that of the well-recognized ADNEX model on external validation. Subgroup analyses were performed to explore sources of heterogeneity.

RESULTS

From 9151 records retrieved, 44 studies were eligible: 40 on ovarian, three on endometrial, and one on myometrial pathology. Overall, 95% were at high risk of bias - primarily due to inappropriate study inclusion criteria, the absence of a patient-level split of training and testing image sets, and no calibration assessment. For ovarian tumors, the summary AUC for AI models distinguishing benign from malignant tumors was 0.89 (95% CI: 0.85-0.92). In lower-risk studies (at least three low-risk domains), the summary AUC dropped to 0.87 (0.83-0.90), with deep learning models outperforming radiomics-based machine learning approaches in this subset. Only five studies included an external validation, and six evaluated calibration performance. In a recent systematic review of external validation studies, the ADNEX model had a pooled AUC of 0.93 (0.91-0.94) in studies at low risk of bias. Studies on endometrial and myometrial pathologies were reported individually.

CONCLUSION

Although AI models show promising discriminative performances for diagnosing gynecological tumors on ultrasound, most studies have methodological shortcomings that result in a high risk of bias. In addition, the ADNEX model appears to outperform most AI approaches for ovarian tumors. Future research should emphasize robust study designs - ideally large, multicenter, and prospective cohorts that mirror real-world populations - along with external validation, proper calibration, and standardized reporting.

REGISTRATION

This study was pre-registered with Open Science Framework (OSF): https://doi.org/10.17605/osf.io/bhkst.

目的

对聚焦于在超声扫描中识别和区分盆腔妇科肿瘤的人工智能（AI）研究进行系统评价。

方法

开发或验证用于在超声扫描中诊断妇科盆腔肿瘤的AI模型的研究符合纳入标准。我们从数据库建立至2024年4月30日，系统检索了PubMed、Embase、Web of Science和Cochrane Central。为评估纳入研究的质量，我们采用QUADAS-2偏倚风险工具来应对AI在医学成像中的独特挑战。使用多层次随机效应模型，我们进行了荟萃分析以生成受试者工作特征曲线下面积（AUC）、敏感性和特异性的汇总估计值。为给超声检查人员提供当前诊断支持工具的参考点，我们将汇总性能与外部验证中公认的ADNEX模型的性能进行了描述性比较。进行亚组分析以探索异质性来源。

结果

从检索到的9151条记录中，44项研究符合条件：40项关于卵巢，3项关于子宫内膜，1项关于子宫肌层病变。总体而言，95%的研究存在高偏倚风险——主要是由于研究纳入标准不当、缺乏训练和测试图像集的患者水平划分以及未进行校准评估。对于卵巢肿瘤，区分良性与恶性肿瘤的AI模型的汇总AUC为0.89（95%CI：0.85 - 0.92）。在低风险研究（至少三个低风险领域）中，汇总AUC降至0.87（0.83 - 0.90），在此子集中深度学习模型优于基于放射组学的机器学习方法。只有5项研究纳入了外部验证，6项评估了校准性能。在最近一项外部验证研究的系统评价中，ADNEX模型在低偏倚风险研究中的汇总AUC为0.93（0.91 - 0.94）。关于子宫内膜和子宫肌层病变的研究单独报告。

结论

尽管AI模型在超声诊断妇科肿瘤方面显示出有前景的判别性能，但大多数研究存在方法学缺陷，导致高偏倚风险。此外，ADNEX模型在卵巢肿瘤方面似乎优于大多数AI方法。未来的研究应强调稳健的研究设计——理想情况下是反映真实世界人群的大型、多中心和前瞻性队列——以及外部验证、适当校准和标准化报告。

注册情况

本研究已在开放科学框架（OSF）上预注册：https://doi.org/10.17605/osf.io/bhkst。

Artificial intelligence applied to ultrasound diagnosis of pelvic gynecological tumors: a systematic review and meta-analysis.

作者信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

REGISTRATION

目的

方法

结果

结论

注册情况

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献