Suppr超能文献

深度学习模型用于产前先天性心脏病筛查可推广至社区环境,并优于临床检测。

Deep-learning model for prenatal congenital heart disease screening generalizes to community setting and outperforms clinical detection.

机构信息

Division of Cardiology, Department of Medicine, University of California, San Francisco, San Francisco, CA, USA.

Department of Obstetrics, Division of Fetal Medicine, Leiden University Medical Center, Leiden, The Netherlands.

出版信息

Ultrasound Obstet Gynecol. 2024 Jan;63(1):44-52. doi: 10.1002/uog.27503.

Abstract

OBJECTIVES

Despite nearly universal prenatal ultrasound screening programs, congenital heart defects (CHD) are still missed, which may result in severe morbidity or even death. Deep machine learning (DL) can automate image recognition from ultrasound. The main aim of this study was to assess the performance of a previously developed DL model, trained on images from a tertiary center, using fetal ultrasound images obtained during the second-trimester standard anomaly scan in a low-risk population. A secondary aim was to compare initial screening diagnosis, which made use of live imaging at the point-of-care, with diagnosis by clinicians evaluating only stored images.

METHODS

All pregnancies with isolated severe CHD in the Northwestern region of The Netherlands between 2015 and 2016 with available stored images were evaluated, as well as a sample of normal fetuses' examinations from the same region and time period. We compared the accuracy of the initial clinical diagnosis (made in real time with access to live imaging) with that of the model (which had only stored imaging available) and with the performance of three blinded human experts who had access only to the stored images (like the model). We analyzed performance according to ultrasound study characteristics, such as duration and quality (scored independently by investigators), number of stored images and availability of screening views.

RESULTS

A total of 42 normal fetuses and 66 cases of isolated CHD at birth were analyzed. Of the abnormal cases, 31 were missed and 35 were detected at the time of the clinical anatomy scan (sensitivity, 53%). Model sensitivity and specificity were 91% and 78%, respectively. Blinded human experts (n = 3) achieved mean ± SD sensitivity and specificity of 55 ± 10% (range, 47-67%) and 71 ± 13% (range, 57-83%), respectively. There was a statistically significant difference in model correctness according to expert-graded image quality (P = 0.03). The abnormal cases included 19 lesions that the model had not encountered during its training; the model's performance in these cases (16/19 correct) was not statistically significantly different from that for previously encountered lesions (P = 0.41).

CONCLUSIONS

A previously trained DL algorithm had higher sensitivity than initial clinical assessment in detecting CHD in a cohort in which over 50% of CHD cases were initially missed clinically. Notably, the DL algorithm performed well on community-acquired images in a low-risk population, including lesions to which it had not been exposed previously. Furthermore, when both the model and blinded human experts had access to only stored images and not the full range of images available to a clinician during a live scan, the model outperformed the human experts. Together, these findings support the proposition that use of DL models can improve prenatal detection of CHD. © 2023 International Society of Ultrasound in Obstetrics and Gynecology.

摘要

目的

尽管几乎普遍进行了产前超声筛查计划,但仍会漏诊先天性心脏病(CHD),这可能导致严重的发病率,甚至死亡。深度学习(DL)可以实现超声图像的自动识别。本研究的主要目的是评估以前开发的 DL 模型的性能,该模型在一个三级中心的图像上进行训练,并在低风险人群中使用妊娠中期标准异常扫描获得的胎儿超声图像进行评估。次要目的是比较初始筛查诊断,即利用即时护理点的实时成像,与仅评估存储图像的临床医生的诊断。

方法

评估了 2015 年至 2016 年荷兰西北部地区所有孤立性严重 CHD 病例的可存储图像,以及同一地区和时间段的正常胎儿检查样本。我们比较了初始临床诊断(实时进行,可访问实时成像)的准确性、模型(仅可访问存储图像)的准确性以及仅访问存储图像的三名盲法人类专家(与模型相同)的性能。我们根据超声研究的特征(例如,由研究人员独立评分的持续时间和质量)、存储图像的数量以及筛查视图的可用性来分析性能。

结果

共分析了 42 例正常胎儿和 66 例出生时孤立性 CHD 病例。异常病例中,31 例漏诊,35 例在临床解剖扫描时发现(敏感性,53%)。模型的敏感性和特异性分别为 91%和 78%。盲法人类专家(n=3)的平均±标准差敏感性和特异性分别为 55±10%(范围,47-67%)和 71±13%(范围,57-83%)。根据专家分级图像质量,模型的正确性存在统计学显著差异(P=0.03)。异常病例包括模型在训练过程中未遇到的 19 个病变;模型在这些病例中的表现(16/19 正确)与以前遇到的病变无统计学显著差异(P=0.41)。

结论

在超过 50%的 CHD 病例在临床初始阶段漏诊的队列中,以前训练的 DL 算法在检测 CHD 方面的敏感性高于初始临床评估。值得注意的是,该 DL 算法在低风险人群中社区获得的图像中表现良好,包括其以前未接触过的病变。此外,当模型和盲法人类专家都仅访问存储图像而不是临床医生在实时扫描期间可访问的全部范围的图像时,模型的表现优于人类专家。这些发现共同支持使用 DL 模型可以提高产前 CHD 检测的主张。

相似文献

3
Why are congenital heart defects being missed?为什么会漏诊先天性心脏病?
Ultrasound Obstet Gynecol. 2020 Jun;55(6):747-757. doi: 10.1002/uog.20358.

引用本文的文献

本文引用的文献

9
Why are congenital heart defects being missed?为什么会漏诊先天性心脏病?
Ultrasound Obstet Gynecol. 2020 Jun;55(6):747-757. doi: 10.1002/uog.20358.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验