Chen Xu, Yang Feifei, Zhang Peifang, Lin Xixiang, Wang Wenjun, Pu Haitao, Chen Xiaotian, Chen Yixin, Yu Liheng, Deng Yujiao, Liu Bohan, Bai Yongyi, Burkhoff Daniel, He Kunlun
Medical Big Data Research Center, Chinese PLA General Hospital, Beijing, China; Department of Cardiology, The Second Medical Center of Chinese PLA General Hospital, Beijing, China.
Medical Big Data Research Center, Chinese PLA General Hospital, Beijing, China; Department of Cardiology, The Fourth Medical Center of Chinese PLA General Hospital, Beijing, China.
J Am Soc Echocardiogr. 2023 Oct;36(10):1064-1078. doi: 10.1016/j.echo.2023.07.001. Epub 2023 Jul 10.
Clinical assessment and grading of left ventricular diastolic function (LVDF) requires quantification of multiple echocardiographic parameters interpreted according to established guidelines, which depends on experienced clinicians and is time consuming. The aim of this study was to develop an artificial intelligence (AI)-assisted system to facilitate the clinical assessment of LVDF.
In total, 1,304 studies (33,404 images) were used to develop a view classification model to select six specific views required for LVDF assessment. A total of 2,238 studies (16,794 two-dimensional [2D] images and 2,198 Doppler images) to develop 2D and Doppler segmentation models, respectively, to quantify key metrics of diastolic function. We used 2,150 studies with definite LVDF labels determined by two experts to train single-view classification models by AI interpretation of strain metrics or video. The accuracy and efficiency of these models were tested in an external data set of 388 prospective studies.
The view classification model identified views required for LVDF assessment with good sensitivity (>0.9), and view segmentation models successfully outlined key regions of these views with intersection over union > 0.8 in the internal validation data set. In the external test data set of 388 cases, AI quantification of 2D and Doppler images showed narrow limits of agreement compared with the two experts (e.g., left ventricular ejection fraction, -12.02% to 9.17%; E/e' ratio, -3.04 to 2.67). These metrics were used to detect LV diastolic dysfunction (DD) and grade DD with accuracy of 0.9 and 0.92, respectively. Concerning the single-view method, the overall accuracy of DD detection was 0.83 and 0.75 by strain-based and video-based models, and the accuracy of DD grading was 0.85 and 0.8, respectively. These models could achieve diagnosis and grading of LVDD in a few seconds, greatly saving time and labor.
AI models successfully achieved LVDF assessment and grading that compared favorably with human experts reading according to guideline-based algorithms. Moreover, when Doppler variables were missing, AI models could provide assessment by interpreting 2D strain metrics or videos from a single view. These models have the potential to save labor and cost and to facilitate work flow of clinical LVDF assessment.
左心室舒张功能(LVDF)的临床评估和分级需要根据既定指南对多个超声心动图参数进行量化,这依赖于经验丰富的临床医生,且耗时较长。本研究的目的是开发一种人工智能(AI)辅助系统,以促进LVDF的临床评估。
总共使用1304项研究(33404张图像)来开发视图分类模型,以选择LVDF评估所需的六个特定视图。分别使用2238项研究(16794张二维[2D]图像和2198张多普勒图像)来开发2D和多普勒分割模型,以量化舒张功能的关键指标。我们使用由两名专家确定具有明确LVDF标签的2150项研究,通过对应变指标或视频的AI解释来训练单视图分类模型。在388项前瞻性研究的外部数据集中测试了这些模型的准确性和效率。
视图分类模型以良好的敏感性(>0.9)识别出LVDF评估所需的视图,并且视图分割模型在内部验证数据集中成功勾勒出这些视图的关键区域,交并比>0.8。在388例的外部测试数据集中,与两名专家相比,2D和多普勒图像的AI量化显示出较窄的一致性界限(例如,左心室射血分数,-12.02%至9.17%;E/e'比值,-3.04至2.67)。这些指标分别用于检测左心室舒张功能障碍(DD)和对DD进行分级时,准确率分别为0.9和0.92。关于单视图方法,基于应变和基于视频的模型检测DD的总体准确率分别为0.83和0.75,DD分级的准确率分别为0.85和0.8。这些模型可以在几秒钟内实现左心室舒张功能障碍的诊断和分级,大大节省了时间和人力。
AI模型成功实现了LVDF评估和分级,与根据基于指南的算法进行解读的人类专家相比表现出色。此外,当多普勒变量缺失时,AI模型可以通过解释单视图的2D应变指标或视频来提供评估。这些模型有可能节省人力和成本,并促进临床LVDF评估的工作流程。