Research and Development, PAII Inc., Bethesda, MD 20817, United States.
Department of Gastroenterology and Hepatology, Chang Gung Memorial Hospital, Linkou Medical Center, Taoyuan 33305, Taiwan.
World J Gastroenterol. 2022 Jun 14;28(22):2494-2508. doi: 10.3748/wjg.v28.i22.2494.
Hepatic steatosis is a major cause of chronic liver disease. Two-dimensional (2D) ultrasound is the most widely used non-invasive tool for screening and monitoring, but associated diagnoses are highly subjective.
To develop a scalable deep learning (DL) algorithm for quantitative scoring of liver steatosis from 2D ultrasound images.
Using multi-view ultrasound data from 3310 patients, 19513 studies, and 228075 images from a retrospective cohort of patients received elastography, we trained a DL algorithm to diagnose steatosis stages (healthy, mild, moderate, or severe) from clinical ultrasound diagnoses. Performance was validated on two multi-scanner unblinded and blinded (initially to DL developer) histology-proven cohorts (147 and 112 patients) with histopathology fatty cell percentage diagnoses and a subset with FibroScan diagnoses. We also quantified reliability across scanners and viewpoints. Results were evaluated using Bland-Altman and receiver operating characteristic (ROC) analysis.
The DL algorithm demonstrated repeatable measurements with a moderate number of images (three for each viewpoint) and high agreement across three premium ultrasound scanners. High diagnostic performance was observed across all viewpoints: Areas under the curve of the ROC to classify mild, moderate, and severe steatosis grades were 0.85, 0.91, and 0.93, respectively. The DL algorithm outperformed or performed at least comparably to FibroScan control attenuation parameter (CAP) with statistically significant improvements for all levels on the unblinded histology-proven cohort and for "= severe" steatosis on the blinded histology-proven cohort.
The DL algorithm provides a reliable quantitative steatosis assessment across view and scanners on two multi-scanner cohorts. Diagnostic performance was high with comparable or better performance than the CAP.
肝脂肪变性是慢性肝病的主要原因。二维(2D)超声是最广泛用于筛查和监测的非侵入性工具,但相关诊断具有高度主观性。
开发一种可扩展的深度学习(DL)算法,用于从 2D 超声图像定量评分肝脂肪变性。
使用来自 3310 名患者、19513 项研究和 228075 张图像的多视图超声数据,对接受弹性成像的患者进行回顾性队列研究,我们训练了一种 DL 算法,根据临床超声诊断诊断肝脂肪变性阶段(健康、轻度、中度或重度)。在两个多扫描仪未盲和盲(最初对 DL 开发人员)组织学证实的队列(147 名和 112 名患者)中验证了性能,这些队列具有组织病理学脂肪细胞百分比诊断和 FibroScan 诊断的子集。我们还量化了跨扫描仪和视角的可靠性。结果使用 Bland-Altman 和接收者操作特征(ROC)分析进行评估。
DL 算法在具有中等数量图像(每个视角三个)的情况下显示可重复的测量值,并且在三个优质超声扫描仪之间具有高度一致性。在所有视角均观察到高诊断性能:用于分类轻度、中度和重度脂肪变性等级的 ROC 曲线下面积分别为 0.85、0.91 和 0.93。DL 算法的表现优于或至少与 FibroScan 控制衰减参数(CAP)相当,在未盲组织学证实队列的所有级别和盲组织学证实队列的“=严重”脂肪变性方面均具有统计学意义的改善。
该 DL 算法在两个多扫描仪队列的多个视角和扫描仪上提供了可靠的定量脂肪变性评估。诊断性能很高,与 CAP 相比具有可比性或更好的性能。