Department of Information Engineering and Computer Science, Feng Chia University, Taichung City, Taiwan.
Institute of Biomedical Engineering and Nanomedicine, National Health Research Institutes, Miaoli County, Taiwan.
PLoS One. 2022 Oct 19;17(10):e0276278. doi: 10.1371/journal.pone.0276278. eCollection 2022.
Early confirmation or ruling out biliary atresia (BA) is essential for infants with delayed onset of jaundice. In the current practice, percutaneous liver biopsy and intraoperative cholangiography (IOC) remain the golden standards for diagnosis. In Taiwan, the diagnostic methods are invasive and can only be performed in selective medical centers. However, referrals from primary physicians and local pediatricians are often delayed because of lacking clinical suspicions. Ultrasounds (US) are common screening tools in local hospitals and clinics, but the pediatric hepatobiliary US particularly requires well-trained imaging personnel. The meaningful comprehension of US is highly dependent on individual experience. For screening BA through human observation on US images, the reported sensitivity and specificity were achieved by pediatric radiologists, pediatric hepatobiliary experts, or pediatric surgeons. Therefore, this research developed a tool based on deep learning models for screening BA to assist pediatric US image reading by general physicians and pediatricians.
De-identified hepatobiliary US images of 180 patients from Taichung Veterans General Hospital were retrospectively collected under the approval of the Institutional Review Board. Herein, the top network models of ImageNet Large Scale Visual Recognition Competition and other network models commonly used for US image recognition were included for further study to classify US images as BA or non-BA. The performance of different network models was expressed by the confusion matrix and receiver operating characteristic curve. There were two methods proposed to solve disagreement by US image classification of a single patient. The first and second methods were the positive-dominance law and threshold law. During the study, the US images of three successive patients suspected to have BA were classified by the trained models.
Among all included patients contributing US images, 41 patients were diagnosed with BA by surgical intervention and 139 patients were either healthy controls or had non-BA diagnoses. In this study, a total of 1,976 original US images were enrolled. Among them, 417 and 1,559 raw images were from patients with BA and without BA, respectively. Meanwhile, ShuffleNet achieved the highest accuracy of 90.56% using the same training parameters as compared with other network models. The sensitivity and specificity were 67.83% and 96.76%, respectively. In addition, the undesired false-negative prediction was prevented by applying positive-dominance law to interpret different images of a single patient with an acceptable false-positive rate, which was 13.64%. For the three consecutive patients with delayed obstructive jaundice with IOC confirmed diagnoses, ShuffleNet achieved accurate diagnoses in two patients.
The current study provides a screening tool for identifying possible BA by hepatobiliary US images. The method was not designed to replace liver biopsy or IOC, but to decrease human error for interpretations of US. By applying the positive-dominance law to ShuffleNet, the false-negative rate and the specificities were 0 and 86.36%, respectively. The trained deep learning models could aid physicians other than pediatric surgeons, pediatric gastroenterologists, or pediatric radiologists, to prevent misreading pediatric hepatobiliary US images. The current artificial intelligence (AI) tool is helpful for screening BA in the real world.
早期确认或排除先天性胆道闭锁(BA)对于黄疸延迟发作的婴儿至关重要。目前,经皮肝活检和术中胆管造影(IOC)仍然是诊断的金标准。在台湾,这些诊断方法具有侵入性,只能在选择性医疗中心进行。然而,由于缺乏临床怀疑,初级医生和当地儿科医生的转诊常常被延迟。超声(US)是当地医院和诊所常用的筛查工具,但儿科肝胆 US 特别需要经过良好培训的影像人员。对 US 图像的有意义的理解高度依赖于个人经验。为了通过对 US 图像的人工观察来筛查 BA,已报道的敏感性和特异性是由儿科放射科医生、儿科肝胆专家或儿科外科医生获得的。因此,本研究开发了一种基于深度学习模型的工具,用于筛查 BA,以协助普通医生和儿科医生阅读儿科 US 图像。
本研究回顾性收集了台中荣民总医院 180 名患者的肝胆 US 图像,这些图像是在机构审查委员会的批准下获得的。在此,包括了 ImageNet 大规模视觉识别竞赛的顶级网络模型和其他常用于 US 图像识别的网络模型,以进一步研究将 US 图像分类为 BA 或非 BA。不同网络模型的性能通过混淆矩阵和接收器工作特征曲线来表示。为了解决单个患者 US 图像分类的分歧,提出了两种方法。第一种和第二种方法是阳性优势法则和阈值法则。在研究过程中,使用训练好的模型对连续三个疑似 BA 的患者的 US 图像进行分类。
在所有纳入的贡献 US 图像的患者中,41 名患者通过手术干预被诊断为 BA,139 名患者为健康对照或患有非 BA。在这项研究中,共纳入了 1976 张原始 US 图像。其中,417 张和 1559 张原始图像分别来自 BA 患者和非 BA 患者。同时,与其他网络模型相比,ShuffleNet 使用相同的训练参数达到了最高的 90.56%的准确率。敏感性和特异性分别为 67.83%和 96.76%。此外,通过应用阳性优势法则来解释单个患者的不同图像,可以在可接受的假阳性率(13.64%)下防止不理想的假阴性预测,从而预防不必要的假阴性预测。对于经 IOC 证实为延迟性梗阻性黄疸的连续 3 名患者,ShuffleNet 对其中 2 名患者进行了准确诊断。
本研究提供了一种通过肝胆 US 图像识别可能的 BA 的筛查工具。该方法不是为了替代肝活检或 IOC,而是为了减少 US 解释中的人为错误。通过应用阳性优势法则到 ShuffleNet,假阴性率和特异性分别为 0 和 86.36%。训练有素的深度学习模型可以帮助除儿科外科医生、儿科胃肠病学家或儿科放射科医生以外的医生,防止对儿科肝胆 US 图像的误读。目前的人工智能(AI)工具有助于在现实世界中筛查 BA。