IEEE Trans Ultrason Ferroelectr Freq Control. 2020 Nov;67(11):2312-2320. doi: 10.1109/TUFFC.2020.3002249. Epub 2020 Jun 15.
Shortness of breath is a major reason that patients present to the emergency department (ED) and point-of-care ultrasound (POCUS) has been shown to aid in diagnosis, particularly through evaluation for artifacts known as B-lines. B-line identification and quantification can be a challenging skill for novice ultrasound users, and experienced users could benefit from a more objective measure of quantification. We sought to develop and test a deep learning (DL) algorithm to quantify the assessment of B-lines in lung ultrasound. We utilized ultrasound clips ( n = 400 ) from an existing database of ED patients to provide training and test sets to develop and test the DL algorithm based on deep convolutional neural networks. Interpretations of the images by algorithm were compared to expert human interpretations on binary and severity (a scale of 0-4) classifications. Our model yielded a sensitivity of 93% (95% confidence interval (CI) 81%-98%) and a specificity of 96% (95% CI 84%-99%) for the presence or absence of B-lines compared to expert read, with a kappa of 0.88 (95% CI 0.79-0.97). Model to expert agreement for severity classification yielded a weighted kappa of 0.65 (95% CI 0.56-074). Overall, the DL algorithm performed well and could be integrated into an ultrasound system in order to help diagnose and track B-line severity. The algorithm is better at distinguishing the presence from the absence of B-lines but can also be successfully used to distinguish between B-line severity. Such methods could decrease variability and provide a standardized method for improved diagnosis and outcome.
呼吸困难是患者到急诊科就诊的主要原因,床边超声(POCUS)已被证明有助于诊断,特别是通过评估被称为 B 线的伪影。B 线的识别和定量对于新手超声用户来说是一项具有挑战性的技能,而有经验的用户可能会受益于更客观的定量测量方法。我们试图开发和测试一种深度学习(DL)算法来定量评估肺部超声中的 B 线。我们利用来自 ED 患者现有数据库的超声剪辑(n = 400),为开发和测试基于深度卷积神经网络的 DL 算法提供了训练集和测试集。通过算法对图像的解释与专家对二进制和严重程度(0-4 级)分类的解释进行了比较。与专家阅读相比,我们的模型在存在或不存在 B 线时的灵敏度为 93%(95%置信区间(CI)81%-98%),特异性为 96%(95%CI 84%-99%),kappa 值为 0.88(95%CI 0.79-0.97)。用于严重程度分类的模型与专家的一致性产生了加权 kappa 值为 0.65(95%CI 0.56-074)。总体而言,DL 算法性能良好,可以集成到超声系统中,以帮助诊断和跟踪 B 线严重程度。该算法更擅长区分 B 线的存在与不存在,但也可以成功用于区分 B 线的严重程度。这种方法可以减少变异性,并提供一种标准化方法,以改善诊断和结果。