Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:1675-1681. doi: 10.1109/EMBC48229.2022.9871894.
Lung ultrasound (LUS) as a diagnostic tool is gaining support for its role in the diagnosis and management of COVID-19 and a number of other lung pathologies. B-lines are a predominant feature in COVID-19, however LUS requires a skilled clinician to interpret findings. To facilitate the interpretation, our main objective was to develop automated methods to classify B-lines as pathologic vs. normal. We developed transfer learning models based on ResNet networks to classify B-lines as pathologic (at least 3 B-lines per lung field) vs. normal using COVID-19 LUS data. Assessment of B-line severity on a 0-4 multi-class scale was also explored. For binary B-line classification, at the frame-level, all ResNet models pretrained with ImageNet yielded higher performance than the baseline nonpretrained ResNet-18. Pretrained ResNet-18 has the best Equal Error Rate (EER) of 9.1% vs the baseline of 11.9%. At the clip-level, all pretrained network models resulted in better Cohen's kappa agreement (linear-weighted) and clip score accuracy, with the pretrained ResNet-18 having the best Cohen's kappa of 0.815 [95% CI: 0.804-0.826], and ResNet-101 the best clip scoring accuracy of 93.6%. Similar results were shown for multi-class scoring, where pretrained network models outperformed the baseline model. A class activation map is also presented to guide clinicians in interpreting LUS findings. Future work aims to further improve the multi-class assessment for severity of B-lines with a more diverse LUS dataset.
肺部超声(LUS)作为一种诊断工具,因其在 COVID-19 及多种肺部疾病的诊断和管理中的作用而受到越来越多的关注。B 线是 COVID-19 的主要特征,但 LUS 需要有经验的临床医生来解读结果。为了便于解读,我们的主要目标是开发自动化方法,将 B 线分类为病理性与正常。我们开发了基于 ResNet 网络的迁移学习模型,使用 COVID-19 LUS 数据将 B 线分类为病理性(每个肺场至少有 3 条 B 线)与正常。还探索了 B 线严重程度的 0-4 多类分级评估。对于二分类 B 线分类,在帧级,所有基于 ImageNet 预训练的 ResNet 模型的性能均高于非预训练的基线 ResNet-18。预训练的 ResNet-18 的等错误率(EER)为 9.1%,优于基线的 11.9%。在剪辑级,所有预训练网络模型的 Cohen's kappa 一致性(线性加权)和剪辑评分准确性都更好,预训练的 ResNet-18 的 Cohen's kappa 最佳为 0.815 [95%CI:0.804-0.826],ResNet-101 的剪辑评分准确性最佳为 93.6%。对于多类评分,也显示出类似的结果,预训练网络模型优于基线模型。还展示了类激活图,以指导临床医生解读 LUS 结果。未来的工作旨在通过更具多样性的 LUS 数据集,进一步提高 B 线严重程度的多类评估。