Department of Radiology, Weill Cornell Medicine, New York, New York, United States of America.
Information Technologies and Services, Weill Cornell Medicine, New York, New York, United States of America.
PLoS One. 2020 Jul 30;15(7):e0236827. doi: 10.1371/journal.pone.0236827. eCollection 2020.
Heart failure (HF) is a major cause of morbidity and mortality. However, much of the clinical data is unstructured in the form of radiology reports, while the process of data collection and curation is arduous and time-consuming.
We utilized a machine learning (ML)-based natural language processing (NLP) approach to extract clinical terms from unstructured radiology reports. Additionally, we investigate the prognostic value of the extracted data in predicting all-cause mortality (ACM) in HF patients.
This observational cohort study utilized 122,025 thoracoabdominal computed tomography (CT) reports from 11,808 HF patients obtained between 2008 and 2018. 1,560 CT reports were manually annotated for the presence or absence of 14 radiographic findings, in addition to age and gender. Thereafter, a Convolutional Neural Network (CNN) was trained, validated and tested to determine the presence or absence of these features. Further, the ability of CNN to predict ACM was evaluated using Cox regression analysis on the extracted features.
11,808 CT reports were analyzed from 11,808 patients (mean age 72.8 ± 14.8 years; 52.7% (6,217/11,808) male) from whom 3,107 died during the 10.6-year follow-up. The CNN demonstrated excellent accuracy for retrieval of the 14 radiographic findings with area-under-the-curve (AUC) ranging between 0.83-1.00 (F1 score 0.84-0.97). Cox model showed the time-dependent AUC for predicting ACM was 0.747 (95% confidence interval [CI] of 0.704-0.790) at 30 days.
An ML-based NLP approach to unstructured CT reports demonstrates excellent accuracy for the extraction of predetermined radiographic findings, and provides prognostic value in HF patients.
心力衰竭(HF)是发病率和死亡率的主要原因。然而,大部分临床数据以放射学报告的形式呈现为非结构化数据,而数据收集和整理的过程既艰巨又耗时。
我们利用基于机器学习(ML)的自然语言处理(NLP)方法从非结构化放射学报告中提取临床术语。此外,我们还研究了提取数据在预测心力衰竭患者全因死亡率(ACM)方面的预后价值。
这项观察性队列研究使用了 2008 年至 2018 年间从 11808 例心力衰竭患者中获得的 122025 例胸腹部计算机断层扫描(CT)报告。为了确定 14 种影像学表现的存在与否,除了年龄和性别外,还对 1560 份 CT 报告进行了手动注释。此后,训练、验证和测试了卷积神经网络(CNN),以确定这些特征的存在与否。此外,还使用 Cox 回归分析对提取的特征进行了评估,以确定 CNN 预测 ACM 的能力。
对 11808 名患者(平均年龄 72.8 ± 14.8 岁;52.7%(6217/11808)为男性)的 11808 份 CT 报告进行了分析,这些患者在 10.6 年的随访中,有 3107 人死亡。CNN 在提取 14 种影像学表现方面具有出色的准确性,曲线下面积(AUC)范围为 0.83-1.00(F1 评分 0.84-0.97)。Cox 模型显示,预测 ACM 的时间依赖 AUC 在 30 天时为 0.747(95%置信区间[CI]:0.704-0.790)。
基于机器学习的 NLP 方法可从非结构化 CT 报告中准确提取预定的影像学表现,并为心力衰竭患者提供预后价值。