Department of Computer Science and Engineering, Korea University, Seoul, Korea.
Department of Radiology and Research Institute of Radiological Science, Severance Hospital, Yonsei University College of Medicine, Seoul, Korea.
JCO Clin Cancer Inform. 2024 Aug;8:e2400021. doi: 10.1200/CCI.24.00021.
To explore the predictive potential of serial computed tomography (CT) radiology reports for pancreatic cancer survival using natural language processing (NLP).
Deep-transfer-learning-based NLP models were retrospectively trained and tested with serial, free-text CT reports, and survival information of consecutive patients diagnosed with pancreatic cancer in a Korean tertiary hospital was extracted. Randomly selected patients with pancreatic cancer and their serial CT reports from an independent tertiary hospital in the United States were included in the external testing data set. The concordance index (c-index) of predicted survival and actual survival, and area under the receiver operating characteristic curve (AUROC) for predicting 1-year survival were calculated.
Between January 2004 and June 2021, 2,677 patients with 12,255 CT reports and 670 patients with 3,058 CT reports were allocated to training and internal testing data sets, respectively. ClinicalBERT (Bidirectional Encoder Representations from Transformers) model trained on the single, first CT reports showed a c-index of 0.653 and AUROC of 0.722 in predicting the overall survival of patients with pancreatic cancer. ClinicalBERT trained on up to 15 consecutive reports from the initial report showed an improved c-index of 0.811 and AUROC of 0.911. On the external testing set with 273 patients with 1,947 CT reports, the AUROC was 0.888, indicating the generalizability of our model. Further analyses showed our model's contextual interpretation beyond specific phrases.
Deep-transfer-learning-based NLP model of serial CT reports can predict the survival of patients with pancreatic cancer. Clinical decisions can be supported by the developed model, with survival information extracted solely from serial radiology reports.
利用自然语言处理(NLP)探索连续计算机断层扫描(CT)影像学报告对胰腺癌生存的预测潜力。
回顾性地训练和测试基于深度迁移学习的 NLP 模型,使用连续的自由文本 CT 报告和从韩国一家三级医院连续诊断为胰腺癌的患者的生存信息进行训练和测试。从美国的另一家独立的三级医院随机选择胰腺癌患者及其连续 CT 报告作为外部测试数据集。计算预测生存和实际生存的一致性指数(c-index)以及预测 1 年生存率的受试者工作特征曲线下面积(AUROC)。
2004 年 1 月至 2021 年 6 月,共纳入 2677 例患者的 12255 份 CT 报告和 670 例患者的 3058 份 CT 报告,分别分配到训练和内部测试数据集。基于单份、首份 CT 报告训练的 ClinicalBERT(来自变压器的双向编码器表示)模型,预测胰腺癌患者总生存率的 c-index 为 0.653,AUROC 为 0.722。从初始报告开始训练多达 15 份连续报告的 ClinicalBERT,c-index 提高到 0.811,AUROC 提高到 0.911。在包含 273 例患者和 1947 份 CT 报告的外部测试集中,AUROC 为 0.888,表明了我们模型的泛化能力。进一步分析表明,我们的模型可以对上下文进行解释,而不仅仅是对特定短语进行解释。
基于深度迁移学习的连续 CT 报告 NLP 模型可预测胰腺癌患者的生存情况。该模型可从连续的影像学报告中提取生存信息,为临床决策提供支持。