Vagliano Iacopo, Rios Miguel, Abukmeil Mohanad, Schut Martijn C, Luik Torec T, van Asselt Kristel M, van Weert Henk C P M, Abu-Hanna Ameen
Department of Medical Informatics, Amsterdam University Medical Centers, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands.
Amsterdam Public Health, Amsterdam University Medical Center, 1105 AZ Amsterdam, The Netherlands.
Cancers (Basel). 2025 Mar 29;17(7):1151. doi: 10.3390/cancers17071151.
Improving prediction models to timely detect lung cancer is paramount. Our aim is to develop and validate prediction models for early detection of lung cancer in primary care, based on free-text consultation notes, that exploit the order and context among words and sentences. Data of all patients enlisted in 49 general practices between 2002 and 2021 were assessed, and we included those older than 30 years with at least one free-text note. We developed two models using a hierarchical architecture that relies on attention and bidirectional long short-term memory networks. One model used only text, while the other combined text with clinical variables. The models were trained on data excluding the five months leading up to the diagnosis, using target replication and a tuning set, and were tested on a separate dataset for discrimination, PPV, and calibration. A total of 250,021 patients were enlisted, with 1507 having a lung cancer diagnosis. Included in the analysis were 183,012 patients, of which 712 had the diagnosis. From the two models, the combined model showed slightly better performance, achieving an AUROC on the test set of 0.91, an AUPRC of 0.05, and a PPV of 0.034 (0.024, 0.043), and showed good calibration. To early detect one cancer patient, 29 high-risk patients would require additional diagnostic testing. Our models showed excellent discrimination by leveraging the word and sentence structure. Including clinical variables in addition to text slightly improved performance. The number needed to treat holds promise for clinical practice. Investigating external validation and model suitability in clinical practice is warranted.
改进预测模型以及时检测肺癌至关重要。我们的目标是基于自由文本咨询记录开发并验证用于基层医疗中肺癌早期检测的预测模型,该模型利用单词和句子之间的顺序和上下文。对2002年至2021年期间49家普通诊所登记的所有患者的数据进行了评估,我们纳入了年龄超过30岁且至少有一条自由文本记录的患者。我们使用了一种依赖注意力和双向长短期记忆网络的分层架构开发了两个模型。一个模型仅使用文本,另一个模型将文本与临床变量相结合。这些模型在排除诊断前五个月的数据上进行训练,使用目标复制和一个调优集,并在一个单独的数据集上进行测试,以评估区分度、阳性预测值和校准情况。总共登记了250,021名患者,其中1507人被诊断为肺癌。纳入分析的有183,012名患者,其中712人被诊断为肺癌。在这两个模型中,组合模型表现略好,在测试集上的受试者工作特征曲线下面积为0.91,精确率-召回率曲线下面积为0.05,阳性预测值为0.034(0.024,0.043),并显示出良好的校准。为了早期检测出一名癌症患者,需要对29名高危患者进行额外的诊断测试。我们的模型通过利用单词和句子结构显示出出色的区分度。除文本外纳入临床变量略微提高了性能。治疗所需人数对临床实践具有前景。有必要在临床实践中研究外部验证和模型适用性。