Lee Jaimie J, Zepeda Andres, Arbour Gregory, Isaac Kathryn V, Ng Raymond T, Nichol Alan M
Department of Radiation Oncology, BC Cancer, Vancouver, BC, Canada.
Department of Surgery, University of British Columbia, Vancouver, BC, Canada.
JCO Clin Cancer Inform. 2024 Dec;8:e2400107. doi: 10.1200/CCI.24.00107. Epub 2024 Dec 20.
Breast cancer relapses are rarely collected by cancer registries because of logistical and financial constraints. Hence, we investigated natural language processing (NLP), enhanced with state-of-the-art deep learning transformer tools and large language models, to automate relapse identification in the text of computed tomography (CT) reports.
We analyzed follow-up CT reports from patients diagnosed with breast cancer between January 1, 2005, and December 31, 2014. The reports were curated and annotated for the presence or absence of local, regional, and distant breast cancer relapses. We performed 10-fold cross-validation to evaluate models identifying different types of relapses in CT reports. Model performance was assessed with classification metrics, reported with 95% confidence intervals.
In our data set of 1,445 CT reports, 799 (55.3%) described any relapse, 72 (5.0%) local relapses, 97 (6.7%) regional relapses, and 743 (51.4%) distant relapses. The any-relapse model achieved an accuracy of 89.6% (87.8-91.1), with a sensitivity of 93.2% (91.4-94.9) and a specificity of 84.2% (80.9-87.1). The local relapse model achieved an accuracy of 94.6% (93.3-95.7), a sensitivity of 44.4% (32.8-56.3), and a specificity of 97.2% (96.2-98.0). The regional relapse model showed an accuracy of 93.6% (92.3-94.9), a sensitivity of 70.1% (60.0-79.1), and a specificity of 95.3% (94.2-96.5). Finally, the distant relapse model demonstrated an accuracy of 88.1% (86.2-89.7), a sensitivity of 91.8% (89.9-93.8), and a specificity of 83.7% (80.5-86.4).
We developed NLP models to identify local, regional, and distant breast cancer relapses from CT reports. Automating the identification of breast cancer relapses can enhance data collection about patient outcomes.
由于后勤和资金限制,癌症登记处很少收集乳腺癌复发数据。因此,我们研究了自然语言处理(NLP)技术,利用最先进的深度学习变压器工具和大语言模型进行增强,以自动识别计算机断层扫描(CT)报告文本中的复发情况。
我们分析了2005年1月1日至2014年12月31日期间被诊断为乳腺癌的患者的随访CT报告。对报告进行整理并标注是否存在局部、区域和远处乳腺癌复发情况。我们进行了10折交叉验证,以评估识别CT报告中不同类型复发的模型。使用分类指标评估模型性能,并报告95%置信区间。
在我们的1445份CT报告数据集中,799份(55.3%)描述了任何复发情况,72份(5.0%)为局部复发,97份(6.7%)为区域复发,743份(51.4%)为远处复发。任何复发模型的准确率为89.6%(87.8 - 91.1),灵敏度为93.2%(91.4 - 94.9),特异度为84.2%(80.9 - 87.1)。局部复发模型的准确率为94.6%(93.3 - 95.7),灵敏度为44.4%(32.8 - 56.3),特异度为97.2%(96.2 - 98.0)。区域复发模型的准确率为93.6%(92.3 - 94.9),灵敏度为70.1%(60.0 - 79.1),特异度为95.3%(94.2 - 96.5)。最后,远处复发模型的准确率为88.1%(86.2 - 89.7),灵敏度为91.8%(89.9 - 93.8),特异度为83.7%(80.5 - 86.4)。
我们开发了NLP模型,用于从CT报告中识别局部、区域和远处乳腺癌复发情况。自动识别乳腺癌复发情况可以加强关于患者预后的数据收集。