• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解读早期及局部晚期非小细胞肺癌的复发情况:来自电子健康记录和自然语言处理的见解

Decoding Recurrence in Early-Stage and Locoregionally Advanced Non-Small Cell Lung Cancer: Insights From Electronic Health Records and Natural Language Processing.

作者信息

Lee Kyeryoung, Liu Zongzhi, Huang Qing, Corrigan David, Kalsekar Iftekhar, Jun Tomi, Stolovitzky Gustavo, Oh William K, Rajaram Ravi, Wang Xiaoyan

机构信息

GeneDx (Sema4), Stamford, CT.

Lung Cancer Initiative, Johnson & Johnson, New Brunswick, NJ.

出版信息

JCO Clin Cancer Inform. 2025 Apr;9:e2400227. doi: 10.1200/CCI-24-00227. Epub 2025 Apr 18.

DOI:10.1200/CCI-24-00227
PMID:40249880
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12011440/
Abstract

PURPOSE

Recurrences after curative resection in early-stage and locoregionally advanced non-small cell lung cancer (NSCLC) are common, necessitating a nuanced understanding of associated risk factors. This study aimed to establish a natural language processing (NLP) system to efficiently curate recurrence data in NSCLC and analyze risk factors longitudinally.

PATIENTS AND METHODS

Electronic health records of 6,351 patients with NSCLC with >700,000 notes were obtained from Mount Sinai's data sets. A deep learning-based customized NLP system was developed to identify cohorts experiencing recurrence. Recurrence types and rates over time were stratified by various clinical features. Cohort description analysis, Kaplan-Meier analysis for overall recurrence-free survival (RFS) and distant metastasis-free survival (DMFS), and Cox proportional hazards analysis were performed.

RESULTS

Of 1,295 patients with stage I-IIIA NSCLC with surgical resections, 336 patients (25.9%) experienced recurrence, as identified through NLP. The NLP system achieved a precision of 94.3%, a recall of 93%, and an F1 score of 93.5. Among 336 patients, 52.4% had local/regional recurrences, 44% distant metastases, and 3.6% unknown recurrence. RFS rates at years 1-5 were 93%, 81%, 73%, 67%, and 61%, respectively (96%, 89%, 84%, 80%, and 75% for distant metastasis). Stage-specific RFS rates at year 5 were 73% (IA), 62% (IB), 47% (IIA), 46% (IIB), and 20% (IIIA). Stage IB patients had a significantly higher likelihood of recurrence versus stage IA (adjusted hazard ratio [aHR], 1.63; = .02). The RFS was lower in patients with clinically significant alteration ( -negative or unknown significance), affecting overall RFS (aHR, 1.89; = .007) and DMFS (aHR, 2.47; = .009) among stage IA/IB patients.

CONCLUSION

Our scalable NLP system enabled us to generate real-world insights into NSCLC recurrences, paving the way for predictive models for preventing, diagnosing, and treating NSCLC recurrence.

摘要

目的

早期和局部晚期非小细胞肺癌(NSCLC)根治性切除术后复发很常见,因此有必要对相关风险因素有细致入微的了解。本研究旨在建立一个自然语言处理(NLP)系统,以高效整理NSCLC的复发数据并纵向分析风险因素。

患者与方法

从西奈山数据集获取了6351例NSCLC患者的电子健康记录,记录数量超过70万条。开发了一个基于深度学习的定制NLP系统,以识别经历复发的队列。根据各种临床特征对复发类型和随时间的复发率进行分层。进行了队列描述分析、总体无复发生存期(RFS)和无远处转移生存期(DMFS)的Kaplan-Meier分析,以及Cox比例风险分析。

结果

在1295例接受手术切除的I-IIIA期NSCLC患者中,通过NLP识别出336例(25.9%)患者出现复发。NLP系统的精确率为94.3%,召回率为93%,F1评分为93.5%。在336例患者中,52.4%有局部/区域复发,44%有远处转移,3.6%复发情况不明。1-5年的RFS率分别为93%、81%、73%、67%和61%(远处转移的RFS率分别为96%、89%、84%、80%和75%)。5年时各分期的RFS率分别为73%(IA期)、62%(IB期)、47%(IIA期)、46%(IIB期)和20%(IIIA期)。IB期患者与IA期相比复发可能性显著更高(调整后风险比[aHR],1.63;P = 0.02)。在IA/IB期患者中,具有临床显著改变(KRAS阴性或意义不明)的患者RFS较低,这影响了总体RFS(aHR,1.89;P = 0.007)和DMFS(aHR,2.47;P = 0.009)。

结论

我们可扩展的NLP系统使我们能够对NSCLC复发产生真实世界的见解,为预防、诊断和治疗NSCLC复发的预测模型铺平了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/8129fa9f7a34/cci-9-e2400227-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/a84be3c6be0a/cci-9-e2400227-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/3fd6807e9d55/cci-9-e2400227-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/71aa699349c3/cci-9-e2400227-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/8129fa9f7a34/cci-9-e2400227-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/a84be3c6be0a/cci-9-e2400227-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/3fd6807e9d55/cci-9-e2400227-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/71aa699349c3/cci-9-e2400227-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eeb4/12011440/8129fa9f7a34/cci-9-e2400227-g004.jpg

相似文献

1
Decoding Recurrence in Early-Stage and Locoregionally Advanced Non-Small Cell Lung Cancer: Insights From Electronic Health Records and Natural Language Processing.解读早期及局部晚期非小细胞肺癌的复发情况:来自电子健康记录和自然语言处理的见解
JCO Clin Cancer Inform. 2025 Apr;9:e2400227. doi: 10.1200/CCI-24-00227. Epub 2025 Apr 18.
2
Prognostic impact of vascular invasion and standardization of its evaluation in stage I non-small cell lung cancer.I期非小细胞肺癌中血管侵犯的预后影响及其评估的标准化
Diagn Pathol. 2015 Apr 2;10:17. doi: 10.1186/s13000-015-0249-5.
3
Association of Clinicopathologic and Molecular Tumor Features With Recurrence in Resected Early-Stage Epidermal Growth Factor Receptor-Positive Non-Small Cell Lung Cancer.早期表皮生长因子受体阳性非小细胞肺癌切除术后复发的临床病理和分子肿瘤特征的相关性研究。
JAMA Netw Open. 2021 Nov 1;4(11):e2131892. doi: 10.1001/jamanetworkopen.2021.31892.
4
Circulating 25-hydroxyvitamin D levels predict survival in early-stage non-small-cell lung cancer patients.循环25-羟维生素D水平可预测早期非小细胞肺癌患者的生存率。
J Clin Oncol. 2007 Feb 10;25(5):479-85. doi: 10.1200/JCO.2006.07.5358.
5
[Prognosis after complete surgical resection for non-small cell lung cancer based on the staging classification].基于分期分类的非小细胞肺癌完全手术切除后的预后
Dtsch Med Wochenschr. 2006 Nov 24;131(47):2643-8. doi: 10.1055/s-2006-956268.
6
Long-Term Results for Clinical Stage IA Lung Cancer: Comparing Lobectomy and Sublobar Resection.临床ⅠA 期肺癌的长期结果:肺叶切除术与亚肺叶切除术比较。
Ann Thorac Surg. 2018 Aug;106(2):375-381. doi: 10.1016/j.athoracsur.2018.02.049. Epub 2018 Mar 23.
7
Nomograms for Predicting Overall and Recurrence-free Survival From Pathologic Stage IA and IB Lung Cancer After Lobectomy.肺癌根治术后 IA 和 IB 期患者总生存和无复发生存的预测列线图
Clin Lung Cancer. 2021 Jul;22(4):e574-e583. doi: 10.1016/j.cllc.2020.10.009. Epub 2020 Oct 22.
8
Clinicopathological variables influencing overall survival, recurrence and post-recurrence survival in resected stage I non-small-cell lung cancer.影响Ⅰ期非小细胞肺癌切除术后总生存、复发和复发后生存的临床病理变量。
BMC Cancer. 2020 Feb 24;20(1):150. doi: 10.1186/s12885-020-6621-1.
9
Early Distant Recurrence in Patients With Resected Stage I Lung Cancer: A Case Series of "Blast Metastasis".根治性切除术后 I 期肺癌患者的早期远处复发:“爆炸转移”病例系列。
Clin Lung Cancer. 2021 Jan;22(1):e132-e135. doi: 10.1016/j.cllc.2020.09.002. Epub 2020 Sep 18.
10
Extracapsular extension is a powerful prognostic factor in stage IIA-IIIA non-small cell lung cancer patients with completely resection.在完全切除的IIA-IIIA期非小细胞肺癌患者中,肿瘤囊外扩展是一个有力的预后因素。
Int J Clin Exp Pathol. 2015 Sep 1;8(9):11268-77. eCollection 2015.

本文引用的文献

1
Detecting Ground Glass Opacity Features in Patients With Lung Cancer: Automated Extraction and Longitudinal Analysis via Deep Learning-Based Natural Language Processing.检测肺癌患者的磨玻璃影特征:基于深度学习的自然语言处理实现自动提取与纵向分析
JMIR AI. 2023 Jun 1;2:e44537. doi: 10.2196/44537.
2
Recurrence-Free Survival in Patients With Surgically Resected Non-Small Cell Lung Cancer: A Systematic Literature Review and Meta-Analysis.手术切除的非小细胞肺癌患者的无复发生存:系统文献回顾和荟萃分析。
Chest. 2024 May;165(5):1260-1270. doi: 10.1016/j.chest.2023.11.042. Epub 2023 Dec 6.
3
Prognosis of recurrence after complete resection in early-stage lung adenocarcinoma based on molecular alterations: a systematic review and meta-analysis.
基于分子改变的早期肺腺癌完全切除术后复发的预后:系统评价和荟萃分析。
Sci Rep. 2023 Oct 31;13(1):18710. doi: 10.1038/s41598-023-42851-2.
4
Machine Learning-Assisted Recurrence Prediction for Patients With Early-Stage Non-Small-Cell Lung Cancer.机器学习辅助早期非小细胞肺癌患者的复发预测。
JCO Clin Cancer Inform. 2023 Jul;7:e2200062. doi: 10.1200/CCI.22.00062.
5
Leveraging natural language processing to identify eligible lung cancer screening patients with the electronic health record.利用自然语言处理技术从电子健康记录中识别符合条件的肺癌筛查患者。
Int J Med Inform. 2023 Sep;177:105136. doi: 10.1016/j.ijmedinf.2023.105136. Epub 2023 Jun 26.
6
Lobar or Sublobar Resection for Peripheral Stage IA Non-Small-Cell Lung Cancer.肺段或亚肺叶切除术治疗外周型ⅠA 期非小细胞肺癌。
N Engl J Med. 2023 Feb 9;388(6):489-498. doi: 10.1056/NEJMoa2212083.
7
Treatment patterns and survival of patients with locoregional recurrence in early-stage NSCLC: a literature review of real-world evidence.早期非小细胞肺癌局部区域复发患者的治疗模式和生存:真实世界证据的文献回顾。
Med Oncol. 2022 Oct 29;40(1):4. doi: 10.1007/s12032-022-01790-0.
8
Machine learning application in personalised lung cancer recurrence and survivability prediction.机器学习在个性化肺癌复发和生存预测中的应用。
Comput Struct Biotechnol J. 2022 Apr 4;20:1811-1820. doi: 10.1016/j.csbj.2022.03.035. eCollection 2022.
9
Using Natural Language Processing and Machine Learning to Preoperatively Predict Lymph Node Metastasis for Non-Small Cell Lung Cancer With Electronic Medical Records: Development and Validation Study.利用自然语言处理和机器学习,通过电子病历术前预测非小细胞肺癌的淋巴结转移:开发与验证研究
JMIR Med Inform. 2022 Apr 25;10(4):e35475. doi: 10.2196/35475.
10
On Predicting Recurrence in Early Stage Non-small Cell Lung Cancer.早期非小细胞肺癌复发预测。
AMIA Annu Symp Proc. 2022 Feb 21;2021:853-862. eCollection 2021.