Medical Oncology Department, Hospital Universitario Puerta de Hierro Majadahonda, Madrid, Spain.
TIB Leibniz Information Centre for Science and Technology, Hannover, Germany; L3S Research, Hannover, Germany.
Lung Cancer. 2024 Sep;195:107920. doi: 10.1016/j.lungcan.2024.107920. Epub 2024 Aug 9.
Lung Cancer (LC) is a multifactorial disease for which the role of genetic susceptibility has become increasingly relevant. Our aim was to use artificial intelligence (AI) to analyze differences between patients with LC based on family history of cancer (FHC).
From August 2016 to June 2020 clinical information was obtained from Thoracic Tumors Registry (TTR), a nationwide database sponsored by the Spanish Lung Cancer Group. In addition to descriptive statistical analysis, an AI-assisted analysis was performed. The German Technical Information Library supported the merging of data from the electronic medical records and database of the TTR. The results of the AI-assisted analysis were reported using Knowledge Graph, Unified Schema and descriptive and predictive analyses.
Analyses were performed in two phases: first, conventional statistical analysis including 11,684 patients of those 5,806 had FHC. Median overall survival (OS) for the global population was 23 months (CI 95 %: 21.39-24.61) in patients with FHC versus 21 months (CI 95 %: 19.53-22.48) in patients without FHC (NFHC), p < 0.001. The second AI-assisted analysis included 5,788 patients of those 939 had FHC. 58.48 % of women with FHC had LC. 9.53 % of patients had an EGFR or HER2 mutation or ALK translocation and at least one relative with cancer. A family history of LC was associated with an increased risk of smoking-related LC. Non-smokers with a family history of LC were more likely to have an EGFR mutation in NSCLC. In Bayesian network analysis, 55 % of patients with a family history of LC and never-smokers had an EGFR mutation.
In our population, the incidence of LC in patients with a FHC is higher in women and younger patients. FHC is a risk factor and predictor of LC development, especially in people ≤ 50 years. These results were confirmed by conventional statistics and AI-assisted analysis.
肺癌(LC)是一种多因素疾病,遗传易感性的作用变得越来越重要。我们的目的是使用人工智能(AI)来分析基于癌症家族史(FHC)的 LC 患者之间的差异。
从 2016 年 8 月至 2020 年 6 月,从西班牙肺癌组赞助的全国性数据库——胸部肿瘤登记处(TTR)获得临床信息。除了描述性统计分析外,还进行了人工智能辅助分析。德国技术信息图书馆支持合并电子病历和 TTR 数据库的数据。人工智能辅助分析的结果使用知识图、统一模式以及描述性和预测性分析进行报告。
分析分两个阶段进行:首先,进行了常规统计分析,共纳入了 11684 名患者,其中 5806 名患者有 FHC。在有 FHC 的患者中,总体生存率(OS)中位数为 23 个月(95%CI:21.39-24.61),而无 FHC 的患者为 21 个月(95%CI:19.53-22.48),p<0.001。其次,进行了人工智能辅助分析,共纳入了 5788 名患者,其中 939 名患者有 FHC。有 FHC 的女性中,58.48%患有 LC。9.53%的患者有 EGFR 或 HER2 突变或 ALK 易位,并且至少有一位亲属患有癌症。LC 的家族史与吸烟相关的 LC 风险增加相关。有 LC 家族史的不吸烟者更有可能患有非小细胞肺癌的 EGFR 突变。在贝叶斯网络分析中,有 FHC 且从不吸烟的患者中,55%患有 EGFR 突变。
在我们的人群中,有 FHC 的患者 LC 的发病率在女性和年轻患者中更高。FHC 是 LC 发生的危险因素和预测因子,尤其是在≤50 岁的人群中。这些结果通过常规统计学和人工智能辅助分析得到了证实。