利用相关的基因组和途径信息来增强预测模型和扩充患者数据。

Boosting predictive models and augmenting patient data with relevant genomic and pathway information.

机构信息

Data Science Institute, University of Galway, University Road, H91 TK33, Co. Galway, Ireland.

出版信息

Comput Biol Med. 2024 May;174:108398. doi: 10.1016/j.compbiomed.2024.108398. Epub 2024 Apr 3.

DOI:10.1016/j.compbiomed.2024.108398

Abstract

The recurrence of low-stage lung cancer poses a challenge due to its unpredictable nature and diverse patient responses to treatments. Personalized care and patient outcomes heavily rely on early relapse identification, yet current predictive models, despite their potential, lack comprehensive genetic data. This inadequacy fuels our research focus-integrating specific genetic information, such as pathway scores, into clinical data. Our aim is to refine machine learning models for more precise relapse prediction in early-stage non-small cell lung cancer. To address the scarcity of genetic data, we employ imputation techniques, leveraging publicly available datasets such as The Cancer Genome Atlas (TCGA), integrating pathway scores into our patient cohort from the Cancer Long Survivor Artificial Intelligence Follow-up (CLARIFY) project. Through the integration of imputed pathway scores from the TCGA dataset with clinical data, our approach achieves notable strides in predicting relapse among a held-out test set of 200 patients. By training machine learning models on enriched knowledge graph data, inclusive of triples derived from pathway score imputation, we achieve a promising precision of 82% and specificity of 91%. These outcomes highlight the potential of our models as supplementary tools within tumour, node, and metastasis (TNM) classification systems, offering improved prognostic capabilities for lung cancer patients. In summary, our research underscores the significance of refining machine learning models for relapse prediction in early-stage non-small cell lung cancer. Our approach, centered on imputing pathway scores and integrating them with clinical data, not only enhances predictive performance but also demonstrates the promising role of machine learning in anticipating relapse and ultimately elevating patient outcomes.

摘要

由于低分期肺癌具有不可预测的性质和患者对治疗反应的多样性，因此其复发是一个挑战。个性化护理和患者的结果严重依赖于早期复发的识别，但目前的预测模型尽管有其潜力，但缺乏全面的遗传数据。这种不足促使我们的研究重点是——将特定的遗传信息（如通路评分）纳入临床数据中。我们的目标是改进机器学习模型，以更精确地预测早期非小细胞肺癌的复发。为了解决遗传数据的稀缺性，我们采用了插补技术，利用公共可用数据集，如癌症基因组图谱（TCGA），将通路评分集成到我们来自癌症长期幸存者人工智能随访（CLARIFY）项目的患者队列中。通过将 TCGA 数据集的插补通路评分与临床数据相结合，我们的方法在 200 名患者的独立测试集中的复发预测方面取得了显著进展。通过在包含来自通路评分插补的三元组的丰富知识图谱数据上训练机器学习模型，我们实现了 82%的准确率和 91%的特异性。这些结果突显了我们的模型作为肿瘤、淋巴结和转移（TNM）分类系统内的辅助工具的潜力，为肺癌患者提供了改进的预后能力。总之，我们的研究强调了改进机器学习模型在早期非小细胞肺癌复发预测中的重要性。我们的方法以插补通路评分并将其与临床数据相结合为中心，不仅提高了预测性能，还展示了机器学习在预测复发和最终提高患者预后方面的有前途的作用。

相似文献

Boosting predictive models and augmenting patient data with relevant genomic and pathway information.

Comput Biol Med. 2024 May;174:108398. doi: 10.1016/j.compbiomed.2024.108398. Epub 2024 Apr 3.

Synergy between imputed genetic pathway and clinical information for predicting recurrence in early stage non-small cell lung cancer.

J Biomed Inform. 2023 Aug;144:104424. doi: 10.1016/j.jbi.2023.104424. Epub 2023 Jun 21.

Integration of Clinical Information and Imputed Aneuploidy Scores to Enhance Relapse Prediction in Early Stage Lung Cancer Patients.

AMIA Annu Symp Proc. 2023 Apr 29;2022:1062-1071. eCollection 2022.

Machine Learning-Assisted Recurrence Prediction for Patients With Early-Stage Non-Small-Cell Lung Cancer.

JCO Clin Cancer Inform. 2023 Jul;7:e2200062. doi: 10.1200/CCI.22.00062.

A Genomic-Pathologic Annotated Risk Model to Predict Recurrence in Early-Stage Lung Adenocarcinoma.

JAMA Surg. 2021 Feb 1;156(2):e205601. doi: 10.1001/jamasurg.2020.5601. Epub 2021 Feb 10.

A novel 12-gene signature as independent prognostic model in stage IA and IB lung squamous cell carcinoma patients.

Clin Transl Oncol. 2021 Nov;23(11):2368-2381. doi: 10.1007/s12094-021-02638-1. Epub 2021 May 24.

A comparison of machine learning methods for predicting recurrence and death after curative-intent radiotherapy for non-small cell lung cancer: Development and validation of multivariable clinical prediction models.

EBioMedicine. 2022 Mar;77:103911. doi: 10.1016/j.ebiom.2022.103911. Epub 2022 Mar 3.

Machine learning application in personalised lung cancer recurrence and survivability prediction.

Comput Struct Biotechnol J. 2022 Apr 4;20:1811-1820. doi: 10.1016/j.csbj.2022.03.035. eCollection 2022.

Lymph Node Metastasis Prediction From In Situ Lung Squamous Cell Carcinoma Histopathology Images Using Deep Learning.

Lab Invest. 2025 Jan;105(1):102187. doi: 10.1016/j.labinv.2024.102187. Epub 2024 Nov 13.

Machine learning for predicting colon cancer recurrence.

Surg Oncol. 2024 Jun;54:102079. doi: 10.1016/j.suronc.2024.102079. Epub 2024 Apr 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用相关的基因组和途径信息来增强预测模型和扩充患者数据。

Boosting predictive models and augmenting patient data with relevant genomic and pathway information.

机构信息

Data Science Institute, University of Galway, University Road, H91 TK33, Co. Galway, Ireland.

出版信息

Comput Biol Med. 2024 May;174:108398. doi: 10.1016/j.compbiomed.2024.108398. Epub 2024 Apr 3.

DOI:10.1016/j.compbiomed.2024.108398

PMID:38608322

Abstract

摘要

利用相关的基因组和途径信息来增强预测模型和扩充患者数据。

Boosting predictive models and augmenting patient data with relevant genomic and pathway information.

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用相关的基因组和途径信息来增强预测模型和扩充患者数据。

Boosting predictive models and augmenting patient data with relevant genomic and pathway information.

机构信息

出版信息

相似文献