• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

肺腺癌和鳞状细胞癌转录组的比较研究以揭示潜在候选生物标志物:一种可解释的人工智能方法。

Comparative investigation of lung adenocarcinoma and squamous cell carcinoma transcriptome to reveal potential candidate biomarkers: An explainable AI approach.

作者信息

Datta Ankur, C George Priya Doss

机构信息

Laboratory of Integrative Genomics, Department of Integrative Biology, School of BioSciences and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu 632014, India.

出版信息

Comput Biol Chem. 2025 Apr;115:108333. doi: 10.1016/j.compbiolchem.2024.108333. Epub 2024 Dec 27.

DOI:10.1016/j.compbiolchem.2024.108333
PMID:39787672
Abstract

Patients with Non-Small Cell Lung Cancer (NSCLC) present a variety of clinical symptoms, such as dyspnea and chest pain, complicating accurate diagnosis. NSCLC includes subtypes distinguished by histological characteristics, specifically lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC). This study aims to compare and identify abnormal gene expression patterns in LUAD and LUSC samples relative to adjacent healthy tissues using an explainable artificial intelligence (XAI) framework. The LASSO algorithm was employed to identify the top gene features in the LUAD and LUSC datasets. An ensemble-based extreme gradient boosting (XGBoost) machine learning (ML) algorithm was trained and interpreted using SHapley Additive exPlanations (SHAP), with top features undergoing biological annotation through survival and functional enrichment analyses. The XAI-based SHAP module addresses the opaque nature of ML models. Notably, 35 and 33 genes were identified for LUAD and LUSC, respectively, using the LASSO algorithm. Performance metrics such as average accuracy and Matthew's correlation coefficient were evaluated. The XGBoost model demonstrated an average accuracy of 99.1 % for LUAD and 98.6 % for LUSC. The SFTPC gene emerged as the most significant feature across both NSCLC subtypes. For LUAD, genes such as STX11, CLEC3B, EMP2, and LYVE1 significantly influenced the XAI-SHAP framework. Conversely, GKN2, OGN, SLC39A8, and MMRN1 were identified for LUSC. Survival analysis and functional validation of these genes highlighted the physiological functions observed to be dysregulated in the NSCLC subtypes. These identified genes have the potential to enhance current medical diagnostics and therapeutics.

摘要

非小细胞肺癌(NSCLC)患者会出现多种临床症状,如呼吸困难和胸痛,这使得准确诊断变得复杂。NSCLC包括根据组织学特征区分的亚型,具体为肺腺癌(LUAD)和肺鳞状细胞癌(LUSC)。本研究旨在使用可解释人工智能(XAI)框架比较并识别LUAD和LUSC样本相对于相邻健康组织的异常基因表达模式。采用LASSO算法识别LUAD和LUSC数据集中的顶级基因特征。使用SHapley加法解释(SHAP)对基于集成的极端梯度提升(XGBoost)机器学习(ML)算法进行训练和解释,通过生存和功能富集分析对顶级特征进行生物学注释。基于XAI的SHAP模块解决了ML模型的不透明性问题。值得注意的是,使用LASSO算法分别为LUAD和LUSC鉴定出35个和33个基因。评估了平均准确率和马修斯相关系数等性能指标。XGBoost模型对LUAD的平均准确率为99.1%,对LUSC的平均准确率为98.6%。SFTPC基因是两种NSCLC亚型中最显著的特征。对于LUAD,STX11、CLEC3B、EMP2和LYVE1等基因显著影响XAI-SHAP框架。相反,LUSC鉴定出了GKN2、OGN、SLC39A8和MMRN1。这些基因的生存分析和功能验证突出了在NSCLC亚型中观察到的失调生理功能。这些鉴定出的基因有可能增强当前的医学诊断和治疗方法。

相似文献

1
Comparative investigation of lung adenocarcinoma and squamous cell carcinoma transcriptome to reveal potential candidate biomarkers: An explainable AI approach.肺腺癌和鳞状细胞癌转录组的比较研究以揭示潜在候选生物标志物:一种可解释的人工智能方法。
Comput Biol Chem. 2025 Apr;115:108333. doi: 10.1016/j.compbiolchem.2024.108333. Epub 2024 Dec 27.
2
Differential expression by epigenetic alterations in human lung squamous cell carcinoma and adenocarcinoma.人类肺鳞癌和腺癌中表观遗传改变的差异表达。
Oncol Rep. 2021 May;45(5). doi: 10.3892/or.2021.8029. Epub 2021 Mar 31.
3
Interpreting Lung Cancer Health Disparity between African American Males and European American Males.解读非裔美国男性与欧美裔美国男性之间的肺癌健康差异。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2024 Dec;2024:7141-7143. doi: 10.1109/bibm62325.2024.10822014.
4
Exploring and comparing of the gene expression and methylation differences between lung adenocarcinoma and squamous cell carcinoma.探索并比较肺腺癌和肺鳞癌之间的基因表达和甲基化差异。
J Cell Physiol. 2019 Apr;234(4):4454-4459. doi: 10.1002/jcp.27240. Epub 2018 Oct 14.
5
Differential prognostic impact and potential molecular mechanisms of PCDHGA12 expression in lung adenocarcinoma and squamous cell carcinoma.PCDHGA12 在肺腺癌和鳞癌中的差异预后影响及潜在分子机制。
Int Immunopharmacol. 2024 Sep 30;139:112727. doi: 10.1016/j.intimp.2024.112727. Epub 2024 Jul 26.
6
[Identification of differentially expressed genes between lung adenocarcinoma and squamous cell carcinoma using transcriber signature analysis].[利用转录特征分析鉴定肺腺癌与肺鳞癌之间的差异表达基因]
Nan Fang Yi Ke Da Xue Xue Bao. 2019 Jun 30;39(6):641-649. doi: 10.12122/j.issn.1673-4254.2019.06.03.
7
GPC3 affects the prognosis of lung adenocarcinoma and lung squamous cell carcinoma.GPC3 影响肺腺癌和肺鳞癌的预后。
BMC Pulm Med. 2021 Jun 10;21(1):199. doi: 10.1186/s12890-021-01549-9.
8
Comprehensive Analysis of Aberrantly Expressed Profiles of lncRNAs and miRNAs with Associated ceRNA Network in Lung Adenocarcinoma and Lung Squamous Cell Carcinoma.肺腺癌和肺鳞癌中异常表达的 lncRNAs 和 miRNAs 谱的综合分析及其相关 ceRNA 网络。
Pathol Oncol Res. 2020 Jul;26(3):1935-1945. doi: 10.1007/s12253-019-00780-4. Epub 2020 Jan 2.
9
An explainable AI-driven biomarker discovery framework for Non-Small Cell Lung Cancer classification.基于可解释人工智能的非小细胞肺癌分类生物标志物发现框架。
Comput Biol Med. 2023 Feb;153:106544. doi: 10.1016/j.compbiomed.2023.106544. Epub 2023 Jan 12.
10
Prognostic value of immune-related genes in the tumor microenvironment of lung adenocarcinoma and lung squamous cell carcinoma.免疫相关基因在肺腺癌和肺鳞癌肿瘤微环境中的预后价值。
Aging (Albany NY). 2020 Mar 25;12(6):4757-4777. doi: 10.18632/aging.102871.