• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于Shapley值加法解释的特征选择揭示CXCL14是预测特发性肺纤维化的关键免疫相关基因。

Shapley additive explanations based feature selection reveals CXCL14 as a key immune-related gene in predicting idiopathic pulmonary fibrosis.

作者信息

Chen Bin, Huan Lu, Lu Junyu, Yuan Jinhe

机构信息

Department of Geriatric Palliative Medicine, Chongqing Liangjiang New District People Hospital, Chongqing, China.

Department of Respiratory and Critical Care Medicine, Renji Hospital, School of Medicine, Chongqing University, Chongqing, China.

出版信息

Front Med (Lausanne). 2025 Aug 6;12:1608078. doi: 10.3389/fmed.2025.1608078. eCollection 2025.

DOI:10.3389/fmed.2025.1608078
PMID:40842541
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12364669/
Abstract

BACKGROUND

Idiopathic pulmonary fibrosis (IPF) is a progressive lung disease marked by excessive fibrous tissue accumulation in the lung interstitium, leading to a gradual deterioration of respiratory function and significantly impairing patients' quality of life. Despite advances in understanding its etiology and pathogenesis, the exact mechanisms remain unclear, underscoring the need for novel biomarkers and therapeutic targets.

METHODS

We analyzed five publicly available datasets from the Gene Expression Omnibus (GEO), specifically "GSE15197," "GSE53845," "GSE135065," "GSE185691," and "GSE195770," to identify gene expression changes associated with IPF. Data were annotated and normalized to minimize batch effects and technical variability. Principal Component Analysis (PCA) verified preprocessing efficacy. Differentially expressed genes (DEGs) were identified using linear modeling. Core DEGs were selected via integrative analysis across datasets.

RESULTS

Our analysis revealed DEGs that are substantially linked to crucial biological processes such as extracellular matrix organization and immune response regulation. Integrative analysis of five GEO datasets identified CXCL14, MMP7, and MDK as core differentially expressed genes in the final predictive model. Using Least Absolute Shrinkage and Selection Operator (LASSO) regression and Random Forest, we constructed a logistic regression model with robust predictive performance, achieving an AUC of 0.92 in the training cohort and 0.89 in the validation cohort, with sensitivity of 88% and specificity of 85%. The Shapley Additive Explanations (SHAP) method identified CXCL14 (mean SHAP value = 0.38) as the most influential feature, followed by MMP7 and MDK. Functional enrichment analyses highlighted significant enrichment of TGF- signaling, extracellular matrix organization, and chemokine signaling pathways. Immune infiltration analysis revealed positive correlations between CXCL14 expression and alveolar macrophage/activated fibroblast populations, while SHAP interaction analysis identified synergistic effects between CXCL14 and TGF-β1 in driving fibrosis.

CONCLUSION

These findings substantiate the hypothesis that IPF pathogenesis is closely linked to extracellular matrix remodeling and immune dysregulation. This suggests that future investigations should delve deeper into the practical applications of identified biomarkers in the early diagnosis and management of IPF. Furthermore, the machine learning-based predictive model demonstrates strong clinical potential and merits further validation in prospective trials to assess its utility and therapeutic implications in real-world settings.

摘要

背景

特发性肺纤维化(IPF)是一种进行性肺部疾病,其特征是肺间质中纤维组织过度积聚,导致呼吸功能逐渐恶化,并严重损害患者的生活质量。尽管在理解其病因和发病机制方面取得了进展,但确切机制仍不清楚,这凸显了对新型生物标志物和治疗靶点的需求。

方法

我们分析了来自基因表达综合数据库(GEO)的五个公开可用数据集,具体为“GSE15197”、“GSE53845”、“GSE135065”、“GSE185691”和“GSE195770”,以确定与IPF相关的基因表达变化。对数据进行注释和标准化,以尽量减少批次效应和技术变异性。主成分分析(PCA)验证了预处理效果。使用线性模型识别差异表达基因(DEG)。通过跨数据集的综合分析选择核心DEG。

结果

我们的分析揭示了与细胞外基质组织和免疫反应调节等关键生物学过程密切相关的DEG。对五个GEO数据集的综合分析确定CXCL14、MMP7和MDK为最终预测模型中的核心差异表达基因。使用最小绝对收缩和选择算子(LASSO)回归和随机森林,我们构建了一个具有强大预测性能的逻辑回归模型,在训练队列中的AUC为0.92,在验证队列中的AUC为0.89,敏感性为88%,特异性为85%。Shapley加性解释(SHAP)方法确定CXCL14(平均SHAP值=0.38)为最具影响力的特征,其次是MMP7和MDK。功能富集分析突出了TGF-信号传导、细胞外基质组织和趋化因子信号通路的显著富集。免疫浸润分析显示CXCL14表达与肺泡巨噬细胞/活化成纤维细胞群体之间呈正相关,而SHAP相互作用分析确定CXCL14和TGF-β1在驱动纤维化方面具有协同作用。

结论

这些发现证实了IPF发病机制与细胞外基质重塑和免疫失调密切相关的假设。这表明未来的研究应更深入地探讨已识别生物标志物在IPF早期诊断和管理中的实际应用。此外,基于机器学习的预测模型显示出强大的临床潜力,值得在前瞻性试验中进一步验证,以评估其在实际临床环境中的效用和治疗意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/8031fb4e2f0e/fmed-12-1608078-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/0e81d92a7dbe/fmed-12-1608078-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/d9a7c7ba9aeb/fmed-12-1608078-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/c1bdccabfd05/fmed-12-1608078-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/ebe25de22583/fmed-12-1608078-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/5eb017500419/fmed-12-1608078-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/8031fb4e2f0e/fmed-12-1608078-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/0e81d92a7dbe/fmed-12-1608078-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/d9a7c7ba9aeb/fmed-12-1608078-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/c1bdccabfd05/fmed-12-1608078-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/ebe25de22583/fmed-12-1608078-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/5eb017500419/fmed-12-1608078-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cab/12364669/8031fb4e2f0e/fmed-12-1608078-g006.jpg

相似文献

1
Shapley additive explanations based feature selection reveals CXCL14 as a key immune-related gene in predicting idiopathic pulmonary fibrosis.基于Shapley值加法解释的特征选择揭示CXCL14是预测特发性肺纤维化的关键免疫相关基因。
Front Med (Lausanne). 2025 Aug 6;12:1608078. doi: 10.3389/fmed.2025.1608078. eCollection 2025.
2
Machine learning based screening of biomarkers associated with cell death and immunosuppression of multiple life stages sepsis populations.基于机器学习对与多生命阶段脓毒症人群细胞死亡和免疫抑制相关生物标志物的筛选。
Sci Rep. 2025 Aug 19;15(1):30302. doi: 10.1038/s41598-025-14600-0.
3
Potential role of DKK3 and WIF1 in prostate cancer: bioinformatics and clinical analysis.DKK3和WIF1在前列腺癌中的潜在作用:生物信息学与临床分析
Discov Oncol. 2025 Aug 27;16(1):1637. doi: 10.1007/s12672-025-03488-x.
4
Molecular mechanisms of efferocytosis imbalance in the idiopathic pulmonary fibrosis microenvironment: from gene screening to dynamic regulation analysis.特发性肺纤维化微环境中胞葬作用失衡的分子机制:从基因筛选到动态调控分析
Biol Direct. 2025 Jul 15;20(1):83. doi: 10.1186/s13062-025-00658-3.
5
Machine learning and bioinformatics analysis to identify and validate diagnostic model associated with immune infiltration in rheumatoid arthritis.机器学习和生物信息学分析以识别和验证与类风湿关节炎免疫浸润相关的诊断模型。
Clin Rheumatol. 2025 Jun 11. doi: 10.1007/s10067-025-07514-9.
6
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
7
Deciphering Shared Gene Signatures and Immune Infiltration Characteristics Between Gestational Diabetes Mellitus and Preeclampsia by Integrated Bioinformatics Analysis and Machine Learning.通过综合生物信息学分析和机器学习破译妊娠期糖尿病和子痫前期之间共享的基因特征及免疫浸润特征
Reprod Sci. 2025 May 15. doi: 10.1007/s43032-025-01847-1.
8
Systematic Analysis of an Immune-Related Gene Signature for Predicting Prognosis and Immune Characteristics in Primary Lower Grade Glioma.用于预测原发性低级别胶质瘤预后和免疫特征的免疫相关基因特征的系统分析
Biomed Res Int. 2025 Aug 12;2025:6180391. doi: 10.1155/bmri/6180391. eCollection 2025.
9
Developing a Panel of Shared Susceptibility Genes as Diagnostic Biomarkers for chronic obstructive pulmonary disease and Heart Failure.开发一组共享易感性基因作为慢性阻塞性肺疾病和心力衰竭的诊断生物标志物。
Comput Biol Med. 2025 Sep;196(Pt A):110657. doi: 10.1016/j.compbiomed.2025.110657. Epub 2025 Jul 4.
10
LYN and CYBB are pivotal immune and inflammatory genes as diagnostic biomarkers in recurrent spontaneous abortion.LYN和CYBB作为复发性自然流产的诊断生物标志物,是关键的免疫和炎症基因。
Front Immunol. 2025 Jul 7;16:1568536. doi: 10.3389/fimmu.2025.1568536. eCollection 2025.

本文引用的文献

1
Cluster analysis of blood biomarkers to identify molecular patterns in pulmonary fibrosis: assessment of a multicentre, prospective, observational cohort with independent validation.对血液生物标志物进行聚类分析,以确定肺纤维化的分子模式:对多中心、前瞻性、观察性队列进行评估,并进行独立验证。
Lancet Respir Med. 2024 Sep;12(9):681-692. doi: 10.1016/S2213-2600(24)00147-4. Epub 2024 Jul 15.
2
Monocytes and macrophages: emerging mechanisms and novel therapeutic targets in pulmonary fibrosis.单核细胞和巨噬细胞:肺纤维化中新兴的机制和新的治疗靶点。
Am J Physiol Cell Physiol. 2023 Oct 1;325(4):C1046-C1057. doi: 10.1152/ajpcell.00302.2023. Epub 2023 Sep 11.
3
Fibroblast-derived CXCL14 aggravates crystalline silica-induced pulmonary fibrosis by mediating polarization and recruitment of interstitial macrophages.
成纤维细胞衍生的趋化因子 14 通过调节间质性巨噬细胞的极化和募集加重二氧化硅诱导的肺纤维化。
J Hazard Mater. 2023 Oct 15;460:132489. doi: 10.1016/j.jhazmat.2023.132489. Epub 2023 Sep 8.
4
Exploration of the shared genes and signaling pathways between lung adenocarcinoma and idiopathic pulmonary fibrosis.肺腺癌与特发性肺纤维化之间共享基因及信号通路的探索。
J Thorac Dis. 2023 Jun 30;15(6):3054-3068. doi: 10.21037/jtd-22-1522. Epub 2023 May 9.
5
An explainable machine learning-driven proposal of pulmonary fibrosis biomarkers.一种基于可解释机器学习的肺纤维化生物标志物提议。
Comput Struct Biotechnol J. 2023;21:2305-2315. doi: 10.1016/j.csbj.2023.03.043. Epub 2023 Mar 25.
6
Cell-Surface Programmed Death Ligand-1 Expression Identifies a Sub-Population of Distal Epithelial Cells Enriched in Idiopathic Pulmonary Fibrosis.细胞表面程序性死亡配体-1 的表达鉴定出特发性肺纤维化中富含的远端上皮细胞亚群。
Cells. 2022 May 10;11(10):1593. doi: 10.3390/cells11101593.
7
Idiopathic Pulmonary Fibrosis (an Update) and Progressive Pulmonary Fibrosis in Adults: An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline.特发性肺纤维化(更新版)和成人进展性肺纤维化:美国胸科学会/欧洲呼吸学会/日本呼吸学会/拉丁美洲胸科学会临床实践指南。
Am J Respir Crit Care Med. 2022 May 1;205(9):e18-e47. doi: 10.1164/rccm.202202-0399ST.
8
Differential LysoTracker Uptake Defines Two Populations of Distal Epithelial Cells in Idiopathic Pulmonary Fibrosis.差异 LysoTracker 摄取定义特发性肺纤维化中两个远端上皮细胞群体。
Cells. 2022 Jan 11;11(2):235. doi: 10.3390/cells11020235.
9
Integrated analysis of multimodal single-cell data.多模态单细胞数据的综合分析。
Cell. 2021 Jun 24;184(13):3573-3587.e29. doi: 10.1016/j.cell.2021.04.048. Epub 2021 May 31.
10
Title of article: Mucosal-associated invariant T cells in lung diseases.文章标题:肺部疾病中的黏膜相关恒定 T 细胞。
Int Immunopharmacol. 2021 May;94:107485. doi: 10.1016/j.intimp.2021.107485. Epub 2021 Feb 26.