• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习驱动的圆锥角膜转录组分析以识别预测性生物标志物

Machine Learning-Driven Transcriptome Analysis of Keratoconus for Predictive Biomarker Identification.

作者信息

Chang Shao-Hsuan, Yeh Lung-Kun, Hung Kuo-Hsuan, Chiu Yen-Jung, Hsieh Chia-Hsun, Ma Chung-Pei

机构信息

Department of Biomedical Engineering, Chang Gung University, Taoyuan 33302, Taiwan.

Department of Ophthalmology, Linkou Chang Gung Memorial Hospital, Taoyuan 33305, Taiwan.

出版信息

Biomedicines. 2025 Apr 24;13(5):1032. doi: 10.3390/biomedicines13051032.

DOI:10.3390/biomedicines13051032
PMID:40426861
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12109562/
Abstract

Keratoconus (KTCN) is a multifactorial disease characterized by progressive corneal degeneration. Recent studies suggest that a gene expression analysis of corneas may uncover potential novel biomarkers involved in corneal matrix remodeling. However, identifying reliable combinations of biomarkers that are linked to disease risk or progression remains a significant challenge. This study employed multiple machine learning algorithms to analyze the transcriptomes of keratoconus patients, identifying feature gene combinations and their functional associations, with the aim of enhancing the understanding of keratoconus pathogenesis. We analyzed the GSE77938 (PRJNA312169) dataset for differential gene expression (DGE) and performed gene set enrichment analysis (GSEA) using Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways to identify enriched pathways in keratoconus (KTCN) versus controls. Machine learning algorithms were then used to analyze the gene sets, with SHapley Additive exPlanations (SHAP) applied to assess the contribution of key feature genes in the model's predictions. Selected feature genes were further analyzed through Gene Ontology (GO) enrichment to explore their roles in biological processes and cellular functions. Machine learning models, including XGBoost, Random Forest, Logistic Regression, and SVM, identified a set of important feature genes associated with keratoconus, with 15 notable genes appearing across multiple models, such as , , , , , , , , and others. The under-expressed genes in KTCN were involved in the mechanical resistance of the epidermis (, ) and in inflammation pathways (, , , , and ), as compared to controls. The GO analysis highlighted that the complex and its associated genes were primarily involved in biological processes related to the cytoskeleton organization, inflammation, and immune response. Furthermore, we expanded our analysis by incorporating additional datasets from PRJNA636666 and PRJNA1184491, thereby offering a broader representation of gene features and increasing the generalizability of our results across diverse cohorts. The differing gene sets identified by XGBoost and SVM may reflect distinct but complementary aspects of keratoconus pathophysiology. Meanwhile, XGBoost captured key immune and chemotactic regulators (e.g., , ), suggesting upstream inflammatory signaling pathways. SVM highlighted structural and epithelial differentiation markers (e.g., , ), possibly reflecting downstream tissue remodeling and stress responses. Our findings provide a novel research platform for the evaluation of keratoconus using machine learning-based approaches, offering valuable insights into its pathogenesis and potential therapeutic targets.

摘要

圆锥角膜(KTCN)是一种以进行性角膜变性为特征的多因素疾病。最近的研究表明,对角膜进行基因表达分析可能会发现参与角膜基质重塑的潜在新生物标志物。然而,确定与疾病风险或进展相关的可靠生物标志物组合仍然是一项重大挑战。本研究采用多种机器学习算法分析圆锥角膜患者的转录组,确定特征基因组合及其功能关联,旨在加深对圆锥角膜发病机制的理解。我们分析了GSE77938(PRJNA312169)数据集的差异基因表达(DGE),并使用京都基因与基因组百科全书(KEGG)通路进行基因集富集分析(GSEA),以确定圆锥角膜(KTCN)与对照组中富集的通路。然后使用机器学习算法分析基因集,并应用SHapley加性解释(SHAP)来评估关键特征基因在模型预测中的贡献。通过基因本体(GO)富集进一步分析选定的特征基因,以探索它们在生物过程和细胞功能中的作用。包括XGBoost、随机森林、逻辑回归和支持向量机在内的机器学习模型确定了一组与圆锥角膜相关的重要特征基因,有15个显著基因出现在多个模型中,如 、 、 、 、 、 、 、 等。与对照组相比,圆锥角膜中表达下调的基因参与表皮的机械抗性( 、 )和炎症通路( 、 、 、 、 )。GO分析突出显示, 复合体及其相关基因主要参与与细胞骨架组织、炎症和免疫反应相关的生物过程。此外,我们通过纳入来自PRJNA636666和PRJNA1184491的其他数据集扩展了分析,从而更广泛地展示了基因特征,并提高了我们结果在不同队列中的通用性。XGBoost和支持向量机确定的不同基因集可能反映了圆锥角膜病理生理学中不同但互补的方面。同时,XGBoost捕获了关键的免疫和趋化调节因子(如 、 ),提示上游炎症信号通路。支持向量机突出显示了结构和上皮分化标志物(如 、 ),可能反映了下游组织重塑和应激反应。我们的研究结果为使用基于机器学习的方法评估圆锥角膜提供了一个新的研究平台,为其发病机制和潜在治疗靶点提供了有价值的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/56809a256e00/biomedicines-13-01032-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/fffe71114fec/biomedicines-13-01032-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/8f598151be1f/biomedicines-13-01032-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/f0525f9f602c/biomedicines-13-01032-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/5db1d68315ec/biomedicines-13-01032-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/493135ec99de/biomedicines-13-01032-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/14c21c9ca1e5/biomedicines-13-01032-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/1bd8ee0b835f/biomedicines-13-01032-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/56809a256e00/biomedicines-13-01032-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/fffe71114fec/biomedicines-13-01032-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/8f598151be1f/biomedicines-13-01032-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/f0525f9f602c/biomedicines-13-01032-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/5db1d68315ec/biomedicines-13-01032-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/493135ec99de/biomedicines-13-01032-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/14c21c9ca1e5/biomedicines-13-01032-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/1bd8ee0b835f/biomedicines-13-01032-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9c48/12109562/56809a256e00/biomedicines-13-01032-g008.jpg

相似文献

1
Machine Learning-Driven Transcriptome Analysis of Keratoconus for Predictive Biomarker Identification.机器学习驱动的圆锥角膜转录组分析以识别预测性生物标志物
Biomedicines. 2025 Apr 24;13(5):1032. doi: 10.3390/biomedicines13051032.
2
Development of machine learning models for diagnostic biomarker identification and immune cell infiltration analysis in PCOS.用于多囊卵巢综合征诊断生物标志物识别和免疫细胞浸润分析的机器学习模型的开发。
J Ovarian Res. 2025 Jan 3;18(1):1. doi: 10.1186/s13048-024-01583-1.
3
Machine learning based identification of anoikis related gene classification patterns and immunoinfiltration characteristics in diabetic nephropathy.基于机器学习的糖尿病肾病中失巢凋亡相关基因分类模式及免疫浸润特征的识别
Sci Rep. 2025 May 1;15(1):15271. doi: 10.1038/s41598-025-99395-w.
4
Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis.基于机器学习的糖尿病肾病肾小管间质损伤免疫相关生物标志物的鉴定:一项生物信息学多芯片综合分析
BioData Min. 2024 Jul 1;17(1):20. doi: 10.1186/s13040-024-00369-x.
5
Exploration and verification a 13-gene diagnostic framework for ulcerative colitis across multiple platforms via machine learning algorithms.基于机器学习算法的多平台溃疡性结肠炎 13 基因诊断框架的探索与验证。
Sci Rep. 2024 Jul 1;14(1):15009. doi: 10.1038/s41598-024-65481-8.
6
Identification of Hub Genes and Key Pathways Associated with Sepsis Progression Using Weighted Gene Co-Expression Network Analysis and Machine Learning.使用加权基因共表达网络分析和机器学习识别与脓毒症进展相关的枢纽基因和关键通路
Int J Mol Sci. 2025 May 7;26(9):4433. doi: 10.3390/ijms26094433.
7
Identification of diagnostic biomarkers and molecular subtype analysis associated with m6A in Tuberculosis immunopathology using machine learning.利用机器学习鉴定结核病免疫病理学中与m6A相关的诊断生物标志物及分子亚型分析
Sci Rep. 2024 Dec 2;14(1):29982. doi: 10.1038/s41598-024-81790-4.
8
Exploration of the shared diagnostic genes and mechanisms between periodontitis and primary Sjögren's syndrome by integrated comprehensive bioinformatics analysis and machine learning.通过综合全面的生物信息学分析和机器学习,探讨牙周炎和原发性干燥综合征之间的共享诊断基因和机制。
Int Immunopharmacol. 2024 Nov 15;141:112899. doi: 10.1016/j.intimp.2024.112899. Epub 2024 Aug 13.
9
Integrative analysis of signaling and metabolic pathways, immune infiltration patterns, and machine learning-based diagnostic model construction in major depressive disorder.重度抑郁症中信号传导与代谢途径、免疫浸润模式的综合分析以及基于机器学习的诊断模型构建
Sci Rep. 2025 Apr 19;15(1):13519. doi: 10.1038/s41598-025-97623-x.
10
Identification of hub biomarkers of myocardial infarction by single-cell sequencing, bioinformatics, and machine learning.通过单细胞测序、生物信息学和机器学习鉴定心肌梗死的核心生物标志物
Front Cardiovasc Med. 2022 Jul 25;9:939972. doi: 10.3389/fcvm.2022.939972. eCollection 2022.

本文引用的文献

1
Non-genetic risk factors for keratoconus and its progression.圆锥角膜及其进展的非遗传危险因素。
Clin Exp Optom. 2025 Aug;108(6):648-656. doi: 10.1080/08164622.2024.2443454. Epub 2025 Jan 6.
2
A novel combined oxidative stress and extracellular matrix related predictive gene signature for keratoconus.一种用于圆锥角膜的新型联合氧化应激和细胞外基质相关预测基因特征。
Biochem Biophys Res Commun. 2025 Jan;742:151144. doi: 10.1016/j.bbrc.2024.151144. Epub 2024 Dec 5.
3
Construction and SHAP interpretability analysis of a risk prediction model for feeding intolerance in preterm newborns based on machine learning.
基于机器学习的早产儿喂养不耐受风险预测模型的构建及 SHAP 可解释性分析。
BMC Med Inform Decis Mak. 2024 Nov 18;24(1):342. doi: 10.1186/s12911-024-02751-5.
4
Prediction of Rock Unloading Strength Based on PSO-XGBoost Hybrid Models.基于粒子群优化-极限梯度提升混合模型的岩石卸荷强度预测
Materials (Basel). 2024 Aug 26;17(17):4214. doi: 10.3390/ma17174214.
5
Discovering biomarkers associated and predicting cardiovascular disease with high accuracy using a novel nexus of machine learning techniques for precision medicine.利用机器学习技术的新型融合,准确发现与心血管疾病相关的生物标志物并进行预测,为精准医疗提供支持。
Sci Rep. 2024 Jan 2;14(1):1. doi: 10.1038/s41598-023-50600-8.
6
AIPs-SnTCN: Predicting Anti-Inflammatory Peptides Using fastText and Transformer Encoder-Based Hybrid Word Embedding with Self-Normalized Temporal Convolutional Networks.AIPs-SnTCN:使用基于fastText和基于Transformer编码器的混合词嵌入与自归一化时间卷积网络预测抗炎肽
J Chem Inf Model. 2023 Nov 13;63(21):6537-6554. doi: 10.1021/acs.jcim.3c01563. Epub 2023 Oct 31.
7
Artificial intelligence for predictive biomarker discovery in immuno-oncology: a systematic review.人工智能在免疫肿瘤学预测生物标志物发现中的应用:系统评价。
Ann Oncol. 2024 Jan;35(1):29-65. doi: 10.1016/j.annonc.2023.10.125. Epub 2023 Oct 23.
8
Genes selection using deep learning and explainable artificial intelligence for chronic lymphocytic leukemia predicting the need and time to therapy.使用深度学习和可解释人工智能进行基因选择以预测慢性淋巴细胞白血病的治疗需求和治疗时机
Front Oncol. 2023 Aug 31;13:1198992. doi: 10.3389/fonc.2023.1198992. eCollection 2023.
9
The benefits and pitfalls of machine learning for biomarker discovery.机器学习在生物标志物发现中的优势和陷阱。
Cell Tissue Res. 2023 Oct;394(1):17-31. doi: 10.1007/s00441-023-03816-z. Epub 2023 Jul 27.
10
A biomarker discovery of acute myocardial infarction using feature selection and machine learning.利用特征选择和机器学习发现急性心肌梗死的生物标志物。
Med Biol Eng Comput. 2023 Oct;61(10):2527-2541. doi: 10.1007/s11517-023-02841-y. Epub 2023 May 18.