• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基于直方图的图表示学习和梯度提升来解码潜在的长链非编码RNA与疾病的关联。

Decoding potential lncRNA and disease associations through graph representation learning and gradient boosting with histogram.

作者信息

Tang Lili, Liu Longlong, Jiang Yan, Yuan Yi

机构信息

School of Computer Science and Artificial Intelligence, Hunan University of Technology, Zhuzhou, 412007, China.

School of Biological Science and Medical Engineering, Hunan University of Technology, Zhuzhou, 412007, China.

出版信息

Sci Rep. 2025 Aug 26;15(1):31407. doi: 10.1038/s41598-025-16177-0.

DOI:10.1038/s41598-025-16177-0
PMID:40858876
Abstract

Long noncoding RNAs (lncRNAs) are important regulators and promising targets for complex diseases. They have manifested dense relationships with various diseases. Although laboratory techniques have validated many lncRNA-disease associations (LDAs), they are costly, laborious, and time-consuming. This study introduces LDA-GMCB, an LDA inference model, by leveraging graph embedding learning, multi-head self-attention mechanism (MSA) with convolutional neural network (CNN), low-rank singular value decomposition (SVD), and histogram-based gradient boosting (HGBoost). For all lncRNAs and diseases, LDA-GMCB first deciphers their nonlinear features by incorporating graph embedding learning and MSA with CNN, then captures their linear features through low-rank SVD, and finally infers their relationships based on HGBoost. LDA-GMCB was compared with four baselines (i.e., SDLDA, LDNFSGB, IPCARF and LDA-VGHB) under 5-fold cross validation and two cold start scenarios, and four popular classifiers (i.e., multi-layer perceptron, SVM, random forest, and XGBoost). Additionally, LDA-GMCB implemented ablation study. The outcomes demonstrated that LDA-GMCB greatly surpassed the above models and gained significant improvement on two public databases (i.e., lncRNADisease and MNDR) under most conditions. Moreover, LDA-GMCB was further applied to infer potential lncRNAs for Alzheimer's disease and Parkinson's disease. It identified that DGCR5 and HIF1A could link with the two diseases, respectively. We hope that LDA-GMCB help infer potential lncRNAs for various complex diseases. LDA-GMCB is freely available at https://github.com/smiling199/LDA-GMCB .

摘要

长链非编码RNA(lncRNAs)是复杂疾病的重要调节因子和有前景的靶点。它们已显示出与各种疾病的密切关系。尽管实验室技术已验证了许多lncRNA-疾病关联(LDAs),但这些技术成本高、费力且耗时。本研究通过利用图嵌入学习、结合卷积神经网络(CNN)的多头自注意力机制(MSA)、低秩奇异值分解(SVD)和基于直方图的梯度提升(HGBoost),引入了一种LDA推理模型LDA-GMCB。对于所有lncRNAs和疾病,LDA-GMCB首先通过将图嵌入学习和MSA与CNN相结合来解读它们的非线性特征,然后通过低秩SVD捕捉它们的线性特征,最后基于HGBoost推断它们之间的关系。在5折交叉验证和两种冷启动场景下,将LDA-GMCB与四个基线(即SDLDA、LDNFSGB、IPCARF和LDA-VGHB)以及四个流行的分类器(即多层感知器、支持向量机、随机森林和XGBoost)进行了比较。此外,LDA-GMCB进行了消融研究。结果表明,LDA-GMCB在大多数情况下大大超过了上述模型,并在两个公共数据库(即lncRNADisease和MNDR)上取得了显著改进。此外,LDA-GMCB还被进一步应用于推断阿尔茨海默病和帕金森病的潜在lncRNAs。研究发现DGCR5和HIF1A可能分别与这两种疾病相关。我们希望LDA-GMCB有助于推断各种复杂疾病的潜在lncRNAs。LDA-GMCB可在https://github.com/smiling199/LDA-GMCB上免费获取。

相似文献

1
Decoding potential lncRNA and disease associations through graph representation learning and gradient boosting with histogram.通过基于直方图的图表示学习和梯度提升来解码潜在的长链非编码RNA与疾病的关联。
Sci Rep. 2025 Aug 26;15(1):31407. doi: 10.1038/s41598-025-16177-0.
2
Predicting lncRNA and disease associations with graph autoencoder and noise robust gradient boosting.使用图自动编码器和噪声鲁棒梯度提升预测长链非编码RNA与疾病的关联
Sci Rep. 2025 May 31;15(1):19178. doi: 10.1038/s41598-025-03269-0.
3
LDA-SCGB: inferring lncRNA-disease associations based on condensed gradient boosting.LDA-SCGB:基于凝聚梯度提升推断长链非编码RNA与疾病的关联
BMC Bioinformatics. 2025 Jul 22;26(1):190. doi: 10.1186/s12859-025-06169-2.
4
LDA-VGHB: identifying potential lncRNA-disease associations with singular value decomposition, variational graph auto-encoder and heterogeneous Newton boosting machine.LDA-VGHB:基于奇异值分解、变分图自动编码器和异质牛顿提升机识别潜在的 lncRNA-疾病关联。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad466.
5
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
6
GEnDDn: An lncRNA-Disease Association Identification Framework Based on Dual-Net Neural Architecture and Deep Neural Network.GEnDDn:一种基于双网络神经架构和深度神经网络的 lncRNA-疾病关联识别框架。
Interdiscip Sci. 2024 Jun;16(2):418-438. doi: 10.1007/s12539-024-00619-w. Epub 2024 May 11.
7
Self-Supervised Contrastive Learning on Attribute and Topology Graphs for Predicting Relationships Among lncRNAs, miRNAs and Diseases.基于属性和拓扑图的自监督对比学习用于预测长链非编码RNA、微小RNA与疾病之间的关系
IEEE J Biomed Health Inform. 2025 Jan;29(1):657-668. doi: 10.1109/JBHI.2024.3467101. Epub 2025 Jan 7.
8
LncRNA-miRNA interaction prediction based on multi-source heterogeneous graph neural network and multi-level attention mechanism.基于多源异构图神经网络和多层次注意力机制的长链非编码RNA-微小RNA相互作用预测
Int J Biol Macromol. 2025 Aug;319(Pt 4):145614. doi: 10.1016/j.ijbiomac.2025.145614. Epub 2025 Jun 27.
9
Development of Machine Learning-based Algorithms to Predict the 2- and 5-year Risk of TKA After Tibial Plateau Fracture Treatment.基于机器学习的算法用于预测胫骨平台骨折治疗后2年和5年全膝关节置换风险的研究进展
Clin Orthop Relat Res. 2025 Mar 12. doi: 10.1097/CORR.0000000000003442.
10
The landscape of long non-coding RNA during cSCC progression.皮肤鳞状细胞癌进展过程中长链非编码RNA的情况
Br J Dermatol. 2025 Mar 27. doi: 10.1093/bjd/ljaf108.

本文引用的文献

1
Prediction of ncRNA-Disease Association Based on Correntropy Induced Loss Matrix Factorization Model.基于核相关向量机诱导损失矩阵分解模型的非编码RNA-疾病关联预测
IEEE Trans Comput Biol Bioinform. 2025 Jul-Aug;22(4):1861-1874. doi: 10.1109/TCBBIO.2025.3574189.
2
Prediction of Ligand-Receptor Interactions Based on CatBoost and Deep Forest and Their Application in Cell-Cell Communication Analysis.基于CatBoost和深度森林的配体-受体相互作用预测及其在细胞-细胞通讯分析中的应用
J Chem Inf Model. 2025 Jun 23;65(12):6341-6366. doi: 10.1021/acs.jcim.5c00726. Epub 2025 May 30.
3
Neighborhood-Regularized Matrix Factorization for lncRNA-Disease Association Identification.
用于长链非编码RNA-疾病关联识别的邻域正则化矩阵分解
Int J Mol Sci. 2025 Apr 30;26(9):4283. doi: 10.3390/ijms26094283.
4
A novel end-to-end learning framework for inferring lncRNA-disease associations based on convolution neural network.一种基于卷积神经网络推断长链非编码RNA与疾病关联的新型端到端学习框架。
Front Genet. 2025 Apr 9;16:1580512. doi: 10.3389/fgene.2025.1580512. eCollection 2025.
5
DeepWalk-Based Graph Embeddings for miRNA-Disease Association Prediction Using Deep Neural Network.基于深度游走的图嵌入用于使用深度神经网络进行miRNA-疾病关联预测
Biomedicines. 2025 Feb 20;13(3):536. doi: 10.3390/biomedicines13030536.
6
MFF-nDA: A Computational Model for ncRNA-Disease Association Prediction Based on Multimodule Fusion.MFF-nDA:一种基于多模块融合的非编码RNA-疾病关联预测计算模型。
J Chem Inf Model. 2025 Apr 14;65(7):3324-3342. doi: 10.1021/acs.jcim.5c00174. Epub 2025 Mar 24.
7
lncRNA-disease association prediction based on optimizing measures of multi-graph regularized matrix factorization.基于多图正则化矩阵分解优化方法的长链非编码RNA-疾病关联预测
Comput Methods Biomech Biomed Engin. 2025 Mar 20:1-16. doi: 10.1080/10255842.2025.2479854.
8
A heterogeneous information network learning model with neighborhood-level structural representation for predicting lncRNA-miRNA interactions.一种具有邻域级结构表示的异构信息网络学习模型,用于预测lncRNA- miRNA相互作用。
Comput Struct Biotechnol J. 2024 Jul 6;23:2924-2933. doi: 10.1016/j.csbj.2024.06.032. eCollection 2024 Dec.
9
DO-GMA: An End-to-End Drug-Target Interaction Identification Framework with a Depthwise Overparameterized Convolutional Network and the Gated Multihead Attention Mechanism.DO-GMA:一种具有深度超参数化卷积网络和门控多头注意力机制的端到端药物-靶点相互作用识别框架。
J Chem Inf Model. 2025 Feb 10;65(3):1318-1337. doi: 10.1021/acs.jcim.4c02088. Epub 2025 Jan 28.
10
Graph Convolutional Network with Neural Collaborative Filtering for Predicting miRNA-Disease Association.基于神经协同过滤的图卷积网络用于预测miRNA-疾病关联
Biomedicines. 2025 Jan 8;13(1):136. doi: 10.3390/biomedicines13010136.