具有结构编码的节点自适应图 Transformer 用于准确稳健的 lncRNA-疾病关联预测。

Node-adaptive graph Transformer with structural encoding for accurate and robust lncRNA-disease association prediction.

机构信息

School of Information Engineering, East China Jiaotong University, Nanchang, China.

School of Information Science and Engineering, Shandong Normal University, Jinan, China.

出版信息

BMC Genomics. 2024 Jan 18;25(1):73. doi: 10.1186/s12864-024-09998-2.

DOI:10.1186/s12864-024-09998-2

PMID:38233788

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10795365/

Abstract

BACKGROUND

Long noncoding RNAs (lncRNAs) are integral to a plethora of critical cellular biological processes, including the regulation of gene expression, cell differentiation, and the development of tumors and cancers. Predicting the relationships between lncRNAs and diseases can contribute to a better understanding of the pathogenic mechanisms of disease and provide strong support for the development of advanced treatment methods.

RESULTS

Therefore, we present an innovative Node-Adaptive Graph Transformer model for predicting unknown LncRNA-Disease Associations, named NAGTLDA. First, we utilize the node-adaptive feature smoothing (NAFS) method to learn the local feature information of nodes and encode the structural information of the fusion similarity network of diseases and lncRNAs using Structural Deep Network Embedding (SDNE). Next, the Transformer module is used to capture potential association information between the network nodes. Finally, we employ a Transformer module with two multi-headed attention layers for learning global-level embedding fusion. Network structure coding is added as the structural inductive bias of the network to compensate for the missing message-passing mechanism in Transformer. NAGTLDA achieved an average AUC of 0.9531 and AUPR of 0.9537 significantly higher than state-of-the-art methods in 5-fold cross validation. We perform case studies on 4 diseases; 55 out of 60 associations between lncRNAs and diseases have been validated in the literatures. The results demonstrate the enormous potential of the graph Transformer structure to incorporate graph structural information for uncovering lncRNA-disease unknown correlations.

CONCLUSIONS

Our proposed NAGTLDA model can serve as a highly efficient computational method for predicting biological information associations.

摘要

背景

长链非编码 RNA（lncRNA）是众多关键细胞生物学过程的组成部分，包括基因表达调控、细胞分化以及肿瘤和癌症的发展。预测 lncRNA 与疾病之间的关系有助于更好地理解疾病的发病机制，并为开发先进的治疗方法提供有力支持。

结果

因此，我们提出了一种用于预测未知 lncRNA-疾病关联的创新的节点自适应图 Transformer 模型，命名为 NAGTLDA。首先，我们利用节点自适应特征平滑（NAFS）方法学习节点的局部特征信息，并使用结构深度网络嵌入（SDNE）对疾病和 lncRNA 的融合相似性网络的结构信息进行编码。接下来，Transformer 模块用于捕获网络节点之间的潜在关联信息。最后，我们使用具有两个多头注意力层的 Transformer 模块进行学习全局级别的嵌入融合。网络结构编码作为网络的结构归纳偏差添加，以补偿 Transformer 中缺失的消息传递机制。在 5 折交叉验证中，NAGTLDA 的平均 AUC 为 0.9531，AUPR 为 0.9537，明显高于最先进的方法。我们对 4 种疾病进行了案例研究；文献中已经验证了 lncRNA 和疾病之间的 60 个关联中的 55 个。结果表明，图 Transformer 结构结合图结构信息来揭示 lncRNA-疾病未知关联的潜力巨大。

结论

我们提出的 NAGTLDA 模型可以作为一种高效的计算方法，用于预测生物信息关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea6b/10795365/a8e1e321310f/12864_2024_9998_Fig1_HTML.jpg

相似文献

Node-adaptive graph Transformer with structural encoding for accurate and robust lncRNA-disease association prediction.

BMC Genomics. 2024 Jan 18;25(1):73. doi: 10.1186/s12864-024-09998-2.

Learning global dependencies and multi-semantics within heterogeneous graph for predicting disease-related lncRNAs.

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac361.

GCNFORMER: graph convolutional network and transformer for predicting lncRNA-disease associations.

BMC Bioinformatics. 2024 Jan 2;25(1):5. doi: 10.1186/s12859-023-05625-1.

Mask-Guided Target Node Feature Learning and Dynamic Detailed Feature Enhancement for lncRNA-Disease Association Prediction.

J Chem Inf Model. 2024 Aug 26;64(16):6662-6675. doi: 10.1021/acs.jcim.4c00652. Epub 2024 Aug 7.

Graph Convolutional Network and Convolutional Neural Network Based Method for Predicting lncRNA-Disease Associations.

Cells. 2019 Aug 30;8(9):1012. doi: 10.3390/cells8091012.

Attentional multi-level representation encoding based on convolutional and variance autoencoders for lncRNA-disease association prediction.

Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa067.

Predicting lncRNA-disease associations using multiple metapaths in hierarchical graph attention networks.

BMC Bioinformatics. 2024 Jan 29;25(1):46. doi: 10.1186/s12859-024-05672-2.

gGATLDA: lncRNA-disease association prediction based on graph-level graph attention network.

BMC Bioinformatics. 2022 Jan 4;23(1):11. doi: 10.1186/s12859-021-04548-z.

Heterogeneous graph attention network based on meta-paths for lncRNA-disease association prediction.

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab407.

Multi-task prediction-based graph contrastive learning for inferring the relationship among lncRNAs, miRNAs and diseases.

Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad276.

引用本文的文献

LDA-SCGB: inferring lncRNA-disease associations based on condensed gradient boosting.

BMC Bioinformatics. 2025 Jul 22;26(1):190. doi: 10.1186/s12859-025-06169-2.

HGCMLDA: predicting lncRNA-disease associations using hypergraph contrastive learning and multi-scale attentional feature fusion.

Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf262.

AMFCL: Predicting miRNA-Disease Associations Through Adaptive Multi-source Modality Fusion and Contrastive Learning.

Interdiscip Sci. 2025 Jun 2. doi: 10.1007/s12539-025-00724-4.

RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models.

Heliyon. 2025 Jan 6;11(2):e41488. doi: 10.1016/j.heliyon.2024.e41488. eCollection 2025 Jan 30.

Herb-disease association prediction model based on network consistency projection.

Sci Rep. 2025 Jan 27;15(1):3328. doi: 10.1038/s41598-025-87521-7.

Predicting noncoding RNA and disease associations using multigraph contrastive learning.

Sci Rep. 2025 Jan 2;15(1):230. doi: 10.1038/s41598-024-81862-5.

Predicting microbe-disease associations via graph neural network and contrastive learning.

Front Microbiol. 2024 Dec 13;15:1483983. doi: 10.3389/fmicb.2024.1483983. eCollection 2024.

Prediction of miRNA-disease association based on multisource inductive matrix completion.

Sci Rep. 2024 Nov 11;14(1):27503. doi: 10.1038/s41598-024-78212-w.

Deep learning model for protein multi-label subcellular localization and function prediction based on multi-task collaborative training.

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae568.

Predicting lncRNA-Disease Associations Based on a Dual-Path Feature Extraction Network with Multiple Sources of Information Integration.

ACS Omega. 2024 Jul 30;9(32):35100-35112. doi: 10.1021/acsomega.4c05365. eCollection 2024 Aug 13.

本文引用的文献

Data resources and computational methods for lncRNA-disease association prediction.

Comput Biol Med. 2023 Feb;153:106527. doi: 10.1016/j.compbiomed.2022.106527. Epub 2023 Jan 2.

Association prediction of CircRNAs and diseases using multi-homogeneous graphs and variational graph auto-encoder.

Comput Biol Med. 2022 Dec;151(Pt A):106289. doi: 10.1016/j.compbiomed.2022.106289. Epub 2022 Nov 11.

RNADisease v4.0: an updated resource of RNA-associated diseases, providing RNA-disease analysis, enrichment and prediction.

Nucleic Acids Res. 2023 Jan 6;51(D1):D1397-D1404. doi: 10.1093/nar/gkac814.

SFGAE: a self-feature-based graph autoencoder model for miRNA-disease associations prediction.

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac340.

LncRNA MAGI2-AS3 Inhibits Prostate Cancer Progression by Targeting the miR-142-3p.

Horm Metab Res. 2022 Nov;54(11):754-759. doi: 10.1055/a-1891-6864. Epub 2022 Aug 9.

Microbiome-associated human genetic variants impact phenome-wide disease risk.

Proc Natl Acad Sci U S A. 2022 Jun 28;119(26):e2200551119. doi: 10.1073/pnas.2200551119. Epub 2022 Jun 24.

lncRNA MNX1‑AS1 promotes prostate cancer progression through regulating miR‑2113/MDM2 axis.

Mol Med Rep. 2022 Jul;26(1). doi: 10.3892/mmr.2022.12747. Epub 2022 May 26.

Fully connected autoencoder and convolutional neural network with attention-based method for inferring disease-related lncRNAs.

Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac089.

Multi-channel graph attention autoencoders for disease-related lncRNAs prediction.

Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab604.

Prediction of lncRNA-disease association based on a Laplace normalized random walk with restart algorithm on heterogeneous networks.

BMC Bioinformatics. 2022 Jan 4;23(1):5. doi: 10.1186/s12859-021-04538-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

具有结构编码的节点自适应图 Transformer 用于准确稳健的 lncRNA-疾病关联预测。

Node-adaptive graph Transformer with structural encoding for accurate and robust lncRNA-disease association prediction.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献