• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

X-LDA:一种用于lncRNA-疾病关联预测的可解释且基于知识的异构图学习框架。

X-LDA: An interpretable and knowledge-informed heterogeneous graph learning framework for LncRNA-disease association prediction.

作者信息

Cao Yangkun, Xiao Jun, Sheng Nan, Qu Yinwei, Wang Zhihang, Sun Chang, Mu Xuechen, Huang Zhenyu, Li Xuan

机构信息

School of Artificial Intelligence, Jilin University, Changchun, 130012, China.

College of Computer Science and Technology, Jilin University, Changchun, 130012, China.

出版信息

Comput Biol Med. 2023 Oct 27;167:107634. doi: 10.1016/j.compbiomed.2023.107634.

DOI:10.1016/j.compbiomed.2023.107634
PMID:39491920
Abstract

The identification of disease-related long noncoding RNAs (lncRNAs) is beneficial to unravel the intricacies of gene expression regulation and epigenetic signatures. Computational methods provide a cost-effective means to explore lncRNA-disease associations (LDAs). However, these methods often lack interpretability, leaving their predictions less convincing to biological and medical researchers. We propose an interpretable and knowledge-informed heterogeneous graph learning framework based on graph patch convolution and integrated gradients to predict LDAs and provides intuitive explanations for its predictions, called X-LDA. The heterogeneous graph is the foundation of the predictions of LDAs, we construct the knowledge-informed heterogeneous graph including LDAs drawn from biological experiments, lncRNA similarities rooted in gene sequences, disease similarities constructed based on disease categorizations. To integrate diverse biological premises and facilitate interpretability, we define nine distinct graph patch types, which encapsulate essential topological relationships within lncRNA-disease node pairs. X-LDA is designed to employ parameter sharing and multi-convolution kernels to grasp common and multiple perspectives of the graph patches, respectively. This approach culminates in the fusion of various semantic information into context embeddings. These post-hoc explanations hinge on graph patch features and integrated gradients, shedding light on the underlying factors driving predictions. Cross validation experiment on the dataset curated from databases and literatures demonstrates that the superior performance of X-LDA in comparison to nine state-of-the-art methods of three categories. X-LDA achieves a larger average area under the receiver operating curve 0.9891 (by at least 6.68%), and a larger average area under the precision-recall curve 0.7907 (by at least 23.2%) than competitive methods. The results of our well-designed ablation and interpretability experiments and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis demonstrate X-LDA's robustness, learnability, predictability, and interpretability. The applicability of X-LDA is also demonstrated through a case study involving the investigation of associated lncRNAs in prostate cancer, colorectal cancer, and breast cancer.

摘要

疾病相关长链非编码RNA(lncRNA)的鉴定有助于揭示基因表达调控和表观遗传特征的复杂性。计算方法为探索lncRNA与疾病的关联(LDA)提供了一种经济高效的手段。然而,这些方法往往缺乏可解释性,使得它们的预测结果难以令生物和医学研究人员信服。我们提出了一种基于图块卷积和集成梯度的可解释且知识驱动的异构图学习框架,用于预测LDA,并为其预测结果提供直观解释,称为X-LDA。异构图是LDA预测的基础,我们构建了知识驱动的异构图,包括从生物学实验中得出的LDA、基于基因序列的lncRNA相似性以及基于疾病分类构建的疾病相似性。为了整合各种生物学前提并促进可解释性,我们定义了九种不同的图块类型,它们封装了lncRNA - 疾病节点对中的基本拓扑关系。X-LDA旨在采用参数共享和多卷积核,分别把握图块的共同和多个视角。这种方法最终将各种语义信息融合到上下文嵌入中。这些事后解释依赖于图块特征和集成梯度,揭示了驱动预测的潜在因素。对从数据库和文献中整理的数据集进行的交叉验证实验表明,与三类九种现有最先进方法相比,X-LDA具有卓越的性能。X-LDA在接收器操作曲线下的平均面积更大,为0.9891(至少提高6.68%),在精确召回曲线下的平均面积更大,为0.7907(至少提高23.2%)。我们精心设计的消融实验、可解释性实验以及京都基因与基因组百科全书(KEGG)富集分析的结果证明了X-LDA的稳健性、可学习性、可预测性和可解释性。通过对前列腺癌、结直肠癌和乳腺癌中相关lncRNA的调查案例研究,也证明了X-LDA的适用性。

相似文献

1
X-LDA: An interpretable and knowledge-informed heterogeneous graph learning framework for LncRNA-disease association prediction.X-LDA:一种用于lncRNA-疾病关联预测的可解释且基于知识的异构图学习框架。
Comput Biol Med. 2023 Oct 27;167:107634. doi: 10.1016/j.compbiomed.2023.107634.
2
Multi-task prediction-based graph contrastive learning for inferring the relationship among lncRNAs, miRNAs and diseases.基于多任务预测的图对比学习推断 lncRNAs、miRNAs 和疾病之间的关系。
Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad276.
3
Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression.通过基于图回归的统一框架预测二元、离散和连续的长链非编码RNA-疾病关联。
BMC Med Genomics. 2017 Dec 21;10(Suppl 4):65. doi: 10.1186/s12920-017-0305-y.
4
Multi-view contrastive heterogeneous graph attention network for lncRNA-disease association prediction.用于长链非编码RNA-疾病关联预测的多视图对比异构图注意力网络
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac548.
5
gGATLDA: lncRNA-disease association prediction based on graph-level graph attention network.基于图级图注意力网络的 lncRNA-疾病关联预测
BMC Bioinformatics. 2022 Jan 4;23(1):11. doi: 10.1186/s12859-021-04548-z.
6
ACLNDA: an asymmetric graph contrastive learning framework for predicting noncoding RNA-disease associations in heterogeneous graphs.ACLNDA:一种用于在异质图中预测非编码 RNA-疾病关联的非对称图对比学习框架。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae533.
7
Graph Convolutional Network and Convolutional Neural Network Based Method for Predicting lncRNA-Disease Associations.基于图卷积网络和卷积神经网络的 lncRNA-疾病关联预测方法。
Cells. 2019 Aug 30;8(9):1012. doi: 10.3390/cells8091012.
8
LDA-VGHB: identifying potential lncRNA-disease associations with singular value decomposition, variational graph auto-encoder and heterogeneous Newton boosting machine.LDA-VGHB:基于奇异值分解、变分图自动编码器和异质牛顿提升机识别潜在的 lncRNA-疾病关联。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad466.
9
LncRNA-disease association identification using graph auto-encoder and learning to rank.基于图自动编码器和排序学习的长链非编码RNA-疾病关联识别
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac539.
10
A lncRNA-disease association prediction tool development based on bridge heterogeneous information network via graph representation learning for family medicine and primary care.一种基于桥梁异构信息网络并通过图表示学习的用于家庭医学和初级保健的lncRNA-疾病关联预测工具开发。
Front Genet. 2023 May 18;14:1084482. doi: 10.3389/fgene.2023.1084482. eCollection 2023.

引用本文的文献

1
FR-BINN: Biologically Informed Neural Networks for Enhanced Biomarker Discovery and Pathway Analysis.FR-BINN:用于增强生物标志物发现和通路分析的生物信息神经网络。
Int J Mol Sci. 2025 Jul 11;26(14):6670. doi: 10.3390/ijms26146670.
2
Sleep traits and physical activity mediate the causal association between depression and age-related diseases.睡眠特征和身体活动介导了抑郁症与年龄相关疾病之间的因果关联。
Eur Arch Psychiatry Clin Neurosci. 2025 Jul 25. doi: 10.1007/s00406-025-02068-y.
3
Predicting lncRNA and disease associations with graph autoencoder and noise robust gradient boosting.
使用图自动编码器和噪声鲁棒梯度提升预测长链非编码RNA与疾病的关联
Sci Rep. 2025 May 31;15(1):19178. doi: 10.1038/s41598-025-03269-0.
4
The complexities of cell death mechanisms: a new perspective in systemic sclerosis therapy.细胞死亡机制的复杂性:系统性硬化症治疗的新视角。
Apoptosis. 2025 Apr;30(3-4):636-651. doi: 10.1007/s10495-025-02082-4. Epub 2025 Feb 9.