• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于矩阵分解的数据融合方法用于 lncRNA-疾病关联预测。

Matrix factorization-based data fusion for the prediction of lncRNA-disease associations.

机构信息

College of Computer and Information Science, Southwest University, Chongqing 400715, China.

Department of Computer Science, George Mason University, Farifax, VA 22030, USA.

出版信息

Bioinformatics. 2018 May 1;34(9):1529-1537. doi: 10.1093/bioinformatics/btx794.

DOI:10.1093/bioinformatics/btx794
PMID:29228285
Abstract

MOTIVATION

Long non-coding RNAs (lncRNAs) play crucial roles in complex disease diagnosis, prognosis, prevention and treatment, but only a small portion of lncRNA-disease associations have been experimentally verified. Various computational models have been proposed to identify lncRNA-disease associations by integrating heterogeneous data sources. However, existing models generally ignore the intrinsic structure of data sources or treat them as equally relevant, while they may not be.

RESULTS

To accurately identify lncRNA-disease associations, we propose a Matrix Factorization based LncRNA-Disease Association prediction model (MFLDA in short). MFLDA decomposes data matrices of heterogeneous data sources into low-rank matrices via matrix tri-factorization to explore and exploit their intrinsic and shared structure. MFLDA can select and integrate the data sources by assigning different weights to them. An iterative solution is further introduced to simultaneously optimize the weights and low-rank matrices. Next, MFLDA uses the optimized low-rank matrices to reconstruct the lncRNA-disease association matrix and thus to identify potential associations. In 5-fold cross validation experiments to identify verified lncRNA-disease associations, MFLDA achieves an area under the receiver operating characteristic curve (AUC) of 0.7408, at least 3% higher than those given by state-of-the-art data fusion based computational models. An empirical study on identifying masked lncRNA-disease associations again shows that MFLDA can identify potential associations more accurately than competing models. A case study on identifying lncRNAs associated with breast, lung and stomach cancers show that 38 out of 45 (84%) associations predicted by MFLDA are supported by recent biomedical literature and further proves the capability of MFLDA in identifying novel lncRNA-disease associations. MFLDA is a general data fusion framework, and as such it can be adopted to predict associations between other biological entities.

AVAILABILITY AND IMPLEMENTATION

The source code for MFLDA is available at: http://mlda.swu.edu.cn/codes.php? name = MFLDA.

CONTACT

gxyu@swu.edu.cn.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

长非编码 RNA(lncRNA)在复杂疾病的诊断、预后、预防和治疗中起着至关重要的作用,但只有一小部分 lncRNA-疾病关联已通过实验验证。各种计算模型已经被提出,通过整合异构数据源来识别 lncRNA-疾病关联。然而,现有的模型通常忽略了数据源的内在结构,或者将它们视为同等相关,而实际上它们可能并不相关。

结果

为了准确识别 lncRNA-疾病关联,我们提出了一种基于矩阵分解的 lncRNA-疾病关联预测模型(简称 MFLDA)。MFLDA 通过矩阵三因子分解将异构数据源的数据矩阵分解为低秩矩阵,以探索和利用它们的内在和共享结构。MFLDA 可以通过为它们分配不同的权重来选择和整合数据源。进一步引入了一种迭代解决方案,以同时优化权重和低秩矩阵。接下来,MFLDA 使用优化后的低秩矩阵来重构 lncRNA-疾病关联矩阵,从而识别潜在的关联。在 5 折交叉验证实验中,用于识别已验证的 lncRNA-疾病关联,MFLDA 的接收者操作特征曲线(AUC)下面积达到 0.7408,至少比基于最先进的数据融合的计算模型高 3%。对识别掩蔽 lncRNA-疾病关联的实证研究再次表明,MFLDA 能够比竞争模型更准确地识别潜在的关联。对识别与乳腺癌、肺癌和胃癌相关的 lncRNA 的案例研究表明,MFLDA 预测的 45 个(84%)关联中的 38 个得到了最近生物医学文献的支持,进一步证明了 MFLDA 识别新型 lncRNA-疾病关联的能力。MFLDA 是一个通用的数据融合框架,因此可以应用于预测其他生物实体之间的关联。

可用性和实现

MFLDA 的源代码可在 http://mlda.swu.edu.cn/codes.php? name = MFLDA 获得。

联系人

gxyu@swu.edu.cn。

补充信息

补充数据可在《生物信息学》在线获得。

相似文献

1
Matrix factorization-based data fusion for the prediction of lncRNA-disease associations.基于矩阵分解的数据融合方法用于 lncRNA-疾病关联预测。
Bioinformatics. 2018 May 1;34(9):1529-1537. doi: 10.1093/bioinformatics/btx794.
2
Weighted matrix factorization on multi-relational data for LncRNA-disease association prediction.基于多关系数据的加权矩阵分解预测 LncRNA 疾病关联。
Methods. 2020 Feb 15;173:32-43. doi: 10.1016/j.ymeth.2019.06.015. Epub 2019 Jun 18.
3
A random forest based computational model for predicting novel lncRNA-disease associations.基于随机森林的计算模型预测新型 lncRNA-疾病关联。
BMC Bioinformatics. 2020 Mar 27;21(1):126. doi: 10.1186/s12859-020-3458-1.
4
Hierarchical Extension Based on the Boolean Matrix for LncRNADisease Association Prediction.基于布尔矩阵的lncRNA疾病关联预测的层次扩展
Curr Mol Med. 2020;20(6):452-460. doi: 10.2174/1566524019666191119104212.
5
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method.IDSSIM:一种基于改进疾病语义相似性方法的 lncRNA 功能相似性计算模型。
BMC Bioinformatics. 2020 Jul 31;21(1):339. doi: 10.1186/s12859-020-03699-9.
6
LDAP: a web server for lncRNA-disease association prediction.LDAP:lncRNA 疾病关联预测的 Web 服务器。
Bioinformatics. 2017 Feb 1;33(3):458-460. doi: 10.1093/bioinformatics/btw639.
7
A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations.基于新型目标收敛集的重启动随机游走算法预测潜在的 lncRNA-疾病关联
BMC Bioinformatics. 2019 Dec 3;20(1):626. doi: 10.1186/s12859-019-3216-4.
8
LncDisAP: a computation model for LncRNA-disease association prediction based on multiple biological datasets.LncDisAP:基于多个生物数据集的 LncRNA 疾病关联预测计算模型。
BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):582. doi: 10.1186/s12859-019-3081-1.
9
DMFLDA: A Deep Learning Framework for Predicting lncRNA-Disease Associations.DMFLDA:一种用于预测 lncRNA-疾病关联的深度学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2353-2363. doi: 10.1109/TCBB.2020.2983958. Epub 2021 Dec 8.
10
Predicting lncRNA-disease associations using network topological similarity based on deep mining heterogeneous networks.基于深度挖掘异质网络的网络拓扑相似性预测 lncRNA-疾病关联。
Math Biosci. 2019 Sep;315:108229. doi: 10.1016/j.mbs.2019.108229. Epub 2019 Jul 16.

引用本文的文献

1
Decoding potential lncRNA and disease associations through graph representation learning and gradient boosting with histogram.通过基于直方图的图表示学习和梯度提升来解码潜在的长链非编码RNA与疾病的关联。
Sci Rep. 2025 Aug 26;15(1):31407. doi: 10.1038/s41598-025-16177-0.
2
Dual balanced augmented topological noncoding RNA disease triplet association in heterogeneous graphs.异质图中的双平衡增强拓扑非编码RNA疾病三联体关联
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf389.
3
LDA-SCGB: inferring lncRNA-disease associations based on condensed gradient boosting.
LDA-SCGB:基于凝聚梯度提升推断长链非编码RNA与疾病的关联
BMC Bioinformatics. 2025 Jul 22;26(1):190. doi: 10.1186/s12859-025-06169-2.
4
Neighborhood-Regularized Matrix Factorization for lncRNA-Disease Association Identification.用于长链非编码RNA-疾病关联识别的邻域正则化矩阵分解
Int J Mol Sci. 2025 Apr 30;26(9):4283. doi: 10.3390/ijms26094283.
5
Predicting noncoding RNA and disease associations using multigraph contrastive learning.使用多重图对比学习预测非编码RNA与疾病的关联
Sci Rep. 2025 Jan 2;15(1):230. doi: 10.1038/s41598-024-81862-5.
6
Modeling ncRNA Synergistic Regulation in Cancer.癌症中ncRNA协同调控的建模
Methods Mol Biol. 2025;2883:377-402. doi: 10.1007/978-1-0716-4290-0_17.
7
ACLNDA: an asymmetric graph contrastive learning framework for predicting noncoding RNA-disease associations in heterogeneous graphs.ACLNDA:一种用于在异质图中预测非编码 RNA-疾病关联的非对称图对比学习框架。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae533.
8
LDAGM: prediction lncRNA-disease asociations by graph convolutional auto-encoder and multilayer perceptron based on multi-view heterogeneous networks.LDAGM:基于多视图异质网络的图卷积自动编码器和多层感知机预测 lncRNA-疾病关联。
BMC Bioinformatics. 2024 Oct 15;25(1):332. doi: 10.1186/s12859-024-05950-z.
9
Non-Coding RNAs Extended Omnigenic Module of Cancers.癌症的非编码RNA扩展全基因模块
Entropy (Basel). 2024 Jul 27;26(8):640. doi: 10.3390/e26080640.
10
Predicting lncRNA-Disease Associations Based on a Dual-Path Feature Extraction Network with Multiple Sources of Information Integration.基于多源信息整合的双路径特征提取网络预测长链非编码RNA与疾病的关联
ACS Omega. 2024 Jul 30;9(32):35100-35112. doi: 10.1021/acsomega.4c05365. eCollection 2024 Aug 13.