• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于学习的lncRNA-疾病关联识别方法:结合相似性信息与旋转森林

A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest.

作者信息

Guo Zhen-Hao, You Zhu-Hong, Wang Yan-Bin, Yi Hai-Cheng, Chen Zhan-Heng

机构信息

Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China; University of Chinese Academy of Sciences, Beijing 100049, China.

Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.

出版信息

iScience. 2019 Sep 27;19:786-795. doi: 10.1016/j.isci.2019.08.030. Epub 2019 Aug 23.

DOI:10.1016/j.isci.2019.08.030
PMID:31494494
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6733997/
Abstract

Long non-coding RNA (lncRNA) play critical roles in the occurrence and development of various diseases. The determination of the lncRNA-disease associations thus would contribute to provide new insights into the pathogenesis of the disease, the diagnosis, and the gene treatments. Considering that traditional experimental approaches are difficult to detect potential human lncRNA-disease associations from the vast amount of biological data, developing computational method could be of significant value. In this paper, we proposed a novel computational method named LDASR to identify associations between lncRNA and disease by analyzing known lncRNA-disease associations. First, the feature vectors of the lncRNA-disease pairs were obtained by integrating lncRNA Gaussian interaction profile kernel similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. Second, autoencoder neural network was employed to reduce the feature dimension and get the optimal feature subspace from the original feature set. Finally, Rotating Forest was used to carry out prediction of lncRNA-disease association. The proposed method achieves an excellent preference with 0.9502 AUC in leave-one-out cross-validations (LOOCV) and 0.9428 AUC in 5-fold cross-validation, which significantly outperformed previous methods. Moreover, two kinds of case studies on identifying lncRNAs associated with colorectal cancer and glioma further proves the capability of LDASR in identifying novel lncRNA-disease associations. The promising experimental results show that the LDASR can be an excellent addition to the biomedical research in the future.

摘要

长链非编码RNA(lncRNA)在多种疾病的发生和发展中发挥着关键作用。因此,确定lncRNA与疾病之间的关联将有助于为疾病的发病机制、诊断和基因治疗提供新的见解。鉴于传统实验方法难以从海量生物数据中检测潜在的人类lncRNA与疾病的关联,开发计算方法可能具有重要价值。在本文中,我们提出了一种名为LDASR的新型计算方法,通过分析已知的lncRNA与疾病的关联来识别lncRNA与疾病之间的关联。首先,通过整合lncRNA高斯相互作用谱核相似性、疾病语义相似性和高斯相互作用谱核相似性,获得lncRNA与疾病对的特征向量。其次,采用自动编码器神经网络来降低特征维度,并从原始特征集中获得最优特征子空间。最后,使用旋转森林进行lncRNA与疾病关联的预测。所提出的方法在留一法交叉验证(LOOCV)中AUC为0.9502,在五折交叉验证中AUC为0.9428,取得了优异的性能,显著优于先前的方法。此外,关于识别与结直肠癌和神经胶质瘤相关的lncRNA的两种案例研究进一步证明了LDASR在识别新型lncRNA与疾病关联方面的能力。这些有前景的实验结果表明,LDASR在未来可能成为生物医学研究的优秀补充。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/67c76ad444bc/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/b6446914affe/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/d38719980972/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/5be02a4f72f4/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/e4f616d87c04/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/10806cc6f2fe/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/67c76ad444bc/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/b6446914affe/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/d38719980972/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/5be02a4f72f4/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/e4f616d87c04/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/10806cc6f2fe/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d21/6733997/67c76ad444bc/gr5.jpg

相似文献

1
A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest.一种基于学习的lncRNA-疾病关联识别方法:结合相似性信息与旋转森林
iScience. 2019 Sep 27;19:786-795. doi: 10.1016/j.isci.2019.08.030. Epub 2019 Aug 23.
2
IPCARF: improving lncRNA-disease association prediction using incremental principal component analysis feature selection and a random forest classifier.IPCARF:利用增量主成分分析特征选择和随机森林分类器改进 lncRNA-疾病关联预测。
BMC Bioinformatics. 2021 Apr 1;22(1):175. doi: 10.1186/s12859-021-04104-9.
3
LDNFSGB: prediction of long non-coding rna and disease association using network feature similarity and gradient boosting.LDNFSGB:基于网络特征相似性和梯度提升的长非编码 RNA 与疾病关联预测
BMC Bioinformatics. 2020 Sep 3;21(1):377. doi: 10.1186/s12859-020-03721-0.
4
A random forest based computational model for predicting novel lncRNA-disease associations.基于随机森林的计算模型预测新型 lncRNA-疾病关联。
BMC Bioinformatics. 2020 Mar 27;21(1):126. doi: 10.1186/s12859-020-3458-1.
5
LDAEXC: LncRNA-Disease Associations Prediction with Deep Autoencoder and XGBoost Classifier.LDAEXC:基于深度自动编码器和 XGBoost 分类器的长链非编码 RNA-疾病关联预测。
Interdiscip Sci. 2023 Sep;15(3):439-451. doi: 10.1007/s12539-023-00573-z. Epub 2023 Jun 12.
6
Predicting potential lncRNA biomarkers for lung cancer and neuroblastoma based on an ensemble of a deep neural network and LightGBM.基于深度神经网络和LightGBM集成模型预测肺癌和神经母细胞瘤的潜在长链非编码RNA生物标志物
Front Genet. 2023 Aug 16;14:1238095. doi: 10.3389/fgene.2023.1238095. eCollection 2023.
7
A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs.基于 LncRNA-疾病对的直接和间接特征预测潜在 LncRNA-疾病关联的新型计算模型。
BMC Bioinformatics. 2020 Dec 2;21(1):555. doi: 10.1186/s12859-020-03906-7.
8
BPLLDA: Predicting lncRNA-Disease Associations Based on Simple Paths With Limited Lengths in a Heterogeneous Network.BPLLDA:基于异质网络中有限长度的简单路径预测长链非编码RNA与疾病的关联
Front Genet. 2018 Oct 16;9:411. doi: 10.3389/fgene.2018.00411. eCollection 2018.
9
Inferring Latent Disease-lncRNA Associations by Faster Matrix Completion on a Heterogeneous Network.基于异构网络上的快速矩阵补全推断潜在疾病-lncRNA关联
Front Genet. 2019 Sep 4;10:769. doi: 10.3389/fgene.2019.00769. eCollection 2019.
10
A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations.基于新型目标收敛集的重启动随机游走算法预测潜在的 lncRNA-疾病关联
BMC Bioinformatics. 2019 Dec 3;20(1):626. doi: 10.1186/s12859-019-3216-4.

引用本文的文献

1
LDA-SCGB: inferring lncRNA-disease associations based on condensed gradient boosting.LDA-SCGB:基于凝聚梯度提升推断长链非编码RNA与疾病的关联
BMC Bioinformatics. 2025 Jul 22;26(1):190. doi: 10.1186/s12859-025-06169-2.
2
MLWNNR: LncRNA-Disease Association Prediction with Multi-Kernel Learning-Driven Weighted Nuclear Norm Regularization.MLWNNR:基于多核学习驱动的加权核范数正则化的长链非编码RNA-疾病关联预测
Interdiscip Sci. 2025 Jun 23. doi: 10.1007/s12539-025-00717-3.
3
Predicting lncRNA and disease associations with graph autoencoder and noise robust gradient boosting.

本文引用的文献

1
CRlncRNA: a manually curated database of cancer-related long non-coding RNAs with experimental proof of functions on clinicopathological and molecular features.CRlncRNA:一个经过人工整理的癌症相关长链非编码RNA数据库,具有关于临床病理和分子特征的功能实验证据。
BMC Med Genomics. 2018 Dec 31;11(Suppl 6):114. doi: 10.1186/s12920-018-0430-2.
2
Prediction of lncRNA-disease associations based on inductive matrix completion.基于归纳矩阵补全的 lncRNA-疾病关联预测。
Bioinformatics. 2018 Oct 1;34(19):3357-3364. doi: 10.1093/bioinformatics/bty327.
3
LncRNA AB073614 induces epithelial- mesenchymal transition of colorectal cancer cells via regulating the JAK/STAT3 pathway.
使用图自动编码器和噪声鲁棒梯度提升预测长链非编码RNA与疾病的关联
Sci Rep. 2025 May 31;15(1):19178. doi: 10.1038/s41598-025-03269-0.
4
Predicting noncoding RNA and disease associations using multigraph contrastive learning.使用多重图对比学习预测非编码RNA与疾病的关联
Sci Rep. 2025 Jan 2;15(1):230. doi: 10.1038/s41598-024-81862-5.
5
A multichannel graph neural network based on multisimilarity modality hypergraph contrastive learning for predicting unknown types of cancer biomarkers.一种基于多相似模态超图对比学习的多通道图神经网络,用于预测未知类型的癌症生物标志物。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae575.
6
GEnDDn: An lncRNA-Disease Association Identification Framework Based on Dual-Net Neural Architecture and Deep Neural Network.GEnDDn:一种基于双网络神经架构和深度神经网络的 lncRNA-疾病关联识别框架。
Interdiscip Sci. 2024 Jun;16(2):418-438. doi: 10.1007/s12539-024-00619-w. Epub 2024 May 11.
7
Finding potential lncRNA-disease associations using a boosting-based ensemble learning model.使用基于提升的集成学习模型寻找潜在的长链非编码RNA-疾病关联。
Front Genet. 2024 Mar 1;15:1356205. doi: 10.3389/fgene.2024.1356205. eCollection 2024.
8
LDA-VGHB: identifying potential lncRNA-disease associations with singular value decomposition, variational graph auto-encoder and heterogeneous Newton boosting machine.LDA-VGHB:基于奇异值分解、变分图自动编码器和异质牛顿提升机识别潜在的 lncRNA-疾病关联。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad466.
9
Predicting lncRNA-disease associations based on heterogeneous graph convolutional generative adversarial network.基于异质图卷积生成对抗网络的 lncRNA 疾病关联预测。
PLoS Comput Biol. 2023 Nov 29;19(11):e1011634. doi: 10.1371/journal.pcbi.1011634. eCollection 2023 Nov.
10
Screening potential lncRNA biomarkers for breast cancer and colorectal cancer combining random walk and logistic matrix factorization.结合随机游走和逻辑矩阵分解筛选乳腺癌和结直肠癌的潜在长链非编码RNA生物标志物
Front Genet. 2023 Jan 20;13:1023615. doi: 10.3389/fgene.2022.1023615. eCollection 2022.
长链非编码 RNA AB073614 通过调控 JAK/STAT3 通路诱导结直肠癌细胞发生上皮-间质转化。
Cancer Biomark. 2018;21(4):849-858. doi: 10.3233/CBM-170780.
4
Long Noncoding RNA Discovery in Cardiovascular Disease: Decoding Form to Function.长链非编码 RNA 在心血管疾病中的发现:从形态到功能的解码。
Circ Res. 2018 Jan 5;122(1):155-166. doi: 10.1161/CIRCRESAHA.117.311802.
5
Matrix factorization-based data fusion for the prediction of lncRNA-disease associations.基于矩阵分解的数据融合方法用于 lncRNA-疾病关联预测。
Bioinformatics. 2018 May 1;34(9):1529-1537. doi: 10.1093/bioinformatics/btx794.
6
MNDR v2.0: an updated resource of ncRNA-disease associations in mammals.MNDR v2.0:哺乳动物中更新的 ncRNA-疾病关联资源。
Nucleic Acids Res. 2018 Jan 4;46(D1):D371-D374. doi: 10.1093/nar/gkx1025.
7
Emerging mechanisms of long noncoding RNA function during normal and malignant hematopoiesis.正常和恶性造血过程中长链非编码RNA功能的新机制
Blood. 2017 Nov 2;130(18):1965-1975. doi: 10.1182/blood-2017-06-788695. Epub 2017 Sep 19.
8
LncRNA AB073614 regulates proliferation and metastasis of colorectal cancer cells via the PI3K/AKT signaling pathway.长链非编码 RNA AB073614 通过 PI3K/AKT 信号通路调节结直肠癌细胞的增殖和转移。
Biomed Pharmacother. 2017 Sep;93:1230-1237. doi: 10.1016/j.biopha.2017.07.024. Epub 2017 Jul 20.
9
The long non-coding RNA SNHG3 functions as a competing endogenous RNA to promote malignant development of colorectal cancer.长链非编码RNA SNHG3作为一种竞争性内源性RNA发挥作用,促进结直肠癌的恶性发展。
Oncol Rep. 2017 Sep;38(3):1402-1410. doi: 10.3892/or.2017.5837. Epub 2017 Jul 18.
10
Linc00152 promotes malignant progression of glioma stem cells by regulating miR-103a-3p/FEZF1/CDC25A pathway.Linc00152 通过调控 miR-103a-3p/FEZF1/CDC25A 通路促进神经胶质瘤干细胞的恶性进展。
Mol Cancer. 2017 Jun 26;16(1):110. doi: 10.1186/s12943-017-0677-9.