• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过异构信息网络中的嵌入学习HOPE预测长链非编码RNA-疾病关联

Prediction of lncRNA-disease associations via an embedding learning HOPE in heterogeneous information networks.

作者信息

Zhou Ji-Ren, You Zhu-Hong, Cheng Li, Ji Bo-Ya

机构信息

The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.

University of Chinese Academy of Sciences, Beijing 100049, China.

出版信息

Mol Ther Nucleic Acids. 2020 Nov 4;23:277-285. doi: 10.1016/j.omtn.2020.10.040. eCollection 2021 Mar 5.

DOI:10.1016/j.omtn.2020.10.040
PMID:33425486
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7773765/
Abstract

Uncovering additional long non-coding RNA (lncRNA)-disease associations has become increasingly important for developing treatments for complex human diseases. Identification of lncRNA biomarkers and lncRNA-disease associations is central to diagnoses and treatment. However, traditional experimental methods are expensive and time-consuming. Enormous amounts of data present in public biological databases are available for computational methods used to predict lncRNA-disease associations. In this study, we propose a novel computational method to predict lncRNA-disease associations. More specifically, a heterogeneous network is first constructed by integrating the associations among microRNA (miRNA), lncRNA, protein, drug, and disease, Second, high-order proximity preserved embedding (HOPE) was used to embed nodes into a network. Finally, the rotation forest classifier was adopted to train the prediction model. In the 5-fold cross-validation experiment, the area under the curve (AUC) of our method achieved 0.8328 ± 0.0236. We compare it with the other four classifiers, in which the proposed method remarkably outperformed other comparison methods. Otherwise, we constructed three case studies for three excess death rate cancers, respectively. The results show that 9 (lung cancer, gastric cancer, and hepatocellular carcinomas) out of the top 15 predicted disease-related lncRNAs were confirmed by our method. In conclusion, our method could predict the unknown lncRNA-disease associations effectively.

摘要

发现更多长链非编码RNA(lncRNA)与疾病的关联对于开发复杂人类疾病的治疗方法变得越来越重要。lncRNA生物标志物和lncRNA与疾病关联的识别是诊断和治疗的核心。然而,传统的实验方法既昂贵又耗时。公共生物数据库中存在大量数据,可用于预测lncRNA与疾病关联的计算方法。在本研究中,我们提出了一种预测lncRNA与疾病关联的新计算方法。更具体地说,首先通过整合微小RNA(miRNA)、lncRNA、蛋白质、药物和疾病之间的关联构建一个异质网络,其次,使用高阶接近度保留嵌入(HOPE)将节点嵌入网络。最后,采用旋转森林分类器训练预测模型。在5折交叉验证实验中,我们方法的曲线下面积(AUC)达到0.8328±0.0236。我们将其与其他四个分类器进行比较,结果表明所提出的方法明显优于其他比较方法。此外,我们分别针对三种超额死亡率癌症构建了三个案例研究。结果表明,我们的方法证实了前15个预测的疾病相关lncRNA中的9个(肺癌、胃癌和肝细胞癌)。总之,我们的方法可以有效地预测未知的lncRNA与疾病的关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/da4529eb3ea7/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/848386c58b8f/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/96e142d3f95d/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/918f82321cc5/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/7afee5724de8/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/70b09462d2b3/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/07a4144c2a7c/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/da4529eb3ea7/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/848386c58b8f/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/96e142d3f95d/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/918f82321cc5/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/7afee5724de8/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/70b09462d2b3/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/07a4144c2a7c/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1700/7773765/da4529eb3ea7/gr6.jpg

相似文献

1
Prediction of lncRNA-disease associations via an embedding learning HOPE in heterogeneous information networks.通过异构信息网络中的嵌入学习HOPE预测长链非编码RNA-疾病关联
Mol Ther Nucleic Acids. 2020 Nov 4;23:277-285. doi: 10.1016/j.omtn.2020.10.040. eCollection 2021 Mar 5.
2
HOPEXGB: A Consensual Model for Predicting miRNA/lncRNA-Disease Associations Using a Heterogeneous Disease-miRNA-lncRNA Information Network.HOPEXGB:一种基于异质疾病-miRNA-lncRNA 信息网络的 miRNA/lncRNA-疾病关联预测共识模型。
J Chem Inf Model. 2024 Apr 8;64(7):2863-2877. doi: 10.1021/acs.jcim.3c00856. Epub 2023 Aug 21.
3
Predicting miRNA-disease association from heterogeneous information network with GraRep embedding model.基于 GraRep 嵌入模型的异质信息网络预测 miRNA-疾病关联
Sci Rep. 2020 Apr 20;10(1):6658. doi: 10.1038/s41598-020-63735-9.
4
HEGANLDA: A Computational Model for Predicting Potential Lncrna-Disease Associations Based On Multiple Heterogeneous Networks.HEGANLDA:一种基于多个异构网络预测潜在长链非编码RNA-疾病关联的计算模型。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):388-398. doi: 10.1109/TCBB.2021.3136886. Epub 2023 Feb 3.
5
Geometric complement heterogeneous information and random forest for predicting lncRNA-disease associations.用于预测长链非编码RNA-疾病关联的几何互补异构信息与随机森林
Front Genet. 2022 Aug 24;13:995532. doi: 10.3389/fgene.2022.995532. eCollection 2022.
6
IPCARF: improving lncRNA-disease association prediction using incremental principal component analysis feature selection and a random forest classifier.IPCARF:利用增量主成分分析特征选择和随机森林分类器改进 lncRNA-疾病关联预测。
BMC Bioinformatics. 2021 Apr 1;22(1):175. doi: 10.1186/s12859-021-04104-9.
7
Predicting lncRNA-disease associations using network topological similarity based on deep mining heterogeneous networks.基于深度挖掘异质网络的网络拓扑相似性预测 lncRNA-疾病关联。
Math Biosci. 2019 Sep;315:108229. doi: 10.1016/j.mbs.2019.108229. Epub 2019 Jul 16.
8
Predicting LncRNA-Disease Association by a Random Walk With Restart on Multiplex and Heterogeneous Networks.基于多重异构网络上带重启的随机游走预测长链非编码RNA与疾病的关联
Front Genet. 2021 Aug 19;12:712170. doi: 10.3389/fgene.2021.712170. eCollection 2021.
9
Heterogeneous graph neural network for lncRNA-disease association prediction.用于 lncRNA-疾病关联预测的异质图神经网络。
Sci Rep. 2022 Oct 20;12(1):17519. doi: 10.1038/s41598-022-22447-y.
10
A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs.基于 LncRNA-疾病对的直接和间接特征预测潜在 LncRNA-疾病关联的新型计算模型。
BMC Bioinformatics. 2020 Dec 2;21(1):555. doi: 10.1186/s12859-020-03906-7.

引用本文的文献

1
Protein Sequence Analysis landscape: A Systematic Review of Task Types, Databases, Datasets, Word Embeddings Methods, and Language Models.蛋白质序列分析全景:任务类型、数据库、数据集、词嵌入方法和语言模型的系统综述
Database (Oxford). 2025 May 30;2025. doi: 10.1093/database/baaf027.
2
DNA sequence analysis landscape: a comprehensive review of DNA sequence analysis task types, databases, datasets, word embedding methods, and language models.DNA序列分析全景:对DNA序列分析任务类型、数据库、数据集、词嵌入方法和语言模型的全面综述。
Front Med (Lausanne). 2025 Apr 8;12:1503229. doi: 10.3389/fmed.2025.1503229. eCollection 2025.
3

本文引用的文献

1
LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse.LncRNA2Target v2.0:一个综合性的数据库,包含人类和小鼠中 lncRNA 的靶基因。
Nucleic Acids Res. 2019 Jan 8;47(D1):D140-D144. doi: 10.1093/nar/gky1051.
2
HMDD v3.0: a database for experimentally supported human microRNA-disease associations.HMDD v3.0:一个实验支持的人类 microRNA-疾病关联数据库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1013-D1017. doi: 10.1093/nar/gky1010.
3
The Comparative Toxicogenomics Database: update 2019.比较毒理学基因组学数据库:2019 年更新。
Transitioning from wet lab to artificial intelligence: a systematic review of AI predictors in CRISPR.
从湿实验室到人工智能的转变:对CRISPR中人工智能预测因子的系统综述
J Transl Med. 2025 Feb 4;23(1):153. doi: 10.1186/s12967-024-06013-w.
4
RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models.RNA序列分析全景:任务类型、数据库、数据集、词嵌入方法及语言模型的全面综述
Heliyon. 2025 Jan 6;11(2):e41488. doi: 10.1016/j.heliyon.2024.e41488. eCollection 2025 Jan 30.
5
A Novel Framework for Data Assessment That Uses Edge Technology to Improve the Detection of Communicable Diseases.一种利用边缘技术改进传染病检测的数据评估新框架。
Diagnostics (Basel). 2024 May 30;14(11):1148. doi: 10.3390/diagnostics14111148.
6
DDK-Linker: a network-based strategy identifies disease signals by linking high-throughput omics datasets to disease knowledge.DDK-Linker:一种基于网络的策略,通过将高通量组学数据集与疾病知识联系起来,识别疾病信号。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae111.
7
Multi-Omics Mining of lncRNAs with Biological and Clinical Relevance in Cancer.癌症中具有生物学和临床相关性的 lncRNAs 的多组学挖掘。
Int J Mol Sci. 2023 Nov 22;24(23):16600. doi: 10.3390/ijms242316600.
8
Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review.使用基因表达数据进行癌症分类的机器学习方法:综述
Bioengineering (Basel). 2023 Jan 28;10(2):173. doi: 10.3390/bioengineering10020173.
9
SCCPMD: Probability matrix decomposition method subject to corrected similarity constraints for inferring long non-coding RNA-disease associations.SCCPMD:基于校正相似性约束的概率矩阵分解方法用于推断长链非编码RNA与疾病的关联
Front Microbiol. 2023 Jan 11;13:1093615. doi: 10.3389/fmicb.2022.1093615. eCollection 2022.
10
Geometric complement heterogeneous information and random forest for predicting lncRNA-disease associations.用于预测长链非编码RNA-疾病关联的几何互补异构信息与随机森林
Front Genet. 2022 Aug 24;13:995532. doi: 10.3389/fgene.2022.995532. eCollection 2022.
Nucleic Acids Res. 2019 Jan 8;47(D1):D948-D954. doi: 10.1093/nar/gky868.
4
miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions.miRTarBase 更新 2018:一个经过实验验证的 microRNA-靶标相互作用的资源库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D296-D302. doi: 10.1093/nar/gkx1067.
5
DrugBank 5.0: a major update to the DrugBank database for 2018.DrugBank 5.0:2018 年 DrugBank 数据库的重大更新。
Nucleic Acids Res. 2018 Jan 4;46(D1):D1074-D1082. doi: 10.1093/nar/gkx1037.
6
lncRNASNP2: an updated database of functional SNPs and mutations in human and mouse lncRNAs.lncRNASNP2:人源和鼠源长非编码 RNA 中功能 SNP 和突变的更新数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D276-D280. doi: 10.1093/nar/gkx1004.
7
PBMDA: A novel and effective path-based computational model for miRNA-disease association prediction.PBMDA:一种用于miRNA-疾病关联预测的新型有效基于路径的计算模型。
PLoS Comput Biol. 2017 Mar 24;13(3):e1005455. doi: 10.1371/journal.pcbi.1005455. eCollection 2017 Mar.
8
DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants.DisGeNET:一个整合人类疾病相关基因和变异信息的综合平台。
Nucleic Acids Res. 2017 Jan 4;45(D1):D833-D839. doi: 10.1093/nar/gkw943. Epub 2016 Oct 19.
9
The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible.2017年的STRING数据库:质量可控的蛋白质-蛋白质相互作用网络,广泛可用。
Nucleic Acids Res. 2017 Jan 4;45(D1):D362-D368. doi: 10.1093/nar/gkw937. Epub 2016 Oct 18.
10
Long noncoding RNA NEAT1 is an unfavorable prognostic factor and regulates migration and invasion in gastric cancer.长链非编码RNA NEAT1是一种不良预后因素,并调控胃癌的迁移和侵袭。
J Cancer Res Clin Oncol. 2016 Jul;142(7):1571-9. doi: 10.1007/s00432-016-2152-1. Epub 2016 Apr 19.