• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合随机游走和逻辑矩阵分解筛选乳腺癌和结直肠癌的潜在长链非编码RNA生物标志物

Screening potential lncRNA biomarkers for breast cancer and colorectal cancer combining random walk and logistic matrix factorization.

作者信息

Li Shijun, Chang Miaomiao, Tong Ling, Wang Yuehua, Wang Meng, Wang Fang

机构信息

Department of Pathology, Chifeng Municipal Hospital, Chifeng, China.

出版信息

Front Genet. 2023 Jan 20;13:1023615. doi: 10.3389/fgene.2022.1023615. eCollection 2022.

DOI:10.3389/fgene.2022.1023615
PMID:36744179
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9895102/
Abstract

Breast cancer and colorectal cancer are two of the most common malignant tumors worldwide. They cause the leading causes of cancer mortality. Many researches have demonstrated that long noncoding RNAs (lncRNAs) have close linkages with the occurrence and development of the two cancers. Therefore, it is essential to design an effective way to identify potential lncRNA biomarkers for them. In this study, we developed a computational method (LDA-RWLMF) by integrating random walk with restart and Logistic Matrix Factorization to investigate the roles of lncRNA biomarkers in the prognosis and diagnosis of the two cancers. We first fuse disease semantic and Gaussian association profile similarities and lncRNA functional and Gaussian association profile similarities. Second, we design a negative selection algorithm to extract negative LncRNA-Disease Associations (LDA) based on random walk. Third, we develop a logistic matrix factorization model to predict possible LDAs. We compare our proposed LDA-RWLMF method with four classical LDA prediction methods, that is, LNCSIM1, LNCSIM2, ILNCSIM, and IDSSIM. The results from 5-fold cross validation on the MNDR dataset show that LDA-RWLMF computes the best AUC value of 0.9312, outperforming the above four LDA prediction methods. Finally, we rank all lncRNA biomarkers for the two cancers after determining the performance of LDA-RWLMF, respectively. We find that 48 and 50 lncRNAs have the highest association scores with breast cancer and colorectal cancer among all lncRNAs known to associate with them on the MNDR dataset, respectively. We predict that lncRNAs HULC and HAR1A could be separately potential biomarkers for breast cancer and colorectal cancer and need to biomedical experimental validation.

摘要

乳腺癌和结直肠癌是全球最常见的两种恶性肿瘤。它们是癌症死亡的主要原因。许多研究表明,长链非编码RNA(lncRNAs)与这两种癌症的发生发展密切相关。因此,设计一种有效的方法来识别它们潜在的lncRNA生物标志物至关重要。在本研究中,我们通过整合带重启的随机游走和逻辑矩阵分解开发了一种计算方法(LDA - RWLMF),以研究lncRNA生物标志物在这两种癌症的预后和诊断中的作用。我们首先融合疾病语义和高斯关联谱相似性以及lncRNA功能和高斯关联谱相似性。其次,我们设计了一种负选择算法,基于随机游走提取负向的长链非编码RNA - 疾病关联(LDA)。第三,我们开发了一个逻辑矩阵分解模型来预测可能的LDA。我们将我们提出的LDA - RWLMF方法与四种经典的LDA预测方法,即LNCSIM1、LNCSIM2、ILNCSIM和IDSSIM进行比较。在MNDR数据集上进行的5折交叉验证结果表明,LDA - RWLMF计算出的最佳AUC值为0.9312,优于上述四种LDA预测方法。最后,在确定LDA - RWLMF的性能后,我们分别对这两种癌症的所有lncRNA生物标志物进行排名。我们发现,在MNDR数据集上已知与乳腺癌和结直肠癌相关的所有lncRNA中,分别有48个和50个lncRNA与它们的关联得分最高。我们预测lncRNAs HULC和HAR1A可能分别是乳腺癌和结直肠癌的潜在生物标志物,需要生物医学实验验证。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/fb7970fd8ec0/fgene-13-1023615-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/e4fadc60d81b/fgene-13-1023615-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/1961b57c0725/fgene-13-1023615-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/df3a8a4ee0de/fgene-13-1023615-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/fb7970fd8ec0/fgene-13-1023615-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/e4fadc60d81b/fgene-13-1023615-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/1961b57c0725/fgene-13-1023615-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/df3a8a4ee0de/fgene-13-1023615-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e3a/9895102/fb7970fd8ec0/fgene-13-1023615-g004.jpg

相似文献

1
Screening potential lncRNA biomarkers for breast cancer and colorectal cancer combining random walk and logistic matrix factorization.结合随机游走和逻辑矩阵分解筛选乳腺癌和结直肠癌的潜在长链非编码RNA生物标志物
Front Genet. 2023 Jan 20;13:1023615. doi: 10.3389/fgene.2022.1023615. eCollection 2022.
2
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method.IDSSIM:一种基于改进疾病语义相似性方法的 lncRNA 功能相似性计算模型。
BMC Bioinformatics. 2020 Jul 31;21(1):339. doi: 10.1186/s12859-020-03699-9.
3
LDA-VGHB: identifying potential lncRNA-disease associations with singular value decomposition, variational graph auto-encoder and heterogeneous Newton boosting machine.LDA-VGHB:基于奇异值分解、变分图自动编码器和异质牛顿提升机识别潜在的 lncRNA-疾病关联。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad466.
4
Finding Lung-Cancer-Related lncRNAs Based on Laplacian Regularized Least Squares With Unbalanced Bi-Random Walk.基于拉普拉斯正则化最小二乘与不平衡双随机游走寻找肺癌相关长链非编码RNA
Front Genet. 2022 Jul 22;13:933009. doi: 10.3389/fgene.2022.933009. eCollection 2022.
5
Detecting lncRNA-Cancer Associations by Combining miRNAs, Genes, and Prognosis With Matrix Factorization.通过结合微小RNA、基因和预后信息利用矩阵分解法检测长链非编码RNA与癌症的关联
Front Genet. 2021 Jun 28;12:639872. doi: 10.3389/fgene.2021.639872. eCollection 2021.
6
GEnDDn: An lncRNA-Disease Association Identification Framework Based on Dual-Net Neural Architecture and Deep Neural Network.GEnDDn:一种基于双网络神经架构和深度神经网络的 lncRNA-疾病关联识别框架。
Interdiscip Sci. 2024 Jun;16(2):418-438. doi: 10.1007/s12539-024-00619-w. Epub 2024 May 11.
7
Predicting potential lncRNA biomarkers for lung cancer and neuroblastoma based on an ensemble of a deep neural network and LightGBM.基于深度神经网络和LightGBM集成模型预测肺癌和神经母细胞瘤的潜在长链非编码RNA生物标志物
Front Genet. 2023 Aug 16;14:1238095. doi: 10.3389/fgene.2023.1238095. eCollection 2023.
8
A random forest based computational model for predicting novel lncRNA-disease associations.基于随机森林的计算模型预测新型 lncRNA-疾病关联。
BMC Bioinformatics. 2020 Mar 27;21(1):126. doi: 10.1186/s12859-020-3458-1.
9
DNILMF-LDA: Prediction of lncRNA-Disease Associations by Dual-Network Integrated Logistic Matrix Factorization and Bayesian Optimization.DNILMF-LDA:基于双网络集成逻辑矩阵分解和贝叶斯优化的 lncRNA-疾病关联预测。
Genes (Basel). 2019 Aug 12;10(8):608. doi: 10.3390/genes10080608.
10
LDA-LNSUBRW: lncRNA-Disease Association Prediction Based on Linear Neighborhood Similarity and Unbalanced bi-Random Walk.LDA-LNSUBRW:基于线性近邻相似性和不平衡双随机游走的 lncRNA-疾病关联预测。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Mar-Apr;19(2):989-997. doi: 10.1109/TCBB.2020.3020595. Epub 2022 Apr 1.

引用本文的文献

1
Role of autophagy‑modulating long non‑coding RNAs in tumor radioresistance (Review).自噬调节性长链非编码RNA在肿瘤放射抗性中的作用(综述)
Oncol Rep. 2025 Nov;54(5). doi: 10.3892/or.2025.8975. Epub 2025 Aug 24.

本文引用的文献

1
Finding Lung-Cancer-Related lncRNAs Based on Laplacian Regularized Least Squares With Unbalanced Bi-Random Walk.基于拉普拉斯正则化最小二乘与不平衡双随机游走寻找肺癌相关长链非编码RNA
Front Genet. 2022 Jul 22;13:933009. doi: 10.3389/fgene.2022.933009. eCollection 2022.
2
A deep learning method for predicting metabolite-disease associations via graph neural network.基于图神经网络的代谢物-疾病关联预测深度学习方法。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac266.
3
Hyb4mC: a hybrid DNA2vec-based model for DNA N4-methylcytosine sites prediction.
Hyb4mC:一种基于 DNA2vec 的混合模型,用于预测 DNA N4-甲基胞嘧啶位点。
BMC Bioinformatics. 2022 Jun 29;23(1):258. doi: 10.1186/s12859-022-04789-6.
4
Cell-cell communication inference and analysis in the tumour microenvironments from single-cell transcriptomics: data resources and computational strategies.单细胞转录组学解析肿瘤微环境中的细胞间通讯:数据资源与计算策略。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac234.
5
Retraction Note to: LncRNA BCRT1 promotes breast cancer progression by targeting miR-1303/PTBP3 axis.撤回说明:长链非编码RNA BCRT1通过靶向miR-1303/PTBP3轴促进乳腺癌进展。
Mol Cancer. 2022 Jun 17;21(1):131. doi: 10.1186/s12943-022-01576-y.
6
MAGCNSE: predicting lncRNA-disease associations using multi-view attention graph convolutional network and stacking ensemble model.MAGCNSE:基于多视图注意力图卷积网络和堆叠集成模型预测 lncRNA-疾病关联
BMC Bioinformatics. 2022 May 19;23(1):189. doi: 10.1186/s12859-022-04715-w.
7
Discovery of Potential Therapeutic Drugs for COVID-19 Through Logistic Matrix Factorization With Kernel Diffusion.通过带核扩散的逻辑矩阵分解发现新冠病毒潜在治疗药物
Front Microbiol. 2022 Feb 28;13:740382. doi: 10.3389/fmicb.2022.740382. eCollection 2022.
8
Long non‑coding RNA PCAT1 sponges miR‑134‑3p to regulate PITX2 expression in breast cancer.长链非编码 RNA PCAT1 通过海绵吸附 miR-134-3p 调控乳腺癌中 PITX2 的表达。
Mol Med Rep. 2022 Mar;25(3). doi: 10.3892/mmr.2022.12591. Epub 2022 Jan 11.
9
EnANNDeep: An Ensemble-based lncRNA-protein Interaction Prediction Framework with Adaptive k-Nearest Neighbor Classifier and Deep Models.EnANNDeep:基于集成学习的 lncRNA-蛋白质相互作用预测框架,采用自适应 k-最近邻分类器和深度模型。
Interdiscip Sci. 2022 Mar;14(1):209-232. doi: 10.1007/s12539-021-00483-y. Epub 2022 Jan 10.
10
VDA-RWLRLS: An anti-SARS-CoV-2 drug prioritizing framework combining an unbalanced bi-random walk and Laplacian regularized least squares.VDA-RWLRLS:一种结合非平衡双向随机游走和拉普拉斯正则化最小二乘法的抗SARS-CoV-2药物优先级框架。
Comput Biol Med. 2022 Jan;140:105119. doi: 10.1016/j.compbiomed.2021.105119. Epub 2021 Dec 7.