• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LPI-SKMSC:基于分段 k--mer 频率和多空间聚类的长链非编码 RNA-蛋白质相互作用预测。

LPI-SKMSC: Predicting LncRNA-Protein Interactions with Segmented k-mer Frequencies and Multi-space Clustering.

机构信息

School of Electrical Engineering and Automation, Anhui University, Hefei, 230601, China.

School of Computer Science and Technology, Anhui University, Hefei, 230601, China.

出版信息

Interdiscip Sci. 2024 Jun;16(2):378-391. doi: 10.1007/s12539-023-00598-4. Epub 2024 Jan 11.

DOI:10.1007/s12539-023-00598-4
PMID:38206558
Abstract

Long noncoding RNAs (lncRNAs) have significant regulatory roles in gene expression. Interactions with proteins are one of the ways lncRNAs play their roles. Since experiments to determine lncRNA-protein interactions (LPIs) are expensive and time-consuming, many computational methods for predicting LPIs have been proposed as alternatives. In the LPIs prediction problem, there commonly exists the imbalance in the distribution of positive and negative samples. However, there are few existing methods that give specific consideration to this problem. In this paper, we proposed a new clustering-based LPIs prediction method using segmented k-mer frequencies and multi-space clustering (LPI-SKMSC). It was dedicated to handling the imbalance of positive and negative samples. We constructed segmented k-mer frequencies to obtain global and local features of lncRNA and protein sequences. Then, the multi-space clustering was applied to LPI-SKMSC. The convolutional neural network (CNN)-based encoders were used to map different features of a sample to different spaces. It used multiple spaces to jointly constrain the classification of samples. Finally, the distances between the output features of the encoder and the cluster center in each space were calculated. The sum of distances in all spaces was compared with the cluster radius to predict the LPIs. We performed cross-validation on 3 public datasets and LPI-SKMSC showed the best performance compared to other existing methods. Experimental results showed that LPI-SKMSC could predict LPIs more effectively when faced with imbalanced positive and negative samples. In addition, we illustrated that our model was better at uncovering potential lncRNA-protein interaction pairs.

摘要

长链非编码 RNA(lncRNA)在基因表达中具有重要的调节作用。与蛋白质的相互作用是 lncRNA 发挥作用的方式之一。由于确定 lncRNA-蛋白质相互作用(LPI)的实验既昂贵又耗时,因此已经提出了许多用于预测 LPI 的计算方法作为替代方法。在 LPI 预测问题中,阳性和阴性样本的分布通常存在不平衡。然而,现有的很少有方法专门考虑到这个问题。在本文中,我们提出了一种新的基于聚类的 LPI 预测方法,使用分段 k-mer 频率和多空间聚类(LPI-SKMSC)。它专门用于处理正负样本的不平衡问题。我们构建了分段 k-mer 频率,以获得 lncRNA 和蛋白质序列的全局和局部特征。然后,将多空间聚类应用于 LPI-SKMSC。基于卷积神经网络(CNN)的编码器用于将样本的不同特征映射到不同的空间。它使用多个空间来共同约束样本的分类。最后,计算编码器输出特征与每个空间中聚类中心之间的距离。所有空间中的距离总和与聚类半径进行比较,以预测 LPI。我们在 3 个公共数据集上进行了交叉验证,与其他现有方法相比,LPI-SKMSC 表现出了最佳的性能。实验结果表明,当面对不平衡的正负样本时,LPI-SKMSC 可以更有效地预测 LPI。此外,我们还说明了我们的模型更善于发现潜在的 lncRNA-蛋白质相互作用对。

相似文献

1
LPI-SKMSC: Predicting LncRNA-Protein Interactions with Segmented k-mer Frequencies and Multi-space Clustering.LPI-SKMSC:基于分段 k--mer 频率和多空间聚类的长链非编码 RNA-蛋白质相互作用预测。
Interdiscip Sci. 2024 Jun;16(2):378-391. doi: 10.1007/s12539-023-00598-4. Epub 2024 Jan 11.
2
LPI-HyADBS: a hybrid framework for lncRNA-protein interaction prediction integrating feature selection and classification.LPI-HyADBS:一种集成特征选择和分类的 lncRNA-蛋白质相互作用预测的混合框架。
BMC Bioinformatics. 2021 Nov 26;22(1):568. doi: 10.1186/s12859-021-04485-x.
3
EnANNDeep: An Ensemble-based lncRNA-protein Interaction Prediction Framework with Adaptive k-Nearest Neighbor Classifier and Deep Models.EnANNDeep:基于集成学习的 lncRNA-蛋白质相互作用预测框架,采用自适应 k-最近邻分类器和深度模型。
Interdiscip Sci. 2022 Mar;14(1):209-232. doi: 10.1007/s12539-021-00483-y. Epub 2022 Jan 10.
4
LPI-deepGBDT: a multiple-layer deep framework based on gradient boosting decision trees for lncRNA-protein interaction identification.LPI-deepGBDT:基于梯度提升决策树的多层深度框架,用于 lncRNA-蛋白质相互作用识别。
BMC Bioinformatics. 2021 Oct 4;22(1):479. doi: 10.1186/s12859-021-04399-8.
5
Capsule-LPI: a LncRNA-protein interaction predicting tool based on a capsule network.胶囊-LPI:一种基于胶囊网络的 LncRNA-蛋白质相互作用预测工具。
BMC Bioinformatics. 2021 May 13;22(1):246. doi: 10.1186/s12859-021-04171-y.
6
LPI-CNNCP: Prediction of lncRNA-protein interactions by using convolutional neural network with the copy-padding trick.LPI-CNNCP:利用卷积神经网络和复制填充技术预测 lncRNA-蛋白质相互作用。
Anal Biochem. 2020 Jul 15;601:113767. doi: 10.1016/j.ab.2020.113767. Epub 2020 May 23.
7
LPI-CSFFR: Combining serial fusion with feature reuse for predicting LncRNA-protein interactions.LPI-CSFFR:结合序列融合与特征重用进行长链非编码 RNA-蛋白质相互作用预测。
Comput Biol Chem. 2022 Aug;99:107718. doi: 10.1016/j.compbiolchem.2022.107718. Epub 2022 Jun 27.
8
A novel lncRNA-protein interaction prediction method based on deep forest with cascade forest structure.基于级联森林结构的深度森林新型长链非编码 RNA-蛋白质相互作用预测方法。
Sci Rep. 2021 Sep 23;11(1):18881. doi: 10.1038/s41598-021-98277-1.
9
Predicting potential interactions between lncRNAs and proteins via combined graph auto-encoder methods.通过组合图自动编码器方法预测 lncRNAs 与蛋白质之间的潜在相互作用。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac527.
10
Prediction of LncRNA-protein Interactions Using Auto-Encoder, SE-ResNet Models and Transfer Learning.使用自动编码器、SE-ResNet模型和迁移学习预测长链非编码RNA与蛋白质的相互作用
Microrna. 2024;13(2):155-165. doi: 10.2174/0122115366288068240322064431.

引用本文的文献

1
Graph-RPI: predicting RNA-protein interactions via graph autoencoder and self-supervised learning strategies.Graph-RPI:通过图自动编码器和自监督学习策略预测RNA-蛋白质相互作用
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf292.

本文引用的文献

1
ProtInteract: A deep learning framework for predicting protein-protein interactions.ProtInteract:一种用于预测蛋白质-蛋白质相互作用的深度学习框架。
Comput Struct Biotechnol J. 2023 Jan 25;21:1324-1348. doi: 10.1016/j.csbj.2023.01.028. eCollection 2023.
2
RLF-LPI: An ensemble learning framework using sequence information for predicting lncRNA-protein interaction based on AE-ResLSTM and fuzzy decision.RLF-LPI:一种基于 AE-ResLSTM 和模糊决策的利用序列信息进行 lncRNA-蛋白质相互作用预测的集成学习框架。
Math Biosci Eng. 2022 Mar 11;19(5):4749-4764. doi: 10.3934/mbe.2022222.
3
Mechanisms of lncRNA biogenesis as revealed by nascent transcriptomics.
新生转录组学揭示的长链非编码RNA生物合成机制
Nat Rev Mol Cell Biol. 2022 Jun;23(6):389-406. doi: 10.1038/s41580-021-00447-6. Epub 2022 Jan 25.
4
LGFC-CNN: Prediction of lncRNA-Protein Interactions by Using Multiple Types of Features through Deep Learning.LGFC-CNN:通过深度学习利用多种类型特征预测 lncRNA-蛋白质相互作用
Genes (Basel). 2021 Oct 24;12(11):1689. doi: 10.3390/genes12111689.
5
A novel lncRNA-protein interaction prediction method based on deep forest with cascade forest structure.基于级联森林结构的深度森林新型长链非编码 RNA-蛋白质相互作用预测方法。
Sci Rep. 2021 Sep 23;11(1):18881. doi: 10.1038/s41598-021-98277-1.
6
Transfer learning via multi-scale convolutional neural layers for human-virus protein-protein interaction prediction.基于多尺度卷积神经网络的迁移学习方法在人类-病毒蛋白质相互作用预测中的应用。
Bioinformatics. 2021 Dec 11;37(24):4771-4778. doi: 10.1093/bioinformatics/btab533.
7
Long Non-Coding RNAs: The Regulatory Mechanisms, Research Strategies, and Future Directions in Cancers.长链非编码RNA:癌症中的调控机制、研究策略及未来方向
Front Oncol. 2020 Dec 18;10:598817. doi: 10.3389/fonc.2020.598817. eCollection 2020.
8
Gene regulation by long non-coding RNAs and its biological functions.长非编码 RNA 的基因调控及其生物学功能。
Nat Rev Mol Cell Biol. 2021 Feb;22(2):96-118. doi: 10.1038/s41580-020-00315-9. Epub 2020 Dec 22.
9
Disease-Causing Mutations and Rearrangements in Long Non-coding RNA Gene Loci.长链非编码RNA基因座中的致病突变和重排
Front Genet. 2020 Nov 30;11:527484. doi: 10.3389/fgene.2020.527484. eCollection 2020.
10
LncMirNet: Predicting LncRNA-miRNA Interaction Based on Deep Learning of Ribonucleic Acid Sequences.LncMirNet:基于 RNA 序列深度学习的长非编码 RNA- miRNA 相互作用预测。
Molecules. 2020 Sep 23;25(19):4372. doi: 10.3390/molecules25194372.