• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用具有二核苷酸-密码子融合特征编码的混合深度学习模型预测长链非编码RNA-蛋白质相互作用。

Predicting lncRNA-protein interactions using a hybrid deep learning model with dinucleotide-codon fusion feature encoding.

作者信息

Tan Li, Mengshan Li, Yu Fu, Yelin Li, Jihong Zhu, Lixin Guan

机构信息

College of Physics and Electronic Information, Gannan Normal University, Ganzhou, 341000, Jiangxi, China.

Ganzhou Power Supply Branch of State Grid Jiangxi Electric Power Co., Ltd, Ganzhou, 341000, Jiangxi, China.

出版信息

BMC Genomics. 2024 Dec 28;25(1):1253. doi: 10.1186/s12864-024-11168-3.

DOI:10.1186/s12864-024-11168-3
PMID:39732642
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11682639/
Abstract

Long non-coding RNAs (lncRNAs) play crucial roles in numerous biological processes and are involved in complex human diseases through interactions with proteins. Accurate identification of lncRNA-protein interactions (LPI) can help elucidate the functional mechanisms of lncRNAs and provide scientific insights into the molecular mechanisms underlying related diseases. While many sequence-based methods have been developed to predict LPIs, efficiently extracting and effectively integrating potential feature information that reflects functional attributes from lncRNA and protein sequences remains a significant challenge. This paper proposes a Dinucleotide-Codon Fusion Feature encoding (DNCFF) and constructs an LPI prediction model based on deep learning, termed LPI-DNCFF. The Dual Nucleotide Visual Fusion Feature encoding (DNVFF) incorporates positional information of single nucleotides with subsequent nucleotide connections, while Codon Fusion Feature encoding (CFF) considers the specificity, molecular weight, and physicochemical properties of each amino acid. These encoding methods encapsulate rich and intuitive sequence information in limited encoding dimensions. The model comprehensively predicts LPIs by integrating global, local, and structural features, and inputs them into BiLSTM and attention layers to form a hybrid deep learning model. Experimental results demonstrate that LPI-DNCFF effectively predicts LPIs. The BiLSTM layer and attention mechanism can learn long-term dependencies and identify weighted key features, enhancing model performance. Compared to one-hot encoding, DNCFF more efficiently and thoroughly extracts potential sequence features. Compared to other existing methods, LPI-DNCFF achieved the best performance on the RPI1847 and ATH948 datasets, with MCC values of approximately 97.84% and 84.58%, respectively, outperforming the state-of-the-art method by about 1.44% and 3.48%.

摘要

长链非编码RNA(lncRNAs)在众多生物过程中发挥着关键作用,并通过与蛋白质相互作用参与复杂的人类疾病。准确识别lncRNA-蛋白质相互作用(LPI)有助于阐明lncRNAs的功能机制,并为相关疾病的分子机制提供科学见解。虽然已经开发了许多基于序列的方法来预测LPI,但从lncRNA和蛋白质序列中有效提取并有效整合反映功能属性的潜在特征信息仍然是一项重大挑战。本文提出了一种二核苷酸-密码子融合特征编码(DNCFF),并构建了一个基于深度学习的LPI预测模型,称为LPI-DNCFF。双核苷酸视觉融合特征编码(DNVFF)将单核苷酸的位置信息与后续核苷酸连接相结合,而密码子融合特征编码(CFF)则考虑了每个氨基酸的特异性、分子量和理化性质。这些编码方法在有限的编码维度中封装了丰富而直观的序列信息。该模型通过整合全局、局部和结构特征来全面预测LPI,并将其输入到双向长短期记忆网络(BiLSTM)和注意力层中,形成一个混合深度学习模型。实验结果表明,LPI-DNCFF能够有效地预测LPI。BiLSTM层和注意力机制可以学习长期依赖性并识别加权关键特征,从而提高模型性能。与独热编码相比,DNCFF能更高效、更全面地提取潜在序列特征。与其他现有方法相比,LPI-DNCFF在RPI1847和ATH948数据集上取得了最佳性能,马修斯相关系数(MCC)值分别约为97.84%和84.58%,比现有最先进的方法分别高出约1.44%和3.48%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/d82e8f71697e/12864_2024_11168_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/caa0e19b1225/12864_2024_11168_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/12a45784baa8/12864_2024_11168_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/1d92fad7da23/12864_2024_11168_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/a0ffc4112e8a/12864_2024_11168_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/a14d87055eca/12864_2024_11168_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/e896ad2759de/12864_2024_11168_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/a48c2bc019e8/12864_2024_11168_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/bd3db15e79e1/12864_2024_11168_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/23888a2d6b39/12864_2024_11168_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/5d924e91eb84/12864_2024_11168_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/d82e8f71697e/12864_2024_11168_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/caa0e19b1225/12864_2024_11168_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/12a45784baa8/12864_2024_11168_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/1d92fad7da23/12864_2024_11168_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/a0ffc4112e8a/12864_2024_11168_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/a14d87055eca/12864_2024_11168_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/e896ad2759de/12864_2024_11168_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/a48c2bc019e8/12864_2024_11168_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/bd3db15e79e1/12864_2024_11168_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/23888a2d6b39/12864_2024_11168_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/5d924e91eb84/12864_2024_11168_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b30/11682639/d82e8f71697e/12864_2024_11168_Fig11_HTML.jpg

相似文献

1
Predicting lncRNA-protein interactions using a hybrid deep learning model with dinucleotide-codon fusion feature encoding.使用具有二核苷酸-密码子融合特征编码的混合深度学习模型预测长链非编码RNA-蛋白质相互作用。
BMC Genomics. 2024 Dec 28;25(1):1253. doi: 10.1186/s12864-024-11168-3.
2
LPI-HyADBS: a hybrid framework for lncRNA-protein interaction prediction integrating feature selection and classification.LPI-HyADBS:一种集成特征选择和分类的 lncRNA-蛋白质相互作用预测的混合框架。
BMC Bioinformatics. 2021 Nov 26;22(1):568. doi: 10.1186/s12859-021-04485-x.
3
Finding lncRNA-Protein Interactions Based on Deep Learning With Dual-Net Neural Architecture.基于双网络神经架构深度学习的长链非编码RNA-蛋白质相互作用研究
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3456-3468. doi: 10.1109/TCBB.2021.3116232. Epub 2022 Dec 8.
4
LPI-deepGBDT: a multiple-layer deep framework based on gradient boosting decision trees for lncRNA-protein interaction identification.LPI-deepGBDT:基于梯度提升决策树的多层深度框架,用于 lncRNA-蛋白质相互作用识别。
BMC Bioinformatics. 2021 Oct 4;22(1):479. doi: 10.1186/s12859-021-04399-8.
5
Capsule-LPI: a LncRNA-protein interaction predicting tool based on a capsule network.胶囊-LPI:一种基于胶囊网络的 LncRNA-蛋白质相互作用预测工具。
BMC Bioinformatics. 2021 May 13;22(1):246. doi: 10.1186/s12859-021-04171-y.
6
RLF-LPI: An ensemble learning framework using sequence information for predicting lncRNA-protein interaction based on AE-ResLSTM and fuzzy decision.RLF-LPI:一种基于 AE-ResLSTM 和模糊决策的利用序列信息进行 lncRNA-蛋白质相互作用预测的集成学习框架。
Math Biosci Eng. 2022 Mar 11;19(5):4749-4764. doi: 10.3934/mbe.2022222.
7
SFPEL-LPI: Sequence-based feature projection ensemble learning for predicting LncRNA-protein interactions.SFPEL-LPI:基于序列的特征投影集成学习预测 LncRNA-蛋白质相互作用。
PLoS Comput Biol. 2018 Dec 11;14(12):e1006616. doi: 10.1371/journal.pcbi.1006616. eCollection 2018 Dec.
8
Prediction of LncRNA-protein Interactions Using Auto-Encoder, SE-ResNet Models and Transfer Learning.使用自动编码器、SE-ResNet模型和迁移学习预测长链非编码RNA与蛋白质的相互作用
Microrna. 2024;13(2):155-165. doi: 10.2174/0122115366288068240322064431.
9
A novel lncRNA-protein interaction prediction method based on deep forest with cascade forest structure.基于级联森林结构的深度森林新型长链非编码 RNA-蛋白质相互作用预测方法。
Sci Rep. 2021 Sep 23;11(1):18881. doi: 10.1038/s41598-021-98277-1.
10
LPI-SKMSC: Predicting LncRNA-Protein Interactions with Segmented k-mer Frequencies and Multi-space Clustering.LPI-SKMSC:基于分段 k--mer 频率和多空间聚类的长链非编码 RNA-蛋白质相互作用预测。
Interdiscip Sci. 2024 Jun;16(2):378-391. doi: 10.1007/s12539-023-00598-4. Epub 2024 Jan 11.

本文引用的文献

1
DrugReAlign: a multisource prompt framework for drug repurposing based on large language models.DrugReAlign:一种基于大型语言模型的药物重定位多源提示框架。
BMC Biol. 2024 Oct 8;22(1):226. doi: 10.1186/s12915-024-02028-3.
2
Efficient Deep Model Ensemble Framework for Drug-Target Interaction Prediction.用于药物-靶点相互作用预测的高效深度模型集成框架
J Phys Chem Lett. 2024 Aug 1;15(30):7681-7693. doi: 10.1021/acs.jpclett.4c01509. Epub 2024 Jul 22.
3
Revisiting drug-protein interaction prediction: a novel global-local perspective.
重新审视药物-蛋白质相互作用预测:一种新颖的全局-局部视角。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae271.
4
AMGDTI: drug-target interaction prediction based on adaptive meta-graph learning in heterogeneous network.AMGDTI:基于异构网络中自适应元图学习的药物-靶标相互作用预测。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad474.
5
A high level of the long non-coding RNA is associated with poor prognosis in breast cancer and activates YAP transcriptional activity to enhance breast cancer proliferation and metastasis.高水平的长链非编码 RNA 与乳腺癌的不良预后相关,并激活 YAP 转录活性,从而增强乳腺癌的增殖和转移。
Bioengineered. 2022 May;13(5):13437-13451. doi: 10.1080/21655979.2022.2074108.
6
RLF-LPI: An ensemble learning framework using sequence information for predicting lncRNA-protein interaction based on AE-ResLSTM and fuzzy decision.RLF-LPI:一种基于 AE-ResLSTM 和模糊决策的利用序列信息进行 lncRNA-蛋白质相互作用预测的集成学习框架。
Math Biosci Eng. 2022 Mar 11;19(5):4749-4764. doi: 10.3934/mbe.2022222.
7
Mechanisms of lncRNA biogenesis as revealed by nascent transcriptomics.新生转录组学揭示的长链非编码RNA生物合成机制
Nat Rev Mol Cell Biol. 2022 Jun;23(6):389-406. doi: 10.1038/s41580-021-00447-6. Epub 2022 Jan 25.
8
LPI-EnEDT: an ensemble framework with extra tree and decision tree classifiers for imbalanced lncRNA-protein interaction data classification.LPI-EnEDT:一种用于不平衡长链非编码RNA-蛋白质相互作用数据分类的集成框架,包含额外树和决策树分类器。
BioData Min. 2021 Dec 3;14(1):50. doi: 10.1186/s13040-021-00277-4.
9
LGFC-CNN: Prediction of lncRNA-Protein Interactions by Using Multiple Types of Features through Deep Learning.LGFC-CNN:通过深度学习利用多种类型特征预测 lncRNA-蛋白质相互作用
Genes (Basel). 2021 Oct 24;12(11):1689. doi: 10.3390/genes12111689.
10
LncRNA ZFAS1: Role in tumorigenesis and other diseases.长链非编码 RNA ZFAS1:在肿瘤发生和其他疾病中的作用。
Biomed Pharmacother. 2021 Oct;142:111999. doi: 10.1016/j.biopha.2021.111999. Epub 2021 Aug 9.