• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度MC-iNABP:用于核酸结合蛋白多类识别和分类的深度学习

DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins.

作者信息

Cui Feifei, Li Shuang, Zhang Zilong, Sui Miaomiao, Cao Chen, El-Latif Hesham Abd, Zou Quan

机构信息

School of Computer Science and Technology, Hainan University, Haikou 570228, China.

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China.

出版信息

Comput Struct Biotechnol J. 2022 Apr 26;20:2020-2028. doi: 10.1016/j.csbj.2022.04.029. eCollection 2022.

DOI:10.1016/j.csbj.2022.04.029
PMID:35521556
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9065708/
Abstract

Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play vital roles in gene expression. Accurate identification of these proteins is crucial. However, there are two existing challenges: one is the problem of ignoring DNA- and RNA-binding proteins (DRBPs), and the other is a cross-predicting problem referring to DBP predictors predicting DBPs as RBPs, and vice versa. In this study, we proposed a computational predictor, called DeepMC-iNABP, with the goal of solving these difficulties by utilizing a multiclass classification strategy and deep learning approaches. DBPs, RBPs, DRBPs and non-NABPs as separate classes of data were used for training the DeepMC-iNABP model. The results on test data collected in this study and two independent test datasets showed that DeepMC-iNABP has a strong advantage in identifying the DRBPs and has the ability to alleviate the cross-prediction problem to a certain extent. The web-server of DeepMC-iNABP is freely available at http://www.deepmc-inabp.net/. The datasets used in this research can also be downloaded from the website.

摘要

核酸结合蛋白(NABP),包括DNA结合蛋白(DBP)和RNA结合蛋白(RBP),在基因表达中发挥着至关重要的作用。准确识别这些蛋白至关重要。然而,目前存在两个挑战:一是忽略DNA和RNA结合蛋白(DRBP)的问题,另一个是交叉预测问题,即DBP预测器将DBP预测为RBP,反之亦然。在本研究中,我们提出了一种计算预测器,称为DeepMC-iNABP,旨在通过利用多类分类策略和深度学习方法来解决这些难题。将DBP、RBP、DRBP和非NABP作为单独的数据类别用于训练DeepMC-iNABP模型。本研究收集的测试数据以及两个独立测试数据集的结果表明,DeepMC-iNABP在识别DRBP方面具有强大优势,并且能够在一定程度上缓解交叉预测问题。DeepMC-iNABP的网络服务器可在http://www.deepmc-inabp.net/免费获取。本研究中使用的数据集也可从该网站下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/d494296e29b2/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/6f432fed4949/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/34f068b48ac3/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/956d0b4f37fe/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/5658c331eb6b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/b0403e086a4a/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/d494296e29b2/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/6f432fed4949/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/34f068b48ac3/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/956d0b4f37fe/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/5658c331eb6b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/b0403e086a4a/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/349c/9065708/d494296e29b2/gr5.jpg

相似文献

1
DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins.深度MC-iNABP:用于核酸结合蛋白多类识别和分类的深度学习
Comput Struct Biotechnol J. 2022 Apr 26;20:2020-2028. doi: 10.1016/j.csbj.2022.04.029. eCollection 2022.
2
iDRBP_MMC: Identifying DNA-Binding Proteins and RNA-Binding Proteins Based on Multi-Label Learning Model and Motif-Based Convolutional Neural Network.iDRBP_MMC:基于多标签学习模型和基于模体的卷积神经网络的 DNA 结合蛋白和 RNA 结合蛋白的鉴定。
J Mol Biol. 2020 Nov 6;432(22):5860-5875. doi: 10.1016/j.jmb.2020.09.008. Epub 2020 Sep 11.
3
IDRBP-PPCT: Identifying Nucleic Acid-Binding Proteins Based on Position-Specific Score Matrix and Position-Specific Frequency Matrix Cross Transformation.基于位置特异得分矩阵和位置特异频率矩阵交叉变换的核酸结合蛋白识别方法(IDRBP-PPCT)
IEEE/ACM Trans Comput Biol Bioinform. 2022 Jul-Aug;19(4):2284-2293. doi: 10.1109/TCBB.2021.3069263. Epub 2022 Aug 8.
4
DeepDRBP-2L: A New Genome Annotation Predictor for Identifying DNA-Binding Proteins and RNA-Binding Proteins Using Convolutional Neural Network and Long Short-Term Memory.DeepDRBP-2L:一种新的基因组注释预测器,用于使用卷积神经网络和长短期记忆识别 DNA 结合蛋白和 RNA 结合蛋白。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jul-Aug;18(4):1451-1463. doi: 10.1109/TCBB.2019.2952338. Epub 2021 Aug 6.
5
Improved prediction of DNA and RNA binding proteins with deep learning models.深度学习模型提高 DNA 和 RNA 结合蛋白的预测能力。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae285.
6
iDRBP-EL: Identifying DNA- and RNA- Binding Proteins Based on Hierarchical Ensemble Learning.iDRBP-EL:基于分层集成学习识别DNA和RNA结合蛋白。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):432-441. doi: 10.1109/TCBB.2021.3136905. Epub 2023 Feb 3.
7
iDRBP-ECHF: Identifying DNA- and RNA-binding proteins based on extensible cubic hybrid framework.iDRBP-ECHF:基于可扩展立方混合框架的 DNA 和 RNA 结合蛋白识别。
Comput Biol Med. 2022 Oct;149:105940. doi: 10.1016/j.compbiomed.2022.105940. Epub 2022 Aug 13.
8
DeepDNAbP: A deep learning-based hybrid approach to improve the identification of deoxyribonucleic acid-binding proteins.DeepDNAbP:一种基于深度学习的混合方法,用于提高脱氧核糖核酸结合蛋白的识别能力。
Comput Biol Med. 2022 Jun;145:105433. doi: 10.1016/j.compbiomed.2022.105433. Epub 2022 Mar 30.
9
NCBRPred: predicting nucleic acid binding residues in proteins based on multilabel learning.NCBRPred:基于多标签学习的蛋白质核酸结合残基预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa397.
10
Immobilized Titanium (IV) Ion Affinity Chromatography Contributes to Efficient Proteomics Analysis of Cellular Nucleic Acid-Binding Proteins.固定化钛(IV)离子亲和色谱有助于细胞核酸结合蛋白的高效蛋白质组学分析。
J Proteome Res. 2022 Jan 7;21(1):220-231. doi: 10.1021/acs.jproteome.1c00788. Epub 2021 Nov 15.

引用本文的文献

1
GTE-PPIS: a protein-protein interaction site predictor based on graph transformer and equivariant graph neural network.GTE-PPIS:一种基于图变换器和等变图神经网络的蛋白质-蛋白质相互作用位点预测器。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf290.
2
RBPWorld for exploring functions and disease associations of RNA-binding proteins across species.RBPWorld用于探索跨物种RNA结合蛋白的功能和疾病关联。
Nucleic Acids Res. 2025 Jan 6;53(D1):D220-D232. doi: 10.1093/nar/gkae1028.
3
DPNN-ac4C: a dual-path neural network with self-attention mechanism for identification of N4-acetylcytidine (ac4C) in mRNA.

本文引用的文献

1
Protein-DNA/RNA interactions: Machine intelligence tools and approaches in the era of artificial intelligence and big data.蛋白质 - DNA/RNA 相互作用:人工智能与大数据时代的机器智能工具及方法
Proteomics. 2022 Apr;22(8):e2100197. doi: 10.1002/pmic.202100197. Epub 2022 Feb 13.
2
BIN1 rs744373 located in enhancers of brain tissues upregulates BIN1 mRNA expression, thereby leading to Alzheimer's disease.位于脑组织增强子区域的BIN1基因rs744373上调BIN1信使核糖核酸的表达,从而引发阿尔茨海默病。
Alzheimers Dement. 2022 Aug;18(8):1587-1588. doi: 10.1002/alz.12548. Epub 2022 Jan 3.
3
iTTCA-RF: a random forest predictor for tumor T cell antigens.
DPNN-ac4C:一种具有自注意力机制的双路径神经网络,用于识别 mRNA 中的 N4-乙酰胞嘧啶(ac4C)。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae625.
4
MultiFeatVotPIP: a voting-based ensemble learning framework for predicting proinflammatory peptides.MultiFeatVotPIP:一种基于投票的集成学习框架,用于预测促炎肽。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae505.
5
PMTPred: machine-learning-based prediction of protein methyltransferases using the composition of k-spaced amino acid pairs.PMTPred:基于k间隔氨基酸对组成的蛋白质甲基转移酶的机器学习预测
Mol Divers. 2024 Aug;28(4):2301-2315. doi: 10.1007/s11030-024-10937-2. Epub 2024 Jul 21.
6
iNP_ESM: Neuropeptide Identification Based on Evolutionary Scale Modeling and Unified Representation Embedding Features.iNP_ESM:基于进化尺度建模和统一表示嵌入特征的神经肽识别。
Int J Mol Sci. 2024 Jun 27;25(13):7049. doi: 10.3390/ijms25137049.
7
EPDRNA: A Model for Identifying DNA-RNA Binding Sites in Disease-Related Proteins.EPDRNA:一种用于识别疾病相关蛋白质中DNA-RNA结合位点的模型。
Protein J. 2024 Jun;43(3):513-521. doi: 10.1007/s10930-024-10183-3. Epub 2024 Mar 16.
8
Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides.利用随机森林识别氨基酸的关键物理化学性质,以区分抗癌肽和非抗癌肽。
Int J Mol Sci. 2023 Jun 29;24(13):10854. doi: 10.3390/ijms241310854.
9
A systematic pan-cancer analysis reveals the clinical prognosis and immunotherapy value of C-X3-C motif ligand 1 (CX3CL1).一项系统性泛癌分析揭示了C-X3-C基序配体1(CX3CL1)的临床预后及免疫治疗价值。
Front Genet. 2023 Apr 20;14:1183795. doi: 10.3389/fgene.2023.1183795. eCollection 2023.
10
A Machine Learning Method to Identify Umami Peptide Sequences by Using Multiplicative LSTM Embedded Features.一种利用乘法长短期记忆网络嵌入特征识别鲜味肽序列的机器学习方法。
Foods. 2023 Apr 2;12(7):1498. doi: 10.3390/foods12071498.
iTTCA-RF:一种用于肿瘤 T 细胞抗原的随机森林预测器。
J Transl Med. 2021 Oct 27;19(1):449. doi: 10.1186/s12967-021-03084-x.
4
Detection of alcoholism using EEG signals and a CNN-LSTM-ATTN network.基于 EEG 信号和 CNN-LSTM-ATTN 网络的酒精中毒检测。
Comput Biol Med. 2021 Nov;138:104940. doi: 10.1016/j.compbiomed.2021.104940. Epub 2021 Oct 13.
5
Integration of Multiple-Omics Data to Analyze the Population-Specific Differences for Coronary Artery Disease.多组学数据的整合分析冠心病的人群特异性差异。
Comput Math Methods Med. 2021 Aug 17;2021:7036592. doi: 10.1155/2021/7036592. eCollection 2021.
6
Deepred-Mt: Deep representation learning for predicting C-to-U RNA editing in plant mitochondria.Deepred-Mt:用于预测植物线粒体 C 到 U RNA 编辑的深度表示学习。
Comput Biol Med. 2021 Sep;136:104682. doi: 10.1016/j.compbiomed.2021.104682. Epub 2021 Jul 27.
7
Dual targeting of cytokine storm and viral replication in COVID-19 by plant-derived steroidal pregnanes: An in silico perspective.植物源性甾体孕烷通过靶向细胞因子风暴和 COVID-19 中的病毒复制:一种计算机视角。
Comput Biol Med. 2021 Jul;134:104406. doi: 10.1016/j.compbiomed.2021.104406. Epub 2021 Apr 18.
8
A comprehensive review of the imbalance classification of protein post-translational modifications.蛋白质翻译后修饰不平衡分类的全面综述。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab089.
9
Critical downstream analysis steps for single-cell RNA sequencing data.单细胞 RNA 测序数据分析的关键下游步骤。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab105.
10
Evaluation of machine learning algorithms for health and wellness applications: A tutorial.机器学习算法在健康和养生应用中的评估:教程。
Comput Biol Med. 2021 May;132:104324. doi: 10.1016/j.compbiomed.2021.104324. Epub 2021 Mar 13.