• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于支持向量机的 RNA 结合蛋白结合残基和进化信息预测。

SVM based prediction of RNA-binding proteins using binding residues and evolutionary information.

机构信息

Department of Biology, McGill University, Montreal, QC, H3A 1B1, Canada.

出版信息

J Mol Recognit. 2011 Mar-Apr;24(2):303-13. doi: 10.1002/jmr.1061.

DOI:10.1002/jmr.1061
PMID:20677174
Abstract

RNA-binding proteins (RBPs) play crucial role in transcription and gene-regulation. This paper describes a support vector machine (SVM) based method for discriminating and classifying RNA-binding and non-binding proteins using sequence features. With the threshold of 30% interacting residues, RNA-binding amino acid prediction method PPRINT achieved the Matthews correlation coefficient (MCC) of 0.32. BLAST and PSI-BLAST identified RBPs with the coverage of 32.63 and 33.16%, respectively, at the e-value of 1e-4. The SVM models developed with amino acid, dipeptide and four-part amino acid compositions showed the MCC of 0.60, 0.46, and 0.53, respectively. This is the first study in which evolutionary information in form of position specific scoring matrix (PSSM) profile has been successfully used for predicting RBPs. We achieved the maximum MCC of 0.62 using SVM model based on PSSM called PSSM-400. Finally, we developed different hybrid approaches and achieved maximum MCC of 0.66. We also developed a method for predicting three subclasses of RNA binding proteins (e.g., rRNA, tRNA, mRNA binding proteins). The performance of the method was also evaluated on an independent dataset of 69 RBPs and 100 non-RBPs (NBPs). An additional benchmarking was also performed using gene ontology (GO) based annotation. Based on the hybrid approach a web-server RNApred has been developed for predicting RNA binding proteins from amino acid sequences (http://www.imtech.res.in/raghava/rnapred/).

摘要

RNA 结合蛋白(RBPs)在转录和基因调控中起着至关重要的作用。本文描述了一种基于支持向量机(SVM)的方法,用于使用序列特征区分和分类 RNA 结合和非结合蛋白。使用 30%相互作用残基的阈值,RNA 结合氨基酸预测方法 PPRINT 实现了 Matthews 相关系数(MCC)为 0.32。BLAST 和 PSI-BLAST 分别以覆盖率 32.63%和 33.16%,在 e 值为 1e-4 的情况下鉴定了 RBPs。使用氨基酸、二肽和四部分氨基酸组成开发的 SVM 模型显示 MCC 分别为 0.60、0.46 和 0.53。这是首次成功使用形式为位置特异性评分矩阵(PSSM)图谱的进化信息来预测 RBPs 的研究。我们使用基于 PSSM 的 SVM 模型 PSSM-400 实现了最大 MCC 为 0.62。最后,我们开发了不同的混合方法,实现了最大 MCC 为 0.66。我们还开发了一种预测三种 RNA 结合蛋白亚类(例如 rRNA、tRNA、mRNA 结合蛋白)的方法。该方法的性能也在 69 个 RBPs 和 100 个非 RBPs(NBPs)的独立数据集上进行了评估。还使用基于基因本体(GO)的注释进行了额外的基准测试。基于混合方法,我们开发了一个用于从氨基酸序列预测 RNA 结合蛋白的网络服务器 RNApred(http://www.imtech.res.in/raghava/rnapred/)。

相似文献

1
SVM based prediction of RNA-binding proteins using binding residues and evolutionary information.基于支持向量机的 RNA 结合蛋白结合残基和进化信息预测。
J Mol Recognit. 2011 Mar-Apr;24(2):303-13. doi: 10.1002/jmr.1061.
2
Prediction of RNA binding sites in a protein using SVM and PSSM profile.使用支持向量机和位置特异性得分矩阵预测蛋白质中的RNA结合位点。
Proteins. 2008 Apr;71(1):189-94. doi: 10.1002/prot.21677.
3
DPROT: prediction of disordered proteins using evolutionary information.DPROT:利用进化信息预测无序蛋白质。
Amino Acids. 2008 Oct;35(3):599-605. doi: 10.1007/s00726-008-0085-y. Epub 2008 Apr 19.
4
A machine learning based method for the prediction of secretory proteins using amino acid composition, their order and similarity-search.一种基于机器学习的方法,利用氨基酸组成、顺序和相似性搜索来预测分泌蛋白。
In Silico Biol. 2008;8(2):129-40.
5
Prediction of transmembrane regions of beta-barrel proteins using ANN- and SVM-based methods.使用基于人工神经网络和支持向量机的方法预测β-桶状蛋白的跨膜区域。
Proteins. 2004 Jul 1;56(1):11-8. doi: 10.1002/prot.20092.
6
Prediction of RNA-binding residues in proteins from primary sequence using an enriched random forest model with a novel hybrid feature.基于新型混合特征的富集随机森林模型预测蛋白质中 RNA 结合残基的一级序列
Proteins. 2011 Apr;79(4):1230-9. doi: 10.1002/prot.22958. Epub 2011 Jan 25.
7
PRINTR: prediction of RNA binding sites in proteins using SVM and profiles.PRINTR:使用支持向量机和图谱预测蛋白质中的RNA结合位点
Amino Acids. 2008 Aug;35(2):295-302. doi: 10.1007/s00726-007-0634-9. Epub 2008 Jan 31.
8
Predicting sub-cellular localization of tRNA synthetases from their primary structures.从一级结构预测 tRNA 合成酶的亚细胞定位。
Amino Acids. 2012 May;42(5):1703-13. doi: 10.1007/s00726-011-0872-8. Epub 2011 Mar 13.
9
Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile.利用氨基酸组成拆分和 PSSM 图谱预测疟原虫的线粒体蛋白。
Amino Acids. 2010 Jun;39(1):101-10. doi: 10.1007/s00726-009-0381-1. Epub 2009 Nov 12.
10
Identification of RNA-binding sites in proteins by integrating various sequence information.通过整合各种序列信息鉴定蛋白质中的 RNA 结合位点。
Amino Acids. 2011 Jan;40(1):239-48. doi: 10.1007/s00726-010-0639-7. Epub 2010 Jun 12.

引用本文的文献

1
Accurate prediction of nucleic acid binding proteins using protein language model.使用蛋白质语言模型准确预测核酸结合蛋白。
Bioinform Adv. 2025 Jan 20;5(1):vbaf008. doi: 10.1093/bioadv/vbaf008. eCollection 2025.
2
Improved prediction of DNA and RNA binding proteins with deep learning models.深度学习模型提高 DNA 和 RNA 结合蛋白的预测能力。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae285.
3
"Life is short, and art is long": RNA degradation in cyanobacteria and model bacteria.“人生短暂,艺术长久”:蓝细菌和模式细菌中的RNA降解
mLife. 2022 Mar 24;1(1):21-39. doi: 10.1002/mlf2.12015. eCollection 2022 Mar.
4
RBProkCNN: Deep learning on appropriate contextual evolutionary information for RNA binding protein discovery in prokaryotes.RBProkCNN:基于适当上下文进化信息的深度学习用于原核生物中RNA结合蛋白的发现
Comput Struct Biotechnol J. 2024 Apr 15;23:1631-1640. doi: 10.1016/j.csbj.2024.04.034. eCollection 2024 Dec.
5
Exploring new roles for RNA-binding proteins in epigenetic and gene regulation.探索 RNA 结合蛋白在表观遗传和基因调控中的新作用。
Curr Opin Genet Dev. 2024 Feb;84:102136. doi: 10.1016/j.gde.2023.102136. Epub 2023 Dec 20.
6
HydRA: Deep-learning models for predicting RNA-binding capacity from protein interaction association context and protein sequence.HydRA:基于蛋白质相互作用关联上下文和蛋白质序列预测 RNA 结合能力的深度学习模型。
Mol Cell. 2023 Jul 20;83(14):2595-2611.e11. doi: 10.1016/j.molcel.2023.06.019. Epub 2023 Jul 7.
7
Stress-related biomolecular condensates in plants.植物中与应激相关的生物分子凝聚物。
Plant Cell. 2023 Sep 1;35(9):3187-3204. doi: 10.1093/plcell/koad127.
8
Exploiting liver metabolism for tissue-specific cancer targeting.利用肝脏代谢实现组织特异性癌症靶向
Nat Cancer. 2023 Mar;4(3):310-311. doi: 10.1038/s43018-023-00530-1.
9
qNABpredict: Quick, accurate, and taxonomy-aware sequence-based prediction of content of nucleic acid binding amino acids.qNABpredict:基于序列的快速、准确且具有分类学意识的核酸结合氨基酸含量预测。
Protein Sci. 2023 Jan;32(1):e4544. doi: 10.1002/pro.4544.
10
Bioinformatics approaches for classification and investigation of the evolution of the Na/K-ATPase alpha-subunit.生物信息学方法在钠离子/钾离子-ATP 酶 α 亚基分类和进化研究中的应用。
BMC Ecol Evol. 2022 Oct 26;22(1):122. doi: 10.1186/s12862-022-02071-0.