• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 SVM 模型预测病毒和人类之间的蛋白质-蛋白质相互作用。

Prediction of protein-protein interactions between viruses and human by an SVM model.

机构信息

School of Computer Science and Engineering, Inha University, Incheon, South Korea.

出版信息

BMC Bioinformatics. 2012 May 8;13 Suppl 7(Suppl 7):S5. doi: 10.1186/1471-2105-13-S7-S5.

DOI:10.1186/1471-2105-13-S7-S5
PMID:22595002
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3348049/
Abstract

BACKGROUND

Several computational methods have been developed to predict protein-protein interactions from amino acid sequences, but most of those methods are intended for the interactions within a species rather than for interactions across different species. Methods for predicting interactions between homogeneous proteins are not appropriate for finding those between heterogeneous proteins since they do not distinguish the interactions between proteins of the same species from those of different species.

RESULTS

We developed a new method for representing a protein sequence of variable length in a frequency vector of fixed length, which encodes the relative frequency of three consecutive amino acids of a sequence. We built a support vector machine (SVM) model to predict human proteins that interact with virus proteins. In two types of viruses, human papillomaviruses (HPV) and hepatitis C virus (HCV), our SVM model achieved an average accuracy above 80%, which is higher than that of another SVM model with a different representation scheme. Using the SVM model and Gene Ontology (GO) annotations of proteins, we predicted new interactions between virus proteins and human proteins.

CONCLUSIONS

Encoding the relative frequency of amino acid triplets of a protein sequence is a simple yet powerful representation method for predicting protein-protein interactions across different species. The representation method has several advantages: (1) it enables a prediction model to achieve a better performance than other representations, (2) it generates feature vectors of fixed length regardless of the sequence length, and (3) the same representation is applicable to different types of proteins.

摘要

背景

已经开发出几种从氨基酸序列预测蛋白质-蛋白质相互作用的计算方法,但这些方法大多数旨在预测同一物种内的相互作用,而不是预测不同物种之间的相互作用。用于预测同源蛋白质相互作用的方法不适合寻找异源蛋白质之间的相互作用,因为它们无法区分同一物种蛋白质之间的相互作用与不同物种蛋白质之间的相互作用。

结果

我们开发了一种新方法,用于将可变长度的蛋白质序列表示为固定长度的频率向量,该向量编码序列中三个连续氨基酸的相对频率。我们构建了一个支持向量机 (SVM) 模型来预测与人相互作用的病毒蛋白质。在两种类型的病毒,人乳头瘤病毒 (HPV) 和丙型肝炎病毒 (HCV) 中,我们的 SVM 模型的平均准确率超过 80%,高于具有不同表示方案的另一个 SVM 模型的准确率。使用 SVM 模型和蛋白质的基因本体论 (GO) 注释,我们预测了病毒蛋白和人类蛋白之间的新相互作用。

结论

对蛋白质序列的氨基酸三联体的相对频率进行编码是一种简单而强大的预测不同物种之间蛋白质-蛋白质相互作用的表示方法。该表示方法具有几个优点:(1) 它使预测模型能够实现优于其他表示方法的性能,(2) 它生成固定长度的特征向量,而与序列长度无关,以及 (3) 相同的表示适用于不同类型的蛋白质。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/383144fe8522/1471-2105-13-S7-S5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/222f1bf9dd75/1471-2105-13-S7-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/4a61d7e79bd7/1471-2105-13-S7-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/7499432f47f8/1471-2105-13-S7-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/383144fe8522/1471-2105-13-S7-S5-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/222f1bf9dd75/1471-2105-13-S7-S5-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/4a61d7e79bd7/1471-2105-13-S7-S5-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/7499432f47f8/1471-2105-13-S7-S5-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f14/3348049/383144fe8522/1471-2105-13-S7-S5-4.jpg

相似文献

1
Prediction of protein-protein interactions between viruses and human by an SVM model.基于 SVM 模型预测病毒和人类之间的蛋白质-蛋白质相互作用。
BMC Bioinformatics. 2012 May 8;13 Suppl 7(Suppl 7):S5. doi: 10.1186/1471-2105-13-S7-S5.
2
An improved method for predicting interactions between virus and human proteins.一种预测病毒与人类蛋白质之间相互作用的改进方法。
J Bioinform Comput Biol. 2017 Feb;15(1):1650024. doi: 10.1142/S0219720016500244. Epub 2016 Jul 10.
3
Predicting protein-protein interactions between human and hepatitis C virus via an ensemble learning method.通过集成学习方法预测人类与丙型肝炎病毒之间的蛋白质-蛋白质相互作用。
Mol Biosyst. 2014 Dec;10(12):3147-54. doi: 10.1039/c4mb00410h. Epub 2014 Sep 18.
4
Prediction of RNA-binding amino acids from protein and RNA sequences.从蛋白质和 RNA 序列预测 RNA 结合氨基酸。
BMC Bioinformatics. 2011;12 Suppl 13(Suppl 13):S7. doi: 10.1186/1471-2105-12-S13-S7. Epub 2011 Nov 30.
5
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.
6
Performance of rotation forest ensemble classifier and feature extractor in predicting protein interactions using amino acid sequences.旋转森林集成分类器和特征提取器在使用氨基酸序列预测蛋白质相互作用方面的性能。
BMC Genomics. 2019 Dec 24;20(Suppl 9):950. doi: 10.1186/s12864-019-6304-y.
7
Prediction of interactions between viral and host proteins using supervised machine learning methods.使用监督式机器学习方法预测病毒蛋白与宿主蛋白之间的相互作用。
PLoS One. 2014 Nov 6;9(11):e112034. doi: 10.1371/journal.pone.0112034. eCollection 2014.
8
mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines.mGOASVM:基于基因本体和支持向量机的多标签蛋白质亚细胞定位。
BMC Bioinformatics. 2012 Nov 6;13:290. doi: 10.1186/1471-2105-13-290.
9
Predicting the host of influenza viruses based on the word vector.基于词向量预测流感病毒的宿主。
PeerJ. 2017 Jul 18;5:e3579. doi: 10.7717/peerj.3579. eCollection 2017.
10
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.

引用本文的文献

1
Prediction of influenza A virus-human protein-protein interactions using XGBoost with continuous and discontinuous amino acids information.使用具有连续和不连续氨基酸信息的XGBoost预测甲型流感病毒与人的蛋白质-蛋白质相互作用
PeerJ. 2025 Jan 30;13:e18863. doi: 10.7717/peerj.18863. eCollection 2025.
2
HBFormer: a single-stream framework based on hybrid attention mechanism for identification of human-virus protein-protein interactions.HBFormer:一种基于混合注意力机制的单流框架,用于识别人类-病毒蛋白质-蛋白质相互作用。
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae724.
3
Prediction of Human Papillomavirus-Host Oncoprotein Interactions Using Deep Learning.

本文引用的文献

1
Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data.使用流形嵌入技术从高通量实验数据中评估和预测蛋白质相互作用。
Bioinformatics. 2010 Nov 1;26(21):2744-51. doi: 10.1093/bioinformatics/btq510. Epub 2010 Sep 3.
2
Predicting protein-protein interactions from protein sequences using meta predictor.利用元预测器从蛋白质序列预测蛋白质-蛋白质相互作用。
Amino Acids. 2010 Nov;39(5):1595-9. doi: 10.1007/s00726-010-0588-1. Epub 2010 Apr 13.
3
Predicting protein-protein interactions from sequence using correlation coefficient and high-quality interaction dataset.
利用深度学习预测人乳头瘤病毒-宿主癌蛋白相互作用
Bioinform Biol Insights. 2024 Dec 10;18:11779322241304666. doi: 10.1177/11779322241304666. eCollection 2024.
4
Bioinformatic Resources for Exploring Human-virus Protein-protein Interactions Based on Binding Modes.基于结合模式探索人类-病毒蛋白质-蛋白质相互作用的生物信息学资源
Genomics Proteomics Bioinformatics. 2024 Dec 3;22(5). doi: 10.1093/gpbjnl/qzae075.
5
Oral_voting_transfer: classification of oral microorganisms' function proteins with voting transfer model.口头投票转移:用投票转移模型对口腔微生物功能蛋白进行分类
Front Microbiol. 2024 Feb 7;14:1277121. doi: 10.3389/fmicb.2023.1277121. eCollection 2023.
6
A correlation coefficient-based feature selection approach for virus-host protein-protein interaction prediction.基于相关系数的病毒-宿主蛋白质相互作用预测特征选择方法。
PLoS One. 2023 May 2;18(5):e0285168. doi: 10.1371/journal.pone.0285168. eCollection 2023.
7
Bioinformatics approaches for unveiling virus-host interactions.用于揭示病毒-宿主相互作用的生物信息学方法。
Comput Struct Biotechnol J. 2023;21:1774-1784. doi: 10.1016/j.csbj.2023.02.044. Epub 2023 Feb 27.
8
RF_phage virion: Classification of phage virion proteins with a random forest model.射频噬菌体病毒体:用随机森林模型对噬菌体病毒体蛋白进行分类
Front Genet. 2023 Feb 8;13:1103783. doi: 10.3389/fgene.2022.1103783. eCollection 2022.
9
Identification of intelligence-related proteins through a robust two-layer predictor.通过强大的双层预测器鉴定与智力相关的蛋白质。
Commun Integr Biol. 2022 Nov 15;15(1):253-264. doi: 10.1080/19420889.2022.2143101. eCollection 2022.
10
Machine learning in postgenomic biology and personalized medicine.后基因组生物学与个性化医学中的机器学习
Wiley Interdiscip Rev Data Min Knowl Discov. 2022 Mar-Apr;12(2). doi: 10.1002/widm.1451. Epub 2022 Jan 24.
利用相关系数和高质量的交互数据集从序列预测蛋白质-蛋白质相互作用。
Amino Acids. 2010 Mar;38(3):891-9. doi: 10.1007/s00726-009-0295-y. Epub 2009 Apr 24.
4
Hepatitis C virus infection protein network.丙型肝炎病毒感染蛋白网络
Mol Syst Biol. 2008;4:230. doi: 10.1038/msb.2008.66. Epub 2008 Nov 4.
5
Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences.利用支持向量机结合自协方差从蛋白质序列预测蛋白质-蛋白质相互作用。
Nucleic Acids Res. 2008 May;36(9):3025-30. doi: 10.1093/nar/gkn159. Epub 2008 Apr 4.
6
Predicting protein-protein interactions based only on sequences information.仅基于序列信息预测蛋白质-蛋白质相互作用。
Proc Natl Acad Sci U S A. 2007 Mar 13;104(11):4337-41. doi: 10.1073/pnas.0607879104. Epub 2007 Mar 5.
7
Prophylactic human papillomavirus vaccines.预防性人乳头瘤病毒疫苗
J Clin Invest. 2006 May;116(5):1167-73. doi: 10.1172/JCI28607.
8
Prediction of yeast protein-protein interaction network: insights from the Gene Ontology and annotations.酵母蛋白质-蛋白质相互作用网络的预测:来自基因本体论和注释的见解。
Nucleic Acids Res. 2006 Apr 26;34(7):2137-50. doi: 10.1093/nar/gkl219. Print 2006.
9
From structure to function: new insights into hepatitis C virus RNA replication.从结构到功能:丙型肝炎病毒RNA复制的新见解
J Biol Chem. 2006 Apr 14;281(15):9833-6. doi: 10.1074/jbc.R500026200. Epub 2005 Dec 30.
10
Predicting protein-protein interactions using signature products.使用特征产物预测蛋白质-蛋白质相互作用。
Bioinformatics. 2005 Jan 15;21(2):218-26. doi: 10.1093/bioinformatics/bth483. Epub 2004 Aug 19.