• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用氨基酸组成预测蛋白质-蛋白质相互作用。

Exploiting amino acid composition for predicting protein-protein interactions.

机构信息

Sushmita Roy Computer Science, University of New Mexico, Albuquerque, New Mexico, United States of America.

出版信息

PLoS One. 2009 Nov 20;4(11):e7813. doi: 10.1371/journal.pone.0007813.

DOI:10.1371/journal.pone.0007813
PMID:19936254
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2775920/
Abstract

BACKGROUND

Computational prediction of protein interactions typically use protein domains as classifier features because they capture conserved information of interaction surfaces. However, approaches relying on domains as features cannot be applied to proteins without any domain information. In this paper, we explore the contribution of pure amino acid composition (AAC) for protein interaction prediction. This simple feature, which is based on normalized counts of single or pairs of amino acids, is applicable to proteins from any sequenced organism and can be used to compensate for the lack of domain information.

RESULTS

AAC performed at par with protein interaction prediction based on domains on three yeast protein interaction datasets. Similar behavior was obtained using different classifiers, indicating that our results are a function of features and not of classifiers. In addition to yeast datasets, AAC performed comparably on worm and fly datasets. Prediction of interactions for the entire yeast proteome identified a large number of novel interactions, the majority of which co-localized or participated in the same processes. Our high confidence interaction network included both well-studied and uncharacterized proteins. Proteins with known function were involved in actin assembly and cell budding. Uncharacterized proteins interacted with proteins involved in reproduction and cell budding, thus providing putative biological roles for the uncharacterized proteins.

CONCLUSION

AAC is a simple, yet powerful feature for predicting protein interactions, and can be used alone or in conjunction with protein domains to predict new and validate existing interactions. More importantly, AAC alone performs at par with existing, but more complex, features indicating the presence of sequence-level information that is predictive of interaction, but which is not necessarily restricted to domains.

摘要

背景

计算蛋白质相互作用的预测通常使用蛋白质结构域作为分类器特征,因为它们捕获了相互作用表面的保守信息。然而,依赖结构域作为特征的方法不能应用于没有任何结构域信息的蛋白质。在本文中,我们探讨了纯氨基酸组成(AAC)对蛋白质相互作用预测的贡献。这个简单的特征是基于单个或氨基酸对的归一化计数,适用于任何测序生物的蛋白质,可用于弥补结构域信息的缺乏。

结果

AAC 在三个酵母蛋白质相互作用数据集上的表现与基于结构域的蛋白质相互作用预测相当。使用不同的分类器获得了相似的行为,这表明我们的结果是特征的函数,而不是分类器的函数。除了酵母数据集外,AAC 在蠕虫和苍蝇数据集上的表现也相当。对整个酵母蛋白质组的相互作用预测确定了大量新的相互作用,其中大多数共定位或参与相同的过程。我们的高置信度相互作用网络包括了研究充分的和未被描述的蛋白质。具有已知功能的蛋白质参与了肌动蛋白组装和细胞出芽。未被描述的蛋白质与涉及繁殖和细胞出芽的蛋白质相互作用,从而为未被描述的蛋白质提供了潜在的生物学作用。

结论

AAC 是一种简单而强大的预测蛋白质相互作用的特征,可以单独使用或与蛋白质结构域结合使用来预测新的和验证现有的相互作用。更重要的是,仅 AAC 与现有的、但更复杂的特征相当,这表明存在序列水平的信息,这些信息是可预测的相互作用,但不一定局限于结构域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/3e6a190738fa/pone.0007813.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/cd69393c87fa/pone.0007813.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/f6abac153f6b/pone.0007813.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/31426f1f8793/pone.0007813.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/dd9f39e58f66/pone.0007813.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/cae19d3587a6/pone.0007813.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/5d694bafabd5/pone.0007813.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/c72463a0237b/pone.0007813.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/8b671e9c1334/pone.0007813.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/a281981a7ed4/pone.0007813.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/56192e34e0e0/pone.0007813.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/3e6a190738fa/pone.0007813.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/cd69393c87fa/pone.0007813.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/f6abac153f6b/pone.0007813.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/31426f1f8793/pone.0007813.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/dd9f39e58f66/pone.0007813.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/cae19d3587a6/pone.0007813.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/5d694bafabd5/pone.0007813.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/c72463a0237b/pone.0007813.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/8b671e9c1334/pone.0007813.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/a281981a7ed4/pone.0007813.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/56192e34e0e0/pone.0007813.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f7f/2775920/3e6a190738fa/pone.0007813.g011.jpg

相似文献

1
Exploiting amino acid composition for predicting protein-protein interactions.利用氨基酸组成预测蛋白质-蛋白质相互作用。
PLoS One. 2009 Nov 20;4(11):e7813. doi: 10.1371/journal.pone.0007813.
2
Proteome-scale relationships between local amino acid composition and protein fates and functions.蛋白质组范围内的局部氨基酸组成与蛋白质命运和功能之间的关系。
PLoS Comput Biol. 2018 Sep 24;14(9):e1006256. doi: 10.1371/journal.pcbi.1006256. eCollection 2018 Sep.
3
Robust and accurate prediction of protein self-interactions from amino acids sequence using evolutionary information.利用进化信息从氨基酸序列对蛋白质自身相互作用进行稳健且准确的预测。
Mol Biosyst. 2016 Nov 15;12(12):3702-3710. doi: 10.1039/c6mb00599c.
4
Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps.通过相互作用图谱的图论分析对蛋白质功能进行全蛋白质组预测。
Bioinformatics. 2005 Jun;21 Suppl 1:i302-10. doi: 10.1093/bioinformatics/bti1054.
5
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.
6
Features analysis for identification of date and party hubs in protein interaction network of Saccharomyces Cerevisiae.酿酒酵母蛋白质相互作用网络中日期和派对中心识别的特征分析
BMC Syst Biol. 2010 Dec 19;4:172. doi: 10.1186/1752-0509-4-172.
7
Predicting protein-protein interactions from primary protein sequences using a novel multi-scale local feature representation scheme and the random forest.使用一种新颖的多尺度局部特征表示方案和随机森林从蛋白质一级序列预测蛋白质-蛋白质相互作用。
PLoS One. 2015 May 6;10(5):e0125811. doi: 10.1371/journal.pone.0125811. eCollection 2015.
8
Prediction of protein-protein interactions based on PseAA composition and hybrid feature selection.基于伪氨基酸组成和混合特征选择的蛋白质-蛋白质相互作用预测
Biochem Biophys Res Commun. 2009 Mar 6;380(2):318-22. doi: 10.1016/j.bbrc.2009.01.077. Epub 2009 Jan 24.
9
Probabilistic prediction and ranking of human protein-protein interactions.人类蛋白质-蛋白质相互作用的概率预测与排序
BMC Bioinformatics. 2007 Jul 5;8:239. doi: 10.1186/1471-2105-8-239.
10
Highly Accurate Prediction of Protein-Protein Interactions via Incorporating Evolutionary Information and Physicochemical Characteristics.通过整合进化信息和物理化学特征实现蛋白质-蛋白质相互作用的高精度预测
Int J Mol Sci. 2016 Aug 25;17(9):1396. doi: 10.3390/ijms17091396.

引用本文的文献

1
AllerTrans: a deep learning method for predicting the allergenicity of protein sequences.AllerTrans:一种预测蛋白质序列变应原性的深度学习方法。
Biol Methods Protoc. 2025 Jul 9;10(1):bpaf040. doi: 10.1093/biomethods/bpaf040. eCollection 2025.
2
BroadAMP-GPT: AI-Driven generation of broad-spectrum antimicrobial peptides for combating multidrug-resistant ESKAPE pathogens.BroadAMP-GPT:用于对抗多重耐药性ESKAPE病原体的人工智能驱动的广谱抗菌肽生成。
Gut Microbes. 2025 Dec;17(1):2523811. doi: 10.1080/19490976.2025.2523811. Epub 2025 Jun 26.
3
An optimized deep-forest algorithm using a modified differential evolution optimization algorithm: A case of host-pathogen protein-protein interaction prediction.

本文引用的文献

1
Empirically controlled mapping of the Caenorhabditis elegans protein-protein interactome network.秀丽隐杆线虫蛋白质-蛋白质相互作用组网络的经验性控制图谱绘制。
Nat Methods. 2009 Jan;6(1):47-54. doi: 10.1038/nmeth.1279.
2
Precision and recall estimates for two-hybrid screens.双杂交筛选的精确率和召回率估计。
Bioinformatics. 2009 Feb 1;25(3):372-8. doi: 10.1093/bioinformatics/btn640. Epub 2008 Dec 17.
3
High-quality binary protein interaction map of the yeast interactome network.酵母相互作用组网络的高质量二元蛋白质相互作用图谱。
一种使用改进差分进化优化算法的优化深度森林算法:宿主-病原体蛋白质-蛋白质相互作用预测实例
Comput Struct Biotechnol J. 2025 Jan 26;27:595-611. doi: 10.1016/j.csbj.2025.01.020. eCollection 2025.
4
PepNet: an interpretable neural network for anti-inflammatory and antimicrobial peptides prediction using a pre-trained protein language model.PepNet:一种基于预训练蛋白质语言模型的可解释神经网络,用于预测抗炎和抗菌肽。
Commun Biol. 2024 Sep 28;7(1):1198. doi: 10.1038/s42003-024-06911-1.
5
A Computational Predictor for Accurate Identification of Tumor Homing Peptides by Integrating Sequential and Deep BiLSTM Features.一种通过整合序列和深度 BiLSTM 特征来准确识别肿瘤归巢肽的计算预测器。
Interdiscip Sci. 2024 Jun;16(2):503-518. doi: 10.1007/s12539-024-00628-9. Epub 2024 May 11.
6
Identify Diabetes-related Targets based on ForgeNet_GPC.基于 ForgeNet_GPC 识别糖尿病相关靶点。
Curr Comput Aided Drug Des. 2024;20(7):1042-1054. doi: 10.2174/0115734099258183230929173855.
7
Determining human-coronavirus protein-protein interaction using machine intelligence.利用机器智能确定人类冠状病毒的蛋白质-蛋白质相互作用。
Med Nov Technol Devices. 2023 Jun;18:100228. doi: 10.1016/j.medntd.2023.100228. Epub 2023 Apr 6.
8
Computational Analysis Predicts Correlations among Amino Acids in SARS-CoV-2 Proteomes.计算分析预测了新冠病毒蛋白质组中氨基酸之间的相关性。
Biomedicines. 2023 Feb 10;11(2):512. doi: 10.3390/biomedicines11020512.
9
Anticoagulant Oligonucleotide-Peptide Conjugates: Identification of Thrombin Aptamer Conjugates with Improved Characteristics.抗凝寡核苷酸-肽缀合物:具有改进特性的凝血酶适体缀合物的鉴定。
Int J Mol Sci. 2022 Mar 30;23(7):3820. doi: 10.3390/ijms23073820.
10
Predicting Protein-Protein Interactions via Gated Graph Attention Signed Network.通过门控图注意力签名网络预测蛋白质-蛋白质相互作用。
Biomolecules. 2021 May 28;11(6):799. doi: 10.3390/biom11060799.
Science. 2008 Oct 3;322(5898):104-10. doi: 10.1126/science.1158684. Epub 2008 Aug 21.
4
Predicting co-complexed protein pairs from heterogeneous data.从异构数据中预测共复合蛋白质对。
PLoS Comput Biol. 2008 Apr 18;4(4):e1000054. doi: 10.1371/journal.pcbi.1000054.
5
A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans.一个单基因网络能准确预测秀丽隐杆线虫基因扰动的表型效应。
Nat Genet. 2008 Feb;40(2):181-8. doi: 10.1038/ng.2007.70. Epub 2008 Jan 27.
6
DOMINE: a database of protein domain interactions.DOMINE:一个蛋白质结构域相互作用的数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D656-61. doi: 10.1093/nar/gkm761. Epub 2007 Oct 2.
7
False positive reduction in protein-protein interaction predictions using gene ontology annotations.利用基因本体注释减少蛋白质-蛋白质相互作用预测中的假阳性
BMC Bioinformatics. 2007 Jul 23;8:262. doi: 10.1186/1471-2105-8-262.
8
Discovering functional linkages and uncharacterized cellular pathways using phylogenetic profile comparisons: a comprehensive assessment.利用系统发育谱比较发现功能联系和未表征的细胞途径:一项综合评估。
BMC Bioinformatics. 2007 May 23;8:173. doi: 10.1186/1471-2105-8-173.
9
Deciphering protein-protein interactions. Part II. Computational methods to predict protein and domain interaction partners.解析蛋白质-蛋白质相互作用。第二部分。预测蛋白质和结构域相互作用伙伴的计算方法。
PLoS Comput Biol. 2007 Apr 27;3(4):e43. doi: 10.1371/journal.pcbi.0030043.
10
Deciphering protein-protein interactions. Part I. Experimental techniques and databases.解析蛋白质-蛋白质相互作用。第一部分。实验技术与数据库。
PLoS Comput Biol. 2007 Mar 30;3(3):e42. doi: 10.1371/journal.pcbi.0030042.