• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不依赖同源蛋白质序列的基于序列的蛋白质 - 蛋白质相互作用预测方法的关键性评估。

Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences.

机构信息

Institute of Cellular and Molecular Biology (MBB 3 210B), Center for Systems and Synthetic Biology, University of Texas at Austin, 2500 Speedway, Austin, Texas, USA.

出版信息

BMC Bioinformatics. 2009 Dec 14;10:419. doi: 10.1186/1471-2105-10-419.

DOI:10.1186/1471-2105-10-419
PMID:20003442
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2803199/
Abstract

BACKGROUND

Protein-protein interactions underlie many important biological processes. Computational prediction methods can nicely complement experimental approaches for identifying protein-protein interactions. Recently, a unique category of sequence-based prediction methods has been put forward--unique in the sense that it does not require homologous protein sequences. This enables it to be universally applicable to all protein sequences unlike many of previous sequence-based prediction methods. If effective as claimed, these new sequence-based, universally applicable prediction methods would have far-reaching utilities in many areas of biology research.

RESULTS

Upon close survey, I realized that many of these new methods were ill-tested. In addition, newer methods were often published without performance comparison with previous ones. Thus, it is not clear how good they are and whether there are significant performance differences among them. In this study, I have implemented and thoroughly tested 4 different methods on large-scale, non-redundant data sets. It reveals several important points. First, significant performance differences are noted among different methods. Second, data sets typically used for training prediction methods appear significantly biased, limiting the general applicability of prediction methods trained with them. Third, there is still ample room for further developments. In addition, my analysis illustrates the importance of complementary performance measures coupled with right-sized data sets for meaningful benchmark tests.

CONCLUSIONS

The current study reveals the potentials and limits of the new category of sequence-based protein-protein interaction prediction methods, which in turn provides a firm ground for future endeavours in this important area of contemporary bioinformatics.

摘要

背景

蛋白质-蛋白质相互作用是许多重要生物过程的基础。计算预测方法可以很好地补充识别蛋白质-蛋白质相互作用的实验方法。最近,提出了一种独特的基于序列的预测方法类别——独特之处在于它不需要同源蛋白质序列。这使得它能够普遍适用于所有蛋白质序列,不像许多以前的基于序列的预测方法。如果如声称的那样有效,这些新的基于序列的、普遍适用的预测方法将在生物学研究的许多领域具有深远的应用价值。

结果

经过仔细调查,我意识到这些新方法中有许多未经充分测试。此外,新方法通常在没有与以前的方法进行性能比较的情况下发布。因此,不清楚它们的效果如何,以及它们之间是否存在显著的性能差异。在这项研究中,我在大规模、非冗余数据集上实现并彻底测试了 4 种不同的方法。它揭示了几个重要的观点。首先,不同方法之间存在显著的性能差异。其次,用于训练预测方法的数据集通常存在明显的偏差,限制了使用它们训练的预测方法的普遍适用性。第三,还有很大的发展空间。此外,我的分析说明了互补性能指标与合适大小的数据集相结合对于有意义的基准测试的重要性。

结论

本研究揭示了新型基于序列的蛋白质-蛋白质相互作用预测方法的潜力和局限性,这反过来为当代生物信息学这一重要领域的未来努力提供了坚实的基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b22/2803199/1f7fc31dcfb7/1471-2105-10-419-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b22/2803199/f8a932d4313e/1471-2105-10-419-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b22/2803199/e64913606d94/1471-2105-10-419-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b22/2803199/1f7fc31dcfb7/1471-2105-10-419-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b22/2803199/f8a932d4313e/1471-2105-10-419-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b22/2803199/e64913606d94/1471-2105-10-419-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b22/2803199/1f7fc31dcfb7/1471-2105-10-419-3.jpg

相似文献

1
Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences.不依赖同源蛋白质序列的基于序列的蛋白质 - 蛋白质相互作用预测方法的关键性评估。
BMC Bioinformatics. 2009 Dec 14;10:419. doi: 10.1186/1471-2105-10-419.
2
Imbalance Data Processing Strategy for Protein Interaction Sites Prediction.蛋白质相互作用位点预测的不平衡数据处理策略。
IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):985-994. doi: 10.1109/TCBB.2019.2953908. Epub 2021 Jun 3.
3
A statistical model of protein sequence similarity and function similarity reveals overly-specific function predictions.蛋白质序列相似性和功能相似性的统计模型揭示了过于具体的功能预测。
PLoS One. 2009 Oct 21;4(10):e7546. doi: 10.1371/journal.pone.0007546.
4
An integrated approach to the prediction of domain-domain interactions.一种预测结构域-结构域相互作用的综合方法。
BMC Bioinformatics. 2006 May 25;7:269. doi: 10.1186/1471-2105-7-269.
5
A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction.十年蛋白质结构预测关键评估(CASP):蛋白质结构预测的进展、瓶颈与前景
Curr Opin Struct Biol. 2005 Jun;15(3):285-9. doi: 10.1016/j.sbi.2005.05.011.
6
Sequence-based prediction of protein-protein interaction sites with L1-logreg classifier.基于序列的 L1-logreg 分类器预测蛋白质-蛋白质相互作用位点。
J Theor Biol. 2014 May 7;348:47-54. doi: 10.1016/j.jtbi.2014.01.028. Epub 2014 Jan 31.
7
Joint evolutionary trees: a large-scale method to predict protein interfaces based on sequence sampling.联合进化树:一种基于序列采样预测蛋白质界面的大规模方法。
PLoS Comput Biol. 2009 Jan;5(1):e1000267. doi: 10.1371/journal.pcbi.1000267. Epub 2009 Jan 23.
8
Prediction of ligand binding sites using homologous structures and conservation at CASP8.利用同源结构和保守性预测配体结合位点 CASP8.
Proteins. 2009;77 Suppl 9(Suppl 9):147-51. doi: 10.1002/prot.22513.
9
Bioinformatical approaches to detect and analyze protein interactions.用于检测和分析蛋白质相互作用的生物信息学方法。
Methods Mol Biol. 2009;564:401-31. doi: 10.1007/978-1-60761-157-8_23.
10
Prediction of protein-protein interactions from primary sequences.从一级序列预测蛋白质-蛋白质相互作用。
Int J Data Min Bioinform. 2010;4(2):211-27. doi: 10.1504/ijdmb.2010.032151.

引用本文的文献

1
Performance improvement for a 2D convolutional neural network by using SSC encoding on protein-protein interaction tasks.利用 SSC 编码提高二维卷积神经网络在蛋白质相互作用任务上的性能。
BMC Bioinformatics. 2021 Apr 12;22(1):184. doi: 10.1186/s12859-021-04111-w.
2
Evolving knowledge graph similarity for supervised learning in complex biomedical domains.用于复杂生物医学领域中监督学习的进化知识图相似度。
BMC Bioinformatics. 2020 Jan 3;21(1):6. doi: 10.1186/s12859-019-3296-1.
3
Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme.

本文引用的文献

1
Physical protein-protein interactions predicted from microarrays.通过微阵列预测的物理蛋白质-蛋白质相互作用。
Bioinformatics. 2008 Nov 15;24(22):2608-14. doi: 10.1093/bioinformatics/btn498. Epub 2008 Oct 1.
2
Built-in loops allow versatility in domain-domain interactions: lessons from self-interacting domains.内置环允许结构域-结构域相互作用具有多样性:来自自相互作用结构域的经验教训。
Proc Natl Acad Sci U S A. 2008 Sep 9;105(36):13292-7. doi: 10.1073/pnas.0801207105. Epub 2008 Aug 29.
3
High-quality binary protein interaction map of the yeast interactome network.
使用混合特征表示和堆叠泛化方案进行蛋白质-蛋白质相互作用预测。
BMC Bioinformatics. 2019 Jun 10;20(1):308. doi: 10.1186/s12859-019-2907-1.
4
Reciprocal Perspective for Improved Protein-Protein Interaction Prediction.互易视角提高蛋白质-蛋白质相互作用预测。
Sci Rep. 2018 Aug 3;8(1):11694. doi: 10.1038/s41598-018-30044-1.
5
ProfPPIdb: Pairs of physical protein-protein interactions predicted for entire proteomes.ProfPPIdb:针对整个蛋白质组预测的物理蛋白质-蛋白质相互作用对。
PLoS One. 2018 Jul 18;13(7):e0199988. doi: 10.1371/journal.pone.0199988. eCollection 2018.
6
SPRINT: ultrafast protein-protein interaction prediction of the entire human interactome.SPRINT:对整个人类相互作用组进行超快速蛋白质-蛋白质相互作用预测。
BMC Bioinformatics. 2017 Nov 15;18(1):485. doi: 10.1186/s12859-017-1871-x.
7
Evolution of protein-protein interaction networks in yeast.酵母中蛋白质-蛋白质相互作用网络的进化
PLoS One. 2017 Mar 1;12(3):e0171920. doi: 10.1371/journal.pone.0171920. eCollection 2017.
8
Mapping and identification of a potential candidate gene for a novel maturity locus, E10, in soybean.大豆成熟度新位点 E10 的候选基因定位与鉴定。
Theor Appl Genet. 2017 Feb;130(2):377-390. doi: 10.1007/s00122-016-2819-7. Epub 2016 Nov 10.
9
Efficient prediction of human protein-protein interactions at a global scale.在全球范围内高效预测人类蛋白质-蛋白质相互作用。
BMC Bioinformatics. 2014 Dec 10;15(1):383. doi: 10.1186/s12859-014-0383-1.
10
Homology-based prediction of interactions between proteins using Averaged One-Dependence Estimators.基于同源性的蛋白质相互作用预测方法:使用平均单依赖估计。
BMC Bioinformatics. 2014 Jun 23;15:213. doi: 10.1186/1471-2105-15-213.
酵母相互作用组网络的高质量二元蛋白质相互作用图谱。
Science. 2008 Oct 3;322(5898):104-10. doi: 10.1126/science.1158684. Epub 2008 Aug 21.
4
An integrative approach for predicting interactions of protein regions.一种预测蛋白质区域相互作用的综合方法。
Bioinformatics. 2008 Aug 15;24(16):i35-41. doi: 10.1093/bioinformatics/btn290.
5
Message-passing algorithms for the prediction of protein domain interactions from protein-protein interaction data.基于蛋白质-蛋白质相互作用数据预测蛋白质结构域相互作用的消息传递算法。
Bioinformatics. 2008 Sep 15;24(18):2064-70. doi: 10.1093/bioinformatics/btn366. Epub 2008 Jul 17.
6
Global investigation of protein-protein interactions in yeast Saccharomyces cerevisiae using re-occurring short polypeptide sequences.利用重复出现的短多肽序列对酿酒酵母中的蛋白质-蛋白质相互作用进行全球调查。
Nucleic Acids Res. 2008 Aug;36(13):4286-94. doi: 10.1093/nar/gkn390. Epub 2008 Jun 27.
7
An in vivo map of the yeast protein interactome.酵母蛋白质相互作用组的体内图谱。
Science. 2008 Jun 13;320(5882):1465-70. doi: 10.1126/science.1153878. Epub 2008 May 8.
8
Evolution and dynamics of protein interactions and networks.蛋白质相互作用与网络的进化及动态变化
Curr Opin Struct Biol. 2008 Jun;18(3):349-57. doi: 10.1016/j.sbi.2008.03.003. Epub 2008 Apr 28.
9
A map of human protein interactions derived from co-expression of human mRNAs and their orthologs.一张源自人类mRNA及其直系同源基因共表达的人类蛋白质相互作用图谱。
Mol Syst Biol. 2008;4:180. doi: 10.1038/msb.2008.19. Epub 2008 Apr 15.
10
Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences.利用支持向量机结合自协方差从蛋白质序列预测蛋白质-蛋白质相互作用。
Nucleic Acids Res. 2008 May;36(9):3025-30. doi: 10.1093/nar/gkn159. Epub 2008 Apr 4.