• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全蛋白质组相互作用挖掘

Whole-proteome interaction mining.

作者信息

Bock Joel R, Gough David A

机构信息

Department of Bioengineering, University of California San Diego, 9500 Gilman Drive, La Jolla 92093-0412, USA.

出版信息

Bioinformatics. 2003 Jan;19(1):125-34. doi: 10.1093/bioinformatics/19.1.125.

DOI:10.1093/bioinformatics/19.1.125
PMID:12499302
Abstract

MOTIVATION

A major post-genomic scientific and technological pursuit is to describe the functions performed by the proteins encoded by the genome. One strategy is to first identify the protein-protein interactions in a proteome, then determine pathways and overall structure relating these interactions, and finally to statistically infer functional roles of individual proteins. Although huge amounts of genomic data are at hand, current experimental protein interaction assays must overcome technical problems to scale-up for high-throughput analysis. In the meantime, bioinformatics approaches may help bridge the information gap required for inference of protein function. In this paper, a previously described data mining approach to prediction of protein-protein interactions (Bock and Gough, 2001, Bioinformatics, 17, 455-460) is extended to interaction mining on a proteome-wide scale. An algorithm (the phylogenetic bootstrap) is introduced, which suggests traversal of a phenogram, interleaving rounds of computation and experiment, to develop a knowledge base of protein interactions in genetically-similar organisms.

RESULTS

The interaction mining approach was demonstrated by building a learning system based on 1,039 experimentally validated protein-protein interactions in the human gastric bacterium Helicobacter pylori. An estimate of the generalization performance of the classifier was derived from 10-fold cross-validation, which indicated expected upper bounds on precision of 80% and sensitivity of 69% when applied to related organisms. One such organism is the enteric pathogen Campylobacter jejuni, in which comprehensive machine learning prediction of all possible pairwise protein-protein interactions was performed. The resulting network of interactions shares an average protein connectivity characteristic in common with previous investigations reported in the literature, offering strong evidence supporting the biological feasibility of the hypothesized map. For inferences about complete proteomes in which the number of pairwise non-interactions is expected to be much larger than the number of actual interactions, we anticipate that the sensitivity will remain the same but precision may decrease. We present specific biological examples of two subnetworks of protein-protein interactions in C. jejuni resulting from the application of this approach, including elements of a two-component signal transduction systems for thermoregulation, and a ferritin uptake network.

摘要

动机

后基因组时代的一项主要科技追求是描述基因组所编码蛋白质的功能。一种策略是首先识别蛋白质组中的蛋白质 - 蛋白质相互作用,然后确定与这些相互作用相关的途径和整体结构,最后从统计学上推断单个蛋白质的功能作用。尽管手头有大量的基因组数据,但当前的实验性蛋白质相互作用检测方法必须克服技术问题才能扩大规模以进行高通量分析。与此同时,生物信息学方法可能有助于弥补推断蛋白质功能所需的信息差距。在本文中,一种先前描述的用于预测蛋白质 - 蛋白质相互作用的数据挖掘方法(Bock和Gough,2001年,《生物信息学》,17卷,455 - 460页)被扩展到全蛋白质组规模的相互作用挖掘。引入了一种算法(系统发育自展法),该算法建议遍历系统发育树图,交错进行多轮计算和实验,以建立遗传相似生物体中蛋白质相互作用的知识库。

结果

通过基于人胃细菌幽门螺杆菌中1039个经实验验证的蛋白质 - 蛋白质相互作用构建学习系统,展示了相互作用挖掘方法。分类器泛化性能的估计来自10折交叉验证,这表明当应用于相关生物体时,预期精度上限为80%,灵敏度为69%。一种这样的生物体是肠道病原体空肠弯曲菌,其中对所有可能的成对蛋白质 - 蛋白质相互作用进行了全面的机器学习预测。所得的相互作用网络与文献中先前报道的研究具有共同的平均蛋白质连接特征,为所假设图谱的生物学可行性提供了有力证据。对于关于完整蛋白质组的推断,预计成对非相互作用的数量将远大于实际相互作用的数量,我们预计灵敏度将保持不变,但精度可能会降低。我们展示了应用此方法在空肠弯曲菌中产生的两个蛋白质 - 蛋白质相互作用子网的具体生物学实例,包括用于温度调节的双组分信号转导系统的元件以及铁蛋白摄取网络。

相似文献

1
Whole-proteome interaction mining.全蛋白质组相互作用挖掘
Bioinformatics. 2003 Jan;19(1):125-34. doi: 10.1093/bioinformatics/19.1.125.
2
MannDB - a microbial database of automated protein sequence analyses and evidence integration for protein characterization.MannDB - 一个用于蛋白质表征的自动蛋白质序列分析和证据整合的微生物数据库。
BMC Bioinformatics. 2006 Oct 17;7:459. doi: 10.1186/1471-2105-7-459.
3
HomoMINT: an inferred human network based on orthology mapping of protein interactions discovered in model organisms.同源互作网络工具(HomoMINT):一种基于在模式生物中发现的蛋白质相互作用的直系同源映射推断出的人类网络。
BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S21. doi: 10.1186/1471-2105-6-S4-S21.
4
Predicting subcellular localization of proteins using machine-learned classifiers.使用机器学习分类器预测蛋白质的亚细胞定位。
Bioinformatics. 2004 Mar 1;20(4):547-56. doi: 10.1093/bioinformatics/btg447. Epub 2004 Jan 22.
5
A nearest neighbor approach for automated transporter prediction and categorization from protein sequences.一种基于最近邻方法从蛋白质序列进行自动转运蛋白预测和分类的方法。
Bioinformatics. 2008 May 1;24(9):1129-36. doi: 10.1093/bioinformatics/btn099. Epub 2008 Mar 12.
6
Towards index-based similarity search for protein structure databases.面向蛋白质结构数据库的基于索引的相似性搜索
Proc IEEE Comput Soc Bioinform Conf. 2003;2:148-58.
7
A proteome-wide protein interaction map for Campylobacter jejuni.空肠弯曲菌的全蛋白质组蛋白质相互作用图谱。
Genome Biol. 2007;8(7):R130. doi: 10.1186/gb-2007-8-7-r130.
8
Annotating proteins by mining protein interaction networks.通过挖掘蛋白质相互作用网络对蛋白质进行注释。
Bioinformatics. 2006 Jul 15;22(14):e260-70. doi: 10.1093/bioinformatics/btl221.
9
High throughput profile-profile based fold recognition for the entire human proteome.针对整个人类蛋白质组的基于高通量图谱-图谱的折叠识别。
BMC Bioinformatics. 2006 Jun 7;7:288. doi: 10.1186/1471-2105-7-288.
10
Decision tree based information integration for automated protein classification.基于决策树的信息整合用于蛋白质自动分类
J Bioinform Comput Biol. 2005 Jun;3(3):717-42. doi: 10.1142/s0219720005001259.

引用本文的文献

1
Recent progress and future challenges in structure-based protein-protein interaction prediction.基于结构的蛋白质-蛋白质相互作用预测的最新进展与未来挑战
Mol Ther. 2025 May 7;33(5):2252-2268. doi: 10.1016/j.ymthe.2025.04.003. Epub 2025 Apr 6.
2
ProteinPrompt: a webserver for predicting protein-protein interactions.ProteinPrompt:一个用于预测蛋白质-蛋白质相互作用的网络服务器。
Bioinform Adv. 2022 Aug 17;2(1):vbac059. doi: 10.1093/bioadv/vbac059. eCollection 2022.
3
Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence.
基于蛋白质序列的集成学习模型预测蛋白质-蛋白质相互作用
Biology (Basel). 2022 Jun 30;11(7):995. doi: 10.3390/biology11070995.
4
A Survey on Deep Networks Approaches in Prediction of Sequence-Based Protein-Protein Interactions.基于序列的蛋白质-蛋白质相互作用预测中深度网络方法的综述
SN Comput Sci. 2022;3(4):298. doi: 10.1007/s42979-022-01197-8. Epub 2022 May 19.
5
Predicting Protein-Protein Interactions via Random Ferns with Evolutionary Matrix Representation.基于进化矩阵表示的随机蕨类预测蛋白质-蛋白质相互作用。
Comput Math Methods Med. 2022 Feb 22;2022:7191684. doi: 10.1155/2022/7191684. eCollection 2022.
6
Prediction of host-pathogen protein interactions by extended network model.通过扩展网络模型预测宿主-病原体蛋白相互作用
Turk J Biol. 2021 Apr 20;45(2):138-148. doi: 10.3906/biy-2009-4. eCollection 2021.
7
Using Random Forest Model Combined With Gabor Feature to Predict Protein-Protein Interaction From Protein Sequence.使用随机森林模型结合Gabor特征从蛋白质序列预测蛋白质-蛋白质相互作用
Evol Bioinform Online. 2020 Jun 30;16:1176934320934498. doi: 10.1177/1176934320934498. eCollection 2020.
8
Prediction of Protein-Protein Interactions with Local Weight-Sharing Mechanism in Deep Learning.基于深度学习局部权重共享机制的蛋白质-蛋白质相互作用预测。
Biomed Res Int. 2020 Jun 13;2020:5072520. doi: 10.1155/2020/5072520. eCollection 2020.
9
Predicting Protein-Protein Interactions from Matrix-Based Protein Sequence Using Convolution Neural Network and Feature-Selective Rotation Forest.基于卷积神经网络和特征选择旋转森林的矩阵基蛋白序列预测蛋白-蛋白相互作用
Sci Rep. 2019 Jul 8;9(1):9848. doi: 10.1038/s41598-019-46369-4.
10
A High Efficient Biological Language Model for Predicting Protein⁻Protein Interactions.一种用于预测蛋白质相互作用的高效生物语言模型。
Cells. 2019 Feb 3;8(2):122. doi: 10.3390/cells8020122.