• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结核分枝杆菌 H37Rv 蛋白-蛋白相互作用数据集的比较分析与评估。

Comparative analysis and assessment of M. tuberculosis H37Rv protein-protein interaction datasets.

机构信息

NUS Graduate School for Integrative Sciences & Engineering, National University of Singapore, 28 Medical Drive, Singapore 117456.

出版信息

BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S20. doi: 10.1186/1471-2164-12-S3-S20.

DOI:10.1186/1471-2164-12-S3-S20
PMID:22369691
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3333180/
Abstract

BACKGROUND

M. tuberculosis is a formidable bacterial pathogen. There is thus an increasing demand on understanding the function and relationship of proteins in various strains of M. tuberculosis. Protein-protein interactions (PPIs) data are crucial for this kind of knowledge. However, the quality of the main available M. tuberculosis PPI datasets is unclear. This hampers the effectiveness of research works that rely on these PPI datasets. Here, we analyze the two main available M. tuberculosis H37Rv PPI datasets. The first dataset is the high-throughput B2H PPI dataset from Wang et al's recent paper in Journal of Proteome Research. The second dataset is from STRING database, version 8.3, comprising entirely of H37Rv PPIs predicted using various methods. We find that these two datasets have a surprisingly low level of agreement. We postulate the following causes for this low level of agreement: (i) the H37Rv B2H PPI dataset is of low quality; (ii) the H37Rv STRING PPI dataset is of low quality; and/or (iii) the H37Rv STRING PPIs are predictions of other forms of functional associations rather than direct physical interactions.

RESULTS

To test the quality of these two datasets, we evaluate them based on correlated gene expression profiles, coherent informative GO term annotations, and conservation in other organisms. We observe a significantly greater portion of PPIs in the H37Rv STRING PPI dataset (with score ≥ 770) having correlated gene expression profiles and coherent informative GO term annotations in both interaction partners than that in the H37Rv B2H PPI dataset. Predicted H37Rv interologs derived from non-M. tuberculosis experimental PPIs are much more similar to the H37Rv STRING functional associations dataset (with score ≥ 770) than the H37Rv B2H PPI dataset. H37Rv predicted physical interologs from IntAct also show extremely low similarity with the H37Rv B2H PPI dataset; and this similarity level is much lower than that between the S. aureus MRSA252 predicted physical interologs from IntAct and S. aureus MRSA252 pull-down PPIs. Comparative analysis with several representative two-hybrid PPI datasets in other species further confirms that the H37Rv B2H PPI dataset is of low quality. Next, to test the possibility that the H37Rv STRING PPIs are not purely direct physical interactions, we compare M. tuberculosis H37Rv protein pairs that catalyze adjacent steps in enzymatic reactions to B2H PPIs and predicted PPIs in STRING, which shows it has much lower similarities with the B2H PPIs than with STRING PPIs. This result strongly suggests that the H37Rv STRING PPIs more likely correspond to indirect relationships between protein pairs than to B2H PPIs. For more precise support, we turn to S. cerevisiae for its comprehensively studied interactome. We compare S. cerevisiae predicted PPIs in STRING to three independent protein relationship datasets which respectively comprise PPIs reported in Y2H assays, protein pairs reported to be in the same protein complexes, and protein pairs that catalyze successive reaction steps in enzymatic reactions. Our analysis reveals that S. cerevisiae predicted STRING PPIs have much higher similarity to the latter two types of protein pairs than to two-hybrid PPIs. As H37Rv STRING PPIs are predicted using similar methods as S. cerevisiae predicted STRING PPIs, this suggests that these H37Rv STRING PPIs are more likely to correspond to the latter two types of protein pairs rather than to two-hybrid PPIs as well.

CONCLUSIONS

The H37Rv B2H PPI dataset has low quality. It should not be used as the gold standard to assess the quality of other (possibly predicted) H37Rv PPI datasets. The H37Rv STRING PPI dataset also has low quality; nevertheless, a subset consisting of STRING PPIs with score ≥770 has satisfactory quality. However, these STRING "PPIs" should be interpreted as functional associations, which include a substantial portion of indirect protein interactions, rather than direct physical interactions. These two factors cause the strikingly low similarity between these two main H37Rv PPI datasets. The results and conclusions from this comparative analysis provide valuable guidance in using these M. tuberculosis H37Rv PPI datasets in subsequent studies for a wide range of purposes.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/aefed353f3fc/1471-2164-12-S3-S20-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/c0017b9a4f24/1471-2164-12-S3-S20-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/3716399fd7d5/1471-2164-12-S3-S20-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/338681a249ba/1471-2164-12-S3-S20-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/fb30d0ba0d3f/1471-2164-12-S3-S20-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/a677cf03ec09/1471-2164-12-S3-S20-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/079649efb9e2/1471-2164-12-S3-S20-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/c0d16228fd7c/1471-2164-12-S3-S20-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/bee242e0e6f6/1471-2164-12-S3-S20-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/aefed353f3fc/1471-2164-12-S3-S20-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/c0017b9a4f24/1471-2164-12-S3-S20-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/3716399fd7d5/1471-2164-12-S3-S20-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/338681a249ba/1471-2164-12-S3-S20-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/fb30d0ba0d3f/1471-2164-12-S3-S20-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/a677cf03ec09/1471-2164-12-S3-S20-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/079649efb9e2/1471-2164-12-S3-S20-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/c0d16228fd7c/1471-2164-12-S3-S20-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/bee242e0e6f6/1471-2164-12-S3-S20-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/300e/3333180/aefed353f3fc/1471-2164-12-S3-S20-10.jpg
摘要

背景

结核分枝杆菌是一种强大的细菌病原体。因此,人们越来越需要了解各种结核分枝杆菌菌株中蛋白质的功能和关系。蛋白质-蛋白质相互作用(PPIs)数据对于这类知识至关重要。然而,主要的可用结核分枝杆菌 PPI 数据集的质量尚不清楚。这阻碍了依赖这些 PPI 数据集的研究工作的有效性。在这里,我们分析了两种主要的结核分枝杆菌 H37Rv PPI 数据集。第一个数据集是 Wang 等人在《蛋白质组研究杂志》上最近发表的论文中的高通量 B2H PPI 数据集。第二个数据集来自 STRING 数据库,版本 8.3,完全由使用各种方法预测的 H37Rv PPIs 组成。我们发现这两个数据集的一致性水平出人意料地低。我们提出了以下导致这种低一致性水平的原因:(i)H37Rv B2H PPI 数据集质量较低;(ii)H37Rv STRING PPI 数据集质量较低;和/或(iii)H37Rv STRING PPIs 是其他形式的功能关联的预测,而不是直接的物理相互作用。

结果

为了测试这两个数据集的质量,我们基于相关基因表达谱、连贯的信息丰富的 GO 术语注释以及在其他生物体中的保守性对它们进行评估。我们观察到 H37Rv STRING PPI 数据集中(得分≥770)具有更多的 PPI 与相互作用伙伴的相关基因表达谱和连贯的信息丰富的 GO 术语注释,而 H37Rv B2H PPI 数据集中则较少。从非结核分枝杆菌实验 PPI 中预测的结核分枝杆菌同源物与 H37Rv STRING 功能关联数据集(得分≥770)更为相似,而与 H37Rv B2H PPI 数据集相比则不相似。从 IntAct 中预测的 H37Rv 物理同源物与 H37Rv B2H PPI 数据集的相似性也极低;并且这种相似性水平远低于从 IntAct 中预测的金黄色葡萄球菌 MRSA252 物理同源物与金黄色葡萄球菌 MRSA252 下拉 PPI 之间的相似性。与其他物种中几个代表性的双杂交 PPI 数据集的比较分析进一步证实了 H37Rv B2H PPI 数据集的质量较低。接下来,为了测试 H37Rv STRING PPIs 不是纯粹的直接物理相互作用的可能性,我们将催化酶反应中相邻步骤的结核分枝杆菌 H37Rv 蛋白对与 B2H PPIs 和 STRING 中的预测 PPIs 进行比较,结果表明与 B2H PPIs 相比,它与 STRING PPIs 的相似性要低得多。这一结果强烈表明,H37Rv STRING PPIs 更可能对应于蛋白对之间的间接关系,而不是 B2H PPIs。为了更精确的支持,我们转向酿酒酵母进行全面研究的相互作用组。我们将酿酒酵母 STRING 中的预测 PPIs 与分别包含 Y2H 测定报告的 PPIs、报告为同一蛋白质复合物的蛋白质对以及催化酶反应中连续反应步骤的蛋白质对的三种独立的蛋白质关系数据集进行比较。我们的分析表明,酿酒酵母预测的 STRING PPIs 与后两种类型的蛋白质对的相似性要高于双杂交 PPIs。由于 H37Rv STRING PPIs 是使用与酿酒酵母预测的 STRING PPIs 相似的方法进行预测的,这表明这些 H37Rv STRING PPIs 更可能对应于后两种类型的蛋白质对,而不是双杂交 PPIs。

结论

H37Rv B2H PPI 数据集质量较低。它不应该被用作评估其他(可能是预测的)H37Rv PPI 数据集质量的金标准。H37Rv STRING PPI 数据集的质量也较低;然而,得分≥770 的 STRING PPI 子集具有令人满意的质量。然而,这些 STRING“PPIs”应被解释为功能关联,其中包括相当一部分间接蛋白质相互作用,而不是直接物理相互作用。这两个因素导致这两个主要的 H37Rv PPI 数据集之间的相似性非常低。从这种比较分析中得出的结果和结论为在随后的各种目的研究中使用这些结核分枝杆菌 H37Rv PPI 数据集提供了有价值的指导。

相似文献

1
Comparative analysis and assessment of M. tuberculosis H37Rv protein-protein interaction datasets.结核分枝杆菌 H37Rv 蛋白-蛋白相互作用数据集的比较分析与评估。
BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S20. doi: 10.1186/1471-2164-12-S3-S20.
2
Stringent DDI-based prediction of H. sapiens-M. tuberculosis H37Rv protein-protein interactions.基于严格药物-药物相互作用的人-结核分枝杆菌H37Rv蛋白质-蛋白质相互作用预测。
BMC Syst Biol. 2013;7 Suppl 6(Suppl 6):S6. doi: 10.1186/1752-0509-7-S6-S6. Epub 2013 Dec 13.
3
Stringent homology-based prediction of H. sapiens-M. tuberculosis H37Rv protein-protein interactions.基于严格同源性的人类-结核分枝杆菌 H37Rv 蛋白质-蛋白质相互作用预测。
Biol Direct. 2014 Apr 8;9:5. doi: 10.1186/1745-6150-9-5.
4
Mycobacterium tuberculosis and Clostridium difficille interactomes: demonstration of rapid development of computational system for bacterial interactome prediction.结核分枝杆菌和艰难梭菌相互作用组:细菌相互作用组预测计算系统快速发展的证明。
Microb Inform Exp. 2012 Mar 21;2:4. doi: 10.1186/2042-5783-2-4.
5
Global protein-protein interaction network in the human pathogen Mycobacterium tuberculosis H37Rv.人类病原体结核分枝杆菌 H37Rv 的全球蛋白质-蛋白质相互作用网络。
J Proteome Res. 2010 Dec 3;9(12):6665-77. doi: 10.1021/pr100808n. Epub 2010 Nov 10.
6
Assembling a protein-protein interaction map of the SSU processome from existing datasets.从现有数据集组装 SSU 加工体的蛋白质-蛋白质相互作用图谱。
PLoS One. 2011 Mar 10;6(3):e17701. doi: 10.1371/journal.pone.0017701.
7
False positive reduction in protein-protein interaction predictions using gene ontology annotations.利用基因本体注释减少蛋白质-蛋白质相互作用预测中的假阳性
BMC Bioinformatics. 2007 Jul 23;8:262. doi: 10.1186/1471-2105-8-262.
8
MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure.MCL-CAw:一种改进的 MCL 方法,用于通过整合核心附着结构,从加权 PPI 网络中检测酵母复合物。
BMC Bioinformatics. 2010 Oct 12;11:504. doi: 10.1186/1471-2105-11-504.
9
MEGADOCK-Web: an integrated database of high-throughput structure-based protein-protein interaction predictions.MEGADOCK-Web:高通量结构基蛋白质-蛋白质相互作用预测的综合数据库。
BMC Bioinformatics. 2018 May 8;19(Suppl 4):62. doi: 10.1186/s12859-018-2073-x.
10
Predicting protein-protein interactions using high-quality non-interacting pairs.利用高质量的非互作蛋白对预测蛋白质相互作用。
BMC Bioinformatics. 2018 Dec 31;19(Suppl 19):525. doi: 10.1186/s12859-018-2525-3.

引用本文的文献

1
A new framework for host-pathogen interaction research.宿主-病原体相互作用研究的新框架。
Front Immunol. 2022 Dec 15;13:1066733. doi: 10.3389/fimmu.2022.1066733. eCollection 2022.
2
Computational Network Inference for Bacterial Interactomics.计算网络推断在细菌相互作用组学中的应用。
mSystems. 2022 Apr 26;7(2):e0145621. doi: 10.1128/msystems.01456-21. Epub 2022 Mar 30.
3
MNBDR: A Module Network Based Method for Drug Repositioning.MNBDR:一种基于模块网络的药物重新定位方法。

本文引用的文献

1
Mapping the protein interaction network in methicillin-resistant Staphylococcus aureus.绘制耐甲氧西林金黄色葡萄球菌的蛋白质相互作用网络。
J Proteome Res. 2011 Mar 4;10(3):1139-50. doi: 10.1021/pr100918u. Epub 2011 Jan 28.
2
Integrating diverse biological and computational sources for reliable protein-protein interactions.整合多种生物和计算资源以获得可靠的蛋白质-蛋白质相互作用。
BMC Bioinformatics. 2010 Oct 15;11 Suppl 7(Suppl 7):S8. doi: 10.1186/1471-2105-11-S7-S8.
3
Global protein-protein interaction network in the human pathogen Mycobacterium tuberculosis H37Rv.
Genes (Basel). 2020 Dec 27;12(1):25. doi: 10.3390/genes12010025.
4
In silico unravelling pathogen-host signaling cross-talks via pathogen mimicry and human protein-protein interaction networks.通过病原体模拟和人类蛋白质-蛋白质相互作用网络在计算机上解析病原体-宿主信号转导的相互作用。
Comput Struct Biotechnol J. 2019 Dec 27;18:100-113. doi: 10.1016/j.csbj.2019.12.008. eCollection 2020.
5
Transferring knowledge of bacterial protein interaction networks to predict pathogen targeted human genes and immune signaling pathways: a case study on M. tuberculosis.将细菌蛋白质相互作用网络的知识转移到预测病原体靶向人类基因和免疫信号通路:以结核分枝杆菌为例的研究。
BMC Genomics. 2018 Jun 28;19(1):505. doi: 10.1186/s12864-018-4873-9.
6
Role of NSC319726 in ovarian cancer based on the bioinformatics analyses.基于生物信息学分析的NSC319726在卵巢癌中的作用
Onco Targets Ther. 2015 Dec 15;8:3757-65. doi: 10.2147/OTT.S86343. eCollection 2015.
7
Comparative Genome and Network Centrality Analysis to Identify Drug Targets of Mycobacterium tuberculosis H37Rv.比较基因组和网络中心性分析以鉴定结核分枝杆菌H37Rv的药物靶点。
Biomed Res Int. 2015;2015:212061. doi: 10.1155/2015/212061. Epub 2015 Nov 5.
8
Maximum flow approach to prioritize potential drug targets of Mycobacterium tuberculosis H37Rv from protein-protein interaction network.基于蛋白质-蛋白质相互作用网络的最大流方法对结核分枝杆菌H37Rv潜在药物靶点进行优先级排序
Clin Transl Med. 2015 Dec;4(1):61. doi: 10.1186/s40169-015-0061-6. Epub 2015 Jun 5.
9
Gene network biological validity based on gene-gene interaction relevance.基于基因-基因相互作用相关性的基因网络生物学有效性。
ScientificWorldJournal. 2014;2014:540679. doi: 10.1155/2014/540679. Epub 2014 Sep 8.
10
Design pattern mining using distributed learning automata and DNA sequence alignment.使用分布式学习自动机和DNA序列比对进行设计模式挖掘。
PLoS One. 2014 Sep 22;9(9):e106313. doi: 10.1371/journal.pone.0106313. eCollection 2014.
人类病原体结核分枝杆菌 H37Rv 的全球蛋白质-蛋白质相互作用网络。
J Proteome Res. 2010 Dec 3;9(12):6665-77. doi: 10.1021/pr100808n. Epub 2010 Nov 10.
4
The BioPAX community standard for pathway data sharing.生物通路交换(BioPAX)社区标准:用于通路数据共享。
Nat Biotechnol. 2010 Sep;28(9):935-42. doi: 10.1038/nbt.1666. Epub 2010 Sep 9.
5
Contrasting transcriptional responses of a virulent and an attenuated strain of Mycobacterium tuberculosis infecting macrophages.对比感染巨噬细胞的毒力株和减毒株结核分枝杆菌的转录反应。
PLoS One. 2010 Jun 10;5(6):e11066. doi: 10.1371/journal.pone.0011066.
6
The IntAct molecular interaction database in 2010.2010 年的 IntAct 分子相互作用数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D525-31. doi: 10.1093/nar/gkp878. Epub 2009 Oct 22.
7
Mining biological pathways using WikiPathways web services.使用WikiPathways网络服务挖掘生物途径。
PLoS One. 2009 Jul 30;4(7):e6447. doi: 10.1371/journal.pone.0006447.
8
A complex-based reconstruction of the Saccharomyces cerevisiae interactome.基于复合物的酿酒酵母相互作用组重建。
Mol Cell Proteomics. 2009 Jun;8(6):1361-81. doi: 10.1074/mcp.M800490-MCP200. Epub 2009 Jan 27.
9
Mycobacterium tuberculosis interactome analysis unravels potential pathways to drug resistance.结核分枝杆菌相互作用组分析揭示了耐药性的潜在途径。
BMC Microbiol. 2008 Dec 23;8:234. doi: 10.1186/1471-2180-8-234.
10
targetTB: a target identification pipeline for Mycobacterium tuberculosis through an interactome, reactome and genome-scale structural analysis.靶向结核杆菌:一种通过相互作用组、反应组和基因组规模结构分析来鉴定结核分枝杆菌靶点的流程
BMC Syst Biol. 2008 Dec 19;2:109. doi: 10.1186/1752-0509-2-109.