Suppr超能文献

利用 48 个细菌双组分系统中组氨酸激酶和反应调节子对之间共变残基对构建相互作用伙伴预测器。

Building interacting partner predictors using co-varying residue pairs between histidine kinase and response regulator pairs of 48 bacterial two-component systems.

机构信息

School of Informatics and Computing, Indiana University, Bloomington, Indiana 47408, USA.

出版信息

Proteins. 2011 Apr;79(4):1118-31. doi: 10.1002/prot.22948. Epub 2011 Jan 18.

Abstract

The two-component system (TCS) is a signal transduction system that involves a histidine kinase (HK) and a response regulator (RR). Although up to hundreds of TCSs may operate in parallel in a bacterial cell, the high-fidelity of a TCS signaling is well maintained, minimizing irrelevant crosstalk between TCSs. When a HK gene and a RR gene in a given TCS system exist in neighboring positions, it is almost certain that their protein products (i.e., HK and RR) are interacting partners. However, large bacterial genomes often have multiple HK genes and/or cognate RR genes that are not neighboring positions. In many partially assembled genomes, some HK genes and RR genes belong to different contigs. In these cases, it is not clear which HK(s) and RR(s) interact. By combining information-theoretic and graph-theoretic approaches, we developed a computational method identifying co-evolving residue pairs between HKs and cognate RRs and predicting the interacting HK:RR pairs for each TCS. In addition, we built a TCSppWWW webserver (http://compath.org/platcom/tcs) that takes query sequences of pairing candidates and predicts their HK:RR pairing using precomputed models. The current release of TCSppWWW provides predictors for 48 TCSs using over 20,000 protein sequences from about 900 bacterial genomes. Three different types of predictors using Random Forest, RBF Network, and Naïve Bayes are provided. Once a set of HK and RR candidate sequences are submitted, TCSppWWW aligns query sequences to the precomputed multiple sequence alignment of HK:RR pairs, extracts co-evolving column positions, then returns prediction results with prediction margin and additional information.

摘要

双组分系统(TCS)是一种信号转导系统,涉及组氨酸激酶(HK)和应答调节蛋白(RR)。尽管在细菌细胞中可能有多达数百个 TCS 同时运作,但 TCS 信号的高保真度得以很好地维持,最大限度地减少了 TCS 之间的不相关串扰。当给定 TCS 系统中的 HK 基因和 RR 基因位于相邻位置时,它们的蛋白质产物(即 HK 和 RR)几乎肯定是相互作用的伙伴。然而,大型细菌基因组通常具有多个不相邻位置的 HK 基因和/或同源 RR 基因。在许多部分组装的基因组中,一些 HK 基因和 RR 基因属于不同的连续体。在这些情况下,不清楚哪些 HK 和 RR 相互作用。通过结合信息论和图论方法,我们开发了一种计算方法,用于识别 HK 和同源 RR 之间共同进化的残基对,并预测每个 TCS 的相互作用的 HK:RR 对。此外,我们构建了一个 TCSppWWW 网络服务器(http://compath.org/platcom/tcs),该服务器接受配对候选序列的查询,并使用预先计算的模型预测它们的 HK:RR 配对。当前版本的 TCSppWWW 使用来自约 900 个细菌基因组的超过 20,000 个蛋白质序列,为 48 个 TCS 提供了预测器。提供了三种不同类型的预测器,包括随机森林、RBF 网络和朴素贝叶斯。一旦提交了一组 HK 和 RR 候选序列,TCSppWWW 将查询序列与预先计算的 HK:RR 对的多重序列比对对齐,提取共同进化的列位置,然后返回带有预测边缘和其他信息的预测结果。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验