CEA, Saint-Paul-lez-Durance, France.
BMC Genomics. 2013 Apr 20;14:269. doi: 10.1186/1471-2164-14-269.
Regulatory proteins (RPs) such as transcription factors (TFs) and two-component system (TCS) proteins control how prokaryotic cells respond to changes in their external and/or internal state. Identification and annotation of TFs and TCSs is non-trivial, and between-genome comparisons are often confounded by different standards in annotation. There is a need for user-friendly, fast and convenient tools to allow researchers to overcome the inherent variability in annotation between genome sequences.
We have developed the web-server P2RP (Predicted Prokaryotic Regulatory Proteins), which enables users to identify and annotate TFs and TCS proteins within their sequences of interest. Users can input amino acid or genomic DNA sequences, and predicted proteins therein are scanned for the possession of DNA-binding domains and/or TCS domains. RPs identified in this manner are categorised into families, unambiguously annotated, and a detailed description of their features generated, using an integrated software pipeline. P2RP results can then be outputted in user-specified formats.
Biologists have an increasing need for fast and intuitively usable tools, which is why P2RP has been developed as an interactive system. As well as assisting experimental biologists to interrogate novel sequence data, it is hoped that P2RP will be built into genome annotation pipelines and re-annotation processes, to increase the consistency of RP annotation in public genomic sequences. P2RP is the first publicly available tool for predicting and analysing RP proteins in users' sequences. The server is freely available and can be accessed along with documentation at http://www.p2rp.org.
调节蛋白(RPs),如转录因子(TFs)和双组分系统(TCS)蛋白,控制原核细胞如何对其外部和/或内部状态的变化做出反应。TFs 和 TCSs 的鉴定和注释并不简单,并且基因组之间的比较通常因注释的标准不同而受到混淆。需要用户友好、快速和方便的工具,使研究人员能够克服基因组序列注释之间固有的可变性。
我们开发了网络服务器 P2RP(预测原核调节蛋白),它使用户能够识别和注释其感兴趣的序列中的 TF 和 TCS 蛋白。用户可以输入氨基酸或基因组 DNA 序列,并且预测其中的蛋白质是否具有 DNA 结合结构域和/或 TCS 结构域。以这种方式识别的 RPs 被分类为家族,使用集成的软件管道对其进行明确注释,并生成其特征的详细描述。然后可以按照用户指定的格式输出 P2RP 结果。
生物学家对快速和直观可用的工具的需求不断增加,这就是为什么开发了 P2RP 作为交互式系统。除了帮助实验生物学家研究新的序列数据外,还希望将 P2RP 构建到基因组注释管道和重新注释过程中,以提高公共基因组序列中 RP 注释的一致性。P2RP 是第一个可用于预测和分析用户序列中 RP 蛋白的公开可用工具。该服务器是免费提供的,并可在 http://www.p2rp.org 上访问,同时提供文档。