Diekmann Yoan, Pereira-Leal José B
Research Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London, WC1E 6BT, UK.
Methods Mol Biol. 2015;1298:17-28. doi: 10.1007/978-1-4939-2569-8_2.
The bioinformatic annotation of Rab GTPases is important, for example, to understand the evolution of the endomembrane system. However, Rabs are particularly challenging for standard annotation pipelines because they are similar to other small GTPases and form a large family with many paralogous subfamilies. Here, we describe a bioinformatic annotation pipeline specifically tailored to Rab GTPases. It proceeds in two steps: first, Rabs are distinguished from other proteins based on GTPase-specific motifs, overall sequence similarity to other Rabs, and the occurrence of Rab-specific motifs. Second, Rabs are classified taking either a more accurate but slower phylogenetic approach or a slightly less accurate but much faster bioinformatic approach. All necessary steps can either be performed locally or using the referenced online tools. An implementation of a slightly more involved version of the pipeline presented here is available at RabDB.org.
例如,为了理解内膜系统的进化,Rab GTPases的生物信息学注释很重要。然而,Rabs对于标准注释流程来说尤其具有挑战性,因为它们与其他小GTPases相似,并且形成了一个包含许多旁系同源亚家族的大家族。在这里,我们描述了一个专门为Rab GTPases量身定制的生物信息学注释流程。它分两步进行:首先,基于GTPase特异性基序、与其他Rabs的整体序列相似性以及Rab特异性基序的出现情况,将Rabs与其他蛋白质区分开来。其次,采用更准确但较慢的系统发育方法或稍欠准确但快得多的生物信息学方法对Rabs进行分类。所有必要步骤既可以在本地执行,也可以使用引用的在线工具。此处介绍的流程的一个稍微复杂一些的版本可在RabDB.org上获取。