Dutilh Bas E, He Ying, Hekkelman Maarten L, Huynen Martijn A
Center for Molecular and Biomolecular Informatics/Nijmegen Center for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Geert Grooteplein 28, 6525 GA, Nijmegen, The Netherlands.
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W470-4. doi: 10.1093/nar/gkn277. Epub 2008 May 17.
Signature genes are genes that are unique to a taxonomic clade and are common within it. They contain a wealth of information about clade-specific processes and hold a strong evolutionary signal that can be used to phylogenetically characterize a set of sequences, such as a metagenomics sample. As signature genes are based on gene content, they provide a means to assess the taxonomic origin of a sequence sample that is complementary to sequence-based analyses. Here, we introduce Signature (http://www.cmbi.ru.nl/signature), a web server that identifies the signature genes in a set of query sequences, and therewith phylogenetically characterizes it. The server produces a list of taxonomic clades that share signature genes with the set of query sequences, along with an insightful image of the tree of life, in which the clades are color coded based on the number of signature genes present. This allows the user to quickly see from which part(s) of the taxonomy the query sequences likely originate.
特征基因是特定分类进化枝所特有的且在该进化枝内普遍存在的基因。它们包含了大量有关进化枝特定过程的信息,并拥有强大的进化信号,可用于从系统发育角度表征一组序列,例如宏基因组样本。由于特征基因基于基因内容,它们提供了一种评估序列样本分类起源的方法,这是对基于序列分析的补充。在此,我们介绍Signature(http://www.cmbi.ru.nl/signature),这是一个网络服务器,可识别一组查询序列中的特征基因,并据此从系统发育角度对其进行表征。该服务器会生成一份与查询序列集共享特征基因的分类进化枝列表,以及一幅富有洞察力的生命之树图像,其中进化枝根据所含特征基因的数量进行颜色编码。这使得用户能够快速了解查询序列可能源自分类学的哪些部分。