Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom.
PLoS Negl Trop Dis. 2008 Jul 2;2(7):e258. doi: 10.1371/journal.pntd.0000258.
The phylum Nematoda is biologically diverse, including parasites of plants and animals as well as free-living taxa. Underpinning this diversity will be commensurate diversity in expressed genes, including gene sets associated specifically with evolution of parasitism.
Here we have analyzed the extensive expressed sequence tag data (available for 37 nematode species, most of which are parasites) and define over 120,000 distinct putative genes from which we have derived robust protein translations. Combined with the complete proteomes of Caenorhabditis elegans and Caenorhabditis briggsae, these proteins have been grouped into 65,000 protein families that in turn contain 40,000 distinct protein domains. We have mapped the occurrence of domains and families across the Nematoda and compared the nematode data to that available for other phyla. Gene loss is common, and in particular we identify nearly 5,000 genes that may have been lost from the lineage leading to the model nematode C. elegans. We find a preponderance of novelty, including 56,000 nematode-restricted protein families and 26,000 nematode-restricted domains. Mapping of the latest time-of-origin of these new families and domains across the nematode phylogeny revealed ongoing evolution of novelty. A number of genes from parasitic species had signatures of horizontal transfer from their host organisms, and parasitic species had a greater proportion of novel, secreted proteins than did free-living ones.
These classes of genes may underpin parasitic phenotypes, and thus may be targets for development of effective control measures.
线虫门在生物学上具有多样性,包括动植物寄生虫和自由生活的类群。这种多样性的基础将是相应的表达基因多样性,包括与寄生虫进化相关的基因集。
在这里,我们分析了广泛的表达序列标签数据(可用于 37 种线虫物种,其中大多数是寄生虫),并定义了超过 120,000 个独特的假定基因,从中我们得出了稳健的蛋白质翻译。结合秀丽隐杆线虫和粗糙拟南芥的完整蛋白质组,这些蛋白质被分为 65,000 个蛋白质家族,其中又包含 40,000 个不同的蛋白质结构域。我们已经在整个线虫门中映射了结构域和家族的出现,并将线虫数据与其他门的数据进行了比较。基因丢失很常见,特别是我们确定了近 5,000 个可能从导致模式线虫秀丽隐杆线虫的谱系中丢失的基因。我们发现了大量的新颖性,包括 56,000 个线虫特有的蛋白质家族和 26,000 个线虫特有的结构域。在整个线虫系统发育树上对这些新家族和结构域的最新起源时间进行映射,揭示了新颖性的持续进化。一些来自寄生虫物种的基因具有从宿主生物水平转移的特征,寄生虫物种比自由生活的物种具有更多的新的分泌蛋白。
这些类别的基因可能是寄生虫表型的基础,因此可能是开发有效控制措施的目标。