Janelia Research Campus of the Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, VA 20147, USA.
Genome Biol Evol. 2022 May 31;14(6). doi: 10.1093/gbe/evac069.
Homology of highly divergent genes often cannot be determined from sequence similarity alone. For example, we recently identified in the aphid Hormaphis cornu a family of rapidly evolving bicycle genes, which encode novel proteins implicated as plant gall effectors, and sequence similarity search methods yielded few putative bicycle homologs in other species. Coding sequence-independent features of genes, such as intron-exon boundaries, often evolve more slowly than coding sequences, however, and can provide complementary evidence for homology. We found that a linear logistic regression classifier using only structural features of bicycle genes identified many putative bicycle homologs in other species. Independent evidence from sequence features and intron locations supported homology assignments. To test the potential roles of bicycle genes in other aphids, we sequenced the genome of a second gall-forming aphid, Tetraneura nigriabdominalis and found that many bicycle genes are strongly expressed in the salivary glands of the gall forming foundress. In addition, bicycle genes are strongly overexpressed in the salivary glands of a non-gall forming aphid, Acyrthosiphon pisum, and in the non-gall forming generations of H. cornu. These observations suggest that Bicycle proteins may be used by multiple aphid species to manipulate plants in diverse ways. Incorporation of gene structural features into sequence search algorithms may aid identification of deeply divergent homologs, especially of rapidly evolving genes involved in host-parasite interactions.
高度分化的基因的同源性往往不能仅通过序列相似性来确定。例如,我们最近在蚜虫 Hormaphis cornu 中鉴定出了一组快速进化的自行车基因家族,这些基因编码的新型蛋白质被认为是植物虫瘿效应物,而序列相似性搜索方法在其他物种中很少发现可能的自行车同源物。然而,基因的编码序列独立特征,如内含子-外显子边界,通常比编码序列进化得更慢,并且可以为同源性提供补充证据。我们发现,仅使用自行车基因的结构特征的线性逻辑回归分类器可以在其他物种中识别出许多可能的自行车同源物。来自序列特征和内含子位置的独立证据支持同源性分配。为了测试自行车基因在其他蚜虫中的潜在作用,我们对第二种形成虫瘿的蚜虫 Tetraneura nigriabdominalis 的基因组进行了测序,发现许多自行车基因在形成虫瘿的创始者的唾液腺中强烈表达。此外,自行车基因在不形成虫瘿的蚜虫 Acyrthosiphon pisum 的唾液腺中以及在 Hormaphis cornu 的非虫瘿形成世代中强烈过表达。这些观察结果表明,Bicycle 蛋白可能被多种蚜虫物种用于以不同方式操纵植物。将基因结构特征纳入序列搜索算法中可能有助于识别深度分化的同源物,特别是涉及宿主-寄生虫相互作用的快速进化基因。