Favre Patrick, Bapaume Laure, Bossolini Eligio, Delorenzi Mauro, Falquet Laurent, Reinhardt Didier
BMC Plant Biol. 2014 Dec 3;14:333. doi: 10.1186/s12870-014-0333-0.
Genes involved in arbuscular mycorrhizal (AM) symbiosis have been identified primarily by mutant screens, followed by identification of the mutated genes (forward genetics). In addition, a number of AM-related genes has been identified by their AM-related expression patterns, and their function has subsequently been elucidated by knock-down or knock-out approaches (reverse genetics). However, genes that are members of functionally redundant gene families, or genes that have a vital function and therefore result in lethal mutant phenotypes, are difficult to identify. If such genes are constitutively expressed and therefore escape differential expression analyses, they remain elusive. The goal of this study was to systematically search for AM-related genes with a bioinformatics strategy that is insensitive to these problems. The central element of our approach is based on the fact that many AM-related genes are conserved only among AM-competent species.
Our approach involves genome-wide comparisons at the proteome level of AM-competent host species with non-mycorrhizal species. Using a clustering method we first established orthologous/paralogous relationships and subsequently identified protein clusters that contain members only of the AM-competent species. Proteins of these clusters were then analyzed in an extended set of 16 plant species and ranked based on their relatedness among AM-competent monocot and dicot species, relative to non-mycorrhizal species. In addition, we combined the information on the protein-coding sequence with gene expression data and with promoter analysis. As a result we present a list of yet uncharacterized proteins that show a strongly AM-related pattern of sequence conservation, indicating that the respective genes may have been under selection for a function in AM. Among the top candidates are three genes that encode a small family of similar receptor-like kinases that are related to the S-locus receptor kinases involved in sporophytic self-incompatibility.
We present a new systematic strategy of gene discovery based on conservation of the protein-coding sequence that complements classical forward and reverse genetics. This strategy can be applied to diverse other biological phenomena if species with established genome sequences fall into distinguished groups that differ in a defined functional trait of interest.
参与丛枝菌根(AM)共生的基因主要通过突变体筛选来鉴定,随后鉴定突变基因(正向遗传学)。此外,一些与AM相关的基因已通过其与AM相关的表达模式得以鉴定,其功能随后通过基因敲低或敲除方法得以阐明(反向遗传学)。然而,功能冗余基因家族的成员基因,或具有重要功能因而导致致死突变表型的基因,难以鉴定。如果此类基因组成型表达,从而逃避差异表达分析,它们就仍然难以捉摸。本研究的目标是采用一种对这些问题不敏感的生物信息学策略,系统地寻找与AM相关的基因。我们方法的核心要素基于这样一个事实,即许多与AM相关的基因仅在具备AM能力的物种中保守。
我们的方法涉及在蛋白质组水平上对具备AM能力的宿主物种与非菌根物种进行全基因组比较。我们首先使用聚类方法建立直系同源/旁系同源关系,随后鉴定仅包含具备AM能力物种成员的蛋白质簇。然后在一组扩展的16种植物物种中分析这些簇的蛋白质,并根据它们在具备AM能力的单子叶和双子叶物种中相对于非菌根物种的相关性进行排名。此外,我们将蛋白质编码序列信息与基因表达数据以及启动子分析相结合。结果,我们列出了一系列尚未表征的蛋白质,它们显示出与AM密切相关的序列保守模式,表明相应的基因可能在AM功能方面受到选择。顶级候选基因中有三个基因编码一小类相似的类受体激酶,它们与参与孢子体自交不亲和的S位点受体激酶相关。
我们提出了一种基于蛋白质编码序列保守性的新的系统基因发现策略,它补充了经典的正向和反向遗传学。如果具有已确定基因组序列的物种分为在感兴趣的特定功能性状上不同的不同组,那么这种策略可以应用于多种其他生物学现象。