Department of Protecting Crops and the Environment, Rothamsted Research, Harpenden, Herts, UK.
Present address: School of Agriculture, Shenzhen Campus of Sun Yat-sen University, Guangming District, Shenzhen, Guangdong, People's Republic of China.
BMC Biol. 2023 Feb 6;21(1):24. doi: 10.1186/s12915-023-01520-6.
Studying genomic variation in rapidly evolving pathogens potentially enables identification of genes supporting their "core biology", being present, functional and expressed by all strains or "flexible biology", varying between strains. Genes supporting flexible biology may be considered to be "accessory", whilst the "core" gene set is likely to be important for common features of a pathogen species biology, including virulence on all host genotypes. The wheat-pathogenic fungus Zymoseptoria tritici represents one of the most rapidly evolving threats to global food security and was the focus of this study.
We constructed a pangenome of 18 European field isolates, with 12 also subjected to RNAseq transcription profiling during infection. Combining this data, we predicted a "core" gene set comprising 9807 sequences which were (1) present in all isolates, (2) lacking inactivating polymorphisms and (3) expressed by all isolates. A large accessory genome, consisting of 45% of the total genes, was also defined. We classified genetic and genomic polymorphism at both chromosomal and individual gene scales. Proteins required for essential functions including virulence had lower-than average sequence variability amongst core genes. Both core and accessory genomes encoded many small, secreted candidate effector proteins that likely interact with plant immunity. Viral vector-mediated transient in planta overexpression of 88 candidates failed to identify any which induced leaf necrosis characteristic of disease. However, functional complementation of a non-pathogenic deletion mutant lacking five core genes demonstrated that full virulence was restored by re-introduction of the single gene exhibiting least sequence polymorphism and highest expression.
These data support the combined use of pangenomics and transcriptomics for defining genes which represent core, and potentially exploitable, weaknesses in rapidly evolving pathogens.
研究快速进化病原体的基因组变异可以帮助确定支持其“核心生物学”的基因,这些基因存在于所有菌株中,具有功能且表达,或者是“灵活生物学”,在菌株之间存在差异。支持灵活生物学的基因可以被认为是“辅助性”的,而“核心”基因集可能对病原体物种生物学的常见特征很重要,包括对所有宿主基因型的毒力。小麦病原菌叶点霉是对全球粮食安全构成的最迅速进化威胁之一,也是本研究的重点。
我们构建了 18 个欧洲田间分离株的泛基因组,其中 12 个也在感染期间进行了 RNAseq 转录谱分析。结合这些数据,我们预测了一个“核心”基因集,包含 9807 个序列,这些序列(1)存在于所有分离株中,(2)缺乏失活多态性,(3)由所有分离株表达。还定义了一个由 45%总基因组成的大型辅助基因组。我们在染色体和单个基因尺度上对遗传和基因组多态性进行了分类。包括毒力在内的必需功能所需的蛋白质在核心基因中具有低于平均的序列变异性。核心和辅助基因组都编码了许多小的、分泌的候选效应蛋白,这些蛋白可能与植物免疫相互作用。病毒载体介导的 88 个候选蛋白的瞬时体内过表达未能鉴定出任何诱导叶片坏死的候选蛋白,这是疾病的特征。然而,通过重新引入表现出最少序列多态性和最高表达的单个基因,对缺乏五个核心基因的非致病性缺失突变体进行功能互补,证明了完全毒力的恢复。
这些数据支持将泛基因组学和转录组学结合使用,以确定代表快速进化病原体核心和潜在可利用弱点的基因。