Funk Andrew, Galewski Paul, McGrath J Mitchell
Department of Plant, Soil, and Microbial Science, Plant Breeding, Genetics, and Biotechnology Program, Michigan State University, East Lansing, MI, 48824, USA.
USDA-ARS, Sugarbeet and Bean Research Unit, 1066 Bogue Street, 494 PSSB, East Lansing, MI, 48824, USA.
Plant J. 2018 May 23. doi: 10.1111/tpj.13977.
Nucleotide-binding (NB-ARC), leucine-rich-repeat genes (NLRs) account for 60.8% of resistance (R) genes molecularly characterized from plants. NLRs exist as large gene families prone to tandem duplication and transposition, with high sequence diversity among crops and their wild relatives. This diversity can be a source of new disease resistance, but difficulty in distinguishing specific sequences from homologous gene family members hinders characterization of resistance for improving crop varieties. Current genome sequencing and assembly technologies, especially those using long-read sequencing, are improving resolution of repeat-rich genomic regions and clarifying locations of duplicated genes, such as NLRs. Using the conserved NB-ARC domain as a model, 231 tentative NB-ARC loci were identified in a highly contiguous genome assembly of sugar beet, revealing diverged and truncated NB-ARC signatures as well as full-length sequences. The NB-ARC-associated proteins contained NLR resistance gene domains, including TIR, CC and LRR, as well as other integrated domains. Phylogenetic relationships of partial and complete domains were determined, and patterns of physical clustering in the genome were evaluated. Comparison of sugar beet NB-ARC domains to validated R-genes from monocots and eudicots suggested extensive Beta vulgaris-specific subfamily expansions. The NLR landscape in the rhizomania resistance conferring Rz region of Chromosome 3 was characterized, identifying 26 NLR-like sequences spanning 20 MB. This work presents the first detailed view of NLR family composition in a member of the Caryophyllales, builds a foundation for additional disease resistance work in B. vulgaris, and demonstrates an additional nucleic-acid-based method for NLR prediction in non-model plant species.
核苷酸结合(NB-ARC)富含亮氨酸重复序列的基因(NLRs)占从植物中分子鉴定出的抗性(R)基因的60.8%。NLRs以易于串联重复和转座的大基因家族形式存在,在作物及其野生近缘种中具有高度的序列多样性。这种多样性可能是新的抗病性来源,但难以从同源基因家族成员中区分特定序列阻碍了为改良作物品种而进行的抗性鉴定。当前的基因组测序和组装技术,尤其是那些使用长读长测序的技术,正在提高富含重复序列的基因组区域的分辨率,并阐明重复基因(如NLRs)的位置。以保守的NB-ARC结构域为模型,在甜菜高度连续的基因组组装中鉴定出231个暂定的NB-ARC基因座,揭示了不同的和截短的NB-ARC特征以及全长序列。与NB-ARC相关的蛋白包含NLR抗性基因结构域,包括TIR、CC和LRR,以及其他整合结构域。确定了部分和完整结构域的系统发育关系,并评估了基因组中的物理聚类模式。将甜菜NB-ARC结构域与来自单子叶植物和双子叶植物的已验证R基因进行比较,表明甜菜特异性亚家族有广泛的扩增。对3号染色体根瘤病抗性赋予Rz区域的NLR图谱进行了表征,鉴定出26个跨越20MB的NLR样序列。这项工作首次详细展示了石竹目成员中NLR家族的组成,为甜菜的其他抗病性研究奠定了基础,并展示了一种用于非模式植物物种中NLR预测的基于核酸的额外方法。