Institute of Botany, Chinese Academy of Sciences, Beijing, China.
Genome Biol. 2011 Nov 21;12(11):R114. doi: 10.1186/gb-2011-12-11-r114.
Sorghum (Sorghum bicolor) is globally produced as a source of food, feed, fiber and fuel. Grain and sweet sorghums differ in a number of important traits, including stem sugar and juice accumulation, plant height as well as grain and biomass production. The first whole genome sequence of a grain sorghum is available, but additional genome sequences are required to study genome-wide and intraspecific variation for dissecting the genetic basis of these important traits and for tailor-designed breeding of this important C4 crop.
We resequenced two sweet and one grain sorghum inbred lines, and identified a set of nearly 1,500 genes differentiating sweet and grain sorghum. These genes fall into ten major metabolic pathways involved in sugar and starch metabolisms, lignin and coumarin biosynthesis, nucleic acid metabolism, stress responses and DNA damage repair. In addition, we uncovered 1,057,018 SNPs, 99,948 indels of 1 to 10 bp in length and 16,487 presence/absence variations as well as 17,111 copy number variations. The majority of the large-effect SNPs, indels and presence/absence variations resided in the genes containing leucine rich repeats, PPR repeats and disease resistance R genes possessing diverse biological functions or under diversifying selection, but were absent in genes that are essential for life.
This is a first report of the identification of genome-wide patterns of genetic variation in sorghum. High-density SNP and indel markers reported here will be a valuable resource for future gene-phenotype studies and the molecular breeding of this important crop and related species.
高粱(高粱)在全球范围内被用作食物、饲料、纤维和燃料的来源。谷物和甜高粱在许多重要性状上存在差异,包括茎糖和汁液积累、株高以及谷物和生物量的生产。第一个谷物高粱的全基因组序列已经可用,但需要额外的基因组序列来研究全基因组和种内变异,以剖析这些重要性状的遗传基础,并为这种重要的 C4 作物进行定制化的育种。
我们重新测序了两个甜高粱和一个谷物高粱自交系,鉴定出了一套近 1500 个区分甜高粱和谷物高粱的基因。这些基因属于十个主要的代谢途径,涉及糖和淀粉代谢、木质素和香豆素生物合成、核酸代谢、应激反应和 DNA 损伤修复。此外,我们还发现了 1057018 个 SNPs、1 到 10 个碱基长度的 99948 个插入缺失和 16487 个存在/缺失变异以及 17111 个拷贝数变异。大多数大效应 SNPs、插入缺失和存在/缺失变异位于富含亮氨酸重复序列、PPR 重复序列和具有多种生物学功能或处于多样化选择下的抗病 R 基因的基因中,但不存在于对生命至关重要的基因中。
这是高粱全基因组遗传变异模式鉴定的首次报告。这里报道的高密度 SNP 和插入缺失标记将成为未来基因表型研究和这种重要作物及相关物种分子育种的宝贵资源。