Shen Xin, Liu Zhi-Quan, Mocoeur Anne, Xia Yan, Jing Hai-Chun
Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
Theor Appl Genet. 2015 Apr;128(4):623-37. doi: 10.1007/s00122-015-2458-4. Epub 2015 Jan 30.
5,511 genic small-size PAVs in sorghum were identified and examined, including the pattern and the function enrichment of PAV genes. 325 PAV markers were developed to construct a genetic map. Presence/absence variants (PAVs) correlate closely to the phenotypic variation, by impacting plant genome sizes and the adaption to the environment. To shed more light on their genome-wide patterns, functions and the possibility of using them as molecular markers, we generated next generation genome sequencing data for four sorghum inbred lines and used associated bioinformatic pipelines to identify small-size PAVs (40-10 kb). Five thousand five hundreds and eleven genic PAVs (40-10 kb) were identified and found to affect 3,238 genes. These PAVs were mainly distributed on the sub-telomeric regions, but the highest proportions occurred in the vicinity of the centromeric regions. One of the prominent features of the PAVs is the high occurrence of long terminal repeats retrotransposons and DNA transposons. PAVs caused various alterations to gene structure, primarily including the coding sequence variants, intron variants, transcript ablation, and initiator codon changes. The genes affected by PAVs were significantly enriched in those involved in stress responses and protein modification. We used 325 PAVs polymorphic between two sorghum inbred lines Ji2731 and E-Tian, together with 49 SSR markers, and constructed a genetic map, which consisted of 10 linkage groups corresponding to the 10 chromosomes of sorghum and spanned 1,430.3 cM in length covering 97% of the physical genome. The resources reported here should be useful for genetic study and breeding of sorghum and related species.
在高粱中鉴定并检测了5511个基因性小尺寸PAV(存在/缺失变异),包括PAV基因的模式和功能富集情况。开发了325个PAV标记来构建遗传图谱。存在/缺失变异(PAV)通过影响植物基因组大小和对环境的适应性,与表型变异密切相关。为了更深入了解它们在全基因组范围内的模式、功能以及将其用作分子标记的可能性,我们生成了四个高粱自交系的下一代基因组测序数据,并使用相关的生物信息学管道来鉴定小尺寸PAV(40 - 10 kb)。鉴定出了5511个基因性PAV(40 - 10 kb),发现它们影响3238个基因。这些PAV主要分布在亚端粒区域,但在着丝粒区域附近出现的比例最高。PAV的一个显著特征是长末端重复逆转座子和DNA转座子的高发生率。PAV对基因结构造成了各种改变,主要包括编码序列变异、内含子变异、转录本缺失和起始密码子变化。受PAV影响的基因在参与应激反应和蛋白质修饰的基因中显著富集。我们使用了两个高粱自交系Ji2731和E - Tian之间多态性的325个PAV,以及49个SSR标记,构建了一个遗传图谱,该图谱由对应于高粱10条染色体的10个连锁群组成,长度为1430.3 cM,覆盖了97%的物理基因组。本文报道的资源应有助于高粱及相关物种的遗传研究和育种。