Jin Shangkun, Han Zegang, Hu Yan, Si Zhanfeng, Dai Fan, He Lu, Cheng Yu, Li Yiqian, Zhao Ting, Fang Lei, Zhang Tianzhen
Zhejiang Provincial Engineering Center for Crop Precision Breeding, Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China; Hainan Institute of Zhejiang University, Sanya 572025, China.
Zhejiang Provincial Engineering Center for Crop Precision Breeding, Advanced Seed Institute, Plant Precision Breeding Academy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China.
Mol Plant. 2023 Apr 3;16(4):678-693. doi: 10.1016/j.molp.2023.02.004. Epub 2023 Feb 9.
Structural variations (SVs) have long been described as being involved in the origin, adaption, and domestication of species. However, the underlying genetic and genomic mechanisms are poorly understood. Here, we report a high-quality genome assembly of Gossypium barbadense acc. Tanguis, a landrace that is closely related to formation of extra-long-staple (ELS) cultivated cotton. An SV-based pan-genome (Pan-SV) was then constructed using a total of 182 593 non-redundant SVs, including 2236 inversions, 97 398 insertions, and 82 959 deletions from 11 assembled genomes of allopolyploid cotton. The utility of this Pan-SV was then demonstrated through population structure analysis and genome-wide association studies (GWASs). Using segregation mapping populations produced through crossing ELS cotton and the landrace along with an SV-based GWAS, certain SVs responsible for speciation, domestication, and improvement in tetraploid cottons were identified. Importantly, some of the SVs presently identified as associated with the yield and fiber quality improvement had not been identified in previous SNP-based GWAS. In particular, a 9-bp insertion or deletion was found to associate with elimination of the interspecific reproductive isolation between Gossypium hirsutum and G. barbadense. Collectively, this study provides new insights into genome-wide, gene-scale SVs linked to important agronomic traits in a major crop species and highlights the importance of SVs during the speciation, domestication, and improvement of cultivated crop species.
长期以来,结构变异(SVs)一直被认为与物种的起源、适应和驯化有关。然而,其潜在的遗传和基因组机制却知之甚少。在此,我们报告了陆地棉种质资源坦古斯(Gossypium barbadense acc. Tanguis)的高质量基因组组装结果,该种质与超长绒(ELS)栽培棉的形成密切相关。然后,利用总共182593个非冗余SVs构建了一个基于SV的泛基因组(Pan-SV),其中包括来自异源多倍体棉花11个组装基因组的2236个倒位、97398个插入和82959个缺失。通过群体结构分析和全基因组关联研究(GWAS)证明了该Pan-SV的实用性。利用通过ELS棉花与该陆地棉杂交产生的分离作图群体以及基于SV的GWAS,鉴定出了一些与四倍体棉花的物种形成、驯化和改良相关的特定SVs。重要的是,目前确定与产量和纤维品质改良相关的一些SVs在以前基于单核苷酸多态性(SNP)的GWAS中并未被鉴定出来。特别是,发现一个9碱基对的插入或缺失与陆地棉(Gossypium hirsutum)和海岛棉(G. barbadense)之间种间生殖隔离的消除有关。总的来说,本研究为与一种主要作物重要农艺性状相关的全基因组、基因尺度的SVs提供了新的见解,并突出了SVs在栽培作物物种的物种形成、驯化和改良过程中的重要性。