State Key Laboratory of Cotton Biology/Institute of Cotton Research of Chinese Academy of Agricultural Sciences/School of Agricultural Sciences, Zhengzhou University, Henan, Zhengzhou 450001, china/Key Laboratory for Cotton Genetic Improvement, MOA, Anyang, Henan 455000, China.
Gigascience. 2022 Apr 1;11. doi: 10.1093/gigascience/giac019.
Gossypium hirsutum L. is the most widely cultivated cotton species, and a high-quality reference genome would be a huge boost for researching the molecular mechanism of agronomic traits in cotton.
Here, Pacific Biosciences and Hi-C sequencing technologies were used to assemble a new upland cotton genome of the No. 1 Chinese cotton variety CRI-12. We generated a high-quality assembled CRI-12 genome of 2.31 Gb with a contig N50 of 19.65 Mb, which was superior to previously reported genomes. Comparisons between CRI-12 and other reported genomes revealed 7,966 structural variations and 7,378 presence/absence variations. The distribution of the haplotypes among A-genome (Gossypium arboreum), D-genome (Gossypium raimondii), and AD-genome (G. hirsutum and Gossypium barbadense) suggested that many haplotypes were lost and recombined in the process of polyploidization. More than half of the haplotypes that correlated with different tolerances were located on chromosome D13, suggesting that this chromosome may be important for wide adaptation. Finally, it was demonstrated that DNA methylation may provide advantages in environmental adaptation through whole-genome bisulfite sequencing analysis.
This research provides a new reference genome for molecular biology research on Gossypium hirsutum L. and helps decode the broad environmental adaptation mechanisms in the No. 1 Chinese cotton variety CRI-12.
陆地棉(Gossypium hirsutum L.)是种植最广泛的棉花物种,如果能拥有一个高质量的参考基因组,将极大地促进棉花农艺性状分子机制的研究。
本研究利用 Pacific Biosciences 和 Hi-C 测序技术组装了中国第一个棉花品种中棉所 12 号的新型陆地棉基因组。我们生成了一个高质量的 CRI-12 基因组,大小为 23.1 亿碱基对,其 contig N50 为 19.65Mb,优于之前报道的基因组。CRI-12 与其他已报道基因组的比较揭示了 7966 个结构变异和 7378 个存在/缺失变异。A 基因组(亚洲棉)、D 基因组(雷蒙德氏棉)和 AD 基因组(陆地棉和草棉)的单倍型分布表明,在多倍化过程中许多单倍型丢失并重组。与不同耐受相关的单倍型中有一半以上位于染色体 D13 上,这表明该染色体可能对广泛适应很重要。最后,通过全基因组亚硫酸氢盐测序分析表明,DNA 甲基化可能通过全基因组甲基化分析为环境适应提供优势。
本研究为陆地棉分子生物学研究提供了一个新的参考基因组,有助于解码中国第一个棉花品种中棉所 12 号的广泛环境适应机制。