State Key Laboratory of Cotton Biology, Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, China.
Nat Genet. 2012 Oct;44(10):1098-103. doi: 10.1038/ng.2371. Epub 2012 Aug 26.
We have sequenced and assembled a draft genome of G. raimondii, whose progenitor is the putative contributor of the D subgenome to the economically important fiber-producing cotton species Gossypium hirsutum and Gossypium barbadense. Over 73% of the assembled sequences were anchored on 13 G. raimondii chromosomes. The genome contains 40,976 protein-coding genes, with 92.2% of these further confirmed by transcriptome data. Evidence of the hexaploidization event shared by the eudicots as well as of a cotton-specific whole-genome duplication approximately 13-20 million years ago was observed. We identified 2,355 syntenic blocks in the G. raimondii genome, and we found that approximately 40% of the paralogous genes were present in more than 1 block, which suggests that this genome has undergone substantial chromosome rearrangement during its evolution. Cotton, and probably Theobroma cacao, are the only sequenced plant species that possess an authentic CDN1 gene family for gossypol biosynthesis, as revealed by phylogenetic analysis.
我们已经对 G. raimondii 的基因组进行了测序和组装,其祖先是经济上重要的纤维生产棉花物种 Gossypium hirsutum 和 Gossypium barbadense 的 D 亚基因组的可能供体。组装序列的超过 73%被锚定在 13 条 G. raimondii 染色体上。该基因组包含 40,976 个蛋白质编码基因,其中 92.2%的基因进一步得到转录组数据的证实。观察到了真双子叶植物和棉花特有的全基因组加倍事件的证据,该事件发生在大约 1300 万至 2000 万年前。我们在 G. raimondii 基因组中鉴定出 2,355 个共线性块,并且发现大约 40%的基因对存在于 1 个以上的块中,这表明该基因组在其进化过程中经历了大量的染色体重排。系统发育分析表明,棉花和可能的可可豆是唯一具有真正的 CDN1 基因家族用于棉酚生物合成的已测序植物物种。