Zhao Ting, Guan Xueying, Hu Yan, Zhang Ziqian, Yang Han, Shi Xiaowen, Han Jin, Mei Huan, Wang Luyao, Shao Lei, Wu Hongyu, Chen Qianqian, Zhao Yongyan, Pan Jiaying, Hao Yupeng, Dong Zeyu, Long Xuan, Deng Qian, Zhao Shengjun, Zhang Mengke, Zhu Yumeng, Ma Xiaowei, Chen Zequan, Deng Yayuan, Si Zhanfeng, Li Xin, Zhang Tianzhen, Gu Fei, Gu Xiaofeng, Fang Lei
Zhejiang Provincial Key Laboratory of Crop Genetic Resources, the Advance Seed Institute, Key Laboratory of Plant FactoryGeneration-adding Breeding, Ministry of Agriculture and Rural Affairs, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, Zhejiang, China.
Damo Academy, Alibaba Group, Hangzhou, Zhejiang, China.
Cell Res. 2024 Dec;34(12):859-872. doi: 10.1038/s41422-024-01027-x. Epub 2024 Oct 17.
DNA methylation plays multiple regulatory roles in crop development. However, the relationships of methylation polymorphisms with genetic polymorphisms, gene expression, and phenotypic variation in natural crop populations remain largely unknown. Here, we surveyed high-quality methylomes, transcriptomes, and genomes obtained from the 20-days-post-anthesis (DPA) cotton fibers of 207 accessions and extended the classical framework of population genetics to epigenetics. Over 287 million single methylation polymorphisms (SMPs) were identified, 100 times more than the number of single nucleotide polymorphisms (SNPs). These SMPs were significantly enriched in intragenic regions while depleted in transposable elements. Association analysis further identified a total of 5,426,782 cis-methylation quantitative trait loci (cis-meQTLs), 5078 cis-expression quantitative trait methylation (cis-eQTMs), and 9157 expression quantitative trait loci (eQTLs). Notably, 36.39% of cis-eQTM genes were not associated with genetic variation, indicating that a large number of SMPs associated with gene expression variation are independent of SNPs. In addition, out of the 1715 epigenetic loci associated with yield and fiber quality traits, only 36 (2.10%) were shared with genome-wide association study (GWAS) loci. The construction of multi-omics regulatory networks revealed 43 cis-eQTM genes potentially involved in fiber development, which cannot be identified by GWAS alone. Among these genes, the role of one encoding CBL-interacting protein kinase 10 in fiber length regulation was successfully validated through gene editing. Taken together, our findings prove that DNA methylation data can serve as an additional resource for breeding purposes and can offer opportunities to enhance and expedite the crop improvement process.
DNA甲基化在作物发育中发挥着多种调控作用。然而,在天然作物群体中,甲基化多态性与遗传多态性、基因表达和表型变异之间的关系仍 largely 未知。在这里,我们对 207 份材料花后 20 天(DPA)棉纤维的高质量甲基化组、转录组和基因组进行了调查,并将群体遗传学的经典框架扩展到表观遗传学。共鉴定出超过 2.87 亿个单甲基化多态性(SMP),比单核苷酸多态性(SNP)的数量多 100 倍。这些 SMP 在基因内区域显著富集,而在转座元件中则减少。关联分析进一步鉴定出总共 5426782 个顺式甲基化数量性状位点(cis-meQTL)、5078 个顺式表达定量性状甲基化(cis-eQTM)和 9157 个表达数量性状位点(eQTL)。值得注意的是,36.39% 的 cis-eQTM 基因与遗传变异无关,这表明大量与基因表达变异相关的 SMP 独立于 SNP。此外,在与产量和纤维品质性状相关的 1715 个表观遗传位点中,只有 36 个(2.10%)与全基因组关联研究(GWAS)位点共享。多组学调控网络的构建揭示了 43 个可能参与纤维发育的 cis-eQTM 基因,这些基因仅通过 GWAS 无法鉴定。在这些基因中,通过基因编辑成功验证了一个编码 CBL 相互作用蛋白激酶 10 的基因在纤维长度调控中的作用。综上所述,我们的研究结果证明,DNA 甲基化数据可以作为育种目的的额外资源,并为加强和加速作物改良过程提供机会。