Suppr超能文献

单倍型解析基因组揭示了野生胡桃(Juglans regia L.)中等位基因特异性表达的机制。

A haplotype-resolved genome provides insight into allele-specific expression in wild walnut (Juglans regia L.).

机构信息

Institute of Horticulture Crops, Xinjiang Academy of Agricultural Sciences, the State Key Laboratory of Genetic Improvement and Germplasm Innovation of Crop Resistance in Arid Desert Regions, Key Laboratory of Genome Research and Genetic Improvement of Xinjiang Characteristic Fruits and Vegetables, Urumqi, China.

College of Agriculture, Henan University, Zhengzhou, China.

出版信息

Sci Data. 2024 Mar 8;11(1):278. doi: 10.1038/s41597-024-03096-4.

Abstract

Wild germplasm resources are crucial for gene mining and molecular breeding because of their special trait performance. Haplotype-resolved genome is an ideal solution for fully understanding the biology of subgenomes in highly heterozygous species. Here, we surveyed the genome of a wild walnut tree from Gongliu County, Xinjiang, China, and generated a haplotype-resolved reference genome of 562.99 Mb (contig N50 = 34.10 Mb) for one haplotype (hap1) and 561.07 Mb (contig N50 = 33.91 Mb) for another haplotype (hap2) using PacBio high-fidelity (HiFi) reads and Hi-C technology. Approximately 527.20 Mb (93.64%) of hap1 and 526.40 Mb (93.82%) of hap2 were assigned to 16 pseudochromosomes. A total of 41039 and 39744 protein-coding gene models were predicted for hap1 and hap2, respectively. Moreover, 123 structural variations (SVs) were identified between the two haplotype genomes. Allele-specific expression genes (ASEGs) that respond to cold stress were ultimately identified. These datasets can be used to study subgenome evolution, for functional elite gene mining and to discover the transcriptional basis of specific traits related to environmental adaptation in wild walnut.

摘要

野生种质资源因其特殊的性状表现,对基因挖掘和分子育种至关重要。单倍型解析基因组是充分了解高度杂合物种亚基因组生物学的理想解决方案。在这里,我们调查了来自中国新疆巩留县的一棵野生核桃树的基因组,并使用 PacBio 高保真(HiFi)读取和 Hi-C 技术为一个单倍型(hap1)生成了 562.99 Mb 的单倍型解析参考基因组(contig N50=34.10 Mb),为另一个单倍型(hap2)生成了 561.07 Mb 的参考基因组(contig N50=33.91 Mb)。大约 527.20 Mb(93.64%)的 hap1 和 526.40 Mb(93.82%)的 hap2 被分配到 16 个假染色体上。分别为 hap1 和 hap2 预测了 41039 个和 39744 个蛋白质编码基因模型。此外,在这两个单倍型基因组之间鉴定出 123 个结构变异(SVs)。最终鉴定出响应冷胁迫的等位基因特异性表达基因(ASEGs)。这些数据集可用于研究亚基因组进化、功能优良基因挖掘以及发现与野生核桃环境适应相关的特定性状的转录基础。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验