State Key Laboratory of Agricultural Microbiology, College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China.
College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
Genes (Basel). 2020 Apr 29;11(5):483. doi: 10.3390/genes11050483.
is an important model legume for studying symbiotic nitrogen fixation as well as plant development. A genomic sequence of (MG20) has been available for more than ten years. However, the low quality of the genome limits its application in functional genomic studies. Therefore, it is necessary to assemble high-quality chromosome sequences of using new sequencing technology to facilitate the study of functional genomics. In this report, we used the third-generation sequencing combined with the Illumina HiSeq platform to sequence the genome of (MG20). We obtained 544 Mb of genomic sequence using third-generation assembly. Based on sequence analysis, 357 Mb of repeats, 28,251 genes, 626 tRNAs, 1409 rRNAs, and 1233 pseudogenes were predicted in the genome. A total of 27,991 genes were annotated into databases. Compared to the previously published data, the new genome database contains complete sequences in the proper order and orientation with a contig N50 2.81Mb and an excellent genome coverage, which provides more accurate genome information and more precise assembly for functional genomic study.
是研究共生固氮以及植物发育的重要模式豆科植物。其基因组序列 (MG20) 已经有十多年的历史了。然而,基因组质量低限制了其在功能基因组研究中的应用。因此,有必要利用新的测序技术来组装 的高质量染色体序列,以促进功能基因组学的研究。在本报告中,我们使用第三代测序技术结合 Illumina HiSeq 平台对 (MG20)的基因组进行测序。我们使用第三代组装获得了 5.44Mb 的基因组序列。基于序列分析,预测到该基因组中有 357Mb 的重复序列、28251 个基因、626 个 tRNA、1409 个 rRNA 和 1233 个假基因。总共注释了 27991 个基因到数据库中。与之前公布的数据相比,新的基因组数据库包含了完整的 序列,并且按照正确的顺序和方向排列,其 contig N50 为 2.81Mb,基因组覆盖度极好,这为功能基因组研究提供了更准确的基因组信息和更精确的组装。