Human Genome Research Center, Tianjin University, Tianjin, 300309, China.
Zheng-Yuan-Tang (Tianjin) Biotechnology Co. Ltd, Tianjin, 300457, China.
Sci Rep. 2019 Jan 29;9(1):898. doi: 10.1038/s41598-018-38021-4.
The complete genome of Cordyceps militaris was sequenced using single-molecule real-time (SMRT) sequencing technology at a coverage over 300×. The genome size was 32.57 Mb, and 14 contigs ranging from 0.35 to 4.58 Mb with an N50 of 2.86 Mb were assembled, including 4 contigs with telomeric sequences on both ends and an additional 8 contigs with telomeric sequences on either the 5' or 3' end. A methylome database of the genome was constructed using SMRT and m4C and m6A methylated nucleotides, and many unknown modification types were identified. The major m6A methylation motif is GA and GGAG, and the major m4C methylation motif is GC or CG/GC. In the C. militaris genome DNA, there were four types of methylated nucleotides that we confirmed using high-resolution LCMS-IT-TOF. Using PacBio Iso-Seq, a total of 31,133 complete cDNA sequences were obtained in the fruiting body. The conserved domains of the nontranscribed regions of the genome include TATA boxes, which are the initial regions of genome replication. There were 406 structural variants between the HN and CM01 strains, and there were 1,114 structural variants between the HN and ATCC strains.
采用单分子实时(SMRT)测序技术对蛹虫草的全基因组进行测序,覆盖率超过 300×。基因组大小为 32.57 Mb,组装出 14 条大小在 0.35 到 4.58 Mb 之间的 contigs,N50 为 2.86 Mb,其中 4 条 contigs 两端具有端粒序列,另外 8 条 contigs 具有 5'或 3'端的端粒序列。使用 SMRT 和 m4C 和 m6A 甲基化核苷酸构建了基因组的甲基组数据库,并鉴定出许多未知的修饰类型。主要的 m6A 甲基化模体是 GA 和 GGAG,主要的 m4C 甲基化模体是 GC 或 CG/GC。在蛹虫草基因组 DNA 中,有四种类型的甲基化核苷酸,我们使用高分辨率 LCMS-IT-TOF 进行了确认。使用 PacBio Iso-Seq,在子实体中总共获得了 31,133 条完整的 cDNA 序列。基因组非转录区的保守结构域包括 TATA 盒,这是基因组复制的初始区域。HN 和 CM01 菌株之间有 406 个结构变异,HN 和 ATCC 菌株之间有 1,114 个结构变异。