Yadav Chanchal, Smith Myron, Ogunremi Dele, Yack Jayne
Department of Biology, Carleton University, Ottawa, Ontario K1S 5B6, Canada.
Canadian Food Inspection Agency, Ottawa Laboratory Fallowfield, Ontario K2J 4S1, Canada.
Data Brief. 2020 Nov 14;33:106531. doi: 10.1016/j.dib.2020.106531. eCollection 2020 Dec.
The masked birch caterpillar, Walker (Lepidoptera: Drepanidae), and other Drepanoidea (Lepidoptera) species are excellent organisms for investigating the function and evolution of vibratory communication and sociality in caterpillars. We present a assembled draft genome and functional annotation for , using a combination of short and long sequencing reads generated by Illumina HiSeq X and Oxford Nanopore Technologies (ONT) MinION sequencing platforms, respectively. A total of 460,694,612 150bp paired-end Illumina and 395,890 ONT raw reads were assembled into 11,493 scaffolds spanning a genome size of 270.5Mb. The resulting genome has a GC content of 38.79%, repeat content of 8.26%, is 86.5% complete based on Benchmarking Universal Single-Copy Orthologs (BUSCO) assessment, and comprises 10,398 predicted protein-coding genes. These data represent the first genomic resources for the lepidopteran superfamily Drepanoidea. Although the order Lepidoptera comprises numerous ecologically and economically important species, assembled genomes and annotations are available for < 1% of the total species. These data can be further utilized for research on Lepidoptera genomics as well as on the function and evolution of vibratory communication and sociality in larval insects.
桦尺蠖蛾,沃克(鳞翅目:尺蛾科),以及其他尺蛾总科(鳞翅目)物种是研究毛虫振动通讯和社会性的功能与进化的优秀生物。我们分别使用Illumina HiSeq X和牛津纳米孔技术(ONT)MinION测序平台生成的短读长和长读长测序数据,为桦尺蠖蛾组装了基因组草图并进行了功能注释。总共460,694,612条150bp双端Illumina原始读段和395,890条ONT原始读段被组装成11,493个支架,基因组大小为270.5Mb。所得的桦尺蠖蛾基因组GC含量为38.79%,重复序列含量为8.26%,基于基准通用单拷贝直系同源基因(BUSCO)评估,完整性为86.5%,包含10,398个预测的蛋白质编码基因。这些数据代表了尺蛾总科鳞翅目昆虫的首个基因组资源。尽管鳞翅目包含众多具有重要生态和经济意义的物种,但已组装的基因组和注释仅占总物种数的不到1%。这些数据可进一步用于鳞翅目基因组学研究,以及幼虫昆虫振动通讯和社会性的功能与进化研究。