State Key Laboratory of Rice Biology, Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insect Pests & Key Laboratory of Biology of Crop Pathogens and Insects of Zhejiang Province, Institute of Insect Sciences, Zhejiang University, Hangzhou, 310058, China.
Shanghai Institute for Advanced Study, Zhejiang University, Shanghai, 201203, China.
Sci Data. 2023 Mar 22;10(1):159. doi: 10.1038/s41597-023-02067-5.
The ectoparasitoid wasp Theocolax elegans is a cosmopolitan and generalist pteromalid parasitoid of several major storage insect pests, and can effectively suppress a host population in warehouses. However, little molecular information about this wasp is currently available. In this study, we assembled the genome of T. elegans using PacBio long-read sequencing, Illumina sequencing, and Hi-C methods. The genome assembly is 662.73 Mb in length with contig and scaffold N50 values of 1.15 Mb and 88.8 Mb, respectively. The genome contains 56.4% repeat sequences and 23,212 protein-coding genes were annotated. Phylogenomic analyses revealed that T. elegans diverged from the lineage leading to subfamily Pteromalinae (Nasonia vitripennis and Pteromalus puparum) approximately 110.5 million years ago. We identified 130 significantly expanded gene families, 34 contracted families, 248 fast-evolving genes, and 365 positively selected genes in T. elegans. Additionally, 260 olfactory receptors and 285 venom proteins were identified. This genome assembly provides valuable genetic bases for future investigations on evolution, molecular biology and application of T. elegans.
长管长尾小蜂是一种世界性的、兼性的长尾小蜂科寄生蜂,可有效抑制仓库中几种主要仓储害虫的种群数量。然而,目前关于这种小蜂的分子信息还很少。在本研究中,我们使用 PacBio 长读测序、Illumina 测序和 Hi-C 方法组装了 T. elegans 的基因组。基因组组装长度为 662.73 Mb,其 contig 和 scaffold N50 值分别为 1.15 Mb 和 88.8 Mb。基因组包含 56.4%的重复序列,注释了 23212 个蛋白质编码基因。系统发育分析表明,T. elegans 与亚科 Pteromalinae(Nasonia vitripennis 和 Pteromalus puparum)的进化支大约在 1.105 亿年前分化。我们在 T. elegans 中鉴定了 130 个显著扩张的基因家族、34 个收缩的家族、248 个快速进化的基因和 365 个正选择的基因。此外,还鉴定了 260 个嗅觉受体和 285 种毒液蛋白。这个基因组组装为未来研究 T. elegans 的进化、分子生物学和应用提供了有价值的遗传基础。