Department of Evolution and Ecology, University of California Davis, One Shields Ave, Davis, CA 95616, USA.
USDA, ARS, PA, US Livestock Insects Research Lab, 2700 Fredericksburg Road, Kerrville, TX 78028, USA.
G3 (Bethesda). 2024 Oct 7;14(10). doi: 10.1093/g3journal/jkae155.
Drosophila prolongata is a member of the melanogaster species group and rhopaloa subgroup native to the subtropical highlands of Southeast Asia. This species exhibits an array of recently evolved male-specific morphological, physiological, and behavioral traits that distinguish it from its closest relatives, making it an attractive model for studying the evolution of sexual dimorphism and testing theories of sexual selection. The lack of genomic resources has impeded the dissection of the molecular basis of sex-specific development and behavior in this species. To address this, we assembled the genome of D. prolongata using long-read sequencing and Hi-C scaffolding, resulting in a highly complete and contiguous (scaffold N50 2.2 Mb) genome assembly of 220 Mb. The repetitive content of the genome is 24.6%, the plurality of which are long terminal repeats retrotransposons (33.2%). Annotations based on RNA-seq data and homology to related species revealed a total of 19,330 genes, of which 16,170 are protein-coding. The assembly includes 98.5% of Diptera BUSCO genes, including 93.8% present as a single copy. Despite some likely regional duplications, the completeness of this genome suggests that it can be readily used for gene expression, genome-wide association studies (GWAS), and other genomic analyses.
长足大蚊是原产于东南亚亚热带高地的黑腹果蝇种组和罗帕洛亚亚组的一员。该物种表现出一系列最近进化的雄性特有的形态、生理和行为特征,使其与最亲近的亲缘物种区分开来,成为研究性二型进化和测试性选择理论的理想模型。缺乏基因组资源阻碍了对该物种中性别特异性发育和行为的分子基础的剖析。为了解决这个问题,我们使用长读测序和 Hi-C 支架组装了 D. prolongata 的基因组,得到了一个高度完整和连续的(支架 N50 为 2.2Mb)基因组组装,大小为 220Mb。基因组的重复含量为 24.6%,其中多数为长末端重复反转录转座子(33.2%)。基于 RNA-seq 数据的注释和与相关物种的同源性共揭示了总共 19330 个基因,其中 16170 个是蛋白质编码基因。该组装包括 98.5%的双翅目 BUSCO 基因,其中 93.8%以单拷贝形式存在。尽管存在一些可能的区域重复,但该基因组的完整性表明它可以方便地用于基因表达、全基因组关联研究(GWAS)和其他基因组分析。