Peterson Jennifer K, MacDonald Madolyn L, Ellis Vincenzo A
Department of Entomology and Wildlife Ecology, University of Delaware, Newark, DE USA.
Bioinformatics and Computational Biology Core, University of Delaware, Newark, DE USA.
bioRxiv. 2024 Oct 8:2024.10.07.611472. doi: 10.1101/2024.10.07.611472.
is the most widespread triatomine bug species in the United States (US). The species vectors the human parasite , which causes Chagas disease. Vector-borne Chagas disease is rarely diagnosed in the US, but has been implicated in a handful of cases. Despite its public health importance, little is known about the genomics or population genetics of a. Here, we used long-read sequencing to assemble the first whole genome sequence for using DNA extracted from one adult specimen from Delaware. The final size of the genome was 1.162 Gbp with 77.7x coverage. The assembly consisted of 183 contigs with an N50 size of 94.97 Kb. The Benchmarking Universal Single-Copy Ortholog (BUSCO) complete score was 99.1%, suggesting a very complete assembly. Genome-wide GC level was 33.56%, and DNA methylation was 18.84%. The genome consists of 61.4% repetitive DNA and 17,799 predicted coding genes. The assembled genome was slightly larger than that of Triatominae species and (949 Mbp with 90.4% BUSCO score and 706 Mbp with 96.5% BUSCO score, respectively). The genome is the first North American triatomine species genome to be sequenced, and it is the most complete genome yet for any Triatominae species. The genome will allow for deeper investigations into epidemiologically relevant aspects of this important vector species, including blood feeding, host seeking, and parasite competence, thus providing potential vector-borne disease management targets and strengthening public health preparedness.
是美国分布最广的锥蝽物种。该物种传播人类寄生虫,可导致恰加斯病。在美国,通过媒介传播的恰加斯病很少被诊断出来,但已在少数病例中被发现。尽管其对公共卫生很重要,但对该物种的基因组学或群体遗传学知之甚少。在这里,我们使用长读长测序技术,从特拉华州的一个成年标本中提取DNA,组装出了该物种的首个全基因组序列。基因组的最终大小为1.162 Gbp,覆盖度为77.7倍。组装后的基因组由183个重叠群组成,N50大小为94.97 Kb。基准通用单拷贝直系同源基因(BUSCO)完整度得分是99.1%,表明组装非常完整。全基因组的GC水平为33.56%,DNA甲基化水平为18.84%。基因组由61.4%的重复DNA和17799个预测的编码基因组成。组装后的该物种基因组略大于另外两种锥蝽物种(分别为949 Mbp,BUSCO得分90.4%;706 Mbp,BUSCO得分96.5%)的基因组。该物种基因组是首个被测序的北美锥蝽物种基因组,也是迄今为止所有锥蝽物种中最完整的基因组。该物种基因组将有助于更深入地研究这个重要媒介物种在流行病学方面的相关特性,包括吸血、寻找宿主和传播寄生虫的能力,从而提供潜在的媒介传播疾病管理靶点并加强公共卫生防范。