Centre for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig (ZFMK), Bonn, Germany.
Department of Zoology, Stockholm University, Sweden.
Genome Biol Evol. 2021 Aug 3;13(8). doi: 10.1093/gbe/evab122.
Conservation genomics has made dramatic improvements over the past decade, leveraging the power of genomes to infer diverse parameters central to conservation management questions. However, much of this effort has focused upon vertebrate species, despite insects providing similar flagship status with the added benefit of smaller genomes, shorter generation times and extensive historical collections in museums. Here we present the genome of the Apollo butterfly (Parnassius apollo, Papilionidae), an iconic endangered butterfly, which like many species in this genus, needs conservation genomic attention yet lacks a genome. Using 68.7 Gb of long-read data (N50 = 15.2 kb) we assembled a 1.4 Gb genome for the Apollo butterfly, making this the largest sequenced Lepidopteran genome to date. The assembly was highly contiguous (N50 = 7.1 Mb) and complete (97% of Lepidopteran BUSCOs were single-copy and complete) and consisted of 1,707 contigs. Using RNAseq data and Arthropoda proteins, we annotated 28.3K genes. Alignment with the closest-related chromosome-level assembly, Papilio bianor, reveals a highly conserved chromosomal organization, albeit genome size is highly expanded in the Apollo butterfly, due primarily to a dramatic increase in repetitive element content. Using this alignment for superscaffolding places the P. apollo genome in to 31 chromosomal scaffolds, and together with our functional annotation, provides an essential resource for advancing conservation genomics in a flagship species for insect conservation.
在过去的十年中,保护基因组学取得了显著的进展,利用基因组的力量推断出保护管理问题的各种核心参数。然而,这项工作的大部分重点都放在了脊椎动物物种上,尽管昆虫提供了类似的旗舰地位,并且具有基因组更小、世代时间更短以及博物馆中广泛的历史收藏等额外优势。在这里,我们展示了阿波罗蝴蝶(Parnassius apollo,凤蝶科)的基因组,这是一种标志性的濒危蝴蝶,与该属中的许多物种一样,需要保护基因组学的关注,但缺乏基因组。我们使用 68.7Gb 的长读数据(N50=15.2kb)组装了阿波罗蝴蝶的 1.4Gb 基因组,这是迄今为止测序的最大鳞翅目基因组。该组装高度连续(N50=7.1Mb)且完整(97%的鳞翅目 BUSCO 为单拷贝和完整),由 1707 个 contigs 组成。使用 RNAseq 数据和节肢动物蛋白,我们注释了 28300 个基因。与最接近的染色体水平组装 Papilio bianor 的比对显示出高度保守的染色体组织,尽管阿波罗蝴蝶的基因组大小高度扩张,这主要是由于重复元件含量的急剧增加。使用这种比对进行超级 scaffolding 将 P. apollo 基因组置于 31 个染色体 scaffolding 中,并与我们的功能注释一起,为推进旗舰昆虫保护物种的保护基因组学提供了重要资源。