Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, 4600 Rickenbacker Cswy, Miami, FL, 33149, USA.
Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO, 80302, USA.
BMC Genomics. 2024 Sep 18;25(1):871. doi: 10.1186/s12864-024-10747-8.
The family Batrachoididae are a group of ecologically important teleost fishes with unique life histories, behavior, and physiology that has made them popular model organisms. Batrachoididae remain understudied in the realm of genomics, with only four reference genome assemblies available for the family, with three being highly fragmented and not up to current assembly standards. Among these is the Gulf toadfish, Opsanus beta, a model organism for serotonin physiology which has recently been bred in captivity.
Here we present a new, de novo genome and transcriptome assemblies for the Gulf toadfish using PacBio long read technology. The genome size of the final assembly is 2.1 gigabases, which is among the largest teleost genomes. This new assembly improves significantly upon the currently available reference for Opsanus beta with a final scaffold count of 62, of which 23 are chromosome scale, an N50 of 98,402,768, and a BUSCO completeness score of 97.3%. Annotation with ab initio and transcriptome-based methods generated 41,076 gene models. The genome is highly repetitive, with ~ 70% of the genome composed of simple repeats and transposable elements. Satellite DNA analysis identified potential telomeric and centromeric regions.
This improved assembly represents a valuable resource for future research using this important model organism and to teleost genomics more broadly.
蟾鱼科是一组具有独特生活史、行为和生理特征的重要硬骨鱼类,这使它们成为受欢迎的模式生物。蟾鱼科在基因组学领域的研究仍然不足,该科仅有四个参考基因组组装,其中三个高度碎片化,不符合当前的组装标准。其中包括海湾蟾鱼,Opsanus beta,这是一种研究血清素生理学的模式生物,最近已在人工饲养中繁殖。
本文使用 PacBio 长读测序技术,为海湾蟾鱼提供了新的从头基因组和转录组组装。最终组装的基因组大小为 21 亿碱基对,是最大的硬骨鱼类基因组之一。与现有的 Opsanus beta 参考基因组相比,这个新的组装有了显著的改进,最终的支架数为 62 个,其中 23 个是染色体水平的,N50 为 98402768,BUSCO 完整性评分为 97.3%。使用从头预测和转录组预测方法注释生成了 41076 个基因模型。该基因组高度重复,约 70%的基因组由简单重复序列和转座元件组成。卫星 DNA 分析确定了潜在的端粒和着丝粒区域。
这个改进的组装为使用这一重要的模式生物进行未来的研究提供了有价值的资源,并为硬骨鱼类基因组学的研究提供了更广泛的资源。