Department of Integrative Biology, University of California, Berkeley, CA 94720, USA.
Museum of Vertebrate Zoology, Berkeley, CA 94720, USA.
G3 (Bethesda). 2023 Sep 30;13(10). doi: 10.1093/g3journal/jkad157.
North American minnows (Cypriniformes: Leuciscidae) comprise a diverse taxonomic group, but many members, particularly those inhabiting deserts, face elevated extinction risks. Despite conservation concerns, leuciscids remain under sampled for reference assemblies relative to other groups of freshwater fishes. Here, we present 2 chromosome-scale reference genome assemblies spikedace (Meda fulgida) and loach minnow (Tiaroga cobitis) using PacBio, Illumina and Omni-C technologies. The complete assembly for spikedace was 882.1 Mb in total length comprised of 83 scaffolds with N50 = 34.8 Mb, L50 = 11, N75 = 32.3 Mb, and L75 = 18. The complete assembly for loach minnow was 1.3 Gb in total length comprised of 550 scaffolds with N50 = 48.6 Mb, L50 = 13, N75 = 42.3 Mb, and L75 = 20. Completeness assessed via Benchmarking Universal Single-Copy Orthologues (BUSCO) metrics using the Actinopterygii BUSCO database showed ∼97% for spikedace and ∼98% for loach minnow of complete BUSCO proportions. Annotation revealed approximately 32.58 and 29.04% of spikedace and loach minnow total genome lengths to be comprised of protein-coding genes, respectively. Comparative genomic analyses of these endangered and co-distributed fishes revealed widespread structural variants, gene family expansions, and evidence of positive selection in both genomes.
北美小鱥(鲤形目:鱥科)是一个多样化的分类群,但许多成员,特别是那些栖息在沙漠中的成员,面临着更高的灭绝风险。尽管存在保护问题,但与其他淡水鱼类相比,鱥科鱼类的参考基因组组装仍然采样不足。在这里,我们使用 PacBio、Illumina 和 Omni-C 技术,分别为尖鳍鱥(Meda fulgida)和泥鳅鱥(Tiaroga cobitis)提供了 2 个染色体级别的参考基因组组装。尖鳍鱥的完整组装总长为 882.1Mb,由 83 个 scaffolds 组成,N50 = 34.8Mb,L50 = 11,N75 = 32.3Mb,L75 = 18。泥鳅鱥的完整组装总长为 13Gb,由 550 个 scaffolds 组成,N50 = 48.6Mb,L50 = 13,N75 = 42.3Mb,L75 = 20。使用 Actinopterygii BUSCO 数据库评估完整性,通过基准通用单拷贝同源基因(BUSCO)指标显示,尖鳍鱥的完整 BUSCO 比例约为 97%,泥鳅鱥的完整 BUSCO 比例约为 98%。注释显示,尖鳍鱥和泥鳅鱥的总基因组长度中,分别约有 32.58%和 29.04%由蛋白质编码基因组成。对这些濒危和共分布鱼类的比较基因组分析表明,两个基因组中都存在广泛的结构变异、基因家族扩张和正选择的证据。