Cooperative Institute of Marine and Atmospheric Science, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, FL, USA.
Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL, USA.
BMC Genomics. 2024 Feb 29;25(1):226. doi: 10.1186/s12864-024-10092-w.
Long-read sequencing is revolutionizing de-novo genome assemblies, with continued advancements making it more readily available for previously understudied, non-model organisms. Stony corals are one such example, with long-read de-novo genome assemblies now starting to be publicly available, opening the door for a wide array of 'omics-based research. Here we present a new de-novo genome assembly for the endangered Caribbean star coral, Orbicella faveolata, using PacBio circular consensus reads. Our genome assembly improved the contiguity (51 versus 1,933 contigs) and complete and single copy BUSCO orthologs (93.6% versus 85.3%, database metazoa_odb10), compared to the currently available reference genome generated using short-read methodologies. Our new de-novo assembled genome also showed comparable quality metrics to other coral long-read genomes. Telomeric repeat analysis identified putative chromosomes in our scaffolded assembly, with these repeats at either one, or both ends, of scaffolded contigs. We identified 32,172 protein coding genes in our assembly through use of long-read RNA sequencing (ISO-seq) of additional O. faveolata fragments exposed to a range of abiotic and biotic treatments, and publicly available short-read RNA-seq data. With anthropogenic influences heavily affecting O. faveolata, as well as its increasing incorporation into reef restoration activities, this updated genome resource can be used for population genomics and other 'omics analyses to aid in the conservation of this species.
长读测序正在彻底改变从头基因组组装,随着技术的不断进步,它越来越容易用于以前研究较少的非模式生物。石珊瑚就是一个例子,现在越来越多的长读从头基因组组装开始公开,为广泛的基于组学的研究打开了大门。在这里,我们使用 PacBio 环形一致读取提供了一种濒危加勒比星珊瑚 Orbicella faveolata 的新从头基因组组装。与目前使用短读方法生成的参考基因组相比,我们的基因组组装提高了连续性(51 个 contigs 对 1933 个 contigs)和完整的单拷贝 BUSCO 直系同源物(93.6% 对 85.3%,数据库 metazoa_odb10)。与其他珊瑚长读基因组相比,我们的新从头组装基因组也具有相当的质量指标。端粒重复分析在我们的支架组装中确定了潜在的染色体,这些重复位于支架 contigs 的一端或两端。我们通过使用额外的 O. faveolata 片段的长读 RNA 测序(ISO-seq),以及公开的短读 RNA-seq 数据,鉴定了我们组装中的 32,172 个蛋白质编码基因。由于人为因素对 O. faveolata 产生了严重影响,并且它越来越多地被纳入珊瑚礁恢复活动中,这个更新的基因组资源可用于群体基因组学和其他组学分析,以帮助保护该物种。