Wu Qingtao, Gao Jie, Sa Boxuan, Cong Hongtao, Deng Wenjie, Zhang Ying, Zhong Xiaojie, Zhang Jinyu, Wang Liduo, Liu Haizhou, Yan Yi, Zhang Yifei, Liu Di, Yan Wei
College of Marine Science and Technology, China University of Geosciences, Wuhan, 430074, China.
Computational Virology Group, Etiology Research Center, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China.
Sci Data. 2025 Apr 12;12(1):612. doi: 10.1038/s41597-025-04762-x.
Prochlorococcus and Synechococcus are key contributors to marine primary production and play essential roles in global biogeochemical cycles. Despite the ecological importance of these two picocyanobacterial genera, current genomic datasets still lack comprehensive representation of under-sampled ocean regions, associated bacteria and viruses. To address this gap, we used a combination of second- and third-generation sequencing technologies to assemble comprehensive genomic data from 105 Picocyanobacterial enrichment cultures isolated from the Indian Ocean, the South China Sea, and the western Pacific Ocean. This dataset includes 55 Prochlorococcus and 50 Synechococcus genomes with high completeness (>98%) and low contamination (<2%), along with 308 non-redundant associated bacterial genomes derived from 1,457 medium- and high-quality non-cyanobacteria metagenome-assembled genomes (MAGs, completeness ≥50% and contamination ≤10%). Additionally, 2,113 non-redundant viral operational taxonomic units (vOTUs) were derived from a total of 7632 qualified viral contigs. This dataset provides a valuable resource for improving our understanding of the complex interactions among Prochlorococcus, Synechococcus, and their associated bacteria and viruses in marine ecosystems, offering a foundation to study their ecological roles and evolutionary dynamics.
原绿球藻和聚球藻是海洋初级生产的关键贡献者,在全球生物地球化学循环中发挥着重要作用。尽管这两个微微型蓝细菌属具有重要的生态学意义,但目前的基因组数据集仍然缺乏对采样不足的海洋区域、相关细菌和病毒的全面表征。为了填补这一空白,我们结合使用了第二代和第三代测序技术,从印度洋、南海和西太平洋分离出的105个聚球藻富集培养物中组装了全面的基因组数据。该数据集包括55个原绿球藻基因组和50个聚球藻基因组,完整性高(>98%)且污染低(<2%),以及从1457个中高质量非蓝细菌宏基因组组装基因组(MAGs,完整性≥50%且污染≤10%)中获得的308个非冗余相关细菌基因组。此外,从总共7632个合格的病毒重叠群中获得了2113个非冗余病毒操作分类单元(vOTUs)。该数据集为增进我们对海洋生态系统中原绿球藻、聚球藻及其相关细菌和病毒之间复杂相互作用的理解提供了宝贵资源,为研究它们的生态作用和进化动态奠定了基础。