Duncan Anthony, Barry Kerrie, Daum Chris, Eloe-Fadrosh Emiley, Roux Simon, Schmidt Katrin, Tringe Susannah G, Valentin Klaus U, Varghese Neha, Salamov Asaf, Grigoriev Igor V, Leggett Richard M, Moulton Vincent, Mock Thomas
School of Computing Sciences, University of East Anglia, Norwich Research Park, Norwich, NR47TJ, UK.
US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.
Data Brief. 2023 Feb 15;47:108990. doi: 10.1016/j.dib.2023.108990. eCollection 2023 Apr.
This article presents metagenome-assembled genomes (MAGs) for both eukaryotic and prokaryotic organisms originating from the Arctic and Atlantic oceans, along with gene prediction and functional annotation for MAGs from both domains. Eleven samples from the chlorophyll-a maximum layer of the surface ocean were collected during two cruises in 2012; six from the Arctic in June-July on ARK-XXVII/1 (PS80), and five from the Atlantic in November on ANT-XXIX/1 (PS81). Sequencing and assembly was carried out by the Joint Genome Institute (JGI), who provide annotation of the assembled sequences, and 122 MAGs for prokaryotic organisms. A subsequent binning process identified 21 MAGs for eukaryotic organisms, mostly identified as Mamiellophyceae or Bacillariophyceae. The data for each MAG includes sequences in FASTA format, and tables of functional annotation of genes. For eukaryotic MAGs, transcript and protein sequences for predicted genes are available. A spreadsheet is provided summarising quality measures and taxonomic classifications for each MAG. These data provide draft genomes for uncultured marine microbes, including some of the first MAGs for polar eukaryotes, and can provide reference genetic data for these environments, or used in genomics-based comparison between environments.
本文展示了源自北冰洋和大西洋的真核生物和原核生物的宏基因组组装基因组(MAGs),以及这两个领域MAGs的基因预测和功能注释。在2012年的两次航行中,从海洋表层叶绿素a含量最高的水层采集了11个样本;6个样本于6月至7月在ARK-XXVII/1(PS80)号船上从北极采集,5个样本于11月在ANT-XXIX/1(PS81)号船上从大西洋采集。测序和组装工作由美国能源部联合基因组研究所(JGI)进行,该研究所提供组装序列的注释,以及122个原核生物的MAGs。随后的分箱过程确定了21个真核生物的MAGs,其中大多数被鉴定为Mamiellophyceae或Bacillariophyceae。每个MAG的数据包括FASTA格式的序列,以及基因功能注释表。对于真核生物的MAGs,可获得预测基因的转录本和蛋白质序列。提供了一个电子表格,总结了每个MAG的质量指标和分类学分类。这些数据提供了未培养海洋微生物的基因组草图,包括一些首批极地真核生物的MAGs,可为这些环境提供参考遗传数据,或用于基于基因组学的环境间比较。