Lynn Hannah M, Gordon Jeffrey I
Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO 63110, USA; Newman Center for Gut Microbiome and Nutrition Research, Washington University School of Medicine, St. Louis, MO 63110, USA.
Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO 63110, USA; Newman Center for Gut Microbiome and Nutrition Research, Washington University School of Medicine, St. Louis, MO 63110, USA.
Cell Rep Methods. 2025 Mar 24;5(3):101005. doi: 10.1016/j.crmeth.2025.101005. Epub 2025 Mar 17.
Generating metagenome-assembled genomes from DNA shotgun sequencing datasets can demand considerable computational resources. Here, we describe a sequential co-assembly method that reduces the assembly of duplicate reads through successive application of single-node computing tools for read assembly and mapping. Using a simulated mouse microbiome DNA shotgun sequencing dataset, we demonstrated that this approach shortens assembly time, uses less memory than traditional co-assembly, and produces significantly fewer assembly errors. Applying sequential co-assembly to shotgun sequencing reads from (1) a longitudinal study of gut microbiomes from undernourished Bangladeshi children and (2) a 2.3-terabyte dataset generated from gnotobiotic mice colonized with pooled microbiomes from these children that was too large to be handled by a traditional co-assembly approach also demonstrated significant reductions in assembly time and memory requirements. These results suggest that this approach should be useful in resource-constrained settings, including in low- and middle-income countries.
从鸟枪法测序数据集中生成宏基因组组装基因组可能需要大量的计算资源。在此,我们描述了一种顺序共组装方法,该方法通过连续应用单节点计算工具进行读段组装和映射,减少了重复读段的组装。使用模拟的小鼠微生物组鸟枪法测序数据集,我们证明这种方法缩短了组装时间,比传统共组装使用的内存更少,并且产生的组装错误显著减少。将顺序共组装应用于以下两个数据集的鸟枪法测序读段:(1)对营养不良的孟加拉国儿童肠道微生物组的纵向研究;(2)来自无菌小鼠的一个2.3太字节的数据集,这些小鼠定殖有来自这些儿童的混合微生物组,该数据集太大而无法用传统共组装方法处理,结果也表明组装时间和内存需求显著减少。这些结果表明,这种方法在资源受限的环境中应该是有用的,包括在低收入和中等收入国家。