Jones Marcus B, Highlander Sarah K, Anderson Ericka L, Li Weizhong, Dayrit Mark, Klitgord Niels, Fabani Martin M, Seguritan Victor, Green Jessica, Pride David T, Yooseph Shibu, Biggs William, Nelson Karen E, Venter J Craig
Human Longevity, Inc., San Diego, CA 92121; Genomic Medicine, J. Craig Venter Institute, La Jolla, CA 92037;
Genomic Medicine, J. Craig Venter Institute, La Jolla, CA 92037;
Proc Natl Acad Sci U S A. 2015 Nov 10;112(45):14024-9. doi: 10.1073/pnas.1519288112. Epub 2015 Oct 28.
Observations from human microbiome studies are often conflicting or inconclusive. Many factors likely contribute to these issues including small cohort sizes, sample collection, and handling and processing differences. The field of microbiome research is moving from 16S rDNA gene sequencing to a more comprehensive genomic and functional representation through whole-genome sequencing (WGS) of complete communities. Here we performed quantitative and qualitative analyses comparing WGS metagenomic data from human stool specimens using the Illumina Nextera XT and Illumina TruSeq DNA PCR-free kits, and the KAPA Biosystems Hyper Prep PCR and PCR-free systems. Significant differences in taxonomy are observed among the four different next-generation sequencing library preparations using a DNA mock community and a cell control of known concentration. We also revealed biases in error profiles, duplication rates, and loss of reads representing organisms that have a high %G+C content that can significantly impact results. As with all methods, the use of benchmarking controls has revealed critical differences among methods that impact sequencing results and later would impact study interpretation. We recommend that the community adopt PCR-free-based approaches to reduce PCR bias that affects calculations of abundance and to improve assemblies for accurate taxonomic assignment. Furthermore, the inclusion of a known-input cell spike-in control provides accurate quantitation of organisms in clinical samples.
人类微生物组研究的观察结果往往相互矛盾或无定论。许多因素可能导致这些问题,包括队列规模小、样本采集以及处理和加工差异。微生物组研究领域正从16S rDNA基因测序转向通过对完整群落进行全基因组测序(WGS)来实现更全面的基因组和功能表征。在这里,我们使用Illumina Nextera XT和Illumina TruSeq DNA PCR-free试剂盒以及KAPA Biosystems Hyper Prep PCR和PCR-free系统,对来自人类粪便标本的WGS宏基因组数据进行了定量和定性分析。使用DNA模拟群落和已知浓度的细胞对照,在四种不同的下一代测序文库制备方法中观察到分类学上的显著差异。我们还揭示了错误谱、重复率以及代表高%G+C含量生物体的读数丢失方面的偏差,这些偏差会显著影响结果。与所有方法一样,使用基准对照揭示了影响测序结果且随后会影响研究解释的方法之间的关键差异。我们建议业界采用基于免PCR的方法,以减少影响丰度计算的PCR偏差,并改进组装以进行准确的分类学归属。此外,加入已知输入的细胞加标对照可对临床样本中的生物体进行准确定量。