Peabody Michael A, Van Rossum Thea, Lo Raymond, Brinkman Fiona S L
Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
BMC Bioinformatics. 2015 Nov 4;16:363. doi: 10.1186/s12859-015-0788-5.
The field of metagenomics (study of genetic material recovered directly from an environment) has grown rapidly, with many bioinformatics analysis methods being developed. To ensure appropriate use of such methods, robust comparative evaluation of their accuracy and features is needed. For taxonomic classification of sequence reads, such evaluation should include use of clade exclusion, which better evaluates a method's accuracy when identical sequences are not present in any reference database, as is common in metagenomic analysis. To date, relatively small evaluations have been performed, with evaluation approaches like clade exclusion limited to assessment of new methods by the authors of the given method. What is needed is a rigorous, independent comparison between multiple major methods, using the same in silico and in vitro test datasets, with and without approaches like clade exclusion, to better characterize accuracy under different conditions.
An overview of the features of 38 bioinformatics methods is provided, evaluating accuracy with a focus on 11 programs that have reference databases that can be modified and therefore most robustly evaluated with clade exclusion. Taxonomic classification of sequence reads was evaluated using both in silico and in vitro mock bacterial communities. Clade exclusion was used at taxonomic levels from species to class-identifying how well methods perform in progressively more difficult scenarios. A wide range of variability was found in the sensitivity, precision, overall accuracy, and computational demand for the programs evaluated. In experiments where distilled water was spiked with only 11 bacterial species, frequently dozens to hundreds of species were falsely predicted by the most popular programs. The different features of each method (forces predictions or not, etc.) are summarized, and additional analysis considerations discussed.
The accuracy of shotgun metagenomics classification methods varies widely. No one program clearly outperformed others in all evaluation scenarios; rather, the results illustrate the strengths of different methods for different purposes. Researchers must appreciate method differences, choosing the program best suited for their particular analysis to avoid very misleading results. Use of standardized datasets for method comparisons is encouraged, as is use of mock microbial community controls suitable for a particular metagenomic analysis.
宏基因组学(直接从环境中回收的遗传物质的研究)领域发展迅速,许多生物信息学分析方法不断涌现。为确保这些方法的合理应用,需要对其准确性和特征进行有力的比较评估。对于序列读数的分类学分类,此类评估应包括使用进化枝排除法,当任何参考数据库中都不存在相同序列时,进化枝排除法能更好地评估方法的准确性,这在宏基因组分析中很常见。迄今为止,相关评估规模相对较小,像进化枝排除法这样的评估方法仅限于给定方法的作者对新方法的评估。我们需要使用相同的计算机模拟和体外测试数据集,在有或没有进化枝排除法等方法的情况下,对多种主要方法进行严格、独立的比较,以更好地描述不同条件下的准确性。
本文概述了38种生物信息学方法的特征,重点评估了11个具有可修改参考数据库的程序的准确性,因此可以使用进化枝排除法进行最可靠的评估。使用计算机模拟和体外模拟细菌群落对序列读数进行分类学分类。在从物种到纲的分类水平上使用进化枝排除法,以确定方法在逐渐更具挑战性的情况下的表现。在所评估的程序的灵敏度、精确度、总体准确性和计算需求方面发现了广泛差异。在仅向蒸馏水中添加11种细菌物种的实验中,最流行的程序经常错误预测出数十到数百种物种。总结了每种方法的不同特征(是否强制预测等),并讨论了其他分析注意事项。
鸟枪法宏基因组学分类方法的准确性差异很大。在所有评估场景中,没有一个程序明显优于其他程序;相反,结果说明了不同方法在不同目的下的优势。研究人员必须了解方法差异,选择最适合其特定分析的程序,以避免产生极具误导性的结果。鼓励使用标准化数据集进行方法比较,以及使用适合特定宏基因组分析的模拟微生物群落对照。