Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom.
PLoS One. 2011 Apr 25;6(4):e19259. doi: 10.1371/journal.pone.0019259.
DNA barcoding and other DNA sequence-based techniques for investigating and estimating biodiversity require explicit methods for associating individual sequences with taxa, as it is at the taxon level that biodiversity is assessed. For many projects, the bioinformatic analyses required pose problems for laboratories whose prime expertise is not in bioinformatics. User-friendly tools are required for both clustering sequences into molecular operational taxonomic units (MOTU) and for associating these MOTU with known organismal taxonomies.
Here we present jMOTU, a Java program for the analysis of DNA barcode datasets that uses an explicit, determinate algorithm to define MOTU. We demonstrate its usefulness for both individual specimen-based Sanger sequencing surveys and bulk-environment metagenetic surveys using long-read next-generation sequencing data. jMOTU is driven through a graphical user interface, and can analyse tens of thousands of sequences in a short time on a desktop computer. A companion program, Taxonerator, that adds traditional taxonomic annotation to MOTU, is also presented. Clustering and taxonomic annotation data are stored in a relational database, and are thus amenable to subsequent data mining and web presentation.
jMOTU efficiently and robustly identifies the molecular taxa present in survey datasets, and Taxonerator decorates the MOTU with putative identifications. jMOTU and Taxonerator are freely available from http://www.nematodes.org/.
用于调查和估计生物多样性的 DNA 条形码和其他基于 DNA 序列的技术需要明确的方法将个体序列与分类单元相关联,因为正是在分类单元层面上评估生物多样性。对于许多项目来说,其所需的生物信息学分析给那些主要专业知识不是生物信息学的实验室带来了问题。需要用户友好的工具来将序列聚类为分子操作分类单元 (MOTU),并将这些 MOTU 与已知的生物分类单元相关联。
在这里,我们介绍了 jMOTU,这是一个用于分析 DNA 条形码数据集的 Java 程序,它使用明确的、确定的算法来定义 MOTU。我们展示了它在基于单个标本的 Sanger 测序调查和使用长读长下一代测序数据的批量环境宏基因组调查中的有用性。jMOTU 通过图形用户界面驱动,可以在台式计算机上短时间内分析数以万计的序列。还介绍了一个配套程序 Taxonerator,它为 MOTU 添加传统的分类注释。聚类和分类注释数据存储在关系数据库中,因此可以进行后续的数据挖掘和网络呈现。
jMOTU 能够有效地、稳健地识别调查数据集中存在的分子分类单元,而 Taxonerator 则为 MOTU 赋予了可能的识别信息。jMOTU 和 Taxonerator 可从 http://www.nematodes.org/ 免费获得。