Yarza Pablo, Yilmaz Pelin, Panzer Katrin, Glöckner Frank Oliver, Reich Marlis
Ribocon GmbH, Fahrenheitstrasse 1, 28359 Bremen, Germany.
Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, 28359 Bremen, Germany.
Mar Genomics. 2017 Dec;36:33-39. doi: 10.1016/j.margen.2017.05.009. Epub 2017 Jun 1.
The usage of molecular phylogenetic approaches is critical to advance the understanding of systematics and community processes in the kingdom Fungi. Among the possible phylogenetic markers (or combinations of them), the 18S rRNA gene appears currently as the most prominent candidate due to its large availability in public databases and informative content. The purpose of this work was the creation of a reference phylogenetic framework that can serve as ready-to-use package for its application on fungal classification and community analysis. The current database contains 9329 representative 18S rRNA gene sequences covering the whole fungal kingdom, a manually curated alignment, an annotated and revised phylogenetic tree with all the sequence entries, updated information on current taxonomy, and recommendations of use. Out of 201 total fungal taxa with more than two sequences in the dataset, 179 were monophyletic. From another perspective, 66% of the entries had a tree-derived classification identical to that obtained from the NCBI taxonomy, whereas 34% differed in one or the other rank. Most of the differences were associated to missing taxonomic assignments in NCBI taxonomy, or the unexpected position of sequences that positioned out of their theoretically corresponding clades. The strong correlation observed with current fungal taxonomy evidences that 18S rRNA gene sequence-based phylogenies are adequate to reflect genealogy of Fungi at the levels of order and above, and justify their further usage and exploration.
分子系统发育方法的应用对于增进对真菌界系统学和群落过程的理解至关重要。在可能的系统发育标记(或其组合)中,18S rRNA基因目前似乎是最突出的候选者,因为它在公共数据库中的可获得性高且信息丰富。这项工作的目的是创建一个参考系统发育框架,作为一个现成的包用于真菌分类和群落分析。当前数据库包含9329条代表性的18S rRNA基因序列,覆盖整个真菌界,一个人工整理的比对,一个带有所有序列条目的注释和修订的系统发育树,当前分类学的更新信息以及使用建议。在数据集中有超过两条序列的总共201个真菌分类单元中,179个是单系的。从另一个角度来看,66%的条目具有与从NCBI分类学获得的分类相同的基于树的分类,而34%在一个或另一个分类等级上有所不同。大多数差异与NCBI分类学中缺失的分类学归属相关,或者与位于其理论上相应分支之外的序列的意外位置有关。与当前真菌分类学观察到的强相关性证明,基于18S rRNA基因序列的系统发育足以在目及以上水平反映真菌的谱系,并证明其进一步使用和探索的合理性。