Suppr超能文献

真菌群落内转录间隔区序列数据的聚类掩盖了分类多样性。

Clustering of fungal community internal transcribed spacer sequence data obscures taxonomic diversity.

机构信息

Department of Environmental Health, Seoul National University, Seoul, Korea.

出版信息

Environ Microbiol. 2014 Aug;16(8):2491-500. doi: 10.1111/1462-2920.12390. Epub 2014 Mar 4.

Abstract

Next-generation DNA sequencing has enabled a rapid expansion in the size of molecular fungal ecology studies employing the nuclear internal transcribed spacer (ITS) region. Many sequence-processing pipelines and protocols require sequence clustering to generate operational taxonomic units (OTUs) based on sequence similarity as a step to reduce total data quantity and complexity prior to taxonomic assignment. However, the consequences of ITS sequence clustering in regard to sample taxonomic coverage have not been carefully examined. Here we demonstrate that typically used clustering thresholds for fungal ITS sequences result in statistically significant losses in taxonomic coverage. Analyses using environmentally derived fungal sequences indicated an average of 3.1% of species went undetected (P < 0.05) if the sequences were denoised and clustered at a 97% threshold prior to taxonomic assignment. Additionally, an in silico analysis using a reference fungal ITS database suggested that approximately 25% of species went undetected if the sequences were clustered prior to taxonomic assignment. Finally, analysis of sequences derived from pure-cultured fungal isolates of known identity indicated sequence denoising and clustering were not critical in improving identification accuracy.

摘要

下一代 DNA 测序技术使得利用核内转录间隔区(ITS)区域进行分子真菌生态学研究的规模迅速扩大。许多序列处理管道和协议需要序列聚类,以便根据序列相似性生成操作分类单元(OTUs),作为在分类分配之前减少总数据量和复杂性的步骤。然而,ITS 序列聚类对样本分类覆盖范围的影响尚未得到仔细检查。在这里,我们证明通常用于真菌 ITS 序列的聚类阈值会导致分类覆盖范围的统计学显著损失。使用环境衍生的真菌序列进行的分析表明,如果在进行分类分配之前对序列进行去噪和聚类到 97%的阈值,则平均有 3.1%的物种未被检测到(P < 0.05)。此外,使用参考真菌 ITS 数据库进行的计算机模拟分析表明,如果在进行分类分配之前对序列进行聚类,则大约有 25%的物种未被检测到。最后,对已知身份的纯培养真菌分离物的序列进行分析表明,序列去噪和聚类对于提高鉴定准确性并不是关键。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验