以SourceTracker为例，评估用于微生物溯源的粪便源库的准确性和特异性。

Assessing accuracy and specificity of faecal source library for microbial source-tracking, using SourceTracker as case study.

作者信息

Lim Timothy J Y, Delgado Yussi M Palacios, Lintern Anna, McCarthy David T, Henry Rebekah

机构信息

Department of Civil & Environmental Engineering, Monash University, Clayton, VIC 3800, Australia.

School of Environmental Sciences, University of Guelph, Guelph, ON N1G 2W1, Canada.

出版信息

Bioinform Adv. 2025 Apr 29;5(1):vbaf103. doi: 10.1093/bioadv/vbaf103. eCollection 2025.

DOI:10.1093/bioadv/vbaf103

PMID:40395502

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12092083/

Abstract

MOTIVATION

Understanding the quality of the source library prior to undertaking library-dependent microbial source-tracking (MST) is an essential, but often overlooked, primary analysis step.

RESULTS

We propose an assessment approach to validate the quality of amplicon-derived faecal source libraries. This approach was demonstrated on a faecal source library consisting of 16S rRNA paired-end amplicon sequences, obtained from various animal types in Victoria, Australia. First, a leave-one-out (LOO) analysis was performed to assess the accuracy of source category groupings by identifying the number of samples incorrectly assigned to a different source category (i.e. animal type). Following a quality control procedure to decide retaining/removing/grouping incorrectly assigned samples, we then assessed if the sample sizes for each source type were sufficient to properly characterize the source fingerprints. Results from LOO demonstrated 15.5% of samples were incorrectly assigned, with high error rates in birds and wallabies within our source library. Increasing the sample size improved source identification accuracy. However, accuracy eventually plateaued in a source-specific manner. Importantly, this highlights the importance of conducting thorough assessments to understand the quality and limitations of the source library prior to library-dependent MST applications.

AVAILABILITY AND IMPLEMENTATION

QIIME2 is available via https://qiime2.org/; SourceTracker v2.0.1 is available via https://github.com/caporaso-lab/sourcetracker2; Pipeline for LOO is available via https://github.com/MonashOWL/Bioinformatics-IlluminaMGI/tree/main/16S/LOO; Pipeline for sample size assessment is available via https://github.com/MonashOWL/Bioinformatics-IlluminaMGI/tree/main/16S/Source%20variability.

摘要

动机

在进行依赖文库的微生物源追踪（MST）之前，了解源文库的质量是一个至关重要但经常被忽视的初步分析步骤。

结果

我们提出了一种评估方法来验证扩增子衍生粪便源文库的质量。该方法在一个粪便源文库上得到了验证，该文库由从澳大利亚维多利亚州的各种动物类型获得的16S rRNA双端扩增子序列组成。首先，进行留一法（LOO）分析，通过识别错误分配到不同源类别（即动物类型）的样本数量来评估源类别分组的准确性。在经过质量控制程序以决定保留/去除/分组错误分配的样本之后，我们接着评估每种源类型的样本量是否足以正确表征源指纹。留一法的结果表明，15.5%的样本被错误分配，我们的源文库中鸟类和小袋鼠的错误率较高。增加样本量提高了源识别的准确性。然而，准确性最终以源特异性的方式趋于平稳。重要的是，这突出了在依赖文库的MST应用之前进行全面评估以了解源文库的质量和局限性的重要性。

可用性和实施方法

QIIME2可通过https://qiime2.org/获取；SourceTracker v2.0.1可通过https://github.com/caporaso-lab/sourcetracker2获取；留一法的流程可通过https://github.com/MonashOWL/Bioinformatics-IlluminaMGI/tree/main/16S/LOO获取；样本量评估的流程可通过https://github.com/MonashOWL/Bioinformatics-IlluminaMGI/tree/main/16S/Source%20variability获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ec7/12092083/cc3d650d43bc/vbaf103f1.jpg

相似文献

Assessing accuracy and specificity of faecal source library for microbial source-tracking, using SourceTracker as case study.以SourceTracker为例，评估用于微生物溯源的粪便源库的准确性和特异性。

Bioinform Adv. 2025 Apr 29;5(1):vbaf103. doi: 10.1093/bioadv/vbaf103. eCollection 2025.

Into the deep: Evaluation of SourceTracker for assessment of faecal contamination of coastal waters.深入探究：用于评估沿海水域粪便污染的SourceTracker评估

Water Res. 2016 Apr 15;93:242-253. doi: 10.1016/j.watres.2016.02.029. Epub 2016 Feb 17.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

STracking: a free and open-source Python library for particle tracking and analysis.STracking：一个免费的开源 Python 库，用于粒子跟踪和分析。

Bioinformatics. 2022 Jul 11;38(14):3671-3673. doi: 10.1093/bioinformatics/btac365.

Compositional and temporal stability of fecal taxon libraries for use with SourceTracker in sub-tropical catchments.用于亚热带集水区 SourceTracker 的粪便分类群文库的组成和时间稳定性。

Water Res. 2019 Nov 15;165:114967. doi: 10.1016/j.watres.2019.114967. Epub 2019 Aug 13.

Taxonomic annotation of 16S rRNA sequences of pig intestinal samples using MG-RAST and QIIME2 generated different microbiota compositions.使用 MG-RAST 和 QIIME2 对猪肠道样本的 16S rRNA 序列进行分类注释产生了不同的微生物群落组成。

J Microbiol Methods. 2021 Jul;186:106235. doi: 10.1016/j.mimet.2021.106235. Epub 2021 May 8.

MarkerMAG: linking metagenome-assembled genomes (MAGs) with 16S rRNA marker genes using paired-end short reads.MarkerMAG：使用配对末端短读长将宏基因组组装基因组（MAG）与 16S rRNA 标记基因进行关联。

Bioinformatics. 2022 Aug 2;38(15):3684-3688. doi: 10.1093/bioinformatics/btac398.

Evaluation of antibiotic resistance analysis and ribotyping for identification of faecal pollution sources in an urban watershed.评估抗生素抗性分析和核糖体分型用于识别城市流域粪便污染源的情况。

J Appl Microbiol. 2005;99(3):618-28. doi: 10.1111/j.1365-2672.2005.02612.x.

Influence of Library Composition on SourceTracker Predictions for Community-Based Microbial Source Tracking.文库组成对基于社区的微生物源追踪的 SourceTracker 预测的影响。

Environ Sci Technol. 2019 Jan 2;53(1):60-68. doi: 10.1021/acs.est.8b04707. Epub 2018 Dec 6.

Accounting for Bacterial Overlap Between Raw Water Communities and Contaminating Sources Improves the Accuracy of Signature-Based Microbial Source Tracking.考虑原水群落与污染源之间的细菌重叠可提高基于特征的微生物源追踪的准确性。

Front Microbiol. 2018 Oct 2;9:2364. doi: 10.3389/fmicb.2018.02364. eCollection 2018.

本文引用的文献

Beyond borders: A systematic review and meta-analysis of human-specific faecal markers across geographical settings.超越国界：对不同地理环境下人类特异性粪便标志物的系统评价与荟萃分析。

Crit Rev Environ Sci Technol. 2025 Feb 6;55(7):447-464. doi: 10.1080/10643389.2025.2455031. eCollection 2025.

Linking migration and microbiota at a major stopover site in a long-distance avian migrant.在一个长途迁徙鸟类的主要中途停留地将迁徙与微生物群联系起来。

Mov Ecol. 2022 Nov 7;10(1):46. doi: 10.1186/s40462-022-00347-0.

Microbial source tracking using metagenomics and other new technologies.利用宏基因组学和其他新技术进行微生物溯源。

J Microbiol. 2021 Mar;59(3):259-269. doi: 10.1007/s12275-021-0668-9. Epub 2021 Feb 10.

Diet induces parallel changes to the gut microbiota and problem solving performance in a wild bird.饮食会使野生鸟类的肠道菌群和解决问题的能力同时发生变化。

Sci Rep. 2020 Nov 27;10(1):20783. doi: 10.1038/s41598-020-77256-y.

Water Res. 2019 Nov 15;165:114967. doi: 10.1016/j.watres.2019.114967. Epub 2019 Aug 13.

Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.使用QIIME 2进行可重复、交互式、可扩展和可延伸的微生物组数据科学研究。

Nat Biotechnol. 2019 Aug;37(8):852-857. doi: 10.1038/s41587-019-0209-9.

Influence of Library Composition on SourceTracker Predictions for Community-Based Microbial Source Tracking.文库组成对基于社区的微生物源追踪的 SourceTracker 预测的影响。

Environ Sci Technol. 2019 Jan 2;53(1):60-68. doi: 10.1021/acs.est.8b04707. Epub 2018 Dec 6.

Fecal pollution: new trends and challenges in microbial source tracking using next-generation sequencing.粪便污染：利用下一代测序进行微生物溯源的新趋势和新挑战。

Environ Microbiol. 2018 Sep;20(9):3132-3140. doi: 10.1111/1462-2920.14281. Epub 2018 Aug 5.

Tracking antibiotic resistance gene pollution from different sources using machine-learning classification.利用机器学习分类技术追踪不同来源的抗生素耐药基因污染

Microbiome. 2018 May 24;6(1):93. doi: 10.1186/s40168-018-0480-x.

Application of SourceTracker for Accurate Identification of Fecal Pollution in Recreational Freshwater: A Double-Blinded Study.应用 SourceTracker 准确识别休闲型淡水环境中的粪便污染：一项双盲研究。

Environ Sci Technol. 2018 Apr 3;52(7):4207-4217. doi: 10.1021/acs.est.7b05401. Epub 2018 Mar 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

以SourceTracker为例，评估用于微生物溯源的粪便源库的准确性和特异性。

Assessing accuracy and specificity of faecal source library for microbial source-tracking, using SourceTracker as case study.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

动机

结果

可用性和实施方法

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献