Suppr超能文献

基于文库的微生物源追踪方法中所使用统计方法的评估

Assessment of statistical methods used in library-based approaches to microbial source tracking.

作者信息

Ritter Kerry J, Carruthers Ethan, Carson C Andrew, Ellender R D, Harwood Valerie J, Kingsley Kyle, Nakatsu Cindy, Sadowsky Michael, Shear Brian, West Brian, Whitlock John E, Wiggins Bruce A, Wilbur Jayson D

机构信息

Southern California Coastal Water Research Project, Westminster, CA 92863, USA.

出版信息

J Water Health. 2003 Dec;1(4):209-23.

Abstract

Several commonly used statistical methods for fingerprint identification in microbial source tracking (MST) were examined to assess the effectiveness of pattern-matching algorithms to correctly identify sources. Although numerous statistical methods have been employed for source identification, no widespread consensus exists as to which is most appropriate. A large-scale comparison of several MST methods, using identical fecal sources, presented a unique opportunity to assess the utility of several popular statistical methods. These included discriminant analysis, nearest neighbour analysis, maximum similarity and average similarity, along with several measures of distance or similarity. Threshold criteria for excluding uncertain or poorly matched isolates from final analysis were also examined for their ability to reduce false positives and increase prediction success. Six independent libraries used in the study were constructed from indicator bacteria isolated from fecal materials of humans, seagulls, cows and dogs. Three of these libraries were constructed using the rep-PCR technique and three relied on antibiotic resistance analysis (ARA). Five of the libraries were constructed using Escherichia coli and one using Enterococcus spp. (ARA). Overall, the outcome of this study suggests a high degree of variability across statistical methods. Despite large differences in correct classification rates among the statistical methods, no single statistical approach emerged as superior. Thresholds failed to consistently increase rates of correct classification and improvement was often associated with substantial effective sample size reduction. Recommendations are provided to aid in selecting appropriate analyses for these types of data.

摘要

研究了微生物源追踪(MST)中几种常用的指纹识别统计方法,以评估模式匹配算法正确识别源的有效性。尽管已采用多种统计方法进行源识别,但对于哪种方法最合适,尚未达成广泛共识。使用相同粪便源对几种MST方法进行大规模比较,为评估几种常用统计方法的效用提供了独特机会。这些方法包括判别分析、最近邻分析、最大相似度和平均相似度,以及几种距离或相似度度量。还研究了在最终分析中排除不确定或匹配不佳的分离株的阈值标准,以评估其减少假阳性和提高预测成功率的能力。本研究中使用的六个独立文库由从人类、海鸥、奶牛和狗的粪便材料中分离出的指示菌构建而成。其中三个文库采用rep-PCR技术构建,另外三个依赖抗生素抗性分析(ARA)。五个文库使用大肠杆菌构建,一个使用肠球菌属(ARA)构建。总体而言,本研究结果表明不同统计方法之间存在高度变异性。尽管统计方法之间的正确分类率存在很大差异,但没有一种统计方法表现出明显优势。阈值未能持续提高正确分类率,且改进往往伴随着有效样本量的大幅减少。本文提供了相关建议,以帮助为这类数据选择合适的分析方法。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验