• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

随机森林统计技术在细菌溯源研究中的新应用。

Novel application of a statistical technique, Random Forests, in a bacterial source tracking study.

机构信息

Department of Life Sciences, Texas A&M University-Corpus Christi, 6300 Ocean Drive, Unit 5800, Corpus Christi, TX 78412, United States.

出版信息

Water Res. 2010 Jul;44(14):4067-76. doi: 10.1016/j.watres.2010.05.019. Epub 2010 May 31.

DOI:10.1016/j.watres.2010.05.019
PMID:20566209
Abstract

In this study, data from bacterial source tracking (BST) analysis using antibiotic resistance profiles were examined using two statistical techniques, Random Forests (RF) and discriminant analysis (DA) to determine sources of fecal contamination of a Texas water body. Cow Trap and Cedar Lakes are potential oyster harvesting waters located in Brazoria County, Texas, that have been listed as impaired for bacteria on the 2004 Texas 303(d) list. Unknown source Escherichia coli were isolated from water samples collected in the study area during two sampling events. Isolates were confirmed as E. coli using carbon source utilization profiles and then analyzed via ARA, following the Kirby-Bauer disk diffusion method. Zone diameters from ARA profiles were analyzed with both DA and RF. Using a two-way classification (human vs nonhuman), both DA and RF categorized over 90% of the 299 unknown source isolates as a nonhuman source. The average rates of correct classification (ARCCs) for the library of 1172 isolates using DA and RF were 74.6% and 82.3%, respectively. ARCCs from RF ranged from 7.7 to 12.0% higher than those from DA. Rates of correct classification (RCCs) for individual sources classified with RF ranged from 23.2 to 0.2% higher than those of DA, with a mean difference of 9.0%. Additional evidence for the outperformance of DA by RF was found in the comparison of training and test set ARCCs and examination of specific disputed isolates; RF produced higher ARCCs (ranging from 8 to 13% higher) than DA for all 1000 trials (excluding the two-way classification, in which RF outperformed DA 999 out of 1000 times). This is of practical significance for analysis of bacterial source tracking data. Overall, based on both DA and RF results, migratory birds were found to be the source of the largest portion of the unknown E. coli isolates. This study is the first known published application of Random Forests in the field of BST.

摘要

在这项研究中,使用两种统计技术,即随机森林(RF)和判别分析(DA),检查了使用抗生素耐药谱进行细菌来源追踪(BST)分析的数据,以确定德克萨斯州水体粪便污染的来源。Cow Trap 和 Cedar Lakes 是位于德克萨斯州布拉佐里亚县的潜在牡蛎捕捞水域,它们在 2004 年德克萨斯州 303(d)清单中因细菌而被列为受损水域。在两次采样事件中,从研究区域采集的水样中分离出未知来源的大肠杆菌。通过碳源利用谱对分离物进行确认,然后使用 Kirby-Bauer 圆盘扩散法通过 ARA 进行分析。使用 DA 和 RF 分析 ARA 谱的带区直径。使用双向分类(人类与非人类),DA 和 RF 均将 299 个未知来源分离物中的 90%以上归类为非人类来源。使用 DA 和 RF 对 1172 个分离物文库进行的平均正确分类率(ARCC)分别为 74.6%和 82.3%。RF 的 ARCC 比 DA 的 ARCC 高 7.7 至 12.0%。使用 RF 分类的各个来源的正确分类率(RCC)比 DA 的高 23.2%至 0.2%,平均差异为 9.0%。在比较训练集和测试集 ARCC 并检查特定有争议的分离物时,还发现了 RF 优于 DA 的更多证据;在 1000 次试验中(不包括双向分类,在双向分类中,RF 在 1000 次中有 999 次优于 DA),RF 对所有 1000 次试验的 ARCC 均产生了更高的结果(范围为高 8%至 13%)。这对于细菌来源追踪数据的分析具有实际意义。总体而言,根据 DA 和 RF 的结果,候鸟被发现是最大部分未知大肠杆菌分离物的来源。本研究是随机森林在 BST 领域的首次已知应用。

相似文献

1
Novel application of a statistical technique, Random Forests, in a bacterial source tracking study.随机森林统计技术在细菌溯源研究中的新应用。
Water Res. 2010 Jul;44(14):4067-76. doi: 10.1016/j.watres.2010.05.019. Epub 2010 May 31.
2
Direct comparison of four bacterial source tracking methods and use of composite data sets.四种细菌溯源方法的直接比较及复合数据集的使用
J Appl Microbiol. 2007 Aug;103(2):350-64. doi: 10.1111/j.1365-2672.2006.03246.x.
3
Considerations when using discriminant function analysis of antimicrobial resistance profiles to identify sources of fecal contamination of surface water in Michigan.运用抗微生物药物耐药性谱的判别函数分析来识别密歇根州地表水体粪便污染来源时的注意事项。
Appl Environ Microbiol. 2007 May;73(9):2878-90. doi: 10.1128/AEM.02376-06. Epub 2007 Mar 2.
4
Choice of indicator organism and library size considerations for phenotypic microbial source tracking by FAME profiling.基于 FAME 分析的表型微生物溯源中指示生物选择和文库大小的考虑因素。
Water Sci Technol. 2009;60(10):2659-68. doi: 10.2166/wst.2009.656.
5
Microbial source tracking in a rural watershed dominated by cattle.以牛为主导的农村流域中的微生物源追踪
Water Res. 2007 Aug;41(16):3729-39. doi: 10.1016/j.watres.2007.04.020. Epub 2007 May 1.
6
Evaluation of antibiotic resistance analysis and ribotyping for identification of faecal pollution sources in an urban watershed.评估抗生素抗性分析和核糖体分型用于识别城市流域粪便污染源的情况。
J Appl Microbiol. 2005;99(3):618-28. doi: 10.1111/j.1365-2672.2005.02612.x.
7
Occurrence of antibiotic resistance in Escherichia coli from surface waters and fecal pollution sources near Hamilton, Ontario.安大略省汉密尔顿附近地表水和粪便污染源中大肠杆菌抗生素耐药性的出现情况。
Can J Microbiol. 2005 Jun;51(6):501-5. doi: 10.1139/w05-028.
8
Identification of the sources of Escherichia coli in a watershed using carbon-utilization patterns and composite data sets.利用碳利用模式和综合数据集识别流域内大肠杆菌的来源
J Water Health. 2008 Jun;6(2):197-207. doi: 10.2166/wh.2008.021.
9
A comparison of ARA and DNA data for microbial source tracking based on source-classification models developed using classification trees.基于使用分类树开发的源分类模型,对用于微生物源追踪的ARA和DNA数据进行比较。
Water Res. 2007 Aug;41(16):3575-84. doi: 10.1016/j.watres.2007.05.026. Epub 2007 May 24.
10
Phenotypic library-based microbial source tracking methods: efficacy in the California collaborative study.基于表型文库的微生物源追踪方法:加利福尼亚合作研究中的效能
J Water Health. 2003 Dec;1(4):153-66.

引用本文的文献

1
Beyond borders: A systematic review and meta-analysis of human-specific faecal markers across geographical settings.超越国界:对不同地理环境下人类特异性粪便标志物的系统评价与荟萃分析。
Crit Rev Environ Sci Technol. 2025 Feb 6;55(7):447-464. doi: 10.1080/10643389.2025.2455031. eCollection 2025.
2
Techniques for learning and transferring knowledge for microbiome-based classification and prediction: review and assessment.基于微生物组的分类和预测的知识学习与转移技术:综述与评估
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf015.
3
Towards more accurate microbial source tracking via non-negative matrix factorization (NMF).
通过非负矩阵分解(NMF)实现更精确的微生物溯源。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i68-i78. doi: 10.1093/bioinformatics/btae227.
4
Ecological dynamics imposes fundamental challenges in community-based microbial source tracking.生态动力学给基于群落的微生物源追踪带来了根本性挑战。
Imeta. 2023 Jan 5;2(1):e75. doi: 10.1002/imt2.75. eCollection 2023 Feb.
5
Distinct Microbiotas Are Associated with Different Production Lines in the Cutting Room of a Swine Slaughterhouse.不同的微生物群与猪屠宰场切割车间的不同生产线相关联。
Microorganisms. 2023 Jan 4;11(1):133. doi: 10.3390/microorganisms11010133.
6
EXPERT: transfer learning-enabled context-aware microbial community classification.专家:基于迁移学习的上下文感知微生物群落分类。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac396.
7
Microbial Dark Matter: from Discovery to Applications.微生物暗物质:从发现到应用。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):867-881. doi: 10.1016/j.gpb.2022.02.007. Epub 2022 Apr 26.
8
Ontology-aware deep learning enables ultrafast and interpretable source tracking among sub-million microbial community samples from hundreds of niches.本体感知深度学习使来自数百个生态位的数百万个微生物群落样本中的源追踪实现超快且可解释。
Genome Med. 2022 Apr 26;14(1):43. doi: 10.1186/s13073-022-01047-5.
9
Evaluating machine learning-powered classification algorithms which utilize variants in the GCKR gene to predict metabolic syndrome: Tehran Cardio-metabolic Genetics Study.利用 GCKR 基因变异预测代谢综合征的机器学习分类算法评估:德黑兰心脏代谢遗传学研究。
J Transl Med. 2022 Apr 9;20(1):164. doi: 10.1186/s12967-022-03349-z.
10
Synergistic Application of Molecular Markers and Community-Based Microbial Source Tracking Methods for Identification of Fecal Pollution in River Water During Dry and Wet Seasons.分子标记与基于群落的微生物源追踪方法在干湿季河水粪便污染识别中的协同应用
Front Microbiol. 2021 Jun 14;12:660368. doi: 10.3389/fmicb.2021.660368. eCollection 2021.