• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于支持向量机的统计分类模型鉴定地理上无关联样本的粪便污染来源。

Identification the source of fecal contamination for geographically unassociated samples with a statistical classification model based on support vector machine.

机构信息

State Key Laboratory of Pollution Control and Resource Reuse, Tongji University, Shanghai 200092, China; College of Environmental Science and Engineering, Tongji University, Shanghai 200092, China.

State Key Laboratory of Pollution Control and Resource Reuse, Tongji University, Shanghai 200092, China; College of Environmental Science and Engineering, Tongji University, Shanghai 200092, China.

出版信息

J Hazard Mater. 2021 Apr 5;407:124821. doi: 10.1016/j.jhazmat.2020.124821. Epub 2020 Dec 11.

DOI:10.1016/j.jhazmat.2020.124821
PMID:33340974
Abstract

The bacterial diversity and corresponding biological significance revealed by high-throughput sequencing contribute massive information to source tracking of fecal contamination. The performances of classification models on predicting the fecal source of geographical local and foreign samples were examined herein, by applying support vector machine (SVM) algorithm. Random forest (RF) and Adaboost were applied for comparison as well. Discriminatory sequences were selected from Clostridiale, Bacteroidales, or Lactobacillales bacterial groups using extremely randomized trees (ExtraTrees). 1.51-12.64% of the unique sequences in the original library composed the representative markers, and they contributed 70% of the discrepancies between source microbiomes. The overall accuracy of the SVM model and the RF model on local samples was 96.08% and 98.04%, respectively, higher than that of the Adaboost (90.20%). As for the non-local samples, the SVM assigned most of the fecal samples into the correct category while several false-positive judgments occurred in closely related groups. The results in this paper suggested that the SVM was a time-saving and accurate method for fecal source tracking in contaminated water body with the potential capability of executing tasks based on geographically unassociated samples, and underlined the necessity of qPCR analysis for accurate detection of human source pollution.

摘要

高通量测序揭示的细菌多样性及其相应的生物学意义为粪便污染的溯源提供了大量信息。本文应用支持向量机(SVM)算法,考察了分类模型对预测地理来源和外国样本粪便来源的性能,并与随机森林(RF)和自适应增强(Adaboost)进行了比较。使用极端随机树(ExtraTrees)从梭菌目、拟杆菌目或乳杆菌目中选择了有区别性的序列。原始文库中独特序列的 1.51%-12.64%组成了代表性标记,它们对源微生物组之间的差异贡献了 70%。SVM 模型和 RF 模型对本地样本的总体准确率分别为 96.08%和 98.04%,高于 Adaboost(90.20%)。对于非本地样本,SVM 将大多数粪便样本分配到正确的类别,但在密切相关的组中也出现了一些假阳性判断。本文的结果表明,SVM 是一种用于受污染水体中粪便溯源的省时、准确的方法,具有基于地理上不相关的样本执行任务的潜力,并强调了 qPCR 分析对准确检测人为污染的必要性。

相似文献

1
Identification the source of fecal contamination for geographically unassociated samples with a statistical classification model based on support vector machine.基于支持向量机的统计分类模型鉴定地理上无关联样本的粪便污染来源。
J Hazard Mater. 2021 Apr 5;407:124821. doi: 10.1016/j.jhazmat.2020.124821. Epub 2020 Dec 11.
2
Fecal source identification using random forest.使用随机森林进行粪便来源识别。
Microbiome. 2018 Oct 18;6(1):185. doi: 10.1186/s40168-018-0568-3.
3
Quantitative identification of fecal water pollution sources by TaqMan real-time PCR assays using Bacteroidales 16S rRNA genetic markers.利用 Bacteroidales 16S rRNA 遗传标记的 TaqMan 实时 PCR 分析法定量鉴定粪便水污染来源。
Appl Microbiol Biotechnol. 2010 Dec;88(6):1373-83. doi: 10.1007/s00253-010-2880-0. Epub 2010 Sep 25.
4
Verification of Bacteroidales 16S rRNA markers as a complementary tool for detecting swine fecal pollution in the Yangtze Delta.验证拟杆菌门 16S rRNA 标记作为一种补充工具,用于检测长江三角洲地区猪粪便污染。
J Environ Sci (China). 2020 Apr;90:59-66. doi: 10.1016/j.jes.2019.11.016. Epub 2019 Dec 9.
5
Evaluation of two library-independent microbial source tracking methods to identify sources of fecal contamination in French estuaries.评估两种不依赖文库的微生物源追踪方法以识别法国河口粪便污染来源
Appl Environ Microbiol. 2007 Aug;73(15):4857-66. doi: 10.1128/AEM.03003-06. Epub 2007 Jun 8.
6
Performance evaluation of canine-associated Bacteroidales assays in a multi-laboratory comparison study.犬源拟杆菌目检测方法在多实验室比较研究中的性能评估。
Water Res. 2013 Nov 15;47(18):6909-20. doi: 10.1016/j.watres.2013.03.062. Epub 2013 Jul 5.
7
Tracing fecal pollution sources in karst groundwater by Bacteroidales genetic biomarkers, bacterial indicators, and environmental variables.利用 Bacteroidales 遗传生物标志物、细菌指标和环境变量追踪喀斯特地下水的粪便污染源。
Sci Total Environ. 2014 Aug 15;490:1082-90. doi: 10.1016/j.scitotenv.2014.05.086. Epub 2014 Jun 10.
8
Assessing hog lagoon waste contamination in the Cape Fear Watershed using Bacteroidetes 16S rRNA gene pyrosequencing.使用 Bacteroidetes 16S rRNA 基因焦磷酸测序评估开普菲尔流域猪 lagoon 废物污染。
Appl Microbiol Biotechnol. 2015 Sep;99(17):7283-93. doi: 10.1007/s00253-015-6784-x. Epub 2015 Jul 19.
9
Uneven genotypic diversity of in fecal sources limits the performance of a library-dependent method of microbial source tracking on the southwestern French Atlantic coast.
Can J Microbiol. 2020 Dec;66(12):698-712. doi: 10.1139/cjm-2020-0244. Epub 2020 Jul 30.
10
Toolbox Approaches Using Molecular Markers and 16S rRNA Gene Amplicon Data Sets for Identification of Fecal Pollution in Surface Water.利用分子标记和16S rRNA基因扩增子数据集识别地表水中粪便污染的工具箱方法
Appl Environ Microbiol. 2015 Oct;81(20):7067-77. doi: 10.1128/AEM.02032-15. Epub 2015 Jul 31.

引用本文的文献

1
Prediction Model for Risk of Death in Elderly Critically Ill Patients with Kidney Failure.老年重症肾衰竭患者死亡风险预测模型
Medicina (Kaunas). 2025 Apr 1;61(4):640. doi: 10.3390/medicina61040640.
2
Identification of neutrophil extracellular trap-related biomarkers in non-alcoholic fatty liver disease through machine learning and single-cell analysis.通过机器学习和单细胞分析鉴定非酒精性脂肪性肝病中的中性粒细胞胞外诱捕相关生物标志物。
Sci Rep. 2024 Sep 10;14(1):21085. doi: 10.1038/s41598-024-72151-2.
3
A deep learning system for heart failure mortality prediction.
深度学习系统预测心力衰竭死亡率。
PLoS One. 2023 Feb 24;18(2):e0276835. doi: 10.1371/journal.pone.0276835. eCollection 2023.
4
Water Quality Evaluation and Pollution Source Apportionment of Surface Water in a Major City in Southeast China Using Multi-Statistical Analyses and Machine Learning Models.基于多元统计分析和机器学习模型的中国东南部一主要城市地表水水质评价及污染源解析
Int J Environ Res Public Health. 2023 Jan 3;20(1):881. doi: 10.3390/ijerph20010881.