• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多水平机器学习分类器比较及其性能指标。

Multi-Level Comparison of Machine Learning Classifiers and Their Performance Metrics.

机构信息

Plasma Chemistry Research Group, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudósok krt. 2, H-1117 Budapest, Hungary.

Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudósok krt. 2, H-1117 Budapest, Hungary.

出版信息

Molecules. 2019 Aug 1;24(15):2811. doi: 10.3390/molecules24152811.

DOI:10.3390/molecules24152811
PMID:31374986
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6695655/
Abstract

Machine learning classification algorithms are widely used for the prediction and classification of the different properties of molecules such as toxicity or biological activity. the prediction of toxic vs. non-toxic molecules is important due to testing on living animals, which has ethical and cost drawbacks as well. The quality of classification models can be determined with several performance parameters. which often give conflicting results. In this study, we performed a multi-level comparison with the use of different performance metrics and machine learning classification methods. Well-established and standardized protocols for the machine learning tasks were used in each case. The comparison was applied to three datasets (acute and aquatic toxicities) and the robust, yet sensitive, sum of ranking differences (SRD) and analysis of variance (ANOVA) were applied for evaluation. The effect of dataset composition (balanced vs. imbalanced) and 2-class vs. multiclass classification scenarios was also studied. Most of the performance metrics are sensitive to dataset composition, especially in 2-class classification problems. The optimal machine learning algorithm also depends significantly on the composition of the dataset.

摘要

机器学习分类算法被广泛应用于预测和分类分子的不同性质,如毒性或生物活性。由于对活体动物进行测试,预测有毒和无毒分子具有伦理和成本方面的缺点。分类模型的质量可以通过几个性能参数来确定,但这些参数往往会给出相互矛盾的结果。在这项研究中,我们使用不同的性能指标和机器学习分类方法进行了多层次的比较。在每种情况下,都使用了经过良好验证和标准化的机器学习任务协议。该比较应用于三个数据集(急性毒性和水生毒性),并应用稳健但敏感的排序差异总和(SRD)和方差分析(ANOVA)进行评估。还研究了数据集组成(平衡与不平衡)和 2 类与多类分类场景的影响。大多数性能指标对数据集的组成非常敏感,尤其是在 2 类分类问题中。最佳的机器学习算法也严重依赖于数据集的组成。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/a321a0e41766/molecules-24-02811-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/40bf56c2faa9/molecules-24-02811-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/66e5103fcbe9/molecules-24-02811-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/9b93c4518e8a/molecules-24-02811-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/0c465a0e53ea/molecules-24-02811-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/cac8bde57526/molecules-24-02811-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/0acf3b960e52/molecules-24-02811-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/b8b9c32ffb1d/molecules-24-02811-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/c77c44228c7a/molecules-24-02811-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/a321a0e41766/molecules-24-02811-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/40bf56c2faa9/molecules-24-02811-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/66e5103fcbe9/molecules-24-02811-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/9b93c4518e8a/molecules-24-02811-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/0c465a0e53ea/molecules-24-02811-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/cac8bde57526/molecules-24-02811-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/0acf3b960e52/molecules-24-02811-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/b8b9c32ffb1d/molecules-24-02811-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/c77c44228c7a/molecules-24-02811-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6d7/6695655/a321a0e41766/molecules-24-02811-g009.jpg

相似文献

1
Multi-Level Comparison of Machine Learning Classifiers and Their Performance Metrics.多水平机器学习分类器比较及其性能指标。
Molecules. 2019 Aug 1;24(15):2811. doi: 10.3390/molecules24152811.
2
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
3
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
4
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
5
Approaches for predicting dairy cattle methane emissions: from traditional methods to machine learning.预测奶牛甲烷排放的方法:从传统方法到机器学习。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae219.
6
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
7
Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.关于使用人工智能评估临床数据完整性并生成元数据的提案:算法开发与验证
JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
9
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
10
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

引用本文的文献

1
Assessment of university students' earthquake coping strategies using artificial intelligence methods.运用人工智能方法评估大学生的地震应对策略
Sci Rep. 2025 Aug 29;15(1):31897. doi: 10.1038/s41598-025-17555-4.
2
Comparing machine learning models with a focus on tone in grooming chat logs.比较专注于梳理聊天记录语气的机器学习模型。
Front Pediatr. 2025 Jun 19;13:1591828. doi: 10.3389/fped.2025.1591828. eCollection 2025.
3
Aerial image segmentation of embankment dams based on multispectral remote sensing: a case study in the Belo Monte Hydroelectric Complex, Pará, Brazil.

本文引用的文献

1
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
2
Applications of machine learning in drug discovery and development.机器学习在药物发现和开发中的应用。
Nat Rev Drug Discov. 2019 Jun;18(6):463-477. doi: 10.1038/s41573-019-0024-5.
3
Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR.分子描述符预选的 QSAR/QSPR 中的互相关限制。
基于多光谱遥感的堤坝式水坝航空图像分割:以巴西帕拉州贝洛蒙特水电站综合体为例
PeerJ Comput Sci. 2025 Jun 16;11:e2917. doi: 10.7717/peerj-cs.2917. eCollection 2025.
4
Direct profiling of non-adenosines in poly(A) tails of endogenous and therapeutic mRNAs with Ninetails.使用九尾狐对内源和治疗性mRNA的聚腺苷酸尾中的非腺苷进行直接分析。
Nat Commun. 2025 Mar 18;16(1):2664. doi: 10.1038/s41467-025-57787-6.
5
Machine learning assisted classification RASAR modeling for the nephrotoxicity potential of a curated set of orally active drugs.机器学习辅助分类RASAR模型用于一组精选口服活性药物的肾毒性潜力评估
Sci Rep. 2025 Jan 4;15(1):808. doi: 10.1038/s41598-024-85063-y.
6
Machine learning algorithms able to predict the prognosis of gastric cancer patients treated with immune checkpoint inhibitors.能够预测接受免疫检查点抑制剂治疗的胃癌患者预后的机器学习算法。
World J Gastroenterol. 2024 Oct 28;30(40):4354-4366. doi: 10.3748/wjg.v30.i40.4354.
7
CSEL-BGC: A Bioinformatics Framework Integrating Machine Learning for Defining the Biosynthetic Evolutionary Landscape of Uncharacterized Antibacterial Natural Products.CSEL-BGC:一种整合机器学习以定义未表征抗菌天然产物生物合成进化格局的生物信息学框架。
Interdiscip Sci. 2025 Mar;17(1):27-41. doi: 10.1007/s12539-024-00656-5. Epub 2024 Sep 30.
8
In Silico Exploration of Novel EGFR Kinase Mutant-Selective Inhibitors Using a Hybrid Computational Approach.使用混合计算方法对新型表皮生长因子受体激酶突变体选择性抑制剂进行计算机模拟探索
Pharmaceuticals (Basel). 2024 Aug 23;17(9):1107. doi: 10.3390/ph17091107.
9
The application of chemical similarity measures in an unconventional modeling framework c-RASAR along with dimensionality reduction techniques to a representative hepatotoxicity dataset.化学相似性度量在非常规建模框架 c-RASAR 中的应用以及降维技术在具有代表性的肝毒性数据集上的应用。
Sci Rep. 2024 Sep 6;14(1):20812. doi: 10.1038/s41598-024-71892-4.
10
Harnessing Machine Learning to Uncover Hidden Patterns in Azole-Resistant CYP51/ERG11 Proteins.利用机器学习揭示唑类抗性CYP51/ERG11蛋白中的隐藏模式。
Microorganisms. 2024 Jul 25;12(8):1525. doi: 10.3390/microorganisms12081525.
Mol Inform. 2019 Aug;38(8-9):e1800154. doi: 10.1002/minf.201800154. Epub 2019 Apr 4.
4
Best Practices for QSAR Model Reporting: Physical and Chemical Properties, Ecotoxicity, Environmental Fate, Human Health, and Toxicokinetics Endpoints.QSAR 模型报告的最佳实践:物理化学性质、生态毒性、环境 fate、人类健康和毒代动力学终点。
Environ Health Perspect. 2018 Dec;126(12):126001. doi: 10.1289/EHP3264.
5
Modelling methods and cross-validation variants in QSAR: a multi-level analysis.QSAR 中的建模方法和交叉验证变体:多层次分析。
SAR QSAR Environ Res. 2018 Sep;29(9):661-674. doi: 10.1080/1062936X.2018.1505778. Epub 2018 Aug 30.
6
The rise of deep learning in drug discovery.深度学习在药物发现中的崛起。
Drug Discov Today. 2018 Jun;23(6):1241-1250. doi: 10.1016/j.drudis.2018.01.039. Epub 2018 Jan 31.
7
Multivariate assessment of lipophilicity scales-computational and reversed phase thin-layer chromatographic indices.亲脂性量表的多变量评估——计算和反相薄层色谱指标
J Pharm Biomed Anal. 2016 Aug 5;127:81-93. doi: 10.1016/j.jpba.2016.04.001. Epub 2016 Apr 1.
8
Consistency of QSAR models: Correct split of training and test sets, ranking of models and performance parameters.定量构效关系(QSAR)模型的一致性:训练集和测试集的正确划分、模型排名及性能参数
SAR QSAR Environ Res. 2015;26(7-9):683-700. doi: 10.1080/1062936X.2015.1084647. Epub 2015 Oct 5.
9
Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?为什么田本系数是基于指纹的相似性计算的合适选择?
J Cheminform. 2015 May 20;7:20. doi: 10.1186/s13321-015-0069-3. eCollection 2015.
10
Count on kappa.信赖卡帕。
J Comput Aided Mol Des. 2014 Nov;28(11):1049-55. doi: 10.1007/s10822-014-9759-6. Epub 2014 Jul 11.