Suppr超能文献

避免机器学习中的过度简化:超越类别预测准确率

Avoid Oversimplifications in Machine Learning: Going beyond the Class-Prediction Accuracy.

作者信息

Ho Sung Yang, Wong Limsoon, Goh Wilson Wen Bin

机构信息

School of Biological Sciences, Nanyang Technological University, Singapore 637551, Singapore.

Department of Computer Science, National University of Singapore, Singapore 117417, Singapore.

出版信息

Patterns (N Y). 2020 May 8;1(2):100025. doi: 10.1016/j.patter.2020.100025.

Abstract

Class-prediction accuracy provides a quick but superficial way of determining classifier performance. It does not inform on the reproducibility of the findings or whether the selected or constructed features used are meaningful and specific. Furthermore, the class-prediction accuracy oversummarizes and does not inform on how training and learning have been accomplished: two classifiers providing the same performance in one validation can disagree on many future validations. It does not provide explainability in its decision-making process and is not objective, as its value is also affected by class proportions in the validation set. Despite these issues, this does not mean we should omit the class-prediction accuracy. Instead, it needs to be enriched with accompanying evidence and tests that supplement and contextualize the reported accuracy. This additional evidence serves as augmentations and can help us perform machine learning better while avoiding naive reliance on oversimplified metrics.

摘要

类别预测准确率提供了一种快速但表面的确定分类器性能的方法。它没有说明研究结果的可重复性,也没有说明所使用的选定或构建的特征是否有意义和具有特异性。此外,类别预测准确率过于概括,没有说明训练和学习是如何完成的:两个在一次验证中表现相同的分类器在许多未来验证中可能会出现分歧。它在决策过程中不提供可解释性,也不客观,因为其值也受验证集中类别比例的影响。尽管存在这些问题,但这并不意味着我们应该忽略类别预测准确率。相反,它需要用补充和解释所报告准确率的伴随证据和测试来丰富。这些额外的证据作为增强因素,可以帮助我们更好地进行机器学习,同时避免对过于简化的指标的盲目依赖。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a00/7660406/f9557e0da735/fx1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验