Ho Sung Yang, Wong Limsoon, Goh Wilson Wen Bin
School of Biological Sciences, Nanyang Technological University, Singapore 637551, Singapore.
Department of Computer Science, National University of Singapore, Singapore 117417, Singapore.
Patterns (N Y). 2020 May 8;1(2):100025. doi: 10.1016/j.patter.2020.100025.
Class-prediction accuracy provides a quick but superficial way of determining classifier performance. It does not inform on the reproducibility of the findings or whether the selected or constructed features used are meaningful and specific. Furthermore, the class-prediction accuracy oversummarizes and does not inform on how training and learning have been accomplished: two classifiers providing the same performance in one validation can disagree on many future validations. It does not provide explainability in its decision-making process and is not objective, as its value is also affected by class proportions in the validation set. Despite these issues, this does not mean we should omit the class-prediction accuracy. Instead, it needs to be enriched with accompanying evidence and tests that supplement and contextualize the reported accuracy. This additional evidence serves as augmentations and can help us perform machine learning better while avoiding naive reliance on oversimplified metrics.
类别预测准确率提供了一种快速但表面的确定分类器性能的方法。它没有说明研究结果的可重复性,也没有说明所使用的选定或构建的特征是否有意义和具有特异性。此外,类别预测准确率过于概括,没有说明训练和学习是如何完成的:两个在一次验证中表现相同的分类器在许多未来验证中可能会出现分歧。它在决策过程中不提供可解释性,也不客观,因为其值也受验证集中类别比例的影响。尽管存在这些问题,但这并不意味着我们应该忽略类别预测准确率。相反,它需要用补充和解释所报告准确率的伴随证据和测试来丰富。这些额外的证据作为增强因素,可以帮助我们更好地进行机器学习,同时避免对过于简化的指标的盲目依赖。