在回归分析评估中，决定系数R平方比对称平均绝对百分比误差（SMAPE）、平均绝对误差（MAE）、平均绝对百分比误差（MAPE）、均方误差（MSE）和均方根误差（RMSE）更具信息量。

The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.

作者信息

Chicco Davide, Warrens Matthijs J, Jurman Giuseppe

机构信息

Institute of Health Policy Management and Evaluation, University of Toronto, Toronto, Canada.

Groningen Institute for Educational Research, University of Groningen, Groningen, Netherlands.

出版信息

PeerJ Comput Sci. 2021 Jul 5;7:e623. doi: 10.7717/peerj-cs.623. eCollection 2021.

DOI:10.7717/peerj-cs.623

PMID:34307865

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8279135/

Abstract

Regression analysis makes up a large part of supervised machine learning, and consists of the prediction of a continuous independent target from a set of other predictor variables. The difference between binary classification and regression is in the target range: in binary classification, the target can have only two values (usually encoded as 0 and 1), while in regression the target can have multiple values. Even if regression analysis has been employed in a huge number of machine learning studies, no consensus has been reached on a single, unified, standard metric to assess the results of the regression itself. Many studies employ the mean square error (MSE) and its rooted variant (RMSE), or the mean absolute error (MAE) and its percentage variant (MAPE). Although useful, these rates share a common drawback: since their values can range between zero and +infinity, a single value of them does not say much about the performance of the regression with respect to the distribution of the ground truth elements. In this study, we focus on two rates that actually generate a high score only if the majority of the elements of a ground truth group has been correctly predicted: the coefficient of determination (also known as -squared or ) and the symmetric mean absolute percentage error (SMAPE). After showing their mathematical properties, we report a comparison between and SMAPE in several use cases and in two real medical scenarios. Our results demonstrate that the coefficient of determination (-squared) is more informative and truthful than SMAPE, and does not have the interpretability limitations of MSE, RMSE, MAE and MAPE. We therefore suggest the usage of -squared as standard metric to evaluate regression analyses in any scientific domain.

摘要

回归分析在有监督机器学习中占据很大一部分，它由根据一组其他预测变量对连续独立目标进行预测组成。二元分类和回归之间的区别在于目标范围：在二元分类中，目标只能有两个值（通常编码为0和1），而在回归中，目标可以有多个值。即使回归分析已在大量机器学习研究中得到应用，但对于评估回归本身结果的单一、统一标准指标尚未达成共识。许多研究采用均方误差（MSE）及其开方变体（RMSE），或平均绝对误差（MAE）及其百分比变体（MAPE）。尽管这些指标很有用，但它们有一个共同的缺点：由于其值可以在零到正无穷之间变化，单个值对于回归相对于真实元素分布的性能说明不多。在本研究中，我们关注两个只有在真实组的大多数元素被正确预测时才会产生高分的指标：决定系数（也称为R平方或R²）和对称平均绝对百分比误差（SMAPE）。在展示了它们的数学性质之后，我们报告了在几个用例和两个实际医疗场景中R²和SMAPE之间的比较。我们的结果表明，决定系数（R平方）比SMAPE更具信息性和真实性，并且没有MSE、RMSE、MAE和MAPE的解释局限性。因此，我们建议使用R平方作为评估任何科学领域回归分析的标准指标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2775/8279135/65120efdfeb2/peerj-cs-07-623-g001.jpg

相似文献

The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.

PeerJ Comput Sci. 2021 Jul 5;7:e623. doi: 10.7717/peerj-cs.623. eCollection 2021.

BenchMetrics Prob: benchmarking of probabilistic error/loss performance evaluation instruments for binary classification problems.

Int J Mach Learn Cybern. 2023 Apr 19:1-31. doi: 10.1007/s13042-023-01826-5.

Regression analysis for detecting epileptic seizure with different feature extracting strategies.

Biomed Tech (Berl). 2019 Dec 18;64(6):619-642. doi: 10.1515/bmt-2018-0012.

Consultation length and no-show prediction for improving appointment scheduling efficiency at a cardiology clinic: A data analytics approach.

Int J Med Inform. 2021 Jan;145:104290. doi: 10.1016/j.ijmedinf.2020.104290. Epub 2020 Oct 1.

Chasing the objective upper eyelid symmetry formula; R, RMSE, POC, MAE, and MSE.

Int Ophthalmol. 2024 Jul 2;44(1):303. doi: 10.1007/s10792-024-03157-y.

Application of Artificial Neural Network Over Nickel-Based Catalyst for Combined Steam-Carbon Dioxide of Methane Reforming (CSDRM).

J Nanosci Nanotechnol. 2020 Sep 1;20(9):5716-5719. doi: 10.1166/jnn.2020.17627.

Predicting Readmission Charges Billed by Hospitals: Machine Learning Approach.

JMIR Med Inform. 2022 Aug 30;10(8):e37578. doi: 10.2196/37578.

Prediction of Serum Creatinine in Hemodialysis Patients Using a Kernel Approach for Longitudinal Data.

Healthc Inform Res. 2020 Apr;26(2):112-118. doi: 10.4258/hir.2020.26.2.112. Epub 2020 Apr 30.

Time-aware forecasting of search volume categories and actual purchase.

Heliyon. 2024 Jan 19;10(3):e25034. doi: 10.1016/j.heliyon.2024.e25034. eCollection 2024 Feb 15.

Advanced machine learning approaches for predicting permeability in reservoir pay zones based on core analyses.

Heliyon. 2024 Jun 11;10(12):e32666. doi: 10.1016/j.heliyon.2024.e32666. eCollection 2024 Jun 30.

引用本文的文献

AI-assisted discovery of potent FGFR1 inhibitors via virtual screening and in silico analysis.

PLoS One. 2025 Sep 11;20(9):e0331837. doi: 10.1371/journal.pone.0331837. eCollection 2025.

NFEmbed: modeling nitrogenase activity via classification and regression with pretrained protein embeddings.

Bioinform Adv. 2025 Aug 23;5(1):vbaf204. doi: 10.1093/bioadv/vbaf204. eCollection 2025.

Humans exercising in the heat: A review on sweat models and a comparison to recent experimental datasets.

Temperature (Austin). 2025 Jun 5;12(3):209-230. doi: 10.1080/23328940.2025.2508534. eCollection 2025.

Testing a Susceptible Population Density Among Other Explanatory Factors of African Swine Fever Spread in Wild Boar Using the Russian Federation Data, 2007-2023.

Transbound Emerg Dis. 2025 Aug 28;2025:6569042. doi: 10.1155/tbed/6569042. eCollection 2025.

Optimizing blood-brain barrier permeability in KRAS inhibitors: A structure-constrained molecular generation approach.

J Pharm Anal. 2025 Aug;15(8):101337. doi: 10.1016/j.jpha.2025.101337. Epub 2025 May 9.

Programmable ultrasonic modulation of viscoelasticity in polymer-based elastomers: Experiments and constitutive modeling.

Ultrason Sonochem. 2025 Aug 24;121:107517. doi: 10.1016/j.ultsonch.2025.107517.

CardioFit: a WebGL-based tool for fast and efficient parametrization of cardiac action potential models to fit user-provided data.

R Soc Open Sci. 2025 Aug 27;12(8):250048. doi: 10.1098/rsos.250048. eCollection 2025 Aug.

Predicting EGFR Inhibitory Effect of Osimertinib Derivatives by Mixed Kernel SVM Enhanced with CLPSO.

Pharmaceuticals (Basel). 2025 Jul 23;18(8):1092. doi: 10.3390/ph18081092.

Prediction of Soil Properties Using Vis-NIR Spectroscopy Combined with Machine Learning: A Review.

Sensors (Basel). 2025 Aug 14;25(16):5045. doi: 10.3390/s25165045.

Estimation of Total Hemoglobin (SpHb) from Facial Videos Using 3D Convolutional Neural Network-Based Regression.

Biosensors (Basel). 2025 Jul 25;15(8):485. doi: 10.3390/bios15080485.

本文引用的文献

Classifier uncertainty: evidence, potential impact, and probabilistic treatment.

PeerJ Comput Sci. 2021 Mar 4;7:e398. doi: 10.7717/peerj-cs.398. eCollection 2021.

The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation.

BioData Min. 2021 Feb 4;14(1):13. doi: 10.1186/s13040-021-00244-z.

A novel framework for COVID-19 case prediction through piecewise regression in India.

Int J Inf Technol. 2021;13(1):41-48. doi: 10.1007/s41870-020-00552-3. Epub 2020 Nov 10.

Count regression models for COVID-19.

Physica A. 2021 Feb 1;563:125460. doi: 10.1016/j.physa.2020.125460. Epub 2020 Oct 31.

Benign overfitting in linear regression.

Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30063-30070. doi: 10.1073/pnas.1907378117. Epub 2020 Apr 24.

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.

BMC Genomics. 2020 Jan 2;21(1):6. doi: 10.1186/s12864-019-6413-7.

Dataset for estimation of obesity levels based on eating habits and physical condition in individuals from Colombia, Peru and Mexico.

Data Brief. 2019 Aug 2;25:104344. doi: 10.1016/j.dib.2019.104344. eCollection 2019 Aug.

A coefficient of determination (R ) for generalized linear mixed models.

Biom J. 2019 Jul;61(4):860-872. doi: 10.1002/bimj.201800270. Epub 2019 Apr 8.

The coefficient of determination and intra-class correlation coefficient from generalized linear mixed-effects models revisited and expanded.

J R Soc Interface. 2017 Sep;14(134). doi: 10.1098/rsif.2017.0213. Epub 2017 Sep 13.

A new accuracy measure based on bounded relative error for time series forecasting.

PLoS One. 2017 Mar 24;12(3):e0174202. doi: 10.1371/journal.pone.0174202. eCollection 2017.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在回归分析评估中，决定系数R平方比对称平均绝对百分比误差（SMAPE）、平均绝对误差（MAE）、平均绝对百分比误差（MAPE）、均方误差（MSE）和均方根误差（RMSE）更具信息量。

The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.

作者信息

Chicco Davide, Warrens Matthijs J, Jurman Giuseppe

机构信息

Institute of Health Policy Management and Evaluation, University of Toronto, Toronto, Canada.

Groningen Institute for Educational Research, University of Groningen, Groningen, Netherlands.

出版信息

PeerJ Comput Sci. 2021 Jul 5;7:e623. doi: 10.7717/peerj-cs.623. eCollection 2021.

DOI:10.7717/peerj-cs.623

PMID:34307865

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8279135/

Abstract

摘要

在回归分析评估中，决定系数R平方比对称平均绝对百分比误差（SMAPE）、平均绝对误差（MAE）、平均绝对百分比误差（MAPE）、均方误差（MSE）和均方根误差（RMSE）更具信息量。

The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

在回归分析评估中，决定系数R平方比对称平均绝对百分比误差（SMAPE）、平均绝对误差（MAE）、平均绝对百分比误差（MAPE）、均方误差（MSE）和均方根误差（RMSE）更具信息量。

The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献