Hammad Issam, El-Sankary Kamal
Department of Electrical and Computer Engineering, Dalhousie University, Halifax, NS B3H 4R2, Canada.
Sensors (Basel). 2019 Aug 9;19(16):3491. doi: 10.3390/s19163491.
Accuracy evaluation in machine learning is based on the split of data into a training set and a test set. This critical step is applied to develop machine learning models including models based on sensor data. For sensor-based problems, comparing the accuracy of machine learning models using the train/test split provides only a baseline comparison in ideal situations. Such comparisons won't consider practical production problems that can impact the inference accuracy such as the sensors' thermal noise, performance with lower inference quantization, and tolerance to sensor failure. Therefore, this paper proposes a set of practical tests that can be applied when comparing the accuracy of machine learning models for sensor-based problems. First, the impact of the sensors' thermal noise on the models' inference accuracy was simulated. Machine learning algorithms have different levels of error resilience to thermal noise, as will be presented. Second, the models' accuracy using lower inference quantization was compared. Lowering inference quantization leads to lowering the analog-to-digital converter (ADC) resolution which is cost-effective in embedded designs. Moreover, in custom designs, analog-to-digital converters' (ADCs) effective number of bits (ENOB) is usually lower than the ideal number of bits due to various design factors. Therefore, it is practical to compare models' accuracy using lower inference quantization. Third, the models' accuracy tolerance to sensor failure was evaluated and compared. For this study, University of California Irvine (UCI) 'Daily and Sports Activities' dataset was used to present these practical tests and their impact on model selection.
机器学习中的准确性评估基于将数据划分为训练集和测试集。这一关键步骤用于开发机器学习模型,包括基于传感器数据的模型。对于基于传感器的问题,使用训练/测试划分来比较机器学习模型的准确性,仅能在理想情况下提供一个基线比较。此类比较不会考虑可能影响推理准确性的实际生产问题,例如传感器的热噪声、较低推理量化时的性能以及对传感器故障的容忍度。因此,本文提出了一组实用测试,可在比较基于传感器问题的机器学习模型准确性时应用。首先,模拟了传感器热噪声对模型推理准确性的影响。如将展示的那样,机器学习算法对热噪声具有不同程度的误差弹性。其次,比较了使用较低推理量化时模型的准确性。降低推理量化会导致降低模数转换器(ADC)的分辨率,这在嵌入式设计中具有成本效益。此外,在定制设计中,由于各种设计因素,模数转换器(ADC)的有效位数(ENOB)通常低于理想位数。因此,比较使用较低推理量化时模型的准确性是切实可行的。第三,评估并比较了模型对传感器故障的准确性容忍度。在本研究中,使用了加利福尼亚大学欧文分校(UCI)的“日常和体育活动”数据集来展示这些实用测试及其对模型选择的影响。