Turrado Concepción Crespo, López María Del Carmen Meizoso, Lasheras Fernando Sánchez, Gómez Benigno Antonio Rodríguez, Rollé José Luis Calvo, Juez Francisco Javier de Cos
Maintenance Department, University of Oviedo, San Francisco 3, Oviedo 3307, Spain.
Departamento de Ingeniería Industrial, University of A Coruña, A Coruña 15405, Spain.
Sensors (Basel). 2014 Oct 29;14(11):20382-99. doi: 10.3390/s141120382.
Global solar broadband irradiance on a planar surface is measured at weather stations by pyranometers. In the case of the present research, solar radiation values from nine meteorological stations of the MeteoGalicia real-time observational network, captured and stored every ten minutes, are considered. In this kind of record, the lack of data and/or the presence of wrong values adversely affects any time series study. Consequently, when this occurs, a data imputation process must be performed in order to replace missing data with estimated values. This paper aims to evaluate the multivariate imputation of ten-minute scale data by means of the chained equations method (MICE). This method allows the network itself to impute the missing or wrong data of a solar radiation sensor, by using either all or just a group of the measurements of the remaining sensors. Very good results have been obtained with the MICE method in comparison with other methods employed in this field such as Inverse Distance Weighting (IDW) and Multiple Linear Regression (MLR). The average RMSE value of the predictions for the MICE algorithm was 13.37% while that for the MLR it was 28.19%, and 31.68% for the IDW.
全球平面表面的太阳宽带辐照度由气象站的总辐射表进行测量。在本研究中,考虑了来自加利西亚气象实时观测网络九个气象站的太阳辐射值,这些数据每十分钟采集并存储一次。在这种记录中,数据缺失和/或存在错误值会对任何时间序列研究产生不利影响。因此,当出现这种情况时,必须执行数据插补过程,以便用估计值替换缺失数据。本文旨在通过链式方程法(MICE)评估十分钟尺度数据的多变量插补。该方法允许网络自身通过使用其余传感器的全部或一组测量值来插补太阳辐射传感器的缺失或错误数据。与该领域使用的其他方法(如反距离加权法(IDW)和多元线性回归法(MLR))相比,MICE方法取得了非常好的结果。MICE算法预测的平均均方根误差值为13.37%,而MLR为28.19%,IDW为31.68%。