Hou Dake, Zhou Wenli, Zhang Qiuxia, Zhang Kun, Fang Jiaqi
School of Mathematics, Shandong University, Jinan, China.
School of Business, Wenzhou Univeristy, Wenzhou, China.
PeerJ Comput Sci. 2023 Aug 16;9:e1522. doi: 10.7717/peerj-cs.1522. eCollection 2023.
This study employs the principles of computer science and statistics to evaluate the efficacy of the linear random effect model, utilizing Lasso variable selection techniques (including Lasso, Elastic-Net, Adaptive-Lasso, and SCAD) through numerical simulation and empirical research. The analysis focuses on the model's consistency in variable selection, prediction accuracy, stability, and efficiency. This study employs a novel approach to assess the consistency of variable selection across models. Specifically, the angle between the actual coefficient vector and the estimated coefficient vector is computed to determine the degree of consistency. Additionally, the boxplot tool of statistical analysis is utilized to visually represent the distribution of model prediction accuracy data and variable selection consistency. The comparative stability of each model is assessed based on the frequency of outliers. This study conducts comparative experiments of numerical simulation to evaluate a proposed model evaluation method against commonly used analysis methods. The results demonstrate the effectiveness and correctness of the proposed method, highlighting its ability to conveniently analyze the stability and efficiency of each fitting model.
本研究运用计算机科学和统计学原理,通过数值模拟和实证研究,利用套索变量选择技术(包括套索、弹性网络、自适应套索和SCAD)评估线性随机效应模型的功效。分析重点在于模型在变量选择、预测准确性、稳定性和效率方面的一致性。本研究采用一种新颖的方法来评估不同模型间变量选择的一致性。具体而言,计算实际系数向量与估计系数向量之间的夹角以确定一致程度。此外,利用统计分析的箱线图工具直观呈现模型预测准确性数据的分布和变量选择的一致性。基于异常值出现的频率评估每个模型的相对稳定性。本研究进行数值模拟对比实验,以针对常用分析方法评估一种提出的模型评估方法。结果证明了所提方法的有效性和正确性,突出了其方便分析每个拟合模型的稳定性和效率的能力。