Nazari Sanaz, Leite Walter L, Huggins-Manley A Corinne
University of California, San Diego, La Jolla, USA.
University of Florida, Gainesville, USA.
Educ Psychol Meas. 2024 Dec;84(6):1107-1137. doi: 10.1177/00131644241255109. Epub 2024 May 29.
Social desirability bias (SDB) is a common threat to the validity of conclusions from responses to a scale or survey. There is a wide range of person-fit statistics in the literature that can be employed to detect SDB. In addition, machine learning classifiers, such as logistic regression and random forest, have the potential to distinguish between biased and unbiased responses. This study proposes a new application of these classifiers to detect SDB by considering several person-fit indices as features or predictors in the machine learning methods. The results of a Monte Carlo simulation study showed that for a single feature, applying person-fit indices directly and logistic regression led to similar classification results. However, the random forest classifier improved the classification of biased and unbiased responses substantially. Classification was improved in both logistic regression and random forest by considering multiple features simultaneously. Moreover, cross-validation indicated stable area under the curves (AUCs) across machine learning classifiers. A didactical illustration of applying random forest to detect SDB is presented.
社会期望偏差(SDB)是对量表或调查回应所得结论有效性的一种常见威胁。文献中有多种人-拟合统计量可用于检测SDB。此外,机器学习分类器,如逻辑回归和随机森林,有潜力区分有偏差和无偏差的回应。本研究提出这些分类器的一种新应用,即通过将几个人-拟合指标作为机器学习方法中的特征或预测变量来检测SDB。蒙特卡罗模拟研究结果表明,对于单个特征,直接应用人-拟合指标和逻辑回归会得到相似的分类结果。然而,随机森林分类器显著改善了有偏差和无偏差回应的分类。通过同时考虑多个特征,逻辑回归和随机森林的分类都得到了改善。此外,交叉验证表明机器学习分类器的曲线下面积(AUC)稳定。本文还给出了应用随机森林检测SDB的教学示例。