Zhou Chucheng, Zhang Yingqian, Lin Chengcong, Zhou Shuang
School of Computing and Data Science, Xiamen University Malaysia, Sepang, Selangor, 43900, Malaysia.
School of AIR, Xiamen University Malaysia, Sepang, Selangor, 43900, Malaysia.
Sci Rep. 2025 Sep 1;15(1):32135. doi: 10.1038/s41598-025-17741-4.
Traditional lie detection relies on the experience of human interrogators, making it susceptible to subjective factors and leading to misjudgments. To solve this problem, we propose an emotion-enhanced deception detection model, Lie Detection using XGBoost with RoBERTa-based Emotion Features (LieXBerta). In this framework, the Robustly Optimized BERT Pretraining Approach (RoBERTa) is used to extract emotional features from interrogation texts. The emotional features are then combined with facial and action features and subsequently fed into an Extreme Gradient Boosting (XGBoost) classifier for deception detection. This approach aims to improve the objectivity and accuracy of deception detection in courtroom settings. For verifying the proposed algorithm, we develop a trial text dataset enriched with detailed emotional features. Simulation experiments demonstrate that the LieXBerta model incorporating emotional features outperforms baseline models that use only traditional features and several classical machine learning models. The experimental results show that after parameter tuning, the accuracy of the LieXBerta model increased to 87.50%, respectively, marking a 6.5% improvement over the baseline model. Moreover, the runtime of the tuned LieXBerta model with reduced features was reduced by 42%, significantly enhancing the training efficiency and prediction performance for deception detection.
传统的测谎依赖于人类审讯人员的经验,容易受到主观因素的影响,从而导致误判。为了解决这个问题,我们提出了一种情感增强欺骗检测模型,即基于带有基于RoBERTa情感特征的XGBoost的测谎模型(LieXBerta)。在此框架中,使用稳健优化的BERT预训练方法(RoBERTa)从审讯文本中提取情感特征。然后将情感特征与面部和动作特征相结合,随后输入到极端梯度提升(XGBoost)分类器中进行欺骗检测。此方法旨在提高法庭环境中欺骗检测的客观性和准确性。为了验证所提出的算法,我们开发了一个富含详细情感特征的试验文本数据集。仿真实验表明,结合情感特征的LieXBerta模型优于仅使用传统特征的基线模型和几个经典机器学习模型。实验结果表明,经过参数调整后,LieXBerta模型的准确率分别提高到了87.50%,比基线模型提高了6.5%。此外,减少特征后的调优LieXBerta模型的运行时间减少了42%,显著提高了欺骗检测的训练效率和预测性能。