• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用大语言模型进行言语谎言检测。

Verbal lie detection using Large Language Models.

作者信息

Loconte Riccardo, Russo Roberto, Capuozzo Pasquale, Pietrini Pietro, Sartori Giuseppe

机构信息

Molecular Mind Lab, IMT School for Advanced Studies Lucca, Piazza San Francesco 19, 55100, Lucca, LU, Italy.

Department of Mathematics "Tullio Levi-Civita", University of Padova, Padova, Italy.

出版信息

Sci Rep. 2023 Dec 21;13(1):22849. doi: 10.1038/s41598-023-50214-0.

DOI:10.1038/s41598-023-50214-0
PMID:38129677
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10739834/
Abstract

Human accuracy in detecting deception with intuitive judgments has been proven to not go above the chance level. Therefore, several automatized verbal lie detection techniques employing Machine Learning and Transformer models have been developed to reach higher levels of accuracy. This study is the first to explore the performance of a Large Language Model, FLAN-T5 (small and base sizes), in a lie-detection classification task in three English-language datasets encompassing personal opinions, autobiographical memories, and future intentions. After performing stylometric analysis to describe linguistic differences in the three datasets, we tested the small- and base-sized FLAN-T5 in three Scenarios using 10-fold cross-validation: one with train and test set coming from the same single dataset, one with train set coming from two datasets and the test set coming from the third remaining dataset, one with train and test set coming from all the three datasets. We reached state-of-the-art results in Scenarios 1 and 3, outperforming previous benchmarks. The results revealed also that model performance depended on model size, with larger models exhibiting higher performance. Furthermore, stylometric analysis was performed to carry out explainability analysis, finding that linguistic features associated with the Cognitive Load framework may influence the model's predictions.

摘要

事实证明,人类通过直觉判断来检测欺骗行为的准确率不会超过随机水平。因此,人们开发了几种采用机器学习和Transformer模型的自动化言语谎言检测技术,以达到更高的准确率。本研究首次探讨了大型语言模型FLAN-T5(小型和基础版本)在包含个人观点、自传式记忆和未来意图的三个英语数据集中的谎言检测分类任务中的表现。在进行文体分析以描述这三个数据集的语言差异之后,我们使用10折交叉验证在三种场景下测试了小型和基础版本的FLAN-T5:一种场景是训练集和测试集来自同一个数据集,一种场景是训练集来自两个数据集,测试集来自剩下的第三个数据集,还有一种场景是训练集和测试集来自所有三个数据集。我们在场景1和场景3中取得了领先的结果,超过了之前的基准。结果还表明,模型性能取决于模型大小,较大的模型表现出更高的性能。此外,还进行了文体分析以进行可解释性分析,发现与认知负荷框架相关的语言特征可能会影响模型的预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/25000cd3d409/41598_2023_50214_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/83c50fc3d8c9/41598_2023_50214_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/d2cd5d71f93e/41598_2023_50214_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/03fc9ac13fdd/41598_2023_50214_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/882f6f3ac167/41598_2023_50214_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/efe8f39e7045/41598_2023_50214_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/f0ebae01953a/41598_2023_50214_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/50c4e20adf84/41598_2023_50214_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/25000cd3d409/41598_2023_50214_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/83c50fc3d8c9/41598_2023_50214_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/d2cd5d71f93e/41598_2023_50214_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/03fc9ac13fdd/41598_2023_50214_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/882f6f3ac167/41598_2023_50214_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/efe8f39e7045/41598_2023_50214_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/f0ebae01953a/41598_2023_50214_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/50c4e20adf84/41598_2023_50214_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c1df/10739834/25000cd3d409/41598_2023_50214_Fig8_HTML.jpg

相似文献

1
Verbal lie detection using Large Language Models.使用大语言模型进行言语谎言检测。
Sci Rep. 2023 Dec 21;13(1):22849. doi: 10.1038/s41598-023-50214-0.
2
Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimer's Disease Based on Speech.基于语音比较预训练模型和基于特征的模型对阿尔茨海默病的预测
Front Aging Neurosci. 2021 Apr 27;13:635945. doi: 10.3389/fnagi.2021.635945. eCollection 2021.
3
Enhancing sentiment and intent analysis in public health via fine-tuned Large Language Models on tobacco and e-cigarette-related tweets.通过在与烟草和电子烟相关的推文上微调大型语言模型来增强公共卫生领域的情感和意图分析。
Front Big Data. 2024 Nov 28;7:1501154. doi: 10.3389/fdata.2024.1501154. eCollection 2024.
4
Understanding the vaccine stance of Italian tweets and addressing language changes through the COVID-19 pandemic: Development and validation of a machine learning model.理解意大利语推文中的疫苗立场,并通过 COVID-19 大流行解决语言变化问题:机器学习模型的开发和验证。
Front Public Health. 2022 Jul 29;10:948880. doi: 10.3389/fpubh.2022.948880. eCollection 2022.
5
A hybrid transformer and attention based recurrent neural network for robust and interpretable sentiment analysis of tweets.基于混合变压器和注意力的循环神经网络的 tweet 情感分析的鲁棒性和可解释性。
Sci Rep. 2024 Oct 22;14(1):24882. doi: 10.1038/s41598-024-76079-5.
6
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
7
Data-driven evolution of water quality models: An in-depth investigation of innovative outlier detection approaches-A case study of Irish Water Quality Index (IEWQI) model.水质模型的数据驱动演变:创新异常值检测方法的深入研究——以爱尔兰水质指数(IEWQI)模型为例
Water Res. 2024 May 15;255:121499. doi: 10.1016/j.watres.2024.121499. Epub 2024 Mar 20.
8
Leveraging GPT-4 for identifying cancer phenotypes in electronic health records: a performance comparison between GPT-4, GPT-3.5-turbo, Flan-T5, Llama-3-8B, and spaCy's rule-based and machine learning-based methods.利用GPT-4在电子健康记录中识别癌症表型:GPT-4、GPT-3.5-turbo、Flan-T5、Llama-3-8B与spaCy基于规则和基于机器学习的方法之间的性能比较。
JAMIA Open. 2024 Jul 3;7(3):ooae060. doi: 10.1093/jamiaopen/ooae060. eCollection 2024 Oct.
9
Deception detection with machine learning: A systematic review and statistical analysis.使用机器学习进行欺骗检测:系统评价和统计分析。
PLoS One. 2023 Feb 9;18(2):e0281323. doi: 10.1371/journal.pone.0281323. eCollection 2023.
10
An Explainable Artificial Intelligence Text Classifier for Suicidality Prediction in Youth Crisis Text Line Users: Development and Validation Study.用于青少年危机短信热线用户自杀倾向预测的可解释人工智能文本分类器:开发与验证研究
JMIR Public Health Surveill. 2025 Jan 29;11:e63809. doi: 10.2196/63809.

引用本文的文献

1
Examining embedded lies through computational text analysis.通过计算文本分析来审视内在的谎言。
Sci Rep. 2025 Jul 21;15(1):26482. doi: 10.1038/s41598-025-11327-w.
2
Finding deceivers in social context with large language models and how to find them: the case of the Mafia game.利用大语言模型在社交情境中识别欺骗者以及如何识别欺骗者:以黑手党游戏为例。
Sci Rep. 2024 Dec 28;14(1):30946. doi: 10.1038/s41598-024-81997-5.

本文引用的文献

1
Analysing Deception in Witness Memory through Linguistic Styles in Spontaneous Language.通过自发语言中的语言风格分析证人记忆中的欺骗行为。
Brain Sci. 2023 Feb 13;13(2):317. doi: 10.3390/brainsci13020317.
2
Truth or lie: Exploring the language of deception.真相还是谎言:探索欺骗的语言。
PLoS One. 2023 Feb 2;18(2):e0281179. doi: 10.1371/journal.pone.0281179. eCollection 2023.
3
Verbal Lie Detection: Its Past, Present and Future.言语测谎:其过去、现在与未来。
Brain Sci. 2022 Dec 1;12(12):1644. doi: 10.3390/brainsci12121644.
4
Quantifying the narrative flow of imagined versus autobiographical stories.量化想象故事与自传故事的叙述流。
Proc Natl Acad Sci U S A. 2022 Nov 8;119(45):e2211715119. doi: 10.1073/pnas.2211715119. Epub 2022 Nov 2.
5
Criteria-Based Content Analysis (CBCA) reality criteria in adults: A meta-analytic review.基于标准的内容分析(CBCA)在成年人中的现实标准:一项元分析综述。
Int J Clin Health Psychol. 2016 May-Aug;16(2):201-210. doi: 10.1016/j.ijchp.2016.01.002. Epub 2016 Mar 16.
6
Covert lie detection using keyboard dynamics.利用键盘动力学进行隐蔽性测谎。
Sci Rep. 2018 Jan 31;8(1):1976. doi: 10.1038/s41598-018-20462-6.
7
Using Named Entities for Computer-Automated Verbal Deception Detection.利用命名实体进行计算机自动言语欺骗检测。
J Forensic Sci. 2018 May;63(3):714-723. doi: 10.1111/1556-4029.13645. Epub 2017 Sep 20.
8
The source of the truth bias: Heuristic processing?真相偏差的来源:启发式加工?
Scand J Psychol. 2015 Jun;56(3):254-63. doi: 10.1111/sjop.12204. Epub 2015 Feb 23.
9
Cues to deception.欺骗的线索。
Psychol Bull. 2003 Jan;129(1):74-118. doi: 10.1037/0033-2909.129.1.74.
10
Bootstrapping, permutation testing and the method of surrogate data.
Phys Med Biol. 1999 Jun;44(6):L11-2. doi: 10.1088/0031-9155/44/6/101.