判断聚合、话语困境与反思平衡：作为自我改进信念主体的神经语言模型

Judgment aggregation, discursive dilemma and reflective equilibrium: Neural language models as self-improving doxastic agents.

作者信息

Betz Gregor, Richardson Kyle

机构信息

Karlsruhe Institute of Technology, Department of Philosophy, Karlsruhe, Germany.

Allen Institute for Artificial Intelligence, Aristo, Seattle, WA, United States.

出版信息

Front Artif Intell. 2022 Oct 18;5:900943. doi: 10.3389/frai.2022.900943. eCollection 2022.

DOI:10.3389/frai.2022.900943

PMID:36329681

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9623417/

Abstract

Neural language models (NLMs) are susceptible to producing inconsistent output. This paper proposes a new diagnosis as well as a novel remedy for NLMs' incoherence. We train NLMs on synthetic text corpora that are created by simulating text production in a society. For diagnostic purposes, we explicitly model the individual belief systems of artificial agents (authors) who produce corpus texts. NLMs, trained on those texts, can be shown to aggregate the judgments of individual authors during pre-training according to sentence-wise vote ratios (roughly, reporting frequencies), which inevitably leads to so-called discursive dilemmas: aggregate judgments are inconsistent even though all individual belief states are consistent. As a remedy for such inconsistencies, we develop a self-training procedure-inspired by the concept of reflective equilibrium-that effectively reduces the extent of logical incoherence in a model's belief system, corrects global mis-confidence, and eventually allows the model to settle on a new, epistemically superior belief state. Thus, social choice theory helps to understand why NLMs are prone to produce inconsistencies; epistemology suggests how to get rid of them.

摘要

神经语言模型（NLMs）容易产生不一致的输出。本文提出了一种针对NLMs不连贯性的新诊断方法以及一种新颖的补救措施。我们在通过模拟社会中的文本生成创建的合成文本语料库上训练NLMs。出于诊断目的，我们明确地对生成语料库文本的人工智能体（作者）的个体信念系统进行建模。可以证明，在这些文本上训练的NLMs在预训练期间会根据句子层面的投票比例（大致为报告频率）汇总个体作者的判断，这不可避免地导致所谓的话语困境：即使所有个体信念状态都是一致的，汇总判断也不一致。作为对这种不一致性的补救措施，我们开发了一种受反思平衡概念启发的自我训练程序，该程序有效地减少了模型信念系统中逻辑不连贯的程度，纠正了全局错误置信度，并最终使模型能够确定一种新的、认知上更优越的信念状态。因此，社会选择理论有助于理解为什么NLMs容易产生不一致性；认识论则建议如何消除这些不一致性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6789/9623417/ea0ee4f2965a/frai-05-900943-g0001.jpg

相似文献

Judgment aggregation, discursive dilemma and reflective equilibrium: Neural language models as self-improving doxastic agents.

Front Artif Intell. 2022 Oct 18;5:900943. doi: 10.3389/frai.2022.900943. eCollection 2022.

Probabilistic coherence, logical consistency, and Bayesian learning: Neural language models as epistemic agents.

PLoS One. 2023 Feb 9;18(2):e0281372. doi: 10.1371/journal.pone.0281372. eCollection 2023.

Enhancing Text Generation via Parse Tree Embedding.

Comput Intell Neurosci. 2022 Jun 10;2022:4096383. doi: 10.1155/2022/4096383. eCollection 2022.

Roman Catholic beliefs produce characteristic neural responses to moral dilemmas.

Soc Cogn Affect Neurosci. 2014 Feb;9(2):240-9. doi: 10.1093/scan/nss121. Epub 2012 Nov 18.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

Rules to be adopted for publishing a scientific paper.

Ann Ital Chir. 2016;87:1-3.

Psychol Rev. 2021 Nov;128(6):1088-1111. doi: 10.1037/rev0000299. Epub 2021 Jul 22.

Simple Hyperintensional Belief Revision.

Erkenntnis. 2019;84(3):559-575. doi: 10.1007/s10670-018-9971-1. Epub 2018 Feb 5.

Not all who ponder count costs: Arithmetic reflection predicts utilitarian tendencies, but logical reflection predicts both deontological and utilitarian tendencies.

Cognition. 2019 Nov;192:103995. doi: 10.1016/j.cognition.2019.06.007. Epub 2019 Jul 10.

Are synthetic clinical notes useful for real natural language processing tasks: A case study on clinical entity recognition.

J Am Med Inform Assoc. 2021 Sep 18;28(10):2193-2201. doi: 10.1093/jamia/ocab112.

引用本文的文献

Probabilistic coherence, logical consistency, and Bayesian learning: Neural language models as epistemic agents.

PLoS One. 2023 Feb 9;18(2):e0281372. doi: 10.1371/journal.pone.0281372. eCollection 2023.

本文引用的文献

The neural architecture of language: Integrative modeling converges on predictive processing.

Proc Natl Acad Sci U S A. 2021 Nov 9;118(45). doi: 10.1073/pnas.2105646118.

The coherence effect: Blending cold and hot cognitions.

J Pers Soc Psychol. 2015 Sep;109(3):369-94. doi: 10.1037/pspa0000029. Epub 2015 Jul 13.

The redux of cognitive consistency theories: evidence judgments by constraint satisfaction.

J Pers Soc Psychol. 2004 Jun;86(6):814-37. doi: 10.1037/0022-3514.86.6.814.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

判断聚合、话语困境与反思平衡：作为自我改进信念主体的神经语言模型

Judgment aggregation, discursive dilemma and reflective equilibrium: Neural language models as self-improving doxastic agents.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献