大语言模型与失语症的比较。

Comparison of Large Language Model with Aphasia.

作者信息

Watanabe Takamitsu, Inoue Katsuma, Kuniyoshi Yasuo, Nakajima Kohei, Aihara Kazuyuki

机构信息

International Research Centre for Neurointelligence, The University of Tokyo Institutes for Advanced Study, 7-3-1 Hongo Bunkyo-ku, Tokyo, 113-0033, Japan.

Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, 113-8656, Japan.

出版信息

Adv Sci (Weinh). 2025 Jun;12(22):e2414016. doi: 10.1002/advs.202414016. Epub 2025 May 14.

DOI:10.1002/advs.202414016

PMID:40369908

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12165151/

Abstract

Large language models (LLMs) respond fluently but often inaccurately, which resembles aphasia in humans. Does this behavioral similarity indicate any resemblance in internal information processing between LLMs and aphasic humans? Here, we address this question by comparing the network dynamics between LLMs-ALBERT, GPT-2, Llama-3.1 and one Japanese variant of Llama-and various aphasic brains. Using energy landscape analysis, we quantify how frequently the network activity pattern is likely to move from one state to another (transition frequency) and how long it tends to dwell in each state (dwelling time). First, by investigating the frequency spectrums of these two indices for brain dynamics, we find that the degrees of the polarization of the transition frequency and dwelling time enable accurate classification of receptive aphasia, expressive aphasia and controls: receptive aphasia shows the bimodal distributions for both indices, whereas expressive aphasia exhibits the most uniform distributions. In parallel, we identify highly polarized distributions in both transition frequency and dwelling time in the network dynamics in the four LLMs. These findings indicate the similarity in internal information processing between LLMs and receptive aphasia, and the current approach can provide a novel diagnosis and classification tool for LLMs and help their performance improve.

摘要

大型语言模型（LLMs）回答流畅但常常不准确，这与人类的失语症相似。这种行为上的相似性是否表明LLMs与失语症患者在内部信息处理方面存在任何相似之处？在这里，我们通过比较LLMs（ALBERT、GPT-2、Llama-3.1和一种日语变体Llama）与各种失语症大脑之间的网络动态来解决这个问题。使用能量景观分析，我们量化了网络活动模式从一种状态转变为另一种状态的频率（转变频率）以及它在每种状态下停留的时间（停留时间）。首先，通过研究这两个大脑动力学指标的频谱，我们发现转变频率和停留时间的极化程度能够准确区分感受性失语症、表达性失语症和对照组：感受性失语症在这两个指标上均呈现双峰分布，而表达性失语症则表现出最均匀的分布。同时，我们在四个LLMs的网络动态中识别出转变频率和停留时间的高度极化分布。这些发现表明LLMs与感受性失语症在内部信息处理方面存在相似性，并且当前的方法可以为LLMs提供一种新颖的诊断和分类工具，并有助于提高它们的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/212a/12165151/75595f8ff3be/ADVS-12-2414016-g001.jpg

相似文献

Comparison of Large Language Model with Aphasia.

Adv Sci (Weinh). 2025 Jun;12(22):e2414016. doi: 10.1002/advs.202414016. Epub 2025 May 14.

Clinical efficacy of pre-trained large language models through the lens of aphasia.

Sci Rep. 2024 Jul 6;14(1):15573. doi: 10.1038/s41598-024-66576-y.

Extracting International Classification of Diseases Codes from Clinical Documentation Using Large Language Models.

Appl Clin Inform. 2025 Mar;16(2):337-344. doi: 10.1055/a-2491-3872. Epub 2024 Nov 28.

Benchmarking large language models for biomedical natural language processing applications and recommendations.

Nat Commun. 2025 Apr 6;16(1):3280. doi: 10.1038/s41467-025-56989-2.

Privacy-ensuring Open-weights Large Language Models Are Competitive with Closed-weights GPT-4o in Extracting Chest Radiography Findings from Free-Text Reports.

Radiology. 2025 Jan;314(1):e240895. doi: 10.1148/radiol.240895.

Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.

J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.

Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis.

J Med Internet Res. 2025 Jun 9;27:e72062. doi: 10.2196/72062.

AI in Home Care-Evaluation of Large Language Models for Future Training of Informal Caregivers: Observational Comparative Case Study.

J Med Internet Res. 2025 Apr 28;27:e70703. doi: 10.2196/70703.

Quality of Answers of Generative Large Language Models vs Peer Patients for Interpreting Lab Test Results for Lay Patients: Evaluation Study.

ArXiv. 2024 Jan 23:arXiv:2402.01693v1.

Evaluation of Large Language Models in Tailoring Educational Content for Cancer Survivors and Their Caregivers: Quality Analysis.

JMIR Cancer. 2025 Apr 7;11:e67914. doi: 10.2196/67914.

本文引用的文献

Testing AI on language comprehension tasks reveals insensitivity to underlying meaning.

Sci Rep. 2024 Nov 14;14(1):28083. doi: 10.1038/s41598-024-79531-8.

Reply to Teeny and Matz: Toward the robust measurement of personalized persuasion with generative AI.

Proc Natl Acad Sci U S A. 2024 Oct 22;121(43):e2418817121. doi: 10.1073/pnas.2418817121. Epub 2024 Oct 17.

Contrasting Linguistic Patterns in Human and LLM-Generated News Text.

Artif Intell Rev. 2024;57(10):265. doi: 10.1007/s10462-024-10903-2. Epub 2024 Aug 23.

The Aphasia Recovery Cohort, an open-source chronic stroke repository.

Sci Data. 2024 Sep 9;11(1):981. doi: 10.1038/s41597-024-03819-7.

Atypical intrinsic neural timescale in the left angular gyrus in Alzheimer's disease.

Brain Commun. 2024 Jul 11;6(4):fcae199. doi: 10.1093/braincomms/fcae199. eCollection 2024.

Detecting hallucinations in large language models using semantic entropy.

Nature. 2024 Jun;630(8017):625-630. doi: 10.1038/s41586-024-07421-0. Epub 2024 Jun 19.

A review of resting-state fMRI and its use to examine psychiatric disorders.

Psychoradiology. 2021 May 11;1(1):42-53. doi: 10.1093/psyrad/kkab003. eCollection 2021 Mar.

Aberrant brain dynamics in individuals with clinical high risk of psychosis.

Schizophr Bull Open. 2024 Jan;5(1). doi: 10.1093/schizbullopen/sgae002. Epub 2024 Jan 18.

Large-scale evidence for logarithmic effects of word predictability on reading time.

Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2307876121. doi: 10.1073/pnas.2307876121. Epub 2024 Feb 29.

How persuasive is AI-generated propaganda?

PNAS Nexus. 2024 Feb 20;3(2):pgae034. doi: 10.1093/pnasnexus/pgae034. eCollection 2024 Feb.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

大语言模型与失语症的比较。

Comparison of Large Language Model with Aphasia.

作者信息

Watanabe Takamitsu, Inoue Katsuma, Kuniyoshi Yasuo, Nakajima Kohei, Aihara Kazuyuki

机构信息

International Research Centre for Neurointelligence, The University of Tokyo Institutes for Advanced Study, 7-3-1 Hongo Bunkyo-ku, Tokyo, 113-0033, Japan.

Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, 113-8656, Japan.

出版信息

Adv Sci (Weinh). 2025 Jun;12(22):e2414016. doi: 10.1002/advs.202414016. Epub 2025 May 14.

DOI:10.1002/advs.202414016

PMID:40369908

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12165151/

Abstract

摘要

大语言模型与失语症的比较。

Comparison of Large Language Model with Aphasia.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

大语言模型与失语症的比较。

Comparison of Large Language Model with Aphasia.

作者信息

机构信息

出版信息

相似文献

本文引用的文献