使用自由对话的向量表示来识别神经认知障碍。

Identifying neurocognitive disorder using vector representation of free conversation.

机构信息

Department of Neuropsychiatry, Keio University School of Medicine, Tokyo, Japan.

Lifescience AI Business Division, Research Development Department, FRONTEO Inc, Tokyo, Japan.

出版信息

Sci Rep. 2022 Aug 3;12(1):12461. doi: 10.1038/s41598-022-16204-4.

DOI:10.1038/s41598-022-16204-4

PMID:35922457

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9349220/

Abstract

In recent years, studies on the use of natural language processing (NLP) approaches to identify dementia have been reported. Most of these studies used picture description tasks or other similar tasks to encourage spontaneous speech, but the use of free conversation without requiring a task might be easier to perform in a clinical setting. Moreover, free conversation is unlikely to induce a learning effect. Therefore, the purpose of this study was to develop a machine learning model to discriminate subjects with and without dementia by extracting features from unstructured free conversation data using NLP. We recruited patients who visited a specialized outpatient clinic for dementia and healthy volunteers. Participants' conversation was transcribed and the text data was decomposed from natural sentences into morphemes by performing a morphological analysis using NLP, and then converted into real-valued vectors that were used as features for machine learning. A total of 432 datasets were used, and the resulting machine learning model classified the data for dementia and non-dementia subjects with an accuracy of 0.900, sensitivity of 0.881, and a specificity of 0.916. Using sentence vector information, it was possible to develop a machine-learning algorithm capable of discriminating dementia from non-dementia subjects with a high accuracy based on free conversation.

摘要

近年来，已有研究报告使用自然语言处理（NLP）方法来识别痴呆症。这些研究大多使用图片描述任务或其他类似任务来鼓励自发语言，但在临床环境中，不要求任务的自由对话可能更容易进行。此外，自由对话不太可能产生学习效应。因此，本研究的目的是开发一种机器学习模型，通过使用 NLP 从非结构化的自由对话数据中提取特征来区分有和无痴呆症的受试者。我们招募了就诊于专门的痴呆门诊的患者和健康志愿者。参与者的对话被转录下来，然后使用 NLP 进行形态分析，将文本数据从自然语句中分解成语素，并将其转换为用于机器学习的实值向量作为特征。总共使用了 432 个数据集，由此产生的机器学习模型对痴呆症和非痴呆症受试者的数据进行分类，准确率为 0.900，灵敏度为 0.881，特异性为 0.916。通过句子向量信息，有可能开发出一种基于自由对话的机器算法，能够以高精度区分痴呆症和非痴呆症患者。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/af0d/9349220/17f4fea32e83/41598_2022_16204_Fig1_HTML.jpg

相似文献

Identifying neurocognitive disorder using vector representation of free conversation.

Sci Rep. 2022 Aug 3;12(1):12461. doi: 10.1038/s41598-022-16204-4.

Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.

J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.

Development of machine learning and natural language processing algorithms for preoperative prediction and automated identification of intraoperative vascular injury in anterior lumbar spine surgery.

Spine J. 2021 Oct;21(10):1635-1642. doi: 10.1016/j.spinee.2020.04.001. Epub 2020 Apr 12.

Natural Language Processing and Machine Learning Methods to Characterize Unstructured Patient-Reported Outcomes: Validation Study.

J Med Internet Res. 2021 Nov 3;23(11):e26777. doi: 10.2196/26777.

Natural language processing and machine learning approaches for food categorization and nutrition quality prediction compared with traditional methods.

Am J Clin Nutr. 2023 Mar;117(3):553-563. doi: 10.1016/j.ajcnut.2022.11.022. Epub 2022 Dec 23.

Natural language processing and machine learning to enable automatic extraction and classification of patients' smoking status from electronic medical records.

Ups J Med Sci. 2020 Nov;125(4):316-324. doi: 10.1080/03009734.2020.1792010. Epub 2020 Jul 22.

Data for registry and quality review can be retrospectively collected using natural language processing from unstructured charts of arthroplasty patients.

Bone Joint J. 2020 Jul;102-B(7_Supple_B):99-104. doi: 10.1302/0301-620X.102B7.BJJ-2019-1574.R1.

Performance of machine learning algorithms for dementia assessment: impacts of language tasks, recording media, and modalities.

BMC Med Inform Decis Mak. 2023 Mar 3;23(1):45. doi: 10.1186/s12911-023-02122-6.

Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.

J Digit Imaging. 2018 Apr;31(2):178-184. doi: 10.1007/s10278-017-0027-x.

Machine learning and natural language processing (NLP) approach to predict early progression to first-line treatment in real-world hormone receptor-positive (HR+)/HER2-negative advanced breast cancer patients.

Eur J Cancer. 2021 Feb;144:224-231. doi: 10.1016/j.ejca.2020.11.030. Epub 2020 Dec 26.

引用本文的文献

A Systematic Review of Natural Language Processing Techniques for Early Detection of Cognitive Impairment.

Mayo Clin Proc Digit Health. 2025 Mar 5;3(2):100205. doi: 10.1016/j.mcpdig.2025.100205. eCollection 2025 Jun.

Artificial Intelligence in Psychiatry: A Review of Biological and Behavioral Data Analyses.

Diagnostics (Basel). 2025 Feb 11;15(4):434. doi: 10.3390/diagnostics15040434.

Using voice recognition and machine learning techniques for detecting patient-reported outcomes from conversational voice in palliative care patients.

Jpn J Nurs Sci. 2025 Jan;22(1):e12644. doi: 10.1111/jjns.12644.

Actual Clinical Practice Assessment: A Rapid and Easy-to-Use Tool for Evaluating Cognitive Decline Equivalent to Dementia.

Cureus. 2024 Apr 22;16(4):e58781. doi: 10.7759/cureus.58781. eCollection 2024 Apr.

Written discourse in diagnosis for acquired neurogenic communication disorders: current evidence and future directions.

Front Hum Neurosci. 2024 Jan 11;17:1264582. doi: 10.3389/fnhum.2023.1264582. eCollection 2023.

Applications of artificial intelligence in dementia.

Geriatr Gerontol Int. 2024 Mar;24 Suppl 1(Suppl 1):25-30. doi: 10.1111/ggi.14709. Epub 2023 Nov 2.

Novel Screening Tool Using Non-linguistic Voice Features Derived from Simple Phrases to Detect Mild Cognitive Impairment and Dementia.

JAR Life. 2023 Aug 23;12:72-76. doi: 10.14283/jarlife.2023.12. eCollection 2023.

Diagnosing psychiatric disorders from history of present illness using a large-scale linguistic model.

Psychiatry Clin Neurosci. 2023 Nov;77(11):597-604. doi: 10.1111/pcn.13580. Epub 2023 Sep 7.

Comparing global and local semantic coherence of spontaneous speech in persons with Alzheimer's disease and healthy controls.

Appl Corpus Linguistics. 2023 Dec;3(3). doi: 10.1016/j.acorp.2023.100064. Epub 2023 Jun 24.

本文引用的文献

Predicting Alzheimer's Disease from Spoken and Written Language Using Fusion-Based Stacked Generalization.

J Biomed Inform. 2021 Jun;118:103803. doi: 10.1016/j.jbi.2021.103803. Epub 2021 May 19.

Changes in telepsychiatry regulations during the COVID-19 pandemic: 17 countries and regions' approaches to an evolving healthcare landscape.

Psychol Med. 2022 Oct;52(13):2606-2613. doi: 10.1017/S0033291720004584. Epub 2020 Nov 27.

The project for objective measures using computational psychiatry technology (PROMPT): Rationale, design, and methodology.

Contemp Clin Trials Commun. 2020 Aug 18;19:100649. doi: 10.1016/j.conctc.2020.100649. eCollection 2020 Sep.

Predicting Inpatient Falls Using Natural Language Processing of Nursing Records Obtained From Japanese Electronic Medical Records: Case-Control Study.

JMIR Med Inform. 2020 Apr 22;8(4):e16970. doi: 10.2196/16970.

Toward the Automation of Diagnostic Conversation Analysis in Patients with Memory Complaints.

J Alzheimers Dis. 2017;58(2):373-387. doi: 10.3233/JAD-160507.

Predicting probable Alzheimer's disease using linguistic deficits and biomarkers.

BMC Bioinformatics. 2017 Jan 14;18(1):34. doi: 10.1186/s12859-016-1456-0.

Linguistic Features Identify Alzheimer's Disease in Narrative Speech.

J Alzheimers Dis. 2016;49(2):407-22. doi: 10.3233/JAD-150520.

Cognitive Tests to Detect Dementia: A Systematic Review and Meta-analysis.

JAMA Intern Med. 2015 Sep;175(9):1450-8. doi: 10.1001/jamainternmed.2015.2152.

Diagnosis of Cognitive Impairment Compatible with Early Diagnosis of Alzheimer's Disease. A Bayesian Network Model based on the Analysis of Oral Definitions of Semantic Categories.

Methods Inf Med. 2016;55(1):42-9. doi: 10.3414/ME14-01-0071. Epub 2015 Apr 30.

A 2 year multidomain intervention of diet, exercise, cognitive training, and vascular risk monitoring versus control to prevent cognitive decline in at-risk elderly people (FINGER): a randomised controlled trial.

Lancet. 2015 Jun 6;385(9984):2255-63. doi: 10.1016/S0140-6736(15)60461-5. Epub 2015 Mar 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用自由对话的向量表示来识别神经认知障碍。

Identifying neurocognitive disorder using vector representation of free conversation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献