使用GPT嵌入技术自动检测阿尔茨海默病的自然语言处理方法的优化

The Optimization of a Natural Language Processing Approach for the Automatic Detection of Alzheimer's Disease Using GPT Embeddings.

作者信息

Runde Benjamin S, Alapati Ajit, Bazan Nicolas G

机构信息

Science Engineering Research Center, The Potomac School, McLean, VA 22101, USA.

Neuroscience Center of Excellence, School of Medicine, New Orleans, LA 70112, USA.

出版信息

Brain Sci. 2024 Feb 25;14(3):211. doi: 10.3390/brainsci14030211.

DOI:10.3390/brainsci14030211

PMID:38539600

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10968873/

Abstract

The development of noninvasive and cost-effective methods of detecting Alzheimer's disease (AD) is essential for its early prevention and mitigation. We optimize the detection of AD using natural language processing (NLP) of spontaneous speech through the use of audio enhancement techniques and novel transcription methodologies. Specifically, we utilized Boll Spectral Subtraction to improve audio fidelity and created transcriptions using state-of-the-art AI services-locally-based Wav2Vec and Whisper, alongside cloud-based IBM Cloud and Rev AI-evaluating their performance against traditional manual transcription methods. Support Vector Machine (SVM) classifiers were then trained and tested using GPT-based embeddings of transcriptions. Our findings revealed that AI-based transcriptions largely outperformed traditional manual ones, with Wav2Vec (enhanced audio) achieving the best accuracy and F-1 score (0.99 for both metrics) for locally-based systems and Rev AI (standard audio) performing the best for cloud-based systems (0.96 for both metrics). Furthermore, this study revealed the detrimental effects of interviewer speech on model performance in addition to the minimal effect of audio enhancement. Based on our findings, current AI transcription and NLP technologies are highly effective at accurately detecting AD with available data but struggle to classify probable AD and mild cognitive impairment (MCI), a prodromal stage of AD, due to a lack of training data, laying the groundwork for the future implementation of an automatic AD detection system.

摘要

开发无创且经济高效的阿尔茨海默病（AD）检测方法对于其早期预防和缓解至关重要。我们通过使用音频增强技术和新颖的转录方法，利用自然语言处理（NLP）对自发语音进行优化，以检测AD。具体而言，我们利用博尔谱减法来提高音频保真度，并使用基于本地的Wav2Vec和Whisper以及基于云的IBM Cloud和Rev AI等先进的人工智能服务创建转录文本，同时将它们的性能与传统的人工转录方法进行比较。然后，使用基于GPT的转录嵌入对支持向量机（SVM）分类器进行训练和测试。我们的研究结果表明，基于人工智能的转录在很大程度上优于传统的人工转录，对于基于本地的系统，Wav2Vec（增强音频）在准确率和F1分数方面表现最佳（两项指标均为0.99），而对于基于云的系统，Rev AI（标准音频）表现最佳（两项指标均为0.96）。此外，这项研究揭示了采访者语音对模型性能的不利影响以及音频增强的最小影响。基于我们的研究结果，当前的人工智能转录和NLP技术在利用现有数据准确检测AD方面非常有效，但由于缺乏训练数据，在对可能的AD和轻度认知障碍（MCI，AD的前驱阶段）进行分类时存在困难，这为未来自动AD检测系统的实施奠定了基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d1ad/10968873/7c5aef4fa8c2/brainsci-14-00211-g001.jpg

相似文献

The Optimization of a Natural Language Processing Approach for the Automatic Detection of Alzheimer's Disease Using GPT Embeddings.使用GPT嵌入技术自动检测阿尔茨海默病的自然语言处理方法的优化

Brain Sci. 2024 Feb 25;14(3):211. doi: 10.3390/brainsci14030211.

medRxiv. 2024 Jan 16:2024.01.14.24301297. doi: 10.1101/2024.01.14.24301297.

Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimer's Disease Based on Speech.基于语音比较预训练模型和基于特征的模型对阿尔茨海默病的预测

Front Aging Neurosci. 2021 Apr 27;13:635945. doi: 10.3389/fnagi.2021.635945. eCollection 2021.

Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech.利用语音的音频和基于文本的表示对阿尔茨海默病进行分类

Front Psychol. 2021 Jan 15;11:624137. doi: 10.3389/fpsyg.2020.624137. eCollection 2020.

Automatic speech analysis for the assessment of patients with predementia and Alzheimer's disease.用于评估轻度认知障碍患者和阿尔茨海默病患者的自动语音分析

Alzheimers Dement (Amst). 2015 Mar 29;1(1):112-24. doi: 10.1016/j.dadm.2014.11.012. eCollection 2015 Mar.

Predicting dementia from spontaneous speech using large language models.使用大语言模型从自发语言中预测痴呆症。

PLOS Digit Health. 2022 Dec 22;1(12):e0000168. doi: 10.1371/journal.pdig.0000168. eCollection 2022 Dec.

Use of Speech Analyses within a Mobile Application for the Assessment of Cognitive Impairment in Elderly People.在移动应用程序中使用语音分析评估老年人认知障碍

Curr Alzheimer Res. 2018;15(2):120-129. doi: 10.2174/1567205014666170829111942.

Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer's dementia.将自然语言处理和自动语音分析与临床医生评估相关联，以量化轻度认知障碍和阿尔茨海默病患者的言语变化。

Alzheimers Res Ther. 2021 Jun 4;13(1):109. doi: 10.1186/s13195-021-00848-x.

Pre-training and ensembling based Alzheimer's disease detection.基于预训练和集成的阿尔茨海默病检测。

Technol Health Care. 2024;32(1):379-395. doi: 10.3233/THC-230571.

Cross-cohort generalizability of deep and conventional machine learning for MRI-based diagnosis and prediction of Alzheimer's disease.基于 MRI 的阿尔茨海默病诊断和预测的深度学习和传统机器学习在不同队列间的泛化能力。

Neuroimage Clin. 2021;31:102712. doi: 10.1016/j.nicl.2021.102712. Epub 2021 Jun 4.

引用本文的文献

A Systematic Review of Natural Language Processing Techniques for Early Detection of Cognitive Impairment.用于早期检测认知障碍的自然语言处理技术的系统评价

Mayo Clin Proc Digit Health. 2025 Mar 5;3(2):100205. doi: 10.1016/j.mcpdig.2025.100205. eCollection 2025 Jun.

Natural language processing in Alzheimer's disease research: Systematic review of methods, data, and efficacy.阿尔茨海默病研究中的自然语言处理：方法、数据和疗效的系统综述

Alzheimers Dement (Amst). 2025 Feb 11;17(1):e70082. doi: 10.1002/dad2.70082. eCollection 2025 Jan-Mar.

Optimizing Machine Learning Models for Accessible Early Cognitive Impairment Prediction: A Novel Cost-effective Model Selection Algorithm.优化用于可及性早期认知障碍预测的机器学习模型：一种新型经济高效的模型选择算法

IEEE Access. 2024;12:180792-180814. doi: 10.1109/access.2024.3505038. Epub 2024 Nov 22.

本文引用的文献

2023 Alzheimer's disease facts and figures.2023 年阿尔茨海默病事实和数据。

Alzheimers Dement. 2023 Apr;19(4):1598-1695. doi: 10.1002/alz.13016. Epub 2023 Mar 14.

Predicting dementia from spontaneous speech using large language models.使用大语言模型从自发语言中预测痴呆症。

PLOS Digit Health. 2022 Dec 22;1(12):e0000168. doi: 10.1371/journal.pdig.0000168. eCollection 2022 Dec.

DementiaBank: Theoretical Rationale, Protocol, and Illustrative Analyses.痴呆症数据库：理论基础、方案及实例分析。

Am J Speech Lang Pathol. 2023 Mar 9;32(2):426-438. doi: 10.1044/2022_AJSLP-22-00281. Epub 2023 Feb 15.

Editorial: Alzheimer's Dementia Recognition through Spontaneous Speech.社论：通过自发言语识别阿尔茨海默病性痴呆

Front Comput Sci. 2021;3. doi: 10.3389/fcomp.2021.780169. Epub 2021 Oct 21.

Semantic Feature Extraction Using SBERT for Dementia Detection.使用SBERT进行语义特征提取以检测痴呆症

Brain Sci. 2022 Feb 15;12(2):270. doi: 10.3390/brainsci12020270.

Speech- and Language-Based Classification of Alzheimer's Disease: A Systematic Review.基于言语和语言的阿尔茨海默病分类：一项系统综述。

Bioengineering (Basel). 2022 Jan 11;9(1):27. doi: 10.3390/bioengineering9010027.

Towards Computer-Based Automated Screening of Dementia Through Spontaneous Speech.迈向基于计算机的通过自发语音进行痴呆症自动筛查

Front Psychol. 2021 Feb 12;11:623237. doi: 10.3389/fpsyg.2020.623237. eCollection 2020.

Array programming with NumPy.使用 NumPy 进行数组编程。

Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.

Analysis of word number and content in discourse of patients with mild to moderate Alzheimer's disease.轻度至中度阿尔茨海默病患者话语中的词数及内容分析

Dement Neuropsychol. 2014 Jul-Sep;8(3):260-265. doi: 10.1590/S1980-57642014DN83000010.

Innovative diagnostic tools for early detection of Alzheimer's disease.用于早期发现阿尔茨海默病的创新诊断工具。

Alzheimers Dement. 2015 May;11(5):561-78. doi: 10.1016/j.jalz.2014.06.004. Epub 2014 Nov 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用GPT嵌入技术自动检测阿尔茨海默病的自然语言处理方法的优化

The Optimization of a Natural Language Processing Approach for the Automatic Detection of Alzheimer's Disease Using GPT Embeddings.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献