使用大语言模型对阿尔茨海默病及相关痴呆症症状进行高通量表型分析：横断面研究

High-Throughput Phenotyping of the Symptoms of Alzheimer Disease and Related Dementias Using Large Language Models: Cross-Sectional Study.

作者信息

Cheng You, Malekar Mrunal, He Yingnan, Bommareddy Apoorva, Magdamo Colin, Singh Arjun, Westover Brandon, Mukerji Shibani S, Dickson John, Das Sudeshna

机构信息

Department of Neurology, Massachusetts General Hospital, Cambridge, MA, United States.

Harvard Medical School, Boston, MA, United States.

出版信息

JMIR AI. 2025 Jun 3;4:e66926. doi: 10.2196/66926.

DOI:10.2196/66926

PMID:40460418

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12174885/

Abstract

BACKGROUND

Alzheimer disease and related dementias (ADRD) are complex disorders with overlapping symptoms and pathologies. Comprehensive records of symptoms in electronic health records (EHRs) are critical for not only reaching an accurate diagnosis but also supporting ongoing research studies and clinical trials. However, these symptoms are frequently obscured within unstructured clinical notes in EHRs, making manual extraction both time-consuming and labor-intensive.

OBJECTIVE

We aimed to automate symptom extraction from the clinical notes of patients with ADRD using fine-tuned large language models (LLMs), compare its performance to regular expression-based symptom recognition, and validate the results using brain magnetic resonance imaging (MRI) data.

METHODS

We fine-tuned LLMs to extract ADRD symptoms across the following 7 domains: memory, executive function, motor, language, visuospatial, neuropsychiatric, and sleep. We assessed the algorithm's performance by calculating the area under the receiver operating characteristic curve (AUROC) for each domain. The extracted symptoms were then validated in two analyses: (1) predicting ADRD diagnosis using the counts of extracted symptoms and (2) examining the association between ADRD symptoms and MRI-derived brain volumes.

RESULTS

Symptom extraction across the 7 domains achieved high accuracy with AUROCs ranging from 0.97 to 0.99. Using the counts of extracted symptoms to predict ADRD diagnosis yielded an AUROC of 0.83 (95% CI 0.77-0.89). Symptom associations with brain volumes revealed that a smaller hippocampal volume was linked to memory impairments (odds ratio 0.62, 95% CI 0.46-0.84; P=.006), and reduced pallidum size was associated with motor impairments (odds ratio 0.73, 95% CI 0.58-0.90; P=.04).

CONCLUSIONS

These results highlight the accuracy and reliability of our high-throughput ADRD phenotyping algorithm. By enabling automated symptom extraction, our approach has the potential to assist with differential diagnosis, as well as facilitate clinical trials and research studies of dementia.

摘要

背景

阿尔茨海默病及相关痴呆症（ADRD）是具有重叠症状和病理特征的复杂疾病。电子健康记录（EHR）中的症状综合记录不仅对于准确诊断至关重要，而且对于支持正在进行的研究和临床试验也至关重要。然而，这些症状在EHR的非结构化临床记录中经常被掩盖，使得手动提取既耗时又费力。

目的

我们旨在使用微调后的大语言模型（LLM）从ADRD患者的临床记录中自动提取症状，将其性能与基于正则表达式的症状识别进行比较，并使用脑磁共振成像（MRI）数据验证结果。

方法

我们对LLM进行微调，以提取以下7个领域的ADRD症状：记忆、执行功能、运动、语言、视觉空间、神经精神和睡眠。我们通过计算每个领域的受试者工作特征曲线下面积（AUROC）来评估算法的性能。然后在两项分析中验证提取的症状：（1）使用提取症状的计数预测ADRD诊断；（2）检查ADRD症状与MRI衍生脑体积之间的关联。

结果

7个领域的症状提取取得了高精度，AUROC范围为0.97至0.99。使用提取症状的计数预测ADRD诊断的AUROC为0.83（95%CI 0.77-0.89）。症状与脑体积的关联显示，较小的海马体积与记忆障碍有关（优势比0.62，95%CI 0.46-0.84；P=0.006），苍白球体积减小与运动障碍有关（优势比0.73，95%CI 0.58-0.90；P=0.04）。

结论

这些结果突出了我们的高通量ADRD表型算法的准确性和可靠性。通过实现自动症状提取，我们的方法有可能协助进行鉴别诊断，并促进痴呆症的临床试验和研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5be0/12174885/422d5e8e7403/ai_v4i1e66926_fig1.jpg

相似文献

High-Throughput Phenotyping of the Symptoms of Alzheimer Disease and Related Dementias Using Large Language Models: Cross-Sectional Study.使用大语言模型对阿尔茨海默病及相关痴呆症症状进行高通量表型分析：横断面研究

JMIR AI. 2025 Jun 3;4:e66926. doi: 10.2196/66926.

Interventions for central serous chorioretinopathy: a network meta-analysis.中心性浆液性脉络膜视网膜病变的干预措施：一项网状Meta分析

Cochrane Database Syst Rev. 2025 Jun 16;6(6):CD011841. doi: 10.1002/14651858.CD011841.pub3.

Prognostic factors for return to work in breast cancer survivors.乳腺癌幸存者恢复工作的预后因素。

Cochrane Database Syst Rev. 2025 May 7;5(5):CD015124. doi: 10.1002/14651858.CD015124.pub2.

Aural toilet (ear cleaning) for chronic suppurative otitis media.慢性化脓性中耳炎的耳道清理（耳部清洁）

Cochrane Database Syst Rev. 2025 Jun 9;6(6):CD013057. doi: 10.1002/14651858.CD013057.pub3.

Electronic cigarettes for smoking cessation.用于戒烟的电子烟。

Cochrane Database Syst Rev. 2025 Jan 29;1(1):CD010216. doi: 10.1002/14651858.CD010216.pub9.

A Live Video Resiliency Dyadic Intervention for Persons With Dementia and Their Care-Partners Early After Diagnosis: Protocol for Open Pilot of Resilient Together for Dementia.一种针对痴呆症患者及其护理伙伴诊断后早期的实时视频复原力二元干预：痴呆症共同复原力开放试点方案。

JMIR Res Protoc. 2025 Jan 15;14:e60382. doi: 10.2196/60382.

Stakeholders' perceptions and experiences of factors influencing the commissioning, delivery, and uptake of general health checks: a qualitative evidence synthesis.利益相关者对影响一般健康检查的委托、提供和接受因素的看法与体验：一项定性证据综合分析

Cochrane Database Syst Rev. 2025 Mar 20;3(3):CD014796. doi: 10.1002/14651858.CD014796.pub2.

Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测：基于放射学报告的多中心方法学研究

J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.

Pelvic floor muscle training with feedback or biofeedback for urinary incontinence in women.针对女性尿失禁的盆底肌训练及反馈或生物反馈训练

Cochrane Database Syst Rev. 2025 Mar 11;3(3):CD009252. doi: 10.1002/14651858.CD009252.pub2.

Mucolytics for children with chronic suppurative lung disease.用于患有慢性化脓性肺病儿童的黏液溶解剂。

Cochrane Database Syst Rev. 2025 Mar 28;3(3):CD015313. doi: 10.1002/14651858.CD015313.pub2.

本文引用的文献

Extracting Critical Information from Unstructured Clinicians' Notes Data to Identify Dementia Severity Using a Rule-Based Approach: Feasibility Study.基于规则的方法从非结构化临床医生笔记数据中提取关键信息以识别痴呆严重程度的可行性研究。

JMIR Aging. 2024 Sep 24;7:e57926. doi: 10.2196/57926.

AI-based differential diagnosis of dementia etiologies on multimodal data.基于人工智能的多模态数据对痴呆病因的鉴别诊断。

Nat Med. 2024 Oct;30(10):2977-2989. doi: 10.1038/s41591-024-03118-z. Epub 2024 Jul 4.

Disparities in cannabis use and documentation in electronic health records among children and young adults.儿童和青年成年人在大麻使用及电子健康记录中的记录差异。

NPJ Digit Med. 2023 Aug 8;6(1):138. doi: 10.1038/s41746-023-00885-w.

Robust machine learning segmentation for large-scale analysis of heterogeneous clinical brain MRI datasets.用于大规模分析异质临床脑 MRI 数据集的稳健机器学习分割。

Proc Natl Acad Sci U S A. 2023 Feb 28;120(9):e2216399120. doi: 10.1073/pnas.2216399120. Epub 2023 Feb 21.

Assess the documentation of cognitive tests and biomarkers in electronic health records via natural language processing for Alzheimer's disease and related dementias.通过自然语言处理评估电子健康记录中的认知测试和生物标志物文档，用于阿尔茨海默病及相关痴呆症。

Int J Med Inform. 2023 Feb;170:104973. doi: 10.1016/j.ijmedinf.2022.104973. Epub 2022 Dec 21.

Launching into clinical space with medspaCy: a new clinical text processing toolkit in Python.医学 spaCy：Python 中的新型临床文本处理工具包，助力临床应用。

AMIA Annu Symp Proc. 2022 Feb 21;2021:438-447. eCollection 2021.

A Deep Language Model for Symptom Extraction From Clinical Text and its Application to Extract COVID-19 Symptoms From Social Media.一种从临床文本中提取症状的深度语言模型及其在从社交媒体中提取 COVID-19 症状的应用。

IEEE J Biomed Health Inform. 2022 Apr;26(4):1737-1748. doi: 10.1109/JBHI.2021.3123192. Epub 2022 Apr 14.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT：一种用于生物医学文本挖掘的预训练生物医学语言表示模型。

Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.

Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.电子健康记录中自由文本叙述的症状的自然语言处理：系统评价。

J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173.

Machine Learning Methods to Extract Documentation of Breast Cancer Symptoms From Electronic Health Records.机器学习方法从电子健康记录中提取乳腺癌症状的文档。

J Pain Symptom Manage. 2018 Jun;55(6):1492-1499. doi: 10.1016/j.jpainsymman.2018.02.016. Epub 2018 Feb 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用大语言模型对阿尔茨海默病及相关痴呆症症状进行高通量表型分析：横断面研究

High-Throughput Phenotyping of the Symptoms of Alzheimer Disease and Related Dementias Using Large Language Models: Cross-Sectional Study.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献