• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用自然语言处理技术在不进行筛查的情况下开发一种预测轻度认知障碍的机器学习模型。

Development of a machine learning model to predict mild cognitive impairment using natural language processing in the absence of screening.

机构信息

Kaiser Permanente Washington Health Research Institute, 1730 Minor Ave., Suite 1600, Seattle, WA, 98101, USA.

Janssen Research and Development, LLC, Raritan, USA.

出版信息

BMC Med Inform Decis Mak. 2022 May 12;22(1):129. doi: 10.1186/s12911-022-01864-z.

DOI:10.1186/s12911-022-01864-z
PMID:35549702
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9097352/
Abstract

BACKGROUND

Patients and their loved ones often report symptoms or complaints of cognitive decline that clinicians note in free clinical text, but no structured screening or diagnostic data are recorded. These symptoms/complaints may be signals that predict who will go on to be diagnosed with mild cognitive impairment (MCI) and ultimately develop Alzheimer's Disease or related dementias. Our objective was to develop a natural language processing system and prediction model for identification of MCI from clinical text in the absence of screening or other structured diagnostic information.

METHODS

There were two populations of patients: 1794 participants in the Adult Changes in Thought (ACT) study and 2391 patients in the general population of Kaiser Permanente Washington. All individuals had standardized cognitive assessment scores. We excluded patients with a diagnosis of Alzheimer's Disease, Dementia or use of donepezil. We manually annotated 10,391 clinic notes to train the NLP model. Standard Python code was used to extract phrases from notes and map each phrase to a cognitive functioning concept. Concepts derived from the NLP system were used to predict future MCI. The prediction model was trained on the ACT cohort and 60% of the general population cohort with 40% withheld for validation. We used a least absolute shrinkage and selection operator logistic regression approach (LASSO) to fit a prediction model with MCI as the prediction target. Using the predicted case status from the LASSO model and known MCI from standardized scores, we constructed receiver operating curves to measure model performance.

RESULTS

Chart abstraction identified 42 MCI concepts. Prediction model performance in the validation data set was modest with an area under the curve of 0.67. Setting the cutoff for correct classification at 0.60, the classifier yielded sensitivity of 1.7%, specificity of 99.7%, PPV of 70% and NPV of 70.5% in the validation cohort.

DISCUSSION AND CONCLUSION

Although the sensitivity of the machine learning model was poor, negative predictive value was high, an important characteristic of models used for population-based screening. While an AUC of 0.67 is generally considered moderate performance, it is also comparable to several tests that are widely used in clinical practice.

摘要

背景

患者及其家属经常报告认知能力下降的症状或抱怨,临床医生在自由临床文本中注意到这些症状,但没有记录任何结构化的筛查或诊断数据。这些症状/抱怨可能是预示谁将被诊断为轻度认知障碍(MCI)并最终发展为阿尔茨海默病或相关痴呆的信号。我们的目标是开发一种自然语言处理系统和预测模型,以便在没有筛查或其他结构化诊断信息的情况下从临床文本中识别 MCI。

方法

有两个患者群体:1794 名成人思维变化(ACT)研究参与者和 2391 名 Kaiser Permanente Washington 普通人群患者。所有个体都有标准化的认知评估评分。我们排除了患有阿尔茨海默病、痴呆或使用多奈哌齐的患者。我们手动注释了 10391 份诊所记录来训练 NLP 模型。标准 Python 代码用于从笔记中提取短语,并将每个短语映射到认知功能概念。从 NLP 系统中获得的概念用于预测未来的 MCI。预测模型在 ACT 队列和普通人群队列的 60%上进行训练,40%被保留用于验证。我们使用最小绝对收缩和选择算子逻辑回归方法(LASSO)来拟合以 MCI 为预测目标的预测模型。使用 LASSO 模型预测的病例状态和标准化评分中的已知 MCI,我们构建了接收者操作曲线来衡量模型性能。

结果

图表抽象确定了 42 个 MCI 概念。验证数据集的预测模型性能中等,曲线下面积为 0.67。将正确分类的截止值设置为 0.60,分类器在验证队列中的灵敏度为 1.7%,特异性为 99.7%,PPV 为 70%,NPV 为 70.5%。

讨论与结论

尽管机器学习模型的灵敏度较差,但阴性预测值较高,这是用于基于人群的筛查的模型的重要特征。虽然 0.67 的 AUC 通常被认为是中等性能,但它也与许多在临床实践中广泛使用的测试相当。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fee/9097352/1ed0115fc86f/12911_2022_1864_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fee/9097352/1ed0115fc86f/12911_2022_1864_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fee/9097352/1ed0115fc86f/12911_2022_1864_Fig1_HTML.jpg

相似文献

1
Development of a machine learning model to predict mild cognitive impairment using natural language processing in the absence of screening.利用自然语言处理技术在不进行筛查的情况下开发一种预测轻度认知障碍的机器学习模型。
BMC Med Inform Decis Mak. 2022 May 12;22(1):129. doi: 10.1186/s12911-022-01864-z.
2
A Stable and Scalable Digital Composite Neurocognitive Test for Early Dementia Screening Based on Machine Learning: Model Development and Validation Study.基于机器学习的稳定且可扩展的数字化复合神经认知测试在早期痴呆筛查中的应用:模型的开发与验证研究。
J Med Internet Res. 2023 Dec 1;25:e49147. doi: 10.2196/49147.
3
Plasma and cerebrospinal fluid amyloid beta for the diagnosis of Alzheimer's disease dementia and other dementias in people with mild cognitive impairment (MCI).血浆和脑脊液β淀粉样蛋白用于诊断轻度认知障碍(MCI)患者的阿尔茨海默病性痴呆及其他痴呆。
Cochrane Database Syst Rev. 2014 Jun 10;2014(6):CD008782. doi: 10.1002/14651858.CD008782.pub4.
4
Mini-Mental State Examination (MMSE) for the detection of Alzheimer's disease and other dementias in people with mild cognitive impairment (MCI).用于检测轻度认知障碍(MCI)患者中阿尔茨海默病及其他痴呆症的简易精神状态检查表(MMSE)。
Cochrane Database Syst Rev. 2015 Mar 5;2015(3):CD010783. doi: 10.1002/14651858.CD010783.pub2.
5
Application of Machine Learning to Arterial Spin Labeling in Mild Cognitive Impairment and Alzheimer Disease.机器学习在轻度认知障碍和阿尔茨海默病动脉自旋标记中的应用。
Radiology. 2016 Dec;281(3):865-875. doi: 10.1148/radiol.2016152703. Epub 2016 Jul 6.
6
AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease.AD-BERT:利用预训练语言模型预测从轻度认知障碍到阿尔茨海默病的进展。
J Biomed Inform. 2023 Aug;144:104442. doi: 10.1016/j.jbi.2023.104442. Epub 2023 Jul 8.
7
ApoE4 effects on automated diagnostic classifiers for mild cognitive impairment and Alzheimer's disease.载脂蛋白E4对轻度认知障碍和阿尔茨海默病自动诊断分类器的影响。
Neuroimage Clin. 2014 Jan 4;4:461-72. doi: 10.1016/j.nicl.2013.12.012. eCollection 2014.
8
CSF tau and the CSF tau/ABeta ratio for the diagnosis of Alzheimer's disease dementia and other dementias in people with mild cognitive impairment (MCI).脑脊液tau蛋白及脑脊液tau蛋白与β淀粉样蛋白比值在轻度认知障碍(MCI)患者中用于诊断阿尔茨海默病性痴呆及其他痴呆。
Cochrane Database Syst Rev. 2017 Mar 22;3(3):CD010803. doi: 10.1002/14651858.CD010803.pub2.
9
Incremental value of biomarker combinations to predict progression of mild cognitive impairment to Alzheimer's dementia.生物标志物组合对预测轻度认知障碍向阿尔茨海默病痴呆进展的增量价值。
Alzheimers Res Ther. 2017 Oct 10;9(1):84. doi: 10.1186/s13195-017-0301-7.
10
Plasma d-glutamate levels for detecting mild cognitive impairment and Alzheimer's disease: Machine learning approaches.血浆谷氨酸水平检测轻度认知障碍和阿尔茨海默病:机器学习方法。
J Psychopharmacol. 2021 Mar;35(3):265-272. doi: 10.1177/0269881120972331. Epub 2021 Feb 15.

引用本文的文献

1
Natural language processing of electronic health records for early detection of cognitive decline: a systematic review.用于早期检测认知衰退的电子健康记录自然语言处理:一项系统综述
NPJ Digit Med. 2025 Mar 1;8(1):133. doi: 10.1038/s41746-025-01527-z.
2
Natural language processing in Alzheimer's disease research: Systematic review of methods, data, and efficacy.阿尔茨海默病研究中的自然语言处理:方法、数据和疗效的系统综述
Alzheimers Dement (Amst). 2025 Feb 11;17(1):e70082. doi: 10.1002/dad2.70082. eCollection 2025 Jan-Mar.
3
Machine learning based algorithms for virtual early detection and screening of neurodegenerative and neurocognitive disorders: a systematic-review.

本文引用的文献

1
Retrospective assessment of patient characteristics and healthcare costs prior to a diagnosis of Alzheimer's disease in an administrative claims database.回顾性评估在行政索赔数据库中诊断为阿尔茨海默病之前的患者特征和医疗保健费用。
BMC Geriatr. 2018 Oct 16;18(1):243. doi: 10.1186/s12877-018-0920-2.
2
The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification.非结构化电子健康记录数据在老年综合征病例识别中的价值。
J Am Geriatr Soc. 2018 Aug;66(8):1499-1507. doi: 10.1111/jgs.15411. Epub 2018 Jul 4.
3
No proof of a causal relationship between antiepileptic drug treatment and incidence of dementia. Comment on: Use of antiepileptic drugs and dementia risk-An analysis of Finnish health register and German health insurance data.
基于机器学习的神经退行性和神经认知障碍虚拟早期检测与筛查算法:一项系统综述。
Front Neurol. 2024 Dec 9;15:1413071. doi: 10.3389/fneur.2024.1413071. eCollection 2024.
4
Early prediction of cognitive impairment in adults aged 20 years and older using machine learning and biomarkers of heavy metal exposure.使用机器学习和重金属暴露生物标志物对20岁及以上成年人认知障碍进行早期预测。
Curr Res Toxicol. 2024 Oct 18;7:100198. doi: 10.1016/j.crtox.2024.100198. eCollection 2024.
5
Enhancing early detection of cognitive decline in the elderly: a comparative study utilizing large language models in clinical notes.提高老年人认知能力下降的早期检测:一项在临床记录中使用大语言模型的比较研究。
EBioMedicine. 2024 Nov;109:105401. doi: 10.1016/j.ebiom.2024.105401. Epub 2024 Oct 12.
6
A Traumatic Brain Injury Prescreening Tool for Intimate Partner Violence Patients Using Initial Clinical Reports and Machine Learning.一种利用初始临床报告和机器学习的亲密伴侣暴力患者创伤性脑损伤预筛查工具。
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:401-408. eCollection 2024.
7
Enhancing Early Detection of Cognitive Decline in the Elderly: A Comparative Study Utilizing Large Language Models in Clinical Notes.提高老年人认知衰退的早期检测:一项在临床记录中利用大语言模型的比较研究。
medRxiv. 2024 May 6:2024.04.03.24305298. doi: 10.1101/2024.04.03.24305298.
8
Year 2022 in Medical Natural Language Processing: Availability of Language Models as a Step in the Democratization of NLP in the Biomedical Area.2022 年医学自然语言处理:语言模型的可用性是生物医学领域 NLP 民主化的一步。
Yearb Med Inform. 2023 Aug;32(1):244-252. doi: 10.1055/s-0043-1768752. Epub 2023 Dec 26.
9
The Potential Influence of AI on Population Mental Health.人工智能对人群心理健康的潜在影响。
JMIR Ment Health. 2023 Nov 16;10:e49936. doi: 10.2196/49936.
10
HomeADScreen: Developing Alzheimer's disease and related dementia risk identification model in home healthcare.在家医疗保健中开发阿尔茨海默病和相关痴呆风险识别模型。
Int J Med Inform. 2023 Sep;177:105146. doi: 10.1016/j.ijmedinf.2023.105146. Epub 2023 Jul 13.
没有证据表明抗癫痫药物治疗与痴呆症发病率之间存在因果关系。评论:抗癫痫药物的使用与痴呆风险——芬兰健康登记和德国健康保险数据分析。
Epilepsia. 2018 Jul;59(7):1303-1306. doi: 10.1111/epi.14432. Epub 2018 May 28.
4
Predicting dementia from primary care records: A systematic review and meta-analysis.从初级保健记录预测痴呆症:系统评价和荟萃分析。
PLoS One. 2018 Mar 29;13(3):e0194735. doi: 10.1371/journal.pone.0194735. eCollection 2018.
5
Fitness and dementia risk: Further evidence of the heart-brain connection.健康与痴呆风险:心脑关联的进一步证据。
Neurology. 2018 Apr 10;90(15):675-676. doi: 10.1212/WNL.0000000000005282. Epub 2018 Mar 14.
6
Walking Speed, Cognitive Function, and Dementia Risk in the English Longitudinal Study of Ageing.《英国老龄化纵向研究》中的步行速度、认知功能与痴呆风险。
J Am Geriatr Soc. 2018 Sep;66(9):1670-1675. doi: 10.1111/jgs.15312. Epub 2018 Mar 6.
7
Underdiagnosis of Dementia: an Observational Study of Patterns in Diagnosis and Awareness in US Older Adults.痴呆症漏诊:美国老年人诊断和认知模式的观察性研究。
J Gen Intern Med. 2018 Jul;33(7):1131-1138. doi: 10.1007/s11606-018-4377-y. Epub 2018 Mar 5.
8
Revised Framingham Stroke Risk Score, Nontraditional Risk Markers, and Incident Stroke in a Multiethnic Cohort.修订版弗雷明汉卒中风险评分、非传统风险标志物与多种族队列中的卒中事件。
Stroke. 2018 Feb;49(2):363-369. doi: 10.1161/STROKEAHA.117.018928. Epub 2018 Jan 8.
9
Proton Pump Inhibitor Use and Dementia Risk: Prospective Population-Based Study.质子泵抑制剂的使用与痴呆风险:前瞻性基于人群的研究。
J Am Geriatr Soc. 2018 Feb;66(2):247-253. doi: 10.1111/jgs.15073. Epub 2017 Nov 14.
10
Comparing clinician descriptions of frailty and geriatric syndromes using electronic health records: a retrospective cohort study.使用电子健康记录比较临床医生对虚弱和老年综合征的描述:一项回顾性队列研究。
BMC Geriatr. 2017 Oct 25;17(1):248. doi: 10.1186/s12877-017-0645-7.