• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用可解释关联挖掘方法揭示阿尔茨海默病、帕金森病和其他痴呆症的重要诊断特征。

Uncovering Important Diagnostic Features for Alzheimer's, Parkinson's and Other Dementias Using Interpretable Association Mining Methods.

作者信息

Noshin Kazi, Boland Mary Regina, Hou Bojian, Lu Victoria, Manning Carol, Shen Li, Zhang Aidong

机构信息

Department of Computer Science, University of Virginia, VA 22903, USA.

出版信息

Pac Symp Biocomput. 2025;30:631-646.

PMID:39670401
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11649014/
Abstract

Alzheimer's Disease and Related Dementias (ADRD) afflict almost 7 million people in the USA alone. The majority of research in ADRD is conducted using post-mortem samples of brain tissue or carefully recruited clinical trial patients. While these resources are excellent, they suffer from lack of sex/gender, and racial/ethnic inclusiveness. Electronic Health Records (EHR) data has the potential to bridge this gap by including real-world ADRD patients treated during routine clinical care. In this study, we utilize EHR data from a cohort of 70,420 ADRD patients diagnosed and treated at Penn Medicine. Our goal is to uncover important risk features leading to three types of Neuro-Degenerative Disorders (NDD), including Alzheimer's Disease (AD), Parkinson's Disease (PD) and Other Dementias (OD). We employ a variety of Machine Learning (ML) Methods, including uni-variate and multivariate ML approaches and compare accuracies across the ML methods. We also investigate the types of features identified by each method, the overlapping features and the unique features to highlight important advantages and disadvantages of each approach specific for certain NDD types. Our study is important for those interested in studying ADRD and NDD in EHRs as it highlights the strengths and limitations of popular approaches employed in the ML community. We found that the uni-variate approach was able to uncover features that were important and rare for specific types of NDD (AD, PD, OD), which is important from a clinical perspective. Features that were found across all methods represent features that are the most robust.

摘要

仅在美国,阿尔茨海默病及相关痴呆症(ADRD)就折磨着近700万人。ADRD的大多数研究是使用脑组织的尸检样本或精心招募的临床试验患者进行的。虽然这些资源很棒,但它们缺乏性别和种族/民族包容性。电子健康记录(EHR)数据有可能通过纳入在常规临床护理中接受治疗的真实世界ADRD患者来弥补这一差距。在本研究中,我们利用了宾夕法尼亚大学医疗系统诊断和治疗的70420名ADRD患者队列的EHR数据。我们的目标是发现导致三种神经退行性疾病(NDD)的重要风险特征,包括阿尔茨海默病(AD)、帕金森病(PD)和其他痴呆症(OD)。我们采用了多种机器学习(ML)方法,包括单变量和多变量ML方法,并比较了这些ML方法的准确性。我们还研究了每种方法识别的特征类型、重叠特征和独特特征,以突出每种方法针对特定NDD类型的重要优缺点。我们的研究对那些对在EHR中研究ADRD和NDD感兴趣的人很重要,因为它突出了ML社区中常用方法的优势和局限性。我们发现,单变量方法能够发现特定类型NDD(AD、PD、OD)中重要且罕见的特征,从临床角度来看这很重要。在所有方法中都发现的特征代表了最稳健的特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/f5d91f2cd698/nihms-2038226-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/7ead480059d1/nihms-2038226-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/8feee7b5bccb/nihms-2038226-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/151217ae6cfe/nihms-2038226-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/901a59a9d507/nihms-2038226-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/f5d91f2cd698/nihms-2038226-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/7ead480059d1/nihms-2038226-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/8feee7b5bccb/nihms-2038226-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/151217ae6cfe/nihms-2038226-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/901a59a9d507/nihms-2038226-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/932a/11649014/f5d91f2cd698/nihms-2038226-f0005.jpg

相似文献

1
Uncovering Important Diagnostic Features for Alzheimer's, Parkinson's and Other Dementias Using Interpretable Association Mining Methods.使用可解释关联挖掘方法揭示阿尔茨海默病、帕金森病和其他痴呆症的重要诊断特征。
Pac Symp Biocomput. 2025;30:631-646.
2
Uncovering Important Diagnostic Features for Alzheimer's, Parkinson's and Other Dementias Using Interpretable Association Mining Methods.使用可解释关联挖掘方法揭示阿尔茨海默病、帕金森病和其他痴呆症的重要诊断特征。
Pac Symp Biocomput. 2025;30:631-646. doi: 10.1142/9789819807024_0045.
3
Predicting the onset of Alzheimer's disease and related dementia using electronic health records: findings from the cache county study on memory in aging (1995-2008).使用电子健康记录预测阿尔茨海默病和相关痴呆的发病:来自老年记忆的 Cache 县研究(1995-2008 年)的结果。
BMC Med Inform Decis Mak. 2024 Oct 28;24(1):316. doi: 10.1186/s12911-024-02728-4.
4
Automated phenotyping of mild cognitive impairment and Alzheimer's disease and related dementias using electronic health records.利用电子健康记录对轻度认知障碍、阿尔茨海默病及相关痴呆症进行自动表型分析。
Int J Med Inform. 2025 Aug;200:105917. doi: 10.1016/j.ijmedinf.2025.105917. Epub 2025 Apr 11.
5
Extracting Critical Information from Unstructured Clinicians' Notes Data to Identify Dementia Severity Using a Rule-Based Approach: Feasibility Study.基于规则的方法从非结构化临床医生笔记数据中提取关键信息以识别痴呆严重程度的可行性研究。
JMIR Aging. 2024 Sep 24;7:e57926. doi: 10.2196/57926.
6
Early prediction of Alzheimer's disease and related dementias using real-world electronic health records.利用真实世界的电子健康记录对阿尔茨海默病及相关痴呆症进行早期预测。
Alzheimers Dement. 2023 Aug;19(8):3506-3518. doi: 10.1002/alz.12967. Epub 2023 Feb 23.
7
Assess the documentation of cognitive tests and biomarkers in electronic health records via natural language processing for Alzheimer's disease and related dementias.通过自然语言处理评估电子健康记录中的认知测试和生物标志物文档,用于阿尔茨海默病及相关痴呆症。
Int J Med Inform. 2023 Feb;170:104973. doi: 10.1016/j.ijmedinf.2022.104973. Epub 2022 Dec 21.
8
Shared genetic risk loci between Alzheimer's disease and related dementias, Parkinson's disease, and amyotrophic lateral sclerosis.阿尔茨海默病及相关痴呆、帕金森病和肌萎缩侧索硬化症之间的共享遗传风险基因座。
Alzheimers Res Ther. 2023 Jun 16;15(1):113. doi: 10.1186/s13195-023-01244-3.
9
Unsupervised Deep Learning of Electronic Health Records to Characterize Heterogeneity Across Alzheimer Disease and Related Dementias: Cross-Sectional Study.利用电子健康记录进行无监督深度学习以表征阿尔茨海默病及相关痴呆症的异质性:横断面研究
JMIR Aging. 2025 Mar 31;8:e65178. doi: 10.2196/65178.
10
Prediction, prognosis and monitoring of neurodegeneration at biobank-scale via machine learning and imaging.通过机器学习和成像技术在生物样本库规模上对神经退行性变进行预测、预后评估和监测。
medRxiv. 2024 Oct 28:2024.10.27.24316215. doi: 10.1101/2024.10.27.24316215.

引用本文的文献

1
Determining the Importance of Clinical Modalities for NeuroDegenerative Disorders and Risk of Patient Injury Using Machine Learning and Survival Analysis.利用机器学习和生存分析确定神经退行性疾病临床模式的重要性及患者受伤风险
AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:385-394. eCollection 2025.
2
Untargeted Lipidomic Reveals Potential Biomarkers in Plasma Samples for the Discrimination of Patients Affected by Parkinson's Disease.非靶向脂质组学揭示血浆样本中用于鉴别帕金森病患者的潜在生物标志物。
Molecules. 2025 Feb 12;30(4):850. doi: 10.3390/molecules30040850.

本文引用的文献

1
Validation of Polygenic Risk Scores for Coronary Heart Disease in a Middle Eastern Cohort Using Whole Genome Sequencing.基于全基因组测序的中东队列冠心病多基因风险评分验证。
Circ Genom Precis Med. 2022 Dec;15(6):e003712. doi: 10.1161/CIRCGEN.122.003712. Epub 2022 Oct 12.
2
Genetic architecture of asthma in African American patients.非裔美国患者哮喘的遗传结构。
J Allergy Clin Immunol. 2023 Apr;151(4):1132-1136. doi: 10.1016/j.jaci.2022.09.001. Epub 2022 Sep 8.
3
Psychiatric manifestations of rare variation in medically actionable genes: a PheWAS approach.
医学可干预基因罕见变异的精神表现:一种 pheWAS 方法。
BMC Genomics. 2022 May 19;23(1):385. doi: 10.1186/s12864-022-08600-x.
4
Deep phenotyping of Alzheimer's disease leveraging electronic medical records identifies sex-specific clinical associations.利用电子病历对阿尔茨海默病进行深度表型分析,确定了性别特异性的临床关联。
Nat Commun. 2022 Feb 3;13(1):675. doi: 10.1038/s41467-022-28273-0.
5
Data analysis with Shapley values for automatic subject selection in Alzheimer's disease data sets using interpretable machine learning.使用可解释机器学习对阿尔茨海默病数据集进行 Shapley 值数据分析,以实现自动受试者选择。
Alzheimers Res Ther. 2021 Sep 15;13(1):155. doi: 10.1186/s13195-021-00879-4.
6
Ethical Machine Learning in Healthcare.医疗保健中的伦理机器学习。
Annu Rev Biomed Data Sci. 2021 Jul;4:123-144. doi: 10.1146/annurev-biodatasci-092820-114757. Epub 2021 May 6.
7
Data-driven discovery of probable Alzheimer's disease and related dementia subphenotypes using electronic health records.利用电子健康记录进行数据驱动的阿尔茨海默病及相关痴呆亚型的可能发现。
Learn Health Syst. 2020 Sep 10;4(4):e10246. doi: 10.1002/lrh2.10246. eCollection 2020 Oct.
8
Female Reproductive Performance and Maternal Birth Month: A Comprehensive Meta-Analysis Exploring Multiple Seasonal Mechanisms.女性生殖表现与产妇出生月份:探索多种季节性机制的综合荟萃分析。
Sci Rep. 2020 Jan 17;10(1):555. doi: 10.1038/s41598-019-57377-9.
9
Disease associations depend on visit type: results from a visit-wide association study.疾病关联取决于就诊类型:一项全就诊范围关联研究的结果
BioData Min. 2019 Jul 11;12:15. doi: 10.1186/s13040-019-0203-2. eCollection 2019.
10
GWAS and PheWAS of red blood cell components in a Northern Nevadan cohort.GWAS 和北内华达州队列的红细胞成分 phewas 分析。
PLoS One. 2019 Jun 13;14(6):e0218078. doi: 10.1371/journal.pone.0218078. eCollection 2019.