• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用血浆蛋白联合临床-人口统计学指标预测未来帕金森病。

Prediction of Future Parkinson Disease Using Plasma Proteins Combined With Clinical-Demographic Measures.

机构信息

From the Institute of Science and Technology for Brain-Inspired Intelligence (J. You, L.W., Y.W., J.K., W.C., J.F.), and Department of Neurology (J. Yu), Huashan Hospital, Fudan University; Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University) (W.C., J.F.), Ministry of Education, Shanghai; Fudan ISTBI-ZJNU Algorithm Centre for Brain-inspired Intelligence (W.C., J.F.), Zhejiang Normal University; Shanghai Medical College and Zhongshan Hospital Immunotherapy Technology Transfer Center (W.C.); Zhangjiang Fudan International Innovation Center (J.F.); and School of Data Science (J.F.), Fudan University, Shanghai, China.

出版信息

Neurology. 2024 Aug 13;103(3):e209531. doi: 10.1212/WNL.0000000000209531. Epub 2024 Jul 8.

DOI:10.1212/WNL.0000000000209531
PMID:38976826
Abstract

BACKGROUND AND OBJECTIVES

Identification of individuals at high risk of developing Parkinson disease (PD) several years before diagnosis is crucial for developing treatments to prevent or delay neurodegeneration. This study aimed to develop predictive models for PD risk that combine plasma proteins and easily accessible clinical-demographic variables.

METHODS

Using data from the UK Biobank (UKB), which recruited participants across the United Kingdom, we conducted a longitudinal study to identify predictors for incident PD. Participants with baseline plasma proteins and no PD were included. Through machine learning, we narrowed down predictors from a pool of 1,463 plasma proteins and 93 clinical-demographic. These predictors were then externally validated using the Parkinson's Progression Marker Initiative (PPMI) cohort. To further investigate the temporal trends of predictors, a nested case-control study was conducted within the UKB.

RESULTS

A total of 52,503 participants without PD (median age 58, 54% female) were included. Over a median follow-up duration of 14.0 years, 751 individuals were diagnosed with PD (median age 65, 37% female). Using a forward selection approach, we selected a panel of 22 plasma proteins for optimal prediction. Using an ensemble tree-based Light Gradient Boosting Machine (LightGBM) algorithm, the model achieved an area under the receiver operating characteristic curve (AUC) of 0.800 (95% CI 0.785-0.815). The LightGBM prediction model integrating both plasma proteins and clinical-demographic variables demonstrated enhanced predictive accuracy, with an AUC of 0.832 (95% CI 0.815-0.849). Key predictors identified included age, years of education, history of traumatic brain injury, and serum creatinine. The incorporation of 11 plasma proteins (neurofilament light, integrin subunit alpha V, hematopoietic PGD synthase, histamine N-methyltransferase, tubulin polymerization promoting protein family member 3, ectodysplasin A2 receptor, Latexin, interleukin-13 receptor subunit alpha-1, BAG family molecular chaperone regulator 3, tryptophanyl-TRNA synthetase, and secretogranin-2) augmented the model's predictive accuracy. External validation in the PPMI cohort confirmed the model's reliability, producing an AUC of 0.810 (95% CI 0.740-0.873). Notably, alterations in these predictors were detectable several years before the diagnosis of PD.

DISCUSSION

Our findings support the potential utility of a machine learning-based model integrating clinical-demographic variables with plasma proteins to identify individuals at high risk for PD within the general population. Although these predictors have been validated by PPMI, additional validation in a more diverse population reflective of the general community is essential.

摘要

背景与目的

在诊断前数年识别出患有帕金森病(PD)风险较高的个体对于开发预防或延缓神经退行性变的治疗方法至关重要。本研究旨在开发一种结合血浆蛋白和易于获得的临床-人口统计学变量的 PD 风险预测模型。

方法

我们使用来自英国生物库(UKB)的数据进行了一项纵向研究,该研究在英国各地招募了参与者,以确定 PD 发病的预测因子。纳入基线时具有血浆蛋白且无 PD 的参与者。通过机器学习,我们从 1463 种血浆蛋白和 93 种临床-人口统计学变量中筛选出预测因子。然后,我们使用帕金森氏病进展标志物倡议(PPMI)队列对这些预测因子进行了外部验证。为了进一步研究预测因子的时间趋势,我们在 UKB 中进行了嵌套病例对照研究。

结果

共纳入 52503 名无 PD 的参与者(中位年龄 58 岁,54%为女性)。在中位随访时间 14.0 年期间,有 751 人被诊断为 PD(中位年龄 65 岁,37%为女性)。使用正向选择方法,我们选择了一组 22 种血浆蛋白以进行最佳预测。使用基于集成树的 Light Gradient Boosting Machine(LightGBM)算法,模型的接收者操作特征曲线下面积(AUC)为 0.800(95%CI 0.785-0.815)。整合血浆蛋白和临床-人口统计学变量的 LightGBM 预测模型显示出更高的预测准确性,AUC 为 0.832(95%CI 0.815-0.849)。确定的关键预测因子包括年龄、受教育年限、创伤性脑损伤史和血清肌酐。纳入 11 种血浆蛋白(神经丝轻链、整合素亚单位αV、造血 PGD 合酶、组氨酸 N-甲基转移酶、微管聚合促进蛋白家族成员 3、外胚层发育不良 A2 受体、Latxin、白细胞介素-13 受体亚基α-1、BAG 家族分子伴侣调节剂 3、色氨酰-tRNA 合成酶和分泌颗粒蛋白-2)提高了模型的预测准确性。在 PPMI 队列中的外部验证证实了该模型的可靠性,AUC 为 0.810(95%CI 0.740-0.873)。值得注意的是,这些预测因子的变化在 PD 诊断前几年即可检测到。

讨论

我们的研究结果支持使用基于机器学习的模型整合临床-人口统计学变量与血浆蛋白来识别普通人群中 PD 风险较高的个体的潜力。尽管这些预测因子已经通过 PPMI 验证,但在更能反映普通社区的多样性人群中进行额外验证是必不可少的。

相似文献

1
Prediction of Future Parkinson Disease Using Plasma Proteins Combined With Clinical-Demographic Measures.使用血浆蛋白联合临床-人口统计学指标预测未来帕金森病。
Neurology. 2024 Aug 13;103(3):e209531. doi: 10.1212/WNL.0000000000209531. Epub 2024 Jul 8.
2
Large-scale identification of clinical and genetic predictors of motor progression in patients with newly diagnosed Parkinson's disease: a longitudinal cohort study and validation.新诊断帕金森病患者运动进展的临床和遗传预测因素的大规模识别:一项纵向队列研究及验证
Lancet Neurol. 2017 Nov;16(11):908-916. doi: 10.1016/S1474-4422(17)30328-9. Epub 2017 Sep 25.
3
Predictive Big Data Analytics: A Study of Parkinson's Disease Using Large, Complex, Heterogeneous, Incongruent, Multi-Source and Incomplete Observations.预测性大数据分析:一项使用大规模、复杂、异构、不一致、多源和不完整观测数据对帕金森病的研究。
PLoS One. 2016 Aug 5;11(8):e0157077. doi: 10.1371/journal.pone.0157077. eCollection 2016.
4
High-Throughput CSF Proteomics and Machine Learning to Identify Proteomic Signatures for Parkinson Disease Development and Progression.高通量 CSF 蛋白质组学和机器学习用于鉴定帕金森病发展和进展的蛋白质组学特征。
Neurology. 2023 Oct 3;101(14):e1434-e1447. doi: 10.1212/WNL.0000000000207725. Epub 2023 Aug 16.
5
Development of a novel dementia risk prediction model in the general population: A large, longitudinal, population-based machine-learning study.普通人群中新型痴呆风险预测模型的开发:一项基于人群的大型纵向机器学习研究。
EClinicalMedicine. 2022 Sep 23;53:101665. doi: 10.1016/j.eclinm.2022.101665. eCollection 2022 Nov.
6
Disease progression subtypes of Parkinson's disease based on milestone events.基于里程碑事件的帕金森病疾病进展亚型。
J Neurol. 2024 Oct;271(10):6791-6800. doi: 10.1007/s00415-024-12645-1. Epub 2024 Aug 26.
7
Large-scale plasma proteomics in the UK Biobank modestly improves prediction of major cardiovascular events in a population without previous cardiovascular disease.在没有先前心血管疾病的人群中,英国生物库中的大规模血浆蛋白质组学研究适度提高了对主要心血管事件的预测能力。
Eur J Prev Cardiol. 2024 Oct 10;31(14):1681-1689. doi: 10.1093/eurjpc/zwae124.
8
Characterization of Parkinson's disease using blood-based biomarkers: A multicohort proteomic analysis.利用基于血液的生物标志物对帕金森病进行特征分析:多队列蛋白质组学分析。
PLoS Med. 2019 Oct 11;16(10):e1002931. doi: 10.1371/journal.pmed.1002931. eCollection 2019 Oct.
9
Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants.使用自动化机器学习进行心血管疾病风险预测:对 423604 名英国生物库参与者的前瞻性研究。
PLoS One. 2019 May 15;14(5):e0213653. doi: 10.1371/journal.pone.0213653. eCollection 2019.
10
Uncovering Clinical Risk Factors and Predicting Severe COVID-19 Cases Using UK Biobank Data: Machine Learning Approach.利用英国生物库数据揭示临床风险因素并预测严重 COVID-19 病例:机器学习方法。
JMIR Public Health Surveill. 2021 Sep 30;7(9):e29544. doi: 10.2196/29544.

引用本文的文献

1
Parkinson's Disease: Bridging Gaps, Building Biomarkers, and Reimagining Clinical Translation.帕金森病:弥合差距、构建生物标志物及重塑临床转化
Cells. 2025 Jul 28;14(15):1161. doi: 10.3390/cells14151161.
2
Identification and validation of an explainable machine learning model for vascular depression diagnosis in the older adults: a multicenter cohort study.老年人血管性抑郁诊断的可解释机器学习模型的识别与验证:一项多中心队列研究
BMC Med. 2025 Jul 31;23(1):448. doi: 10.1186/s12916-025-04283-9.
3
Development and validation of risk prediction models for acute kidney disease in gout patients: a retrospective study using machine learning.
痛风患者急性肾疾病风险预测模型的开发与验证:一项使用机器学习的回顾性研究
Eur J Med Res. 2025 Jul 23;30(1):660. doi: 10.1186/s40001-025-02939-z.
4
Proteomic risk scores for predicting common diseases using linear and neural network models in the UK biobank.在英国生物银行中使用线性和神经网络模型预测常见疾病的蛋白质组学风险评分。
Sci Rep. 2025 Jul 1;15(1):20520. doi: 10.1038/s41598-025-06232-1.
5
An interpretable machine learning model based on computed tomography radiomics for predicting programmed death ligand 1 expression status in gastric cancer.一种基于计算机断层扫描影像组学的可解释机器学习模型,用于预测胃癌中程序性死亡配体1的表达状态。
Cancer Imaging. 2025 Mar 12;25(1):31. doi: 10.1186/s40644-025-00855-3.
6
Proximity Elongation Assay and ELISA for the Identification of Serum Diagnostic Biomarkers in Parkinson's Disease and Progressive Supranuclear Palsy.接近延伸测定法和 ELISA 用于鉴定帕金森病和进行性核上性麻痹中的血清诊断生物标志物。
Int J Mol Sci. 2024 Oct 30;25(21):11663. doi: 10.3390/ijms252111663.