Rojas-Velazquez David, Kidwai Sarah, Liu Ting Chia, El-Yacoubi Mounim A, Garssen Johan, Tonda Alberto, Lopez-Rincon Alejandro
Division of Pharmacology, Utrecht Institute for Pharmaceutical Sciences, Faculty of Science, Universiteitsweg 99, Utrecht 3508 TB, the Netherlands; Department of Data Science, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Heidelberglaan 100, Utrecht, 3508 GA, the Netherlands.
Division of Pharmacology, Utrecht Institute for Pharmaceutical Sciences, Faculty of Science, Universiteitsweg 99, Utrecht 3508 TB, the Netherlands.
Maturitas. 2025 Feb;193:108185. doi: 10.1016/j.maturitas.2024.108185. Epub 2024 Dec 23.
Given that Parkinson's disease is a progressive disorder, with symptoms that worsen over time, our goal is to enhance the diagnosis of Parkinson's disease by utilizing machine learning techniques and microbiome analysis. The primary objective is to identify specific microbiome signatures that can reproducibly differentiate patients with Parkinson's disease from healthy controls.
We used four Parkinson-related datasets from the NCBI repository, focusing on stool samples. Then, we applied a DADA2-based script for amplicon sequence processing and the Recursive Ensemble Feature Selection (REF) algorithm for biomarker discovery. The discovery dataset was PRJEB14674, while PRJNA742875, PRJEB27564, and PRJNA594156 served as testing datasets. The Extra Trees classifier was used to validate the selected features.
The Recursive Ensemble Feature Selection algorithm identified 84 features (Amplicon Sequence Variants) from the discovery dataset, achieving an accuracy of over 80%. The Extra Trees classifier demonstrated good diagnostic accuracy with an area under the receiver operating characteristic curve of 0.74. In the testing phase, the classifier achieved areas under the receiver operating characteristic curves of 0.64, 0.71, and 0.62 for the respective datasets, indicating sufficient to good diagnostic accuracy. The study identified several bacterial taxa associated with Parkinson's disease, such as Lactobacillus, Bifidobacterium, and Roseburia, which were increased in patients with the disease.
This study successfully identified microbiome signatures that can differentiate patients with Parkinson's disease from healthy controls across different datasets. These findings highlight the potential of integrating machine learning and microbiome analysis for the diagnosis of Parkinson's disease. However, further research is needed to validate these microbiome signatures and to explore their therapeutic implications in developing targeted treatments and diagnostics for Parkinson's disease.
鉴于帕金森病是一种进行性疾病,其症状会随时间恶化,我们的目标是通过利用机器学习技术和微生物组分析来加强帕金森病的诊断。主要目标是识别能够将帕金森病患者与健康对照者进行可重复区分的特定微生物组特征。
我们使用了来自NCBI数据库的四个与帕金森病相关的数据集,重点关注粪便样本。然后,我们应用基于DADA2的脚本进行扩增子序列处理,并使用递归集成特征选择(REF)算法进行生物标志物发现。发现数据集为PRJEB14674,而PRJNA742875、PRJEB27564和PRJNA594156用作测试数据集。使用极端随机树分类器来验证所选特征。
递归集成特征选择算法从发现数据集中识别出84个特征(扩增子序列变体),准确率超过80%。极端随机树分类器表现出良好的诊断准确性,受试者工作特征曲线下面积为0.74。在测试阶段,该分类器在各个数据集的受试者工作特征曲线下面积分别为0.64、0.71和0.62,表明诊断准确性足够好。该研究确定了几种与帕金森病相关的细菌分类群,如乳酸杆菌、双歧杆菌和罗斯氏菌,这些在患病患者中有所增加。
本研究成功识别出能够在不同数据集中区分帕金森病患者与健康对照者的微生物组特征。这些发现突出了整合机器学习和微生物组分析用于帕金森病诊断的潜力。然而,需要进一步研究来验证这些微生物组特征,并探索它们在开发帕金森病靶向治疗和诊断方法中的治疗意义。