基于机器学习从静息态脑电图检测帕金森病：一项多中心研究。

Machine Learning-Based Detection of Parkinson's Disease From Resting-State EEG: A Multi-Center Study.

作者信息

Kurbatskaya Anna, Jaramillo-Jimenez Alberto, Ochoa-Gomez John Fredy, Bronnick Kolbjorn, Fernandez-Quilez Alvaro

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340700.

DOI:10.1109/EMBC40787.2023.10340700

Abstract

Resting-state EEG (rs-EEG) has been demonstrated to aid in Parkinson's disease (PD) diagnosis. In particular, the power spectral density (PSD) of low-frequency bands (δ and θ) and high-frequency bands (α and β) has been shown to be significantly different in patients with PD as compared to subjects without PD (non-PD). However, rs-EEG feature extraction and the interpretation thereof can be time-intensive and prone to examiner variability. Machine learning (ML) has the potential to automatize the analysis of rs-EEG recordings and provides a supportive tool for clinicians to ease their workload. In this work, we use rs-EEG recordings of 84 PD and 85 non-PD subjects pooled from four datasets obtained at different centers. We propose an end-to-end pipeline consisting of preprocessing, extraction of PSD features from clinically-validated frequency bands, and feature selection. Following, we assess the classification ability of the features via ML algorithms to stratify between PD and non-PD subjects. Further, we evaluate the effect of feature harmonization, given the multi-center nature of the datasets. Our validation results show, on average, an improvement in PD detection ability (69.6% vs. 75.5% accuracy) by logistic regression when harmonizing the features and performing univariate feature selection (k = 202 features). Our final results show an average global accuracy of 72.2% with balanced accuracy results for all the centers included in the study: 60.6%, 68.7%, 77.7%, and 82.2%, respectively.Clinical relevance- We present an end-to-end pipeline to extract clinically relevant features from rs-EEG recordings that can facilitate the analysis and detection of PD. Further, we provide an ML system that shows a good performance in detecting PD, even in the presence of centers with different acquisition protocols. Our results show the relevance of harmonizing features and provide a good starting point for future studies focusing on rs-EEG analysis and multi-center data.

摘要

静息态脑电图（rs-EEG）已被证明有助于帕金森病（PD）的诊断。特别是，与非帕金森病（non-PD）受试者相比，帕金森病患者低频带（δ和θ）和高频带（α和β）的功率谱密度（PSD）已显示出显著差异。然而，rs-EEG特征提取及其解释可能耗时且容易受到检查者差异的影响。机器学习（ML）有潜力实现rs-EEG记录分析的自动化，并为临床医生提供一个支持工具以减轻其工作量。在这项工作中，我们使用了从不同中心获得的四个数据集中汇总的84名帕金森病患者和85名非帕金森病受试者的rs-EEG记录。我们提出了一个端到端的流程，包括预处理、从临床验证的频带中提取PSD特征以及特征选择。随后，我们通过ML算法评估这些特征对帕金森病患者和非帕金森病受试者进行分层的分类能力。此外，考虑到数据集的多中心性质，我们评估了特征协调的效果。我们的验证结果表明，在协调特征并进行单变量特征选择（k = 202个特征）时，逻辑回归对帕金森病检测能力平均有所提高（准确率从69.6%提高到75.5%）。我们的最终结果显示平均全局准确率为72.2%，研究中纳入的所有中心的平衡准确率结果分别为：60.6%、68.7%、77.7%和82.2%。临床相关性——我们提出了一个端到端的流程，用于从rs-EEG记录中提取临床相关特征，这有助于帕金森病的分析和检测。此外，我们提供了一个ML系统，该系统在检测帕金森病方面表现良好，即使存在采集协议不同的中心。我们的结果显示了特征协调的相关性，并为未来专注于rs-EEG分析和多中心数据的研究提供了一个良好的起点。