Rangaprakash D, Odemuyiwa Toluwanimi, Narayana Dutt D, Deshpande Gopikrishna
Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, MA, USA.
Department of Radiology, Harvard Medical School, Boston, MA, USA.
Brain Inform. 2020 Nov 26;7(1):19. doi: 10.1186/s40708-020-00120-2.
Various machine-learning classification techniques have been employed previously to classify brain states in healthy and disease populations using functional magnetic resonance imaging (fMRI). These methods generally use supervised classifiers that are sensitive to outliers and require labeling of training data to generate a predictive model. Density-based clustering, which overcomes these issues, is a popular unsupervised learning approach whose utility for high-dimensional neuroimaging data has not been previously evaluated. Its advantages include insensitivity to outliers and ability to work with unlabeled data. Unlike the popular k-means clustering, the number of clusters need not be specified. In this study, we compare the performance of two popular density-based clustering methods, DBSCAN and OPTICS, in accurately identifying individuals with three stages of cognitive impairment, including Alzheimer's disease. We used static and dynamic functional connectivity features for clustering, which captures the strength and temporal variation of brain connectivity respectively. To assess the robustness of clustering to noise/outliers, we propose a novel method called recursive-clustering using additive-noise (R-CLAN). Results demonstrated that both clustering algorithms were effective, although OPTICS with dynamic connectivity features outperformed in terms of cluster purity (95.46%) and robustness to noise/outliers. This study demonstrates that density-based clustering can accurately and robustly identify diagnostic classes in an unsupervised way using brain connectivity.
以前曾采用各种机器学习分类技术,利用功能磁共振成像(fMRI)对健康人群和疾病人群的脑状态进行分类。这些方法通常使用对异常值敏感的监督分类器,并且需要对训练数据进行标记以生成预测模型。基于密度的聚类克服了这些问题,是一种流行的无监督学习方法,其在高维神经成像数据方面的效用此前尚未得到评估。其优点包括对异常值不敏感以及能够处理未标记数据。与流行的k均值聚类不同,无需指定聚类的数量。在本研究中,我们比较了两种流行的基于密度的聚类方法DBSCAN和OPTICS在准确识别包括阿尔茨海默病在内的三个认知障碍阶段个体方面的性能。我们使用静态和动态功能连接特征进行聚类,分别捕捉脑连接的强度和时间变化。为了评估聚类对噪声/异常值的鲁棒性,我们提出了一种名为使用加性噪声的递归聚类(R-CLAN)的新方法。结果表明,两种聚类算法都是有效的,尽管具有动态连接特征的OPTICS在聚类纯度(95.46%)和对噪声/异常值的鲁棒性方面表现更优。这项研究表明,基于密度的聚类可以使用脑连接以无监督的方式准确且稳健地识别诊断类别。