Tian Qing, Yang Ning-Bo, Fan Yu, Dong Fang, Bo Qi-Jing, Zhou Fu-Chun, Zhang Ji-Cong, Li Liang, Yin Guang-Zhong, Wang Chuan-Yue, Fan Ming
Laboratory of Brain Disorders, Collaborative Innovation Center for Brain Disorders, Beijing Institute of Brain Disorders, Capital Medical University, Ministry of Science and Technology, Beijing, China.
Suzhou Guangji Hospital, The Affiliated Guangji Hospital of Soochow University, The Institute of Mental Health, Suzhou, China.
Front Psychiatry. 2022 Apr 5;13:810362. doi: 10.3389/fpsyt.2022.810362. eCollection 2022.
The search for a method that utilizes biomarkers to identify patients with schizophrenia from healthy individuals has occupied researchers for decades. However, no single indicator can be employed to achieve the good in clinical practice. We aim to develop a comprehensive machine learning pipeline based on neurocognitive and electrophysiological combined features for distinguishing schizophrenia patients from healthy people.
In the present study, 69 patients with schizophrenia and 50 healthy controls participated. Neurocognitive (contains seven specific domains of cognition) and electrophysiological [prepulse inhibition, electroencephalography (EEG) power spectrum, detrended fluctuation analysis, and fractal dimension (FD)] features were collected, all these features were taken together to generate the identification models of schizophrenia by applying logistics, random forest, and extreme gradient boosting algorithm. The classification capabilities of these models were also evaluated.
Both the neurocognitive and electrophysiological feature sets showed a good classification effect with the highest accuracy greater than 85% and AUC greater than 90%. Specifically, the performances of the combined neurocognitive and electrophysiological feature sets achieved the highest accuracy of 93.28% and AUC of 97.91%. The extreme gradient boosting algorithm as a whole presented more stably and precisely in classification efficiency.
The highest classification accuracy of 93.28% by combination of neurocognitive and electrophysiological features shows that both measurements are appropriate indicators to be used in discriminating schizophrenia patients and healthy individuals. Also, among three algorithms, extreme gradient boosting had better classified performances than logistics and random forest algorithms.
数十年来,研究人员一直在寻找一种利用生物标志物从健康个体中识别精神分裂症患者的方法。然而,在临床实践中,没有单一指标能够达到理想效果。我们旨在开发一种基于神经认知和电生理联合特征的综合机器学习流程,以区分精神分裂症患者和健康人。
在本研究中,69例精神分裂症患者和50名健康对照参与其中。收集了神经认知(包含七个特定认知领域)和电生理[预脉冲抑制、脑电图(EEG)功率谱、去趋势波动分析和分形维数(FD)]特征,通过应用逻辑回归、随机森林和极端梯度提升算法,将所有这些特征结合起来生成精神分裂症的识别模型。还评估了这些模型的分类能力。
神经认知和电生理特征集均显示出良好的分类效果,最高准确率大于85%,曲线下面积(AUC)大于90%。具体而言,神经认知和电生理联合特征集的表现达到了最高准确率93.28%和AUC 97.91%。极端梯度提升算法在分类效率上整体表现得更稳定、更精确。
神经认知和电生理特征相结合的最高分类准确率为93.28%,表明这两种测量方法都是区分精神分裂症患者和健康个体的合适指标。此外,在三种算法中,极端梯度提升算法的分类性能优于逻辑回归和随机森林算法。