Chung Jetli, Teo Jason
Faculty of Computing and Informatics, Universiti Malaysia Sabah, Jalan UMS, 88400, Kota Kinabalu, Sabah, Malaysia.
Advanced Machine Intelligence Research Group, Faculty of Computing and Informatics, Universiti Malaysia Sabah, Jalan UMS, 88400, Kota Kinabalu, Sabah, Malaysia.
Brain Inform. 2023 Jan 3;10(1):1. doi: 10.1186/s40708-022-00180-6.
Early prediction of mental health issues among individuals is paramount for early diagnosis and treatment by mental health professionals. One of the promising approaches to achieving fully automated computer-based approaches for predicting mental health problems is via machine learning. As such, this study aims to empirically evaluate several popular machine learning algorithms in classifying and predicting mental health problems based on a given data set, both from a single classifier approach as well as an ensemble machine learning approach. The data set contains responses to a survey questionnaire that was conducted by Open Sourcing Mental Illness (OSMI). Machine learning algorithms investigated in this study include Logistic Regression, Gradient Boosting, Neural Networks, K-Nearest Neighbours, and Support Vector Machine, as well as an ensemble approach using these algorithms. Comparisons were also made against more recent machine learning approaches, namely Extreme Gradient Boosting and Deep Neural Networks. Overall, Gradient Boosting achieved the highest overall accuracy of 88.80% followed by Neural Networks with 88.00%. This was followed by Extreme Gradient Boosting and Deep Neural Networks at 87.20% and 86.40%, respectively. The ensemble classifier achieved 85.60% while the remaining classifiers achieved between 82.40 and 84.00%. The findings indicate that Gradient Boosting provided the highest classification accuracy for this particular mental health bi-classification prediction task. In general, it was also demonstrated that the prediction results produced by all of the machine learning approaches studied here were able to achieve more than 80% accuracy, thereby indicating a highly promising approach for mental health professionals toward automated clinical diagnosis.
对个体心理健康问题进行早期预测,对于心理健康专业人员进行早期诊断和治疗至关重要。实现基于计算机的全自动心理健康问题预测方法的一个有前景的途径是通过机器学习。因此,本研究旨在基于给定数据集,从单分类器方法以及集成机器学习方法两方面,对几种流行的机器学习算法在分类和预测心理健康问题方面进行实证评估。该数据集包含对由开源精神疾病组织(OSMI)进行的调查问卷的回复。本研究中调查的机器学习算法包括逻辑回归、梯度提升、神经网络、K近邻和支持向量机,以及使用这些算法的集成方法。还与更新的机器学习方法,即极端梯度提升和深度神经网络进行了比较。总体而言,梯度提升的总体准确率最高,为88.80%,其次是神经网络,为88.00%。随后是极端梯度提升和深度神经网络,分别为87.20%和86.40%。集成分类器的准确率为85.60%,其余分类器的准确率在82.40%至84.00%之间。研究结果表明,梯度提升在这个特定的心理健康二分类预测任务中提供了最高的分类准确率。总体而言,还表明这里研究的所有机器学习方法产生的预测结果都能够达到超过80%的准确率,从而表明这是一种对心理健康专业人员进行自动化临床诊断非常有前景的方法。