Sarveswaran Tamilarasi, Rajangam Vijayarajan
School of Electronics Engineering, Vellore Institute of Technology, Chennai, India.
Centre for Healthcare Advancement, Innovation and Research, School of Electronics Engineering, Vellore Institute of Technology, Chennai, India.
Sci Rep. 2025 Mar 25;15(1):10257. doi: 10.1038/s41598-025-93912-7.
Schizophrenia is a complicated mental condition marked by disruptions in thought processes, perceptions, and emotional responses, which can cause severe impairment in everyday functioning. sMRI is a non-invasive neuroimaging technology that visualizes the brain's structure while providing precise information on its anatomy and potential problems. This paper investigates the role of multidimensional Convolutional Neural Network (CNN) architectures: 1D-CNN, 2D-CNN and 3D-CNN, using the DWT subbands of sMRI data. 1D-CNN involves energy features extracted from the CD subband of sMRI data. The sum of gradient magnitudes of CD subband, known as energy feature, highlights diagonal high frequency elements associated with schizophrenia. 2D-CNN uses the CH subband decomposed by DWT that enables feature extraction from horizontal high frequency coefficients of sMRI data. In the case of 3D-CNNs, the CV subband is used which leads to volumetric feature extraction from vertical high frequency coefficients. Feature extraction in DWT domain explores textural changes, edges, coarse and fine details present in sMRI data from which the multidimensional feature extraction is carried out for classification.Through maximum voting technique, the proposed model optimizes schizophrenia classification from the multidimensional CNN models. The generalization of the proposed model for the two datasets proves convincing in improving the classification accuracy. The multidimensional CNN architectures achieve an average accuracy of 93.2%, 95.8%, and 98.0%, respectively, while the proposed model achieves an average accuracy of 98.9%.
精神分裂症是一种复杂的精神疾病,其特征是思维过程、感知和情感反应受到干扰,这可能导致日常功能严重受损。结构磁共振成像(sMRI)是一种非侵入性神经成像技术,可在可视化大脑结构的同时提供有关其解剖结构和潜在问题的精确信息。本文使用sMRI数据的离散小波变换(DWT)子带,研究了多维卷积神经网络(CNN)架构(1D-CNN、2D-CNN和3D-CNN)的作用。1D-CNN涉及从sMRI数据的CD子带中提取的能量特征。CD子带的梯度幅值之和,即能量特征,突出了与精神分裂症相关的对角高频元素。2D-CNN使用由DWT分解的CH子带,从而能够从sMRI数据的水平高频系数中提取特征。对于3D-CNN,使用CV子带,从而能够从垂直高频系数中进行体素特征提取。在DWT域中的特征提取探索了sMRI数据中存在的纹理变化、边缘、粗细细节,从中进行多维特征提取以进行分类。通过最大投票技术,所提出的模型从多维CNN模型中优化了精神分裂症分类。所提出的模型在两个数据集上的泛化能力在提高分类准确率方面表现令人信服。多维CNN架构的平均准确率分别达到93.2%、95.8%和98.0%,而所提出的模型的平均准确率达到98.9%。