Wang Fengfeng, Cheung Chi Wai, Wong Stanley Sau Ching
Department of Anaesthesiology, School of Clinical Medicine, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China.
Front Genet. 2023 Mar 29;14:1026672. doi: 10.3389/fgene.2023.1026672. eCollection 2023.
The prevalence rate of depression is higher in patients with fibromyalgia syndrome, but this is often unrecognized in patients with chronic pain. Given that depression is a common major barrier in the management of patients with fibromyalgia syndrome, an objective tool that reliably predicts depression in patients with fibromyalgia syndrome could significantly enhance the diagnostic accuracy. Since pain and depression can cause each other and worsen each other, we wonder if pain-related genes can be used to differentiate between those with major depression from those without. This study developed a support vector machine model combined with principal component analysis to differentiate major depression in fibromyalgia syndrome patients using a microarray dataset, including 25 fibromyalgia syndrome patients with major depression, and 36 patients without major depression. Gene co-expression analysis was used to select gene features to construct support vector machine model. The principal component analysis can help reduce the number of data dimensions without much loss of information, and identify patterns in data easily. The 61 samples available in the database were not enough for learning based methods and cannot represent every possible variation of each patient. To address this issue, we adopted Gaussian noise to generate a large amount of simulated data for training and testing of the model. The ability of support vector machine model to differentiate major depression using microarray data was measured as accuracy. Different structural co-expression patterns were identified for 114 genes involved in pain signaling pathway by two-sample KS test ( < 0.001 for the maximum deviation D = 0.11 > D = 0.05), indicating the aberrant co-expression patterns in fibromyalgia syndrome patients. Twenty hub gene features were further selected based on co-expression analysis to construct the model. The principal component analysis reduced the dimension of the training samples from 20 to 16, since 16 components were needed to retain more than 90% of the original variance. The support vector machine model was able to differentiate between those with major depression from those without in fibromyalgia syndrome patients with an average accuracy of 93.22% based on the expression levels of the selected hub gene features. These findings would contribute key information that can be used to develop a clinical decision-making tool for the data-driven, personalized optimization of diagnosing depression in patients with fibromyalgia syndrome.
纤维肌痛综合征患者中抑郁症的患病率较高,但在慢性疼痛患者中这一点常常未被认识到。鉴于抑郁症是纤维肌痛综合征患者管理中的一个常见主要障碍,一种能够可靠预测纤维肌痛综合征患者抑郁症的客观工具可以显著提高诊断准确性。由于疼痛和抑郁症会相互影响并加重彼此,我们想知道与疼痛相关的基因是否可用于区分患有重度抑郁症的患者和未患该病的患者。本研究开发了一种结合主成分分析的支持向量机模型,使用一个微阵列数据集来区分纤维肌痛综合征患者中的重度抑郁症,该数据集包括25名患有重度抑郁症的纤维肌痛综合征患者和36名未患重度抑郁症的患者。基因共表达分析用于选择基因特征以构建支持向量机模型。主成分分析有助于在信息损失不大的情况下减少数据维度,并轻松识别数据中的模式。数据库中可用的61个样本对于基于学习的方法来说不够,并且无法代表每个患者的所有可能变异。为了解决这个问题,我们采用高斯噪声生成大量模拟数据用于模型的训练和测试。支持向量机模型使用微阵列数据区分重度抑郁症的能力以准确率来衡量。通过双样本KS检验(最大偏差D = 0.11时<0.001,D = 0.05时),确定了参与疼痛信号通路的114个基因的不同结构共表达模式,表明纤维肌痛综合征患者中存在异常的共表达模式。基于共表达分析进一步选择了20个核心基因特征来构建模型。主成分分析将训练样本的维度从20维减少到16维,因为需要16个成分来保留超过90%的原始方差。基于所选核心基因特征的表达水平,支持向量机模型能够区分纤维肌痛综合征患者中患有重度抑郁症和未患该病的患者,平均准确率为93.22%。这些发现将提供关键信息,可用于开发一种临床决策工具,以实现数据驱动的、针对纤维肌痛综合征患者抑郁症诊断的个性化优化。