Department of Virology and Experimental Therapy, Aggeu Magalhães Research Center-CPqAM/FIOCRUZ, Recife, Brazil.
PLoS One. 2010 Jun 23;5(6):e11267. doi: 10.1371/journal.pone.0011267.
Symptomatic infection by dengue virus (DENV) can range from dengue fever (DF) to dengue haemorrhagic fever (DHF), however, the determinants of DF or DHF progression are not completely understood. It is hypothesised that host innate immune response factors are involved in modulating the disease outcome and the expression levels of genes involved in this response could be used as early prognostic markers for disease severity.
METHODOLOGY/PRINCIPAL FINDINGS: mRNA expression levels of genes involved in DENV innate immune responses were measured using quantitative real time PCR (qPCR). Here, we present a novel application of the support vector machines (SVM) algorithm to analyze the expression pattern of 12 genes in peripheral blood mononuclear cells (PBMCs) of 28 dengue patients (13 DHF and 15 DF) during acute viral infection. The SVM model was trained using gene expression data of these genes and achieved the highest accuracy of approximately 85% with leave-one-out cross-validation. Through selective removal of gene expression data from the SVM model, we have identified seven genes (MYD88, TLR7, TLR3, MDA5, IRF3, IFN-alpha and CLEC5A) that may be central in differentiating DF patients from DHF, with MYD88 and TLR7 observed to be the most important. Though the individual removal of expression data of five other genes had no impact on the overall accuracy, a significant combined role was observed when the SVM model of the two main genes (MYD88 and TLR7) was re-trained to include the five genes, increasing the overall accuracy to approximately 96%.
CONCLUSIONS/SIGNIFICANCE: Here, we present a novel use of the SVM algorithm to classify DF and DHF patients, as well as to elucidate the significance of the various genes involved. It was observed that seven genes are critical in classifying DF and DHF patients: TLR3, MDA5, IRF3, IFN-alpha, CLEC5A, and the two most important MYD88 and TLR7. While these preliminary results are promising, further experimental investigation is necessary to validate their specific roles in dengue disease.
登革热病毒(DENV)引起的症状感染范围从登革热(DF)到登革出血热(DHF)不等,但DF 或 DHF 进展的决定因素尚不完全清楚。据推测,宿主先天免疫反应因素参与调节疾病结局,并且该反应涉及的基因表达水平可以用作疾病严重程度的早期预后标志物。
方法/主要发现:使用定量实时 PCR(qPCR)测量参与 DENV 先天免疫反应的基因的 mRNA 表达水平。在这里,我们提出了支持向量机(SVM)算法的新应用,以分析 28 例登革热患者(13 例 DHF 和 15 例 DF)在急性病毒感染期间外周血单核细胞(PBMC)中 12 个基因的表达模式。SVM 模型使用这些基因的基因表达数据进行训练,通过留一法交叉验证,最高准确率约为 85%。通过从 SVM 模型中选择性地去除基因表达数据,我们已经确定了七个可能在区分 DF 患者和 DHF 患者中起核心作用的基因(MYD88、TLR7、TLR3、MDA5、IRF3、IFN-α和 CLEC5A),其中 MYD88 和 TLR7 观察到最为重要。尽管单独去除五个其他基因的表达数据对整体准确性没有影响,但当重新训练包含这两个主要基因(MYD88 和 TLR7)的 SVM 模型时,观察到它们的联合作用非常显著,从而将整体准确性提高到约 96%。
结论/意义:在这里,我们提出了 SVM 算法的新用途,用于对 DF 和 DHF 患者进行分类,并阐明了涉及的各种基因的意义。观察到七个基因在区分 DF 和 DHF 患者方面至关重要:TLR3、MDA5、IRF3、IFN-α、CLEC5A,以及两个最重要的 MYD88 和 TLR7。虽然这些初步结果很有希望,但需要进一步的实验研究来验证它们在登革热疾病中的具体作用。