Department of Cardiology, Yangling Demonstration Zone Hospital, Yangling Demonstration Zone, Xianyang, Shaanxi 712100, P.R. China.
Mol Med Rep. 2018 Jan;17(1):1555-1560. doi: 10.3892/mmr.2017.8044. Epub 2017 Nov 14.
The aim of the present study was to identify risk genes in myocardial infarction. Microarray data GSE34198, containing data from the peripheral blood of 49 myocardial infarction samples and 48 corresponding control samples, were downloaded from the Gene Expression Omnibus database to screen the differentially expressed genes (DEGs). The DEGs were used to construct a protein‑protein interaction (PPI) network of patient samples, from which the feature genes were identified using the neighboring score method. The recursive feature elimination (RFE) algorithm was employed to select the risk genes among feature genes, which were subsequently applied to perform a support vector machine (SVM) classifier to identify the specific signature in myocardial infarction samples. Another dataset, GSE61144, was also downloaded to verify the efficacy of the classifier. A total of 724 downregulated and 483 upregulated DEGs were screened in patient samples compared with control samples in the GSE34198 dataset. The PPI network of myocardial infarction was comprised of 1,083 nodes (genes) and 46,363 lines (connections). Using the neighborhood scoring method, the top 100 feature genes in myocardial infarction samples were identified as the disease feature genes, which distinguish the myocardial infarction samples from the control samples. The RFE algorithm screened 15 risk genes, which were employed to construct a SVM classifier with an average precision of 88% to the patient sample following visualization by a confusion matrix. The predictive precision of the classifier on another microarray dataset, GSE61144, was 0.92, with an average true positive of 0.9278 and an average false positive of 0.2361. A‑kinase‑anchoring protein 12 (AKAP12) and glycine receptor α2 (GLRA2) were two risk genes in the SVM classifier. Therefore, AKAP12 and GLRA2 exert potential roles in the development of myocardial infarction, potentially by influencing cardiac contractility and protecting against ischemia‑reperfusion injury, which may provide clues in developing potential diagnostic biomarkers or therapeutic targets for myocardial infarction.
本研究旨在鉴定心肌梗死的风险基因。从基因表达综合数据库中下载包含 49 例心肌梗死样本和 48 例相应对照样本外周血的微阵列数据集 GSE34198,筛选差异表达基因(DEGs)。使用邻近评分法构建患者样本的蛋白质-蛋白质相互作用(PPI)网络,从该网络中确定特征基因。递归特征消除(RFE)算法用于从特征基因中选择风险基因,然后应用支持向量机(SVM)分类器来识别心肌梗死样本中的特定特征。还下载了另一个数据集 GSE61144 来验证分类器的功效。与 GSE34198 数据集中的对照样本相比,患者样本中筛选出 724 个下调和 483 个上调的 DEG。心肌梗死的 PPI 网络由 1083 个节点(基因)和 46363 条线(连接)组成。使用邻近评分法,确定了前 100 个特征基因作为疾病特征基因,这些基因可将心肌梗死样本与对照样本区分开来。RFE 算法筛选出 15 个风险基因,用于构建 SVM 分类器,对患者样本进行可视化后,混淆矩阵显示分类器的平均精度为 88%。该分类器对另一个微阵列数据集 GSE61144 的预测精度为 0.92,平均真阳性为 0.9278,平均假阳性为 0.2361。A-激酶锚定蛋白 12(AKAP12)和甘氨酸受体α2(GLRA2)是 SVM 分类器中的两个风险基因。因此,AKAP12 和 GLRA2 在心肌梗死的发展中可能发挥潜在作用,可能通过影响心脏收缩力和保护免受缺血再灌注损伤,这可能为开发心肌梗死潜在的诊断生物标志物或治疗靶点提供线索。