基于递归特征消除算法和支持向量机分类器的心肌梗死相关风险基因鉴定。

Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier.

机构信息

Department of Cardiology, Yangling Demonstration Zone Hospital, Yangling Demonstration Zone, Xianyang, Shaanxi 712100, P.R. China.

出版信息

Mol Med Rep. 2018 Jan;17(1):1555-1560. doi: 10.3892/mmr.2017.8044. Epub 2017 Nov 14.

DOI:10.3892/mmr.2017.8044

PMID:29138828

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5780094/

Abstract

The aim of the present study was to identify risk genes in myocardial infarction. Microarray data GSE34198, containing data from the peripheral blood of 49 myocardial infarction samples and 48 corresponding control samples, were downloaded from the Gene Expression Omnibus database to screen the differentially expressed genes (DEGs). The DEGs were used to construct a protein‑protein interaction (PPI) network of patient samples, from which the feature genes were identified using the neighboring score method. The recursive feature elimination (RFE) algorithm was employed to select the risk genes among feature genes, which were subsequently applied to perform a support vector machine (SVM) classifier to identify the specific signature in myocardial infarction samples. Another dataset, GSE61144, was also downloaded to verify the efficacy of the classifier. A total of 724 downregulated and 483 upregulated DEGs were screened in patient samples compared with control samples in the GSE34198 dataset. The PPI network of myocardial infarction was comprised of 1,083 nodes (genes) and 46,363 lines (connections). Using the neighborhood scoring method, the top 100 feature genes in myocardial infarction samples were identified as the disease feature genes, which distinguish the myocardial infarction samples from the control samples. The RFE algorithm screened 15 risk genes, which were employed to construct a SVM classifier with an average precision of 88% to the patient sample following visualization by a confusion matrix. The predictive precision of the classifier on another microarray dataset, GSE61144, was 0.92, with an average true positive of 0.9278 and an average false positive of 0.2361. A‑kinase‑anchoring protein 12 (AKAP12) and glycine receptor α2 (GLRA2) were two risk genes in the SVM classifier. Therefore, AKAP12 and GLRA2 exert potential roles in the development of myocardial infarction, potentially by influencing cardiac contractility and protecting against ischemia‑reperfusion injury, which may provide clues in developing potential diagnostic biomarkers or therapeutic targets for myocardial infarction.

摘要

本研究旨在鉴定心肌梗死的风险基因。从基因表达综合数据库中下载包含 49 例心肌梗死样本和 48 例相应对照样本外周血的微阵列数据集 GSE34198，筛选差异表达基因（DEGs）。使用邻近评分法构建患者样本的蛋白质-蛋白质相互作用（PPI）网络，从该网络中确定特征基因。递归特征消除（RFE）算法用于从特征基因中选择风险基因，然后应用支持向量机（SVM）分类器来识别心肌梗死样本中的特定特征。还下载了另一个数据集 GSE61144 来验证分类器的功效。与 GSE34198 数据集中的对照样本相比，患者样本中筛选出 724 个下调和 483 个上调的 DEG。心肌梗死的 PPI 网络由 1083 个节点（基因）和 46363 条线（连接）组成。使用邻近评分法，确定了前 100 个特征基因作为疾病特征基因，这些基因可将心肌梗死样本与对照样本区分开来。RFE 算法筛选出 15 个风险基因，用于构建 SVM 分类器，对患者样本进行可视化后，混淆矩阵显示分类器的平均精度为 88%。该分类器对另一个微阵列数据集 GSE61144 的预测精度为 0.92，平均真阳性为 0.9278，平均假阳性为 0.2361。A-激酶锚定蛋白 12（AKAP12）和甘氨酸受体α2（GLRA2）是 SVM 分类器中的两个风险基因。因此，AKAP12 和 GLRA2 在心肌梗死的发展中可能发挥潜在作用，可能通过影响心脏收缩力和保护免受缺血再灌注损伤，这可能为开发心肌梗死潜在的诊断生物标志物或治疗靶点提供线索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5256/5780094/70e1fe99a88b/MMR-17-01-1555-g01.jpg

相似文献

Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier.基于递归特征消除算法和支持向量机分类器的心肌梗死相关风险基因鉴定。

Mol Med Rep. 2018 Jan;17(1):1555-1560. doi: 10.3892/mmr.2017.8044. Epub 2017 Nov 14.

Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction.风险基因识别和支持向量机学习构建心肌梗死早期诊断模型。

Mol Med Rep. 2020 Sep;22(3):1775-1782. doi: 10.3892/mmr.2020.11247. Epub 2020 Jun 17.

Identification of feature autophagy-related genes in patients with acute myocardial infarction based on bioinformatics analyses.基于生物信息学分析鉴定急性心肌梗死患者的特征自噬相关基因。

Biosci Rep. 2020 Jul 31;40(7). doi: 10.1042/BSR20200790.

Establishment of a SVM classifier to predict recurrence of ovarian cancer.建立 SVM 分类器预测卵巢癌复发。

Mol Med Rep. 2018 Oct;18(4):3589-3598. doi: 10.3892/mmr.2018.9362. Epub 2018 Aug 8.

Construction of a 26‑feature gene support vector machine classifier for smoking and non‑smoking lung adenocarcinoma sample classification.构建一个 26 特征基因支持向量机分类器，用于吸烟和非吸烟肺腺癌样本分类。

Mol Med Rep. 2018 Feb;17(2):3005-3013. doi: 10.3892/mmr.2017.8220. Epub 2017 Dec 7.

Identification of Featured Metabolism-Related Genes in Patients with Acute Myocardial Infarction.鉴定急性心肌梗死患者的特征代谢相关基因。

Dis Markers. 2020 Nov 28;2020:8880004. doi: 10.1155/2020/8880004. eCollection 2020.

A multigene support vector machine predictor for metastasis of cutaneous melanoma.一种用于预测皮肤黑色素瘤转移的多基因支持向量机预测器。

Mol Med Rep. 2018 Feb;17(2):2907-2914. doi: 10.3892/mmr.2017.8219. Epub 2017 Dec 7.

A 15-gene signature for prediction of colon cancer recurrence and prognosis based on SVM.基于支持向量机的用于预测结肠癌复发和预后的15基因特征。

Gene. 2017 Mar 10;604:33-40. doi: 10.1016/j.gene.2016.12.016. Epub 2016 Dec 18.

An Efficient Feature Selection Strategy Based on Multiple Support Vector Machine Technology with Gene Expression Data.基于基因表达数据的多支持向量机技术的高效特征选择策略。

Biomed Res Int. 2018 Aug 30;2018:7538204. doi: 10.1155/2018/7538204. eCollection 2018.

Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods.通过 MetaDE 和 SVM 分类器方法鉴定转移性乳腺癌的特征基因。

Mol Med Rep. 2018 Mar;17(3):4281-4290. doi: 10.3892/mmr.2018.8398. Epub 2018 Jan 9.

引用本文的文献

Integrative Genetic Approach Facilitates Precision Strategies for Acute Myocardial Infarction.综合遗传方法有助于制定急性心肌梗死的精准策略。

Genes (Basel). 2023 Jun 26;14(7):1340. doi: 10.3390/genes14071340.

Identification of Risk Genes Associated with Myocardial Infarction-Big Data Analysis and Literature Review.识别与心肌梗死相关的风险基因：大数据分析和文献回顾。

Int J Mol Sci. 2022 Nov 30;23(23):15008. doi: 10.3390/ijms232315008.

Short- and long-term mortality prediction after an acute ST-elevation myocardial infarction (STEMI) in Asians: A machine learning approach.亚洲人急性ST段抬高型心肌梗死（STEMI）后的短期和长期死亡率预测：一种机器学习方法。

PLoS One. 2021 Aug 2;16(8):e0254894. doi: 10.1371/journal.pone.0254894. eCollection 2021.

Analysis of Differential Gene Expression in Three Common Rat Models of Diastolic Dysfunction.三种常见舒张功能障碍大鼠模型中的差异基因表达分析

Front Cardiovasc Med. 2018 Feb 21;5:11. doi: 10.3389/fcvm.2018.00011. eCollection 2018.

本文引用的文献

Correlation Between Posttraumatic Growth and Posttraumatic Stress Disorder Symptoms Based on Pearson Correlation Coefficient: A Meta-Analysis.基于皮尔逊相关系数的创伤后成长与创伤后应激障碍症状之间的相关性：一项元分析

J Nerv Ment Dis. 2017 May;205(5):380-389. doi: 10.1097/NMD.0000000000000605.

Loss of AKAP150 promotes pathological remodelling and heart failure propensity by disrupting calcium cycling and contractile reserve.AKAP150的缺失通过破坏钙循环和收缩储备促进病理重塑和心力衰竭倾向。

Cardiovasc Res. 2017 Feb;113(2):147-159. doi: 10.1093/cvr/cvw221. Epub 2016 Nov 17.

Multiclass Classification for the Differential Diagnosis on the ADHD Subtypes Using Recursive Feature Elimination and Hierarchical Extreme Learning Machine: Structural MRI Study.基于递归特征消除和分层极限学习机的多动症亚型鉴别诊断多分类：结构磁共振成像研究

PLoS One. 2016 Aug 8;11(8):e0160697. doi: 10.1371/journal.pone.0160697. eCollection 2016.

AKAP150 participates in calcineurin/NFAT activation during the down-regulation of voltage-gated K(+) currents in ventricular myocytes following myocardial infarction.在心肌梗死后心室肌细胞电压门控钾电流下调过程中，A激酶锚定蛋白150（AKAP150）参与钙调神经磷酸酶/活化T细胞核因子（calcineurin/NFAT）的激活。

Cell Signal. 2016 Jul;28(7):733-40. doi: 10.1016/j.cellsig.2015.12.015. Epub 2015 Dec 24.

cAMP-dependent Protein Kinase (PKA) Signaling Is Impaired in the Diabetic Heart.环磷酸腺苷（cAMP）依赖性蛋白激酶（PKA）信号传导在糖尿病心脏中受损。

J Biol Chem. 2015 Dec 4;290(49):29250-8. doi: 10.1074/jbc.M115.681767. Epub 2015 Oct 14.

A mitotic kinase scaffold depleted in testicular seminomas impacts spindle orientation in germ line stem cells.一种在睾丸精原细胞瘤中缺失的有丝分裂激酶支架会影响生殖系干细胞中的纺锤体定向。

Elife. 2015 Sep 25;4:e09384. doi: 10.7554/eLife.09384.

Protein complex detection in PPI networks based on data integration and supervised learning method.基于数据整合和监督学习方法的蛋白质-蛋白质相互作用网络中的蛋白质复合物检测

BMC Bioinformatics. 2015;16 Suppl 12(Suppl 12):S3. doi: 10.1186/1471-2105-16-S12-S3. Epub 2015 Aug 25.

Assessment and diagnostic relevance of novel serum biomarkers for early decision of ST-elevation myocardial infarction.新型血清生物标志物对ST段抬高型心肌梗死早期诊断的评估及诊断相关性

Oncotarget. 2015 May 30;6(15):12970-83. doi: 10.18632/oncotarget.4001.

Prioritization of cancer-related genomic variants by SNP association network.通过单核苷酸多态性关联网络对癌症相关基因组变异进行优先级排序。

Cancer Inform. 2015 Apr 1;14(Suppl 2):57-70. doi: 10.4137/CIN.S17288. eCollection 2015.

ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap.ClustVis：一种使用主成分分析和热图可视化多变量数据聚类的网络工具。

Nucleic Acids Res. 2015 Jul 1;43(W1):W566-70. doi: 10.1093/nar/gkv468. Epub 2015 May 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于递归特征消除算法和支持向量机分类器的心肌梗死相关风险基因鉴定。

Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献