Zhang Yujun, Lian Qiufang, Nie Yanwu, Zhao Wei
Data Management Center, Xianyang Hospital, Yan'an University, Xianyang, China.
Department of Cardiology, Xianyang Hospital, Yan'an University, Xianyang, China.
Front Cardiovasc Med. 2024 Jul 11;11:1414974. doi: 10.3389/fcvm.2024.1414974. eCollection 2024.
Atrial fibrillation (AF) is a common persistent arrhythmia characterized by rapid and chaotic atrial electrical activity, potentially leading to severe complications such as thromboembolism, heart failure, and stroke, significantly affecting patient quality of life and safety. As the global population ages, the prevalence of AF is on the rise, placing considerable strains on individuals and healthcare systems. This study utilizes bioinformatics and Mendelian Randomization (MR) to analyze transcriptome data and genome-wide association study (GWAS) summary statistics, aiming to identify biomarkers causally associated with AF and explore their potential pathogenic pathways.
We obtained AF microarray datasets GSE41177 and GSE79768 from the Gene Expression Omnibus (GEO) database, merged them, and corrected for batch effects to pinpoint differentially expressed genes (DEGs). We gathered exposure data from expression quantitative trait loci (eQTL) and outcome data from AF GWAS through the IEU Open GWAS database. We employed inverse variance weighting (IVW), MR-Egger, weighted median, and weighted model approaches for MR analysis to assess exposure-outcome causality. IVW was the primary method, supplemented by other techniques. The robustness of our results was evaluated using Cochran's Q test, MR-Egger intercept, MR-PRESSO, and leave-one-out sensitivity analysis. A "Veen" diagram visualized the overlap of DEGs with significant eQTL genes from MR analysis, referred to as common genes (CGs). Additional analyses, including Gene Ontology (GO) enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, and immune cell infiltration studies, were conducted on these intersecting genes to reveal their roles in AF pathogenesis.
The combined dataset revealed 355 differentially expressed genes (DEGs), with 228 showing significant upregulation and 127 downregulated. Mendelian randomization (MR) analysis identified that the autocrine motility factor receptor (AMFR) [IVW: OR = 0.977; 95% CI, 0.956-0.998; = 0.030], leucine aminopeptidase 3 (LAP3) [IVW: OR = 0.967; 95% CI, 0.934-0.997; = 0.048], Rab acceptor 1 (RABAC1) [IVW: OR = 0.928; 95% CI, 0.875-0.985; = 0.015], and tryptase beta 2 (TPSB2) [IVW: OR = 0.971; 95% CI, 0.943-0.999; = 0.049] are associated with a reduced risk of atrial fibrillation (AF). Conversely, GTPase-activating SH3 domain-binding protein 2 (G3BP2) [IVW: OR = 1.030; 95% CI, 1.004-1.056; = 0.024], integrin subunit beta 2 (ITGB2) [IVW: OR = 1.050; 95% CI, 1.017-1.084; = 0.003], glutaminyl-peptide cyclotransferase (QPCT) [IVW: OR = 1.080; 95% CI, 1.010-0.997; = 1.154], and tripartite motif containing 22 (TRIM22) [IVW: OR = 1.048; 95% CI, 1.003-1.095; = 0.035] are positively associated with AF risk. Sensitivity analyses indicated a lack of heterogeneity or horizontal pleiotropy ( > 0.05), and leave-one-out analysis did not reveal any single nucleotide polymorphisms (SNPs) impacting the MR results significantly. GO and KEGG analyses showed that CG is involved in processes such as protein polyubiquitination, neutrophil degranulation, specific and tertiary granule formation, protein-macromolecule adaptor activity, molecular adaptor activity, and the SREBP signaling pathway, all significantly enriched. The analysis of immune cell infiltration demonstrated associations of CG with various immune cells, including plasma cells, CD8T cells, resting memory CD4T cells, regulatory T cells (Tregs), gamma delta T cells, activated NK cells, activated mast cells, and neutrophils.
By integrating bioinformatics and MR approaches, genes such as AMFR, G3BP2, ITGB2, LAP3, QPCT, RABAC1, TPSB2, and TRIM22 are identified as causally linked to AF, enhancing our understanding of its molecular foundations. This strategy may facilitate the development of more precise biomarkers and therapeutic targets for AF diagnosis and treatment.
心房颤动(AF)是一种常见的持续性心律失常,其特征是心房电活动快速且紊乱,可能导致严重并发症,如血栓栓塞、心力衰竭和中风,显著影响患者的生活质量和安全性。随着全球人口老龄化,AF的患病率呈上升趋势,给个人和医疗保健系统带来了相当大的压力。本研究利用生物信息学和孟德尔随机化(MR)分析转录组数据和全基因组关联研究(GWAS)汇总统计数据,旨在识别与AF因果相关的生物标志物,并探索其潜在的致病途径。
我们从基因表达综合数据库(GEO)中获取了AF微阵列数据集GSE41177和GSE79768,将它们合并,并校正批次效应以确定差异表达基因(DEG)。我们通过IEU开放GWAS数据库收集了来自表达数量性状位点(eQTL)的暴露数据和来自AF GWAS的结果数据。我们采用逆方差加权(IVW)、MR-Egger、加权中位数和加权模型方法进行MR分析,以评估暴露-结果因果关系。IVW是主要方法,辅以其他技术。我们使用Cochran's Q检验、MR-Egger截距、MR-PRESSO和留一法敏感性分析来评估结果的稳健性。一个“Veen”图可视化了DEG与MR分析中具有显著eQTL基因的重叠,这些基因被称为共同基因(CG)。对这些相交基因进行了额外的分析,包括基因本体(GO)富集、京都基因与基因组百科全书(KEGG)途径和免疫细胞浸润研究,以揭示它们在AF发病机制中的作用。
合并后的数据集显示有355个差异表达基因(DEG),其中228个显著上调而127个下调。孟德尔随机化(MR)分析确定,自分泌运动因子受体(AMFR)[IVW:OR = 0.977;95% CI,0.956 - 0.998;P = 0.030]、亮氨酸氨肽酶3(LAP3)[IVW:OR = 0.967;95% CI,0.934 - 0.997;P = 0.048]、Rab受体1(RABAC1)[IVW:OR = 0.928;95% CI,0.875 - 0.985;P = 0.015]和类胰蛋白酶β2(TPSB2)[IVW:OR = 0.971;95% CI,0.943 - 0.999;P = 0.049]与心房颤动(AF)风险降低相关。相反,GTP酶激活SH3结构域结合蛋白2(G3BP2)[IVW:OR = 1.030;95% CI,1.004 - 1.056;P = 0.024]、整合素亚基β2(ITGB2)[IVW:OR = 1.050;95% CI,1.017 - 1.084;P = 0.003]、谷氨酰胺基肽环化转移酶(QPCT)[IVW:OR = 1.080;95% CI,1.010 - 0.997;P = 1.154]和含三联基序蛋白22(TRIM22)[IVW:OR = 1.048;95% CI,1.003 - 1.095;P = 0.035]与AF风险呈正相关。敏感性分析表明缺乏异质性或水平多效性(P > 0.05),留一法分析未发现任何对MR结果有显著影响的单核苷酸多态性(SNP)。GO和KEGG分析表明,CG参与蛋白质多泛素化、中性粒细胞脱颗粒、特异性和三级颗粒形成、蛋白质 - 大分子衔接子活性、分子衔接子活性以及SREBP信号通路等过程,所有这些均显著富集。免疫细胞浸润分析表明CG与各种免疫细胞相关,包括浆细胞、CD8T细胞、静息记忆CD4T细胞、调节性T细胞(Tregs)、γδT细胞、活化的NK细胞、活化的肥大细胞和中性粒细胞。
通过整合生物信息学和MR方法,鉴定出AMFR、G3BP2、ITGB2、LAP3、QPCT、RABAC1、TPSB2和TRIM22等基因与AF存在因果联系,增强了我们对其分子基础的理解。该策略可能有助于开发更精确的生物标志物和治疗靶点用于AF的诊断和治疗。