INSERM UMR_S 937, Pierre and Marie Curie University, Paris, France.
PLoS One. 2013;8(1):e50888. doi: 10.1371/journal.pone.0050888. Epub 2013 Jan 23.
Smoking is a risk factor for atherosclerosis with reported widespread effects on gene expression in circulating blood cells. We hypothesized that a molecular signature mediating the relation between smoking and atherosclerosis may be found in the transcriptome of circulating monocytes. Genome-wide expression profiles and counts of atherosclerotic plaques in carotid arteries were collected in 248 smokers and 688 non-smokers from the general population. Patterns of co-expressed genes were identified by Independent Component Analysis (ICA) and network structure of the pattern-specific gene modules was inferred by the PC-algorithm. A likelihood-based causality test was implemented to select patterns that fit models containing a path "smoking→gene expression→plaques". Robustness of the causal inference was assessed by bootstrapping. At a FDR ≤0.10, 3,368 genes were associated to smoking or plaques, of which 93% were associated to smoking only. SASH1 showed the strongest association to smoking and PPARG the strongest association to plaques. Twenty-nine gene patterns were identified by ICA. Modules containing SASH1 and PPARG did not show evidence for the "smoking→gene expression→plaques" causality model. Conversely, three modules had good support for causal effects and exhibited a network topology consistent with gene expression mediating the relation between smoking and plaques. The network with the strongest support for causal effects was connected to plaques through SLC39A8, a gene with known association to HDL-cholesterol and cellular uptake of cadmium from tobacco, while smoking was directly connected to GAS6, a gene reported to have anti-inflammatory effects in atherosclerosis and to be up-regulated in the placenta of women smoking during pregnancy. Our analysis of the transcriptome of monocytes recovered genes relevant for association to smoking and atherosclerosis, and connected genes that before, were only studied in separate contexts. Inspection of correlation structure revealed candidates that would be missed by expression-phenotype association analysis alone.
吸烟是动脉粥样硬化的一个风险因素,据报道,它对循环血细胞中的基因表达有广泛的影响。我们假设,在循环单核细胞的转录组中可能会发现一种介导吸烟与动脉粥样硬化之间关系的分子特征。在一般人群中,从 248 名吸烟者和 688 名不吸烟者中收集了颈动脉粥样硬化斑块的全基因组表达谱和斑块数量。通过独立成分分析(ICA)识别共表达基因的模式,并通过 PC 算法推断模式特异性基因模块的网络结构。实施似然因果检验来选择适合包含路径“吸烟→基因表达→斑块”的模型的模式。通过自举法评估因果推断的稳健性。在 FDR≤0.10 时,有 3368 个基因与吸烟或斑块相关,其中 93%仅与吸烟相关。SASH1 与吸烟的相关性最强,而 PPARG 与斑块的相关性最强。通过 ICA 鉴定了 29 个基因模式。包含 SASH1 和 PPARG 的模块没有显示出“吸烟→基因表达→斑块”因果模型的证据。相反,有三个模块有很好的因果效应支持,并表现出与基因表达介导吸烟与斑块之间关系一致的网络拓扑结构。具有最强因果效应支持的网络通过 SLC39A8 与斑块连接,SLC39A8 是一个与高密度脂蛋白胆固醇和烟草中镉的细胞摄取有关的基因,而吸烟与 GAS6 直接连接,GAS6 是一种在动脉粥样硬化中具有抗炎作用的基因,并且在怀孕期间吸烟的女性胎盘中有上调。我们对单核细胞转录组的分析恢复了与吸烟和动脉粥样硬化相关的基因,并连接了以前仅在单独的背景下研究的基因。对相关结构的检查揭示了仅通过表达-表型关联分析可能会错过的候选基因。