Department of Computer Science, Xavier University of Louisiana, New Orleans, Louisiana, United States of America.
PLoS One. 2013 Oct 30;8(10):e78868. doi: 10.1371/journal.pone.0078868. eCollection 2013.
Single-nucleotide polymorphisms (SNPs) contribute to the between-individual expression variation of many genes. A regulatory (trait-associated) SNP is usually located near or within a (host) gene, possibly influencing the gene's transcription or/and post-transcriptional modification. But its targets may also include genes that are physically farther away from it. A heuristic explanation of such multiple-target interferences is that the host gene transfers the SNP genotypic effects to the distant gene(s) by a transcriptional or signaling cascade. These connections between the host genes (regulators) and the distant genes (targets) make the genetic analysis of gene expression traits a promising approach for identifying unknown regulatory relationships. In this study, through a mixed model analysis of multi-source digital expression profiling for 140 human lymphocyte cell lines (LCLs) and the genotypes distributed by the international HapMap project, we identified 45 thousands of potential SNP-induced regulatory relationships among genes (the significance level for the underlying associations between expression traits and SNP genotypes was set at FDR < 0.01). We grouped the identified relationships into four classes (paradigms) according to the two different mechanisms by which the regulatory SNPs affect their cis- and trans- regulated genes, modifying mRNA level or altering transcript splicing patterns. We further organized the relationships in each class into a set of network modules with the cis- regulated genes as hubs. We found that the target genes in a network module were often characterized by significant functional similarity, and the distributions of the target genes in three out of the four networks roughly resemble a power-law, a typical pattern of gene networks obtained from mutation experiments. By two case studies, we also demonstrated that significant biological insights can be inferred from the identified network modules.
单核苷酸多态性 (SNP) 导致许多基因在个体间的表达变化。调控(性状相关)SNP 通常位于(宿主)基因附近或内部,可能影响基因的转录或/和转录后修饰。但其靶标也可能包括与其物理距离较远的基因。这种多靶干扰的启发式解释是,宿主基因通过转录或信号级联将 SNP 基因型效应传递给远处的基因。宿主基因(调节剂)和远处基因(靶标)之间的这些连接使得基因表达性状的遗传分析成为识别未知调节关系的一种很有前途的方法。在这项研究中,通过对 140 个人类淋巴细胞系 (LCL) 的多源数字表达谱进行混合模型分析,并结合国际 HapMap 项目分布的基因型,我们鉴定了 45000 多个潜在的 SNP 诱导的基因间调控关系(表达性状与 SNP 基因型之间潜在关联的显著性水平设定为 FDR < 0.01)。我们根据调控 SNP 影响其顺式和反式调节基因的两种不同机制,将鉴定出的关系分为四类(范式),改变 mRNA 水平或改变转录剪接模式。我们进一步将每个类别的关系组织成一组网络模块,以顺式调节基因为枢纽。我们发现,网络模块中的靶基因通常具有显著的功能相似性,并且四个网络中的三个网络模块的靶基因分布大致类似于幂律分布,这是从突变实验中获得的基因网络的典型模式。通过两个案例研究,我们还证明了从鉴定出的网络模块中可以推断出重要的生物学见解。