Brown Keely E, Kelly John K
Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA.
Genetics. 2022 Jan 4;220(1). doi: 10.1093/genetics/iyab189.
We measured the floral bud transcriptome of 151 fully sequenced lines of Mimulus guttatus from one natural population. Thousands of single nucleotide polymorphisms (SNPs) are implicated as transcription regulators, but there is a striking difference in the allele frequency spectrum of cis-acting and trans-acting mutations. Cis-SNPs have intermediate frequencies (consistent with balancing selection) while trans-SNPs exhibit a rare-alleles model (consistent with purifying selection). This pattern only becomes clear when transcript variation is normalized on a gene-to-gene basis. If a global normalization is applied, as is typically in RNAseq experiments, asymmetric transcript distributions combined with "rarity disequilibrium" produce a superabundance of false positives for trans-acting SNPs. To explore the cause of purifying selection on trans-acting mutations, we identified gene expression modules as sets of coexpressed genes. The extent to which trans-acting mutations influence modules is a strong predictor of allele frequency. Mutations altering expression of genes with high "connectedness" (those that are highly predictive of the representative module expression value) have the lowest allele frequency. The expression modules can also predict whole-plant traits such as flower size. We find that a substantial portion of the genetic (co)variance among traits can be described as an emergent property of genetic effects on expression modules.
我们测量了来自一个自然种群的151个全测序的黄花沟酸浆品系的花芽转录组。数千个单核苷酸多态性(SNP)被认为是转录调节因子,但顺式作用和反式作用突变的等位基因频率谱存在显著差异。顺式SNP具有中等频率(与平衡选择一致),而反式SNP表现出稀有等位基因模型(与纯化选择一致)。只有在基因对基因的基础上对转录变异进行归一化时,这种模式才会变得清晰。如果像RNAseq实验中通常那样应用全局归一化,不对称的转录本分布与“稀有性不平衡”相结合,会导致反式作用SNP产生大量假阳性。为了探究对反式作用突变进行纯化选择的原因,我们将基因表达模块鉴定为共表达基因集。反式作用突变影响模块的程度是等位基因频率的有力预测指标。改变具有高“连通性”(那些高度预测代表性模块表达值的基因)的基因表达的突变具有最低的等位基因频率。这些表达模块还可以预测全株性状,如花的大小。我们发现,性状间很大一部分遗传(协)方差可以被描述为基因对表达模块影响的一种涌现特性。