Sverdlov Serge, Thompson Elizabeth
Department of Statistics, University of Washington , Seattle, Washington.
J Comput Biol. 2017 Apr;24(4):267-279. doi: 10.1089/cmb.2016.0112. Epub 2016 Nov 21.
We develop computational tools for the analysis of nonlinear genotype-phenotype relationships with epistasis among multiple loci or dominance interactions among multiple alleles within the same locus. Theory distinguishes between separable traits, with removable epistasis, and traits with essential epistasis. Separable traits can be transformed to a natural scale where additive methods apply. The methods we present solve for the natural scale, exactly when possible and approximately when not. Through graph methods, our methods allow for enumeration, counting, or sampling of distinct trait architectures satisfying constraints from the separability theory. A tool is provided for diagnosing which separability constraints are violated by a given nonseparable architecture. For genetic traits controlled by limited numbers of loci and alleles, our algorithm enumerates all possible trait structures and finds exact or error-minimizing linearizing transformations by formulating a constrained optimization program. We find that the fraction of possible distinct genetic traits satisfying simple criteria that can be fully or approximately linearized is high for small systems and falls as the number of alleles or loci increases.
我们开发了计算工具,用于分析多基因座间存在上位性或同一位点内多等位基因间存在显性相互作用时的非线性基因型-表型关系。理论上区分了可分离性状(具有可消除的上位性)和具有本质上位性的性状。可分离性状可以转换到加法方法适用的自然尺度。我们提出的方法在可能时精确求解自然尺度,不可能时近似求解。通过图方法,我们的方法允许对满足可分离性理论约束的不同性状结构进行枚举、计数或抽样。提供了一种工具,用于诊断给定的不可分离结构违反了哪些可分离性约束。对于由有限数量的基因座和等位基因控制的遗传性状,我们的算法通过制定约束优化程序来枚举所有可能的性状结构,并找到精确的或使误差最小化的线性化变换。我们发现,对于小系统,满足可完全或近似线性化的简单标准的可能不同遗传性状的比例很高,并且随着等位基因或基因座数量的增加而下降。