School of Mathematics and Statistics, Newcastle University, Newcastle upon Tyne, United Kingdom.
Genet Epidemiol. 2011 Jan;35(1):19-45. doi: 10.1002/gepi.20547.
Many complex genetic effects, including epigenetic effects, may be expected to operate via mechanisms in the inter-uterine environment. A popular design for the investigation of such effects, including effects of parent-of-origin (imprinting), maternal genotype, and maternal-fetal genotype interactions, is to collect DNA from affected offspring and their mothers (case/mother duos) and to compare with an appropriate control sample. An alternative design uses data from cases and both parents (case/parent trios) but does not require controls. In this study, we describe a novel implementation of a multinomial modeling approach that allows the estimation of such genetic effects using either case/mother duos or case/parent trios. We investigate the performance of our approach using computer simulations and explore the sample sizes and data structures required to provide high power for detection of effects and accurate estimation of the relative risks conferred. Through the incorporation of additional assumptions (such as Hardy-Weinberg equilibrium, random mating and known allele frequencies) and/or the incorporation of additional types of control sample (such as unrelated controls, controls and their mothers, or both parents of controls), we show that the (relative risk) parameters of interest are identifiable and well estimated. Nevertheless, parameter interpretation can be complex, as we illustrate by demonstrating the mathematical equivalence between various different parameterizations. Our approach scales up easily to allow the analysis of large-scale genome-wide association data, provided both mothers and affected offspring have been genotyped at all variants of interest.
许多复杂的遗传效应,包括表观遗传效应,可能通过子宫内环境中的机制发挥作用。一种研究此类效应(包括亲本来源效应(印迹)、母体基因型和母体-胎儿基因型相互作用)的流行设计是从受影响的后代及其母亲(病例/母亲对子)中收集 DNA,并与适当的对照样本进行比较。另一种设计使用病例和父母双方的数据(病例/父母三联体),但不需要对照。在这项研究中,我们描述了一种新的多项式建模方法的实现,该方法允许使用病例/母亲对子或病例/父母三联体来估计此类遗传效应。我们使用计算机模拟研究了我们方法的性能,并探讨了检测效应和准确估计所赋予的相对风险所需的样本量和数据结构。通过纳入其他假设(例如哈迪-温伯格平衡、随机交配和已知等位基因频率)和/或纳入其他类型的对照样本(例如无关对照、对照及其母亲、或对照的父母),我们表明感兴趣的(相对风险)参数是可识别且估计良好的。然而,参数解释可能很复杂,正如我们通过证明各种不同参数化之间的数学等价性来说明的那样。我们的方法易于扩展,可用于分析大规模全基因组关联数据,前提是所有感兴趣的变体都已对母亲和受影响的后代进行了基因分型。