Lou X-Y
Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL, USA.
Heredity (Edinb). 2015 Mar;114(3):255-61. doi: 10.1038/hdy.2014.94. Epub 2014 Oct 22.
Biological outcomes are governed by multiple genetic and environmental factors that act in concert. Determining multifactor interactions is the primary topic of interest in recent genetics studies but presents enormous statistical and mathematical challenges. The computationally efficient multifactor dimensionality reduction (MDR) approach has emerged as a promising tool for meeting these challenges. On the other hand, complex traits are expressed in various forms and have different data generation mechanisms that cannot be appropriately modeled by a dichotomous model; the subjects in a study may be recruited according to its own analytical goals, research strategies and resources available, not only consisting of homogeneous unrelated individuals. Although several modifications and extensions of MDR have in part addressed the practical problems, they are still limited in statistical analyses of diverse phenotypes, multivariate phenotypes and correlated observations, correcting for potential population stratification and unifying both unrelated and family samples into a more powerful analysis. I propose a comprehensive statistical framework, referred as to unified generalized MDR (UGMDR), for systematic extension of MDR. The proposed approach is quite versatile, not only allowing for covariate adjustment, being suitable for analyzing almost any trait type, for example, binary, count, continuous, polytomous, ordinal, time-to-onset, multivariate and others, as well as combinations of those, but also being applicable to various study designs, including homogeneous and admixed unrelated-subject and family as well as mixtures of them. The proposed UGMDR offers an important addition to the arsenal of analytical tools for identifying nonlinear multifactor interactions and unraveling the genetic architecture of complex traits.
生物学结果由多种协同作用的遗传和环境因素决定。确定多因素相互作用是近期遗传学研究的主要关注主题,但面临巨大的统计和数学挑战。计算效率高的多因素降维(MDR)方法已成为应对这些挑战的一种有前途的工具。另一方面,复杂性状以多种形式表现,具有不同的数据生成机制,无法用二分模型进行适当建模;研究中的受试者可能根据其自身的分析目标、研究策略和可用资源进行招募,不仅包括同质的无关个体。尽管MDR的一些修改和扩展在一定程度上解决了实际问题,但它们在对多样的表型、多变量表型和相关观察进行统计分析、校正潜在的群体分层以及将无关样本和家系样本统一到更强大的分析中仍存在局限性。我提出了一个全面的统计框架,称为统一广义MDR(UGMDR),用于MDR的系统扩展。所提出的方法非常通用,不仅允许进行协变量调整,适用于分析几乎任何性状类型,例如二元、计数、连续、多分类、有序、发病时间、多变量等,以及它们的组合,而且还适用于各种研究设计,包括同质和混合的无关个体、家系以及它们的混合。所提出的UGMDR为识别非线性多因素相互作用和揭示复杂性状的遗传结构的分析工具库增添了重要内容。