Wu Baolin, Pankow James S
Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA.
Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, Minneapolis, MN, USA.
Comput Math Methods Med. 2018 Mar 18;2018:2564531. doi: 10.1155/2018/2564531. eCollection 2018.
Multiple correlated traits are often collected in genetic studies. By jointly analyzing multiple traits, we can increase power by aggregating multiple weak effects and reveal additional insights into the genetic architecture of complex human diseases. In this article, we propose a multivariate linear regression-based method to test the joint association of multiple quantitative traits. It is flexible to accommodate any covariates, has very accurate control of type I errors, and offers very competitive performance. We also discuss fast and accurate significance value computation especially for genome-wide association studies with small-to-medium sample sizes. We demonstrate through extensive numerical studies that the proposed method has competitive performance. Its usefulness is further illustrated with application to genome-wide association analysis of diabetes-related traits in the Atherosclerosis Risk in Communities (ARIC) study. We found some very interesting associations with diabetes traits which have not been reported before. We implemented the proposed methods in a publicly available R package.
在基因研究中,常常会收集多个相关性状。通过联合分析多个性状,我们可以通过汇总多个微弱效应来提高检验效能,并揭示复杂人类疾病遗传结构的更多见解。在本文中,我们提出了一种基于多变量线性回归的方法来检验多个数量性状的联合关联性。它能够灵活地纳入任何协变量,对I型错误有非常精确的控制,并且具有极具竞争力的性能。我们还讨论了快速且准确的显著性值计算,特别是对于中小样本量的全基因组关联研究。我们通过广泛的数值研究证明,所提出的方法具有竞争力。通过将其应用于社区动脉粥样硬化风险(ARIC)研究中与糖尿病相关性状的全基因组关联分析,进一步说明了其有用性。我们发现了一些之前未报道过的与糖尿病性状非常有趣的关联。我们在一个公开可用的R包中实现了所提出的方法。