Yan Qi, Weeks Daniel E, Tiwari Hemant K, Yi Nengjun, Zhang Kui, Gao Guimin, Lin Wan-Yu, Lou Xiang-Yang, Chen Wei, Liu Nianjun
Division of Pulmonary Medicine, Allergy and Immunology, Department of Pediatrics, Children's Hospital of Pittsburgh of UPMC, Pittsburgh, Pa., USA.
Hum Hered. 2015;80(3):126-38. doi: 10.1159/000445057. Epub 2016 Apr 29.
The kernel machine (KM) test reportedly performs well in the set-based association test of rare variants. Many studies have been conducted to measure phenotypes at multiple time points, but the standard KM methodology has only been available for phenotypes at a single time point. In addition, family-based designs have been widely used in genetic association studies; therefore, the data analysis method used must appropriately handle familial relatedness. A rare-variant test does not currently exist for longitudinal data from family samples. Therefore, in this paper, we aim to introduce an association test for rare variants, which includes multiple longitudinal phenotype measurements for either population or family samples.
This approach uses KM regression based on the linear mixed model framework and is applicable to longitudinal data from either population (L-KM) or family samples (LF-KM).
In our population-based simulation studies, L-KM has good control of Type I error rate and increased power in all the scenarios we considered compared with other competing methods. Conversely, in the family-based simulation studies, we found an inflated Type I error rate when L-KM was applied directly to the family samples, whereas LF-KM retained the desired Type I error rate and had the best power performance overall. Finally, we illustrate the utility of our proposed LF-KM approach by analyzing data from an association study between rare variants and blood pressure from the Genetic Analysis Workshop 18 (GAW18).
We propose a method for rare-variant association testing in population and family samples using phenotypes measured at multiple time points for each subject. The proposed method has the best power performance compared to competing approaches in our simulation study.
据报道,核机器(KM)检验在罕见变异的基于集合的关联检验中表现良好。已经开展了许多研究来在多个时间点测量表型,但标准的KM方法仅适用于单个时间点的表型。此外,基于家系的设计已广泛应用于基因关联研究;因此,所使用的数据分析方法必须适当地处理家族相关性。目前不存在针对家系样本纵向数据的罕见变异检验。因此,在本文中,我们旨在引入一种针对罕见变异的关联检验,该检验包括针对人群或家系样本的多个纵向表型测量。
该方法使用基于线性混合模型框架的KM回归,适用于来自人群(L-KM)或家系样本(LF-KM)的纵向数据。
在我们基于人群的模拟研究中,与其他竞争方法相比,L-KM在我们考虑的所有情景中对I型错误率有良好的控制且功效增加。相反,在基于家系的模拟研究中,我们发现将L-KM直接应用于家系样本时I型错误率膨胀,而LF-KM保持了所需的I型错误率且总体上具有最佳的功效表现。最后,我们通过分析来自遗传分析研讨会18(GAW18)的罕见变异与血压关联研究的数据来说明我们提出的LF-KM方法的效用。
我们提出了一种在人群和家系样本中使用每个受试者在多个时间点测量的表型进行罕见变异关联检验的方法。在我们的模拟研究中,与竞争方法相比,所提出的方法具有最佳的功效表现。