Ionita-Laza Iuliana, McQueen Matthew B, Laird Nan M, Lange Christoph
Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA.
Am J Hum Genet. 2007 Sep;81(3):607-14. doi: 10.1086/519748. Epub 2007 Jul 17.
For genomewide association (GWA) studies in family-based designs, we propose a novel two-stage strategy that weighs the association P values with the use of independently estimated weights. The association information contained in the family sample is partitioned into two orthogonal components--namely, the between-family information and the within-family information. The between-family component is used in the first (i.e., screening) stage to obtain a relative ranking of all the markers. The within-family component is used in the second (i.e., testing) stage in the framework of the standard family-based association test, and the resulting P values are weighted using the estimated marker ranking from the screening step. The approach is appealing, in that it ensures that all the markers are tested in the testing step and, at the same time, also uses information from the screening step. Through simulation studies, we show that testing all the markers is more powerful than testing only the most promising ones from the screening step, which was the method suggested by Van Steen et al. A comparison with a population-based approach shows that the approach achieves comparable power. In the presence of a reasonable level of population stratification, our approach is only slightly affected in terms of power and, since it is a family-based method, is completely robust to spurious effects. An application to a 100K scan in the Framingham Heart Study illustrates the practical advantages of our approach. The proposed method is of general applicability; it extends to any setting in which prior, independent ranking of hypotheses is available.
对于基于家系设计的全基因组关联(GWA)研究,我们提出了一种新颖的两阶段策略,该策略使用独立估计的权重对关联P值进行加权。家系样本中包含的关联信息被划分为两个正交成分,即家系间信息和家系内信息。家系间成分用于第一阶段(即筛选阶段),以获得所有标记的相对排名。家系内成分用于第二阶段(即检验阶段),在标准的基于家系的关联检验框架下,使用筛选步骤中估计的标记排名对所得的P值进行加权。该方法很有吸引力,因为它确保了在检验步骤中对所有标记进行检验,同时还利用了筛选步骤中的信息。通过模拟研究,我们表明对所有标记进行检验比仅对筛选步骤中最有希望的标记进行检验更具效力,后者是Van Steen等人建议的方法。与基于群体的方法进行比较表明,我们的方法具有相当的效力。在存在合理程度的群体分层的情况下,我们的方法在效力方面仅受到轻微影响,并且由于它是一种基于家系的方法,对虚假效应完全具有稳健性。在弗雷明汉心脏研究中对100K扫描的应用说明了我们方法的实际优势。所提出的方法具有普遍适用性;它适用于任何可以获得假设的先验独立排名的情况。