Lin Hui-Wen, Chen Yi-Hau
Biostatistics Research and Consulting Center, Taipei Medical University, Taipei, Taiwan, ROC.
Hum Hered. 2010;69(3):160-70. doi: 10.1159/000267996. Epub 2009 Dec 18.
The association analysis based on a population-based case-control study is convenient and powerful, but may be biased under population stratification (PS), namely the study population consists of strata heterogeneous in disease rates and allele frequencies. On the other hand, a family-based (e.g. case-parents) study is robust against the PS bias, but may be less convenient to implement. We propose an association analysis that preserves the full robustness property of the family-based analysis while allowing for borrowing information from a population-based analysis.
A two-stage procedure is proposed. In the first stage, one selects a population-based case-control sample and performs a traditional case-control association analysis. In the second stage, one randomly selects a subset of the first-stage cases and recruits their family controls (e.g. parents), and performs a family-based association analysis. An overall two-stage analysis is then performed to utilize information from the two stages.
The proposed two-stage analysis achieves higher power than the second-stage family-based analysis by utilizing information in the first-stage population study, while maintaining the full robustness of the family study and hence is still valid under PS. The proposal can also accommodate parental missingness when the case-parents study is used as the second-stage family study.
The two-stage analysis facilitates efficient and robust association analysis under PS. Its computation- and cost-effectiveness render it very promising in genome-wide association studies.
基于人群的病例对照研究进行的关联分析方便且效能强大,但在人群分层(PS)情况下可能存在偏倚,即研究人群由疾病发生率和等位基因频率存在异质性的层组成。另一方面,基于家系的(如病例-父母)研究对PS偏倚具有稳健性,但实施起来可能不太方便。我们提出一种关联分析方法,该方法在保留基于家系分析的完全稳健性的同时,还能借鉴基于人群分析的信息。
提出一个两阶段程序。在第一阶段,选取一个基于人群的病例对照样本并进行传统的病例对照关联分析。在第二阶段,从第一阶段的病例中随机选取一个子集,并招募其家系对照(如父母),然后进行基于家系的关联分析。接着进行一个整体的两阶段分析以利用两个阶段的信息。
通过利用第一阶段人群研究中的信息,所提出的两阶段分析比第二阶段基于家系的分析具有更高的效能,同时保持了家系研究的完全稳健性,因此在PS情况下仍然有效。当将病例-父母研究用作第二阶段家系研究时,该方法还能处理父母缺失的情况。
两阶段分析有助于在PS情况下进行高效且稳健的关联分析。其计算和成本效益使其在全基因组关联研究中非常有前景。