Departamento de Lenguajes y Sistemas Informáticos, ETS Ingeniera Informática y de Telecomunicaciones-CITIC, Universidad de Granada, Granada, Spain.
PLoS One. 2012;7(2):e29613. doi: 10.1371/journal.pone.0029613. Epub 2012 Feb 17.
Multimarker Transmission/Disequilibrium Tests (TDTs) are very robust association tests to population admixture and structure which may be used to identify susceptibility loci in genome-wide association studies. Multimarker TDTs using several markers may increase power by capturing high-degree associations. However, there is also a risk of spurious associations and power reduction due to the increase in degrees of freedom. In this study we show that associations found by tests built on simple null hypotheses are highly reproducible in a second independent data set regardless the number of markers. As a test exhibiting this feature to its maximum, we introduce the multimarker 2-Groups TDT (mTDT(2G)), a test which under the hypothesis of no linkage, asymptotically follows a χ2 distribution with 1 degree of freedom regardless the number of markers. The statistic requires the division of parental haplotypes into two groups: disease susceptibility and disease protective haplotype groups. We assessed the test behavior by performing an extensive simulation study as well as a real-data study using several data sets of two complex diseases. We show that mTDT(2G) test is highly efficient and it achieves the highest power among all the tests used, even when the null hypothesis is tested in a second independent data set. Therefore, mTDT(2G) turns out to be a very promising multimarker TDT to perform genome-wide searches for disease susceptibility loci that may be used as a preprocessing step in the construction of more accurate genetic models to predict individual susceptibility to complex diseases.
多标记传递/不平衡测试(TDTs)是一种非常稳健的群体混合和结构关联测试,可用于识别全基因组关联研究中的易感基因座。使用多个标记的多标记 TDT 可以通过捕获高度关联来提高功效。然而,由于自由度的增加,也存在虚假关联和功效降低的风险。在这项研究中,我们表明,基于简单零假设构建的测试所发现的关联在第二个独立数据集是高度可重复的,无论标记数量多少。作为一个表现出这种特征的测试,我们引入了多标记 2 组 TDT(mTDT(2G)),该测试在无连锁假设下,无论标记数量多少,渐近遵循自由度为 1 的 χ2 分布。该统计量需要将亲本单倍型分为两组:疾病易感性和疾病保护性单倍型组。我们通过进行广泛的模拟研究以及使用两个复杂疾病的多个数据集进行真实数据研究来评估测试行为。我们表明,mTDT(2G)测试具有很高的效率,并且在所有使用的测试中达到最高的功效,即使在第二个独立数据集上测试零假设也是如此。因此,mTDT(2G) 是一种非常有前途的多标记 TDT,可用于全基因组搜索疾病易感基因座,可作为构建更准确遗传模型以预测个体对复杂疾病易感性的预处理步骤。