关于在方差异质时进行数量性状遗传关联分析的排列检验的说明。

A note on permutation tests for genetic association analysis of quantitative traits when variances are heterogeneous.

机构信息

Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands.

出版信息

Genet Epidemiol. 2009 Dec;33(8):710-6. doi: 10.1002/gepi.20423.

DOI:10.1002/gepi.20423

PMID:19365861

Abstract

The genetic dissection of quantitative traits, or endophenotypes, usually involves genetic linkage or association analysis in pedigrees and subsequent fine mapping association analysis in the population. The ascertainment procedure for quantitative traits often results in unequal variance of observations. For example, some phenotypes may be clinically measured whilst others are from self-reports, or phenotypes may be the average of multiple measures but with the number of measurements varying. The resulting heterogeneity of variance poses no real problem for analysis, as long as it is properly modelled and thereby taken into account. However, if statistical significance is determined using an empirical permutation procedure, it is not obvious what the units of sampling are. We investigated a number of permutation approaches in a simulation study of an association analysis between a quantitative trait and a single nucleotide polymorphism. Our simulations were designed such that we knew the true p-value of the test statistics. A number of permutation methods were compared from the regression of true on empirical p-values and the precision of the empirical p-values. We show that the best procedure involves an implicit adjustment of the original data for the effects in the model before permutation, and that other methods, some of which seemed appropriate a priori, are relatively biased.

摘要

定量性状（或内表型）的遗传剖析通常涉及家系中的遗传连锁或关联分析，以及随后在人群中的精细映射关联分析。定量性状的确定程序通常会导致观察值的方差不均等。例如，一些表型可能是临床测量的，而另一些则是自我报告的，或者表型可能是多个测量值的平均值，但测量值的数量不同。只要正确建模并考虑到这种方差异质性，它就不会对分析造成真正的问题。然而，如果使用经验性置换程序确定统计显著性，那么采样的单位就不清楚了。我们在一项关于定量性状和单个核苷酸多态性之间关联分析的模拟研究中研究了几种置换方法。我们的模拟设计使得我们知道了测试统计量的真实 p 值。我们比较了从真实 p 值到经验 p 值的回归和经验 p 值的精度的几种置换方法。我们表明，最好的方法是在置换之前对原始数据进行模型中效应的隐式调整，而其他一些方法，尽管在事先看来是合适的，但相对有偏差。