Department of Twin Research and Genetic Epidemiology, King's College London, St Thomas' Hospital, Westminster Bridge Road, London SE1 7EH, UK.
G3 (Bethesda). 2024 Apr 3;14(4). doi: 10.1093/g3journal/jkae022.
Genetically associated phenotypic variability has been widely observed across organisms and traits, including in humans. Both gene-gene and gene-environment interactions can lead to an increase in genetically associated phenotypic variability. Therefore, detecting the underlying genetic variants, or variance Quantitative Trait Loci (vQTLs), can provide novel insights into complex traits. Established approaches to detect vQTLs apply different methodologies from variance-only approaches to mean-variance joint tests, but a comprehensive comparison of these methods is lacking. Here, we review available methods to detect vQTLs in humans, carry out a simulation study to assess their performance under different biological scenarios of gene-environment interactions, and apply the optimal approaches for vQTL identification to gene expression data. Overall, with a minor allele frequency (MAF) of less than 0.2, the squared residual value linear model (SVLM) and the deviation regression model (DRM) are optimal when the data follow normal and non-normal distributions, respectively. In addition, the Brown-Forsythe (BF) test is one of the optimal methods when the MAF is 0.2 or larger, irrespective of phenotype distribution. Additionally, a larger sample size and more balanced sample distribution in different exposure categories increase the power of BF, SVLM, and DRM. Our results highlight vQTL detection methods that perform optimally under realistic simulation settings and show that their relative performance depends on the phenotype distribution, allele frequency, sample size, and the type of exposure in the interaction model underlying the vQTL.
遗传相关表型变异性在生物和特征中广泛存在,包括人类。基因-基因和基因-环境相互作用都可能导致遗传相关表型变异性增加。因此,检测潜在的遗传变异体,或方差数量性状基因座 (vQTL),可以为复杂特征提供新的见解。检测 vQTL 的既定方法应用了从方差方法到均值-方差联合检验的不同方法,但缺乏对这些方法的全面比较。在这里,我们回顾了在人类中检测 vQTL 的可用方法,进行了一项模拟研究,以评估它们在不同基因-环境相互作用的生物学场景下的性能,并将 vQTL 鉴定的最佳方法应用于基因表达数据。总的来说,当数据遵循正态和非正态分布时,小等位基因频率 (MAF) 小于 0.2 时,平方残差线性模型 (SVLM) 和偏差回归模型 (DRM) 是最优的。此外,当 MAF 为 0.2 或更大时,Brown-Forsythe (BF) 检验是最优方法之一,而与表型分布无关。此外,更大的样本量和不同暴露类别中更平衡的样本分布增加了 BF、SVLM 和 DRM 的功效。我们的结果突出了在现实模拟设置下表现最佳的 vQTL 检测方法,并表明它们的相对性能取决于表型分布、等位基因频率、样本量以及 vQTL 下交互模型中的暴露类型。