Ajasa Afees A, Gjøen Hans M, Boison Solomon A, Lillehammer Marie
Department of Breeding and Genetics, Nofima (Norwegian Institute of Food, Fisheries and Aquaculture Research), P. O. Box 210, N-1431, Ås, Norway.
Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, 5003 NMBU, N-1432, Ås, Norway.
Genet Sel Evol. 2025 Feb 27;57(1):9. doi: 10.1186/s12711-025-00959-1.
In a previous study, we found low persistence of linkage disequilibrium (LD) phase across breeding populations of Atlantic salmon. Accordingly, we observed no increase in accuracy from combining these populations for genomic prediction. In this study, we aimed to examine if the same were true for detection power in genome-wide association studies (GWAS), in terms of reduction in p-values, and if the precision of mapping quantitative trait loci (QTL) would improve from such analysis. Since individual records may not always be available, e.g. due to proprietorship or confidentiality, we also compared mega-analysis and meta-analysis. Mega-analysis needs access to all individual records, whereas meta-analysis utilizes parameters, such as p-values or allele substitution effects, from multiple studies or populations. Furthermore, different methods for determining the presence or absence of independent or secondary signals, such as conditional association analysis, approximate conditional and joint analysis (COJO), and the clumping approach, were assessed.
Mega-analysis resulted in increased detection power, in terms of reduction in p-values, and increased precision, compared to the within-population GWAS. Only one QTL was detected using conditional association analysis, both within populations and in mega-analysis, while the number of QTL detected with COJO and the clumping approach ranged from 1 to 19. The allele substitution effect and -logp-values obtained from mega-analysis were highly correlated with the corresponding values from various meta-analysis methods. Compared to mega-analysis, a higher detection power and reduced precision were obtained with the meta-analysis methods.
Our results show that combining multiple datasets or populations in a mega-analysis can increase detection power and mapping precision. With meta-analysis, a higher detection power was obtained compared to mega-analysis. However, care must be taken in the interpretation of the meta-analysis results from multiple populations because their test statistics might be inflated due to population structure or cryptic relatedness.
在之前的一项研究中,我们发现大西洋鲑鱼繁殖群体间的连锁不平衡(LD)阶段持续性较低。因此,我们观察到将这些群体合并用于基因组预测时准确性并未提高。在本研究中,我们旨在检验在全基因组关联研究(GWAS)中检测能力是否也是如此,即p值是否降低,以及这种分析是否会提高定位数量性状基因座(QTL)的精度。由于个体记录可能并非总是可用,例如由于所有权或保密性原因,我们还比较了汇总分析和荟萃分析。汇总分析需要获取所有个体记录,而荟萃分析则利用来自多个研究或群体的参数,如p值或等位基因替代效应。此外,还评估了用于确定独立或次要信号存在与否的不同方法,如条件关联分析、近似条件和联合分析(COJO)以及聚类方法。
与群体内GWAS相比,汇总分析在降低p值方面提高了检测能力,并提高了精度。使用条件关联分析,无论是在群体内还是在汇总分析中,仅检测到一个QTL,而使用COJO和聚类方法检测到的QTL数量在1到19个之间。汇总分析获得的等位基因替代效应和-logp值与各种荟萃分析方法的相应值高度相关。与汇总分析相比,荟萃分析方法获得了更高的检测能力,但精度降低。
我们的结果表明,在汇总分析中合并多个数据集或群体可以提高检测能力和定位精度。与汇总分析相比,荟萃分析获得了更高的检测能力。然而,在解释来自多个群体的荟萃分析结果时必须谨慎,因为由于群体结构或潜在相关性,其检验统计量可能会膨胀。