• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在大型基因分型人群中,使用经过验证和新兴的算法进行单步 GWAS 的标记效应 p 值。

Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations.

机构信息

1Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.

出版信息

Genet Sel Evol. 2024 Aug 22;56(1):59. doi: 10.1186/s12711-024-00925-3.

DOI:10.1186/s12711-024-00925-3
PMID:39174924
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11340074/
Abstract

BACKGROUND

Single-nucleotide polymorphism (SNP) effects can be backsolved from ssGBLUP genomic estimated breeding values (GEBV) and used for genome-wide association studies (ssGWAS). However, obtaining p-values for those SNP effects relies on the inversion of dense matrices, which poses computational limitations in large genotyped populations. In this study, we present a method to approximate SNP p-values for ssGWAS with many genotyped animals. This method relies on the combination of a sparse approximation of the inverse of the genomic relationship matrix ( ) built with the algorithm for proven and young ( ) and an approximation of the prediction error variance of SNP effects which does not require the inversion of the left-hand side (LHS) of the mixed model equations. To test the proposed p-value computing method, we used a reduced genotyped population of 50K genotyped animals and compared the approximated SNP p-values with benchmark p-values obtained with the direct inverse of LHS built with an exact genomic relationship matrix ( . Then, we applied the proposed approximation method to obtain SNP p-values for a larger genotyped population composed of 450K genotyped animals.

RESULTS

The same genomic regions on chromosomes 7 and 20 were identified across all p-value computing methods when using 50K genotyped animals. In terms of computational requirements, obtaining p-values with the proposed approximation reduced the wall-clock time by 38 times and the memory requirement by ten times compared to using the exact inversion of the LHS. When the approximation was applied to a population of 450K genotyped animals, two new significant regions on chromosomes 6 and 14 were uncovered, indicating an increase in GWAS detection power when including more genotypes in the analyses. The process of obtaining p-values with the approximation and 450K genotyped individuals took 24.5 wall-clock hours and 87.66GB of memory, which is expected to increase linearly with the addition of noncore genotyped individuals.

CONCLUSIONS

With the proposed method, obtaining p-values for SNP effects in ssGWAS is computationally feasible in large genotyped populations. The computational cost of obtaining p-values in ssGWAS may no longer be a limitation in extensive populations with many genotyped animals.

摘要

背景

单核苷酸多态性(SNP)效应可以从 ssGBLUP 基因组估计育种值(GEBV)中反推出来,并用于全基因组关联研究(ssGWAS)。然而,获得那些 SNP 效应的 p 值依赖于密集矩阵的逆运算,这在大型基因分型群体中存在计算限制。在这项研究中,我们提出了一种在有大量基因分型动物的情况下,对 ssGWAS 的 SNP 效应进行近似 p 值计算的方法。该方法依赖于利用算法为已证实和年轻的( )构建的基因组关系矩阵( )的稀疏近似以及 SNP 效应预测误差方差的近似,而不需要混合模型方程的左(LHS)的逆运算。为了测试所提出的 p 值计算方法,我们使用了一个经过缩减的基因分型群体的 50K 个基因分型动物,并将近似 SNP p 值与使用具有精确基因组关系矩阵( )的 LHS 直接逆运算获得的基准 p 值进行了比较。然后,我们应用所提出的近似方法,对由 450K 个基因分型动物组成的更大基因分型群体进行 SNP p 值计算。

结果

当使用 50K 个基因分型动物时,所有 p 值计算方法都在染色体 7 和 20 上鉴定出相同的基因组区域。在计算要求方面,与使用 LHS 的精确逆运算相比,使用所提出的近似方法获得 p 值时,计算时间减少了 38 倍,内存需求减少了 10 倍。当将近似方法应用于 450K 个基因分型动物的群体时,在染色体 6 和 14 上发现了两个新的显著区域,表明在分析中包含更多的基因型时,GWAS 的检测能力有所提高。使用近似方法和 450K 个基因分型个体获得 p 值的过程需要 24.5 个时钟小时和 87.66GB 的内存,预计随着非核心基因分型个体的增加,计算时间和内存需求将呈线性增加。

结论

在所提出的方法中,在大型基因分型群体中,对 ssGWAS 的 SNP 效应进行 p 值计算在计算上是可行的。在具有大量基因分型动物的广泛群体中,获得 ssGWAS 中 p 值的计算成本可能不再是一个限制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/49160547e272/12711_2024_925_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/b3d17fdf81eb/12711_2024_925_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/4f0f5cb2684d/12711_2024_925_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/019131ba5364/12711_2024_925_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/49160547e272/12711_2024_925_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/b3d17fdf81eb/12711_2024_925_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/4f0f5cb2684d/12711_2024_925_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/019131ba5364/12711_2024_925_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ada/11340074/49160547e272/12711_2024_925_Fig4_HTML.jpg

相似文献

1
Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations.在大型基因分型人群中,使用经过验证和新兴的算法进行单步 GWAS 的标记效应 p 值。
Genet Sel Evol. 2024 Aug 22;56(1):59. doi: 10.1186/s12711-024-00925-3.
2
Efficient approximation of reliabilities for single-step genomic best linear unbiased predictor models with the Algorithm for Proven and Young.利用 Proven 和 Young 算法对单步基因组最佳线性无偏预测模型进行可靠性的有效逼近。
J Anim Sci. 2022 Jan 1;100(1). doi: 10.1093/jas/skab353.
3
Comparing algorithms to approximate accuracies for single-step genomic best linear unbiased predictor.比较算法以近似单步基因组最佳线性无偏预测器的准确性。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae195.
4
Theoretical accuracy for indirect predictions based on SNP effects from single-step GBLUP.基于单步 GBLUP 的 SNP 效应进行间接预测的理论准确性。
Genet Sel Evol. 2022 Sep 27;54(1):66. doi: 10.1186/s12711-022-00752-4.
5
Is single-step genomic REML with the algorithm for proven and young more computationally efficient when less generations of data are present?当数据的世代数较少时,采用具有成熟和年轻算法的一步法基因组 REML 是否更具计算效率?
J Anim Sci. 2022 May 1;100(5). doi: 10.1093/jas/skac082.
6
Indirect predictions with a large number of genotyped animals using the algorithm for proven and young.使用经过验证和年轻的算法对大量基因分型动物进行间接预测。
J Anim Sci. 2020 Jun 1;98(6). doi: 10.1093/jas/skaa154.
7
Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates.高效的大规模单步评估和间接基因组预测的基因分型选择候选者。
Genet Sel Evol. 2023 Jun 8;55(1):37. doi: 10.1186/s12711-023-00808-z.
8
Inexpensive Computation of the Inverse of the Genomic Relationship Matrix in Populations with Small Effective Population Size.有效种群规模较小的群体中基因组关系矩阵逆矩阵的低成本计算
Genetics. 2016 Feb;202(2):401-9. doi: 10.1534/genetics.115.182089. Epub 2015 Nov 19.
9
Leveraging low-density crossbred genotypes to offset crossbred phenotypes and their impact on purebred predictions.利用低密度杂交基因型来抵消杂交表型及其对纯种预测的影响。
J Anim Sci. 2022 Dec 1;100(12). doi: 10.1093/jas/skac359.
10
Development of genomic predictions for Angus cattle in Brazil incorporating genotypes from related American sires.发展巴西安格斯牛的基因组预测,纳入相关美国父本的基因型。
J Anim Sci. 2022 Feb 1;100(2). doi: 10.1093/jas/skac009.

引用本文的文献

1
Candidate Genes, Markers, Signatures of Selection, and Quantitative Trait Loci (QTLs) and Their Association with Economic Traits in Livestock: Genomic Insights and Selection.候选基因、标记、选择特征、数量性状位点(QTLs)及其与家畜经济性状的关联:基因组见解与选择
Int J Mol Sci. 2025 Aug 8;26(16):7688. doi: 10.3390/ijms26167688.
2
Applying the algorithm for Proven and young in GWAS Reveals high polygenicity for key traits in Nellore cattle.在基因组全基因关联研究中应用针对已证实和年轻个体的算法,揭示了内洛尔牛关键性状的高多基因性。
Front Genet. 2025 Apr 30;16:1549284. doi: 10.3389/fgene.2025.1549284. eCollection 2025.
3
Genetic inference and single cell expression analysis of potential targets in heart failure and breast cancer.

本文引用的文献

1
Single nucleotide polymorphism profile for quantitative trait nucleotide in populations with small effective size and its impact on mapping and genomic predictions.有效群体规模较小的数量性状核苷酸的单核苷酸多态性图谱及其对作图和基因组预测的影响。
Genetics. 2024 Aug 7;227(4). doi: 10.1093/genetics/iyae103.
2
Dimensionality of genomic information and its impact on genome-wide associations and variant selection for genomic prediction: a simulation study.基因组信息的维度及其对全基因组关联和基因组预测中变异选择的影响:一项模拟研究。
Genet Sel Evol. 2023 Jul 17;55(1):49. doi: 10.1186/s12711-023-00823-0.
3
Invited review: Reliability computation from the animal model era to the single-step genomic model era.
心力衰竭和乳腺癌中潜在靶点的遗传推断和单细胞表达分析。
J Cancer Res Clin Oncol. 2024 Oct 26;150(10):479. doi: 10.1007/s00432-024-06010-y.
特邀综述:从动物模型时代到单步基因组模型时代的可靠性计算
J Dairy Sci. 2023 Mar;106(3):1518-1532. doi: 10.3168/jds.2022-22629. Epub 2022 Dec 23.
4
On the equivalence between marker effect models and breeding value models and direct genomic values with the Algorithm for Proven and Young.基于 Proven and Young 算法的标记效应模型与育种值模型和直接基因组值的等效性
Genet Sel Evol. 2022 Jul 16;54(1):52. doi: 10.1186/s12711-022-00741-7.
5
A comprehensive study on size and definition of the core group in the proven and young algorithm for single-step GBLUP.针对一步法 GBLUP 中已证明和年轻算法的核心群体的大小和定义进行全面研究。
Genet Sel Evol. 2022 May 20;54(1):34. doi: 10.1186/s12711-022-00726-6.
6
Is single-step genomic REML with the algorithm for proven and young more computationally efficient when less generations of data are present?当数据的世代数较少时,采用具有成熟和年轻算法的一步法基因组 REML 是否更具计算效率?
J Anim Sci. 2022 May 1;100(5). doi: 10.1093/jas/skac082.
7
Multibreed genomic evaluation for production traits of dairy cattle in the United States using single-step genomic best linear unbiased predictor.利用单步基因组最佳线性无偏预测器对美国奶牛生产性状进行多品种基因组评估。
J Dairy Sci. 2022 Jun;105(6):5141-5152. doi: 10.3168/jds.2021-21505. Epub 2022 Mar 10.
8
Efficient approximation of reliabilities for single-step genomic best linear unbiased predictor models with the Algorithm for Proven and Young.利用 Proven 和 Young 算法对单步基因组最佳线性无偏预测模型进行可靠性的有效逼近。
J Anim Sci. 2022 Jan 1;100(1). doi: 10.1093/jas/skab353.
9
Accounting for Population Structure and Phenotypes From Relatives in Association Mapping for Farm Animals: A Simulation Study.家畜关联分析中考虑亲缘关系的群体结构和表型:一项模拟研究
Front Genet. 2021 Apr 29;12:642065. doi: 10.3389/fgene.2021.642065. eCollection 2021.
10
Changes in genomic predictions when new information is added.添加新信息时基因组预测的变化。
J Anim Sci. 2021 Feb 1;99(2). doi: 10.1093/jas/skab004.