• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BLUPmrMLM:全基因组关联研究中的一种快速 mrMLM 算法。

BLUPmrMLM: A Fast mrMLM Algorithm in Genome-wide Association Studies.

机构信息

College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China.

出版信息

Genomics Proteomics Bioinformatics. 2024 Sep 13;22(3). doi: 10.1093/gpbjnl/qzae020.

DOI:10.1093/gpbjnl/qzae020
PMID:39348630
Abstract

Multilocus genome-wide association study has become the state-of-the-art tool for dissecting the genetic architecture of complex and multiomic traits. However, most existing multilocus methods require relatively long computational time when analyzing large datasets. To address this issue, in this study, we proposed a fast mrMLM method, namely, best linear unbiased prediction multilocus random-SNP-effect mixed linear model (BLUPmrMLM). First, genome-wide single-marker scanning in mrMLM was replaced by vectorized Wald tests based on the best linear unbiased prediction (BLUP) values of marker effects and their variances in BLUPmrMLM. Then, adaptive best subset selection (ABESS) was used to identify potentially associated markers on each chromosome to reduce computational time when estimating marker effects via empirical Bayes. Finally, shared memory and parallel computing schemes were used to reduce the computational time. In simulation studies, BLUPmrMLM outperformed GEMMA, EMMAX, mrMLM, and FarmCPU as well as the control method (BLUPmrMLM with ABESS removed), in terms of computational time, power, accuracy for estimating quantitative trait nucleotide positions and effects, false positive rate, false discovery rate, false negative rate, and F1 score. In the reanalysis of two large rice datasets, BLUPmrMLM significantly reduced the computational time and identified more previously reported genes, compared with the aforementioned methods. This study provides an excellent multilocus model method for the analysis of large-scale and multiomic datasets. The software mrMLM v5.1 is available at BioCode (https://ngdc.cncb.ac.cn/biocode/tool/BT007388) or GitHub (https://github.com/YuanmingZhang65/mrMLM).

摘要

多基因全基因组关联研究已成为剖析复杂多组学性状遗传结构的最新工具。然而,大多数现有的多基因方法在分析大型数据集时需要相对较长的计算时间。针对这一问题,本研究提出了一种快速 mrMLM 方法,即最佳线性无偏预测多基因随机-SNP 效应混合线性模型 (BLUPmrMLM)。首先,BLUPmrMLM 中的全基因组单标记扫描被基于标记效应的最佳线性无偏预测 (BLUP) 值和 BLUPmrMLM 中标记效应方差的矢量化 Wald 检验所取代。然后,自适应最佳子集选择 (ABESS) 用于识别每条染色体上可能相关的标记,以减少通过经验 Bayes 估计标记效应的计算时间。最后,使用共享内存和并行计算方案来减少计算时间。在模拟研究中,BLUPmrMLM 在计算时间、功效、估计数量性状核苷酸位置和效应的准确性、假阳性率、假发现率、假阴性率和 F1 分数方面均优于 GEMMA、EMMAX、mrMLM 和 FarmCPU 以及对照方法(去除 ABESS 的 BLUPmrMLM)。在对两个大型水稻数据集的重新分析中,与上述方法相比,BLUPmrMLM 显著减少了计算时间并鉴定出了更多先前报道的基因。本研究为分析大规模多组学数据集提供了一种出色的多基因模型方法。mrMLM v5.1 软件可在 BioCode(https://ngdc.cncb.ac.cn/biocode/tool/BT007388)或 GitHub(https://github.com/YuanmingZhang65/mrMLM)上获取。

相似文献

1
BLUPmrMLM: A Fast mrMLM Algorithm in Genome-wide Association Studies.BLUPmrMLM:全基因组关联研究中的一种快速 mrMLM 算法。
Genomics Proteomics Bioinformatics. 2024 Sep 13;22(3). doi: 10.1093/gpbjnl/qzae020.
2
mrMLM v4.0.2: An R Platform for Multi-locus Genome-wide Association Studies.mrMLM v4.0.2:一个用于多基因座全基因组关联研究的 R 平台。
Genomics Proteomics Bioinformatics. 2020 Aug;18(4):481-487. doi: 10.1016/j.gpb.2020.06.006. Epub 2020 Dec 18.
3
pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies.pLARmEB:用于多位点全基因组关联研究的最小角回归与经验贝叶斯方法的整合
Heredity (Edinb). 2017 Jun;118(6):517-524. doi: 10.1038/hdy.2017.8. Epub 2017 Mar 15.
4
Iterative sure independence screening EM-Bayesian LASSO algorithm for multi-locus genome-wide association studies.用于多位点全基因组关联研究的迭代确定独立筛选EM-贝叶斯套索算法
PLoS Comput Biol. 2017 Jan 31;13(1):e1005357. doi: 10.1371/journal.pcbi.1005357. eCollection 2017 Jan.
5
Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology.通过多基因座混合线性模型方法提高全基因组关联研究的效能和准确性。
Sci Rep. 2016 Jan 20;6:19444. doi: 10.1038/srep19444.
6
An Efficient Score Test Integrated with Empirical Bayes for Genome-Wide Association Studies.一种用于全基因组关联研究的与经验贝叶斯相结合的高效计分检验。
Front Genet. 2021 Oct 1;12:742752. doi: 10.3389/fgene.2021.742752. eCollection 2021.
7
Phenotypic variation and genome-wide association studies of main culm panicle node number, maximum node production rate, and degree-days to heading in rice.水稻主茎穗颈节数、最大穗颈节间伸长速率和生育天数的表型变异及全基因组关联研究。
BMC Genomics. 2022 May 23;23(1):390. doi: 10.1186/s12864-022-08629-y.
8
Improving the accuracy of whole genome prediction for complex traits using the results of genome wide association studies.利用全基因组关联研究结果提高复杂性状全基因组预测的准确性。
PLoS One. 2014 Mar 24;9(3):e93017. doi: 10.1371/journal.pone.0093017. eCollection 2014.
9
Genome-wide barebones regression scan for mixed-model association analysis.全基因组简化回归扫描混合模型关联分析。
Theor Appl Genet. 2020 Jan;133(1):51-58. doi: 10.1007/s00122-019-03439-5. Epub 2019 Sep 24.
10
Genomic selection and association mapping in rice (Oryza sativa): effect of trait genetic architecture, training population composition, marker number and statistical model on accuracy of rice genomic selection in elite, tropical rice breeding lines.水稻(Oryza sativa)的基因组选择与关联图谱分析:性状遗传结构、训练群体组成、标记数量及统计模型对优质热带水稻育种系基因组选择准确性的影响
PLoS Genet. 2015 Feb 17;11(2):e1004982. doi: 10.1371/journal.pgen.1004982. eCollection 2015 Feb.

引用本文的文献

1
Fast3VmrMLM: A fast algorithm that integrates genome-wide scanning with machine learning to accelerate gene mining and breeding by design for polygenic traits in large-scale GWAS datasets.Fast3VmrMLM:一种快速算法,它将全基因组扫描与机器学习相结合,以加速大规模全基因组关联研究(GWAS)数据集中多基因性状的基因挖掘和设计育种。
Plant Commun. 2025 Jul 14;6(7):101385. doi: 10.1016/j.xplc.2025.101385. Epub 2025 May 22.
2
Phenotypic plasticity of flowering time and plant height related traits in wheat.小麦开花时间和株高相关性状的表型可塑性
BMC Plant Biol. 2025 May 14;25(1):636. doi: 10.1186/s12870-025-06489-8.
3
The prognostic significance of stress hyperglycemia ratio in evaluating all-cause and cardiovascular mortality risk among individuals across stages 0-3 of cardiovascular-kidney-metabolic syndrome: evidence from two cohort studies.

本文引用的文献

1
RiceENCODE: A comprehensive epigenomic database as a rice Encyclopedia of DNA Elements.水稻ENCODE:一个作为水稻DNA元件百科全书的综合表观基因组数据库。
Mol Plant. 2021 Oct 4;14(10):1604-1606. doi: 10.1016/j.molp.2021.08.018. Epub 2021 Aug 27.
2
Computationally efficient whole-genome regression for quantitative and binary traits.计算效率高的全基因组回归分析用于定量和二项性状。
Nat Genet. 2021 Jul;53(7):1097-1103. doi: 10.1038/s41588-021-00870-7. Epub 2021 May 20.
3
mrMLM v4.0.2: An R Platform for Multi-locus Genome-wide Association Studies.
应激性高血糖比值在评估心血管-肾脏-代谢综合征0至3期个体全因和心血管死亡风险中的预后意义:两项队列研究的证据
Cardiovasc Diabetol. 2025 Mar 24;24(1):137. doi: 10.1186/s12933-025-02689-6.
mrMLM v4.0.2:一个用于多基因座全基因组关联研究的 R 平台。
Genomics Proteomics Bioinformatics. 2020 Aug;18(4):481-487. doi: 10.1016/j.gpb.2020.06.006. Epub 2020 Dec 18.
4
A polynomial algorithm for best-subset selection problem.一种最佳子集选择问题的多项式算法。
Proc Natl Acad Sci U S A. 2020 Dec 29;117(52):33117-33123. doi: 10.1073/pnas.2014241117. Epub 2020 Dec 16.
5
Rapid epistatic mixed-model association studies by controlling multiple polygenic effects.通过控制多种多基因效应实现快速上位性混合模型关联研究。
Bioinformatics. 2020 Dec 8;36(19):4833-4837. doi: 10.1093/bioinformatics/btaa610.
6
Deshrinking ridge regression for genome-wide association studies.缩岭 Ridge 回归在全基因组关联研究中的应用。
Bioinformatics. 2020 Aug 15;36(14):4154-4162. doi: 10.1093/bioinformatics/btaa345.
7
A resource-efficient tool for mixed model association analysis of large-scale data.一种资源高效的工具,用于大规模数据的混合模型关联分析。
Nat Genet. 2019 Dec;51(12):1749-1755. doi: 10.1038/s41588-019-0530-8. Epub 2019 Nov 25.
8
A coordinate descent approach for sparse Bayesian learning in high dimensional QTL mapping and genome-wide association studies.一种用于高维 QTL 映射和全基因组关联研究中稀疏贝叶斯学习的坐标下降方法。
Bioinformatics. 2019 Nov 1;35(21):4327-4335. doi: 10.1093/bioinformatics/btz244.
9
Editorial: The Applications of New Multi-Locus GWAS Methodologies in the Genetic Dissection of Complex Traits.社论:新的多位点全基因组关联研究方法在复杂性状基因剖析中的应用
Front Plant Sci. 2019 Feb 11;10:100. doi: 10.3389/fpls.2019.00100. eCollection 2019.
10
Population structure in genetic studies: Confounding factors and mixed models.遗传研究中的群体结构:混杂因素与混合模型。
PLoS Genet. 2018 Dec 27;14(12):e1007309. doi: 10.1371/journal.pgen.1007309. eCollection 2018 Dec.