• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

KAML:使用机器学习确定的参数来提高复杂性状的基因组预测准确性。

KAML: improving genomic prediction accuracy of complex traits using machine learning determined parameters.

机构信息

Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education & College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China.

Key Laboratory of Swine Genetics and Breeding, Ministry of Agriculture, Huazhong Agricultural University, Wuhan, 430070, Hubei, People's Republic of China.

出版信息

Genome Biol. 2020 Jun 17;21(1):146. doi: 10.1186/s13059-020-02052-w.

DOI:10.1186/s13059-020-02052-w
PMID:32552725
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7386246/
Abstract

Advances in high-throughput sequencing technologies have reduced the cost of genotyping dramatically and led to genomic prediction being widely used in animal and plant breeding, and increasingly in human genetics. Inspired by the efficient computing of linear mixed model and the accurate prediction of Bayesian methods, we propose a machine learning-based method incorporating cross-validation, multiple regression, grid search, and bisection algorithms named KAML that aims to combine the advantages of prediction accuracy with computing efficiency. KAML exhibits higher prediction accuracy than existing methods, and it is available at https://github.com/YinLiLin/KAML.

摘要

高通量测序技术的进步极大地降低了基因分型的成本,并导致基因组预测在动植物育种中得到广泛应用,并且在人类遗传学中也越来越普及。受线性混合模型高效计算和贝叶斯方法精确预测的启发,我们提出了一种基于机器学习的方法,该方法结合了交叉验证、多元回归、网格搜索和二分算法,名为 KAML,旨在结合预测准确性和计算效率的优势。KAML 表现出比现有方法更高的预测准确性,并且可以在 https://github.com/YinLiLin/KAML 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/c67f36f23c63/13059_2020_2052_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/5897784e1caa/13059_2020_2052_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/d6c62b9a64d6/13059_2020_2052_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/45bd3b07d00c/13059_2020_2052_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/ef1a8395111d/13059_2020_2052_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/c67f36f23c63/13059_2020_2052_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/5897784e1caa/13059_2020_2052_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/d6c62b9a64d6/13059_2020_2052_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/45bd3b07d00c/13059_2020_2052_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/ef1a8395111d/13059_2020_2052_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa49/7388460/c67f36f23c63/13059_2020_2052_Fig5_HTML.jpg

相似文献

1
KAML: improving genomic prediction accuracy of complex traits using machine learning determined parameters.KAML:使用机器学习确定的参数来提高复杂性状的基因组预测准确性。
Genome Biol. 2020 Jun 17;21(1):146. doi: 10.1186/s13059-020-02052-w.
2
Application of linear and machine learning models to genomic prediction of fatty acid composition in Japanese Black cattle.线性和机器学习模型在日本黑牛脂肪酸组成的基因组预测中的应用。
Anim Sci J. 2023 Jan-Dec;94(1):e13883. doi: 10.1111/asj.13883.
3
Improving accuracy of genomic prediction by genetic architecture based priors in a Bayesian model.在贝叶斯模型中通过基于遗传结构的先验信息提高基因组预测的准确性。
BMC Genet. 2015 Oct 14;16:120. doi: 10.1186/s12863-015-0278-9.
4
MAK: a machine learning framework improved genomic prediction via multi-target ensemble regressor chains and automatic selection of assistant traits.MAK:一个通过多目标集成回归链和辅助性状自动选择来改进基因组预测的机器学习框架。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad043.
5
Genomic prediction of genetic merit using LD-based haplotypes in the Nordic Holstein population.在北欧荷斯坦牛群体中使用基于连锁不平衡的单倍型对遗传价值进行基因组预测。
BMC Genomics. 2014 Dec 23;15(1):1171. doi: 10.1186/1471-2164-15-1171.
6
An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction.用于基于基因组预测的多环境和多性状多环境数据的贝叶斯分析的 R 包。
G3 (Bethesda). 2019 May 7;9(5):1355-1369. doi: 10.1534/g3.119.400126.
7
Accuracy of predicting genomic breeding values for residual feed intake in Angus and Charolais beef cattle.预测 Angus 和夏洛莱肉牛剩余采食量的基因组育种值的准确性。
J Anim Sci. 2013 Oct;91(10):4669-78. doi: 10.2527/jas.2013-5715.
8
Genomic prediction of breeding values for carcass traits in Nellore cattle.内洛尔牛胴体性状育种值的基因组预测
Genet Sel Evol. 2016 Jan 29;48:7. doi: 10.1186/s12711-016-0188-y.
9
Comparison of conventional BLUP and single-step genomic BLUP evaluations for yearling weight and carcass traits in Hanwoo beef cattle using single trait and multi-trait models.利用单性状和多性状模型比较韩牛育肥牛的常规 BLUP 和单步基因组 BLUP 对周岁体重和胴体性状的评估。
PLoS One. 2019 Oct 14;14(10):e0223352. doi: 10.1371/journal.pone.0223352. eCollection 2019.
10
Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data.基于机器学习的基因组预测:在合成数据和实际数据上,正则化回归、集成、基于实例和深度学习方法的性能比较。
BMC Genomics. 2024 Feb 7;25(1):152. doi: 10.1186/s12864-023-09933-x.

引用本文的文献

1
DeepAnnotation: A novel interpretable deep learning-based genomic selection model that integrates comprehensive functional annotations.深度注释:一种基于深度学习的新型可解释基因组选择模型,该模型整合了全面的功能注释。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf083.
2
EXGEP: a framework for predicting genotype-by-environment interactions using ensembles of explainable machine-learning models.EXGEP:一个使用可解释机器学习模型集成来预测基因-环境相互作用的框架。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf414.
3
Machine Learning Analysis of Maize Seedling Traits Under Drought Stress.

本文引用的文献

1
Improved polygenic prediction by Bayesian multiple regression on summary statistics.基于汇总统计数据的贝叶斯多元回归提高多基因预测能力。
Nat Commun. 2019 Nov 8;10(1):5086. doi: 10.1038/s41467-019-12653-0.
2
G2P: a Genome-Wide-Association-Study simulation tool for genotype simulation, phenotype simulation and power evaluation.G2P:用于基因型模拟、表型模拟和功效评估的全基因组关联研究模拟工具。
Bioinformatics. 2019 Oct 1;35(19):3852-3854. doi: 10.1093/bioinformatics/btz126.
3
Harnessing genomic information for livestock improvement.利用基因组信息促进家畜改良。
干旱胁迫下玉米幼苗性状的机器学习分析
Biology (Basel). 2025 Jun 29;14(7):787. doi: 10.3390/biology14070787.
4
Machine Learning-Based Analysis of Differentially Expressed Genes in the Muscle Transcriptome Between Beef Cattle and Dairy Cattle.基于机器学习的肉牛和奶牛肌肉转录组中差异表达基因分析
Int J Mol Sci. 2025 May 23;26(11):5046. doi: 10.3390/ijms26115046.
5
Enhancing Genomic Prediction Accuracy with a Single-Step Genomic Best Linear Unbiased Prediction Model Integrating Genome-Wide Association Study Results.使用整合全基因组关联研究结果的单步基因组最佳线性无偏预测模型提高基因组预测准确性。
Animals (Basel). 2025 Apr 29;15(9):1268. doi: 10.3390/ani15091268.
6
ExAutoGP: Enhancing Genomic Prediction Stability and Interpretability with Automated Machine Learning and SHAP.ExAutoGP:通过自动机器学习和SHAP增强基因组预测的稳定性和可解释性
Animals (Basel). 2025 Apr 18;15(8):1172. doi: 10.3390/ani15081172.
7
mmGEBLUP: an advanced genomic prediction scheme for genetic improvement of complex traits in crops through integrative analysis of major genes, polygenes, and genotype-environment interactions.多性状基因组最佳线性无偏预测法(mmGEBLUP):一种通过对主基因、多基因以及基因型与环境互作进行综合分析,用于作物复杂性状遗传改良的先进基因组预测方案。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf058.
8
KPRR: a novel machine learning approach for effectively capturing nonadditive effects in genomic prediction.KPRR:一种有效捕捉基因组预测中非加性效应的新型机器学习方法。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae683.
9
IIIVmrMLM.QEI: An effective tool for indirect detection of QTN-by-environment interactions in genome-wide association studies.IIIVmrMLM.QEI:全基因组关联研究中用于间接检测QTN与环境相互作用的有效工具。
Comput Struct Biotechnol J. 2024 Dec 2;23:4357-4368. doi: 10.1016/j.csbj.2024.11.046. eCollection 2024 Dec.
10
An investigation of machine learning methods applied to genomic prediction in yellow-feathered broilers.应用于黄羽肉鸡基因组预测的机器学习方法研究。
Poult Sci. 2025 Jan;104(1):104489. doi: 10.1016/j.psj.2024.104489. Epub 2024 Nov 1.
Nat Rev Genet. 2019 Mar;20(3):135-156. doi: 10.1038/s41576-018-0082-2.
4
Common Disease Is More Complex Than Implied by the Core Gene Omnigenic Model.常见疾病比核心基因全基因组模型所暗示的更为复杂。
Cell. 2018 Jun 14;173(7):1573-1580. doi: 10.1016/j.cell.2018.05.051.
5
Non-parametric genetic prediction of complex traits with latent Dirichlet process regression models.使用潜在狄利克雷过程回归模型对复杂性状进行非参数遗传预测。
Nat Commun. 2017 Sep 6;8(1):456. doi: 10.1038/s41467-017-00470-2.
6
An efficient unified model for genome-wide association studies and genomic selection.一种用于全基因组关联研究和基因组选择的高效统一模型。
Genet Sel Evol. 2017 Aug 24;49(1):64. doi: 10.1186/s12711-017-0338-x.
7
10 Years of GWAS Discovery: Biology, Function, and Translation.全基因组关联研究十年发现:生物学、功能与转化
Am J Hum Genet. 2017 Jul 6;101(1):5-22. doi: 10.1016/j.ajhg.2017.06.005.
8
Using markers with large effect in genetic and genomic predictions.在遗传和基因组预测中使用具有大效应的标记。
J Anim Sci. 2017 Jan;95(1):59-71. doi: 10.2527/jas.2016.0754.
9
Dissecting the genetics of complex traits using summary association statistics.利用汇总关联统计剖析复杂性状的遗传学。
Nat Rev Genet. 2017 Feb;18(2):117-127. doi: 10.1038/nrg.2016.142. Epub 2016 Nov 14.
10
Developing and evaluating polygenic risk prediction models for stratified disease prevention.开发和评估用于分层疾病预防的多基因风险预测模型。
Nat Rev Genet. 2016 Jul;17(7):392-406. doi: 10.1038/nrg.2016.27. Epub 2016 May 3.