• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GMStool:基于 GWAS 的基因组预测标记选择工具,用于从基因组数据中进行基因组预测。

GMStool: GWAS-based marker selection tool for genomic prediction from genomic data.

机构信息

Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.

Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology (UST), Daejeon, 34141, Republic of Korea.

出版信息

Sci Rep. 2020 Nov 12;10(1):19653. doi: 10.1038/s41598-020-76759-y.

DOI:10.1038/s41598-020-76759-y
PMID:33184432
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7665227/
Abstract

The increased accessibility to genomic data in recent years has laid the foundation for studies to predict various phenotypes of organisms based on the genome. Genomic prediction collectively refers to these studies, and it estimates an individual's phenotypes mainly using single nucleotide polymorphism markers. Typically, the accuracy of these genomic prediction studies is highly dependent on the markers used; however, in practice, choosing optimal markers with high accuracy for the phenotype to be used is a challenging task. Therefore, we present a new tool called GMStool for selecting optimal marker sets and predicting quantitative phenotypes. The GMStool is based on a genome-wide association study (GWAS) and heuristically searches for optimal markers using statistical and machine-learning methods. The GMStool performs the genomic prediction using statistical and machine/deep-learning models and presents the best prediction model with the optimal marker-set. For the evaluation, the GMStool was tested on real datasets with four phenotypes. The prediction results showed higher performance than using the entire markers or the GWAS-top markers, which have been used frequently in prediction studies. Although the GMStool has several limitations, it is expected to contribute to various studies for predicting quantitative phenotypes. The GMStool written in R is available at www.github.com/JaeYoonKim72/GMStool .

摘要

近年来,基因组数据的可及性增加为基于基因组预测生物体各种表型的研究奠定了基础。基因组预测通常是指这些研究,它主要使用单核苷酸多态性标记来估计个体的表型。通常,这些基因组预测研究的准确性高度依赖于所使用的标记;然而,在实践中,选择用于特定表型的具有高精度的最佳标记是一项具有挑战性的任务。因此,我们提出了一种名为 GMStool 的新工具,用于选择最佳标记集和预测定量表型。GMStool 基于全基因组关联研究 (GWAS),并使用统计和机器学习方法启发式地搜索最佳标记。GMStool 使用统计和机器学习/深度学习模型进行基因组预测,并呈现具有最佳标记集的最佳预测模型。为了进行评估,我们在具有四个表型的真实数据集上测试了 GMStool。预测结果显示,其性能优于经常用于预测研究的整个标记或 GWAS 顶级标记。尽管 GMStool 存在一些局限性,但它有望为预测定量表型的各种研究做出贡献。用 R 编写的 GMStool 可在 www.github.com/JaeYoonKim72/GMStool 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/a96103f53f36/41598_2020_76759_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/c80726fd4f89/41598_2020_76759_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/075a21e42faf/41598_2020_76759_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/ac339b419f57/41598_2020_76759_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/a96103f53f36/41598_2020_76759_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/c80726fd4f89/41598_2020_76759_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/075a21e42faf/41598_2020_76759_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/ac339b419f57/41598_2020_76759_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4362/7665227/a96103f53f36/41598_2020_76759_Fig4_HTML.jpg

相似文献

1
GMStool: GWAS-based marker selection tool for genomic prediction from genomic data.GMStool:基于 GWAS 的基因组预测标记选择工具,用于从基因组数据中进行基因组预测。
Sci Rep. 2020 Nov 12;10(1):19653. doi: 10.1038/s41598-020-76759-y.
2
Genomic prediction models for traits differing in heritability for soybean, rice, and maize.大豆、水稻和玉米不同遗传力性状的基因组预测模型。
BMC Plant Biol. 2022 Feb 26;22(1):87. doi: 10.1186/s12870-022-03479-y.
3
GbyE: an integrated tool for genome widely association study and genome selection based on genetic by environmental interaction.GbyE:一种基于遗传与环境互作的全基因组关联研究和基因组选择的综合工具。
BMC Genomics. 2024 Apr 19;25(1):386. doi: 10.1186/s12864-024-10310-5.
4
Genome-wide association and genomic prediction identifies associated loci and predicts the sensitivity of Tobacco ringspot virus in soybean plant introductions.全基因组关联分析和基因组预测确定了相关位点,并预测了大豆引进品种对烟草环斑病毒的敏感性。
BMC Genomics. 2016 Feb 29;17:153. doi: 10.1186/s12864-016-2487-7.
5
DeepCGP: A Deep Learning Method to Compress Genome-Wide Polymorphisms for Predicting Phenotype of Rice.DeepCGP:一种用于压缩全基因组多态性以预测水稻表型的深度学习方法。
IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):2078-2088. doi: 10.1109/TCBB.2022.3231466. Epub 2023 Jun 5.
6
Gene based markers improve precision of genome-wide association studies and accuracy of genomic predictions in rice breeding.基于基因的标记物可提高水稻育种全基因组关联研究的精度和基因组预测的准确性。
Heredity (Edinb). 2023 May;130(5):335-345. doi: 10.1038/s41437-023-00599-5. Epub 2023 Feb 15.
7
Using markers with large effect in genetic and genomic predictions.在遗传和基因组预测中使用具有大效应的标记。
J Anim Sci. 2017 Jan;95(1):59-71. doi: 10.2527/jas.2016.0754.
8
An efficient unified model for genome-wide association studies and genomic selection.一种用于全基因组关联研究和基因组选择的高效统一模型。
Genet Sel Evol. 2017 Aug 24;49(1):64. doi: 10.1186/s12711-017-0338-x.
9
A machine learning pipeline for quantitative phenotype prediction from genotype data.基于基因型数据的定量表型预测的机器学习管道。
BMC Bioinformatics. 2010 Oct 26;11 Suppl 8(Suppl 8):S3. doi: 10.1186/1471-2105-11-S8-S3.
10
Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.利用真实或推算的全基因组标记预测牛模拟多基因表型及其潜在数量性状位点基因型的准确性。
Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.

引用本文的文献

1
Beyond the genome: the role of functional markers in contemporary plant breeding.超越基因组:功能标记在当代植物育种中的作用
Front Plant Sci. 2025 Aug 5;16:1637299. doi: 10.3389/fpls.2025.1637299. eCollection 2025.
2
Holobiont-based genetic analysis reveals new plant and microbial markers for resistance against a root rot pathogen complex in pea.基于共生功能体的遗传分析揭示了豌豆抗根腐病原菌复合体的新植物和微生物标记。
BMC Plant Biol. 2025 Aug 9;25(1):1053. doi: 10.1186/s12870-025-06995-9.
3
Genome-Wide Association Analysis and Genomic Selection for Growth Traits in Grass Carp ().

本文引用的文献

1
KAML: improving genomic prediction accuracy of complex traits using machine learning determined parameters.KAML:使用机器学习确定的参数来提高复杂性状的基因组预测准确性。
Genome Biol. 2020 Jun 17;21(1):146. doi: 10.1186/s13059-020-02052-w.
2
Improved polygenic prediction by Bayesian multiple regression on summary statistics.基于汇总统计数据的贝叶斯多元回归提高多基因预测能力。
Nat Commun. 2019 Nov 8;10(1):5086. doi: 10.1038/s41467-019-12653-0.
3
Korean soybean core collection: Genotypic and phenotypic diversity population structure and genome-wide association study.
草鱼生长性状的全基因组关联分析和基因组选择
Animals (Basel). 2025 Jun 26;15(13):1888. doi: 10.3390/ani15131888.
4
Variable selection strategies for genomic prediction of growth and carcass related traits in experimental Nellore cattle herds under different selection criteria.不同选择标准下实验内洛尔牛群生长和胴体相关性状基因组预测的变量选择策略
Sci Rep. 2025 Jul 1;15(1):22266. doi: 10.1038/s41598-025-06949-z.
5
Role of multi-omics in aquaculture genetics and breeding: current status and future perspective.多组学在水产养殖遗传与育种中的作用:现状与未来展望
Sci China Life Sci. 2025 May 28. doi: 10.1007/s11427-024-2828-8.
6
Comparison of genomic prediction accuracies in dairy cattle lactation traits using five classes of functional variants versus generic SNP.使用五类功能变异与通用单核苷酸多态性对奶牛泌乳性状进行基因组预测准确性的比较。
Genet Sel Evol. 2025 Apr 11;57(1):20. doi: 10.1186/s12711-025-00966-2.
7
Exploring genetic variants affecting milk production traits through genome-wide association study in Vrindavani crossbred cattle of India.通过全基因组关联研究探索影响印度弗林达瓦尼杂交牛产奶性状的基因变异。
Trop Anim Health Prod. 2025 Mar 6;57(2):104. doi: 10.1007/s11250-025-04348-0.
8
Enhancing Genomic Prediction Accuracy of Reproduction Traits in Rongchang Pigs Through Machine Learning.通过机器学习提高荣昌猪繁殖性状的基因组预测准确性
Animals (Basel). 2025 Feb 12;15(4):525. doi: 10.3390/ani15040525.
9
Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle.整合多层生物学先验信息以提高肉牛基因组预测准确性。
Biol Direct. 2024 Dec 31;19(1):147. doi: 10.1186/s13062-024-00574-y.
10
Using the Pearson's correlation coefficient as the sole metric to measure the accuracy of quantitative trait prediction: is it sufficient?使用皮尔逊相关系数作为衡量数量性状预测准确性的唯一指标:这足够吗?
Front Plant Sci. 2024 Dec 10;15:1480463. doi: 10.3389/fpls.2024.1480463. eCollection 2024.
韩国大豆核心种质:基因型和表型多样性、群体结构和全基因组关联研究。
PLoS One. 2019 Oct 22;14(10):e0224074. doi: 10.1371/journal.pone.0224074. eCollection 2019.
4
Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits.基于参数和机器学习模型的复杂性状基因组预测的基准测试。
G3 (Bethesda). 2019 Nov 5;9(11):3691-3702. doi: 10.1534/g3.119.400498.
5
Genomic Prediction Including SNP-Specific Variance Predictors.基因组预测包括 SNP 特异性方差预测因子。
G3 (Bethesda). 2019 Oct 7;9(10):3333-3343. doi: 10.1534/g3.119.400381.
6
SPADIS: An Algorithm for Selecting Predictive and Diverse SNPs in GWAS.SPADIS:一种在 GWAS 中选择具有预测性和多样性的 SNPs 的算法。
IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):1208-1216. doi: 10.1109/TCBB.2019.2935437. Epub 2021 Jun 3.
7
A Guide for Using Deep Learning for Complex Trait Genomic Prediction.深度学习在复杂性状基因组预测中的应用指南。
Genes (Basel). 2019 Jul 20;10(7):553. doi: 10.3390/genes10070553.
8
Using imputed whole-genome sequence data to improve the accuracy of genomic prediction for parasite resistance in Australian sheep.利用估算的全基因组序列数据提高澳大利亚绵羊寄生虫抗性的基因组预测准确性。
Genet Sel Evol. 2019 Jun 26;51(1):32. doi: 10.1186/s12711-019-0476-4.
9
Tournaments between markers as a strategy to enhance genomic predictions.标记物间的竞赛作为一种提升基因组预测的策略。
PLoS One. 2019 Jun 24;14(6):e0217283. doi: 10.1371/journal.pone.0217283. eCollection 2019.
10
Genetic diversity patterns and domestication origin of soybean.大豆的遗传多样性模式和驯化起源。
Theor Appl Genet. 2019 Apr;132(4):1179-1193. doi: 10.1007/s00122-018-3271-7. Epub 2018 Dec 26.