• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用可视评分进行复杂性状的基因组预测在育种计划中的应用。

Using visual scores for genomic prediction of complex traits in breeding programs.

机构信息

Statistics Department, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil.

Horticultural Sciences Department, Blueberry Breeding and Genomics Lab, University of Florida, Gainesville, FL, USA.

出版信息

Theor Appl Genet. 2023 Dec 15;137(1):9. doi: 10.1007/s00122-023-04512-w.

DOI:10.1007/s00122-023-04512-w
PMID:38102495
Abstract

An approach for handling visual scores with potential errors and subjectivity in scores was evaluated in simulated and blueberry recurrent selection breeding schemes to assist breeders in their decision-making. Most genomic prediction methods are based on assumptions of normality due to their simplicity and ease of implementation. However, in plant and animal breeding, continuous traits are often visually scored as categorical traits and analyzed as a Gaussian variable, thus violating the normality assumption, which could affect the prediction of breeding values and the estimation of genetic parameters. In this study, we examined the main challenges of visual scores for genomic prediction and genetic parameter estimation using mixed models, Bayesian, and machine learning methods. We evaluated these approaches using simulated and real breeding data sets. Our contribution in this study is a five-fold demonstration: (i) collecting data using an intermediate number of categories (1-3 and 1-5) is the best strategy, even considering errors associated with visual scores; (ii) Linear Mixed Models and Bayesian Linear Regression are robust to the normality violation, but marginal gains can be achieved when using Bayesian Ordinal Regression Models (BORM) and Random Forest Classification; (iii) genetic parameters are better estimated using BORM; (iv) our conclusions using simulated data are also applicable to real data in autotetraploid blueberry; and (v) a comparison of continuous and categorical phenotypes found that investing in the evaluation of 600-1000 categorical data points with low error, when it is not feasible to collect continuous phenotypes, is a strategy for improving predictive abilities. Our findings suggest the best approaches for effectively using visual scores traits to explore genetic information in breeding programs and highlight the importance of investing in the training of evaluator teams and in high-quality phenotyping.

摘要

一种处理视觉评分中潜在误差和评分主观性的方法在模拟和蓝莓轮回选择育种计划中进行了评估,以帮助育种者做出决策。由于其简单性和易于实现,大多数基因组预测方法都是基于正态性假设。然而,在植物和动物育种中,连续性状通常被视为分类性状进行视觉评分,并作为高斯变量进行分析,从而违反了正态性假设,这可能会影响育种值的预测和遗传参数的估计。在这项研究中,我们使用混合模型、贝叶斯和机器学习方法检查了视觉评分对基因组预测和遗传参数估计的主要挑战。我们使用模拟和真实育种数据集评估了这些方法。我们在这项研究中的贡献有五个方面:(i)即使考虑到与视觉评分相关的误差,使用中间数量的类别(1-3 和 1-5)收集数据也是最佳策略;(ii)线性混合模型和贝叶斯线性回归对正态性违反具有鲁棒性,但当使用贝叶斯有序回归模型(BORM)和随机森林分类时,可以获得边际收益;(iii)BORM 可以更好地估计遗传参数;(iv)我们使用模拟数据得出的结论也适用于同源四倍体蓝莓中的真实数据;(v)连续和分类表型的比较发现,当无法收集连续表型时,投资于评估具有低误差的 600-1000 个分类数据点是提高预测能力的策略。我们的研究结果表明了有效使用视觉评分性状的最佳方法,以探索育种计划中的遗传信息,并强调了投资于评估团队培训和高质量表型的重要性。

相似文献

1
Using visual scores for genomic prediction of complex traits in breeding programs.利用可视评分进行复杂性状的基因组预测在育种计划中的应用。
Theor Appl Genet. 2023 Dec 15;137(1):9. doi: 10.1007/s00122-023-04512-w.
2
Reduction in accuracy of genomic prediction for ordered categorical data compared to continuous observations.与连续观测相比,有序分类数据的基因组预测准确性降低。
Genet Sel Evol. 2014 Jun 9;46(1):37. doi: 10.1186/1297-9686-46-37.
3
Optimizing Genomic Parental Selection for Categorical and Continuous-Categorical Multi-Trait Mixtures.优化分类和连续分类多性状混合的基因组亲本选择。
Genes (Basel). 2024 Jul 29;15(8):995. doi: 10.3390/genes15080995.
4
Bayesian discrete lognormal regression model for genomic prediction.贝叶斯离散对数正态回归模型在基因组预测中的应用。
Theor Appl Genet. 2024 Jan 14;137(1):21. doi: 10.1007/s00122-023-04526-4.
5
Fast genomic predictions via Bayesian G-BLUP and multilocus models of threshold traits including censored Gaussian data.基于贝叶斯 G-BLUP 和阈性状的多基因模型的快速基因组预测,包括截尾正态数据。
G3 (Bethesda). 2013 Sep 4;3(9):1511-23. doi: 10.1534/g3.113.007096.
6
New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes.基于深度学习的基因组的新型预测模型,用于具有二元、有序和连续表型的多个特征。
G3 (Bethesda). 2019 May 7;9(5):1545-1556. doi: 10.1534/g3.119.300585.
7
Ridge regression and deep learning models for genome-wide selection of complex traits in New Mexican Chile peppers.岭回归和深度学习模型在新墨西哥智利辣椒全基因组复杂性状选择中的应用。
BMC Genom Data. 2023 Dec 18;24(1):80. doi: 10.1186/s12863-023-01179-6.
8
Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data.植物育种中遗传力和预测准确性的稳健估计:使用模拟和经验数据进行评估。
BMC Genomics. 2020 Jan 14;21(1):43. doi: 10.1186/s12864-019-6429-z.
9
Accounting for Correlation Between Traits in Genomic Prediction.基因组预测中性状间相关性的考量
Methods Mol Biol. 2022;2467:285-327. doi: 10.1007/978-1-0716-2205-6_10.
10
A computationally efficient algorithm for genomic prediction using a Bayesian model.一种使用贝叶斯模型进行基因组预测的计算高效算法。
Genet Sel Evol. 2015 Apr 30;47(1):34. doi: 10.1186/s12711-014-0082-4.

引用本文的文献

1
Transferability of genomic prediction models across market segments in potato and the effect of selection.马铃薯基因组预测模型在不同市场细分中的可转移性及选择效应
Theor Appl Genet. 2025 Aug 20;138(9):219. doi: 10.1007/s00122-025-05004-9.
2
Comparative genomic prediction of resistance to Fusarium wilt (Fusarium oxysporum f. sp. niveum race 2) in watermelon: parametric and nonparametric approaches.西瓜对枯萎病(尖孢镰刀菌西瓜专化型2号生理小种)抗性的比较基因组预测:参数化和非参数化方法
Theor Appl Genet. 2025 Jan 24;138(1):35. doi: 10.1007/s00122-024-04813-8.
3
Segment Anything for Comprehensive Analysis of Grapevine Cluster Architecture and Berry Properties.

本文引用的文献

1
Classification and Regression Models for Genomic Selection of Skewed Phenotypes: A Case for Disease Resistance in Winter Wheat ( L.).偏态表型基因组选择的分类与回归模型:以冬小麦(L.)的抗病性为例
Front Genet. 2022 Feb 23;13:835781. doi: 10.3389/fgene.2022.835781. eCollection 2022.
2
Genomic Selection in an Outcrossing Autotetraploid Fruit Crop: Lessons From Blueberry Breeding.异交四倍体果树中的基因组选择:来自蓝莓育种的经验教训。
Front Plant Sci. 2021 Jun 14;12:676326. doi: 10.3389/fpls.2021.676326. eCollection 2021.
3
Long-term comparison between index selection and optimal independent culling in plant breeding programs with genomic prediction.
用于葡萄果穗结构和浆果特性综合分析的任何分割方法
Plant Phenomics. 2024 Jun 27;6:0202. doi: 10.34133/plantphenomics.0202. eCollection 2024.
4
Enhancing grapevine breeding efficiency through genomic prediction and selection index.通过基因组预测和选择指数提高葡萄育种效率。
G3 (Bethesda). 2024 Apr 3;14(4). doi: 10.1093/g3journal/jkae038.
基于基因组预测的植物育种计划中指数选择与最优独立淘汰的长期比较。
PLoS One. 2021 May 10;16(5):e0235554. doi: 10.1371/journal.pone.0235554. eCollection 2021.
4
Impact of Mislabeling on Genomic Selection in Cassava Breeding.标签错误对木薯育种中基因组选择的影响。
Crop Sci. 2018 Jul-Aug;58:1470-1480. doi: 10.2135/cropsci2017.07.0442. Epub 2018 Jun 21.
5
Genome-based prediction of Bayesian linear and non-linear regression models for ordinal data.基于基因组的贝叶斯线性和非线性回归模型对有序数据的预测。
Plant Genome. 2020 Jul;13(2):e20021. doi: 10.1002/tpg2.20021. Epub 2020 May 14.
6
How can a high-quality genome assembly help plant breeders?高质量的基因组组装如何帮助植物育种者?
Gigascience. 2019 Jun 1;8(6). doi: 10.1093/gigascience/giz068.
7
Haplotype-phased genome and evolution of phytonutrient pathways of tetraploid blueberry.四倍体蓝莓的单倍型分型基因组与植物营养素代谢途径的进化
Gigascience. 2019 Mar 1;8(3). doi: 10.1093/gigascience/giz012.
8
Accurate genomic prediction of Coffea canephora in multiple environments using whole-genome statistical models.利用全基因组统计模型对多个环境中的咖啡进行精确基因组预测。
Heredity (Edinb). 2019 Mar;122(3):261-275. doi: 10.1038/s41437-018-0105-y. Epub 2018 Jun 25.
9
The effect of mislabeled phenotypic status on the identification of mutation-carriers from SNP genotypes in dairy cattle.错误标注的表型状态对从奶牛单核苷酸多态性(SNP)基因型中识别突变携带者的影响。
BMC Res Notes. 2017 Jun 26;10(1):230. doi: 10.1186/s13104-017-2540-x.
10
General Methods for Evolutionary Quantitative Genetic Inference from Generalized Mixed Models.基于广义混合模型的进化数量遗传推断通用方法。
Genetics. 2016 Nov;204(3):1281-1294. doi: 10.1534/genetics.115.186536. Epub 2016 Sep 2.