• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用推荐系统预测多性状和多环境基因组数据

Prediction of Multiple-Trait and Multiple-Environment Genomic Data Using Recommender Systems.

作者信息

Montesinos-López Osval A, Montesinos-López Abelardo, Crossa José, Montesinos-López José C, Mota-Sanchez David, Estrada-González Fermín, Gillberg Jussi, Singh Ravi, Mondal Suchismita, Juliana Philomin

机构信息

Facultad de Telemática, Universidad de Colima, 28040 Colima, México

Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, 44430 Jalisco, México.

出版信息

G3 (Bethesda). 2018 Jan 4;8(1):131-147. doi: 10.1534/g3.117.300309.

DOI:10.1534/g3.117.300309
PMID:29097376
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5765342/
Abstract

In genomic-enabled prediction, the task of improving the accuracy of the prediction of lines in environments is difficult because the available information is generally sparse and usually has low correlations between traits. In current genomic selection, although researchers have a large amount of information and appropriate statistical models to process it, there is still limited computing efficiency to do so. Although some statistical models are usually mathematically elegant, many of them are also computationally inefficient, and they are impractical for many traits, lines, environments, and years because they need to sample from huge normal multivariate distributions. For these reasons, this study explores two recommender systems: item-based collaborative filtering (IBCF) and the matrix factorization algorithm (MF) in the context of multiple traits and multiple environments. The IBCF and MF methods were compared with two conventional methods on simulated and real data. Results of the simulated and real data sets show that the IBCF technique was slightly better in terms of prediction accuracy than the two conventional methods and the MF method when the correlation was moderately high. The IBCF technique is very attractive because it produces good predictions when there is high correlation between items (environment-trait combinations) and its implementation is computationally feasible, which can be useful for plant breeders who deal with very large data sets.

摘要

在基于基因组的预测中,提高不同环境下品系预测准确性的任务颇具难度,因为可用信息通常较为稀疏,而且性状之间的相关性通常较低。在当前的基因组选择中,尽管研究人员拥有大量信息以及合适的统计模型来处理这些信息,但进行计算的效率仍然有限。虽然一些统计模型通常在数学上很精妙,但其中许多在计算上效率低下,并且对于许多性状、品系、环境和年份而言并不实用,因为它们需要从巨大的正态多元分布中进行抽样。基于这些原因,本研究在多性状和多环境的背景下探索了两种推荐系统:基于项目的协同过滤(IBCF)和矩阵分解算法(MF)。将IBCF和MF方法与两种传统方法在模拟数据和真实数据上进行了比较。模拟数据集和真实数据集的结果表明,当相关性处于中等高度时,IBCF技术在预测准确性方面比两种传统方法和MF方法略胜一筹。IBCF技术非常具有吸引力,因为当项目(环境 - 性状组合)之间存在高度相关性时,它能产生良好的预测结果,并且其实现方式在计算上是可行的,这对于处理非常大的数据集的植物育种者可能会很有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2d2/5765342/77cc0931aa11/131f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2d2/5765342/f33dd760c104/131f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2d2/5765342/fb49649e7042/131f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2d2/5765342/77cc0931aa11/131f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2d2/5765342/f33dd760c104/131f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2d2/5765342/fb49649e7042/131f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a2d2/5765342/77cc0931aa11/131f3.jpg

相似文献

1
Prediction of Multiple-Trait and Multiple-Environment Genomic Data Using Recommender Systems.使用推荐系统预测多性状和多环境基因组数据
G3 (Bethesda). 2018 Jan 4;8(1):131-147. doi: 10.1534/g3.117.300309.
2
Genomic Selection in Winter Wheat Breeding Using a Recommender Approach.利用推荐方法进行冬小麦育种中的基因组选择。
Genes (Basel). 2020 Jul 11;11(7):779. doi: 10.3390/genes11070779.
3
A Bayesian Poisson-lognormal Model for Count Data for Multiple-Trait Multiple-Environment Genomic-Enabled Prediction.贝叶斯泊松-对数正态模型在多性状多环境下基于基因组的计数数据分析中的应用。
G3 (Bethesda). 2017 May 5;7(5):1595-1606. doi: 10.1534/g3.117.039974.
4
Bayesian Genomic Prediction with Genotype × Environment Interaction Kernel Models.使用基因型×环境互作核模型的贝叶斯基因组预测
G3 (Bethesda). 2017 Jan 5;7(1):41-53. doi: 10.1534/g3.116.035584.
5
Genomic-Enabled Prediction in Maize Using Kernel Models with Genotype × Environment Interaction.利用具有基因型×环境互作的核模型对玉米进行基因组预测
G3 (Bethesda). 2017 Jun 7;7(6):1995-2014. doi: 10.1534/g3.117.042341.
6
A Bayesian Genomic Multi-output Regressor Stacking Model for Predicting Multi-trait Multi-environment Plant Breeding Data.一种用于预测多性状多环境植物育种数据的贝叶斯基因组多输出回归器堆叠模型。
G3 (Bethesda). 2019 Oct 7;9(10):3381-3393. doi: 10.1534/g3.119.400336.
7
Genomic Selection in Plant Breeding: Methods, Models, and Perspectives.基因组选择在植物育种中的应用:方法、模型与展望。
Trends Plant Sci. 2017 Nov;22(11):961-975. doi: 10.1016/j.tplants.2017.08.011. Epub 2017 Sep 28.
8
Use of multiple traits genomic prediction, genotype by environment interactions and spatial effect to improve prediction accuracy in yield data.利用多个性状基因组预测、基因型与环境互作和空间效应来提高产量数据的预测准确性。
PLoS One. 2020 May 13;15(5):e0232665. doi: 10.1371/journal.pone.0232665. eCollection 2020.
9
MegaLMM: Mega-scale linear mixed models for genomic predictions with thousands of traits.MegaLMM:用于具有数千个性状的基因组预测的大规模线性混合模型。
Genome Biol. 2021 Jul 23;22(1):213. doi: 10.1186/s13059-021-02416-w.
10
Multi-Trait Genomic Prediction of Yield-Related Traits in US Soft Wheat under Variable Water Regimes.在不同水分条件下美国软小麦产量相关性状的多性状基因组预测。
Genes (Basel). 2020 Oct 28;11(11):1270. doi: 10.3390/genes11111270.

引用本文的文献

1
Stacked ensembles on basis of parentage information can predict hybrid performance with an accuracy comparable to marker-based GBLUP.基于亲缘关系信息的堆叠集成方法能够预测杂种性能,其准确性与基于标记的基因组最佳线性无偏预测法相当。
Front Plant Sci. 2023 Jul 21;14:1178902. doi: 10.3389/fpls.2023.1178902. eCollection 2023.
2
Genetic dissection of Striga hermonthica (Del.) Benth. resistance via genome-wide association and genomic prediction in tropical maize germplasm.利用全基因组关联分析和基因组预测对热带玉米种质中 Striga hermonthica(Del.)Benth. 抗性的遗传剖析。
Theor Appl Genet. 2021 Mar;134(3):941-958. doi: 10.1007/s00122-020-03744-4. Epub 2021 Jan 3.
3

本文引用的文献

1
A Bayesian Poisson-lognormal Model for Count Data for Multiple-Trait Multiple-Environment Genomic-Enabled Prediction.贝叶斯泊松-对数正态模型在多性状多环境下基于基因组的计数数据分析中的应用。
G3 (Bethesda). 2017 May 5;7(5):1595-1606. doi: 10.1534/g3.117.039974.
2
Canopy Temperature and Vegetation Indices from High-Throughput Phenotyping Improve Accuracy of Pedigree and Genomic Selection for Grain Yield in Wheat.高通量表型分析得出的冠层温度和植被指数提高了小麦籽粒产量系谱选择和基因组选择的准确性。
G3 (Bethesda). 2016 Sep 8;6(9):2799-808. doi: 10.1534/g3.116.032888.
3
A Genomic Bayesian Multi-trait and Multi-environment Model.
Genomic Selection in Winter Wheat Breeding Using a Recommender Approach.
利用推荐方法进行冬小麦育种中的基因组选择。
Genes (Basel). 2020 Jul 11;11(7):779. doi: 10.3390/genes11070779.
4
Crop Yield Prediction Using Deep Neural Networks.使用深度神经网络进行作物产量预测。
Front Plant Sci. 2019 May 22;10:621. doi: 10.3389/fpls.2019.00621. eCollection 2019.
5
Prospects and Challenges of Applied Genomic Selection-A New Paradigm in Breeding for Grain Yield in Bread Wheat.应用基因组选择在面包小麦产量育种中的前景与挑战——一种新的模式。
Plant Genome. 2018 Nov;11(3). doi: 10.3835/plantgenome2018.03.0017.
6
An R Package for Multitrait and Multienvironment Data with the Item-Based Collaborative Filtering Algorithm.一个基于项目的协同过滤算法的多性状和多环境数据的 R 包。
Plant Genome. 2018 Nov;11(3). doi: 10.3835/plantgenome2018.02.0013.
7
Integrating genomic-enabled prediction and high-throughput phenotyping in breeding for climate-resilient bread wheat.将基因组增强预测与高通量表型分析整合到抗逆性面包小麦的育种中。
Theor Appl Genet. 2019 Jan;132(1):177-194. doi: 10.1007/s00122-018-3206-3. Epub 2018 Oct 19.
一种基因组贝叶斯多性状多环境模型。
G3 (Bethesda). 2016 Sep 8;6(9):2725-44. doi: 10.1534/g3.116.032359.
4
Genomic prediction in maize breeding populations with genotyping-by-sequencing.基于测序的基因型鉴定在玉米育种群体中的基因组预测。
G3 (Bethesda). 2013 Nov 6;3(11):1903-26. doi: 10.1534/g3.113.008227.
5
Efficient methods to compute genomic predictions.计算基因组预测的有效方法。
J Dairy Sci. 2008 Nov;91(11):4414-23. doi: 10.3168/jds.2007-0980.
6
Genomic selection.基因组选择
J Anim Breed Genet. 2007 Dec;124(6):323-30. doi: 10.1111/j.1439-0388.2007.00702.x.
7
Prediction of total genetic value using genome-wide dense marker maps.利用全基因组密集标记图谱预测总遗传值。
Genetics. 2001 Apr;157(4):1819-29. doi: 10.1093/genetics/157.4.1819.