• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新审视主成分分析:具有平滑收敛性的快速多性状遗传评估

Principal component analysis revisited: fast multitrait genetic evaluations with smooth convergence.

作者信息

Ahlinder Jon, Hall David, Suontama Mari, Sillanpää Mikko J

机构信息

Department of Tree Breeding, Skogforsk, Box 3, Tomterna 1, Sävar SE-91821, Sweden.

Department of Ecology and Environmental Science, Umeå University, Umeå SE-90736, Sweden.

出版信息

G3 (Bethesda). 2024 Oct 21;14(12). doi: 10.1093/g3journal/jkae228.

DOI:10.1093/g3journal/jkae228
PMID:39429114
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11631533/
Abstract

A cornerstone in breeding and population genetics is the genetic evaluation procedure, needed to make important decisions on population management. Multivariate mixed model analysis, in which many traits are considered jointly, utilizes genetic and environmental correlations between traits to improve the accuracy. However, the number of parameters in the multitrait model grows exponentially with the number of traits which reduces its scalability. Here, we suggest using principal component analysis to reduce the dimensions of the response variables, and then using the computed principal components as separate responses in the genetic evaluation analysis. As principal components are orthogonal to each other so that phenotypic covariance is abscent between principal components, a full multivariate analysis can be approximated by separate univariate analyses instead which should speed up computations considerably. We compared the approach to both traditional multivariate analysis and factor analytic approach in terms of computational requirement and rank lists according to predicted genetic merit on two forest tree datasets with 22 and 27 measured traits, respectively. Obtained rank lists of the top 50 individuals were in good agreement. Interestingly, the required computational time of the approach only took a few seconds without convergence issues, unlike the traditional approach which required considerably more time to run (7 and 10 h, respectively). The factor analytic approach took approximately 5-10 min. Our approach can easily handle missing data and can be used with all available linear mixed effect model softwares as it does not require any specific implementation. The approach can help to mitigate difficulties with multitrait genetic analysis in both breeding and wild populations.

摘要

育种和群体遗传学的一个基石是遗传评估程序,这是做出群体管理重要决策所必需的。多变量混合模型分析联合考虑多个性状,利用性状之间的遗传和环境相关性来提高准确性。然而,多性状模型中的参数数量会随着性状数量呈指数增长,这降低了其可扩展性。在此,我们建议使用主成分分析来降低响应变量的维度,然后将计算出的主成分作为遗传评估分析中的单独响应。由于主成分彼此正交,因此主成分之间不存在表型协方差,这样就可以通过单独的单变量分析来近似完整的多变量分析,这应该会大大加快计算速度。我们根据预测的遗传价值,在分别具有22个和27个测量性状的两个林木数据集上,从计算要求和排名列表方面将该方法与传统多变量分析和因子分析方法进行了比较。获得的前50个个体的排名列表吻合良好。有趣的是,该方法所需的计算时间仅为几秒,不存在收敛问题,而传统方法运行所需时间长得多(分别为7小时和10小时)。因子分析方法大约需要5 - 10分钟。我们的方法可以轻松处理缺失数据,并且可以与所有可用的线性混合效应模型软件一起使用,因为它不需要任何特定的实现方式。该方法有助于缓解育种和野生群体中多性状遗传分析的困难。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/fe30a7c298ff/jkae228f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/a82bf6a7f5ac/jkae228f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/f5f828d0fd0a/jkae228f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/4b846858cce5/jkae228f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/3ca85bfc6ebb/jkae228f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/d3defa107ee3/jkae228f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/cc6dcf49e7ab/jkae228f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/fe30a7c298ff/jkae228f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/a82bf6a7f5ac/jkae228f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/f5f828d0fd0a/jkae228f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/4b846858cce5/jkae228f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/3ca85bfc6ebb/jkae228f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/d3defa107ee3/jkae228f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/cc6dcf49e7ab/jkae228f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10e3/11631533/fe30a7c298ff/jkae228f7.jpg

相似文献

1
Principal component analysis revisited: fast multitrait genetic evaluations with smooth convergence.重新审视主成分分析:具有平滑收敛性的快速多性状遗传评估
G3 (Bethesda). 2024 Oct 21;14(12). doi: 10.1093/g3journal/jkae228.
2
A comparison of methods to calculate a total merit index using stochastic simulation.使用随机模拟计算总价值指数的方法比较。
Genet Sel Evol. 2015 May 2;47(1):36. doi: 10.1186/s12711-015-0118-4.
3
Improving genetic evaluation using a multitrait single-step genomic model for ability to resume cycling after calving, measured by activity tags in Holstein cows.利用荷斯坦奶牛活动标记测量的产后发情能力的多性状单步基因组模型来提高遗传评估。
J Dairy Sci. 2017 Oct;100(10):8188-8196. doi: 10.3168/jds.2017-13122. Epub 2017 Aug 2.
4
Principal component approach in variance component estimation for international sire evaluation.主成分分析法在国际种公牛评估中估计方差分量的应用。
Genet Sel Evol. 2011 May 24;43(1):21. doi: 10.1186/1297-9686-43-21.
5
Reduced-rank models of growth and reproductive traits in Nelore cattle.内洛尔牛生长和繁殖性状的降秩模型
Theriogenology. 2015 May;83(8):1338-43. doi: 10.1016/j.theriogenology.2015.01.025. Epub 2015 Jan 31.
6
Reduced rank analysis of morphometric and functional traits in Campolina horses.Campolina 马形态和功能特征的降秩分析。
J Anim Breed Genet. 2022 Mar;139(2):231-246. doi: 10.1111/jbg.12658. Epub 2021 Nov 28.
7
A stochastic simulation study on validation of an approximate multitrait model using preadjusted data for prediction of breeding values.一项关于使用预调整数据验证近似多性状模型以预测育种值的随机模拟研究。
J Dairy Sci. 2007 Jun;90(6):3002-11. doi: 10.3168/jds.2006-430.
8
Prediction accuracy of direct and indirect approaches, and their relationships with prediction ability of calibration models.直接法和间接法的预测准确性及其与校准模型预测能力的关系。
J Dairy Sci. 2018 Jul;101(7):6174-6189. doi: 10.3168/jds.2017-13322. Epub 2018 Mar 28.
9
An approximate multitrait model for genetic evaluation in dairy cattle with a robust estimation of genetic trends.一种用于奶牛遗传评估的近似多性状模型及遗传趋势的稳健估计。
Genet Sel Evol. 2007 Jul-Aug;39(4):353-67. doi: 10.1186/1297-9686-39-4-353. Epub 2007 Jul 6.
10
Application of multivariate single-step SNP best linear unbiased predictor model and revised SNP list for genomic evaluation of dairy cattle in Australia.应用多元单步 SNP 最佳线性无偏预测模型和修订的 SNP 列表对澳大利亚奶牛的基因组评估。
J Dairy Sci. 2020 Sep;103(9):8305-8316. doi: 10.3168/jds.2020-18242. Epub 2020 Jul 1.

引用本文的文献

1
Molecular Mass and Isoelectric Point Analysis of Cytokinin Sequences in the Wheat Genome.小麦基因组中细胞分裂素序列的分子量和等电点分析
Int J Mol Sci. 2025 May 30;26(11):5270. doi: 10.3390/ijms26115270.

本文引用的文献

1
Genetic parameter changes and age-age correlations in Pinus koraiensis growth over 40-year progeny testing.红松 40 年子代测定中生长的遗传参数变化及其与年龄的关系。
BMC Plant Biol. 2024 Feb 3;24(1):86. doi: 10.1186/s12870-024-04752-y.
2
Transcriptome analysis of axillary buds in low phosphorus stress and functional analysis of TaWRKY74s in wheat.腋芽在低磷胁迫下的转录组分析和小麦 TaWRKY74s 的功能分析。
BMC Plant Biol. 2024 Jan 2;24(1):1. doi: 10.1186/s12870-023-04695-w.
3
Regularized multi-trait multi-locus linear mixed models for genome-wide association studies and genomic selection in crops.
作物全基因组关联研究和基因组选择的正则化多性状多基因座线性混合模型。
BMC Bioinformatics. 2023 Oct 26;24(1):399. doi: 10.1186/s12859-023-05519-2.
4
Reducing computational demands of restricted maximum likelihood estimation with genomic relationship matrices.利用基因组关系矩阵降低限制最大似然估计的计算需求。
Genet Sel Evol. 2023 Jan 25;55(1):7. doi: 10.1186/s12711-023-00781-7.
5
A computationally efficient method for approximating reliabilities in large-scale single-step genomic prediction.一种用于大规模单步基因组预测中可靠性逼近的计算高效方法。
Genet Sel Evol. 2023 Jan 5;55(1):1. doi: 10.1186/s12711-022-00774-y.
6
Evaluation of automatic discrimination between benign and malignant prostate tissue in the era of high precision digital pathology.评价高精度数字病理学时代下自动区分前列腺良恶性组织。
BMC Bioinformatics. 2023 Jan 3;24(1):1. doi: 10.1186/s12859-022-05124-9.
7
Stan: A Probabilistic Programming Language.斯坦:一种概率编程语言。
J Stat Softw. 2017;76. doi: 10.18637/jss.v076.i01. Epub 2017 Jan 11.
8
Performance of the No-U-Turn sampler in multi-trait variance component estimation using genomic data.基于基因组数据的多性状方差分量估计中无反转抽样器的性能。
Genet Sel Evol. 2022 Jul 11;54(1):51. doi: 10.1186/s12711-022-00743-5.
9
Canonical transformation for multivariate mixed model association analyses.多元混合模型关联分析的典范变换。
Theor Appl Genet. 2022 Jun;135(6):2147-2155. doi: 10.1007/s00122-022-04103-1. Epub 2022 May 10.
10
Evaluation of the phenotypic and genomic background of variability based on litter size of Large White pigs.基于大白猪窝仔数的表型和基因组背景变异性评估。
Genet Sel Evol. 2022 Jan 3;54(1):1. doi: 10.1186/s12711-021-00692-5.