• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过留一法偏最小二乘回归(PLSR)分析遗传标记与表型的关系。

Analysis of genetic marker-phenotype relationships by jack-knifed partial least squares regression (PLSR).

作者信息

Bjørnstad Asmund, Westad Frank, Martens Harald

机构信息

Department of Plant and Environmental Sciences, Agricultural University of Norway, As, Norway.

出版信息

Hereditas. 2004;141(2):149-65. doi: 10.1111/j.1601-5223.2004.01816.x.

DOI:10.1111/j.1601-5223.2004.01816.x
PMID:15660976
Abstract

The utility of a relatively new multivariate method, bi-linear modelling by cross-validated partial least squares regression (PLSR), was investigated in the analysis of QTL. The distinguishing feature of PLSR is to reveal reliable covariance structures in data of different types with regard to the same set objects. Two matrices X (here: genetic markers) and Y (here: phenotypes) are interactively decomposed into latent variables (PLS components, or PCs) in a way which facilitates statistically reliable and graphically interpretable model building. Natural collinearities between input variables are utilized actively to stabilise the modelling, instead of being treated as a statistical problem. The importance of cross-validation/jack-knifing as an intuitively appealing way to avoid overfitting, is emphasized. Two datasets from chromosomal mapping studies of different complexity were chosen for illustration (QTL for tomato yield and for oat heading date). Results from PLSR analysis were compared to published results and to results using the package PLABQTL in these data sets. In all cases PLSR gave at least similar explained validation variances as the reported studies. An attractive feature is that PLSR allows the analysis of several traits/replicates in one analysis, and the direct visual identification of individuals with desirable marker genotypes. It is suggested that PLSR may be useful in structural and functional genomics and in marker assisted selection, particularly in cases with limited number of objects.

摘要

在数量性状基因座(QTL)分析中,研究了一种相对较新的多变量方法——通过交叉验证偏最小二乘回归(PLSR)进行双线性建模的效用。PLSR的显著特点是,对于同一组对象,能在不同类型的数据中揭示可靠的协方差结构。两个矩阵X(此处:遗传标记)和Y(此处:表型)以一种有助于构建统计可靠且可图形解释模型的方式交互分解为潜在变量(PLS成分或主成分)。积极利用输入变量之间的自然共线性来稳定建模,而非将其视为一个统计问题。强调了交叉验证/留一法作为一种直观且有吸引力的避免过拟合方法的重要性。选择了两个来自不同复杂程度染色体定位研究的数据集用于说明(番茄产量和燕麦抽穗期的QTL)。将PLSR分析的结果与已发表的结果以及在这些数据集中使用PLABQTL软件包得到的结果进行了比较。在所有情况下,PLSR给出的验证方差解释至少与已报道的研究相似。一个吸引人的特点是,PLSR允许在一次分析中分析多个性状/重复,并直接直观地识别具有理想标记基因型的个体。建议PLSR在结构和功能基因组学以及标记辅助选择中可能有用,特别是在对象数量有限的情况下。

相似文献

1
Analysis of genetic marker-phenotype relationships by jack-knifed partial least squares regression (PLSR).通过留一法偏最小二乘回归(PLSR)分析遗传标记与表型的关系。
Hereditas. 2004;141(2):149-65. doi: 10.1111/j.1601-5223.2004.01816.x.
2
Resistance to barley scald (Rhynchosporium secalis) in the Ethiopian donor lines 'Steudelli' and 'Jet', analyzed by partial least squares regression and interval mapping.
Hereditas. 2004;141(2):166-79. doi: 10.1111/j.1601-5223.2004.01817.x.
3
Hierarchical cluster-based partial least squares regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models.基于层次聚类的偏最小二乘回归(HC-PLSR)是一种用于非线性动态模型元建模的有效工具。
BMC Syst Biol. 2011 Jun 1;5:90. doi: 10.1186/1752-0509-5-90.
4
Bayesian QTL mapping using genome-wide SSR markers and segregating population derived from a cross of two commercial F hybrids of tomato.基于全基因组 SSR 标记和两个商业番茄杂交 F1 代分离群体的贝叶斯 QTL 作图
Theor Appl Genet. 2017 Aug;130(8):1601-1616. doi: 10.1007/s00122-017-2913-5. Epub 2017 May 5.
5
A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers.比较五种方法从全基因组 SNP 标记预测奶牛公牛的基因组育种值。
Genet Sel Evol. 2009 Dec 31;41(1):56. doi: 10.1186/1297-9686-41-56.
6
New Method Application for Marker-Trait Association Studies in Plants: Partial Least Square Regression Aids Detection of Simultaneous Correlations.植物中标记-性状关联研究的新方法应用:偏最小二乘回归有助于同时相关性的检测
Agric Res Technol. 2017 Dec;12(5). doi: 10.19080/ARTOAJ.2017.12.555864. Epub 2017 Dec 15.
7
Genomic selection using principal component regression.基于主成分回归的基因组选择。
Heredity (Edinb). 2018 Jul;121(1):12-23. doi: 10.1038/s41437-018-0078-x. Epub 2018 May 1.
8
A best-practice guide to predicting plant traits from leaf-level hyperspectral data using partial least squares regression.使用偏最小二乘回归从叶片高光谱数据预测植物性状的最佳实践指南。
J Exp Bot. 2021 Sep 30;72(18):6175-6189. doi: 10.1093/jxb/erab295.
9
Partial least squares methods: partial least squares correlation and partial least square regression.偏最小二乘法:偏最小二乘相关分析和偏最小二乘回归分析。
Methods Mol Biol. 2013;930:549-79. doi: 10.1007/978-1-62703-059-5_23.
10
QTL mapping in eggplant reveals clusters of yield-related loci and orthology with the tomato genome.茄子的QTL定位揭示了产量相关基因座的簇以及与番茄基因组的直系同源性。
PLoS One. 2014 Feb 21;9(2):e89499. doi: 10.1371/journal.pone.0089499. eCollection 2014.

引用本文的文献

1
Relating multivariate shapes to genescapes using phenotype-biological process associations for craniofacial shape.使用表型-生物过程关联将多变量形状与基因景观相关联,以研究颅面形状。
Elife. 2021 Nov 15;10:e68623. doi: 10.7554/eLife.68623.
2
Identifying Key Somatic Copy Number Alterations Driving Dysregulation of Cancer Hallmarks in Lower-Grade Glioma.识别驱动低级别胶质瘤中癌症特征失调的关键体细胞拷贝数改变
Front Genet. 2021 Jun 7;12:654736. doi: 10.3389/fgene.2021.654736. eCollection 2021.
3
Identifying Cancer Driver lncRNAs Bridged by Functional Effectors through Integrating Multi-omics Data in Human Cancers.
通过整合人类癌症中的多组学数据鉴定由功能效应物桥接的癌症驱动长链非编码RNA
Mol Ther Nucleic Acids. 2019 Sep 6;17:362-373. doi: 10.1016/j.omtn.2019.05.030. Epub 2019 Jun 13.
4
Multivariate Analysis of Genotype-Phenotype Association.基因型-表型关联的多变量分析
Genetics. 2016 Apr;202(4):1345-63. doi: 10.1534/genetics.115.181339. Epub 2016 Feb 19.
5
Identifying core gene modules in glioblastoma based on multilayer factor-mediated dysfunctional regulatory networks through integrating multi-dimensional genomic data.通过整合多维度基因组数据,基于多层因子介导的功能失调调控网络识别胶质母细胞瘤中的核心基因模块。
Nucleic Acids Res. 2015 Feb 27;43(4):1997-2007. doi: 10.1093/nar/gkv074. Epub 2015 Feb 4.
6
Diversity of North European oat analyzed by SSR, AFLP and DArT markers.北欧燕麦的 SSR、AFLP 和 DArT 标记分析多样性。
Theor Appl Genet. 2012 Jun;125(1):57-70. doi: 10.1007/s00122-012-1816-8. Epub 2012 Feb 18.
7
Partial least square regression applied to the QTLMAS 2010 dataset.
BMC Proc. 2011 May 27;5 Suppl 3(Suppl 3):S7. doi: 10.1186/1753-6561-5-S3-S7.
8
Comparison of analyses of the QTLMAS XIV common dataset. I: genomic selection.QTLMAS XIV通用数据集分析的比较。I:基因组选择。
BMC Proc. 2011 May 27;5 Suppl 3(Suppl 3):S1. doi: 10.1186/1753-6561-5-S3-S1.
9
Genomic value prediction for quantitative traits under the epistatic model.基于上位性模型的数量性状基因组值预测。
BMC Genet. 2011 Jan 26;12:15. doi: 10.1186/1471-2156-12-15.
10
A feeding induced switch from a variable to a homogenous state of the earthworm gut microbiota within a host population.在宿主种群中,摄食诱导蚯蚓肠道微生物群从可变状态转变为同质状态。
PLoS One. 2009 Oct 20;4(10):e7528. doi: 10.1371/journal.pone.0007528.