一种期望最大化算法，用于对数量性状基因座效应进行 Lasso 估计。

An expectation-maximization algorithm for the Lasso estimation of quantitative trait locus effects.

机构信息

Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA.

出版信息

Heredity (Edinb). 2010 Nov;105(5):483-94. doi: 10.1038/hdy.2009.180. Epub 2010 Jan 6.

DOI:10.1038/hdy.2009.180

PMID:20051978

Abstract

The least absolute shrinkage and selection operator (Lasso) estimation of regression coefficients can be expressed as Bayesian posterior mode estimation of the regression coefficients under various hierarchical modeling schemes. A Bayesian hierarchical model requires hyper prior distributions. The regression coefficients are parameters of interest. The normal distribution assigned to each regression coefficient is a prior distribution. The variance parameter in the normal prior distribution is further assigned a hyper prior distribution so that the variance parameter can be estimated from the data. We developed an expectation-maximization (EM) algorithm to estimate the variance parameter of the prior distribution for each regression coefficient. Performance of the EM algorithm was evaluated through simulation study and real data analysis. We found that the Jeffreys' hyper prior for the variance component usually performs well with regard to generating the desired sparseness of the regression model. The EM algorithm can handle not only the usual regression models but it also conveniently deals with linear models in which predictors are defined as classification variables. In the context of quantitative trait loci (QTL) mapping, this new EM algorithm can estimate both genotypic values and QTL effects expressed as linear contrasts of the genotypic values.

摘要

回归系数的最小绝对收缩和选择算子（Lasso）估计可以表示为在各种层次建模方案下回归系数的贝叶斯后验模式估计。贝叶斯层次模型需要超先验分布。回归系数是感兴趣的参数。分配给每个回归系数的正态分布是先验分布。正态先验分布中的方差参数进一步分配给超先验分布，以便可以从数据中估计方差参数。我们开发了期望最大化（EM）算法来估计每个回归系数的先验分布的方差参数。通过模拟研究和真实数据分析评估了 EM 算法的性能。我们发现，方差分量的杰弗里斯超先验通常在产生所需的回归模型稀疏性方面表现良好。EM 算法不仅可以处理通常的回归模型，还可以方便地处理将预测器定义为分类变量的线性模型。在数量性状位点（QTL）映射的上下文中，这个新的 EM 算法可以估计基因型值和 QTL 效应，这些效应表示为基因型值的线性对比。

相似文献

An expectation-maximization algorithm for the Lasso estimation of quantitative trait locus effects.一种期望最大化算法，用于对数量性状基因座效应进行 Lasso 估计。

Heredity (Edinb). 2010 Nov;105(5):483-94. doi: 10.1038/hdy.2009.180. Epub 2010 Jan 6.

[The use of the expectation-maximization (EM) algorithm for maximum likelihood estimation of gametic frequencies of multilocus polymorphic codominant systems based on sampled population data].[基于抽样群体数据，使用期望最大化（EM）算法对多位点共显性系统的配子频率进行最大似然估计]

Genetika. 2002 Mar;38(3):407-18.

A Fisher scoring algorithm for the weighted regression method of QTL mapping.一种用于QTL定位加权回归方法的费希尔评分算法。

Heredity (Edinb). 2008 Nov;101(5):453-64. doi: 10.1038/hdy.2008.78. Epub 2008 Aug 13.

An empirical Bayes method for estimating epistatic effects of quantitative trait loci.一种用于估计数量性状基因座上位性效应的经验贝叶斯方法。

Biometrics. 2007 Jun;63(2):513-21. doi: 10.1111/j.1541-0420.2006.00711.x.

Improved LASSO priors for shrinkage quantitative trait loci mapping.改进的 LASSO 先验用于收缩数量性状基因座定位。

Theor Appl Genet. 2012 May;124(7):1315-24. doi: 10.1007/s00122-012-1789-7.

Mapping a quantitative trait locus via the EM algorithm and Bayesian classification.通过期望最大化（EM）算法和贝叶斯分类法定位数量性状基因座。

Genet Epidemiol. 2000 Sep;19(2):97-126. doi: 10.1002/1098-2272(200009)19:2<97::AID-GEPI1>3.0.CO;2-9.

Derivation of the shrinkage estimates of quantitative trait locus effects.数量性状基因座效应收缩估计值的推导。

Genetics. 2007 Oct;177(2):1255-8. doi: 10.1534/genetics.107.077487. Epub 2007 Aug 24.

Shrinkage estimation method for mapping multiple quantitative trait loci.用于定位多个数量性状基因座的收缩估计方法。

Yi Chuan Xue Bao. 2006 Oct;33(10):861-9. doi: 10.1016/S0379-4172(06)60120-0.

Clustering expressed genes on the basis of their association with a quantitative phenotype.基于与定量表型的关联对表达基因进行聚类。

Genet Res. 2005 Dec;86(3):193-207. doi: 10.1017/S0016672305007822.

An EM algorithm for mapping quantitative resistance loci.一种用于定位数量抗性基因座的期望最大化（EM）算法。

Heredity (Edinb). 2005 Jan;94(1):119-28. doi: 10.1038/sj.hdy.6800583.

引用本文的文献

Fast3VmrMLM: A fast algorithm that integrates genome-wide scanning with machine learning to accelerate gene mining and breeding by design for polygenic traits in large-scale GWAS datasets.Fast3VmrMLM：一种快速算法，它将全基因组扫描与机器学习相结合，以加速大规模全基因组关联研究（GWAS）数据集中多基因性状的基因挖掘和设计育种。

Plant Commun. 2025 Jul 14;6(7):101385. doi: 10.1016/j.xplc.2025.101385. Epub 2025 May 22.

IIIVmrMLM.QEI: An effective tool for indirect detection of QTN-by-environment interactions in genome-wide association studies.IIIVmrMLM.QEI：全基因组关联研究中用于间接检测QTN与环境相互作用的有效工具。

Comput Struct Biotechnol J. 2024 Dec 2;23:4357-4368. doi: 10.1016/j.csbj.2024.11.046. eCollection 2024 Dec.

The integration of quantile regression with 3VmrMLM identifies more QTNs and QTN-by-environment interactions using SNP- and haplotype-based markers.分位数回归与3VmrMLM的整合使用基于单核苷酸多态性（SNP）和单倍型的标记物鉴定出更多的数量性状核苷酸（QTN）以及QTN与环境的互作。

Plant Commun. 2025 Mar 10;6(3):101196. doi: 10.1016/j.xplc.2024.101196. Epub 2024 Nov 23.

BLUPmrMLM: A Fast mrMLM Algorithm in Genome-wide Association Studies.BLUPmrMLM：全基因组关联研究中的一种快速 mrMLM 算法。

Genomics Proteomics Bioinformatics. 2024 Sep 13;22(3). doi: 10.1093/gpbjnl/qzae020.

Privacy-preserving biological age prediction over federated human methylation data using fully homomorphic encryption.基于全同态加密的联邦人类甲基化数据的隐私保护生物年龄预测。

Genome Res. 2024 Oct 11;34(9):1324-1333. doi: 10.1101/gr.279071.124.

Pleiotropic genetic association analysis with multiple phenotypes using multivariate response best-subset selection.多表型的多元响应最佳子集选择的多效性遗传关联分析。

BMC Genomics. 2023 Dec 11;24(1):759. doi: 10.1186/s12864-023-09820-5.

Improving power of genome-wide association studies via transforming ordinal phenotypes into continuous phenotypes.通过将有序表型转化为连续表型提高全基因组关联研究的效能

Front Plant Sci. 2023 Nov 2;14:1247181. doi: 10.3389/fpls.2023.1247181. eCollection 2023.

A multi-locus linear mixed model methodology for detecting small-effect QTLs for quantitative traits in MAGIC, NAM, and ROAM populations.一种用于在多亲本高级世代互交群体（MAGIC）、嵌套关联作图群体（NAM）和随机家系高级世代互交群体（ROAM）中检测数量性状小效应数量性状基因座（QTL）的多位点线性混合模型方法。

Comput Struct Biotechnol J. 2023 Mar 15;21:2241-2252. doi: 10.1016/j.csbj.2023.03.022. eCollection 2023.

Dissecting Complex Traits Using Omics Data: A Review on the Linear Mixed Models and Their Application in GWAS.利用组学数据剖析复杂性状：线性混合模型及其在全基因组关联研究中的应用综述

Plants (Basel). 2022 Nov 28;11(23):3277. doi: 10.3390/plants11233277.

Genetic Dissection of Epistatic Interactions Contributing Yield-Related Agronomic Traits in Rice Using the Compressed Mixed Model.利用压缩混合模型对水稻产量相关农艺性状上位性互作进行遗传剖析

Plants (Basel). 2022 Sep 26;11(19):2504. doi: 10.3390/plants11192504.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种期望最大化算法，用于对数量性状基因座效应进行 Lasso 估计。

An expectation-maximization algorithm for the Lasso estimation of quantitative trait locus effects.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献