在误差服从广义高斯 - 拉普拉斯分布的假设下，通过最大化似然函数进行多元线性回归。

Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error.

作者信息

Jäntschi Lorentz, Bálint Donatella, Bolboacă Sorana D

机构信息

Department of Physics and Chemistry, Faculty of Materials and Environmental Engineering, Technical University of Cluj-Napoca, Muncii Boulevard No. 103-105, 400641 Cluj-Napoca, Romania; Doctoral School of Chemistry, Institute for Doctoral Studies, Babeş-Bolyai University, Kogălniceanu Street No. 1, 400084 Cluj-Napoca, Romania; Department of Chemistry, Faculty of Science, University of Oradea, Universităţii Street No. 1, 410087 Oradea, Romania.

Doctoral School of Chemistry, Institute for Doctoral Studies, Babeş-Bolyai University, Kogălniceanu Street No. 1, 400084 Cluj-Napoca, Romania.

出版信息

Comput Math Methods Med. 2016;2016:8578156. doi: 10.1155/2016/8578156. Epub 2016 Dec 7.

DOI:10.1155/2016/8578156

PMID:28090215

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5174750/

Abstract

Multiple linear regression analysis is widely used to link an outcome with predictors for better understanding of the behaviour of the outcome of interest. Usually, under the assumption that the errors follow a normal distribution, the coefficients of the model are estimated by minimizing the sum of squared deviations. A new approach based on maximum likelihood estimation is proposed for finding the coefficients on linear models with two predictors without any constrictive assumptions on the distribution of the errors. The algorithm was developed, implemented, and tested as proof-of-concept using fourteen sets of compounds by investigating the link between activity/property (as outcome) and structural feature information incorporated by molecular descriptors (as predictors). The results on real data demonstrated that in all investigated cases the power of the error is significantly different by the convenient value of two when the Gauss-Laplace distribution was used to relax the constrictive assumption of the normal distribution of the error. Therefore, the Gauss-Laplace distribution of the error could not be rejected while the hypothesis that the power of the error from Gauss-Laplace distribution is normal distributed also failed to be rejected.

摘要

多元线性回归分析被广泛用于将一个结果与预测变量联系起来，以便更好地理解感兴趣结果的行为。通常，在误差服从正态分布的假设下，通过最小化平方偏差之和来估计模型的系数。本文提出了一种基于最大似然估计的新方法，用于在对误差分布没有任何约束性假设的情况下，找到具有两个预测变量的线性模型的系数。通过研究活性/性质（作为结果）与分子描述符纳入的结构特征信息（作为预测变量）之间的联系，开发、实现并测试了该算法，作为概念验证使用了十四组化合物。实际数据结果表明，在所有研究案例中，当使用高斯 - 拉普拉斯分布来放宽误差正态分布的约束性假设时，误差的幂与方便值二有显著差异。因此，误差的高斯 - 拉普拉斯分布不能被拒绝，而来自高斯 - 拉普拉斯分布的误差幂呈正态分布的假设也未能被拒绝。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77fa/5174750/819ba11dac6b/CMMM2016-8578156.001.jpg

相似文献

Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error.在误差服从广义高斯 - 拉普拉斯分布的假设下，通过最大化似然函数进行多元线性回归。

Comput Math Methods Med. 2016;2016:8578156. doi: 10.1155/2016/8578156. Epub 2016 Dec 7.

Semiparametric maximum likelihood for measurement error model regression.测量误差模型回归的半参数极大似然估计

Biometrics. 2001 Mar;57(1):53-61. doi: 10.1111/j.0006-341x.2001.00053.x.

A family of linear mixed-effects models using the generalized Laplace distribution.使用广义拉普拉斯分布的线性混合效应模型族。

Stat Methods Med Res. 2020 Sep;29(9):2665-2682. doi: 10.1177/0962280220903763. Epub 2020 Mar 11.

Linear mixed function-on-function regression models.线性混合函数对函数回归模型

Biometrics. 2014 Dec;70(4):794-801. doi: 10.1111/biom.12207. Epub 2014 Jun 26.

Laplace approximation, penalized quasi-likelihood, and adaptive Gauss-Hermite quadrature for generalized linear mixed models: towards meta-analysis of binary outcome with sparse data.拉普拉斯逼近、惩罚拟似然和广义线性混合模型的自适应高斯-埃尔米特求积：用于稀疏数据二分类结局的荟萃分析。

BMC Med Res Methodol. 2020 Jun 11;20(1):152. doi: 10.1186/s12874-020-01035-6.

Secure analysis of distributed chemical databases without data integration.无需数据整合的分布式化学数据库安全分析

J Comput Aided Mol Des. 2005 Sep-Oct;19(9-10):739-47. doi: 10.1007/s10822-005-9011-5. Epub 2005 Nov 3.

Maximum likelihood orientation estimation of 1-D patterns in Laguerre-Gauss subspaces.拉盖尔-高斯子空间中一维模式的最大似然方向估计。

IEEE Trans Image Process. 2010 May;19(5):1113-25. doi: 10.1109/TIP.2010.2041395. Epub 2010 Jan 26.

Inside of the Linear Relation between Dependent and Independent Variables.因变量与自变量之间的线性关系内部。

Comput Math Methods Med. 2015;2015:360752. doi: 10.1155/2015/360752. Epub 2015 May 25.

A better lemon squeezer? Maximum-likelihood regression with beta-distributed dependent variables.更好的柠檬榨汁器？具有贝塔分布因变量的最大似然回归。

Psychol Methods. 2006 Mar;11(1):54-71. doi: 10.1037/1082-989X.11.1.54.

Laplace approximation in measurement error models.测量误差模型中的拉普拉斯近似

Biom J. 2011 May;53(3):411-25. doi: 10.1002/bimj.201000095.

引用本文的文献

Casorati Inequalities for Statistical Submanifolds in Kenmotsu Statistical Manifolds of Constant -Sectional Curvature with Semi-Symmetric Metric Connection.具有半对称度量联络的常截面曲率Kenmotsu统计流形中统计子流形的Casorati不等式

Entropy (Basel). 2022 Jun 8;24(6):800. doi: 10.3390/e24060800.

Medical Diagnostic Tests: A Review of Test Anatomy, Phases, and Statistical Treatment of Data.医学诊断测试：测试解剖学、阶段和数据统计处理的回顾。

Comput Math Methods Med. 2019 May 28;2019:1891569. doi: 10.1155/2019/1891569. eCollection 2019.

本文引用的文献

Inside of the Linear Relation between Dependent and Independent Variables.因变量与自变量之间的线性关系内部。

Comput Math Methods Med. 2015;2015:360752. doi: 10.1155/2015/360752. Epub 2015 May 25.

QSAR models for thiophene and imidazopyridine derivatives inhibitors of the Polo-Like Kinase 1.针对Polo样激酶1的噻吩和咪唑并吡啶衍生物抑制剂的定量构效关系模型

Eur J Pharm Sci. 2014 Oct 1;62:171-9. doi: 10.1016/j.ejps.2014.05.029. Epub 2014 Jun 6.

Daphnia and fish toxicity of (benzo)triazoles: validated QSAR models, and interspecies quantitative activity-activity modelling.（苯并）三唑类对水蚤和鱼类的毒性：经过验证的定量构效关系模型，以及种间定量活性-活性模型。

J Hazard Mater. 2013 Aug 15;258-259:50-60. doi: 10.1016/j.jhazmat.2013.04.025. Epub 2013 Apr 25.

The 3D-QSAR study of 110 diverse, dual binding, acetylcholinesterase inhibitors based on alignment independent descriptors (GRIND-2). The effects of conformation on predictive power and interpretability of the models.基于无对齐描述符（GRIND-2）的 110 种多样的、双重结合的乙酰胆碱酯酶抑制剂的 3D-QSAR 研究。构象对模型预测能力和可解释性的影响。

J Mol Graph Model. 2012 Sep;38:194-210. doi: 10.1016/j.jmgm.2012.08.001. Epub 2012 Sep 1.

QSPR/QSAR models for prediction of the physico-chemical properties and biological activity of polychlorinated diphenyl ethers (PCDEs).多氯二苯醚（PCDEs）物理化学性质和生物活性的定量构效关系（QSPR/QSAR）模型预测。

Chemosphere. 2010 Jul;80(6):665-70. doi: 10.1016/j.chemosphere.2010.04.050. Epub 2010 May 21.

The importance of molecular structures, endpoints' values, and predictivity parameters in QSAR research: QSAR analysis of a series of estrogen receptor binders.分子结构、终点值和预测性参数在定量构效关系研究中的重要性：一系列雌激素受体配体的定量构效关系分析。

Mol Divers. 2010 Nov;14(4):687-96. doi: 10.1007/s11030-009-9212-2. Epub 2009 Nov 17.

Comparison of quantitative structure-activity relationship model performances on carboquinone derivatives.卡波醌衍生物定量构效关系模型性能的比较

ScientificWorldJournal. 2009 Oct 14;9:1148-66. doi: 10.1100/tsw.2009.131.

Comparative QSAR study on para-substituted aromatic sulphonamides as CAII inhibitors: information versus topological (distance-based and connectivity) indices.作为碳酸酐酶II抑制剂的对位取代芳基磺酰胺的比较定量构效关系研究：信息指数与拓扑（基于距离和连接性）指数

Chem Biol Drug Des. 2008 Mar;71(3):244-59. doi: 10.1111/j.1747-0285.2007.00625.x. Epub 2008 Jan 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

在误差服从广义高斯 - 拉普拉斯分布的假设下，通过最大化似然函数进行多元线性回归。

Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献