• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大气浓度数据的带乘性对数正态误差项的多元线性回归模型。

A multiple linear regression model with multiplicative log-normal error term for atmospheric concentration data.

机构信息

Department of Chemistry, Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China.

Texas A&M Transportation Institute, Texas A&M University, College Station, TX 77843-3135, United States.

出版信息

Sci Total Environ. 2021 May 1;767:144282. doi: 10.1016/j.scitotenv.2020.144282. Epub 2020 Dec 24.

DOI:10.1016/j.scitotenv.2020.144282
PMID:33422960
Abstract

The homoscedasticity assumption (the variance of the error term is the same across all the observations) is a key assumption in the ordinary linear squares (OLS) solution of a linear regression model. The validity of this assumption is examined for a multiple linear regression model used to determine the source contributions to the observed black carbon concentrations at 12 background monitoring sites across China using a hybrid modeling approach. Residual analysis from the traditional OLS method, which assumes that the error term is additive and normally distributed with a mean of zero, shows pronounced heteroscedasticity based on the Breusch-Pagan test for 11 datasets. Noticing that the atmospheric black carbon data are log-normally distributed, we make a new assumption that the error terms are multiplicative and log-normally distributed. When the coefficients of the multilinear regression model are determined using the maximum likelihood estimation (MLE), the distribution of the residuals in 8 out of the 12 datasets is in good accordance with the revised assumption. Furthermore, the MLE computation under this novel assumption could be proved mathematically identical to minimizing a log-scale objective function, which considerably reduces the complexity in the MLE calculation. The new method is further demonstrated to have clear advantages in numerical simulation experiments of a 5-variable multiple linear regression model using synthesized data with prescribed coefficients and lognormally distributed multiplicative errors. Under all 9 simulation scenarios, the new method yields the most accurate estimations of the regression coefficients and has significantly higher coverage probability (on average, 95% for all five coefficients) than OLS (79%) and weighted least squares (WLS, 72%) methods.

摘要

同方差假设(误差项的方差在所有观测值中相同)是线性回归模型中普通最小二乘法(OLS)解的关键假设。使用混合建模方法来确定中国 12 个背景监测站点观测到的黑碳浓度的来源贡献,对于所使用的多元线性回归模型,通过传统的 OLS 方法(假设误差项是加性的且正态分布,均值为零)进行残差分析,根据 Breusch-Pagan 检验,对于 11 个数据集,发现存在明显的异方差。注意到大气黑碳数据呈对数正态分布,我们做出新的假设,即误差项是乘性的且对数正态分布。使用最大似然估计(MLE)确定多元线性回归模型的系数时,在 12 个数据集的 8 个数据集中,残差的分布与修正后的假设非常吻合。此外,在这种新假设下进行的 MLE 计算可以被证明在数学上等同于最小化对数尺度的目标函数,这大大降低了 MLE 计算的复杂性。新方法在使用规定系数和对数正态分布乘性误差的合成数据进行的 5 变量多元线性回归模型的数值模拟实验中,进一步显示出明显的优势。在所有 9 个模拟场景下,新方法对回归系数的估计最为准确,且覆盖概率显著高于 OLS(79%)和加权最小二乘法(WLS,72%)方法(平均而言,所有五个系数的覆盖概率均为 95%)。

相似文献

1
A multiple linear regression model with multiplicative log-normal error term for atmospheric concentration data.大气浓度数据的带乘性对数正态误差项的多元线性回归模型。
Sci Total Environ. 2021 May 1;767:144282. doi: 10.1016/j.scitotenv.2020.144282. Epub 2020 Dec 24.
2
A comparison of methods to handle skew distributed cost variables in the analysis of the resource consumption in schizophrenia treatment.精神分裂症治疗资源消耗分析中处理偏态分布成本变量方法的比较。
J Ment Health Policy Econ. 2002 Mar;5(1):21-31.
3
Pharmacokinetic parameter estimations by minimum relative entropy method.基于最小相对熵法的药代动力学参数估计
J Pharmacokinet Biopharm. 1995 Oct;23(5):479-94. doi: 10.1007/BF02353470.
4
Pharmacokinetic parameter estimates from several least squares procedures: superiority of extended least squares.几种最小二乘法的药代动力学参数估计:扩展最小二乘法的优越性
J Pharmacokinet Biopharm. 1985 Apr;13(2):185-201. doi: 10.1007/BF01059398.
5
Statistical methods for estimating doubling time in in vitro cell growth.体外细胞生长中倍增时间估计的统计方法。
In Vitro Cell Dev Biol Anim. 1997 Apr;33(4):289-93. doi: 10.1007/s11626-997-0049-7.
6
Weighted Linear Regression Improves Accuracy of Quantitative Elemental Bioimaging by Means of LA-ICP-MS.加权线性回归通过激光烧蚀电感耦合等离子体质谱法提高了定量元素生物成像的准确性。
Anal Chem. 2021 Nov 30;93(47):15720-15727. doi: 10.1021/acs.analchem.1c03630. Epub 2021 Nov 16.
7
A strategy for residual error modeling incorporating scedasticity of variance and distribution shape.一种结合方差的异方差性和分布形状的残差误差建模策略。
J Pharmacokinet Pharmacodyn. 2016 Apr;43(2):137-51. doi: 10.1007/s10928-015-9460-y. Epub 2015 Dec 17.
8
Empirical likelihood estimation for linear regression models with AR(p) error terms with numerical examples.具有AR(p)误差项的线性回归模型的经验似然估计及数值示例
J Appl Stat. 2021 Mar 13;49(9):2271-2286. doi: 10.1080/02664763.2021.1899142. eCollection 2022.
9
A comparison of two-stage procedures for testing least-squares coefficients under heteroscedasticity.两种在异方差下检验最小二乘系数的两阶段程序比较。
Br J Math Stat Psychol. 2011 May;64(Pt 2):244-58. doi: 10.1348/000711010X508683.
10
Global and Geographically and Temporally Weighted Regression Models for Modeling PM in Heilongjiang, China from 2015 to 2018.中国黑龙江省 2015-2018 年 PM 时空地理加权回归模型研究
Int J Environ Res Public Health. 2019 Dec 14;16(24):5107. doi: 10.3390/ijerph16245107.

引用本文的文献

1
Aerosol Uptake Coefficients of Isoprene Epoxides: Determination and Parameter Estimation from Online Field Measurements of Organic Molecular Markers.异戊二烯环氧化物的气溶胶吸收系数:基于有机分子标记物在线实地测量的测定与参数估计
Environ Sci Technol. 2025 Sep 2;59(34):18249-18258. doi: 10.1021/acs.est.5c09046. Epub 2025 Aug 21.
2
A new approach using the genetic algorithm for parameter estimation in multiple linear regression with long-tailed symmetric distributed error terms: An application to the Covid-19 data.一种使用遗传算法在具有长尾对称分布误差项的多元线性回归中进行参数估计的新方法:在新冠疫情数据中的应用。
Chemometr Intell Lab Syst. 2021 Sep 15;216:104372. doi: 10.1016/j.chemolab.2021.104372. Epub 2021 Jun 29.
3
Mechanisms for log normal concentration distributions in the environment.
环境中对数正态浓度分布的机制。
Sci Rep. 2021 Aug 12;11(1):16418. doi: 10.1038/s41598-021-96010-6.