• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自变量的对数转换:我们必须这样做吗?

Log-transformation of Independent Variables: Must We?

机构信息

From the Department of Environmental Health and Engineering, Bloomberg School of Public Health, Johns Hopkins, Baltimore, MD.

Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC.

出版信息

Epidemiology. 2022 Nov 1;33(6):843-853. doi: 10.1097/EDE.0000000000001534. Epub 2022 Oct 5.

DOI:10.1097/EDE.0000000000001534
PMID:36220581
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9574910/
Abstract

Epidemiologic studies often quantify exposure using biomarkers, which commonly have statistically skewed distributions. Although normality assumption is not required if the biomarker is used as an independent variable in linear regression, it has become common practice to log-transform the biomarker concentrations. This transformation can be motivated by concerns for nonlinear dose-response relationship or outliers; however, such transformation may not always reduce bias. In this study, we evaluated the validity of motivations underlying the decision to log-transform an independent variable using simulations, considering eight scenarios that can give rise to skewed X and normal Y. Our simulation study demonstrates that (1) if the skewness of exposure did not arise from a biasing factor (e.g., measurement error), the analytic approach with the best overall model fit best reflected the underlying outcome generating methods and was least biased, regardless of the skewness of X and (2) all estimates were biased if the skewness of exposure was a consequence of a biasing factor. We additionally illustrate a process to determine whether the transformation of an independent variable is needed using NHANES. Our study and suggestion to divorce the shape of the exposure distribution from the decision to log-transform it may aid researchers in planning for analysis using biomarkers or other skewed independent variables.

摘要

流行病学研究通常使用生物标志物来量化暴露,而生物标志物的分布通常存在统计学上的偏态。虽然在线性回归中,生物标志物作为自变量时不需要正态性假设,但将生物标志物浓度进行对数转换已成为一种常见做法。这种转换可能是出于对非线性剂量反应关系或异常值的担忧;然而,这种转换并不总是能减少偏差。在这项研究中,我们通过模拟评估了使用生物标志物作为自变量时进行对数转换的决策背后的动机的有效性,考虑了八种可能导致 X 偏态和 Y 正态的情况。我们的模拟研究表明:(1)如果暴露的偏态不是由偏倚因素(例如测量误差)引起的,那么具有最佳整体模型拟合度的分析方法最能反映潜在的结果生成方法,且偏差最小,而与 X 的偏态无关;(2)如果暴露的偏态是由偏倚因素引起的,那么所有的估计值都是有偏差的。我们还通过 NHANES 说明了一种确定是否需要转换自变量的过程。我们的研究和建议将暴露分布的形状与对数转换的决策分开,这可能有助于研究人员计划使用生物标志物或其他偏态自变量进行分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e3a/9574910/7980c4f8f91d/nihms-1825980-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e3a/9574910/1c6f51090006/nihms-1825980-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e3a/9574910/d7733922067a/nihms-1825980-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e3a/9574910/7980c4f8f91d/nihms-1825980-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e3a/9574910/1c6f51090006/nihms-1825980-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e3a/9574910/d7733922067a/nihms-1825980-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9e3a/9574910/7980c4f8f91d/nihms-1825980-f0003.jpg

相似文献

1
Log-transformation of Independent Variables: Must We?自变量的对数转换:我们必须这样做吗?
Epidemiology. 2022 Nov 1;33(6):843-853. doi: 10.1097/EDE.0000000000001534. Epub 2022 Oct 5.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
A comparison of methods to handle skew distributed cost variables in the analysis of the resource consumption in schizophrenia treatment.精神分裂症治疗资源消耗分析中处理偏态分布成本变量方法的比较。
J Ment Health Policy Econ. 2002 Mar;5(1):21-31.
4
Re-expressing coefficients from regression models for inclusion in a meta-analysis.重新表达回归模型中的系数,以纳入荟萃分析。
BMC Med Res Methodol. 2024 Jan 8;24(1):6. doi: 10.1186/s12874-023-02132-y.
5
Standardizing effect size from linear regression models with log-transformed variables for meta-analysis.对用于荟萃分析的具有对数转换变量的线性回归模型的效应量进行标准化。
BMC Med Res Methodol. 2017 Mar 17;17(1):44. doi: 10.1186/s12874-017-0322-8.
6
Tests of trend between disease outcomes and ordinal covariates discretized from underlying continuous variables: simulation studies and applications to NHANES 2007-2008.疾病结局与潜在连续变量离散化的有序协变量之间趋势的检验:模拟研究及 NHANES 2007-2008 的应用
BMC Med Res Methodol. 2019 Jan 5;19(1):2. doi: 10.1186/s12874-018-0630-7.
7
Is the assumption of normality or log-normality for continuous response data critical for benchmark dose estimation?对于连续反应数据,假设正态性或对数正态性对于基准剂量估计是否至关重要?
Toxicol Appl Pharmacol. 2013 Nov 1;272(3):767-79. doi: 10.1016/j.taap.2013.08.006. Epub 2013 Aug 15.
8
Accounting for measurement error in log regression models with applications to accelerated testing.在对数回归模型中考虑测量误差及其在加速测试中的应用。
PLoS One. 2018 May 30;13(5):e0197222. doi: 10.1371/journal.pone.0197222. eCollection 2018.
9
Checking distributional assumptions for pharmacokinetic summary statistics based on simulations with compartmental models.基于房室模型模拟检验药代动力学汇总统计量的分布假设。
J Biopharm Stat. 2017;27(5):756-772. doi: 10.1080/10543406.2016.1222535. Epub 2016 Sep 26.
10
Quantile regression for exposure data with repeated measures in the presence of non-detects.存在未检出值时,重复测量暴露数据的分位数回归。
J Expo Sci Environ Epidemiol. 2021 Nov;31(6):1057-1066. doi: 10.1038/s41370-021-00345-1. Epub 2021 Jun 9.

引用本文的文献

1
Effect of aspirin use on conversion risk from mild cognitive impairment to Alzheimer's disease.阿司匹林使用对轻度认知障碍转化为阿尔茨海默病风险的影响。
Front Aging Neurosci. 2025 Aug 6;17:1603892. doi: 10.3389/fnagi.2025.1603892. eCollection 2025.
2
Paternal and maternal exposures to per- and polyfluoroalkyl substances (PFAS) and birth outcomes: a multi-country cohort study.父母接触全氟和多氟烷基物质(PFAS)与出生结局:一项多国队列研究。
Environ Health. 2025 Jul 12;24(1):48. doi: 10.1186/s12940-025-01199-y.
3
Association of Per- and Polyfluoroalkyl Substance Exposure With Cataract Prevalence Among U.S. Adults: A NHANES Analysis (2005-2008).
美国成年人中全氟和多氟烷基物质暴露与白内障患病率的关联:一项美国国家健康与营养检查调查(2005 - 2008年)分析
Transl Vis Sci Technol. 2025 Apr 1;14(4):1. doi: 10.1167/tvst.14.4.1.
4
A Perspective from US Environmental Protection Agency (EPA) Scientists: How Your Epidemiologic Analyses Can Inform the Human Health Risk Assessment Process.美国环境保护局(EPA)科学家的观点:您的流行病学分析如何为人类健康风险评估过程提供信息。
Environ Health Perspect. 2025 Apr;133(3-4):45001. doi: 10.1289/EHP15203. Epub 2025 Apr 11.
5
Association between serum cotinine concentrations on red blood cell folate concentrations in pregnant women and the mediating role of lymphocytes: an NHANES Study.孕妇血清可替宁浓度与红细胞叶酸浓度之间的关联及淋巴细胞的中介作用:一项美国国家健康与营养检查调查研究
Arch Public Health. 2025 Feb 21;83(1):49. doi: 10.1186/s13690-025-01533-3.
6
Associations of PFAS concentrations during pregnancy and midlife with bone health in midlife: Cross-sectional and prospective findings from Project Viva.孕期和中年期全氟烷基和多氟烷基物质(PFAS)浓度与中年期骨骼健康的关联:生命项目的横断面和前瞻性研究结果
Environ Int. 2024 Dec;194:109177. doi: 10.1016/j.envint.2024.109177. Epub 2024 Dec 1.
7
Prenatal organophosphate pesticide exposure and sex-specific estimated fetal size.产前有机磷农药暴露与性别特异性估计胎儿大小
Am J Epidemiol. 2025 Apr 8;194(4):954-962. doi: 10.1093/aje/kwae265.
8
Geographic and dietary differences of urinary uranium levels in the Strong Heart Family Study.强心脏家族研究中尿铀水平的地理和饮食差异。
J Expo Sci Environ Epidemiol. 2025 May;35(3):393-402. doi: 10.1038/s41370-024-00695-6. Epub 2024 Jul 3.
9
Depicting deterministic variables within directed acyclic graphs: an aid for identifying and interpreting causal effects involving derived variables and compositional data.在有向无环图中描述确定性变量:有助于识别和解释涉及派生变量和组合数据的因果效应。
Am J Epidemiol. 2025 Feb 5;194(2):469-479. doi: 10.1093/aje/kwae153.
10
Residential proximity to toxic metal-emitting industrial sites and toenail metal concentrations in a United States-wide prospective cohort.美国一项前瞻性队列研究中,居住地与有毒金属排放工业场所的距离及 toenail 金属浓度。 (注:这里“toenail”直译为“脚趾甲”,结合语境可能是指与脚趾甲中金属浓度相关研究,但表述稍显奇怪,推测原文可能有更准确的关于身体某部位金属浓度的专业术语表述,不过按照要求准确翻译为上述内容)
Environ Res. 2024 Oct 1;258:119466. doi: 10.1016/j.envres.2024.119466. Epub 2024 Jun 20.