Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association, Molecular Epidemiology Research Group, Berlin, Germany.
Charité - Universitätsmedizin Berlin, Institute of Biometry and Clinical Epidemiology, Berlin, Germany.
Ann Epidemiol. 2022 Oct;74:84-96. doi: 10.1016/j.annepidem.2022.07.012. Epub 2022 Aug 5.
To investigate (1) the bias in effect estimation due to heaping or digit preference, (2) the association between age at hypertension diagnosis and risk of cardiovascular comorbidities, and (3) the influence of heaping on risk estimates.
We performed a simulation study with various scenarios, binary outcome, and normal or lognormal distributed covariables. We calculated mean logistic coefficients under the original and heaped data and their relative deviation. The association of age at hypertension diagnosis and risk of ≥1 cardiovascular comorbidity was investigated using logistic regression among 50,858 participants in the NAKO Gesundheitsstudie (German National Cohort) who reported such diagnosis. We assessed the proportion of heaped observations and to what extent heaping may have influenced risk estimates.
Based on the simulation study and assuming 50% of observations in the variable of interest to be heaped, relative bias was <6%. In NAKO, a 5-year younger age at hypertension diagnosis was associated with a 15% increased risk of having ≥1 cardiovascular comorbidity. Observed heaping in age at hypertension diagnosis was 12.6%, and bias of the risk estimate was 0.14%.
Bias in effect estimation due to heaping is low in most common scenarios. Younger age at hypertension diagnosis is associated with a higher risk of cardiovascular comorbidities.
(1)研究由于堆积或数字偏好导致的效应估计偏差,(2)高血压诊断时年龄与心血管合并症风险之间的关系,以及(3)堆积对风险估计的影响。
我们进行了一项模拟研究,涉及各种情况、二项结果和正态或对数正态分布的协变量。我们在原始数据和堆积数据下计算了平均逻辑系数及其相对偏差。在 50858 名参加 NAKO Gesundheitsstudie(德国国家队列研究)的患者中,报告了这种诊断,我们使用逻辑回归研究了高血压诊断时的年龄与≥1 种心血管合并症风险之间的关联。我们评估了堆积观察的比例以及堆积可能在多大程度上影响风险估计。
基于模拟研究并假设感兴趣变量中 50%的观察值堆积,相对偏差 <6%。在 NAKO 中,高血压诊断时年龄每年轻 5 岁,患≥1 种心血管合并症的风险增加 15%。高血压诊断时年龄的观察堆积为 12.6%,风险估计的偏差为 0.14%。
在大多数常见情况下,由于堆积导致的效应估计偏差较低。高血压诊断时年龄较小与心血管合并症风险较高相关。