Owusu-Edusei Kwame, Deb Arijita, Elbasha Elamin H
Merck & Co., Inc., Rahway, NJ, USA.
GSK, Upper Province, PA, USA.
Dialogues Health. 2025 Apr 18;6:100216. doi: 10.1016/j.dialog.2025.100216. eCollection 2025 Jun.
The risk of disease varies across populations based on factors like age, sex, race, ethnicity, socioeconomic status, and underlying medical conditions. Subgroup or subpopulation data are critical in planning, executing and evaluating public health interventions. However, most studies report the values for the overall (total) population with little or no information on the subgroups. As a result, finding subgroup specific data can be challenging.
In this report, a set of formulae that can be used to calculate subgroup or subpopulation data using the overall estimates and the reported or assumed relative estimates were derived.
A simple numerical example was used to illustrate the methodology. Next, symbolic formula for calculating the burden (e.g., incidence, prevalence, or average cost) for 3 (and extended to number of) subgroups or subpopulations were derived. To account for uncertainty in the data, two statistical methods were used to estimate confidence intervals for the point estimates.
The derived formulae indicated that each subgroup or subpopulation's burden (incidence, prevalence, or average cost) can be calculated as the overall burden adjusted by the ratio of that subgroup or subpopulation's relative burden to the sum of the proportion-weighted relative burden (incidence, prevalence, or average cost) of all the subgroups or subpopulations within the population.
These formulae can help to avoid or minimize potential quantitative and qualitative errors in subgroup or subpopulation disease burden estimates used for health research, interventions and/or policy analyses or deliberations.
基于年龄、性别、种族、民族、社会经济地位和潜在医疗状况等因素,不同人群的疾病风险存在差异。亚组或亚人群数据对于规划、实施和评估公共卫生干预措施至关重要。然而,大多数研究报告的是总体(全部)人群的数据,而关于亚组的信息很少或没有。因此,查找亚组特定数据可能具有挑战性。
在本报告中,推导了一组可用于使用总体估计值以及报告的或假设的相对估计值来计算亚组或亚人群数据的公式。
使用一个简单的数值示例来说明该方法。接下来,推导了用于计算3个(并扩展到任意数量)亚组或亚人群负担(如发病率、患病率或平均成本)的符号公式。为了考虑数据中的不确定性,使用两种统计方法来估计点估计值的置信区间。
推导的公式表明,每个亚组或亚人群的负担(发病率、患病率或平均成本)可以通过将总体负担乘以该亚组或亚人群的相对负担与人群中所有亚组或亚人群的比例加权相对负担(发病率、患病率或平均成本)之和的比率来计算。
这些公式有助于避免或最小化用于健康研究、干预措施和/或政策分析或审议的亚组或亚人群疾病负担估计中的潜在定量和定性误差。