Limoncella Giorgio, Grilli Leonardo, Dreassi Emanuela, Rampichini Carla, Platt Robert, Gini Rosa
Department of Statistics, Informatics and Applications ``Giuseppe Parenti'', University of Florence, Florence 50134, Italy.
Departments of Pediatrics and of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec H3A 0G3, Canada.
Am J Epidemiol. 2025 Sep 3;194(9):2570-2579. doi: 10.1093/aje/kwae423.
In epidemiologic database studies, the occurrence of an event is measured with error through an indicator whose specificity is often maximized, at the expense of sensitivity. However, if the indicator has low sensitivity, measures of occurrence are underestimated. In association studies, risk difference is biased, and risk ratio may be biased as well, in either direction, if the sensitivity is differential across exposure groups. In this work, we show that if an auxiliary screening indicator can be defined to complement the main indicator, estimates of the positive predictive value of both indicators provide tools to estimate the sensitivity of the primary indicator or a lower bound thereof. This mitigates bias in estimating the number of cases, prevalence, cumulative incidence, rate (particularly when the event is rare), and, in association studies, risk ratio and risk difference. They also allow testing for nondifferential sensitivity. Although direct estimation of sensitivity is often infeasible, this novel methodology improves evidence based on data obtained from reuse of existing databases, which may prove critical for regulatory and public health decisions.
在流行病学数据库研究中,事件的发生是通过一个指标来测量的,该指标存在误差,其特异性通常会被最大化,而以敏感性为代价。然而,如果该指标的敏感性较低,事件发生率的测量值就会被低估。在关联研究中,如果敏感性在暴露组之间存在差异,风险差值会产生偏差,风险比也可能会在两个方向上产生偏差。在这项研究中,我们表明,如果可以定义一个辅助筛查指标来补充主要指标,那么这两个指标的阳性预测值估计值就能提供工具来估计主要指标的敏感性或其下限。这可以减轻在估计病例数、患病率、累积发病率、发生率(尤其是当事件罕见时)以及在关联研究中估计风险比和风险差值时的偏差。它们还允许对无差异敏感性进行检验。尽管直接估计敏感性通常不可行,但这种新颖的方法改进了基于从现有数据库再利用中获得的数据的证据,这对于监管和公共卫生决策可能至关重要。