1 Department of Family and Community Medicine, Baylor College of Medicine, Houston, TX, USA.
2 Birth Defects Surveillance Program, Department of Community and Family Health, College of Public Health, University of South Florida, Tampa, FL, USA.
Public Health Rep. 2018 May/Jun;133(3):303-310. doi: 10.1177/0033354918763168. Epub 2018 Apr 5.
We identified algorithms to improve the accuracy of passive surveillance programs for birth defects that rely on administrative diagnosis codes for case ascertainment and in situations where case confirmation via medical record review is not possible or is resource prohibitive.
We linked data from the 2009-2011 Florida Birth Defects Registry, a statewide, multisource, passive surveillance program, to an enhanced surveillance database with selected cases confirmed through medical record review. For each of 13 birth defects, we calculated the positive predictive value (PPV) to compare the accuracy of 4 algorithms that varied case definitions based on the number of diagnoses, medical encounters, and data sources in which the birth defect was identified. We also assessed the degree to which accuracy-improving algorithms would affect the Florida Birth Defects Registry's completeness of ascertainment.
The PPV generated by using the original Florida Birth Defects Registry case definition (ie, suspected cases confirmed by medical record review) was 94.2%. More restrictive case definition algorithms increased the PPV to between 97.5% (identified by 1 or more codes/encounters in 1 data source) and 99.2% (identified in >1 data source). Although PPVs varied by birth defect, alternative algorithms increased accuracy for all birth defects; however, alternative algorithms also resulted in failing to ascertain 58.3% to 81.9% of cases.
We found that surveillance programs that rely on unverified diagnosis codes can use algorithms to dramatically increase the accuracy of case finding, without having to review medical records. This can be important for etiologic studies. However, the use of increasingly restrictive case definition algorithms led to a decrease in completeness and the disproportionate exclusion of less severe cases, which could limit the widespread use of these approaches.
我们确定了算法,以提高依赖于行政诊断代码进行病例确定的出生缺陷被动监测计划的准确性,以及在无法或资源有限的情况下通过病历审查进行病例确认的情况下。
我们将 2009-2011 年佛罗里达州出生缺陷登记处(一个全州范围、多源、被动监测计划)的数据与一个增强的监测数据库进行了链接,该数据库中的一些选定病例通过病历审查进行了确认。对于 13 种出生缺陷中的每一种,我们计算了阳性预测值(PPV),以比较基于诊断数量、医疗就诊次数和识别出生缺陷的数据源数量的 4 种算法的准确性。我们还评估了准确性改进算法将如何影响佛罗里达州出生缺陷登记处的病例发现完整性。
使用原始佛罗里达州出生缺陷登记处病例定义(即通过病历审查确认的疑似病例)生成的 PPV 为 94.2%。更严格的病例定义算法将 PPV提高到 97.5%(在 1 个数据源中识别出 1 个或多个代码/就诊)和 99.2%(在多个数据源中识别出)之间。尽管 PPV 因出生缺陷而异,但替代算法提高了所有出生缺陷的准确性;然而,替代算法也导致了 58.3%至 81.9%的病例无法确定。
我们发现,依赖未经证实的诊断代码的监测计划可以使用算法极大地提高病例发现的准确性,而无需审查病历。这对于病因研究很重要。然而,使用越来越严格的病例定义算法会导致完整性降低,以及较轻病例的不成比例排除,这可能会限制这些方法的广泛应用。