Department of Epidemiology, Human Genetics and Environmental Sciences, UTHealth School of Public Health, Houston, Texas.
Department of Biostatistics and Data Science, UTHealth School of Public Health, Houston, Texas.
Birth Defects Res. 2019 Nov 1;111(18):1356-1364. doi: 10.1002/bdr2.1549. Epub 2019 Jul 16.
Few studies have systematically evaluated birth defect co-occurrence patterns, perhaps, in part, due to the lack of software designed to implement large-scale, complex analytic methods.
We created an R-based platform, "co-occurring defect analysis" (CODA), designed to implement analyses of birth defect co-occurrence patterns in birth defect registries. CODA uses an established algorithm for calculating the observed-to-expected ratio of a given birth defect combination, accounting for the known tendency of birth defects to co-occur nonspecifically. To demonstrate CODA's feasibility, we evaluated the computational time needed to assess 2- to 5-way combinations of major birth defects in the Texas Birth Defects Registry (TBDR) (1999-2014). We report on two examples of pairwise patterns, defects co-occurring with trisomy 21 or with non-syndromic spina bifida, to demonstrate proof-of-concept.
We evaluated combinations of 175 major birth defects among 206,784 infants in the TBDR. CODA performed efficiently in the data set, analyzing 1.5 million 5-way combinations in 18 hr. As anticipated, we identified large observed-to-expected ratios for the birth defects that co-occur with trisomy 21 or spina bifida.
CODA is available for application to birth defect data sets and can be used to better understand co-occurrence patterns. Co-occurrence patterns elucidated by using CODA may be helpful for identifying new birth defect associations and may provide etiological insights regarding potentially shared pathogenic mechanisms. CODA may also have wider applications, such as assessing patterns of additional types of co-occurrence patterns in other large data sets (e.g., medical records).
由于缺乏专门用于实施大规模、复杂分析方法的软件,因此很少有研究系统地评估出生缺陷的并发模式。
我们创建了一个基于 R 的平台,即“并发缺陷分析”(CODA),旨在对出生缺陷登记处中的出生缺陷并发模式进行分析。CODA 使用一种已建立的算法来计算给定缺陷组合的观察到的与预期的比值,该算法考虑到了出生缺陷非特异性并发的已知趋势。为了证明 CODA 的可行性,我们评估了在德克萨斯州出生缺陷登记处(TBDR)(1999-2014 年)评估 2 至 5 种主要出生缺陷组合所需的计算时间。我们报告了两个关于二元模式的示例,即与 21 三体或非综合征性脊柱裂并发的缺陷,以证明概念验证。
我们在 TBDR 中的 206784 名婴儿中评估了 175 种主要出生缺陷的组合。CODA 在该数据集中的性能高效,在 18 小时内分析了 150 万个 5 种组合。正如预期的那样,我们确定了与 21 三体或脊柱裂并发的出生缺陷的观察到的与预期的比值较大。
CODA 可应用于出生缺陷数据集,并可用于更好地理解并发模式。使用 CODA 阐明的并发模式可能有助于识别新的出生缺陷关联,并为潜在共享的发病机制提供病因学见解。CODA 也可能具有更广泛的应用,例如评估其他大型数据集(例如医疗记录)中其他类型并发模式的模式。