Université Paris-Saclay, UVSQ, Université Paris-Sud, Inserm, High-Dimensional Biostatistics for Drug Safety and Genomics, CESP, Villejuif, France.
Obstetric Department, Lille Catholic Hospitals, Lille Catholic University, Lille, France.
Drug Saf. 2020 Jun;43(6):549-559. doi: 10.1007/s40264-020-00916-5.
Pregnant women are largely exposed to medications. However, knowledge is lacking about their effects on pregnancy and the fetus.
This study sought to evaluate the potential of high-dimensional propensity scores and high-dimensional disease risk scores for automated signal detection in pregnant women from medico-administrative databases in the context of drug-induced prematurity.
We used healthcare claims and hospitalization discharges of a 1/97th representative sample of the French population. We tested the association between prematurity and drug exposure during the trimester before delivery, for all drugs prescribed to at least five pregnancies. We compared different strategies (1) for building the two scores, including two machine-learning methods and (2) to account for these scores in the final logistic regression models: adjustment, weighting, and matching. We also proposed a new signal detection criterion derived from these scores: the p value relative decrease. Evaluation was performed by assessing the relevance of the signals using a literature review and clinical expertise.
Screening 400 drugs from a cohort of 57,407 pregnancies, we observed that choosing between the two machine-learning methods had little impact on the generated signals. Score adjustment performed better than weighting and matching. Using the p value relative decrease efficiently filtered out spurious signals while maintaining a number of relevant signals similar to score adjustment. Most of the relevant signals belonged to the psychotropic class with benzodiazepines, antidepressants, and antipsychotics.
Mining complex healthcare databases with statistical methods from the high-dimensional inference field may improve signal detection in pregnant women.
孕妇在很大程度上会接触到药物。然而,人们对这些药物对妊娠和胎儿的影响知之甚少。
本研究旨在评估高维倾向评分和高维疾病风险评分在药物诱导早产背景下,从医疗管理数据库中自动检测孕妇信号的潜力。
我们使用了法国人口的 1/97 代表性样本的医疗保健索赔和住院记录。我们测试了在分娩前三个月内,所有至少有 5 次妊娠使用的药物与早产之间的关联。我们比较了不同的策略(1)构建这两个评分,包括两种机器学习方法,以及(2)在最终的逻辑回归模型中考虑这些评分的方法:调整、加权和匹配。我们还提出了一种新的信号检测标准,该标准源自这些评分:相对 p 值下降。通过文献综述和临床专业知识评估信号的相关性来进行评估。
在 57407 例妊娠队列中筛选出 400 种药物,我们发现两种机器学习方法之间的选择对生成的信号影响不大。评分调整的效果优于加权和匹配。使用相对 p 值下降有效地筛选出虚假信号,同时保持与评分调整相似数量的相关信号。大多数相关信号属于精神类药物,包括苯二氮䓬类、抗抑郁药和抗精神病药。
使用高维推断领域的统计方法挖掘复杂的医疗保健数据库可能会提高孕妇信号检测的效果。