基于观察性医疗保健数据的人群效应估计研究的经验置信区间校准。

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data.

机构信息

Observational Health Data Sciences and Informatics, New York, NY 10032;

Epidemiology Analytics, Janssen Research & Development, Titusville, NJ 08560.

出版信息

Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2571-2577. doi: 10.1073/pnas.1708282114.

DOI:10.1073/pnas.1708282114

PMID:29531023

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5856503/

Abstract

Observational healthcare data, such as electronic health records and administrative claims, offer potential to estimate effects of medical products at scale. Observational studies have often been found to be nonreproducible, however, generating conflicting results even when using the same database to answer the same question. One source of discrepancies is error, both random caused by sampling variability and systematic (for example, because of confounding, selection bias, and measurement error). Only random error is typically quantified but converges to zero as databases become larger, whereas systematic error persists independent from sample size and therefore, increases in relative importance. Negative controls are exposure-outcome pairs, where one believes no causal effect exists; they can be used to detect multiple sources of systematic error, but interpreting their results is not always straightforward. Previously, we have shown that an empirical null distribution can be derived from a sample of negative controls and used to calibrate values, accounting for both random and systematic error. Here, we extend this work to calibration of confidence intervals (CIs). CIs require positive controls, which we synthesize by modifying negative controls. We show that our CI calibration restores nominal characteristics, such as 95% coverage of the true effect size by the 95% CI. We furthermore show that CI calibration reduces disagreement in replications of two pairs of conflicting observational studies: one related to dabigatran, warfarin, and gastrointestinal bleeding and one related to selective serotonin reuptake inhibitors and upper gastrointestinal bleeding. We recommend CI calibration to improve reproducibility of observational studies.

摘要

观察性医疗保健数据，如电子健康记录和行政索赔，提供了在大规模估计医疗产品效果的潜力。然而，观察性研究经常被发现是不可复制的，即使使用相同的数据库来回答相同的问题，也会产生相互矛盾的结果。差异的一个来源是错误，包括由抽样变异性引起的随机错误和系统错误（例如，由于混杂、选择偏差和测量误差）。只有随机误差通常是定量的，但随着数据库的增大而趋于零，而系统误差独立于样本量持续存在，因此相对重要性增加。负对照是暴露-结局对，其中人们认为不存在因果效应；它们可用于检测多种系统误差源，但解释其结果并不总是直截了当。以前，我们已经表明，可以从负对照的样本中得出一个经验性的零分布，并将其用于校准值，以考虑随机误差和系统误差。在这里，我们将这项工作扩展到置信区间（CI）的校准。CI 需要阳性对照，我们通过修改负对照来合成阳性对照。我们表明，我们的 CI 校准恢复了名义特征，例如 95%CI 以 95%的置信度覆盖真实效应大小。我们还表明，CI 校准减少了两项相互矛盾的观察性研究的重复之间的分歧：一项与达比加群、华法林和胃肠道出血有关，另一项与选择性 5-羟色胺再摄取抑制剂和上胃肠道出血有关。我们建议进行 CI 校准，以提高观察性研究的可重复性。

相似文献

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data.

Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2571-2577. doi: 10.1073/pnas.1708282114.

Assessing the effectiveness of empirical calibration under different bias scenarios.

BMC Med Res Methodol. 2022 Jul 27;22(1):208. doi: 10.1186/s12874-022-01687-6.

Adjusting for both sequential testing and systematic error in safety surveillance using observational data: Empirical calibration and MaxSPRT.

Stat Med. 2023 Feb 28;42(5):619-631. doi: 10.1002/sim.9631. Epub 2023 Jan 15.

Interpreting observational studies: why empirical calibration is needed to correct p-values.

Stat Med. 2014 Jan 30;33(2):209-18. doi: 10.1002/sim.5925. Epub 2013 Jul 30.

Bias, Confounding, and Interaction: Lions and Tigers, and Bears, Oh My!

Anesth Analg. 2017 Sep;125(3):1042-1048. doi: 10.1213/ANE.0000000000002332.

Limitations of empirical calibration of p-values using observational data.

Stat Med. 2016 Sep 30;35(22):3869-82. doi: 10.1002/sim.6936. Epub 2016 Mar 10.

The effect of sample size and bias on the reliability of estimates of error: a comparative study of Dahlberg's formula.

Eur J Orthod. 2012 Apr;34(2):158-63. doi: 10.1093/ejo/cjr010. Epub 2011 Mar 29.

Empirical performance of the self-controlled case series design: lessons for developing a risk identification and analysis system.

Drug Saf. 2013 Oct;36 Suppl 1:S83-93. doi: 10.1007/s40264-013-0100-4.

A comparison of the empirical performance of methods for a risk identification system.

Drug Saf. 2013 Oct;36 Suppl 1:S143-58. doi: 10.1007/s40264-013-0108-9.

引用本文的文献

Bayesian Posterior Interval Calibration to Improve the Interpretability of Observational Studies.

Stat Anal Data Min. 2024 Dec;17(6). doi: 10.1002/sam.11715. Epub 2024 Dec 4.

Immunoendocrine Insights Into Dehydroepiandrosterone by Cytokine Profiling in Sub-fertile Women Undergoing Intrauterine Insemination.

Cureus. 2025 Jun 28;17(6):e86899. doi: 10.7759/cureus.86899. eCollection 2025 Jun.

Negative control-calibrated difference-in-difference analyses: addressing unmeasured confounding in RWD with application to racial/ethnic differences.

NPJ Digit Med. 2025 Jul 17;8(1):452. doi: 10.1038/s41746-025-01821-w.

Drug combination-wide association studies of cancer.

Commun Med (Lond). 2025 Jul 9;5(1):285. doi: 10.1038/s43856-025-00991-8.

Risk of Thyroid Tumors With GLP-1 Receptor Agonists: A Retrospective Cohort Study.

Diabetes Care. 2025 Aug 1;48(8):1386-1394. doi: 10.2337/dc25-0154.

Unraveling the Link Between Obesity and Keratoconus Risk Based on Genetic Evidence.

Transl Vis Sci Technol. 2025 May 1;14(5):20. doi: 10.1167/tvst.14.5.20.

Cardiovascular post-acute sequelae of SARS-CoV-2 in children and adolescents: cohort study using electronic health records.

Nat Commun. 2025 Apr 11;16(1):3445. doi: 10.1038/s41467-025-56284-0.

Reinfection with SARS-CoV-2 in the Omicron Era is Associated with Increased Risk of Post-Acute Sequelae of SARS-CoV-2 Infection: A RECOVER-EHR Cohort Study.

medRxiv. 2025 Mar 30:2025.03.28.25324858. doi: 10.1101/2025.03.28.25324858.

Epidemiology of Shigella species and serotypes in children: a retrospective substudy of the MAL-ED observational birth cohort study.

Lancet Microbe. 2025 Jun;6(6):101064. doi: 10.1016/j.lanmic.2024.101064. Epub 2025 Mar 26.

Semaglutide and Nonarteritic Anterior Ischemic Optic Neuropathy.

JAMA Ophthalmol. 2025 Apr 1;143(4):304-314. doi: 10.1001/jamaophthalmol.2024.6555.

本文引用的文献

A New Method for Partial Correction of Residual Confounding in Time-Series and Other Observational Studies.

Am J Epidemiol. 2017 May 15;185(10):941-949. doi: 10.1093/aje/kwx013.

Negative Control Outcomes: A Tool to Detect Bias in Randomized Trials.

JAMA. 2016 Dec 27;316(24):2597-2598. doi: 10.1001/jama.2016.17700.

Accuracy of an automated knowledge base for identifying drug adverse reactions.

J Biomed Inform. 2017 Feb;66:72-81. doi: 10.1016/j.jbi.2016.12.005. Epub 2016 Dec 16.

Stroke, Bleeding, and Mortality Risks in Elderly Medicare Beneficiaries Treated With Dabigatran or Rivaroxaban for Nonvalvular Atrial Fibrillation.

JAMA Intern Med. 2016 Nov 1;176(11):1662-1671. doi: 10.1001/jamainternmed.2016.5954.

Robust empirical calibration of p-values using observational data.

Stat Med. 2016 Sep 30;35(22):3883-8. doi: 10.1002/sim.6977.

Brief Report: Negative Controls to Detect Selection Bias and Measurement Bias in Epidemiologic Studies.

Epidemiology. 2016 Sep;27(5):637-41. doi: 10.1097/EDE.0000000000000504.

Data Resource Profile: Clinical Practice Research Datalink (CPRD).

Int J Epidemiol. 2015 Jun;44(3):827-36. doi: 10.1093/ije/dyv098. Epub 2015 Jun 6.

Control Outcomes and Exposures for Improving Internal Validity of Nonrandomized Studies.

Health Serv Res. 2015 Oct;50(5):1432-51. doi: 10.1111/1475-6773.12279. Epub 2015 Jan 19.

Massive parallelization of serial inference algorithms for a complex generalized linear model.

ACM Trans Model Comput Simul. 2013 Jan;23(1). doi: 10.1145/2414416.2414791.

Zoo or savannah? Choice of training ground for evidence-based pharmacovigilance.

Drug Saf. 2014 Sep;37(9):655-9. doi: 10.1007/s40264-014-0198-z.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于观察性医疗保健数据的人群效应估计研究的经验置信区间校准。

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data.

机构信息

Observational Health Data Sciences and Informatics, New York, NY 10032;

Epidemiology Analytics, Janssen Research & Development, Titusville, NJ 08560.

出版信息

Proc Natl Acad Sci U S A. 2018 Mar 13;115(11):2571-2577. doi: 10.1073/pnas.1708282114.

DOI:10.1073/pnas.1708282114

PMID:29531023

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5856503/

Abstract

摘要

基于观察性医疗保健数据的人群效应估计研究的经验置信区间校准。

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于观察性医疗保健数据的人群效应估计研究的经验置信区间校准。

Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data.

机构信息

出版信息